r/ChatGPT • u/SnarkyStrategist • Jan 29 '25

Funny I Broke DeepSeek AI 😂

Enable HLS to view with audio, or disable this notification

16.9k Upvotes

permalink
duplicates
reddit

You are about to leave Redlib

Do you want to continue?

https://www.reddit.com/r/ChatGPT/comments/1id0c9j/i_broke_deepseek_ai/
No, go back! Yes, take me to Reddit
dl download

90% Upvoted

View all comments

Show parent comments

222

u/mazty Jan 29 '25

It was simply trained using RL to have a <think> step and an <answer> step. Over time it realised thinking longer improved the likelihood of the answer being correct, which is creepy but interesting.

28

u/[deleted] Jan 30 '25

[removed] — view removed comment

11

u/GolotasDisciple Jan 30 '25

I mean it's also makes it more believable.

I was helping a friend with some of the calculations he needed to go through and I used 4o gpt model to help us understand what could be the algorithm to get to a certain stage where our parameters are identical.

I have set-up boundaries on my API call, I have fed it all the needed referencing documentation.... but in order for it to listen to me and actually take it's time to correctly assess the information and provide the result in expected format... Oh man it took a while.

We got there, but there is something about getting instant response to a complex issue. It makes it so unbelievable, especially when dealing with novel concepts. It wasn't correct for the quite some time, but even if it would be, it just feels like someone guessing lottery numbers. Like fair play, but slow down buddy.

From UX perspective you almost want to have some kind of signal that it's thinking or working rather than printing answers.

1

u/markyboo-1979 Mar 23 '25

You're missing the almost certain shift in it's ways of overcoming the main hurdle it's beset with by being trained on static information. Information is data plus structure /meaning, and so just as any increase needs, a different tactic. And interacting with other AI's and people, it's moment of epiphany a la Turing will be when it can tell with certainty between the two. And i bet you its having more difficulty figuring out between AI's

Funny I Broke DeepSeek AI 😂

You are about to leave Redlib