Source record

@tjrobertson52 source record
Which LLM is best at following instructions? Tested Claude, ChatGPT & Gemini—the winner surprised me 👀 #AI #ChatGPT #Claude #gemini #AITools
Topics Mentioned
Public Evidence Excerpt
iThere is no best large language model. There's only the best model for a given task. For some tasks, you want the smartest model, for others you want an opinionated model and for some tasks, you just want a model that'll follow instructions. So I wanna talk about which model is best for when you know exactly what you need the model to do and exactly what the output should look like. So for this comparison, we're gonna be talking about Gemini, Chatgbt and Claude. So let's start with Claude. Claude is actually my favourite model and by far the one I use most for work. We use Claude Sonic for any kind of writing and we use Claude Opus frequently for analysis. While Claude is generally good at following instructions, it tends to be very opinionated. It's also a bit of an overachiever. Claude is especially good at understanding your goals and then extrapolating from there. It's gonna do what
Related Passages
iThese are public discovery snippets linked to the same source record. A snippet can end early when the public page keeps only short evidence context.
There is no best large language model. There's only the best model for a given task. For some tasks, you want the smartest model, for others you want an opinionated model and for some tasks, you just want a model that'll follow instructions. So I wanna talk about which model is best for when you know exactly what you need the model to do and exactly what the output should look like. So for this comparison, we're gonna be talking about Gemini, Chatgbt and Claude. So let's start with Claude. Claude is actually my...
The problem with Gemini is that it's kind of lazy. It's gonna answer before putting a lot of thought and deliberation into it. And if you explicitly ask it to think and deliberate, it's gonna take shortcuts. It seems to do whatever it can to limit the amount of tokens it spends. It might be the model best at following instructions for very simple tasks, but overall, the model I find to be the best at following instructions is Chat G P t. When you're able to articulate exactly what you want the model to do and ex...
Public Insight Cards
iNo public insight cards are linked to this source yet. The source is still searchable as attributed evidence, but no reviewed topic card has been promoted for it.