Discussion about this post

User's avatar
Ben's avatar

If a top-tier model like Claude can misrepresent its own identity because of contamination, what does that say about relying on any LLM's self-reporting for critical tasks?

I find it interesting. My own testing over six months with several Chinese models (under the Hermes update) showed something different. Every time i restarted the session and asked “What model are you using?” the Chinese models consistently gave the correct answer.

No posts

Ready for more?