...

I Tried Making my Own (Bad) LLM Benchmark to Cheat in Escape Rooms

I Tried Making my Own (Bad) LLM Benchmark to Cheat in Escape Rooms
Recently, DeepSeek announced their latest model, R1, and article after article came out praising its performance relative to cost, and ...
Read more