Anthropic has finally confirmed the existence of Claude Mythos, but they will not release the model publicly, because it isa “cybersecurity reckoning.” Instead, they have organized a 40-company coalition called “Project Glassing” to allow cybersecurity professionals a head start in locking down critical software. This is an urgent initiative and it’s time to pay attention to alignment, because these models are getting seriously capable. Mythos’ benchmarks are absurd: It scored 77.8% on SWE_bench Pro, versus 53.4% for Opus 4.6. 59% on SWE-bench multimodal, versus 27.1% for Opus 4.6 That should give you an idea. It is a tremendous leap. Imagine what you will be able to build with Mythos when it eventually drops. Also imagine, the damage you could do with it If they were to prematurely release it… #anthropic #claude #mythos #ai #alignment
Anthropic has finally confirmed the existence of Claude Mythos, but they will not release the model publicly, because it isa “cybersecurity reckoning.” Instead, they have organized a 40-company coalition called “Project Glassing” to allow cybersecurity professionals a head start in locking down critical software. This is an urgent initiative and it’s time to pay attention to alignment, because these models are getting seriously capable. Mythos’ benchmarks are absurd: It scored 77.8% on SWE_bench Pro, versus 53.4% for Opus 4.6. 59% on SWE-bench multimodal, versus 27.1% for Opus 4.6 That should give you an idea. It is a tremendous leap. Imagine what you will be able to build with Mythos when it eventually drops. Also imagine, the damage you could do with it If they were to prematurely release it… #anthropic #claude #mythos #ai #alignment