HarnessX automates the assembly and adaptation of agent harnesses from execution traces, achieving an average +14.5% performance improvement without model scaling.
A new benchmark enables identification of the exact point where medical AI models produce hallucinations and enables targeted countermeasures through trace-supervised fine-tuning.
A trainable classifier predicts with a 0.7 Macro-F1-Score based on early hidden states whether activation steering will succeed without requiring complete generations.
Language models are evolving from chatbots with simple next-token prediction into Digital Colleagues with working memory, persistent workspaces, reusable skills, and reliable problem-solving.
A security vulnerability reported by Amazon to the White House triggered an AI model release freeze, exposing deep structural tensions between private AI makers, the US government, and national export control interests.
The White House removed Anthropic’s Fable model from the market with export controls after concerns about bypassed security safeguards, following failed intensive negotiations between government officials and CEO Amodei.
Anthropic must immediately disable Fable 5 and Mythos 5 following a US government export control directive citing national security concerns; the company disputes that the identified bypass method constitutes a substantive threat.
The US government blocks two high-performance Anthropic AI models for foreign nationals over concerns about a workaround to security restrictions — a step Anthropic criticizes as non-transparent and technically unjustified.