NEWWARP: Recovering Training Data Mixtures from Model Weights

5. July 2026
AI Models

WARP reconstructs the training source mixtures of language models from their weights, achieving mean absolute errors of 0.046 for BERT and 0.104 for GPT-2.

Share on:

OpenThoughts-Agent: Systematic Data Curation for Agentic Models

24. June 20264. July 2026
AI Models

A systematic data curation pipeline enables agentic models to be trained generalizably across diverse task types while achieving competitive or superior results compared to specialized models.

Share on:

NEWWARP: Recovering Training Data Mixtures from Model Weights

OpenThoughts-Agent: Systematic Data Curation for Agentic Models

Lumi AI News

Legal

Topics