Mistral

3 weeks ago · ai

I Trained Probes to Catch AI Models Sandbagging

TL;DR: I extracted “sandbagging directions” from three open‑weight models and trained linear probes that detect sandbagging intent with 90‑96 % accuracy. The mo...

#sandbagging #model probing #linear probes #AI safety #Mistral #Gemma #evaluation gaming #model steering
1 month ago · ai

Mistral closes in on Big AI rivals with new open-weight frontier and small models

Mistral unveils its Mistral 3 lineup, including a frontier model and efficient small models designed for offline, customizable enterprise use—aiming to prove sm...

#Mistral #large language models #frontier model #small models #open-weight AI #fine-tuning #enterprise AI #offline AI
1 month ago · ai

Mistral 3 family of models released

Article URL: https://mistral.ai/news/mistral-3 Comments URL: https://news.ycombinator.com/item?id=46121889 Points: 138 Comments: 38...

#Mistral #large language model #LLM #AI model release #generative AI