Make Trust Irrelevant: A Gamer's Take on Agentic AI Safety
I wrote a short position paper arguing that current agentic AI safety failures are the confused deputy problem on repeat. We are handing agents ambient authorit...
I wrote a short position paper arguing that current agentic AI safety failures are the confused deputy problem on repeat. We are handing agents ambient authorit...
!Cover image for AI is getting scaryhttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,format=auto/https%3A%2F%2Fdev-to-uploads.s3...
As AI systems grow more powerful, Anthropic’s resident philosopher says the startup is betting Claude itself can learn the wisdom needed to avoid disaster....
OpenAI shares its approach to AI localization, showing how globally shared frontier models can be adapted to local languages, laws, and cultures without comprom...
Article URL: https://arxiv.org/abs/2512.04124 Comments URL: https://news.ycombinator.com/item?id=46902855 Points: 8 Comments: 3...
For years, Alphabet-owned Waymo has tried to set itself apart from other self-driving startups by emphasizing a culture of caution and safety. Now, just ahead o...
markdown JAN. 29, 2026 !Ajeet Mirwanihttps://developers.google.com/static/images/author/Ajeet_Mirwani.pnghttps://developers.googleblog.com/search/?author=Ajeet+...
Intelligence was never the threat. Coordination is. Every existing governance framework breaks at that point. The Real Shift: Coordination Over Intelligence For...
'JAN. 29, 2026
Article URL: https://alignment.anthropic.com/2026/hot-mess-of-ai/ Comments URL: https://news.ycombinator.com/item?id=46864498 Points: 61 Comments: 14...
Discover the Sora feed philosophy—built to spark creativity, foster connections, and keep experiences safe with personalized recommendations, parental controls,...
'markdown JAN. 29, 2026