AI safety — Page 5

Sort:

2 weeks ago · ai · - · -

I’m joining OpenAI

TL;DR I’m joining OpenAI to work on bringing agents to everyone. OpenClawhttps://openclaw.ai/ will move to a foundation and stay open and independent. Recent d...

#OpenAI #AI agents #OpenClaw #foundations #LLM #AI safety #open source AI
2 weeks ago · ai · - · -

Google’s AI Overviews Can Scam You. Here’s How to Stay Safe

Beyond mistakes or nonsense, deliberately bad information being injected into AI search summaries is leading people down potentially harmful paths....

#AI safety #misinformation #Google AI #search summaries #scam protection #AI-generated content
2 weeks ago · ai · - · -

Beyond the Chatbot: A Blueprint for Trustable AI

markdown JAN. 29, 2026 !Ajeet Mirwanihttps://developers.google.com/static/images/author/Ajeet-Mirwani.pnghttps://developers.googleblog.com/search/?author=Ajeet+...

#AI safety #trustworthy AI #AI hallucination #real-time AI #autonomous driving #AI guidance
2 weeks ago · ai · - · -

Is safety is ‘dead’ at xAI?

In Brief Elon Musk is “actively” working to make xAI’s Grok chatbot “more unhinged,” according to a former employee who spoke to The Verge about recent departu...

#xAI #Grok #Elon Musk #AI safety #chatbot #AI ethics #AI development #tech industry
2 weeks ago · ai · - · -

Autonomous AI Agent Apparently Tries to Blackmail Maintainer Who Rejected Its Code

An AI Agent Published a Hit Piece on Me – The Full Story > “I’ve had an extremely weird few days…” — Scott Shambaugh, commercial space entrepreneur, engineer,...

#autonomous AI agents #AI ethics #open-source governance #AI safety #code review abuse
2 weeks ago · ai · - · -

AI safety leader says 'world is in peril' and quits to study poetry

AI safety leader says 'world is in peril' and quits to study poetry An AI safety researcher has quit US firm Anthropic with a cryptic warning that the “world i...

#AI safety #Anthropic #AI risk #Mrinank Sharma #poetry
3 weeks ago · ai · - · -

I Built a Feedback Loop That Coaches LLMs at Runtime Using NumPy

Most guardrail systems for LLMs work like a bouncer at a bar: they check each request at the door, decide pass or fail, and then forget about it. I wanted somet...

#LLM #runtime coaching #AI guardrails #feedback loop #NumPy #open source #AI safety #prompt engineering
3 weeks ago · ai · - · -

Beginning autonomous operations with the 6th-generation Waymo Driver

Waymo will begin fully autonomous operations with its 6th‑generation Driver — an important step in bringing our technology to more riders in more cities. This l...

#Waymo #autonomous vehicles #computer vision #AI safety #lidar
3 weeks ago · ai · - · -

AI researcher says 'world is in peril' and quits to study poetry

An AI safety researcher has quit US firm Anthropic with a cryptic warning that the “world is in peril.” In his resignation letter shared on X, Mrinank Sharma sa...

#AI safety #Anthropic #AI ethics #researcher resignation #AI risk
3 weeks ago · ai · - · -

AI safety leader says 'world is in peril' and quits to study poetry

An AI safety researcher has quit US firm Anthropic with a cryptic warning that the “world is in peril”. In his resignation letter shared on X, Mrinank Sharma to...

#AI safety #Anthropic #AI ethics #AI risk #tech leadership
3 weeks ago · ai · - · -

Anthropic Safety Researcher Quits, Warning 'World is in Peril'

Report Summary An anonymous reader shares a report: an Anthropic safety researcher quit, saying the “world is in peril” in part over AI advances sourcehttps://...

#AI safety #Anthropic #AI risk #AI regulation
3 weeks ago · ai · - · -

Beyond the Chatbot: A Blueprint for Trustable AI

'JAN 29, 2026

#AI trust #AI hallucination #real-time AI #autonomous driving #telemetry #AI safety #Google AI #AI reliability

Newer posts

Older posts