· ai
The 'truth serum' for AI: OpenAI’s new method for training models to confess their mistakes
OpenAI researchers have introduced a novel method that acts as a 'truth serum' for large language models LLMs, compelling them to self-report their own misbehav...