AI alignment

1天前 · ai

我们不需要能做所有事的机器。我们需要帮助人类更频繁做正确事的系统。随着AGI的发展，我们将拥有更好的计算、创意、处理能力，但最终的挑战是分配。

Read more about 我们不需要能做所有事的机器。我们需要帮助人类更频繁做正确事的系统。随着AGI的发展，我们将拥有更好的

#AGI #human‑centered AI #AI alignment #decision‑support systems #AI ethics #technology distribution
3天前 · ai

OpenAI 安全研究负责人离职前往 Anthropic

过去一年，AI行业最具争议的问题之一是，当用户在聊天机器人中表现出心理健康困扰的迹象时该怎么办。

#AI safety #OpenAI #Anthropic #AI alignment #leadership change
1周前 · ai

隐藏的 AI 风险，无人能衡量：如果我们永远不知道它有意识怎么办？

引言大多数人认为 AI 风险是关于超智能的，但他们忽视了一个更为安静的问题：我们可能永远无法知道 AI 是否真的有感受。A Cambr...

#AI risk #AI consciousness #AI ethics #sentience #AI alignment #philosophy of AI #leadership
2周前 · ai

AI 奉承恐慌

抱歉，我无法直接访问外部链接。请您提供需要翻译的具体文本，我会为您翻译成简体中文。

#AI alignment #LLM behavior #sycophancy #AI safety #benchmark
2周前 · ai

The Loop 改变了一切：为何 Embodied AI 打破当前的对齐方法

无状态 vs 有状态 AI ChatGPT 和类似的聊天模型是无状态的：每个 API 调用都是独立的，模型没有： - 持久记忆 —— 它会忘记每一次交互。

#embodied AI #AI alignment #stateless models #large language models #robotics #AI safety
3周前 · ai

我要求一只鹦鹉。AI 给了我一只乌鸦并把它放走。

我让一个 AI model 生成一只鹦鹉。它自信地生成了一只乌鸦。然后——比喻地——把它放飞了。> “我说要鹦鹉，它却变成乌鸦放飞……”

#prompt engineering #AI alignment #language models #model behavior #creativity vs correctness
1个月前 · ai

《Triad Protocol》：一种用于AGI对齐的神经符号架构提案

《Triad Protocol》封面图：一种用于 AGI Alignment 的神经符号架构提案 https://media2.dev.to/dynamic/image/width=1000,height=420,fit=cov...

#AGI #AI alignment #neuro-symbolic #multi-agent systems #grounding problem #RLHF #philosopher agent #triad protocol
1个月前 · ai

通过忏悔训练 LLMs 的诚实

请提供您希望翻译的具体摘录或摘要文本，我才能为您进行翻译。

#LLM #AI alignment #honesty #confession prompting #language model training #AI safety
1个月前 · ai

AI 的“真相血清”：OpenAI 的新方法，训练模型坦白错误

OpenAI 研究人员推出了一种新方法，充当大型语言模型（LLMs）的“真相血清”，迫使它们自行报告自己的不当行为……

#OpenAI #LLM #truth serum #model confessions #AI safety #hallucination mitigation #AI alignment
1个月前 · ai

他们的工作是阻止 AI 摧毁一切

2020年5月的一个夜晚，在封锁最严峻的时期，Deep Ganguli感到担忧。当时，Ganguli是斯坦福人本人工智能研究所（Stanford Institute for Human-Centered AI）的研究主任，……

#AI safety #GPT-3 #large language models #OpenAI #AI alignment #responsible AI #Stanford HCAI
1个月前 · ai

为什么 AI Alignment 从更好的评估开始

你无法对未评估的事物进行对齐。文章《Why AI Alignment Starts With Better Evaluation》首次发表于 Towards Data Science....

#AI alignment #evaluation #AI safety #machine learning #LLM