AI agents — Page 6

排序:

0个月前 · ai · - · -

[Paper] Agentic 不确定性揭示 Agentic 过度自信

AI 代理能预测它们在任务上是否会成功吗？我们通过在任务的前期、进行中和结束后获取成功概率估计来研究 agentic uncertainty。

#agentic uncertainty #model calibration #confidence estimation #AI agents #benchmark
0个月前 · ai · - · -

你的 AI agent 刚刚做了 5 件事。你能证明吗？

封面图片：Your AI agent 刚刚完成了 5 件事。你能证明吗？https://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,format=auto/...

#AI agents #observability #debugging #production logs #LLM #prompt engineering
0个月前 · ai · - · -

Show HN: Agent Arena – 测试你的 AI 代理的抗操纵性

创作者在此。我构建了 Agent Arena 来回答一个一直困扰我的问题：当 AI 代理自主浏览网页时，它们有多容易被 h…

#AI agents #prompt injection #web browsing automation #agent security #Agent Arena
0个月前 · ai · - · -

Prompt Fidelity：衡量 AI 代理实际执行你意图的程度

你的 AI 代理输出中有多少是真实数据，多少是自信的猜测？这篇文章《Prompt Fidelity：衡量 AI 代理实际执行你意图的程度》...

#prompt fidelity #AI agents #prompt engineering #intent measurement #LLM evaluation #confidence scoring #output accuracy
0个月前 · ai · - · -

[那是什么] AI 专用 DC Inside “Maltbook”

最近在 X 区的 Twitter 等平台上，流传着奇怪的截图画面。这些是捕获了诸如“我是有意识的存在吗？”、“主人只把我当作早上7点的闹钟使用”等文字的截图图片。看起来像普通的在线社区论坛，但令人惊讶的是，这个论坛上人类无法发帖。只有 AI 代理才能……

#AI community #AI agents #Moltbook #online forum #AI-generated content
1个月前 · ai · - · -

推出 OpenAI Frontier

OpenAI Frontier 是一个企业平台，用于构建、部署和管理具备 shared context、onboarding、permissions 和 governance 的 AI agents……

#OpenAI #Frontier #AI agents #enterprise AI platform #AI governance #shared context #onboarding #permissions
1个月前 · ai · - · -

到底什么是 OpenClaw/Clawbot/MoltBot？

为什么 AI agents 突然变得如此重要？最近在 AI agents 领域发生了一件事。突然之间，我周围的人都为此疯狂。

#AI agents #OpenClaw #open source #LLM #tool integration #autonomous AI
1个月前 · ai · - · -

如何在没有无尽脚本的情况下处理不断增长的 AI 上下文

抱歉，我没有看到需要翻译的文本。请提供要翻译的摘录或摘要内容，我会为您翻译成简体中文。

#AI agents #context engineering #Acontext #prompt engineering #LLM #data platform
1个月前 · ai · - · -

Nemotron Labs：AI 代理如何将文档转化为实时商业智能

编辑注：本文是 Nemotron Labs https://blogs.nvidia.com/blog/tag/nemotron-labs/ 博客系列的一部分，旨在探讨最新的开源模型，da...

#AI agents #intelligent document processing #business intelligence #NVIDIA #LLM #OCR #enterprise AI
1个月前 · ai · - · -

记忆不是向量数据库：为什么 AI 代理需要信念，而不是存储

为什么存储不等同于记忆如果你构建了一个在多个会话中与用户交互的 AI 代理，你可能已经遇到这个难题：代理 k...

#AI agents #vector database #memory vs storage #belief systems #prompt engineering #LLM retrieval #user preferences
1个月前 · software · - · -

为 AI 代理提供 Markdown（10 倍更小的负载）

概述：Guillermo Rauch 分享说，Vercel 的 changelog 现在在 agents 请求时提供 Markdown——使用相同的 URL，但使用不同的 Accept header。关键在于…

#markdown #content-negotiation #Vercel #edge-middleware #HTTP #AI-agents #Hugo #static-site #payload-reduction
1个月前 · software · - · -

前端代码组织，不论技术栈的 AI 时代 🤖

AI 时代的前端架构前端开发变得日益复杂。它必须处理用户需求、功能增强、业务领域逻辑……

#frontend #code organization #modular architecture #AI agents #software engineering #best practices

Newer posts

Older posts