ai — 页 32 | EUNO.NEWS

排序:

1个月前 · ai · - · -

OpenAI 在 Anthropic 推出其自有模型几分钟后发布新的 agentic coding 模型

新模型旨在加速 Codex 的功能，Codex 是 OpenAI 本周早些时候推出的具备代理能力的编码工具……

#OpenAI #agentic coding model #Codex #Anthropic #AI coding assistants #large language models #generative AI
1个月前 · ai · - · -

[Paper] 伪可逆神经网络

Moore‑Penrose 伪逆 (PInv) 是线性系统的基本解。在本文中，我们提出了一种对 PInv 的自然推广……

#research #paper #ai #machine-learning #computer-vision
1个月前 · ai · - · -

[Paper] 共享 LoRA 子空间用于几乎严格的持续学习

高效且持续地将 large pretrained models 适配到新任务对于 real‑world deployment 至关重要，但由于 catastrophic forgetting 等挑战仍然困难。

#research #paper #ai #machine-learning #computer-vision
1个月前 · ai · - · -

[Paper] 从透视描述预测相机姿态用于空间推理

多图像空间推理仍然是当前多模态大语言模型（MLLMs）的挑战。虽然单视角感知本质上是二维的，推理……

#research #paper #ai #computer-vision
1个月前 · ai · - · -

[Paper] DyTopo：通过语义匹配的多智能体推理动态拓扑路由

由提示的大型语言模型构建的多代理系统可以提升多轮推理能力，然而大多数现有的流水线依赖于固定的、跨轨迹的通信……

#research #paper #ai #machine-learning
1个月前 · ai · - · -

[Paper] SwimBird: 在混合自回归 MLLMs 中引发可切换的推理模式

多模态大型语言模型（MLLMs）通过连接视觉和语言，在多模态感知和推理方面取得了显著进展。然而，大多数现有...

#research #paper #ai #computer-vision
1个月前 · ai · - · -

[论文] CommCP：通过基于LLM的通信与共形预测实现高效多智能体协同

为了完成人类以 natural language 提供的任务，机器人必须解释指令，生成并回答与 scene understanding 相关的问题，……

#research #paper #ai #machine-learning #computer-vision
1个月前 · ai · - · -

[Paper] 用几何思考：Active Geometry Integration 用于空间推理

近期在空间推理方面的进展，使用多模态大语言模型（MLLMs）越来越多地利用来自3D编码器的几何先验。然而，大多数现存……

#research #paper #ai #computer-vision
1个月前 · ai · - · -

[Paper] DFlash：块扩散用于 Flash 投机解码

自回归大型语言模型（LLMs）表现出色，但需要本质上顺序的解码，导致推理延迟高且 GPU 利用率差……

#research #paper #ai #nlp
1个月前 · ai · - · -

[Paper] InterPrior：用于基于物理的人体-物体交互的可扩展生成控制

人类很少在显式的全身动作层面上规划与物体的全身交互。高级意图，例如 affordance，定义了目标……

#research #paper #ai #computer-vision
1个月前 · ai · - · -

[Paper] V-Retrver: 基于证据驱动的主体推理用于通用多模态检索

多模态大语言模型（MLLMs）最近被用于通用多模态检索，其中链式思考（CoT）推理能够提升候选项的质量。

#research #paper #ai #computer-vision
1个月前 · ai · - · -

[Paper] 视觉语言模型能从交互中学习直观物理吗？

预训练的视觉语言模型对物理世界没有良好的直觉。最近的研究表明，监督微调可以提升模型的……

#research #paper #ai #machine-learning
1个月前 · ai · - · -

[论文] Splat and Distill：通过前馈 3D 重建增强教师，实现 3D 感知蒸馏

Vision Foundation Models (VFMs) 在应用于各种下游 2D 任务时取得了显著成功。尽管它们效果显著，但它们常常表现出……

#research #paper #ai #computer-vision
1个月前 · ai · - · -

[论文] AP-OOD：Attention Pooling 用于分布外检测

Out-of-distribution（OOD）检测，将高维数据映射为标量 OOD 分数，对于机器学习模型的可靠部署至关重要……

#research #paper #ai #machine-learning
1个月前 · ai · - · -

[Paper] PhysicsAgentABM：物理引导的生成式基于代理的建模

基于大型语言模型（LLM）的多代理系统能够实现富表达的代理推理，但其扩展成本高，并且在时间步对齐的场景下校准性较差。

#research #paper #ai #machine-learning
1个月前 · ai · - · -

[Paper] 好奇心即知识：自洽学习与主动推断下的无后悔优化

主动推断（AIF）通过最小化期望自由能（EFE）统一了探索与利用，平衡认知价值（信息增益）和实际价值（...）。

#research #paper #ai #machine-learning
1个月前 · ai · - · -

[Paper] 上下文强制：具有长上下文的一致自回归视频生成

近期针对实时长视频生成的研究通常采用 streaming tuning 策略，尝试使用 short‑cont（短上下文）来训练 long‑context student。

#research #paper #ai #computer-vision
1个月前 · ai · - · -

[Paper] 学习查询感知 Budget-Tier 路由用于 Runtime Agent Memory

记忆在超出单个上下文窗口运行的大型语言模型（LLM）代理中变得日益核心，然而大多数现有系统仍依赖离线的、查询式的…

#research #paper #ai #machine-learning #nlp
1个月前 · ai · - · -

[Paper] 学习基于事件的射击模型来自虚拟现实实验

虚拟现实（VR）已成为评估学校安全措施的强大工具，尤其在学校枪击等高风险情境中，提供实验……

#research #paper #ai #machine-learning
1个月前 · ai · - · -

[Paper] 正确性优化的残差激活透镜 (CORAL)：可转移且校准感知的推理时引导

大型语言模型（LLMs）表现出持续的误校准，尤其是在指令微调和偏好对齐之后。修改后的训练目标可以 i...

#research #paper #ai #machine-learning
1个月前 · ai · - · -

[Paper] 扩散模型的泛化可以通过对数据依赖的 Ridge 流形的归纳偏置来刻画

当 diffusion model 并未记忆 training data set 时，它到底是如何实现 generalize 的？对它生成的 distribution 进行 quantitative understanding …

#research #paper #ai #machine-learning
1个月前 · ai · - · -

[论文] 通过自蒸馏的多标记预测

现有的加速语言模型推理的技术，例如 speculative decoding，需要训练辅助的 speculator 模型并构建和部署…

#research #paper #ai #machine-learning #nlp
1个月前 · ai · - · -

[Paper] 大语言模型在 PTSD 严重程度估计中的系统评估：上下文知识与建模策略的作用

大型语言模型（LLMs）正日益以零样本方式用于评估心理健康状况，但我们对哪些因素了解有限，...

#research #paper #ai #nlp
1个月前 · ai · - · -

乐观性使 Thompson Sampling 在自适应推断中更稳健

Thompson 采样（TS）在随机多臂赌博机中被广泛使用，但其在自适应数据收集下的推断属性非常微妙。经典的……

#research #paper #ai #machine-learning
1个月前 · ai · - · -

[Paper] GenArena：我们如何实现对视觉生成任务的人类对齐评估？

视觉生成模型的快速发展已经超出了传统评估方法的步伐，迫切需要采用 Vision-Language Models 作为替代……

#research #paper #ai #machine-learning #computer-vision
1个月前 · ai · - · -

[Paper] AgenticPay：用于买卖交易的多代理 LLM 谈判系统

基于大型语言模型（LLM）的代理正日益被期望能够自主进行谈判、协调和交易，然而现有的基准缺乏原则性的……

#research #paper #ai #machine-learning
1个月前 · ai · - · -

[Paper] 利用 OpenAI Whisper 表征和注意力池化方法的语音情感识别

语音情感识别（Speech Emotion Recognition, SER）研究由于缺乏标准且足够大的数据集而受到限制。最近的研究利用了预训练…

#research #paper #ai #machine-learning #nlp
1个月前 · ai · - · -

[论文] DSB：用于 Diffusion LLM 的动态滑动块调度

扩散大语言模型（dLLMs）已成为文本生成的有前景的替代方案，其特点是原生支持并行解码……

#research #paper #ai #nlp
1个月前 · ai · - · -

[Paper] SAGE：基准测试与改进深度研究智能体的检索

深度研究代理已经成为处理复杂查询的强大系统。与此同时，基于LLM的检索器在fol方面展示了强大的能力。

#research #paper #ai #nlp
1个月前 · ai · - · -

[Paper] 将人类在概念生成中的语义导航表征为Embedding Space中的轨迹

语义表征可以被构建为一种结构化、动态的知识空间，人类在其中导航以检索和操作意义。为了研究……

#research #paper #ai #machine-learning #nlp
1个月前 · ai · - · -

心理测量Jailbreaks揭示前沿模型的内部冲突

请提供您希望翻译的具体摘录或摘要文本，我才能为您进行简体中文翻译。

#psychometric testing #jailbreak #frontier models #large language models #AI safety #model evaluation
1个月前 · ai · - · -

GPT-5.3-Codex

请提供您需要翻译的具体摘录或摘要文本。

#GPT-5.3 #Codex #OpenAI #large language model #code generation #AI research
1个月前 · ai · - · -

Claude 正在走红——它能保持吗？

Boris Cherny 在公共场合相对经常被认出来。无论是在酒吧、机场，还是在一般的公共空间，人们都想和这位 cre...

#Anthropic #Claude #AI coding assistant #large language model #generative AI #AI tools
1个月前 · ai · - · -

[Paper] 自我改进的多语言长推理通过翻译-推理集成训练

长推理模型在多语言环境中常常遇到困难：它们倾向于对非英语问题使用英语进行推理；当被限制在...

#research #paper #ai #nlp
1个月前 · ai · - · -

Claude 代码代理团队

请提供您希望翻译的具体摘录或摘要文本，我才能为您进行简体中文翻译。

#Claude #code agent #AI coding assistant #software development #automation
1个月前 · ai · - · -

Claude Opus 4.6 现已在 GitHub Copilot 中普遍可用

Claude Opus 4.6，Anthropic 的最新模型，现已在 GitHub Copilot 中推出。在早期测试中，Claude Opus 4.6 在 agentic coding 方面表现出色，具备 specialization…

#Claude Opus #GitHub Copilot #large language model #AI coding assistant #agentic coding
1个月前 · ai · - · -

[Paper] 多语者还是众多？多语言LLM 对价值取向的多项选择题的回答

多项选择题（MCQs）常用于评估大型语言模型（LLMs）中编码的知识、推理能力，甚至价值观。虽然...

#research #paper #ai #nlp
1个月前 · ai · - · -

Claude Opus 4.6

请提供您希望翻译的具体摘录或摘要文本，我才能为您进行简体中文翻译。

#Claude #Anthropic #LLM #AI model release
1个月前 · ai · - · -

[Paper] DARWIN：动态代理式重写自我改进网络

DARWIN 是一种进化型 GPT 模型，利用类似遗传算法的优化结构，对多个独立的 GPT 代理进行单独训练……

#research #paper #ai #machine-learning #nlp
1个月前 · ai · - · -

Google 在超级碗播出 Gemini 广告：“新家” [Video]

在 Super Bowl——抱歉，我是指 “Big Game”——本周日，Google 正在播出一则关于 Gemini 应用的广告。更多…

#Google #Gemini #Super Bowl #AI advertising #large language model #AI product launch
1个月前 · ai · - · -

[Paper] 使用语义范围对企业代码仓库的 LLM 自动化定制

代码补全（Code completion，CC）是开发者在与基于 LLM 的编程助手协作时常用的任务。尽管性能有所提升……

#research #paper #ai #machine-learning
1个月前 · ai · - · -

[Paper] RocqSmith：自动优化能打造更好的证明代理吗？

本工作研究了自动 AI 代理优化方法在形式验证环境中对真实世界代理的适用性，重点关注自动定理证明……

#research #paper #ai #machine-learning
1个月前 · ai · - · -

Fundamental 从隐身模式出现，推出首个针对表格数据的重大基础模型

深度学习革命有一个奇怪的盲点：电子表格。虽然大型语言模型（LLMs）已经掌握了人类散文和图像生成的细微差别……

#foundation model #tabular data #deep learning #structured data #LLM #machine learning #AI research #VentureBeat
1个月前 · ai · - · -

[Paper] TimelyFreeze：用于流水线并行的自适应参数冻结机制

Pipeline parallelism 使得训练超出单设备内存限制的模型成为可能，但实际吞吐量仍受到 pipeline bubbles 的限制。虽然 parameter …

#research #paper #ai #machine-learning
1个月前 · ai · - · -

我没注意到 AI 为我设定优先级，直到为时已晚

AI 如何影响优先级我以为我在使用 AI 来更快执行。我没有意识到的是，它悄悄地在塑造我在第一阶段所工作的内容……

#AI productivity #prompt engineering #workflow automation #task prioritization #AI bias in work
1个月前 · ai · - · -

[Paper] 神经启发的视觉模式识别通过生物 Reservoir Computing

在本文中，我们提出了一种受神经启发的 reservoir computing (RC) 方法，其中体外培养的皮层神经元网络作为物理……

#research #paper #ai #computer-vision
1个月前 · ai · - · -

机制可解释性：窥探 LLM 内部

LLM 的类人认知能力是真实的还是虚假的？信息在神经网络中是如何传播的？LLM 内部是否存在隐藏的知识？……

#mechanistic interpretability #LLM #large language models #neural network analysis #AI explainability #deep learning
1个月前 · ai · - · -

ElevenLabs CEO：语音是 AI 的下一个界面

ElevenLabs CEO 在卡塔尔 Web Summit 上表示，语音将成为 AI 的下一个界面，因为 OpenAI、Google 和 Apple 正在将对话系统推向可穿戴设备以及新的……

#voice AI #conversational interfaces #ElevenLabs #speech synthesis #Web Summit #AI assistants

Newer posts

Older posts