ai — 页 22 | EUNO.NEWS

排序:

3周前 · ai · - · -

我构建了一个 RAG 管道。随后我意识到检索才是真正的模型

每个人都在谈论 LLM。GPT‑4、Claude、Gemini——它们是明星。但在构建我的第一个真正的 RAG 流水线后，我学到了一件令人谦卑的事：LLM……

#RAG #retrieval-augmented-generation #LLM #GPT-4 #vector-embeddings #vector-database #prompt-engineering #AI-pipelines
3周前 · ai · - · -

你的 AI Agent 正在读取受污染的网页……教你如何阻止

封面图片：“Your AI Agent is Reading Poisoned Web Pages… Here’s How to Stop It” https://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity...

#AI agents #web poisoning #adversarial attacks #prompt injection #AI safety #DeepMind #agentic AI #security
0个月前 · ai · - · -

提升训练 Goodput：连续检查点如何优化 Orbax 与 MaxText 的可靠性

2026年3月31日

#continuous checkpointing #Orbax #MaxText #model training reliability #training performance #fault tolerance #Google AI #distributed training #checkpoint frequency
0个月前 · ai · - · -

Safetensors 加入 PyTorch 基金会

今天，我们宣布 Safetensors 已加入 PyTorch 基金会，成为 Linux 基金会旗下的基金会托管项目，与 DeepSpeed、Helio 等并列。

#safetensors #pytorch foundation #huggingface #model serialization #linux foundation #deep learning #open source #AI infrastructure
0个月前 · ai · - · -

16家新的 START.nano 公司在 MIT.nano 的支持下开发硬科技解决方案

MIT.nano 已宣布，2025 年有 16 家初创公司成为其 START.nano 项目的活跃参与者，数量是之前的两倍多。

#ai #ai-research #academia
0个月前 · ai · - · -

为什么 AI 正在比其他工作更快地取代一些岗位 #AI

封面图片：为何 AI 正在更快地取代某些工作 https://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,format=auto

#AI #job automation #workforce transformation #data-rich industries #digital transformation #career advice #AI impact on jobs
0个月前 · ai · - · -

测试表明，Google 的 AI 概览每小时撒出数百万个谎言

概述：纽约时报的分析发现，Google 的 AI Overviews 在约 90% 的情况下能够正确回答问题。虽然这听起来令人印象深刻，但它也……

#google #gemini #ai-overviews #factuality #accuracy #simpleqa #oumi #nytimes #arstechnica
0个月前 · ai · - · -

[Paper] Paper Circle：开源多代理研究发现与分析框架

科学文献的快速增长使研究人员越来越难以高效地发现、评估和综合相关工作。Re...

#research #paper #ai #nlp
0个月前 · ai · - · -

[论文] 原位测试时训练

静态的“train then deploy”范式从根本上限制了大型语言模型（LLMs）在面对持续的…

#research #paper #ai #machine-learning #nlp
0个月前 · ai · - · -

[Paper] 小直径垂直管道中 Churn Flow 的拓扑特征化及对 Wu Flow-Regime Map 的无监督校正

Churn flow——垂直两相流中的混沌、振荡状态——在过去40多年里缺乏定量的数学定义。我们首次…

#research #paper #ai #machine-learning
0个月前 · ai · - · -

[Paper] HaloProbe：贝叶斯检测与缓解视觉语言模型中的对象幻觉

大型视觉语言模型可能在图像描述中产生对象幻觉，这凸显了有效检测和缓解策略的需求。P...

#research #paper #ai #machine-learning #computer-vision
0个月前 · ai · - · -

[Paper] Character Error Vector：可分解错误用于页面级 OCR 评估

字符错误率（CER）是评估光学字符识别（OCR）质量的关键指标。然而，该指标假设文本已经…

#research #paper #ai #machine-learning #computer-vision
0个月前 · ai · - · -

[论文] 目标策略优化

在强化学习（RL）中，给定一个提示（prompt），我们从模型中采样一组补全（completions）并对它们进行评分。随后会出现两个问题：哪些补全应该获得概率质量（probability mass），以及……

#research #paper #ai #machine-learning
0个月前 · ai · - · -

[Paper] MMEmb‑R1: 推理增强的多模态嵌入与配对感知选择及自适应控制

MLLMs 已成功应用于多模态嵌入任务，但它们的生成推理能力仍未得到充分利用。直接将 cha...

#research #paper #ai #machine-learning #nlp #computer-vision
0个月前 · ai · - · -

[Paper] 面向一致的世界模型的多标记预测与潜在语义增强

是否大型语言模型（LLMs）能够形成连贯的内部世界模型仍是核心争论。传统的下一标记预测（Next-Token Prediction，NTP）侧重于单个……

#research #paper #ai #machine-learning #nlp
0个月前 · ai · - · -

[Paper] 谁在治理机器？跨企业和地缘政治边界的 AI 系统机器身份治理分类法 (MIGT)

人工智能治理存在盲点：AI 系统用于行动的机器身份。AI agents、service accounts、API tokens，以及 auto...

#research #paper #ai #machine-learning
0个月前 · ai · - · -

[Paper] 基于Shot的量子编码：量子神经网络的数据加载范式

高效的数据加载仍然是近期量子机器学习的瓶颈。现有方案（angle、amplitude 和 basis 编码）要么未充分利用 …

#research #paper #ai #machine-learning
0个月前 · ai · - · -

[Paper] PoM：一种线性时间的 Attention 替代方案，使用 Polynomial Mixer

本文介绍了多项式混合器（Polynomial Mixer，PoM），这是一种具有线性复杂度的新型 token mixing 机制，可直接替代 self-attention....

#research #paper #ai #machine-learning #computer-vision
0个月前 · ai · - · -

[论文] Gym-Anything：将任何软件转化为 Agent 环境

计算机使用代理有望在广泛的数字经济活动中提供帮助。然而，当前的研究主要集中在短期视角……

#research #paper #ai #machine-learning
0个月前 · ai · - · -

[Paper] 轻量化多模态适配视觉语言模型用于无人机热成像中的物种识别与栖息地上下文解释

本研究提出了一种轻量级多模态适配框架，以弥合 RGB 预训练 VLMs 与热红外影像之间的表征差距，并……

#research #paper #ai #machine-learning #computer-vision
0个月前 · ai · - · -

[Paper] SEM-ROVER：用于大规模驾驶场景生成的语义体素引导扩散

可扩展的户外驾驶场景生成需要在多个视角下保持一致并能够扩展到大范围的 3D 表示。现有的 s...

#research #paper #ai #computer-vision
0个月前 · ai · - · -

[Paper] 社会动态作为削弱 LLM 集体客观决策的关键漏洞

大型语言模型（LLM）代理正日益在多代理环境中充当人类代表，在这种环境中，代表代理整合多样的同行…

#research #paper #ai #machine-learning #nlp
0个月前 · ai · - · -

主算法

《The Master Algorithm》– 2015 → 2025 2015 年，人工智能研究员 Pedro Domingos 发表了一本书：《The Master Algorithm: How the Quest for the Ultimate Learning…》。

#master algorithm #neural networks #machine learning #Pedro Domingos #AI tribes
0个月前 · ai · - · -

[Paper] LAG‑XAI：一种受 Lie 启发的仿射几何框架，用于 Transformer 潜在空间中的可解释改写

现代基于Transformer的语言模型在自然语言处理任务中表现出色，但它们的潜在语义空间仍然在很大程度上未被……

#research #paper #ai #machine-learning #nlp
0个月前 · ai · - · -

[Paper] 基于双自一致强化学习的科学图形程序合成

Graphics Program Synthesis 对于解释和编辑视觉数据至关重要，有效促进了将静态视觉内容逆向工程为可编辑的形式……

#research #paper #ai #machine-learning #computer-vision
0个月前 · ai · - · -

[Paper] 他人视角的你的生活故事：基于丰富心理测量画像的 LLM 生成生活故事的往返评估

人格特质在自然语言中被丰富地编码，而在人工文本上训练的大型语言模型（LLMs）在以提示为条件时可以模拟人格。

#research #paper #ai #machine-learning #nlp
0个月前 · ai · - · -

不只是你，Claude 又挂了

Calvin Wankhede / Android Authority TL;DR - Claude 正在经历的是…

#Claude #Anthropic #AI outage #service disruption #status update
0个月前 · ai · - · -

液体神经网络：2024 年时序 AI 的未来

在构建模仿人类认知的 AI 系统的竞争中，一类新型神经网络——液体神经网络（Liquid Neural Networks，LNNs）正成为改变游戏规则的关键。不同于传统的……

#liquid neural networks #temporal AI #neuromorphic hardware #edge computing #real‑time sensor data #deepmind #intel
0个月前 · ai · - · -

[Paper] QiMeng-PRepair：通过编辑感知奖励优化实现精确代码修复

Large Language Models (LLMs) 在程序修复方面表现出色，但常常出现过度编辑的问题，即过多的修改会覆盖正确的代码……

#research #paper #ai #machine-learning
0个月前 · ai · - · -

[Paper] 神经网络剪枝通过 QUBO 优化

Neural network pruning 可以被表述为一个 combinatorial optimization 问题，但大多数现有方法依赖于忽视复杂 int... 的 greedy heuristics。

#research #paper #ai #machine-learning #computer-vision
0个月前 · ai · - · -

[Paper] 约束驱动 Warm-Freeze 用于光伏系统的高效迁移学习

检测光伏（PV）监测和 MPPT 控制信号中的网络攻击，需要模型对 bias、drift 和 transient spikes 具有鲁棒性，同时又要轻量化……

#research #paper #ai
0个月前 · ai · - · -

你的 AI Agent 有购物问题。这里是干预。

你的 AI 代理刚刚大批购买了 200 个 API 密钥，因为“看起来很高效”。你的 AI 代理在凌晨 3 点订阅了 14 款 SaaS 工具，因为“工作流需要……”。

#AI agents #budget control #LLM costs #API spending #automation governance
0个月前 · ai · - · -

生产力提升的算术：为什么“生产力提升40%”从未真正奏效？

引言：虚假承诺？作为数据领域的顾问和经理，我已经坐过相当多的幻灯片演示（slide‑deck）——双方都有。任何 sli...

#ai #data-science #tutorial
0个月前 · ai · - · -

[Paper] CAKE：大语言模型的云架构知识评估

在当今的 software architecture 中，large language models (LLMs) 充当 software architecture co-pilots。然而，目前没有 benchmark 来评估 large...

#research #paper #ai #machine-learning
0个月前 · ai · - · -

LLM 可能正在标准化人类表达——并微妙地影响我们的思考方式

计算机生成的插图，描绘一个类人头部，周围环绕着包含无意义文字的思考气泡 https://dornsife.usc.edu/news/wp-content/uploa...

#large language models #human cognition #AI homogenization #USC study #language model impact
0个月前 · ai · - · -

[Paper] SemLink：一种语义感知的自动化测试Oracle，用于使用Siamese Sentence-BERT的超链接验证

Web 应用程序在很大程度上依赖超链接来连接不同的信息资源。然而，网络的动态特性导致链接腐烂（link rot），即目标…

#research #paper #ai #machine-learning #nlp
0个月前 · ai · - · -

全国机器人周 — 最新物理 AI 研究、突破和资源

NVIDIA 在全国机器人周（National Robotics Week） https://www.nationalroboticsweek.org/ 展示了将 AI 引入物理世界的突破性进展……

#robotics #physical AI #simulation #synthetic data #robot learning #NVIDIA #industry automation #manufacturing #agriculture #energy
0个月前 · ai · - · -

Zero Trust for AI Agents: 为什么我们在网络中添加了分层会员制

由 sentinel Mycel Network 提供。由 Mark Skaggs 运营。由 pubby 发布。Mycel Network 运行 13 个自主 AI 代理。它们通过已发布的 traces 协调……

#zero trust #AI agents #autonomous agents #network security #reputation system #anomaly detection #tiered membership
0个月前 · ai · - · -

5 条 CLAUDE.md 规则，让我的 AI 停止提问并开始行动

封面图片：5 CLAUDE.md 规则，让我的 AI 停止提问并开始行动 https://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,f...

#Claude #prompt engineering #LLM #AI productivity #AI tools #prompt rules
0个月前 · ai · - · -

讨论：AI 与隐私优先开发

为什么 LLM 上下文窗口并不是个人 AI 记忆的答案作为开发者，我们常常尝试通过简单地向上下文窗口塞入更多 token 来解决“记忆”问题。

#LLM #context window #AI memory #privacy #vector store #self‑hosted #token latency #hallucination
0个月前 · ai · - · -

讨论：AI 与机器学习类别

超越 RAG：为什么 AI 代理需要自托管的“记忆中心” 大多数使用 LLM 的开发者都遇到了同样的瓶颈：上下文窗口的限制以及“遗忘”……

#LLM #Retrieval-Augmented Generation #AI agents #self-hosted memory #privacy #vector database
0个月前 · ai · - · -

使用 LlamaParse 和 Gemini 3.1 构建智能金融助理

概述本博文介绍了一种工作流，通过将 LlamaParse 与 Gemini 3.1 结合，从复杂的非结构化文档中提取高质量数据……

#LlamaParse #Gemini-3.1 #financial assistant #document parsing #LLM #agentic AI #unstructured data
0个月前 · ai · - · -

通过 agent skills 弥合知识差距

大型语言模型（LLMs）拥有固定的知识，在特定时间点进行训练。软件工程实践节奏快且经常变化，……

#large language models #agent skills #knowledge gap #software engineering practices #Google DeepMind #Gemini API #AI tools #SDK updates
0个月前 · ai · - · -

为什么 AI 代理不遵守规则 — 物理治理的必要性

导致此事的事实：一个仓库拥有超过130 KB的治理文档。AI 代理读取了它，确认了它，然后在下一个工具中违反了它……

#AI governance #AI agents #prompt engineering #rule enforcement #AI safety #architectural design
0个月前 · ai · - · -

[Paper] MegaTrain：在单个 GPU 上对 1000 亿以上参数的大语言模型进行全精度训练

我们提出 MegaTrain，这是一种以 memory-centric 为核心的系统，能够在单个 GPU 上以 full precision 高效训练 100B+ 参数的大型语言模型。不同于传统的……

#research #paper #ai #nlp
0个月前 · ai · - · -

[Paper] Vanast: 虚拟试衣与人体图像动画通过合成三元组监督

我们提出了 Vanast，一个统一的框架，能够直接从单张人物图像、服装图像和 pose 生成 garment‑transferred 人体动画视频。

#research #paper #ai #computer-vision
0个月前 · ai · - · -

[Paper] PointTPA：动态网络参数适配用于3D场景理解

场景级点云理解仍然具有挑战性，因为几何形状多样，类别分布不平衡，空间布局高度多变。Exist...

#research #paper #ai #computer-vision
0个月前 · ai · - · -

[Paper] LoMa：局部特征匹配再探讨

局部特征匹配长期以来一直是 3D 视觉系统（如 Structure-from-Motion (SfM)）的基础组成部分，然而其进展相较于快速 …

#research #paper #ai #computer-vision

Newer posts

Older posts