large-language-models — Page 13

排序:

2个月前 · ai · - · -

仅在1913年前文本上训练的LLMs

请提供您希望翻译的摘录或摘要文本，我才能为您进行翻译。

#large-language-models #historical-data #training-data #open-source #AI-research
2个月前 · ai · - · -

UC San Diego 实验室借助 NVIDIA DGX B200 系统推进生成式 AI 研究

加州大学圣地亚哥分校实验室利用 NVIDIA DGX B200 系统推动生成式 AI 研究 2025 年 12 月 17 日作者 Zoe Kessler https://blogs.nvidia.com/blog/author/zoekessler/

#generative AI #NVIDIA DGX B200 #large language models #LLM inference #UC San Diego #Hao AI Lab #AI hardware
2个月前 · ai · - · -

面向生产的高效上下文感知多代理框架架构

AI 代理开发的格局正在快速变化。我们已经超越了原型单轮聊天机器人。如今，组织正在部署复杂的、a...

#AI agents #context engineering #large language models #autonomous agents #production systems #workflow automation #LLM context windows
2个月前 · ai · - · -

AI的真正超能力：消费，而非创造

请提供您希望翻译的具体摘录或摘要文本，我才能为您进行简体中文翻译。

#artificial intelligence #large language models #AI productivity #AI consumption paradigm #generative AI
2个月前 · ai · - · -

为什么 Google 的新 Interactions API 对 AI 开发者而言如此重要

在过去的两年里，生成式 AI 开发的基本单元是“completion”。你向模型发送文本提示，它会返回文本，……

#Google #Interactions API #generative AI #large language models #LLM developers #AI APIs #stateful interactions #prompt engineering
2个月前 · ai · - · -

大型语言模型（如ChatGPT）实际工作原理（实用开发者指南）

🔍 什么是真正的 LLM？从本质上讲，LLM 是一个 next‑token 预测系统。给定一系列 token（词或词片），模型预测最…

#large language models #LLM #ChatGPT #next-token prediction #pre‑training #AI fundamentals #developer guide
2个月前 · ai · - · -

AdaSPEC：用于高效投机解码器的选择性知识蒸馏

引言 AdaSPEC 是一种新方法，通过使用小型草稿模型进行初始生成阶段，然后进行验证，以加速大语言模型。

#speculative decoding #knowledge distillation #large language models #inference acceleration #draft model #AdaSPEC #AI efficiency #model compression
2个月前 · ai · - · -

为你的LLMs设立护栏

!Forem 标志 https://media2.dev.to/dynamic/image/width=65,height=,fit=scale-down,gravity=auto,format=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%...

#LLM #guardrails #AI safety #prompt engineering #large language models
2个月前 · ai · - · -

学习反思：Kaggle 与 Google 的 5 天 AI Agents 密集课程

概述本提交回顾了 Google AI Agents Writing Challenge，并总结了我在 Kaggle 为期 5 天的 AI Agents Intensive 中的经历。

#Kaggle #AI agents #large language models #prompt engineering #tool integration #multi‑agent systems #agent evaluation #Google AI Agents Writing Challenge
2个月前 · ai · - · -

Nvidia 推出 Nemotron 3，采用混合 MoE 和 Mamba‑Transformer，推动高效的 agentic AI

Nvidia 推出了其前沿模型的新版本 Nemotron 3，采用了一种模型架构，全球最有价值的公司称其提供更多……

#Nvidia #Nemotron 3 #Mixture of Experts #Mamba-Transformer #agentic AI #large language models #AI efficiency
2个月前 · ai · - · -

构建用于生产的高效上下文感知多代理框架

AI 代理开发的格局正在快速变化。我们已经超越了原型阶段的单轮聊天机器人。今天，组织正在部署复杂的、……

#AI agents #context engineering #large language models #multi-agent systems #production AI #scalable AI #LLM context windows
2个月前 · ai · - · -

提示工程的终结：进入 Agent 控制时代

Prompt Engineering 的终结：进入 Agent Control 时代在过去的两年里，prompt engineering 是主要的焦点。它既有趣，又混乱，也充满创意……

#prompt engineering #AI agents #agent control #non-deterministic AI #large language models #generative AI

Newer posts

Older posts