AI agents — Page 2

Sort:

2 weeks ago · ai · - · -

Why AI agent teams are just hoping their agents behave

The Problem Every trending AI project is giving agents more autonomy—running shell commands, browsing the web, calling APIs, moving money, even performing pene...

#AI agents #agent safety #prompt engineering #LangChain #OpenAI Agents SDK #autonomous AI #AI governance
2 weeks ago · software · - · -

OAuth Token Vault Patterns for AI Agents

OAuth Token Vault Patterns for AI Agents AI agents that access third‑party APIs on behalf of users GitHub, Slack, Google Calendar face a hard security problem:...

#oauth #token-vault #security #ai-agents #authentication #best-practices
2 weeks ago · ai · - · -

I stopped trusting AI agents to “do the right thing” - so I built a governance system

!Cover image for I stopped trusting AI agents to “do the right thing” - so I built a governance systemhttps://media2.dev.to/dynamic/image/width=1000,height=420,...

#AI agents #AI governance #LLM #prompt engineering #automation safety #trustworthy AI
2 weeks ago · ai · - · -

I Built a Persistent Memory API for AI Agents — Here's Why Vector Search Alone Isn't Enough

The Problem Every autonomous agent framework has the same silent failure: memory decay. Your agent works great on day 1. By week 3, it’s confidently using stal...

#AI agents #persistent memory #vector search #embeddings #memory decay #LLM #agent architecture
3 weeks ago · ai · - · -

Secure AI Agent Architecture

Introduction I’ve started writing an open book on the architecture of secure AI agents. The goal is to build a practical engineering reference — not a collecti...

#AI agents #secure AI #architecture #production engineering #observability #governance #open source
3 weeks ago · ai · - · -

AI Agent Memory Systems: How to Give Your AI Persistent Memory

Memory‑First AI Agents The biggest limitation of most AI setups isn’t intelligence — it’s memory. You can have the most powerful model in the world, but if it...

#AI agents #persistent memory #LLM #chatbot architecture #context management #prompt engineering #memory systems
3 weeks ago · ai · - · -

InformationWeek Says Control AI Agent Costs With Process. Here's Why That Won't Scale.

Overview InformationWeek recently published “A Practical Guide to Controlling AI Agent Costs Before They Spiral”https://www.informationweek.com/ai-or-machine-l...

#AI agents #cost management #token quotas #API usage #scalability #LLM expenses #operational AI
3 weeks ago · ai · - · -

AI-Powered Data Science Team for Accelerated Task Completion

!Stelixx Insiderhttps://media2.dev.to/dynamic/image/width=50,height=50,fit=cover,gravity=auto,format=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fupload...

#AI agents #data science automation #open-source #machine learning workflow #productivity #LLM-powered tools
3 weeks ago · ai · - · -

Troubleshooting AI Agent File Input Failures: A Guide to Robust Testing and Data Handling for LLM Applications

Why File Inputs Go Sideways for LLM Agents File input seems straightforward. It's just a file, right? For a human, yes. For an AI agent powered by a large lang...

#LLM #AI agents #file ingestion #data handling #testing #robustness
3 weeks ago · software · - · -

Building a Desktop Control Center for OpenClaw with Tauri and Rust

!Cover image for Building a Desktop Control Center for OpenClaw with Tauri and Rusthttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=a...

#Tauri #Rust #OpenClaw #desktop application #control center #AI agents #cross‑platform UI
3 weeks ago · ai · - · -

Your AI Agent Just Made a $50K Mistake. Can You Explain Why?

!Cover image for “Your AI Agent Just Made a $50K Mistake. Can You Explain Why?”https://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto...

#AI agents #AI explainability #AI safety #incident analysis #Meta #decision tracing #LLM governance
3 weeks ago · ai · - · -

Your AI Agents Are Exploring Blind. Here's How to Give Them a Map.

The Exploration Tax In a multi‑agent workflow, every agent pays an exploration tax at the start of each session. Before it can do anything useful, it has to or...

#AI agents #autonomous agents #large language models #agent memory #indexing #prompt engineering #Claude #Gemini #Codex #multi‑agent workflows
3 weeks ago · ai · - · -

Why Your AI Agent Needs Memory

The Core Problem Most agent frameworks treat memory as an afterthought. They give your agent tools, prompts, and orchestration patterns — but when you restart...

#AI agents #memory #persistent state #retrieval #knowledge layer #LLM #Claude #GPT #Gemini #MCP
3 weeks ago · ai · - · -

Agent-to-agent pair programming

Overview What if you could let Claude and Codex work together as pair programmers, talking to each other directly? One acts as the main worker while the other...

#pair programming #AI agents #Claude #Codex #code review #multi-agent workflow #software development
3 weeks ago · ai · - · -

Query Live AI Inference Pricing with the ATOM MCP Server

Introduction If you've ever tried to compare LLM pricing across vendors you know how painful it is. One charges per token, another per character, another per r...

#LLM pricing #ATOM MCP server #AI inference cost #model pricing API #AI agents #token pricing normalization #live AI data source
3 weeks ago · ai · - · -

How to Build Production-Ready Multi-Agent Systems: Lessons from Running 8+ Agents

Everyone talks about AI agents. Few discuss what happens when you run 10, 50, or 100 of them simultaneously. After building and operating a multi‑agent system i...

#multi-agent systems #AI agents #orchestration #production deployment #scalability #LLM integration
3 weeks ago · software · - · -

Show HN: Nit – I rebuilt Git in Zig to save AI agents 71% on tokens

AI agents call git constantly—status, diff, log, show. I pulled data from 3,156 real coding sessions and git accounted for roughly 459 000 tokens of output, abo...

#git #zig #nit #AI agents #token optimization #libgit2 #command-line tools #performance
3 weeks ago · ai · - · -

Most AI agent systems fail within 48 hours of going live

Most AI agent systems fail within 48 hours of going live. Not because the code is bad, but because nobody thought about what happens when an agent times out at...

#AI agents #production AI #operational reliability #memory context #LLM deployment #AI ops
3 weeks ago · ai · - · -

Claude Can Use Your Computer Now. Here's How to Make It Verify Trust First.

The Problem A Claude Desktop agent that calls an external API is trusting that API implicitly. There's no verification, no trust score, no audit trail of what...

#Claude #Anthropic #computer use #AI agents #API trust #security #tool integration
3 weeks ago · ai · - · -

Anthropic's Claude Can Now Use Your Computer To Finish Tasks

New Claude Feature: Computer Use Anthropic is testing a new Claude feature that lets users send a request from their phone and have the AI carry it out directl...

#Anthropic #Claude #AI agents #computer automation #task automation #AI productivity
3 weeks ago · ai · - · -

Implementing a RAG system: Crawl

Introduction I'm starting a “Crawl, walk, run” series of posts on various topics and decided to begin with Retrieval‑Augmented Generation RAG. In this phase we...

#retrieval-augmented generation #RAG #LLM #vector database #embeddings #knowledge base #AI agents #document chunking
3 weeks ago · ai · - · -

AI 102

Prompt A prompt is your instruction to the LLM—the text you write before you press send. Because the LLM doesn’t “understand” you the way a person would, it pa...

#prompt engineering #LLM #AI agents #tool chaining #workflow design #prompt design
3 weeks ago · ai · - · -

The three disciplines separating AI agent demos from real-world deployment

Getting AI agents to perform reliably in production — not just in demos — is turning out to be harder than enterprises anticipated. Fragmented data, unclear wor...

#AI agents #enterprise deployment #production AI #data virtualization #agent dashboards #autonomous AI #AI workflow management
3 weeks ago · ai · - · -

Claude Code can now take over your computer to complete tasks

Safety Measures Anthropic says it has safeguards in place to prevent common risks like prompt injection, and it will limit access to certain “off limits” apps...

#Claude #Anthropic #AI agents #computer control #prompt injection #AI safety #risk mitigation
3 weeks ago · it · - · -

How CVE-2026-25253 exposed every OpenClaw user to RCE — and how to fix it in one command

CVE‑2026‑25253 — A Wake‑Up Call for Autonomous AI Agents Score: 8.8 CVSS Impact: Any website could steal your OpenClaw auth token and achieve remote code execu...

#CVE-2026-25253 #OpenClaw #remote code execution #security vulnerability #AI agents #cybersecurity
0 month ago · ai · - · -

Stop Writing AI Agent Prompts Like It's 2023: The Framework That Makes Your OpenClaw Agent Actually Work

Your agent isn’t broken. Your SOUL.md is. I’ve deployed dozens of AI agents—WhatsApp bots, Telegram assistants, Discord helpers—you name it. For months I kept...

#AI agents #prompt engineering #OpenClaw #LEONIDAS framework #system prompts #LLM #agent consistency
0 month ago · ai · - · -

Day 16 – Designing Agent Prompts That Actually Work

Why Most Agents Fail It’s Not the Model Teams often blame: - weak models - bad tools - missing memory In practice, 70 % of agent failures come from poor prompt...

#prompt engineering #agent prompts #AI agents #LLM behavior #role definition #decision policy #execution guide
1 month ago · software · - · -

I've spent 12 years putting Python inside museum walls. Now I'm putting AI agents inside sandboxes.

Introduction I've spent over a dozen years experimenting with Python in environments where it traditionally doesn't belong. From mobile app tooling to interact...

#python #ai-agents #sandbox #kivy #mobile-development #interactive-installations #museum-technology
1 month ago · ai · - · -

Understanding How AI Agents Work

markdown !Cover image for Understanding How AI Agents Workhttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,format=auto/https%3A%...

#AI agents #automation #machine learning #intelligent decision‑making #agent‑based systems
1 month ago · ai · - · -

Context Engineering Has a Blind Spot

The biggest shift in agent design over the past year has been context engineering rather than improved models Most of the published guidance focuses on codebas...

#context engineering #email data #AI agents #large language models #prompt engineering #enterprise AI
1 month ago · ai · - · -

The Deterministic Control Plane: Building Reliable AI Agents

!The BookMasterhttps://media2.dev.to/dynamic/image/width=50,height=50,fit=cover,gravity=auto,format=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads...

#AI agents #deterministic control plane #reliability #guardrails #probabilistic AI #production AI
1 month ago · ai · - · -

Approval Gates: How to Make AI Agents Safe for Real-World Operations

AI agents with real‑world tool access email, phone, browser, payments are powerful—but also dangerous. Without guardrails, an agent could send emails to custome...

#AI agents #safety #approval gates #human‑in‑the‑loop #automation control #tool access #AI security
1 month ago · ai · - · -

I built a cognitive layer for AI agents that learns without LLM calls

The problem Every time your agent starts a conversation, it starts from zero. Sure, you can stuff a summary into the system prompt, use RAG, or call Mem0 or Ze...

#AI agents #cognitive layer #local learning #LLM-free #AuraSDK #memory management #RAG alternatives #token cost reduction
1 month ago · ai · - · -

What we found when an AI audited an AI (real findings, no sanitising)

Most operators assume their agents are running efficiently. They're not. Not because anyone built them badly, but because nobody audits them. You build the thin...

#AI auditing #token waste #LLM efficiency #GPT-4 #Claude Sonnet #AI agents #cost optimization
1 month ago · ai · - · -

How to Stream AI Agent Responses in 5 Min

The Code python import asyncio from agents import Agent, Runner, function_tool from openai.types.responses import ResponseTextDeltaEvent @function_tool def loo...

#AI agents #streaming responses #OpenAI #tool calling #async Python
1 month ago · ai · - · -

We built a living canvas for our AI agent team — here's what that actually means

Most AI tools make their agents invisible. You kick off a job, wait, and get a result. Somewhere in between, agents did things—but you have no idea what, when,...

#AI agents #agent visualization #observability #Reflectt canvas #AI tooling #live monitoring
1 month ago · ai · - · -

EVAL #004: AI Agent Frameworks — LangGraph vs CrewAI vs AutoGen vs Smolagents vs OpenAI Agents SDK

Every week there's a new AI agent framework on Hacker News. The GitHub stars pile up, the demo videos look magical, and six months later half of them are abando...

#AI agents #LangGraph #CrewAI #AutoGen #Smolagents #OpenAI Agents SDK #framework comparison
1 month ago · ai · - · -

Chatbots, AI Agents, and Agentic AI: Understanding the Evolution of Intelligent Systems

Introduction Artificial Intelligence is rapidly transforming how software interacts with humans and performs tasks. Over the past few years three related conce...

#chatbots #AI agents #agentic AI #autonomous AI #conversational AI #intelligent systems #AI evolution
1 month ago · ai · - · -

I Built a Control Plane for My AI Agent — Because It Kept Making the Same Mistakes

I run a Claude agent 24/7. It writes code, deploys services, manages my side projects. Sounds cool, right? Except it kept doing dumb things. And I'd only find...

#AI agents #autonomous AI #control plane #Claude #agent safety #automation #devops integration
1 month ago · ai · - · -

What Is Agentic AI?

What Is Agentic AI? Agentic AI refers to AI systems that can take actions in pursuit of a goal rather than simply producing single responses. Capabilities of a...

#agentic AI #AI agents #language models #orchestration frameworks #tool integration #workflow automation #AI research
1 month ago · ai · - · -

I asked my AI agent to audit himself. He scored 62/100.

Introduction Before you sell something, you should make sure it actually works on yourself. That’s the rule I gave my agent — Gary Botlington IV — when we deci...

#AI agents #self‑audit #token optimization #LLM cost reduction #Claude Sonnet #prompt engineering #automation #agentic systems
1 month ago · ai · - · -

How API Data Bloat is Ruining Your AI Agents (And How I Cut Token Usage by 98% in Python)

The 50KB JSON Problem When your AI agent calls a tool—e.g., searching for a user profile in a database—the API often returns a massive JSON payload e.g., 40 KB...

#AI agents #token optimization #API data bloat #Python #prompt engineering #LLM efficiency
1 month ago · software · - · -

LedgerMind 3.0 3.3.2: How We Turned 'It Works' into 'It Works Brilliantly'

Spoiler: 497 commits, three sleepless nights with SQLite, and one very stubborn race condition that refused to die Reading time: ~12 minutes · For: AI‑agent de...

#ledgermind #performance #race-condition #sqlite #ai-agents #engineering-drama #version-upgrade #reliability
1 month ago · ai · - · -

Add Cryptographic Identity to Your LangChain Agent in 5 Minutes

markdown !The Nexus Guardhttps://media2.dev.to/dynamic/image/width=50,height=50,fit=cover,gravity=auto,format=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com...

#LangChain #Agent Identity Protocol #cryptographic identity #AI agents #tool integration #security
1 month ago · ai · - · -

How to Build Your First AI Agent in 2026: A Practical Guide

How to build your first autonomous AI agent in 2026. The AI agent revolution is here—Anthropic released multi‑agent code review, OpenAI shipped Codex Security,...

#AI agents #autonomous agents #large language models #prompt engineering #tool integration #multi‑agent systems #OpenAI #Anthropic #NVIDIA
1 month ago · it · - · -

Digg shuts down for a 'hard reset' because it was flooded with bots

Shutdown Details Digg has shut down, for now, just a few months after its open beta launched. The company’s CEO, Justin Mezzell, explained on the home page tha...

#Digg #bots #AI agents #SEO spam #platform shutdown #social media #automation
1 month ago · ai · - · -

The Three Reliability Modes I See in Production AI Agents

Why Most AI Agents Fail in Production And How to Fix It After running autonomous agents in production for months, I've noticed a pattern: agents fail in predic...

#AI agents #production reliability #context decay #tool drift #objective misalignment #LLM deployment #autonomous agents #prompt engineering
1 month ago · ai · - · -

Optimizing Content for Agents

Just as useless of an idea as LLMs.txt was It’s all dumb abstractions that AI doesn’t need because AIs are as smart as humans so they can just use what was alre...

#LLM #AI agents #content optimization #prompt engineering #context management

Newer posts

Older posts