AI Agents

A topic hub collecting every article tagged AI Agents. Use it to explore related posts and follow this theme across the site.

35 articles

Explore More Topics

AI Agno Claude Code LLM MCP Cursor

AI Engineering AI Agents Developer Productivity Product

How AI Helps Engineers Evolve and Scale Modern Apps

AI can help teams move faster, but the real unlock is designing agents, skills, evals, traces, and self-correction loops around your app.

Jun 1, 2026 15 min read

Software Architecture System Design Engineering DevOps AI Agents

The Pieces of Modern, Effective Software Design

A practical ladder for growing a software system from one app and one database into observable, event-driven, permission-aware, AI-ready architecture.

Jun 1, 2026 18 min read

Auth0 CIMD MCP AI Agents OAuth

Auth0 CIMD for Modern Apps, MCP, and Agent Workflows

A practical guide to Auth0 Client ID Metadata Documents, Auth for MCP, on-behalf-of token exchange, and secure agent access patterns for modern web apps.

May 8, 2026 18 min read

OpenFGA Authorization MCP AI Agents Security

OpenFGA for Granular Permissions in Modern Apps, MCP, and Agent Workflows

A practical guide to using OpenFGA for fine-grained authorization in SaaS apps, MCP servers, workflow agents, and agent orchestrators.

May 8, 2026 32 min read

Agno LangWatch Scenario Evaluation AI Agents Testing LLMOps AgentOS

Agno Evals vs LangWatch Scenario: Native Agent Metrics or Simulated Agent Tests?

A source-backed comparison of Agno's latest native eval docs and LangWatch Scenario's simulation-based testing model, with practical guidance on when to use each and how to combine them.

Apr 28, 2026 15 min read

AI Security AI Agents Databases Authorization PostgreSQL

Preparing Databases for Secure AI Agents

AI agents feel like they can touch every system. The practical answer is not more trust in the model, but database roles, row-level policies, semantic layers, tool scopes, approval gates, and audit trails.

Apr 28, 2026 16 min read

AI Security AI Agents Secure Coding DevSecOps LLM

A Secure Agentic Coding Process Is Not Just a Bigger LLM

The current evidence does not support trusting model size alone for secure code generation. Secure agentic coding needs threat modeling, constrained tools, scanners, evals, and human approval gates.

Apr 28, 2026 20 min read

AI Agents Prompt Engineering LLM Software Architecture Evaluation Anthropic OpenAI

How to Write Robust System Prompts for AI Agents Across LLMs

There is no truly bulletproof system prompt. But there is a practical engineering standard for making prompts far more robust across Sonnet, Haiku, GPT, and reasoning-style models.

Apr 22, 2026 20 min read

AI Agents Web Standards MCP Cloudflare llms.txt Developer Infrastructure

Agent-Ready Infrastructure: How to Make Your Website Work With AI Agents

AI agents are the new crawlers. Learn how to signal discoverability, serve machine-readable content, control bot access, and expose capabilities through MCP — with practical code examples.

Apr 21, 2026 13 min read

Code Review AI Agents LLM Software Engineering Evaluation

How to Build a Good Agentic Code Reviewer

Most AI code review bots fail for a simple reason: they optimize for visible comments instead of reviewer trust. This guide pulls together current benchmarks, practitioner reports, product limitations, and design patterns for building a code review agent that is fast, quieter, less biased, and less hallucination-prone.

Apr 15, 2026 23 min read

AI Agents Agno AgentOS LangChain LangGraph FastAPI Python Architecture Production

Agno/AgentOS + FastAPI vs LangChain/LangGraph + FastAPI

A source-audited, side-by-side guide to choosing between Agno/AgentOS + FastAPI and LangChain/LangGraph + FastAPI for production Python agent backends.

Apr 7, 2026 26 min read

AI Agents Engineering Leadership Product Strategy MCP OpenAI Anthropic Google Cloud Microsoft

Leading High-Velocity Agentic Systems: From Internal Sandboxes to Member-Facing Betas

A practical guide for engineers and PMs on how to lead fast iteration cycles for agentic systems, design useful sandbox UIs and APIs, and graduate the best prototypes into production experiments and real betas.

Apr 7, 2026 13 min read

AI Agents A2UI Agent UI React TypeScript A2A MCP

How to Build Agent-Driven Interfaces with A2UI: From Zero to Advanced

A practical guide to A2UI with real deployments, official sample patterns, and complete code examples that take you from a static renderer to server-driven, interactive agent UI.

Apr 6, 2026 19 min read

AI Agents Anthropic OpenAI Computer Use Workflows TypeScript Agent UX

Persistent Agents: Dispatch, Cowork, Computer Use, and Always-On Workflows

Persistent agents are becoming product features, not just backend architecture. Learn how Dispatch, Cowork, computer use, and OpenAI background mode fit together, with complete TypeScript examples from zero to advanced.

Apr 6, 2026 18 min read

Astro Cloudflare D1 Cloudflare Containers Agno AgentOS Architecture Blog Platform AI Agents Cloudflare Workers

How to Build a Modern Blog App with Astro, Cloudflare D1, and Cloudflare Containers for Agno Agents

A practical guide for engineers and architects designing an Astro blog platform with draft/publish workflows in D1, a lightweight markdown editor, and a Cloudflare Containers deployment for an Agno/AgentOS research service.

Apr 2, 2026 26 min read

Spec Kit Cursor Beads Claude Code Git Worktrees Engineering Workflow AI Agents

From Spec to Parallel Delivery: Spec Kit, Cursor, Beads, and Claude Code on a Real Feature

A practical engineer-first guide to turning a messy enterprise feature into specs, acceptance criteria, Beads task graphs, git worktrees, and parallel delivery across Cursor and Claude Code.

Apr 2, 2026 16 min read

AI Agents LangChain LangGraph Agno AgentOS Architecture Production Python

12-Factor Agents in Practice: LangChain/LangGraph and Agno/AgentOS

A source-audited translation of HumanLayer's 12-factor agent principles into practical LangGraph and Agno/AgentOS architecture, with production-minded Python examples.

Mar 31, 2026 24 min read

AI Agents Learning Roadmap OpenAI Anthropic Google Cloud Microsoft Engineering

How to Learn Production-Grade AI Agent Engineering

A practical roadmap for software engineers who want to move from toy chatbots to production-grade AI agents, with the right study order, common gaps, and portfolio projects.

Mar 31, 2026 24 min read

LangWatch Agno AgentOS Evaluation AI Agents Testing Observability LLMOps

Implementing LangWatch Evaluations with Agno and AgentOS: From Zero to Production

A practical guide for implementing LangWatch evaluations in Agno and AgentOS systems, from first traces and batch experiments to structured-output scoring, production monitors, and background evaluation hooks.

Mar 31, 2026 17 min read

AI Agents Subagents Agno LangChain AI SDK Architecture Production TypeScript Python

Designing Subagents in the Cloud: Agno vs LangChain vs AI SDK

A practical, source-audited guide to designing subagents for real cloud workloads, with concrete patterns and tradeoffs across Agno, LangChain, and Vercel AI SDK.

Mar 30, 2026 14 min read

AI Agents OpenAI AI SDK Astro MCP UX Product Strategy Agent UI

From Chat to Agent UI: ChatKit, A2UI, and Structured Interaction Surfaces

Agent UI is becoming its own stack. Here's how ChatKit, A2UI, and MCP Apps fit together, where plain chat breaks down, and how to design structured interaction surfaces that actually help users get work done.

Mar 30, 2026 24 min read

AI SDK LangChain LangGraph Streaming AI Agents TypeScript Next.js Architecture

How to Stream LangChain and LangGraph into AI SDK

A source-audited, practical guide to building streaming APIs with LangChain and LangGraph, then consuming them cleanly with AI SDK from simple chat to durable agents.

Mar 30, 2026 19 min read

AI Agents Evaluation LLM Testing LLMOps Observability

LLM-as-a-Judge for Agent Apps: Biases, Blind Spots, and Fixes

LLM-as-a-judge can be one of the most useful patterns in agent evaluation, but only if you understand where it breaks: order bias, self-preference, verbosity bias, weak judges, and evidence-free scoring. This guide explains the pattern, the common traps, and the fixes that make it practical.

Mar 30, 2026 17 min read

AI Agents Evaluation Observability LangWatch LangSmith Langfuse LLMOps Testing

Trace Grading vs Scenario Testing: How to Evaluate Agents in Production

Why production agent evaluation is moving beyond output-only checks, how trace-aware grading complements scenario testing, and how LangWatch, LangSmith, and Langfuse compare.

Mar 30, 2026 12 min read

AI Agents Computer Use QA Browser Automation Playwright OpenAI Anthropic Stagehand

Computer-Use Agents for QA, Legacy Systems, and Browser Automation

A source-audited guide to where computer-use agents are already practical, where they still break, and how to deploy them safely for QA, legacy enterprise workflows, and browser automation.

Mar 24, 2026 12 min read

Cursor Claude Code Subagents AI Agents Code Review Developer Tools

Cursor Subagents and Claude Code /simplify: A Practical Workflow

A grounded look at how Cursor's subagents and skills fit with Claude Code's subagents, worktrees, and the new /simplify command for research, implementation, and cleanup.

Mar 24, 2026 10 min read

AI Agents Architecture Production AI SDK LangChain Agno AgentOS TypeScript Python

Stateful vs Stateless Agents in Production: AI SDK, LangChain, and Agno/AgentOS

A source-audited, architecture-first guide to deciding where agent state should live in production, with code examples in Vercel AI SDK, LangChain, and Agno/AgentOS.

Mar 24, 2026 16 min read

Astro Agno AI Agents Python Web Development Performance Islands Architecture Multi-Agent Workflows

Astro & Agno: Where Modern Web Performance Meets AI-Native Agent Design

A deep dive into why Astro is the best framework for fast, content-rich sites and why Agno is the simplest path to production-grade AI agents — from a Hello World agent to a full multi-agent workflow dashboard.

Mar 21, 2026 14 min read

Agno AgentOS OpenRouter Astro AI Agents Workflows Content Automation GitHub Claude Gemini GLM LangWatch Observability

How to Build an Autonomous Research-to-Astro Publishing Pipeline with Agno, AgentOS, and OpenRouter

A practical, multi-level guide to building an agent app that researches topics, drafts markdown articles, writes them into an Astro site, monitors reliability with LangWatch, and opens a GitHub PR automatically using Agno, AgentOS, and OpenRouter.

Mar 20, 2026 27 min read

Claude Code MCP AI Agents Cursor IDE Skills Orchestration Developer Tools

The Evolution of AI Agent Orchestration: System Prompts, Skills, MCP, and Plugins

How AI-assisted engineering workflows mature from a simple system prompt into skills, MCP tools, and full plugins — with real engineering examples at every stage.

Mar 19, 2026 19 min read

LangWatch AI Agents Evaluation LLM Testing Observability

Evaluating AI Agents with LangWatch: From Vibes to Scores

Unit tests tell you if your code works. Scenario tests tell you if your agent behaves. But how do you measure quality across hundreds of examples and track it over time? LangWatch evaluations fill that gap.

Mar 19, 2026 11 min read

LangWatch MCP TDD AI Agents Observability LLMOps Testing

LangWatch MCP: TDD Testing and Monitoring Inference Quality for AI Agents

A deep dive into LangWatch MCP Server — from basic setup to legendary architectural patterns for test-driven AI development and production inference monitoring.

Mar 18, 2026 14 min read

QA AI Agents MCP LiteLLM LangWatch Agno Claude Agent SDK Testing Observability LLMOps

Quality Assurance for Web Apps, AI Agents, and MCP Servers: A Complete Guide with LiteLLM, LangWatch, Agno & Claude Agent SDK

A comprehensive, multi-level deep dive into testing and quality assurance for modern AI-powered systems — from static web apps to agentic pipelines and MCP servers — using LiteLLM, LangWatch, Agno, AgentOS, and the Claude Agent SDK.

Mar 18, 2026 23 min read

LangWatch AI Agents Testing Scenario Skills TDD CI/CD

Testing AI Agent Skills with LangWatch Scenario: A Comprehensive Guide

From vibes to verification — how to build testable, reliable agent skills using LangWatch's simulation-based Scenario framework with multi-turn conversations, judges, and CI/CD integration.

Mar 18, 2026 17 min read

AI Agents OpenAI AI SDK Claude Code Agno FastAPI TypeScript Python

How to Build Modern Web App Agents

A deep comparison of four approaches to building AI agents — OpenAI with raw fetch, Vercel AI SDK, Claude Agent SDK, and Agno with FastAPI — and which one you should pick.

Mar 17, 2026 17 min read

Luis Mori Guerra

Recent Articles

Topics