agentao

CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Package Management

Always use uv for package management, not pip:

# Install dependencies
uv sync

# Add a new dependency
uv add package-name

# Run Python scripts
uv run python script.py

# Run the CLI
uv run agentao
# or
uv run python main.py

Running and Testing

Start the Agent

# Quick start
./run.sh

# Or directly
uv run agentao

# Or via Python
uv run python main.py

Run Tests

# Run all tests with pytest
uv run python -m pytest tests/

# Run a specific test file
uv run python tests/test_imports.py
uv run python tests/test_tool_confirmation.py
uv run python tests/test_readchar_confirmation.py
uv run python tests/test_date_in_prompt.py

# All test files are in tests/ directory

Configuration

Copy and edit .env from .env.example:

cp .env.example .env
# Edit .env with your API key and settings

Required: OPENAI_API_KEY Optional: OPENAI_BASE_URL, OPENAI_MODEL

Architecture

Three-Layer Design

Agentao uses a Tool-Agent-CLI architecture:

CLI Layer (cli.py): User interface with Rich, handles commands, manages session state (like allow_all_tools)
Agent Layer (agent.py): Orchestrates LLM, tools, skills, and conversation history
Tool Layer (tools/): Individual tool implementations following the Tool base class

User → CLI → Agent → LLM + Tools
                  ↓
            SkillManager (loads from skills/)

Tool System

All tools inherit from Tool base class (tools/base.py):

class MyTool(Tool):
    @property
    def name(self) -> str:
        return "my_tool"

    @property
    def description(self) -> str:
        return "Description for LLM"

    @property
    def parameters(self) -> Dict[str, Any]:
        return {...}  # JSON Schema

    @property
    def requires_confirmation(self) -> bool:
        return False  # True for dangerous operations

    def execute(self, **kwargs) -> str:
        return "Result"

Tool Registration: Tools are registered in agent.py::_register_tools(). The ToolRegistry converts them to OpenAI function calling format.

Tool Confirmation: Tools with requires_confirmation=True (Shell, Web, File Writing) pause execution and prompt user via confirmation_callback passed from CLI.

Tools requiring confirmation:

run_shell_command - Shell command execution (allowlist for safe read-only commands)
web_fetch - Fetch web content (domain-tiered: allowlist/blocklist/ask)
google_web_search - Web search
write_file - File writing/overwriting (prevents data loss)

Domain-Based Permissions (web_fetch): The PermissionEngine supports "domain" rules with allowlist/blocklist matching. Default presets auto-allow trusted docs sites (.github.com, .docs.python.org, etc.) and auto-deny SSRF targets (localhost, 127.0.0.1, 169.254.169.254, etc.). Customizable via .agentao/permissions.json. See docs/features/TOOL_CONFIRMATION_FEATURE.md for details.

Skills System

Dynamic Loading: Skills are auto-discovered from skills/ directory. Each subdirectory contains:

SKILL.md - Main file with YAML frontmatter (name:, description:)
reference/*.md (optional) - Additional documentation loaded on-demand

Skill Manager (skills/manager.py):

Parses YAML frontmatter from SKILL.md files
Maintains available_skills dict (all skills)
Maintains active_skills dict (currently activated)
Injects active skill context into system prompt

Activation: Use activate_skill tool or /skills command. Active skills add their documentation to the system prompt.

System Prompt Composition

The system prompt is dynamically built in agent.py::_build_system_prompt():

AGENTAO.md (if exists in cwd) - Project-specific instructions
Agent Instructions - Base Agentao capabilities
Current Date/Time - Auto-injected: YYYY-MM-DD HH:MM:SS (Day)
Available Skills - List with descriptions
Active Skills Context - Full documentation of activated skills

This composition happens on every chat() call to keep skills context fresh.

Conversation Flow

# agent.py::chat()
1. User message added to self.messages
2. System prompt built (includes AGENTAO.md, date, skills)
3. LLM called with messages + tools
4. Loop (max 100 iterations):
   a. If tool_calls: execute each tool
      - Check requires_confirmation
      - Call confirmation_callback if needed
      - Execute tool or cancel based on response
   b. Add tool results to messages
   c. Call LLM again with updated messages
   d. If no tool_calls: return final response

Logging System

Complete LLM interaction logging to agentao.log:

Every request/response (full content, no truncation)
All tool calls with formatted JSON arguments
Tool results
Token usage
Timestamps

Logger is in llm/client.py. To debug tool execution or LLM behavior, check this log file.

CLI Commands

User commands (start with /):

/clear - Clears history AND resets allow_all_tools to False
/reset-confirm - Resets allow_all_tools only (keeps history)
/status - Shows message count, model, active skills, confirmation mode
/model [name] - List models or switch to specified model
/skills - List available/active skills
/memory - Show saved memories
/mcp - List MCP servers and tools
/help - Show help

Session state allow_all_tools persists across tool confirmations within one session.

MCP (Model Context Protocol) System

Agentao supports connecting to external MCP servers that provide additional tools.

Configuration: .agentao/mcp.json (project) and <home>/.agentao/mcp.json (global):

{
  "mcpServers": {
    "server-name": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-filesystem", "/path"],
      "env": { "TOKEN": "$MY_TOKEN" },
      "trust": false
    },
    "remote-server": {
      "url": "https://api.example.com/sse",
      "headers": { "Authorization": "Bearer $API_KEY" },
      "timeout": 30
    }
  }
}

Transport types: command (stdio subprocess) or url (SSE).

Tool naming: MCP tools are registered as mcp_{server}_{tool} (e.g. mcp_github_create_issue).

Architecture:

.agentao/mcp.json → McpConfig → McpClientManager → McpClient (per server)
                                                          ↓
                                                   list_tools() / call_tool()
                                                          ↓
                                                   McpTool(Tool) → ToolRegistry

Key files:

agentao/mcp/config.py - Config loading, env var expansion
agentao/mcp/client.py - McpClient (single server), McpClientManager (multi-server)
agentao/mcp/tool.py - McpTool wrapper adapting MCP tools to Tool base class

Async bridge: MCP SDK is async-only; McpClientManager uses a dedicated event loop with run_until_complete() to bridge into sync Agentao code.

CLI: /mcp list, /mcp add <name> <command|url>, /mcp remove <name>

Adding New Components

Adding a Tool

Create tool class in agentao/tools/<module>.py
Implement Tool interface (name, description, parameters, execute)
Set requires_confirmation=True if dangerous:
- Shell commands (arbitrary execution)
- Web access (network requests, privacy)
- File writing/overwriting (data loss risk)
- File deletion (irreversible)

from .tools.mymodule import MyTool
# In _register_tools():
tools_to_register.append(MyTool())

Adding a Skill

Create directory: skills/my-skill/

Create SKILL.md with YAML frontmatter:

---
name: my-skill
description: Use when... (trigger conditions)
---

# Skill Documentation
...

(Optional) Add reference/*.md files for on-demand loading
Restart agent - skill auto-discovered

Reference files are loaded only when skill is activated (saves memory).

Important Patterns

Confirmation Callback Pattern

CLI creates callback and passes to Agent:

# cli.py
def confirm_tool_execution(self, name, desc, args) -> bool:
    # Show menu, get user choice
    # Return True/False

self.agent = Agentao(
    confirmation_callback=self.confirm_tool_execution
)

Agent checks before tool execution:

# agent.py
if tool.requires_confirmation and self.confirmation_callback:
    confirmed = self.confirmation_callback(name, desc, args)
    if not confirmed:
        result = "Tool execution cancelled by user"

Single-Key Input

Uses readchar library for instant response:

import readchar
key = readchar.readkey()  # No Enter needed

Supports: 1, 2, 3, Esc, Ctrl+C. Invalid keys are silently ignored.

Memory System

Architecture: SQLite-backed storage managed by MemoryManager (agentao/memory/manager.py).

SQLite databases:

Database	Path	Content
Project store	`.agentao/memory.db`	Project-scoped persistent memories + session summaries
User store	`<home>/.agentao/memory.db`	Cross-project user-scoped persistent memories

Three data types:

Persistent memories (MemoryRecord) — rows in the memories table. Soft-deleted (never physically removed). Scoped to user or project. Types: preference, profile, project_fact, workflow, decision, constraint, note. Source: explicit (LLM-written) or auto/crystallized. Fields: id, scope, type, key_normalized, title, content, tags, keywords, source, confidence, sensitivity, created_at, updated_at, deleted_at.
Session summaries (SessionSummaryRecord) — rows in the session_summaries table. Written by the context-compression pipeline (microcompaction / full LLM summarization) to preserve conversation continuity across compaction events. Scoped to a session_id.
Recall candidates (RecallCandidate) — transient, in-memory only. Scored at query time by MemoryRetriever using a keyword/Jaccard/tag/recency formula. Never stored.

Prompt injection (per turn, two blocks):

<memory-stable> — rendered by MemoryPromptRenderer.render_stable_block(): stable persistent memories only (budget-limited, selection policy applied). Session summaries are intentionally excluded — they already live in the conversation message history as [Conversation Summary] blocks.
<memory-context> — rendered by render_dynamic_block(): top-k recall candidates scored against the current user message.

LLM tool (write-only):

save_memory(key, value, tags?) — the only memory tool exposed to the LLM

CLI commands (full management):

/memory / /memory list — list all entries
/memory search <query> — keyword search across title, value, tags
/memory tag <tag> — filter by tag
/memory user / /memory project — show a single scope
/memory delete <key> — soft-delete by title
/memory clear — soft-delete all entries + clear session summaries (with confirmation)
/memory session — show current session summary
/memory status — entry counts, session size, archive count

Separation of concerns: The LLM can only write (save_memory). Search, delete, and clear are CLI-only operations that call MemoryManager methods directly — they are never exposed to the LLM as callable tools.

See docs/features/memory-management.md for detailed documentation.

File Organization

agentao/
├── agentao/           # Main package
│   ├── agent.py        # Core orchestration
│   ├── cli.py          # CLI interface with Rich
│   ├── llm/
│   │   └── client.py   # OpenAI client wrapper
│   ├── tools/          # Tool implementations
│   │   ├── base.py     # Tool base class + registry
│   │   ├── file_ops.py # Read, write, edit, list
│   │   ├── search.py   # Glob, grep
│   │   ├── shell.py    # Shell execution
│   │   ├── web.py      # Fetch, search
│   │   ├── memory.py   # Persistent memory
│   │   ├── agents.py   # Helper agents
│   │   └── skill.py    # Skill activation
│   ├── mcp/            # MCP (Model Context Protocol) support
│   │   ├── config.py   # Config loading + env var expansion
│   │   ├── client.py   # McpClient + McpClientManager
│   │   └── tool.py     # McpTool wrapper for Tool interface
│   └── skills/
│       └── manager.py  # Skill loading + management
├── skills/             # Skill definitions (SKILL.md files)
├── tests/              # Test files (test_*.py)
├── docs/               # Documentation
│   ├── features/       # Feature documentation
│   ├── updates/        # Update logs
│   ├── implementation/ # Technical implementation details
│   └── dev-notes/      # Development notes (archived)
├── CLAUDE.md           # Claude Code guidance
├── AGENTAO.md        # Project-specific instructions
└── main.py            # Entry point

Key Dependencies

openai - LLM client (OpenAI-compatible APIs)
rich - CLI interface (markdown, panels, prompts)
readchar - Single-key input (no Enter needed)
httpx - HTTP client for web tools
beautifulsoup4 - HTML parsing
python-dotenv - Environment configuration
mcp - Model Context Protocol client SDK

This site is open source. Improve this page.