code-context-agent¶

AI-powered CLI tool for automated codebase analysis and context generation.

code-context-agent uses Claude Opus 4.6 (via Amazon Bedrock) with 45+ tools to analyze unfamiliar codebases and produce structured context documentation for AI coding assistants. It combines semantic analysis (LSP), structural pattern matching (ast-grep), graph algorithms (NetworkX), git history analysis, and intelligent code bundling (repomix) to generate narrated markdown that helps developers and AI assistants understand a codebase's architecture and business logic.

Autonomous Agent

This CLI runs a fully autonomous AI agent loop. The agent decides which tools to invoke, what files to read, and what shell commands to run. While shell commands are restricted to a read-only allowlist and all inputs are validated, the agent makes its own decisions within those bounds. Review all generated output before using it in production.

AI-Generated Output

Generative AI can make mistakes. Review all output and monitor costs generated by your chosen AI model. Analysis of a single repository typically consumes 50K--500K input tokens and 10K--50K output tokens on Claude Opus 4.6.

Disclaimer

The author is an AWS employee. This is not an official AWS project or service. It is not maintained, supported, or endorsed by AWS. This project runs fully autonomous agent loops with access to your filesystem (read-only). You are solely responsible for any consequences of running this tool. The CLI and source code are provided AS IS without warranty of any kind. User discretion advised.

Key Capabilities¶

Capability	Description
45+ analysis tools	LSP, ast-grep, ripgrep, repomix, git history, NetworkX graph
Multi-language LSP	Python (ty), TypeScript, Rust, Go, Java with ordered fallback chains
Graph-based insights	Hotspots (betweenness centrality), foundations (PageRank/TrustRank), modules (Louvain/Leiden), triangle detection
Git-aware bundling	Embeds diffs, commit history, and coupling data in context bundles
Tree-sitter compression	Extracts signatures/types only, stripping function bodies for token efficiency
Structured output	Pydantic-typed `AnalysisResult` with ranked business logic, risks, and graph stats
Security hardened	Shell allowlist, input validation, path traversal prevention, CI security pipeline
Full mode	`--full` for exhaustive analysis with no size limits, fail-fast errors, and per-module output
MCP server	Expose graph algorithms and analysis as MCP tools for Claude Code, Cursor, and other agents

Architecture¶

flowchart TD
    A[CLI: cyclopts] --> B[run_analysis]
    B --> C[create_agent]
    C --> D[Strands Agent<br/>Opus 4.6 + adaptive thinking]
    D --> E[Jinja2 System Prompt]
    D --> F[HookProviders<br/>quality + efficiency + fail-fast]
    D --> G[AnalysisResult<br/>structured output]
    D --> H[Tool Execution]
    H --> I[Discovery<br/>ripgrep, repomix]
    H --> J[LSP<br/>ty, ts-server, rust-analyzer]
    H --> K[AST<br/>ast-grep patterns]
    H --> L[Graph<br/>NetworkX analysis]
    H --> M[Git<br/>coupling, churn, blame]
    H --> N[Shell<br/>bounded execution]
    H --> O[Output Files<br/>.code-context/ directory]

Quick Start¶

# Install
uv tool install code-context-agent

# Analyze a repository
code-context-agent analyze /path/to/repo

# Focus on a specific area
code-context-agent analyze . --focus "authentication system"

# Verify tool dependencies
code-context-agent check

# Exhaustive analysis (no size limits, fail-fast)
code-context-agent analyze . --full

# Full analysis focused on specific area
code-context-agent analyze . --full --focus "authentication"

See the Installation and Quick Start guides for details.

Output¶

All outputs are written to .code-context/ (or custom --output-dir):

File	Description
`CONTEXT.md`	Main narrated context (<=300 lines in standard mode)
`CONTEXT.orientation.md`	Token distribution tree
`CONTEXT.bundle.md`	Bundled source code (compressed)
`CONTEXT.signatures.md`	Signatures-only structural view
`files.all.txt`	Complete file manifest
`files.business.txt`	Curated business logic files
`code_graph.json`	Persisted graph data
`FILE_INDEX.md`	File index with graph metrics (complex repos)
`analysis_result.json`	Structured analysis result (Pydantic JSON)
`CONTEXT.modules/`	Per-module context files (full mode)
`CONTEXT.business.*.md`	Category-specific business logic (full mode)

Tech Stack¶

Component	Technology
Agent framework	Strands Agents
LLM	Claude Opus 4.6 via Amazon Bedrock
CLI	cyclopts
Prompt templates	Jinja2
Data models	Pydantic + pydantic-settings
Graph analysis	NetworkX
Terminal UI	Rich
Code search	ripgrep
Code bundling	repomix (Tree-sitter)
Pattern matching	ast-grep
Type checker / LSP	ty, typescript-language-server