Skip to content

Getting Started

Get the AI Gateway running locally and deploy it to AWS in under 5 minutes.

What You Will Build

A fully operational LLM inference gateway on AWS that:

Accepts requests in both OpenAI and Anthropic API formats
Routes to multiple model providers (Bedrock, OpenAI, Anthropic, Google, Azure OpenAI)
Authenticates callers via Cognito M2M JWT tokens
Auto-scales on ECS Fargate behind an Application Load Balancer
Collects traces, metrics, and logs via OpenTelemetry

5-Minute Quickstart

1. Clone and install tools

git clone git@github.com:theagenticguy/ai-gateway.git
cd ai-gateway

# Install all tool versions defined in mise.toml
# (Python 3.13, Terraform 1.10.5, lefthook, checkov, trivy, hadolint, gitleaks)
mise install

2. Install dependencies

# Install Python dependencies
uv sync

# Install git hooks (pre-commit, pre-push, commit-msg)
lefthook install

3. Initialize Terraform

cd infrastructure
terraform init -backend-config=environments/dev.tfvars

4. Preview infrastructure

terraform plan -var-file=environments/dev.tfvars

5. Deploy

terraform apply -var-file=environments/dev.tfvars

What Happens Next

After deployment, Terraform outputs the ALB DNS name and Cognito token endpoint. You will need these to:

Get a token — Use scripts/get-gateway-token.sh to obtain a JWT from Cognito
Configure your agent — Point your AI coding agent at the gateway URL
Start routing requests — Send LLM requests through the gateway to any provider

Next Steps

Prerequisites — Detailed tool and AWS account requirements
Authentication — How Cognito M2M auth works and how to get tokens
Agent Setup — Configure Claude Code, OpenCode, Goose, and more