Quick Start¶

Get up and running with AgentTel in minutes. Choose your integration path:

Path	Best For	Effort
Spring Boot Starter	Spring Boot applications	Add dependency + YAML config
Go SDK	Go services (net/http, Gin, gRPC)	`go get` + YAML config
Node.js SDK	Express / Fastify services	`npm install` + YAML config
Python SDK	FastAPI / Django / Flask services	`pip install` + YAML config
JavaAgent	Any JVM app (no code changes)	JVM flag + YAML config
Frontend SDK	Browser / SPA applications	`npm install` + init call
Agent SDK	AI agent observability	Add dependency + AgentTracer
Instrument Agent	AI-assisted setup in your IDE	Run MCP server, ask your agent

Try the Docker Demo¶

The fastest way to see AgentTel in action — starts a demo payment service with OTel Collector and Jaeger:

cd examples/spring-boot-example
docker compose -f docker/docker-compose.yml up --build

Or run the guided demo script (generates traffic and queries all MCP tools):

./docker-demo.sh

Dashboard	URL
Jaeger (traces)	http://localhost:16686
Swagger UI	http://localhost:8080/swagger-ui.html
MCP Tool Docs	http://localhost:8081/mcp/docs

Teardown: docker compose -f docker/docker-compose.yml down -v

Backend: Spring Boot¶

1. Add Dependencies¶

MavenGradle (Kotlin)Gradle (Groovy)

<dependencies>
    <!-- Core: span enrichment, baselines, anomaly detection, SLO tracking -->
    <dependency>
        <groupId>dev.agenttel</groupId>
        <artifactId>agenttel-spring-boot-starter</artifactId>
        <version>0.2.0-alpha</version>
    </dependency>

    <!-- Optional: GenAI instrumentation -->
    <dependency>
        <groupId>dev.agenttel</groupId>
        <artifactId>agenttel-genai</artifactId>
        <version>0.2.0-alpha</version>
    </dependency>

    <!-- Optional: Agent interface layer (MCP server, incident context, remediation) -->
    <dependency>
        <groupId>dev.agenttel</groupId>
        <artifactId>agenttel-agent</artifactId>
        <version>0.2.0-alpha</version>
    </dependency>
</dependencies>

dependencies {
    // Core: span enrichment, baselines, anomaly detection, SLO tracking
    implementation("dev.agenttel:agenttel-spring-boot-starter:0.2.0-alpha")

    // Optional: GenAI instrumentation
    implementation("dev.agenttel:agenttel-genai:0.2.0-alpha")

    // Optional: Agent interface layer (MCP server, incident context, remediation)
    implementation("dev.agenttel:agenttel-agent:0.2.0-alpha")
}

dependencies {
    // Core: span enrichment, baselines, anomaly detection, SLO tracking
    implementation 'dev.agenttel:agenttel-spring-boot-starter:0.2.0-alpha'

    // Optional: GenAI instrumentation
    implementation 'dev.agenttel:agenttel-genai:0.2.0-alpha'

    // Optional: Agent interface layer (MCP server, incident context, remediation)
    implementation 'dev.agenttel:agenttel-agent:0.2.0-alpha'
}

2. Configure Your Service¶

All enrichment is driven by YAML configuration — no code changes needed:

# application.yml
agenttel:
  # Topology: set once on the OTel Resource, associated with all telemetry
  topology:
    team: payments-platform
    tier: critical
    domain: commerce
    on-call-channel: "#payments-oncall"
  dependencies:
    - name: postgres
      type: database
      criticality: required
      timeout-ms: 5000
      circuit-breaker: true
    - name: stripe-api
      type: rest_api
      criticality: required
      fallback: "Return cached pricing"

  # Reusable operational profiles — reduce repetition across operations
  profiles:
    critical-write:
      retryable: false
      escalation-level: page_oncall
      safe-to-restart: false
    read-only:
      retryable: true
      idempotent: true
      escalation-level: notify_team

  # Per-operation baselines and decision metadata
  # Use bracket notation [key] for operation names with special characters
  operations:
    "[POST /api/payments]":
      profile: critical-write
      expected-latency-p50: "45ms"
      expected-latency-p99: "200ms"
      expected-error-rate: 0.001
      retryable: true               # overrides profile default
      idempotent: true
      runbook-url: "https://wiki/runbooks/process-payment"
    "[GET /api/payments/{id}]":
      profile: read-only
      expected-latency-p50: "15ms"
      expected-latency-p99: "80ms"

  baselines:
    rolling-window-size: 1000
    rolling-min-samples: 10
  anomaly-detection:
    z-score-threshold: 3.0

3. Optional: Annotate for IDE Support¶

Annotations are optional — YAML config above is sufficient. Use @AgentOperation when you want IDE autocomplete and compile-time validation:

@AgentOperation(profile = "critical-write")
@PostMapping("/api/payments")
public ResponseEntity<PaymentResult> processPayment(@RequestBody PaymentRequest req) {
    // Your business logic — spans are enriched automatically
}

Info

When both YAML config and annotations define the same operation, YAML config takes priority. Per-operation values override profile defaults.

4. Start the MCP Server (Optional)¶

// Expose telemetry to AI agents via MCP
McpServer mcp = new AgentTelMcpServerBuilder()
    .port(8081)
    .contextProvider(agentContextProvider)
    .remediationExecutor(remediationExecutor)
    .build();
mcp.start();

AI agents can now call tools like get_service_health, get_incident_context, list_remediation_actions, and execute_remediation over JSON-RPC.

5. What You Get¶

Resource attributes (set once per service, associated with all telemetry):

agenttel.topology.team         = "payments-platform"
agenttel.topology.tier         = "critical"
agenttel.topology.domain       = "commerce"
agenttel.topology.on_call_channel = "#payments-oncall"
agenttel.topology.dependencies = [{"name":"postgres","type":"database",...}]

Span attributes (per operation, only on operations with registered metadata):

agenttel.baseline.latency_p50_ms = 45.0
agenttel.baseline.latency_p99_ms = 200.0
agenttel.baseline.error_rate     = 0.001
agenttel.baseline.source         = "static"
agenttel.decision.retryable      = true
agenttel.decision.runbook_url    = "https://wiki/runbooks/process-payment"
agenttel.decision.escalation_level = "page_oncall"
agenttel.anomaly.detected        = false
agenttel.slo.budget_remaining    = 0.85

Incident context via MCP when something goes wrong:

=== INCIDENT inc-a3f2b1c4 ===
SEVERITY: HIGH
SUMMARY: POST /api/payments experiencing elevated error rate (5.2%)

## WHAT IS HAPPENING
Error Rate: 5.2% (baseline: 0.1%)
Latency P50: 312ms (baseline: 45ms)
Patterns: ERROR_RATE_SPIKE

## WHAT CHANGED
Last Deploy: v2.1.0 at 2025-01-15T14:30:00Z

## WHAT IS AFFECTED
Scope: operation_specific
User-Facing: YES
Affected Deps: stripe-api

## SUGGESTED ACTIONS
  - [HIGH] rollback_deployment: Rollback to previous version (NEEDS APPROVAL)
  - [MEDIUM] enable_circuit_breakers: Circuit break stripe-api

Backend: Go¶

Full-featured Go SDK with middleware for net/http, Gin, and gRPC.

1. Install¶

go get go.agenttel.dev/agenttel-go@latest

2. Configure¶

# agenttel.yml
agenttel:
  topology:
    service-name: payment-service
    team: payments-platform
    tier: critical
    domain: commerce
  operations:
    "POST /api/payments":
      expected-latency-p50: 45ms
      expected-latency-p99: 200ms
      retryable: true
  anomaly-detection:
    z-score-threshold: 3.0

3. Instrument¶

net/httpGingRPC

import (
    agenttel "go.agenttel.dev/agenttel-go"
    agmw "go.agenttel.dev/agenttel-go/middleware/http"
)

cfg, _ := agenttel.LoadConfigFile("agenttel.yml")
engine := agenttel.NewEngineBuilder(cfg).Build()

mux := http.NewServeMux()
// ... register handlers ...

handler := agmw.Middleware(mux,
    agmw.WithBaselineProvider(engine.BaselineProvider()),
    agmw.WithTopology(engine.TopologyRegistry()),
)
http.ListenAndServe(":8080", handler)

import agmw "go.agenttel.dev/agenttel-go/middleware/gin"

r := gin.Default()
r.Use(agmw.Middleware(engine.BaselineProvider(), engine.TopologyRegistry()))

import agmw "go.agenttel.dev/agenttel-go/middleware/grpc"

srv := grpc.NewServer(
    grpc.UnaryInterceptor(agmw.UnaryServerInterceptor(
        engine.BaselineProvider(), engine.TopologyRegistry(),
    )),
)

All spans are automatically enriched with topology, baselines, anomaly detection, and SLO tracking — identical attributes to the JVM SDK.

4. GenAI Instrumentation (Optional)¶

import "go.agenttel.dev/agenttel-go/genai"

builder := genai.NewSpanBuilder(tracer)
ctx, span := builder.StartChatSpan(ctx, "gpt-4o", "openai")
// ... make LLM call ...
genai.EndSpanSuccess(span, promptTokens, completionTokens, "gpt-4o")

5. Agentic Observability (Optional)¶

import "go.agenttel.dev/agenttel-go/agentic"

at := agentic.NewTracer(tracer).
    AgentName("incident-responder").
    AgentType(enums.AgentTypeSingle).
    Build()

ctx, inv := at.Invoke(ctx, "Diagnose high latency")
defer inv.End()

ctx, step := inv.Step(ctx, enums.StepTypeThought, "analyzing metrics")
defer step.End()

Compatibility¶

Component	Versions
Go	1.22+
OpenTelemetry SDK	1.33.0+
net/http, Gin, gRPC	Latest

Backend: Node.js¶

Full-featured Node.js SDK with middleware for Express and Fastify.

1. Install¶

npm install @agenttel/node @opentelemetry/api @opentelemetry/sdk-trace-node

2. Configure¶

# agenttel.yml
agenttel:
  topology:
    service-name: payment-service
    team: payments-platform
    tier: critical
    domain: commerce
  operations:
    "POST /api/payments":
      expected-latency-p50: 45ms
      expected-latency-p99: 200ms
      retryable: true
  anomaly-detection:
    z-score-threshold: 3.0

3. Instrument¶

ExpressFastify

import { AgentTelEngineBuilder, expressMiddleware, loadConfig } from '@agenttel/node';

const config = loadConfig('agenttel.yml');
const engine = new AgentTelEngineBuilder()
  .withTeam(config.topology?.team ?? 'payments-platform')
  .withTier(config.topology?.tier ?? 'critical')
  .build();

const app = express();
app.use(expressMiddleware({
  baselineProvider: engine.baselineProvider,
  topology: engine.topologyRegistry,
}));

import Fastify from 'fastify';
import { fastifyPlugin } from '@agenttel/node';

const fastify = Fastify();
fastify.register(fastifyPlugin, {
  baselineProvider: engine.baselineProvider,
  topology: engine.topologyRegistry,
});

All spans are automatically enriched with topology, baselines, anomaly detection, and SLO tracking — identical attributes to the JVM SDK.

4. GenAI Instrumentation (Optional)¶

import { GenAiSpanBuilder } from '@agenttel/node';

const builder = new GenAiSpanBuilder(tracer);
const span = builder.startChatSpan('gpt-4o', 'openai');
// ... make LLM call ...
GenAiSpanBuilder.endSpanSuccess(span, promptTokens, completionTokens, 'gpt-4o');

5. Agentic Observability (Optional)¶

import { AgentTracer, AgentType, StepType } from '@agenttel/node';

const at = new AgentTracer({
  agentName: 'incident-responder',
  agentType: AgentType.SINGLE,
});

const inv = at.invoke('Diagnose high latency');
const step = inv.step(StepType.THOUGHT);
step.end();
inv.end();

Compatibility¶

Component	Versions
Node.js	18+
TypeScript	5.0+
OpenTelemetry SDK	1.30.0+
Express, Fastify	Latest

Backend: Python¶

Full-featured Python SDK with FastAPI integration.

1. Install¶

pip install agenttel[fastapi]

# Optional extras
pip install agenttel[openai]       # OpenAI instrumentation
pip install agenttel[anthropic]    # Anthropic instrumentation
pip install agenttel[all]          # Everything

2. Configure¶

# agenttel.yml
agenttel:
  topology:
    service-name: payment-service
    team: payments-platform
    tier: critical
    domain: commerce
  operations:
    "POST /api/payments":
      expected-latency-p50: 45ms
      expected-latency-p99: 200ms
      retryable: true

3. Instrument¶

from fastapi import FastAPI
from agenttel.fastapi import instrument_fastapi

app = FastAPI()
instrument_fastapi(app)  # One-line integration

All spans are automatically enriched with topology, baselines, anomaly detection, and SLO tracking — identical attributes to the JVM SDK.

4. GenAI Instrumentation (Optional)¶

from agenttel.genai import instrument_openai, instrument_anthropic

# Wrap your LLM clients — all calls get traced with token/cost tracking
instrument_openai()      # Patches openai.ChatCompletion
instrument_anthropic()   # Patches anthropic.Anthropic

5. Agentic Observability (Optional)¶

from agenttel.agentic import AgentTracer, StepType

tracer = AgentTracer(agent_name="incident-responder", agent_type="single")

with tracer.invoke("Diagnose high latency") as inv:
    inv.step(StepType.THOUGHT, "analyzing metrics")
    inv.step(StepType.ACTION, "calling get_service_health")
    inv.step(StepType.OBSERVATION, "error rate elevated at 5.2%")

Compatibility¶

Component	Versions
Python	3.11+
FastAPI	0.100+
OpenTelemetry SDK	1.20.0+
Django, Flask	Coming soon

Zero-Code Mode (JavaAgent)¶

For applications where you cannot add a library dependency, use the AgentTel javaagent. No code changes, no Spring dependency — a single JAR that bundles the OTel agent with AgentTel enrichment:

java -javaagent:agenttel-javaagent.jar \
     -Dagenttel.config.file=agenttel.yml \
     -jar myapp.jar

The javaagent reads configuration from agenttel.yml (same YAML format as above), system properties (-Dagenttel.topology.team=payments), or environment variables (AGENTTEL_TOPOLOGY_TEAM=payments).

See the Zero-Code Mode Guide for full details including Docker/Kubernetes deployment.

Frontend: Browser SDK¶

Add agent-ready telemetry to your browser application with @agenttel/web.

1. Install¶

npm install @agenttel/web

2. Initialize¶

import { AgentTelWeb } from '@agenttel/web';

AgentTelWeb.init({
  appName: 'checkout-web',
  appVersion: '1.0.0',
  environment: 'production',
  collectorEndpoint: '/otlp',

  // Per-route baselines and decision metadata
  routes: {
    '/checkout/:step': {
      businessCriticality: 'revenue',
      baseline: {
        pageLoadP50Ms: 800,
        pageLoadP99Ms: 2000,
        apiCallP50Ms: 300,
      },
      decision: {
        escalationLevel: 'page_oncall',
        runbookUrl: 'https://wiki/runbooks/checkout',
      },
    },
    '/products': {
      businessCriticality: 'engagement',
      baseline: { pageLoadP50Ms: 500 },
    },
  },

  // Multi-step user journey tracking
  journeys: {
    checkout: {
      steps: ['/products', '/cart', '/checkout/shipping', '/checkout/payment', '/confirmation'],
      baseline: { completionRate: 0.65, avgDurationS: 300 },
    },
  },

  // Client-side anomaly detection
  anomalyDetection: {
    rageClickThreshold: 3,
    rageClickWindowMs: 2000,
    apiFailureCascadeThreshold: 3,
    errorLoopThreshold: 5,
    errorLoopWindowMs: 30000,
  },
});

3. What You Get¶

The SDK automatically instruments — no further code changes needed:

Category	What's Captured
Page Loads	DOM load time, TTFB, transfer size via Navigation Timing API
Navigation	SPA route changes with timing
API Calls	`fetch` and `XMLHttpRequest` interception with response status and duration
Interactions	Clicks and form submits with semantic target identification
Errors	JavaScript errors with error loop detection
Journeys	Step completion, abandonment, funnel drop-off rates
Anomalies	Rage clicks, API failure cascades, slow page loads, error loops
Correlation	W3C Trace Context (`traceparent`) injected on all outgoing requests for backend linking

Span attributes emitted by the SDK:

agenttel.client.page.route              = "/checkout/payment"
agenttel.client.page.business_criticality = "revenue"
agenttel.client.baseline.page_load_p50_ms = 800
agenttel.client.decision.escalation_level = "page_oncall"
agenttel.client.anomaly.detected        = true
agenttel.client.anomaly.pattern         = "rage_click"
agenttel.client.correlation.backend_trace_id = "abc123..."

4. Manual API (Optional)¶

const agenttel = AgentTelWeb.getInstance();

// Track custom interactions
agenttel.trackInteraction('custom', {
  target: 'button#submit',
  outcome: 'success',
  durationMs: 150,
});

// Control journeys manually
agenttel.startJourney('checkout');
agenttel.advanceJourney('checkout');
agenttel.completeJourney('checkout');

// Flush and shutdown
await agenttel.flush();
agenttel.shutdown();

Tip

The SDK uses data-agenttel-target attributes to identify interaction targets without capturing PII. Add data-agenttel-target="pay-button" to your HTML elements for clean span names.

Agent SDK¶

Add observability to your AI agent's lifecycle — invocations, reasoning steps, tool calls, orchestration patterns, cost tracking, and safety guardrails.

1. Add Dependency¶

MavenGradle (Kotlin)

<dependency>
    <groupId>dev.agenttel</groupId>
    <artifactId>agenttel-agentic</artifactId>
    <version>0.2.0-alpha</version>
</dependency>

implementation("dev.agenttel:agenttel-agentic:0.2.0-alpha")

2. Instrument Your Agent¶

Choose your integration style — programmatic, annotation, or YAML config:

Programmatic@AgentMethod AnnotationYAML Config

import io.agenttel.agentic.trace.AgentTracer;
import io.agenttel.agentic.AgentType;
import io.agenttel.agentic.StepType;

AgentTracer tracer = AgentTracer.create(openTelemetry)
    .agentName("incident-responder")
    .agentType(AgentType.SINGLE)
    .build();

try (AgentInvocation inv = tracer.invoke("Diagnose high latency")) {
    inv.step(StepType.THOUGHT, "Need to check service metrics");

    try (ToolCallScope tool = inv.toolCall("get_service_health")) {
        // call tool...
        tool.success();
    }

    inv.step(StepType.OBSERVATION, "Error rate elevated at 5.2%");
    inv.complete(true);
}

@AgentMethod(name = "incident-responder", type = "single", maxSteps = 100)
public IncidentReport diagnose(String incidentId) {
    // Automatically wrapped in AgentInvocation
    // On success: complete(true), on exception: span records error
    return analyzeAndRespond(incidentId);
}

# application.yml
agenttel:
  agentic:
    agents:
      incident-responder:
        type: single
        framework: custom
        max-steps: 100
        loop-threshold: 5

// Use @AgentMethod with name only — identity comes from YAML
@AgentMethod(name = "incident-responder")
public IncidentReport diagnose(String incidentId) {
    return analyzeAndRespond(incidentId);
}

3. What You Get¶

invoke_agent
  agenttel.agentic.agent.name        = "incident-responder"
  agenttel.agentic.agent.type        = "single"
  agenttel.agentic.invocation.goal   = "Diagnose high latency"
  agenttel.agentic.invocation.status = "success"
  agenttel.agentic.invocation.steps  = 3
  └── agenttel.agentic.step  (type=thought)
  └── agenttel.agentic.tool_call  (tool=get_service_health, status=success)
  └── agenttel.agentic.step  (type=observation)

Tip

With Spring Boot, just add the dependency — AgentTracer, AgentConfigRegistry, and AgentCostAggregator beans are created automatically. See the Agent Observability Guide for the full API.

Instrument Agent (IDE Tooling)¶

The instrument agent is a Python MCP server that helps AI assistants in your IDE (Cursor, Claude Code, VS Code) analyze your codebase and generate AgentTel configuration automatically.

1. Install¶

pip install agenttel-instrument

2. Configure¶

# instrument.yml
server:
  host: 0.0.0.0
  port: 8082
backend_mcp:
  host: localhost
  port: 8081
  timeout_seconds: 30.0

3. Run¶

agenttel-instrument --config instrument.yml

4. Use with Your IDE Agent¶

Once running, add it as an MCP server in your IDE and ask your AI assistant:

Prompt	What Happens
"Analyze my codebase and generate AgentTel config"	Scans endpoints, detects dependencies, generates `agenttel.yml`
"Instrument the frontend"	Detects React routes, infers criticality, generates SDK config
"Validate my instrumentation"	Cross-references config against source code, finds gaps
"Suggest improvements"	Finds missing baselines, uncovered endpoints, stale thresholds
"Apply low-risk improvements"	Auto-calibrates baselines from live health data

Available MCP Tools¶

Tool	Description
`analyze_codebase`	Scan Java/Spring Boot source — detect endpoints, dependencies, framework
`instrument_backend`	Generate backend config — dependencies, annotations, agenttel.yml
`instrument_frontend`	Generate frontend config — React route detection, SDK initialization
`validate_instrumentation`	Validate agenttel.yml completeness against source code
`suggest_improvements`	Detect missing baselines, uncovered endpoints, missing runbooks
`apply_improvements`	Auto-apply low-risk improvements using live health data
`apply_single`	Apply a single specific improvement

Examples¶

Example	Description	Run Command
Spring Boot Example	Payment service with span enrichment, topology, baselines, anomaly detection, and MCP server	`docker compose -f docker/docker-compose.yml up --build`
Go Service Example	Go payment service with net/http middleware, topology, baselines, anomaly detection	`cd examples/go-service-example && go run .`
Express Example	Node.js payment service with Express middleware, topology, baselines, anomaly detection	`cd examples/express-example && npm run dev`
LangChain4j Example	GenAI tracing with LangChain4j — chat spans, token tracking, and cost calculation	`./gradlew :examples:langchain4j-example:run`
React Checkout Example	React SPA with frontend telemetry — journey tracking, anomaly detection, cross-stack correlation	`cd agenttel-web/examples/react-checkout && npm start`

Next Steps¶

Architecture — understand how AgentTel works under the hood
MCP Server & Agent Tools — set up the agent interface layer
Attribute Dictionary — every attribute AgentTel adds to spans
Configuration Reference — all configuration properties