Jarvis RegistryGoverned AI2 min 51 secJul 2026

Enterprise AI Observability | Track LLM Token Usage with Jarvis Registry

As AI agents scale across planning, reasoning, tool calling, and cross-workflow collaboration, token usage becomes harder to monitor and connect to business outcomes. Jarvis Registry introduces LLM token usage tracking that captures consumption across agents, chat, and MCP context automatically, so teams finally know where tokens go, which agents drive usage, and whether the spend justifies the ROI.

Every agent registered in Jarvis Registry has its token usage tracked automatically, no matter how it's invoked, through chat, API, or workflow. In this demo, a Bedrock spending agent is called through a natural-language chat request, and Jarvis Registry captures its LLM token usage in real time, showing a call that consumed 12.8K tokens alongside a full history of prior agent usage.

Token data is exported through standard OpenTelemetry, making it ready to visualize in Grafana, Datadog, New Relic, or any OTEL-compatible platform. Beyond token counts, Jarvis Registry gives engineering teams visibility across the full agentic flow, including tool calls, traces, metrics, authentication activity, latency, and execution status, turning every step of an agent's run into something measurable and accountable.

What you'll learn

How Jarvis Registry automatically tracks LLM token usage across chat, API, and workflow invocations

What real-time Grafana visualization reveals about per-agent token consumption like the 12.8K token Bedrock call

Why standard OpenTelemetry export enables token data to flow into Grafana, Datadog, or New Relic

How full agentic flow visibility into tool calls, traces, and latency supports cost and ROI accountability

0:02 - AI agents scale, visibility lags

As AI agents scale across planning, reasoning, and tool calling, token usage becomes harder to monitor and tie to business outcomes. Without visibility, token spend becomes a blind spot in the enterprise AI stack.

0:26 - Introducing token usage tracking

Jarvis Registry introduces LLM token usage tracking that captures consumption across agents, chat, and MCP context. Data is collected through standard OpenTelemetry for use in Grafana, Datadog, New Relic, or any OTEL-compatible platform.

0:51 - Automatic tracking on every agent

Every agent registered in Jarvis has its token usage tracked automatically regardless of how it's invoked, whether through chat, API, or workflow. The demo invokes a Bedrock spending agent as an example.

1:04 - Asking Jarvis in natural language

A user asks Jarvis, from within a tool like Claude, about this week's Amazon Bedrock spending in plain language. Jarvis understands the question, finds the right registered tool, and runs it to return an answer.

1:30 - Real-time token capture in Grafana

As the agent runs, Jarvis Registry captures its LLM token usage in real time. In Grafana, the Bedrock spending agent's call shows 12.8K tokens used, alongside a full record of prior agent usage.

2:08 - Full agentic flow visibility

Jarvis Registry tracks more than tokens, covering tool calls, traces, metrics, authentication activity, latency, and execution status. This makes token usage visible, standard, and measurable, helping teams understand cost, ROI, and optimize agent workflows.

Topics

Governed AI Jarvis RegistryJarvis Registry token usage trackingLLM token usage monitoringenterprise AI observabilityAI agent cost visibilityOpenTelemetry LLM token trackingGrafana AI agent dashboardAI agent ROI measurementtrack token usage across AI agentsOTEL compatible AI observabilityDatadog New Relic LLM monitoring