adk-observability-guide
Original:🇺🇸 English
Not Translated
MUST READ before setting up observability for ADK agents or when analyzing production traffic, debugging agent behavior, or improving agent performance. ADK observability guide — Cloud Trace, prompt-response logging, BigQuery Agent Analytics, third-party integrations, and troubleshooting. Use when configuring monitoring, tracing, or logging for agents, or when understanding how a deployed agent handles real traffic.
2installs
Added on
NPX Install
npx skill4agent add eliasecchig/adk-docs-skills-test adk-observability-guideSKILL.md Content
ADK Observability Guide
Scaffolded project? Cloud Trace and prompt-response logging are pre-configured by Terraform. Seefor infrastructure details, env vars, and verification commands.references/cloud-trace-and-logging.mdNo scaffold? Follow the ADK docs links below for manual setup. For production infrastructure, scaffold with./adk-scaffold
Reference Files
| File | Contents |
|---|---|
| Scaffolded project details — Terraform-provisioned resources, environment variables, verification commands, enabling/disabling locally |
| Third-party integration setup patterns, trade-offs, and ADK docs links for each provider |
Observability Tiers
Choose the right level of observability based on your needs:
| Tier | What It Does | Scope | Default State | Best For |
|---|---|---|---|---|
| Cloud Trace | Distributed tracing — execution flow, latency, errors via OpenTelemetry spans | All templates, all environments | Always enabled | Debugging latency, understanding agent execution flow |
| Prompt-Response Logging | GenAI interactions exported to GCS, BigQuery, and Cloud Logging | ADK agents only | Disabled locally, enabled when deployed | Auditing LLM interactions, compliance |
| BigQuery Agent Analytics | Structured agent events (LLM calls, tool use, outcomes) to BigQuery | ADK agents with plugin enabled | Opt-in ( | Conversational analytics, custom dashboards, LLM-as-judge evals |
| Third-Party Integrations | External observability platforms (AgentOps, Phoenix, MLflow, etc.) | Any ADK agent | Opt-in, per-provider setup | Team collaboration, specialized visualization, prompt management |
Ask the user which tier(s) they need — they can be combined. Cloud Trace is always on; the others are additive.
Cloud Trace
ADK uses OpenTelemetry to emit distributed traces. Every agent invocation produces spans that track the full execution flow.
Span Hierarchy
invocation
└── agent_run (one per agent in the chain)
├── call_llm (model request/response)
└── execute_tool (tool execution)Setup by Deployment Type
| Deployment | Setup |
|---|---|
| Agent Engine | Automatic — traces are exported to Cloud Trace by default |
| Cloud Run (scaffolded) | Automatic — |
| Cloud Run (manual) | Configure OpenTelemetry exporter in your app |
| Local dev | Works with |
View traces: Cloud Console → Trace → Trace explorer
For detailed setup instructions (Agent Engine CLI/SDK, Cloud Run, custom deployments), fetch the ADK docs:
WebFetch: https://google.github.io/adk-docs/integrations/cloud-trace/index.md
Prompt-Response Logging
Captures GenAI interactions (model name, tokens, timing) and exports to GCS (JSONL), BigQuery (external tables), and Cloud Logging (dedicated bucket).
Privacy Modes
Prompt-response logging is privacy-preserving by default — only metadata is logged. Controlled by :
OTEL_INSTRUMENTATION_GENAI_CAPTURE_MESSAGE_CONTENT| Value | Behavior |
|---|---|
| Logging disabled |
| Enabled, metadata only — tokens, model name, timing (default in deployed environments) |
| Enabled with full prompt/response content (not recommended for production) |
For Agent Engine: the platform requires during deployment, but the app overrides to at runtime.
trueNO_CONTENTBehavior by Environment
| Environment | Prompt-Response Logging | Why |
|---|---|---|
Local dev ( | Disabled | No |
| Dev (Terraform deployed) | Enabled ( | Terraform sets env vars |
| Staging / Production | Enabled ( | Terraform sets env vars |
To enable locally, set and before running .
LOGS_BUCKET_NAMEOTEL_INSTRUMENTATION_GENAI_CAPTURE_MESSAGE_CONTENT=NO_CONTENTmake playgroundTo disable in a deployed environment, set in and re-apply.
OTEL_INSTRUMENTATION_GENAI_CAPTURE_MESSAGE_CONTENT=falsedeployment/terraform/service.tfFor scaffolded project infrastructure details (Terraform resources, env vars, verification), see .
references/cloud-trace-and-logging.mdFor ADK logging docs (log levels, configuration, debugging):
WebFetch: https://google.github.io/adk-docs/observability/logging/index.md
BigQuery Agent Analytics Plugin
An optional plugin that logs structured agent events directly to BigQuery via the Storage Write API. Enables:
- Conversational analytics — session flows, user interaction patterns
- LLM-as-judge evals — structured data for evaluation pipelines
- Custom dashboards — Looker Studio integration
- Tool provenance tracking — LOCAL, MCP, SUB_AGENT, A2A, TRANSFER_AGENT
Enabling
| Method | How |
|---|---|
| At scaffold time | |
| Post-scaffold | Add the plugin manually to |
Infrastructure (BigQuery dataset, GCS offloading) is provisioned automatically by Terraform when enabled at scaffold time.
Key Features
- Auto-schema upgrade (new fields added without migration)
- GCS offloading for multimodal content (images, audio)
- Distributed tracing via OpenTelemetry span context
- SQL-queryable event log for all agent interactions
For full schema, SQL query examples, and Looker Studio setup:
WebFetch: https://google.github.io/adk-docs/integrations/bigquery-agent-analytics/index.md
Third-Party Integrations
ADK supports six third-party observability platforms. Each uses OpenTelemetry or custom instrumentation to capture agent behavior.
| Platform | Key Differentiator | Setup Complexity | Self-Hosted Option |
|---|---|---|---|
| AgentOps | Session replays, 2-line setup, replaces native telemetry | Minimal | No (SaaS) |
| Phoenix | Open-source, custom evaluators, experiment testing | Low | Yes |
| MLflow | OTel traces to MLflow Tracking Server, span tree visualization | Medium (needs SQL backend) | Yes |
| Monocle | 1-call setup, VS Code Gantt chart visualizer | Minimal | Yes (local files) |
| Weave | W&B platform, team collaboration, timeline views | Low | No (SaaS) |
| Freeplay | Prompt management + evals + observability in one platform | Low | No (SaaS) |
Ask the user which platform they prefer — present the trade-offs and let them choose. For setup details on each, see .
references/third-party.mdTroubleshooting
| Issue | Solution |
|---|---|
| No traces in Cloud Trace | Verify |
| Prompt-response data not appearing | Check |
| Privacy mode misconfigured | Check |
| BigQuery Analytics not logging | Verify plugin is configured in |
| Third-party integration not capturing spans | Check provider-specific env vars (API keys, endpoints); some providers (AgentOps) replace native telemetry |
| Traces missing tool spans | Tool execution spans appear under |
| High telemetry costs | Switch to |
Deep Dive: ADK Docs (WebFetch URLs)
For detailed documentation beyond what this skill covers, fetch these pages:
| Topic | URL |
|---|---|
| Observability overview | |
| Agent activity logging | |
| Cloud Trace integration | |
| BigQuery Agent Analytics | |
| AgentOps | |
| Phoenix (Arize) | |
| MLflow tracing | |
| Monocle | |
| W&B Weave | |
| Freeplay | |