signoz-creating-dashboards
Create a new SigNoz dashboard from a natural-language intent — import a curated template (PostgreSQL, Redis, JVM, k8s, hostmetrics, APM, LLM, etc.) when one fits, or build a custom dashboard from scratch with metric / trace / log panels. Make sure to use this skill whenever the user says "create a dashboard for…", "set up monitoring for…", "build me a dashboard…", "I need observability for…", "import a dashboard template", or asks to track / visualize a service, database, cluster, or AI/LLM platform — even if they don't explicitly say "dashboard". Also use it when someone wants to "monitor", "watch", or "see metrics for" a technology and the natural answer is a dashboard.
Skill body
Dashboard Create
Build a SigNoz dashboard from a user’s natural-language intent. The skill targets two consumers: an autonomous AI SRE agent that runs without a human in the loop, and a human at a Claude Code / Codex / Cursor prompt. Both go through the same flow.
Prerequisites
This skill calls SigNoz MCP server tools (signoz:signoz_create_dashboard,
signoz:signoz_list_dashboards, signoz:signoz_list_dashboard_templates,
signoz:signoz_import_dashboard, signoz:signoz_get_dashboard,
signoz:signoz_update_dashboard, signoz:signoz_list_metrics,
signoz:signoz_get_field_keys, signoz:signoz_get_field_values,
signoz:signoz_aggregate_logs, signoz:signoz_aggregate_traces, etc.).
Before running the workflow, confirm the signoz:signoz_* tools are
available. If they are not, the SigNoz MCP server is not installed or
configured — run signoz-mcp-setup first to initialize or repair the MCP
connection. Do not fall back to raw HTTP calls or fabricate dashboard JSON
without the MCP tools.
When to use
Use this skill when the user wants to:
- Create, set up, or build a new dashboard.
- “Monitor” or “set up observability” for a service, database, infrastructure component, or AI/LLM platform.
- Import a curated dashboard template.
- Visualize a set of metrics / traces / logs together on one screen.
Do NOT use when the user wants to:
- Modify an existing dashboard →
signoz-modifying-dashboards. - Understand what an existing dashboard shows →
signoz-explaining-dashboards. - Run a one-off query without persisting it →
signoz-generating-queries.
Required inputs (strict)
Dashboard creation is a write operation. Guessing here clutters the shared workspace with empty or wrongly-scoped dashboards someone else has to clean up. The skill enforces a soft input contract — most fields have sensible defaults, but a few cannot be guessed:
| Input | Required | Source if missing |
|---|---|---|
| Dashboard intent (NL goal) | yes | $ARGUMENTS or recent user turn |
| Technology / domain (e.g. PostgreSQL, Redis, “payment pipeline”) | yes | parse from intent; otherwise ask |
| Modify-or-create choice when duplicates exist | yes | ask the user (Step 2) |
| Resource scope for custom builds (service / namespace / cluster) | yes for custom builds | discover via signoz:signoz_get_field_keys + signoz:signoz_get_field_values; fall back to a dashboard variable |
| Specific metrics / signals for custom builds | inferred | derive from technology + MCP signoz://dashboard/* resources; surface in preview |
| Default time range, refresh, layout | inferred | apply defaults (see “Defaults” below) |
If a required input is missing and cannot be discovered, stop before
calling any write tool and ask the user. The host application decides
how the question is surfaced (a structured clarification tool, inline
<assistant_question> tags, an interactive prompt, etc.) — follow the
host’s UI rendering rules.
What to include in the question:
- What is missing — name the input concretely (e.g. “no service or cluster specified for the custom build”).
- Candidate lists populated from your discovery calls — concrete
values per attribute the user can pick from. Example shape:
service.name→frontend,checkout,payments,inventory;k8s.cluster.name→prod-us-east-1,staging. - Allow free-form input so the user can name a value you didn’t surface.
In autonomous mode (no human), escalate to the caller or fill the gap
from upstream context. Either way, do not proceed to
signoz:signoz_create_dashboard / signoz:signoz_import_dashboard with
a guessed value.
Workflow
The flow runs in order: duplicate check → user picks modify-or-create
→ on create, template lookup decides template-import vs custom-build →
no-data probe → per-panel dry-run → preview → save. Duplicate check
comes first so we never silently create a second copy of something that
already exists. Once the user has chosen to create, the template lookup
is an internal implementation detail — if a curated template fits we
use it, otherwise we build from scratch. The per-panel dry-run
(signoz:signoz_execute_builder_query against every query-bearing
panel) is mandatory before save — a saved empty panel from a typo’d
attribute or wrong severity filter is the worst failure mode for this
skill, and dry-run is the only step that catches it. The user is
offered exactly two upfront choices: modify an existing dashboard, or
create a new one.
Step 1: Check for duplicates
Call signoz:signoz_list_dashboards. Most installs fit in the default
page (limit=50); only paginate when pagination.hasMore=true. Use
string values for limit / offset (e.g. "50", "0") — the schema
expects strings, not integers.
Match by relevance Compare each existing
dashboard’s lowercased name, description, and tags against the
user’s technology/domain. Surface only matches a human would recognize
as the same thing — a “redis” dashboard does not match a “postgresql”
request just because both have a database tag. Collect each match’s
name, uuid, and createdAt for the next step.
Step 2: Ask the user — modify or create
Present exactly two options (no template-import as a separate top-level choice — that’s an internal decision in Step 3b):
- Duplicates found: “There are already these similar dashboards: [list with name, UUID, created-at]. Want me to (a) modify one of these, (b) create a new dashboard anyway, or (c) stop?”
- No duplicates: “I’ll create a new dashboard for this. Proceed?” (No “modify” option when there’s nothing to modify.)
Wait for the user’s choice. “modify” → Step 3a. “create new” / confirm → Step 3b. “stop” → stop.
Step 3: Create or modify
Step 3a: Modify an existing dashboard
Hand off immediately to signoz-modifying-dashboards with the chosen
dashboard UUID and the user’s intent. Do not call
signoz:signoz_update_dashboard from this skill — modification is out
of scope. (See “Scope boundary” in Guardrails.)
Step 3b: Create a new dashboard
Run the template lookup first. The user has already agreed to create new — the lookup decides how we build it.
Call signoz:signoz_list_dashboard_templates once with no arguments.
The full catalog (~95 entries) returns in a single call — read it
in-context and pick the best match for the user’s intent. When several
entries plausibly fit, present the top 3–5 and let the user choose.
Branch on the result:
- Single clear template match — proceed to Step 3b-i (template import). Briefly tell the user “I found a pre-built [title] template and will use it” so they know what’s being created; do not block on yes/no.
- Multiple plausible matches — present them and ask the user to pick. Once picked, proceed to Step 3b-i.
- No template — proceed to Step 3b-ii (custom build).
Step 3b-i: Import the template
Tool guardrail The only template tools are
signoz:signoz_list_dashboard_templatesandsignoz:signoz_import_dashboard. Do not shell out, fetch raw GitHub URLs, or invent other tool names.signoz:signoz_import_dashboardtakes the templatepathfrom the catalog entry and creates the dashboard in one call — you do not need to fetch the JSON yourself or callsignoz:signoz_create_dashboardafterwards.
Step 3b-i.1: Pre-flight no-data probe (fail fast)
Before calling signoz:signoz_import_dashboard, confirm the template’s
signals are actually being ingested. The most common silent failure for
template imports is “the template imports cleanly but every panel reads
‘No data’ because the technology isn’t being scraped” — the user only
discovers it after clicking through to a useless dashboard.
Since we don’t fetch the template body up front, base the probe on the
catalog entry’s category, title, and keywords plus the user’s
stated technology. Pick up to ~5 representative signals and check them
— keep the total small:
- Metric-based templates (most infra/runtime templates): call
signoz:signoz_list_metricswithsearchTextset to the technology prefix (e.g.searchText="postgresql"). Empty result → metric family is not being ingested. Early out: if this returns empty, declare “None present” and skip the rest of the metric probes — they will all return zero. Note:signoz:signoz_list_metricshas notimeRangeparameter; passstart/end(unix-ms strings) only if you need a window other than the server default. - Trace-based templates (APM-style): call
signoz:signoz_aggregate_traceswithaggregation=count,timeRange=1h. No filter is needed for the “is anything flowing” probe — addingfilter="service.name EXISTS"is fragile and unnecessary. Zero count → no traces flowing. - Log-based templates: call
signoz:signoz_aggregate_logswithaggregation=count,timeRange=1h, no filter. Zero count → no logs. - Variable values (when the template clearly relies on a resource
attribute, e.g.
service.name,k8s.cluster.name): callsignoz:signoz_get_field_valuesto confirm there are values to pick from. A dashboard whose top-level dropdown is empty is barely better than one full of empty panels.
Branch on the probe result:
- All signals present → proceed silently to Step 3b-i.2.
- Some present, some missing → list which are missing and ask the user to confirm before continuing. Many templates are useful even with partial coverage; let them decide.
- None present → tell the user no data was found for this technology in the probe window, explain the dashboard will show “No data” until ingestion is set up, and offer to create it anyway or stop. Wait for the user’s choice.
This probe is cheap (a handful of queries, ~hundreds of ms total), and catching the no-data case early avoids the worst UX failure mode of the template path.
Step 3b-i.2: Preview, import, report
- Preview Tell the user what’s about to happen in one short
paragraph: which template (
title,path), what category, what the probe found. In autonomous mode the consumer proceeds; in interactive mode the human can intervene. - Import Call
signoz:signoz_import_dashboardwith thepathfrom the chosen catalog entry (e.g.postgresql/postgresql.json). The server fetches the JSON, validates it, and creates the dashboard in one call. - Report Read the response and tell the user the dashboard’s
title, panel count, and section breakdown. Surface the dashboard’s
variables (“filter by
service.name”, “filter byk8s.cluster.name”) so the user knows what knobs they have. Offer two follow-ups: “Want me to adjust panels, layout, or variables?” and “Want me to wire alerts for any of these signals? (signoz-creating-alerts)”. - Customization handling If the user asks for any change to the
imported dashboard, hand off to
signoz-modifying-dashboardswith the new dashboard’s UUID and the requested changes. Do not callsignoz:signoz_update_dashboardfrom this skill.
Step 3b-ii: Custom build (no template, or import failed)
Run this path when the Step 3b template lookup found no match, the user
explicitly rejected the suggested template, or
signoz:signoz_import_dashboard failed.
Step 3b-ii.1: Gather requirements
Ask the user (skip questions whose answer is already clear from intent):
- Signals — metrics, traces, logs, or a combination.
- Specific signals — which metrics, which span attributes, which log severities matter most.
- Resource scope — which service(s), namespace(s), cluster(s), or environment(s).
- Variables — what should be a dropdown vs. a hard-coded filter
(typical:
service.name,deployment.environment.name,k8s.cluster.name). - Sections — group panels into Overview / Latency / Errors / Saturation, or another structure that fits the domain.
If the user is non-specific (“just make me something useful for X”), apply the defaults table below and surface them in the preview.
Step 3b-ii.2: Discover names and probe data
The MCP guideline applies: always prefer resource-attribute filters. Before authoring panels, confirm the names you’ll use exist and emit data:
- Metrics — call
signoz:signoz_list_metricswithsearchTexttied to the technology (e.g.searchText="postgresql") to get the exact OTel metric names. Catalog presence ≠ data flowing — for any metric you intend to use, follow up withsignoz:signoz_query_metricson a representative window to confirm it actually has datapoints. - Resource attributes — call
signoz:signoz_get_field_keyswithfieldContext=resourcefor the relevant signal to enumerate available attributes; callsignoz:signoz_get_field_valueson the ones you’ll use as variables to confirm concrete values exist. Note that the live data may use older OTel semconv (e.g.deployment.environmentrather thandeployment.environment.name) — always trust the discovered key over the one in the defaults table.
Per-panel validation happens later as a hard requirement — see Step 3b-ii.6 and the “Mandatory dry-run before save” guardrail.
If none of the discovered signals return data, tell the user the dashboard’s data isn’t being ingested yet, explain the panels will show “No data” until ingestion is set up, and offer to build anyway or stop. Wait for the user’s choice before building.
Step 3b-ii.3: Read the dashboard MCP resources
These are the source of truth for the JSON schema, panel types, query builder shape, and layout rules — do not transcribe schema text into this skill, it will rot out of sync with the server. Read the four core resources before authoring widget JSON.
Fallback when the MCP resource-reader is unavailable Some MCP client harnesses do not expose a resource-reading tool. If you cannot read
signoz://...URIs in this session, fall back tosignoz:signoz_list_dashboards+signoz:signoz_get_dashboardon an existing dashboard of the same signal type (metrics / traces / logs) and read itswidgetsarray for v5 widget shapes. The mandatory dry-run in Step 3b-ii.6 then backstops any shape errors the fallback misses.
signoz://dashboard/instructions— title, tags, description, variables.signoz://dashboard/widgets-instructions— 7 panel types and layout rules.signoz://dashboard/widgets-examples— complete widget configs with all required fields (the most important resource — every widget must includeid,panelTypes,title,query,selectedLogFields,selectedTracesFields,thresholds,contextLinks).signoz://dashboard/query-builder-example— query builder reference, including operator semantics forfilters.items[].op.
Add signal-specific resources as needed:
- Metrics (PromQL):
signoz://promql/instructions. - Metrics (ClickHouse):
signoz://dashboard/clickhouse-schema-for-metricssignoz://dashboard/clickhouse-metrics-example.
- Metrics (Query Builder aggregation rules):
signoz://metrics-aggregation-guide— required for picking validtimeAggregation/spaceAggregationper metric type. - Traces (Query Builder):
signoz://traces/query-builder-guide. - Traces (ClickHouse):
signoz://dashboard/clickhouse-schema-for-tracessignoz://dashboard/clickhouse-traces-example.
- Logs (ClickHouse):
signoz://dashboard/clickhouse-schema-for-logssignoz://dashboard/clickhouse-logs-example.
Step 3b-ii.4: Build the dashboard JSON
Follow the v5 schema as documented in the resources above. Use OTel semantic attribute names (not shorthand) in filters, groupBy, and variables. Apply the defaults below unless the user specified otherwise.
All panels are validated in Step 3b-ii.6 via the mandatory dry-run
before save. Author the JSON here as you intend to save it — the
dry-run uses the exact shape from queryData.
Defaults the skill applies (and surfaces in the preview):
| Field | Default | When to override |
|---|---|---|
| Time range | last 1h | longer for capacity planning, shorter for live debugging |
| Refresh | manual (no auto-refresh) | set 30s–1m only when the user explicitly wants live updates |
| Section structure (APM/services) | Overview / Latency / Errors / Throughput | domain-specific (e.g. DB: Overview / Connections / Throughput / Slow Queries) |
| Section structure (infra/runtime) | Overview / Saturation / Errors / Latency | domain-specific |
| Headline panels (services) | request rate, error rate, p50/p95/p99 latency, throughput | omit those that don’t apply |
| Headline panels (infra) | resource utilization (CPU, mem), saturation, error/restart counts, throughput | tailor to the technology |
| Variables (services) | service.name, deployment.environment (or deployment.environment.name — verify which exists via signoz:signoz_get_field_keys) |
add k8s.cluster.name / k8s.namespace.name when k8s-flavored |
| Variables (k8s/infra) | k8s.cluster.name, k8s.namespace.name (or host.name for hostmetrics) |
drop service.name — it is rarely populated on infra signals |
| Layout | 2-column grid (w: 6), 12 columns wide |
full-width (w: 12) for tables and time-series with many series |
| GroupBy on per-service panels | service.name resource attribute |
drop when filtering to a single service |
Title and description The dashboard title should name the technology and the scope clearly: “PostgreSQL — prod-us-east-1”, not just “PostgreSQL”. Description should answer “what is this for” in one sentence. Tags: technology + signal types + environment when known.
Step 3b-ii.5: Shape check before save
signoz://dashboard/widgets-examples is the source of truth for widget
required fields, panel-type-specific shapes, the canonical
filters.items[].key.id form, operator casing, and common write-shape
errors. Re-skim it before serialising any custom widget JSON.
One rule widgets-examples does not call out, but
signoz:signoz_create_dashboard enforces: no JSON.stringify on
arrays/objects layout, widgets, tags, and variables are
native JSON — stringifying them produces errors like
cannot unmarshal string into ... layout of type []LayoutItem.
Step 3b-ii.6: Dry-run before save (mandatory)
Call signoz:signoz_execute_builder_query per panel. The dry-run
validates the query is well-formed and confirms data flows under
that filter — the per-panel data probe folds in here.
Envelope translation Widget JSON wraps queries in
compositeQuery.builder.queryData[] and queryFormulas[], but
signoz:signoz_execute_builder_query takes
compositeQuery.queries[].{type, spec}. Translate per panel: each
queryData[i] → { type: "builder_query", spec: { name, signal,
filter: {expression}, groupBy, aggregations } }; each
queryFormulas[i] → { type: "builder_formula", spec: { name,
expression } }. Preserve the original name (A, B, …) on
every builder_query.spec — formula expressions reference inputs
by that name (e.g., A * 100 / B), and dropping it makes the
dry-run shape diverge from the saved panel so formulas can’t
resolve their inputs. The endpoint cannot consume widget JSON
directly.
Non-empty response = pass; server error, “filter type mismatch”, or unexpected zero rows = fail (fix the panel JSON before save).
Coverage: dry-run every query-bearing panel, regardless of count
or shape. Trivial panels fail silently too (wrong severity filter,
wrong resource scope, attribute name shorthand like service instead
of service.name) — the same footguns that bite non-trivial panels.
Row / header panels (panelTypes: "row") have no query to execute —
validate their shape against signoz://dashboard/widgets-examples
instead and skip them here.
Step 3b-ii.7: Preview, save, report
-
Preview Emit a one-paragraph plain-language summary of what will be created — no JSON dump. A 20–30 widget payload is hundreds of lines the user cannot meaningfully review in chat, and the dry-run has already validated every query-bearing panel against live data.
Summary: This dashboard tracks [signals] for [scope], with sections [list]. Variables: [list]. Time range default 1h. Dry-run: all [N] query-bearing panels validated against live data (any failures have been fixed pre-save).
-
Save Call
signoz:signoz_create_dashboardwith the payload. -
Report Tell the user:
- The created dashboard’s UUID and title.
- Panel count and section breakdown.
- Which variables are wired.
- Two follow-up offers: “Want me to adjust panels, layout, or
variables?” and “Want me to wire alerts for any of these signals?
(
signoz-creating-alerts)”.
Guardrails
- Strict inputs over guessing Resource scope is required for custom builds. If missing, stop and ask the user (see Required inputs above). A guessed scope on a shared dashboard is harder to clean up than asking.
- Always paginate
signoz:signoz_list_dashboardsStopping at page 1 misses duplicates and produces clutter. - Duplicate check first The user’s only two upfront options are “modify an existing one” or “create a new one” — never offer template-import as a separate top-level choice.
- Template-first on the create path Once the user has chosen to
create, always run
signoz:signoz_list_dashboard_templatesbefore anysignoz:signoz_create_dashboardcall. If a matching template exists, import it viasignoz:signoz_import_dashboard(just inform the user); only build from scratch when no template matches. - No-data probe is mandatory before save Run the pre-flight probe
(Step 3b-i.1 / Step 3b-ii.2) before
signoz:signoz_import_dashboard/signoz:signoz_create_dashboard. A “No data” dashboard is a worse outcome than one extra confirmation prompt. Skip only if the user has explicitly opted out for this request. - Mandatory dry-run before save on custom builds Before
signoz:signoz_create_dashboard, runsignoz:signoz_execute_builder_queryper Step 3b-ii.6 — translate each panel’sbuilder.queryData[]/queryFormulas[]into the endpoint’squeries[].{type, spec}envelope (mapping in Step 3b-ii.6). Skipping is equivalent to skipping the duplicate check. The create-dashboard schema accepts queries that 500 at query time — agroupByon a numeric attribute, an aggregation incompatible with the metric type — and the result ships as a silently empty panel. - Preview before save on custom builds Emit the plain-language
summary before
signoz:signoz_create_dashboardso the human can intervene on intent. - Prefer OTel attribute names
service.namenotservice,host.namenothost. Wrong names produce empty panels. Verify the exact key (deployment.environmentvsdeployment.environment.name, for instance) againstsignoz:signoz_get_field_keysrather than guessing — installs running classic OTel semconv emit the no-.nameform. - No metric guessing For custom builds, verify metric names with
signoz:signoz_list_metricsbefore authoring. Wrong names produce empty panels and the user only finds out later. - Valid JSON shapes only Follow the v5 schema documented in
signoz://dashboard/*MCP resources. Required widget andqueryDatafields are listed insignoz://dashboard/widgets-instructionsandsignoz://dashboard/widgets-examples. Never wrap arrays/objects inJSON.stringify. - Scope boundary This skill creates dashboards. The moment the
user asks to modify, edit, rearrange, or extend an existing dashboard
— including immediately after import — hand off to
signoz-modifying-dashboards. Do not callsignoz:signoz_update_dashboardfrom this skill.
Examples
Four canonical flows — template happy path, template choice, duplicate
found, custom build — live in references/examples.md.