Config reference: tracing
-
Enterprise tuning surface
Defaults + constraints are rendered directly from Pydantic.
-
Env keys when available
Many fields have an env-style alias (from
TriBridConfig.to_flat_dict()). -
Tooltip-level guidance
If a matching glossary entry exists, you’ll see deeper tuning notes.
Config reference Config API & workflow Glossary
Total parameters: 17
Group index
(root)
(root)
| JSON key | Env key(s) | Type | Default | Constraints | Summary |
|---|---|---|---|---|---|
tracing.alert_include_resolved | ALERT_INCLUDE_RESOLVED | int | 1 | ≥ 0, ≤ 1 | Include resolved alerts |
tracing.alert_notify_severities | ALERT_NOTIFY_SEVERITIES | str | "critical,warning" | — | Alert severities to notify |
tracing.alert_webhook_timeout | ALERT_WEBHOOK_TIMEOUT | int | 5 | ≥ 1, ≤ 30 | Alert webhook timeout (seconds) |
tracing.langchain_endpoint | LANGCHAIN_ENDPOINT | str | "https://api.smith.langchain.com" | — | LangChain/LangSmith API endpoint |
tracing.langchain_project | LANGCHAIN_PROJECT | str | "tribrid" | — | LangChain project name |
tracing.langchain_tracing_v2 | LANGCHAIN_TRACING_V2 | int | 0 | ≥ 0, ≤ 1 | Enable LangChain v2 tracing |
tracing.langtrace_api_host | LANGTRACE_API_HOST | str | "" | — | LangTrace API host |
tracing.langtrace_project_id | LANGTRACE_PROJECT_ID | str | "" | — | LangTrace project ID |
tracing.log_level | LOG_LEVEL | str | "INFO" | pattern=^(DEBUG|INFO|WARNING|ERROR)$ | Logging level |
tracing.metrics_enabled | METRICS_ENABLED | int | 1 | ≥ 0, ≤ 1 | Enable metrics collection |
tracing.prometheus_port | PROMETHEUS_PORT | int | 9090 | ≥ 1024, ≤ 65535 | Prometheus metrics port |
tracing.trace_auto_ls | TRACE_AUTO_LS | int | 1 | ≥ 0, ≤ 1 | Auto-enable LangSmith tracing |
tracing.trace_retention | TRACE_RETENTION | int | 50 | ≥ 10, ≤ 500 | Number of traces to retain |
tracing.trace_sampling_rate | TRACE_SAMPLING_RATE | float | 1.0 | ≥ 0.0, ≤ 1.0 | Trace sampling rate (0.0-1.0) |
tracing.tracing_enabled | TRACING_ENABLED | int | 1 | ≥ 0, ≤ 1 | Enable distributed tracing |
tracing.tracing_mode | TRACING_MODE | str | "langsmith" | pattern=^(langsmith|local|none|off)$ | Tracing backend mode |
tracing.tribrid_log_path | TRIBRID_LOG_PATH | str | "data/logs/queries.jsonl" | — | Query log file path |
Details (glossary)
tracing.alert_include_resolved (ALERT_INCLUDE_RESOLVED) — Alert Include Resolved
Category: general
Controls whether resolved events are included in outbound alert notifications. Set to on (1) to receive recovery/resolution notices alongside active alerts, or off (0) to only notify on active issues.
tracing.alert_notify_severities (ALERT_NOTIFY_SEVERITIES) — Alert Notify Severities
Category: general
Comma-separated severity levels that are allowed to trigger outbound alert notifications (for example: critical,warning). This acts as the final routing filter after alert generation, so misconfigured values can silently suppress notifications you expected to see.
Use this to align operational noise with team on-call policy: start narrow (critical only), then expand once signal quality is stable. Keep values synchronized with the severity vocabulary emitted by your tracing/alert pipeline.
- Typical progression: critical -> critical,warning -> critical,warning,info
- Affects: webhook fan-out volume, pager noise, incident MTTA
tracing.alert_webhook_timeout (ALERT_WEBHOOK_TIMEOUT) — Alert Webhook Timeout
Category: general
Maximum time in seconds to wait for alert webhook delivery before the request is treated as failed. This timeout protects retrieval/tracing workflows from hanging on slow third-party incident tooling.
Set this high enough for normal network jitter, but low enough to avoid backlog growth during outages. If notifications are frequently timing out, fix endpoint reliability before increasing this value aggressively.
- Lower timeout: faster failure detection, more dropped notifications under transient latency
- Higher timeout: better delivery chance, slower failure feedback
Links: - Webhook Timeouts - Slack Webhooks
tracing.langchain_endpoint (LANGCHAIN_ENDPOINT) — LangChain Endpoint
Category: general
Base endpoint used for LangSmith/LangChain trace ingestion. This controls where trace payloads are sent when tracing integration is enabled.
Override this for self-hosted gateways, enterprise proxies, or region-specific endpoints. Keep endpoint, project name, and API key aligned or traces will fail silently or fragment across projects.
- Default commonly points to api.smith.langchain.com
- Verify network egress and TLS policy in production
Links: - LangSmith API
tracing.langchain_project (LANGCHAIN_PROJECT) — LangChain Project
Category: general
Logical project/container name used when publishing traces to LangSmith. It determines where runs appear in dashboards and how teams segment environments (dev/staging/prod) or product areas.
Use a stable naming convention so trend analysis remains coherent over time. Avoid ad-hoc names that split observability history across many near-duplicate projects.
- Example convention: ragweld-dev, ragweld-staging, ragweld-prod
- Affects: trace discoverability, dashboard hygiene
Links: - LangSmith Projects
tracing.langchain_tracing_v2 (LANGCHAIN_TRACING_V2) — LangChain Tracing
Category: general
Reserved for future LangSmith integration (v2 tracing protocol). TriBridRAG currently captures local, in-memory request traces for UI preview and does not export traces to LangSmith yet.
Badges: - Requires API key
Links: - LangSmith Setup - Tracing Guide - How to Enable
tracing.langtrace_api_host (LANGTRACE_API_HOST) — LangTrace API Host
Category: infrastructure
LangTrace API endpoint host (optional). Stored in config field tracing.langtrace_api_host (and surfaced as LANGTRACE_API_HOST in env exports). Reserved for future external trace export.
Links: - Langtrace Docs
tracing.langtrace_project_id (LANGTRACE_PROJECT_ID) — LangTrace Project ID
Category: general
Project identifier for LangTrace (optional). Stored in config field tracing.langtrace_project_id (and surfaced as LANGTRACE_PROJECT_ID in env exports). Reserved for future external trace export.
Links: - Langtrace Projects
tracing.log_level (LOG_LEVEL) — Log Level
Category: general
Logging verbosity level. Options: DEBUG (verbose, dev), INFO (normal, recommended), WARNING (errors + warnings only), ERROR (errors only). Higher levels reduce noise but may hide useful diagnostics.
Links: - Python Logging Levels - Logging Best Practices
tracing.metrics_enabled (METRICS_ENABLED) — Metrics Enabled
Category: evaluation
Master toggle for exporting internal metrics (for example Prometheus-compatible counters/gauges/histograms). When disabled, trace logs may still exist but metric endpoints/series emission are reduced or silent.
Enable in environments where dashboards/alerts depend on metric streams. Disable only for constrained local runs where telemetry overhead must be minimized.
- Affects: observability dashboards, alert pipelines, SLO reporting
- Pair with: PROMETHEUS_PORT, TRACE_SAMPLING_RATE
Links: - Prometheus Metrics - Monitoring Best Practices
tracing.prometheus_port (PROMETHEUS_PORT) — Prometheus Port
Category: infrastructure
Network port used to expose Prometheus-compatible metrics endpoint. Monitoring systems scrape this port to collect retrieval/tracing health and performance metrics.
Ensure this port is reachable from your monitoring plane and does not conflict with other services on the host. In containerized setups, verify both container and host mapping are correct.
- Typical default: 9090
- Affects: metrics ingestion and dashboard freshness
Links: - Prometheus Basics - Metrics Endpoint
tracing.trace_auto_ls (TRACE_AUTO_LS) — Auto-open LangSmith
Category: general
UI convenience flag intended to auto-open LangSmith after a request (1=yes, 0=no). TriBridRAG does not currently implement LangSmith deep-linking; this setting is reserved for future integration.
Links: - LangSmith Setup
tracing.trace_retention (TRACE_RETENTION) — Trace Retention
Category: general
Retention cap for stored trace records/events in local history. Older traces beyond this limit are pruned to keep storage and UI performance manageable.
Set based on debugging horizon and storage budget. Higher retention helps long-tail incident analysis but increases disk usage and list/query overhead.
- Lower retention: lean local storage
- Higher retention: richer historical diagnostics
Links: - Data Retention
tracing.trace_sampling_rate (TRACE_SAMPLING_RATE) — Trace Sampling Rate
Category: general
Percentage of requests to trace with LangSmith/observability (0.0-1.0). 1.0 = trace everything (100%), 0.1 = trace 10% of requests, 0.0 = no tracing. Lower sampling reduces LangSmith costs and overhead while still providing visibility into system behavior. Use 1.0 during development/debugging, 0.05-0.2 in production for cost-effective monitoring. Sampling is random - every request has this probability of being traced.
Recommended: 1.0 for development, 0.1-0.2 for production monitoring, 0.05 for high-traffic systems.
Badges: - Cost control - Observability
Links: - LangSmith Tracing - Sampling Strategies - Trace Costs
tracing.tracing_enabled (TRACING_ENABLED) — Tracing Enabled
Category: general
Enable TriBridRAG request tracing. This records an in-memory per-request event trace (used by the UI “Routing Trace” preview) for debugging routing/retrieval decisions and latency. This does not export traces to external providers.
Links: - Distributed Tracing (concepts)
tracing.tracing_mode (TRACING_MODE) — Tracing Mode
Category: general
Selects the tracing backend mode used by the retrieval pipeline (for example off, local, or langsmith export). Mode determines where trace data is emitted and how debugging/observability workflows operate.
Use local mode for quick iteration and deterministic local introspection; use external modes for team-level dashboards and long-term telemetry. Keep credentials/endpoints aligned with selected mode.
- off: minimal overhead, no trace diagnostics
- local: local debugging visibility
- langsmith: remote observability and sharing
Links: - LangSmith
tracing.tribrid_log_path (TRIBRID_LOG_PATH) — Reranker Log Path
Category: general
Filesystem path for local query/trace log persistence. This file is used for audit trails, debugging replay, and compatibility tooling that reads historical run data.
Choose a stable writable location with predictable rotation/backup behavior. In production, pair with retention management to avoid unbounded log growth.
- Affects: local trace replay and diagnostics
- Verify path permissions in containerized environments
Links: - Python logging