Config reference: `semantic_cache`

Enterprise tuning surface

Defaults + constraints are rendered directly from Pydantic.
Env keys when available

Many fields have an env-style alias (from TriBridConfig.to_flat_dict()).
Tooltip-level guidance

If a matching glossary entry exists, you’ll see deeper tuning notes.

Config reference Config API & workflow Glossary

Total parameters: 13

Group index

(root)

`(root)`

JSON key	Env key(s)	Type	Default	Constraints	Summary
`semantic_cache.bypass_if_images`	`SEMANTIC_CACHE_BYPASS_IF_IMAGES`	`int`	`1`	≥ 0, ≤ 1	Bypass chat generation cache when images are attached.
`semantic_cache.chat_history_window`	`SEMANTIC_CACHE_CHAT_HISTORY_WINDOW`	`int`	`6`	≥ 0, ≤ 50	Number of prior conversation turns included in chat cache fingerprint.
`semantic_cache.enabled`	`SEMANTIC_CACHE_ENABLED`	`int`	`0`	≥ 0, ≤ 1	Enable semantic cache reads/writes (0=off, 1=on).
`semantic_cache.max_entries`	`SEMANTIC_CACHE_MAX_ENTRIES`	`int`	`5000`	≥ 100, ≤ 500000	Maximum cache rows to retain per scope/endpoint.
`semantic_cache.max_temperature_for_write`	`SEMANTIC_CACHE_MAX_TEMPERATURE_FOR_WRITE`	`float`	`0.5`	≥ 0.0, ≤ 2.0	Skip generation-cache writes when temperature exceeds this value.
`semantic_cache.min_query_chars`	`SEMANTIC_CACHE_MIN_QUERY_CHARS`	`int`	`3`	≥ 1, ≤ 200	Minimum query length before cache is eligible.
`semantic_cache.mode`	`SEMANTIC_CACHE_MODE`	`Literal["read_write", "read_only", "write_only"]`	`"read_write"`	allowed="read_write", "read_only", "write_only"	Cache mode when enabled.
`semantic_cache.similarity_threshold_answer`	`SEMANTIC_CACHE_THRESHOLD_ANSWER`	`float`	`0.93`	≥ 0.0, ≤ 1.0	Minimum cosine similarity for semantic answer cache hits.
`semantic_cache.similarity_threshold_chat`	`SEMANTIC_CACHE_THRESHOLD_CHAT`	`float`	`0.95`	≥ 0.0, ≤ 1.0	Minimum cosine similarity for semantic chat cache hits.
`semantic_cache.similarity_threshold_search`	`SEMANTIC_CACHE_THRESHOLD_SEARCH`	`float`	`0.9`	≥ 0.0, ≤ 1.0	Minimum cosine similarity for semantic search cache hits.
`semantic_cache.ttl_seconds_answer`	`SEMANTIC_CACHE_TTL_ANSWER_SEC`	`int`	`1800`	≥ 10, ≤ 86400	TTL in seconds for answer cache entries.
`semantic_cache.ttl_seconds_chat`	`SEMANTIC_CACHE_TTL_CHAT_SEC`	`int`	`600`	≥ 10, ≤ 86400	TTL in seconds for chat cache entries.
`semantic_cache.ttl_seconds_search`	`SEMANTIC_CACHE_TTL_SEARCH_SEC`	`int`	`900`	≥ 10, ≤ 86400	TTL in seconds for search cache entries.

Config reference: semantic_cache

(root)

Config reference: `semantic_cache`

`(root)`