diff --git a/docs/openapi.md b/docs/openapi.md index ac3f33344..b2f1b512d 100644 --- a/docs/openapi.md +++ b/docs/openapi.md @@ -2,6 +2,398 @@ Lightspeed Core Service (LCS) service API specification. + + + + * [🌍 Base URL](#-base-url) +* [🛠️ APIs](#-apis) + * [List of REST API endpoints](#list-of-rest-api-endpoints) + * [GET `/`](#get-) + * [Parameters:](#parameters) + * [Returns:](#returns) + * [✅ Responses](#-responses) + * [GET `/v1/info`](#get-v1info) + * [Parameters:](#parameters-1) + * [Raises:](#raises) + * [Returns:](#returns-1) + * [✅ Responses](#-responses-1) + * [GET `/v1/models`](#get-v1models) + * [Parameters:](#parameters-2) + * [Raises:](#raises-1) + * [Returns:](#returns-2) + * [🔗 Parameters](#-parameters) + * [✅ Responses](#-responses-2) + * [GET `/v1/tools`](#get-v1tools) + * [Parameters:](#parameters-3) + * [Raises:](#raises-2) + * [Returns:](#returns-3) + * [✅ Responses](#-responses-3) + * [GET `/v1/mcp-auth/client-options`](#get-v1mcp-authclient-options) + * [Parameters:](#parameters-4) + * [Returns:](#returns-4) + * [✅ Responses](#-responses-4) + * [GET `/v1/mcp-servers`](#get-v1mcp-servers) + * [Parameters:](#parameters-5) + * [Raises:](#raises-3) + * [Returns:](#returns-5) + * [✅ Responses](#-responses-5) + * [POST `/v1/mcp-servers`](#post-v1mcp-servers) + * [Parameters:](#parameters-6) + * [Raises:](#raises-4) + * [Returns:](#returns-6) + * [📦 Request Body](#-request-body) + * [✅ Responses](#-responses-6) + * [DELETE `/v1/mcp-servers/{name}`](#delete-v1mcp-serversname) + * [Parameters:](#parameters-7) + * [Raises:](#raises-5) + * [Returns:](#returns-7) + * [🔗 Parameters](#-parameters-1) + * [✅ Responses](#-responses-7) + * [GET `/v1/shields`](#get-v1shields) + * [Parameters:](#parameters-8) + * [Returns:](#returns-8) + * [Raises:](#raises-6) + * [✅ Responses](#-responses-8) + * [GET `/v1/providers`](#get-v1providers) + * [Parameters:](#parameters-9) + * [Returns:](#returns-9) + * [Raises:](#raises-7) + * [✅ Responses](#-responses-9) + * [GET `/v1/providers/{provider_id}`](#get-v1providersprovider_id) + * [Parameters:](#parameters-10) + * [Returns:](#returns-10) + * [Raises:](#raises-8) + * [🔗 Parameters](#-parameters-2) + * [✅ Responses](#-responses-10) + * [GET `/v1/prompts`](#get-v1prompts) + * [Parameters:](#parameters-11) + * [Raises:](#raises-9) + * [Returns:](#returns-11) + * [✅ Responses](#-responses-11) + * [POST `/v1/prompts`](#post-v1prompts) + * [Parameters:](#parameters-12) + * [Raises:](#raises-10) + * [Returns:](#returns-12) + * [📦 Request Body](#-request-body-1) + * [✅ Responses](#-responses-12) + * [GET `/v1/prompts/{prompt_id}`](#get-v1promptsprompt_id) + * [Parameters:](#parameters-13) + * [Raises:](#raises-11) + * [Returns:](#returns-13) + * [🔗 Parameters](#-parameters-3) + * [✅ Responses](#-responses-13) + * [PUT `/v1/prompts/{prompt_id}`](#put-v1promptsprompt_id) + * [Parameters:](#parameters-14) + * [Raises:](#raises-12) + * [Returns:](#returns-14) + * [🔗 Parameters](#-parameters-4) + * [📦 Request Body](#-request-body-2) + * [✅ Responses](#-responses-14) + * [DELETE `/v1/prompts/{prompt_id}`](#delete-v1promptsprompt_id) + * [Parameters:](#parameters-15) + * [Raises:](#raises-13) + * [Returns:](#returns-15) + * [🔗 Parameters](#-parameters-5) + * [✅ Responses](#-responses-15) + * [GET `/v1/rags`](#get-v1rags) + * [✅ Responses](#-responses-16) + * [GET `/v1/rags/{rag_id}`](#get-v1ragsrag_id) + * [🔗 Parameters](#-parameters-6) + * [✅ Responses](#-responses-17) + * [POST `/v1/query`](#post-v1query) + * [Parameters:](#parameters-16) + * [Returns:](#returns-16) + * [Raises:](#raises-14) + * [📦 Request Body](#-request-body-3) + * [✅ Responses](#-responses-18) + * [POST `/v1/streaming_query`](#post-v1streaming_query) + * [Parameters:](#parameters-17) + * [Returns:](#returns-17) + * [Raises:](#raises-15) + * [📦 Request Body](#-request-body-4) + * [✅ Responses](#-responses-19) + * [POST `/v1/streaming_query/interrupt`](#post-v1streaming_queryinterrupt) + * [Parameters:](#parameters-18) + * [Returns:](#returns-18) + * [Raises:](#raises-16) + * [📦 Request Body](#-request-body-5) + * [✅ Responses](#-responses-20) + * [GET `/v1/config`](#get-v1config) + * [Parameters:](#parameters-19) + * [Returns:](#returns-19) + * [✅ Responses](#-responses-21) + * [POST `/v1/feedback`](#post-v1feedback) + * [Parameters:](#parameters-20) + * [Returns:](#returns-20) + * [Raises:](#raises-17) + * [📦 Request Body](#-request-body-6) + * [✅ Responses](#-responses-22) + * [GET `/v1/feedback/status`](#get-v1feedbackstatus) + * [Parameters:](#parameters-21) + * [Returns:](#returns-21) + * [✅ Responses](#-responses-23) + * [PUT `/v1/feedback/status`](#put-v1feedbackstatus) + * [Parameters:](#parameters-22) + * [Returns:](#returns-22) + * [📦 Request Body](#-request-body-7) + * [✅ Responses](#-responses-24) + * [GET `/v1/conversations`](#get-v1conversations) + * [✅ Responses](#-responses-25) + * [GET `/v1/conversations/{conversation_id}`](#get-v1conversationsconversation_id) + * [🔗 Parameters](#-parameters-7) + * [✅ Responses](#-responses-26) + * [DELETE `/v1/conversations/{conversation_id}`](#delete-v1conversationsconversation_id) + * [🔗 Parameters](#-parameters-8) + * [✅ Responses](#-responses-27) + * [PUT `/v1/conversations/{conversation_id}`](#put-v1conversationsconversation_id) + * [🔗 Parameters](#-parameters-9) + * [📦 Request Body](#-request-body-8) + * [✅ Responses](#-responses-28) + * [GET `/v2/conversations`](#get-v2conversations) + * [✅ Responses](#-responses-29) + * [GET `/v2/conversations/{conversation_id}`](#get-v2conversationsconversation_id) + * [🔗 Parameters](#-parameters-10) + * [✅ Responses](#-responses-30) + * [DELETE `/v2/conversations/{conversation_id}`](#delete-v2conversationsconversation_id) + * [🔗 Parameters](#-parameters-11) + * [✅ Responses](#-responses-31) + * [PUT `/v2/conversations/{conversation_id}`](#put-v2conversationsconversation_id) + * [🔗 Parameters](#-parameters-12) + * [📦 Request Body](#-request-body-9) + * [✅ Responses](#-responses-32) + * [POST `/v1/responses`](#post-v1responses) + * [📦 Request Body](#-request-body-10) + * [✅ Responses](#-responses-33) + * [POST `/v1/infer`](#post-v1infer) + * [📦 Request Body](#-request-body-11) + * [✅ Responses](#-responses-34) + * [GET `/readiness`](#get-readiness) + * [Parameters:](#parameters-23) + * [Returns:](#returns-23) + * [✅ Responses](#-responses-35) + * [GET `/liveness`](#get-liveness) + * [Parameters:](#parameters-24) + * [Returns:](#returns-24) + * [✅ Responses](#-responses-36) + * [POST `/authorized`](#post-authorized) + * [Parameters:](#parameters-25) + * [Returns:](#returns-25) + * [✅ Responses](#-responses-37) + * [GET `/metrics`](#get-metrics) + * [Parameters:](#parameters-26) + * [Returns:](#returns-26) + * [✅ Responses](#-responses-38) + * [GET `/.well-known/agent-card.json`](#get-well-knownagent-cardjson) + * [✅ Responses](#-responses-39) + * [GET `/.well-known/agent.json`](#get-well-knownagentjson) + * [✅ Responses](#-responses-40) + * [GET `/a2a`](#get-a2a) + * [✅ Responses](#-responses-41) + * [POST `/a2a`](#post-a2a) + * [✅ Responses](#-responses-42) + * [GET `/a2a/health`](#get-a2ahealth) + * [✅ Responses](#-responses-43) + * [| 200 | Successful Response | object |](#-200--successful-response--object-) +* [📋 Components](#-components) + * [A2AStateConfiguration](#a2astateconfiguration) + * [APIKeySecurityScheme](#apikeysecurityscheme) + * [APIKeyTokenConfiguration](#apikeytokenconfiguration) + * [AccessRule](#accessrule) + * [Action](#action) + * [AgentCapabilities](#agentcapabilities) + * [AgentCard](#agentcard) + * [AgentCardSignature](#agentcardsignature) + * [AgentExtension](#agentextension) + * [AgentInterface](#agentinterface) + * [AgentProvider](#agentprovider) + * [AgentSkill](#agentskill) + * [AllowedToolsFilter](#allowedtoolsfilter) + * [ApprovalFilter](#approvalfilter) + * [ApprovalFilter-Output](#approvalfilter-output) + * [ApprovalsConfiguration](#approvalsconfiguration) + * [Attachment](#attachment) + * [AuthenticationConfiguration](#authenticationconfiguration) + * [AuthorizationCodeOAuthFlow](#authorizationcodeoauthflow) + * [AuthorizationConfiguration](#authorizationconfiguration) + * [AuthorizedResponse](#authorizedresponse) + * [AzureEntraIdConfiguration](#azureentraidconfiguration) + * [BadRequestResponse](#badrequestresponse) + * [Body_create_file_v1_files_post](#body_create_file_v1_files_post) + * [ByokRag](#byokrag) + * [CORSConfiguration](#corsconfiguration) + * [ClientCredentialsOAuthFlow](#clientcredentialsoauthflow) + * [CompactionConfiguration](#compactionconfiguration) + * [Configuration](#configuration) + * [ConfigurationResponse](#configurationresponse) + * [ConflictResponse](#conflictresponse) + * [ConversationData](#conversationdata) + * [ConversationDeleteResponse](#conversationdeleteresponse) + * [ConversationDetails](#conversationdetails) + * [ConversationHistoryConfiguration](#conversationhistoryconfiguration) + * [ConversationResponse](#conversationresponse) + * [ConversationTurn](#conversationturn) + * [ConversationUpdateRequest](#conversationupdaterequest) + * [ConversationUpdateResponse](#conversationupdateresponse) + * [ConversationsListResponse](#conversationslistresponse) + * [ConversationsListResponseV2](#conversationslistresponsev2) + * [CustomProfile](#customprofile) + * [Customization](#customization) + * [DatabaseConfiguration](#databaseconfiguration) + * [DetailModel](#detailmodel) + * [FeedbackCategory](#feedbackcategory) + * [FeedbackRequest](#feedbackrequest) + * [FeedbackResponse](#feedbackresponse) + * [FeedbackStatusUpdateRequest](#feedbackstatusupdaterequest) + * [FeedbackStatusUpdateResponse](#feedbackstatusupdateresponse) + * [FileResponse](#fileresponse) + * [FileTooLargeResponse](#filetoolargeresponse) + * [ForbiddenResponse](#forbiddenresponse) + * [HTTPAuthSecurityScheme](#httpauthsecurityscheme) + * [HTTPValidationError](#httpvalidationerror) + * [ImplicitOAuthFlow](#implicitoauthflow) + * [In](#in) + * [InMemoryCacheConfig](#inmemorycacheconfig) + * [IncludeParameter](#includeparameter) + * [InferenceConfiguration](#inferenceconfiguration) + * [InfoResponse](#inforesponse) + * [InternalServerErrorResponse](#internalservererrorresponse) + * [JsonPathOperator](#jsonpathoperator) + * [JwkConfiguration](#jwkconfiguration) + * [JwtConfiguration](#jwtconfiguration) + * [JwtRoleRule](#jwtrolerule) + * [LivenessResponse](#livenessresponse) + * [LlamaStackConfiguration](#llamastackconfiguration) + * [MCPClientAuthOptionsResponse](#mcpclientauthoptionsresponse) + * [MCPListToolsTool](#mcplisttoolstool) + * [MCPServerAuthInfo](#mcpserverauthinfo) + * [MCPServerDeleteResponse](#mcpserverdeleteresponse) + * [MCPServerInfo](#mcpserverinfo) + * [MCPServerListResponse](#mcpserverlistresponse) + * [MCPServerRegistrationRequest](#mcpserverregistrationrequest) + * [MCPServerRegistrationResponse](#mcpserverregistrationresponse) + * [Message](#message) + * [ModelContextProtocolServer](#modelcontextprotocolserver) + * [ModelsResponse](#modelsresponse) + * [MutualTLSSecurityScheme](#mutualtlssecurityscheme) + * [NotFoundResponse](#notfoundresponse) + * [OAuth2SecurityScheme](#oauth2securityscheme) + * [OAuthFlows](#oauthflows) + * [OkpConfiguration](#okpconfiguration) + * [OpenAIResponseAnnotationCitation](#openairesponseannotationcitation) + * [OpenAIResponseAnnotationContainerFileCitation](#openairesponseannotationcontainerfilecitation) + * [OpenAIResponseAnnotationFileCitation](#openairesponseannotationfilecitation) + * [OpenAIResponseAnnotationFilePath](#openairesponseannotationfilepath) + * [OpenAIResponseContentPartRefusal](#openairesponsecontentpartrefusal) + * [OpenAIResponseError](#openairesponseerror) + * [OpenAIResponseInputFunctionToolCallOutput](#openairesponseinputfunctiontoolcalloutput) + * [OpenAIResponseInputMessageContentFile](#openairesponseinputmessagecontentfile) + * [OpenAIResponseInputMessageContentImage](#openairesponseinputmessagecontentimage) + * [OpenAIResponseInputMessageContentText](#openairesponseinputmessagecontenttext) + * [OpenAIResponseInputToolChoiceAllowedTools](#openairesponseinputtoolchoiceallowedtools) + * [OpenAIResponseInputToolChoiceCustomTool](#openairesponseinputtoolchoicecustomtool) + * [OpenAIResponseInputToolChoiceFileSearch](#openairesponseinputtoolchoicefilesearch) + * [OpenAIResponseInputToolChoiceFunctionTool](#openairesponseinputtoolchoicefunctiontool) + * [OpenAIResponseInputToolChoiceMCPTool](#openairesponseinputtoolchoicemcptool) + * [OpenAIResponseInputToolChoiceMode](#openairesponseinputtoolchoicemode) + * [OpenAIResponseInputToolChoiceWebSearch](#openairesponseinputtoolchoicewebsearch) + * [OpenAIResponseInputToolFileSearch](#openairesponseinputtoolfilesearch) + * [OpenAIResponseInputToolFunction](#openairesponseinputtoolfunction) + * [OpenAIResponseInputToolMCP](#openairesponseinputtoolmcp) + * [OpenAIResponseInputToolWebSearch](#openairesponseinputtoolwebsearch) + * [OpenAIResponseMCPApprovalRequest](#openairesponsemcpapprovalrequest) + * [OpenAIResponseMCPApprovalResponse](#openairesponsemcpapprovalresponse) + * [OpenAIResponseMessage-Input](#openairesponsemessage-input) + * [OpenAIResponseMessage-Output](#openairesponsemessage-output) + * [OpenAIResponseOutputMessageContentOutputText-Input](#openairesponseoutputmessagecontentoutputtext-input) + * [OpenAIResponseOutputMessageContentOutputText-Output](#openairesponseoutputmessagecontentoutputtext-output) + * [OpenAIResponseOutputMessageFileSearchToolCall](#openairesponseoutputmessagefilesearchtoolcall) + * [OpenAIResponseOutputMessageFileSearchToolCallResults](#openairesponseoutputmessagefilesearchtoolcallresults) + * [OpenAIResponseOutputMessageFunctionToolCall](#openairesponseoutputmessagefunctiontoolcall) + * [OpenAIResponseOutputMessageMCPCall](#openairesponseoutputmessagemcpcall) + * [OpenAIResponseOutputMessageMCPListTools](#openairesponseoutputmessagemcplisttools) + * [OpenAIResponseOutputMessageWebSearchToolCall](#openairesponseoutputmessagewebsearchtoolcall) + * [OpenAIResponsePrompt](#openairesponseprompt) + * [OpenAIResponseReasoning](#openairesponsereasoning) + * [OpenAIResponseText](#openairesponsetext) + * [OpenAIResponseTextFormat](#openairesponsetextformat) + * [OpenAIResponseToolMCP](#openairesponsetoolmcp) + * [OpenAIResponseUsage](#openairesponseusage) + * [OpenAIResponseUsageInputTokensDetails](#openairesponseusageinputtokensdetails) + * [OpenAIResponseUsageOutputTokensDetails](#openairesponseusageoutputtokensdetails) + * [OpenAITokenLogProb](#openaitokenlogprob) + * [OpenAITopLogProb](#openaitoplogprob) + * [OpenIdConnectSecurityScheme](#openidconnectsecurityscheme) + * [PasswordOAuthFlow](#passwordoauthflow) + * [PostgreSQLDatabaseConfiguration](#postgresqldatabaseconfiguration) + * [PromptCreateRequest](#promptcreaterequest) + * [PromptDeleteResponse](#promptdeleteresponse) + * [PromptResourceResponse](#promptresourceresponse) + * [PromptTooLongResponse](#prompttoolongresponse) + * [PromptUpdateRequest](#promptupdaterequest) + * [PromptsListResponse](#promptslistresponse) + * [ProviderHealthStatus](#providerhealthstatus) + * [ProviderResponse](#providerresponse) + * [ProvidersListResponse](#providerslistresponse) + * [QueryRequest](#queryrequest) + * [QueryResponse](#queryresponse) + * [QuotaExceededResponse](#quotaexceededresponse) + * [QuotaHandlersConfiguration](#quotahandlersconfiguration) + * [QuotaLimiterConfiguration](#quotalimiterconfiguration) + * [QuotaSchedulerConfiguration](#quotaschedulerconfiguration) + * [RAGChunk](#ragchunk) + * [RAGInfoResponse](#raginforesponse) + * [RAGListResponse](#raglistresponse) + * [RHIdentityConfiguration](#rhidentityconfiguration) + * [RagConfiguration](#ragconfiguration) + * [ReadinessResponse](#readinessresponse) + * [ReferencedDocument](#referenceddocument) + * [RerankerConfiguration](#rerankerconfiguration) + * [ResponseInput](#responseinput) + * [ResponseItem](#responseitem) + * [ResponsesRequest](#responsesrequest) + * [ResponsesResponse](#responsesresponse) + * [RlsapiV1Attachment](#rlsapiv1attachment) + * [RlsapiV1CLA](#rlsapiv1cla) + * [RlsapiV1Configuration](#rlsapiv1configuration) + * [RlsapiV1Context](#rlsapiv1context) + * [RlsapiV1InferData](#rlsapiv1inferdata) + * [RlsapiV1InferRequest](#rlsapiv1inferrequest) + * [RlsapiV1InferResponse](#rlsapiv1inferresponse) + * [RlsapiV1SystemInfo](#rlsapiv1systeminfo) + * [RlsapiV1Terminal](#rlsapiv1terminal) + * [SQLiteDatabaseConfiguration](#sqlitedatabaseconfiguration) + * [SearchRankingOptions](#searchrankingoptions) + * [SecurityScheme](#securityscheme) + * [ServiceConfiguration](#serviceconfiguration) + * [ServiceUnavailableResponse](#serviceunavailableresponse) + * [ShieldsResponse](#shieldsresponse) + * [SkillsConfiguration](#skillsconfiguration) + * [SolrVectorSearchRequest](#solrvectorsearchrequest) + * [SplunkConfiguration](#splunkconfiguration) + * [StatusResponse](#statusresponse) + * [StreamingInterruptRequest](#streaminginterruptrequest) + * [StreamingInterruptResponse](#streaminginterruptresponse) + * [TLSConfiguration](#tlsconfiguration) + * [ToolCallSummary](#toolcallsummary) + * [ToolResultSummary](#toolresultsummary) + * [ToolsResponse](#toolsresponse) + * [UnauthorizedResponse](#unauthorizedresponse) + * [UnprocessableEntityResponse](#unprocessableentityresponse) + * [UserDataCollection](#userdatacollection) + * [ValidationError](#validationerror) + * [VectorStoreCreateRequest](#vectorstorecreaterequest) + * [VectorStoreDeleteResponse](#vectorstoredeleteresponse) + * [VectorStoreFileCreateRequest](#vectorstorefilecreaterequest) + * [VectorStoreFileDeleteResponse](#vectorstorefiledeleteresponse) + * [VectorStoreFileResponse](#vectorstorefileresponse) + * [VectorStoreFilesListResponse](#vectorstorefileslistresponse) + * [VectorStoreResponse](#vectorstoreresponse) + * [VectorStoreUpdateRequest](#vectorstoreupdaterequest) + * [VectorStoresListResponse](#vectorstoreslistresponse) + + + + ## 🌍 Base URL @@ -5021,10 +5413,44 @@ Filter configuration for MCP tool approval requirements. | never | | | +## ApprovalFilter-Output + + +Granular approval control for specific MCP tools. + +Attributes: + always: Tool names that always require human approval before execution. + never: Tool names that never require approval (pre-approved). + + +| Field | Type | Description | +|-------|------|-------------| +| always | array | List of tool names that always require human approval | +| never | array | List of tool names that never require approval | + + +## ApprovalsConfiguration + + +Configuration for human-in-the-loop approvals. + +Attributes: + approval_timeout_seconds: How long approval requests remain pending + before expiring. + approval_retention_days: How long to retain decided approvals for audit + purposes before cleanup. + + +| Field | Type | Description | +|-------|------|-------------| +| approval_timeout_seconds | integer | Seconds before pending approval requests expire | +| approval_retention_days | integer | Days to retain decided approvals before cleanup | + + ## Attachment -Model representing an attachment that can be send from the UI as part of query. +Model representing an attachment that can be sent from the UI as part of query. A list of attachments can be an optional part of 'query' request. @@ -5060,6 +5486,7 @@ Authentication configuration. | module | string | | | skip_tls_verification | boolean | | | skip_for_health_probes | boolean | Skip authorization for readiness and liveness probes | +| skip_for_metrics | boolean | Skip authorization for the /metrics endpoint | | k8s_cluster_api | | | | k8s_ca_cert_path | | | | jwk_config | | | @@ -5199,6 +5626,44 @@ Defines configuration details for the OAuth 2.0 Client Credentials flow. | tokenUrl | string | | +## CompactionConfiguration + + +Configuration for conversation history compaction. + +Compaction summarizes older conversation turns when their estimated +token count approaches the context window limit, keeping the +conversation usable instead of failing with HTTP 413. The +configuration here controls when compaction triggers and how much +recent context is preserved verbatim. + +Attributes: + enabled: Master switch. When False, compaction never triggers + and other fields are inert. + threshold_ratio: Trigger compaction when estimated input tokens + exceed this fraction of the model's context window + (clamped to 0.0..1.0). + token_floor: Minimum estimated token count before compaction + can trigger, regardless of threshold_ratio. Prevents + triggering on very small context windows. + buffer_turns: Initial number of recent turns to keep verbatim. + The runtime applies a degrading guard — if these turns + exceed the available budget, it reduces buffer_turns by + one repeatedly until the budget fits, down to zero. + buffer_max_ratio: Hard cap on the fraction of the context + window the buffer zone may occupy, regardless of + buffer_turns. + + +| Field | Type | Description | +|-------|------|-------------| +| enabled | boolean | When true, older conversation turns are summarized when estimated tokens approach the context window limit. | +| threshold_ratio | number | Trigger compaction when estimated tokens exceed this fraction of the model's context window (0.0-1.0). | +| token_floor | integer | Minimum token count before compaction can trigger. Prevents triggering on very small context windows. | +| buffer_turns | integer | Number of recent turns to keep verbatim. | +| buffer_max_ratio | number | Maximum fraction of context window the buffer zone can occupy, regardless of buffer_turns. | + + ## Configuration @@ -5218,6 +5683,8 @@ Global service configuration. | customization | | It is possible to customize Lightspeed Core Stack via this section. System prompt can be customized and also different parts of the service can be replaced by custom Python modules. | | inference | | One LLM provider and one its model might be selected as default ones. When no provider+model pair is specified in REST API calls (query endpoints), the default provider and model are used. | | conversation_cache | | | +| compaction | | Controls when conversation history is summarized to keep the model's input below the context window limit. Disabled by default — when disabled, requests that exceed the window continue to surface as HTTP 413. | +| approvals | | Settings for human-in-the-loop approval of MCP tool invocations | | byok_rag | array | BYOK RAG configuration. This configuration can be used to reconfigure Llama Stack through its run.yaml configuration file | | a2a_state | | Configuration for A2A protocol persistent state storage. | | quota_handlers | | Quota handlers configuration | @@ -5227,6 +5694,8 @@ Global service configuration. | deployment_environment | string | Deployment environment name (e.g., 'development', 'staging', 'production'). Used in telemetry events. | | rag | | Configuration for all RAG strategies (inline and tool-based). | | okp | | OKP provider settings. Only used when 'okp' is listed in rag.inline or rag.tool. | +| reranker | | Configuration for neural reranking of RAG chunks using cross-encoder. | +| skills | | Agent skills configuration. Specifies paths to skill directories. | ## ConfigurationResponse @@ -5234,6 +5703,9 @@ Global service configuration. Success response model for the config endpoint. +Attributes: + configuration: Parsed application configuration returned to the client. + | Field | Type | Description | |-------|------|-------------|