TanStack · tombeckenham · May 11, 2026 · May 11, 2026 · May 11, 2026 · May 12, 2026
diff --git a/.changeset/migrate-groq-openrouter-to-openai-base.md b/.changeset/migrate-groq-openrouter-to-openai-base.md
@@ -0,0 +1,20 @@
+---
+'@tanstack/ai-openai-compatible': minor
+'@tanstack/ai-groq': patch
+'@tanstack/ai-openrouter': patch
+'@tanstack/ai': patch
+---
+
+Migrate `ai-groq` and `ai-openrouter` onto `OpenAICompatibleChatCompletionsTextAdapter` so they share the stream accumulator, partial-JSON tool-call buffer, RUN_ERROR taxonomy, and lifecycle gates with `ai-openai` / `ai-grok`. Removes ~1k LOC of duplicated stream processing.
+
+`@tanstack/ai-openai-compatible` adds four protected hooks on `OpenAICompatibleChatCompletionsTextAdapter` so providers with non-OpenAI SDK shapes can reuse the base: `callChatCompletion` and `callChatCompletionStream` (SDK call sites for non-streaming and streaming Chat Completions), `extractReasoning` (surface reasoning content from chunk shapes that carry it, e.g. OpenRouter's `delta.reasoningDetails`, into the base's REASONING\_\* + legacy STEP_STARTED/STEP_FINISHED lifecycle), and `transformStructuredOutput` (subclasses like OpenRouter can preserve nulls in structured output instead of converting them to undefined).
+
+`@tanstack/ai-openai-compatible` fixes two error-handling regressions in the shared base: `structuredOutput` now throws a distinct `"response contained no content"` error rather than letting empty content cascade into a misleading JSON-parse error, and the post-loop tool-args drain block now logs malformed JSON via `logger.errors` (matching the in-loop finish_reason path) so truncated streams emitting partial tool args are debuggable instead of silently invoking the tool with `{}`.
+
+`@tanstack/ai` normalizes abort-shaped errors (`AbortError`, `APIUserAbortError`, `RequestAbortedError`) to a stable `{ message: 'Request aborted', code: 'aborted' }` payload in `toRunErrorPayload`, so consumers can discriminate user-initiated cancellation from other failures without matching on provider-specific message strings.
+
+`@tanstack/ai-groq` drops the `groq-sdk` dependency in favour of the OpenAI SDK pointed at `https://api.groq.com/openai/v1` (the same pattern as `ai-grok` against xAI). The Groq-specific quirk where streaming usage arrives under `chunk.x_groq.usage` is preserved via a small `processStreamChunks` wrapper that promotes it to the standard `chunk.usage` slot.
+
+`@tanstack/ai-openrouter` keeps `@openrouter/sdk` (the source of truth for OpenRouter's typed provider routing, plugins, and metadata) but routes the SDK call through the base via overridden hooks. A small request shape converter (`max_tokens` → `maxCompletionTokens`, etc.) and chunk shape adapter (camelCase → snake_case for the base's reader) bridge the SDKs. No public API changes; provider routing, app attribution headers (`httpReferer`, `appTitle`), reasoning variants (`:thinking`), and `RequestAbortedError` handling are preserved. Fixes: `stream_options.include_usage` is now correctly camelCased to `includeUsage` so streaming `RUN_FINISHED.usage` is populated (previously silently dropped by the SDK Zod schema); mid-stream `chunk.error.code` is stringified so provider error codes (401, 429, 500, …) survive the `toRunErrorPayload` narrow; assistant `toolCalls[].function.arguments` is stringified to match the SDK's `string` contract; and `convertMessage` now mirrors the base's fail-loud guards (throws on empty user content and unsupported content parts) instead of silently sending empty paid requests.
+
+`ai-ollama` remains on `BaseTextAdapter` — its native API uses a different wire format from Chat Completions (different chunk shape, request shape, tool-call streaming, and reasoning surface) and doesn't fit the OpenAI base without rebuilding most of the processing it would otherwise inherit. Migrating it remains a separate effort.
diff --git a/.changeset/rename-openai-base-to-ai-openai-compatible.md b/.changeset/rename-openai-base-to-ai-openai-compatible.md
@@ -0,0 +1,27 @@
+---
+'@tanstack/ai-openai-compatible': minor
+'@tanstack/ai-openai': patch
+'@tanstack/ai-openrouter': patch
+'@tanstack/ai-groq': patch
+'@tanstack/ai-grok': patch
+---
+
+Rename `@tanstack/openai-base` → `@tanstack/ai-openai-compatible`.
+
+The previous "base" name implied this package tracked OpenAI's product roadmap. In reality it implements two OpenAI-shaped _wire-format protocols_ that multiple providers ship:
+
+- **Chat Completions** (`/v1/chat/completions`) — natively implemented by OpenAI, Groq, Grok, OpenRouter, vLLM, SGLang, Together, etc.
+- **Responses** (`/v1/responses`) — OpenAI's reference implementation plus OpenRouter's beta routing implementation (which fans out to Anthropic, Google, and other underlying models).
+
+"OpenAI-compatible" is the actual industry term for this family of wire formats (cf. Vercel's `@ai-sdk/openai-compatible`, LiteLLM's "OpenAI-compatible endpoint", BentoML / Lightning AI docs). The renamed package makes the boundary explicit: it holds the protocol, while OpenAI-specific tools, models, and behaviors continue to live in `@tanstack/ai-openai`.
+
+No runtime behavior changes. Class names (`OpenAICompatibleChatCompletionsTextAdapter`, `OpenAICompatibleResponsesTextAdapter`, …) and protected hook contracts are unchanged. Consumer packages (`ai-openai`, `ai-openrouter`, `ai-groq`, `ai-grok`) only update their internal import paths — public API is unchanged.
+
+If you were importing from `@tanstack/openai-base` directly (uncommon — the package was not yet documented as a public extension point), update your imports:
+
+```diff
+- import { OpenAICompatibleChatCompletionsTextAdapter } from '@tanstack/openai-base'
++ import { OpenAICompatibleChatCompletionsTextAdapter } from '@tanstack/ai-openai-compatible'
+```
+
+`@tanstack/openai-base@0.2.x` remains published on npm for anyone with a pinned lockfile reference but will receive no further updates.
diff --git a/docs/adapters/openrouter.md b/docs/adapters/openrouter.md
@@ -35,16 +35,17 @@ const stream = chat({
 ## Configuration
 
 ```typescript
-import { createOpenRouter, type OpenRouterConfig } from "@tanstack/ai-openrouter";
-
-const config: OpenRouterConfig = {
-  apiKey: process.env.OPENROUTER_API_KEY!,
-  baseURL: "https://openrouter.ai/api/v1", // Optional
-  httpReferer: "https://your-app.com", // Optional, for rankings
-  xTitle: "Your App Name", // Optional, for rankings
-};
-
-const adapter = createOpenRouter(config.apiKey, config);
+import { createOpenRouterText } from "@tanstack/ai-openrouter";
+
+const adapter = createOpenRouterText(
+  "openai/gpt-5",
+  process.env.OPENROUTER_API_KEY!,
+  {
+    serverURL: "https://openrouter.ai/api/v1", // Optional
+    httpReferer: "https://your-app.com", // Optional, for rankings
+    appTitle: "Your App Name", // Optional, for rankings
+  },
+);
 ```
 
 ## Available Models
@@ -122,18 +123,52 @@ OpenRouter can automatically route requests to the best available provider:
 ```typescript
 const stream = chat({
   adapter: openRouterText("openrouter/auto"),
-  messages, 
-  providerOptions: {
+  messages,
+  modelOptions: {
     models: [
       "openai/gpt-4o",
       "anthropic/claude-3.5-sonnet",
       "google/gemini-pro",
     ],
-    route: "fallback", // Use fallback if primary fails
   },
 });
 ```
-
+
+## Chat Completions vs Responses (beta)
+
+OpenRouter exposes two OpenAI-compatible wire formats, and the adapter
+package ships one of each:
+
+| Adapter                    | Endpoint                  | Status   | When to use                                                                  |
+| -------------------------- | ------------------------- | -------- | ---------------------------------------------------------------------------- |
+| `openRouterText`           | `/v1/chat/completions`    | Stable   | Default for almost everything. Broadest model + tool support.                |
+| `openRouterResponsesText`  | `/v1/responses`           | Beta     | OpenAI Responses-shaped request/response; richer multi-turn state on OpenAI-style models. |
+
+Both adapters route to any underlying model OpenRouter supports
+(`anthropic/...`, `google/...`, `meta-llama/...`, etc.) — the wire format
+describes how your client talks to OpenRouter, not which provider answers.
+`/v1/responses` is OpenAI's newer API surface; OpenRouter implements it so
+clients that prefer that wire format can use it across the same 300+
+model catalogue.
+
+```typescript
+import { chat } from "@tanstack/ai";
+import { openRouterResponsesText } from "@tanstack/ai-openrouter";
+
+const stream = chat({
+  adapter: openRouterResponsesText("anthropic/claude-sonnet-4.5"),
+  messages: [{ role: "user", content: "Hello!" }],
+});
+```
+
+Caveats while the Responses adapter is in beta:
+
+- Function tools are supported; OpenRouter's branded server-tools (web
+  search, file search, …) are not yet wired through this path — use
+  `openRouterText` if you need those.
+- If in doubt, prefer `openRouterText`. The Chat Completions endpoint has
+  broader provider coverage and feature parity today.
+
 ## Next Steps
 
 - [Getting Started](../getting-started/quick-start) - Learn the basics

diff --git a/packages/typescript/ai-anthropic/src/adapters/summarize.ts b/packages/typescript/ai-anthropic/src/adapters/summarize.ts
@@ -1,237 +1,54 @@
-import { BaseSummarizeAdapter } from '@tanstack/ai/adapters'
-import {
-  createAnthropicClient,
-  generateId,
-  getAnthropicApiKeyFromEnv,
-} from '../utils'
+import { ChatStreamSummarizeAdapter } from '@tanstack/ai/adapters'
+import { getAnthropicApiKeyFromEnv } from '../utils'
+import { AnthropicTextAdapter } from './text'
+import type { InferTextProviderOptions } from '@tanstack/ai/adapters'
 import type { ANTHROPIC_MODELS } from '../model-meta'
-import type {
-  StreamChunk,
-  SummarizationOptions,
-  SummarizationResult,
-} from '@tanstack/ai'
 import type { AnthropicClientConfig } from '../utils'
 
-/** Cast an event object to StreamChunk. */
-const asChunk = (chunk: Record<string, unknown>) =>
-  chunk as unknown as StreamChunk
-
-/**
- * Configuration for Anthropic summarize adapter
- */
 export interface AnthropicSummarizeConfig extends AnthropicClientConfig {}
 
-/**
- * Anthropic-specific provider options for summarization
- */
-export interface AnthropicSummarizeProviderOptions {
-  /** Temperature for response generation (0-1) */
-  temperature?: number
-  /** Maximum tokens in the response */
-  maxTokens?: number
-}
-
-/** Model type for Anthropic summarization */
 export type AnthropicSummarizeModel = (typeof ANTHROPIC_MODELS)[number]
 
-/**
- * Anthropic Summarize Adapter
- *
- * Tree-shakeable adapter for Anthropic summarization functionality.
- * Import only what you need for smaller bundle sizes.
- */
-export class AnthropicSummarizeAdapter<
-  TModel extends AnthropicSummarizeModel,
-> extends BaseSummarizeAdapter<TModel, AnthropicSummarizeProviderOptions> {
-  readonly kind = 'summarize' as const
-  readonly name = 'anthropic' as const
-
-  private client: ReturnType<typeof createAnthropicClient>
-
-  constructor(config: AnthropicSummarizeConfig, model: TModel) {
-    super({}, model)
-    this.client = createAnthropicClient(config)
-  }
-
-  async summarize(options: SummarizationOptions): Promise<SummarizationResult> {
-    const { logger } = options
-    const systemPrompt = this.buildSummarizationPrompt(options)
-
-    logger.request(`activity=summarize provider=anthropic`, {
-      provider: 'anthropic',
-      model: options.model,
-    })
-
-    try {
-      const response = await this.client.messages.create({
-        model: options.model,
-        messages: [{ role: 'user', content: options.text }],
-        system: systemPrompt,
-        max_tokens: options.maxLength || 500,
-        temperature: 0.3,
-        stream: false,
-      })
-
-      const content = response.content
-        .map((c) => (c.type === 'text' ? c.text : ''))
-        .join('')
-
-      return {
-        id: response.id,
-        model: response.model,
-        summary: content,
-        usage: {
-          promptTokens: response.usage.input_tokens,
-          completionTokens: response.usage.output_tokens,
-          totalTokens:
-            response.usage.input_tokens + response.usage.output_tokens,
-        },
-      }
-    } catch (error) {
-      logger.errors('anthropic.summarize fatal', {
-        error,
-        source: 'anthropic.summarize',
-      })
-      throw error
-    }
-  }
-
-  async *summarizeStream(
-    options: SummarizationOptions,
-  ): AsyncIterable<StreamChunk> {
-    const { logger } = options
-    const systemPrompt = this.buildSummarizationPrompt(options)
-    const id = generateId(this.name)
-    const model = options.model
-    let accumulatedContent = ''
-    let inputTokens = 0
-    let outputTokens = 0
-
-    logger.request(`activity=summarize provider=anthropic`, {
-      provider: 'anthropic',
-      model,
-      stream: true,
-    })
-
-    try {
-      const stream = await this.client.messages.create({
-        model: options.model,
-        messages: [{ role: 'user', content: options.text }],
-        system: systemPrompt,
-        max_tokens: options.maxLength || 500,
-        temperature: 0.3,
-        stream: true,
-      })
-
-      for await (const event of stream) {
-        logger.provider(`provider=anthropic type=${event.type}`, {
-          chunk: event,
-        })
-
-        if (event.type === 'message_start') {
-          inputTokens = event.message.usage.input_tokens
-        } else if (event.type === 'content_block_delta') {
-          if (event.delta.type === 'text_delta') {
-            const delta = event.delta.text
-            accumulatedContent += delta
-            yield asChunk({
-              type: 'TEXT_MESSAGE_CONTENT',
-              messageId: id,
-              model,
-              timestamp: Date.now(),
-              delta,
-              content: accumulatedContent,
-            })
-          }
-        } else if (event.type === 'message_delta') {
-          outputTokens = event.usage.output_tokens
-          yield asChunk({
-            type: 'RUN_FINISHED',
-            runId: id,
-            model,
-            timestamp: Date.now(),
-            finishReason: event.delta.stop_reason as
-              | 'stop'
-              | 'length'
-              | 'content_filter'
-              | null,
-            usage: {
-              promptTokens: inputTokens,
-              completionTokens: outputTokens,
-              totalTokens: inputTokens + outputTokens,
-            },
-          })
-        }
-      }
-    } catch (error) {
-      logger.errors('anthropic.summarize fatal', {
-        error,
-        source: 'anthropic.summarize',
-      })
-      throw error
-    }
-  }
-
-  private buildSummarizationPrompt(options: SummarizationOptions): string {
-    let prompt = 'You are a professional summarizer. '
-
-    switch (options.style) {
-      case 'bullet-points':
-        prompt += 'Provide a summary in bullet point format. '
-        break
-      case 'paragraph':
-        prompt += 'Provide a summary in paragraph format. '
-        break
-      case 'concise':
-        prompt += 'Provide a very concise summary in 1-2 sentences. '
-        break
-      default:
-        prompt += 'Provide a clear and concise summary. '
-    }
-
-    if (options.focus && options.focus.length > 0) {
-      prompt += `Focus on the following aspects: ${options.focus.join(', ')}. `
-    }
-
-    if (options.maxLength) {
-      prompt += `Keep the summary under ${options.maxLength} tokens. `
-    }
-
-    return prompt
-  }
-}
-
 /**
  * Creates an Anthropic summarize adapter with explicit API key.
- * Type resolution happens here at the call site.
  *
- * @param model - The model name (e.g., 'claude-sonnet-4-5', 'claude-3-5-haiku-latest')
- * @param apiKey - Your Anthropic API key
- * @param config - Optional additional configuration
- * @returns Configured Anthropic summarize adapter instance with resolved types
+ * @example
+ * ```typescript
+ * const adapter = createAnthropicSummarize('claude-sonnet-4-5', 'sk-ant-...');
+ * ```
  */
 export function createAnthropicSummarize<
   TModel extends AnthropicSummarizeModel,
 >(
   model: TModel,
   apiKey: string,
   config?: Omit<AnthropicSummarizeConfig, 'apiKey'>,
-): AnthropicSummarizeAdapter<TModel> {
-  return new AnthropicSummarizeAdapter({ apiKey, ...config }, model)
+): ChatStreamSummarizeAdapter<
+  TModel,
+  InferTextProviderOptions<AnthropicTextAdapter<TModel>>
+> {
+  return new ChatStreamSummarizeAdapter(
+    new AnthropicTextAdapter({ apiKey, ...config }, model),
+    model,
+    'anthropic',
+  )
 }
 
 /**
- * Creates an Anthropic summarize adapter with automatic API key detection.
- * Type resolution happens here at the call site.
+ * Creates an Anthropic summarize adapter with API key from `ANTHROPIC_API_KEY`.
  *
- * @param model - The model name (e.g., 'claude-sonnet-4-5', 'claude-3-5-haiku-latest')
- * @param config - Optional configuration (excluding apiKey which is auto-detected)
- * @returns Configured Anthropic summarize adapter instance with resolved types
+ * @example
+ * ```typescript
+ * const adapter = anthropicSummarize('claude-sonnet-4-5');
+ * await summarize({ adapter, text: 'Long article text...' });
+ * ```
  */
 export function anthropicSummarize<TModel extends AnthropicSummarizeModel>(
   model: TModel,
   config?: Omit<AnthropicSummarizeConfig, 'apiKey'>,
-): AnthropicSummarizeAdapter<TModel> {
-  const apiKey = getAnthropicApiKeyFromEnv()
-  return createAnthropicSummarize(model, apiKey, config)
+): ChatStreamSummarizeAdapter<
+  TModel,
+  InferTextProviderOptions<AnthropicTextAdapter<TModel>>
+> {
+  return createAnthropicSummarize(model, getAnthropicApiKeyFromEnv(), config)
 }