Commit Graph

2 Commits

Author SHA1 Message Date
Charles-Henri ROBICHE
c50d5c7f34
fix(litellm): enable prompt caching for Anthropic models
- Add LiteLLM + Claude model detection to isCacheTtlEligibleProvider
- Reduces cost by 90% for Claude models through LiteLLM proxy
- Add test coverage for cache eligibility detection
- Document prompt caching behavior and cost savings

Before: $0.47 per message (no caching)
After: $0.05 per message (90% cached)

Closes #2683
2026-01-28 23:44:44 +01:00
Charles-Henri ROBICHE
efd827b526
feat(providers): add LiteLLM provider support
Add LiteLLM as a new OpenAI-compatible proxy provider:

- Add onboarding flow with API key, base URL, and model selection
- Fetch available models from LiteLLM /v1/models endpoint
- Auto-detect context window from /model/info endpoint
- Set supportsStore: false to avoid "Extra inputs are not permitted" errors
  with providers that don't support the OpenAI Responses API store parameter
- Preserve compat settings through model resolution pipeline
- Add provider documentation

Closes #2639
Closes #2305

Co-Authored-By: Claude <noreply@anthropic.com>
2026-01-28 08:55:48 +01:00