- Add LiteLLM + Claude model detection to isCacheTtlEligibleProvider
- Reduces cost by 90% for Claude models through LiteLLM proxy
- Add test coverage for cache eligibility detection
- Document prompt caching behavior and cost savings
Before: $0.47 per message (no caching)
After: $0.05 per message (90% cached)
Closes#2683
Add LiteLLM as a new OpenAI-compatible proxy provider:
- Add onboarding flow with API key, base URL, and model selection
- Fetch available models from LiteLLM /v1/models endpoint
- Auto-detect context window from /model/info endpoint
- Set supportsStore: false to avoid "Extra inputs are not permitted" errors
with providers that don't support the OpenAI Responses API store parameter
- Preserve compat settings through model resolution pipeline
- Add provider documentation
Closes#2639Closes#2305
Co-Authored-By: Claude <noreply@anthropic.com>