Workers AI Model Deprecations - May 2026
Key Points
- 17 models deprecated on May 30, 2026
- Kimi K2.5 auto-aliases to K2.6 with higher pricing
- Fast and LoRA variants remain active
Summary
Cloudflare is refreshing the Workers AI model catalog by deprecating older models on May 30, 2026. Applications must be updated to remove references to deprecated models and migrate to recommended replacements before the deadline.
Key Points
- Deprecation Date: May 30, 2026 for most models; Kimi K2.5 extended from May 10 to May 30, 2026
- Automatic Aliasing: Kimi K2.5 requests will automatically route to Kimi K2.6 (note: higher pricing)
- Recommended Replacements:
- GLM-4.7-Flash: fast multilingual model with tool calling and coding
- Gemma-4-26b: efficient open model with vision and tool calling
- Kimi K2.6: capable tool-calling and vision model for agentic workloads
- Active Variants: Fast and LoRA variants remain active (e.g., llama-3.3-70b-instruct-fp8-fast, gemma-7b-it-lora)
- Affected Models: 17 base models deprecated including Llama 3/3.1, Mistral, Gemma, Phi-2, and others
- Action Required: Review pricing and capabilities of replacement models before May 30, 2026
Resources
Refer to the Workers AI model catalog and pricing page for complete details.