Z.AI Model Config
Z.AI Model Config
Hermes model hierarchy and fallback chain for Z.AI provider.
Current Hierarchy
- glm-5.2 (primary)
- glm-5.1 (fallback 1)
- glm-5-turbo (fallback 2)
- glm-4.7 (fallback 3)
- glm-4.5-air (fallback 4)
- gemini (fallback 5)
- cerebras (fallback 6)
- groq (fallback 7)
Known Quirks
- GLM 5.2 reasoning overhead: Uses 1000-2000 reasoning_tokens internally before output.
max_tokensmust be 8000+ for essay-length content or output comes back empty. - Vision fallback: glm-4.5v
- Bare model names only: No
[1m]suffix
API Endpoint
https://api.z.ai/api/coding/paas/v4
Related
- Hermes Infra
- Morning Brief — hit fallback issues when glm-5.2/5.1 rate-limited