Overview
Support for Cerebras API which uses custom hardware for super fast inference. Cerebras provides Llama models. Inference feature compatibility:- tool calling (supported)
- native JSON object response_format (supported)
- native JSON schema response_format (supported)
- Instructor markdown-JSON fallback (fallback)