Overview

Polyglot is a PHP library that provides a unified API for interacting with various Large Language Model (LLM) providers. It allows developers to build applications that use LLMs without being locked into a specific provider or having to rewrite code when switching between providers. The core philosophy behind Polyglot is to create a consistent, provider-agnostic interface that abstracts away the differences between LLM APIs, while still allowing access to provider-specific features when needed. This enables developers to:

Write code once and use it with any supported LLM provider
Easily switch between providers without changing application code
Use different providers in different environments (development, testing, production)
Fall back to alternative providers if one becomes unavailable

Polyglot was developed as part of the Instructor for PHP library, which focuses on structured outputs from LLMs, but can also be used as a standalone library for general LLM interactions.

Key Features

Unified LLM API

Polyglot’s primary feature is its unified API that works across multiple LLM providers:

Consistent interface for making inference or embedding requests
Common message format across all providers
Standardized response handling
Unified error handling

Framework-Agnostic

Polyglot is designed to work with any PHP framework or even in plain PHP applications. It does not depend on any specific framework, making it easy to integrate into existing projects.

Compatible with Laravel, Symfony, CodeIgniter, and others
Can be used in CLI scripts or web applications
Lightweight and easy to install

Comprehensive Provider Support

Polyglot supports a wide range of LLM providers, including:

OpenAI (GPT models)
Anthropic (Claude models)
Google Gemini (native and OpenAI compatible)
Mistral AI
Azure OpenAI
Cohere
And many others (see full list below)

Multiple Interaction Modes

Polyglot supports various modes of interaction with LLMs:

Text mode: Simple text completion/chat
JSON mode: Structured JSON responses
JSON Schema mode: Responses validated against a schema
Tools mode: Function/tool calling for task execution

Streaming Support

Real-time streaming of responses is supported across compatible providers:

Token-by-token streaming
Progress handling
Partial response accumulation

Embeddings Generation

Beyond text generation, Polyglot includes support for vector embeddings:

Generate embeddings from text
Support for multiple embedding providers
Utilities for finding similar documents

Configuration Flexibility

Polyglot offers a flexible configuration system:

Configure multiple providers simultaneously
Environment-based configuration
Runtime provider switching
Per-request customization

Middleware and Extensibility

The library is built with extensibility in mind:

HTTP client middleware for customization
Event system for request/response monitoring
Ability to add custom providers

Use Cases

Polyglot is a good choice for a variety of use cases:

Applications requiring LLM provider flexibility: Switch between providers based on cost, performance, or feature needs
Multi-environment deployments: Use different LLM providers in development, staging, and production
Redundancy and fallback: Implement fallback strategies when a provider is unavailable
Hybrid approaches: Combine different providers for different tasks based on their strengths
Local + cloud development: Use local models (via Ollama) for development and cloud providers for production

Supported Providers

Inference Providers

Polyglot currently supports the following LLM providers for chat completion:

A21: API access to Jamba models
Anthropic: Claude family of models
Microsoft Azure: Azure-hosted OpenAI models
Cerebras: Cerebras LLMs
Cohere: Command models (both native and OpenAI compatible interfaces)
Deepseek: Deepseek models including reasoning capabilities
Google Gemini: Google’s Gemini models (both native and OpenAI compatible)
Groq: High-performance inference platform
Hugging Face: Hugging Face hosted models
Meta: Jina AI models
Minimaxi: MiniMax models
Mistral: Mistral AI models
Moonshot: Kimi models
Ollama: Self-hosted open source models
OpenAI: GPT models family
OpenRouter: Multi-provider routing service
Perplexity: Perplexity models
SambaNova: SambaNova hosted models
Together: Together AI hosted models
xAI: xAI’s Grok models

Embeddings Providers

For embeddings generation, Polyglot supports:

Microsoft Azure: Azure-hosted OpenAI embeddings
Cohere: Cohere embeddings models
Google Gemini: Google’s embedding models
Jina: Jina embeddings
Mistral: Mistral embedding models
Ollama: Self-hosted embedding models
OpenAI: OpenAI embeddings

Polyglot

Polyglot \ Essentials

Polyglot \ Streaming

Polyglot \ Embeddings

Polyglot \ Output Modes

Polyglot \ Advanced

Polyglot \ Troubleshooting

Polyglot \ Internals

Key Features

Unified LLM API

Framework-Agnostic

Comprehensive Provider Support

Multiple Interaction Modes

Streaming Support

Embeddings Generation

Configuration Flexibility

Middleware and Extensibility

Use Cases

Supported Providers

Inference Providers

Embeddings Providers

Polyglot

Polyglot \ Essentials

Polyglot \ Streaming

Polyglot \ Embeddings

Polyglot \ Output Modes

Polyglot \ Advanced

Polyglot \ Troubleshooting

Polyglot \ Internals

​Key Features

​Unified LLM API

​Framework-Agnostic

​Comprehensive Provider Support

​Multiple Interaction Modes

​Streaming Support

​Embeddings Generation

​Configuration Flexibility

​Middleware and Extensibility

​Use Cases

​Supported Providers

​Inference Providers

​Embeddings Providers

Key Features

Unified LLM API

Framework-Agnostic

Comprehensive Provider Support

Multiple Interaction Modes

Streaming Support

Embeddings Generation

Configuration Flexibility

Middleware and Extensibility

Use Cases

Supported Providers

Inference Providers

Embeddings Providers