AI Providers

Overview

AI Puffer uses a model. Add your provider credentials, sync models, then choose the provider and model inside each module.

When you purchase AI Puffer Pro, you get access to additional plugin features. API usage is separate. You still need your own provider account and billing setup for OpenAI, Google, Anthropic, xAI, or any other provider you connect. AI Puffer does not include API credits.

OpenAI

Text, images, embeddings, speech, web, and realtime.

Google

Gemini, images, video, embeddings, TTS, and grounding.

Anthropic

Text, web search, and supported image analysis.

OpenRouter

Access many text, image, embedding, and web-capable models.

Azure

Azure OpenAI deployments for text, images, embeddings, and speech.

DeepSeek

Text generation for chat, writing, forms, and automations.

xAI

Grok text models, web search, supported image analysis, and image generation.

Ollama

Local or self-hosted text and embedding models.

AI providers power text, image, audio, , and retrieval features.

Provider	Text	Images	Embeddings	Audio	Video	Web	Realtime
OpenAI	✓	✓	✓	TTS + STT	-	✓	✓
Google	✓	✓	✓	TTS	✓	✓	-
Anthropic	✓	-	-	-	-	✓	-
OpenRouter	✓	✓	✓	-	-	✓	-
Azure	✓	✓	✓	STT	-	-	-
DeepSeek	✓	-	-	-	-	-	-
xAI	✓	✓	-	-	-	✓	-
Ollama	✓	-	✓	-	-	-	-

✓ means supported. - means not supported in the current provider strategy. AI Puffer > Settings is the central area for managing your connections to different AI providers.

OpenAI

OpenAI supports the widest set of AI Puffer features.

Open AI Puffer > Settings > AI.
Select OpenAI.
Paste your OpenAI API key.
Keep the default base URL unless you use a compatible custom endpoint.
Open Advanced and click Sync Models.
Select the model you want to use by default.

For knowledge features, OpenAI embedding models are available in embedding model selectors. OpenAI Vector Stores are created and managed in Knowledge Base > Stores.

Google

Google provides Gemini, embeddings, image/video models, text to speech, and grounding.

Open AI Puffer > Settings > AI.
Select Google.
Paste your Google API key.
Open Advanced and click Sync Models.
Select the default model.
Review Google safety settings if you need to change blocking thresholds.

Google image and video model settings are configured in Images.

Google safety settings can block some responses. Change them only when the default behavior is too restrictive for your site.

Anthropic

Anthropic is available for text workflows, web search, and supported Claude image analysis models.

Open AI Puffer > Settings > AI.
Select Anthropic.
Paste your Anthropic API key.
Open Advanced and click Sync Models.
Select the default model.

Anthropic does not provide embeddings through the current AI Puffer provider strategy.

OpenRouter

OpenRouter gives access to models available in your OpenRouter account.

Open AI Puffer > Settings > AI.
Select OpenRouter.
Paste your OpenRouter API key.
Open Advanced and click Sync Models.
Select the default model.

Some capabilities depend on the selected model.

OpenRouter support depends on the model you select. Check the model capabilities before using image, embedding, or web features.

Azure

Azure uses your Azure OpenAI .

Open AI Puffer > Settings > AI.
Select Azure.
Paste your Azure API key.
Enter the Azure endpoint URL for your resource.
Open Advanced.
Check the API versions if your Azure resource requires different versions.
Click Sync Models to load deployments.
Select the deployment to use as the default model.

Azure uses deployment names. If a deployment does not appear, confirm it exists in Azure and that the endpoint and API key belong to the same resource.

DeepSeek

DeepSeek is available for text workflows.

Open AI Puffer > Settings > AI.
Select DeepSeek.
Paste your DeepSeek API key.
Open Advanced and click Sync Models.
Select the default model.

DeepSeek embeddings are not supported by the current provider strategy. Use OpenAI, Google, Azure, OpenRouter, or Ollama for embedding workflows.

xAI

xAI is available for text workflows, Grok models, web search, supported image analysis models, and image generation.

Open AI Puffer > Settings > AI.
Select xAI.
Paste your xAI API key.
Keep the default base URL unless you use a compatible xAI endpoint.
Keep the API version as v1.
Open Advanced and click Sync Models.
Select the default text model.

AI Puffer uses xAI’s Responses API for text generation and streaming. Legacy completions and chat completions are not used by this provider integration. xAI image generation and image editing use xAI image models such as grok-imagine-image. xAI image understanding uses Grok language models that support image input. xAI is not available as an embedding provider, vector store provider, video provider, speech provider, or realtime voice provider in the current integration.

Ollama

Ollama connects local or self-hosted models.

Install Ollama from ollama.com/download.
Run Ollama on the computer or server you want to use as the AI server.
Pull a model:

ollama pull llama3

Open AI Puffer > Settings > AI.
Select Ollama.
Enter the Ollama base URL. The default is http://localhost:11434.
Click Sync Models.
Select the default model.

Install notes:

System	Notes
macOS	Download the macOS ZIP, move Ollama to Applications, then run the app.
Windows	Run the Windows installer. Ollama runs in the background after installation.
Linux	Run `curl -fsSL https://ollama.com/install.sh \| sh`. The installer sets up Ollama as a service.

You can pull more than one model. AI Puffer shows synced Ollama models after Sync Models runs.

If WordPress and Ollama are on different servers, do not use localhost unless Ollama is running on the same server as WordPress. Use the reachable server URL instead.

Do not expose Ollama publicly without access controls. Anyone who can reach the Ollama server can send model requests to it.

Troubleshooting

Quota exceeded

The provider account is out of credits, has reached a spend limit, or is blocked by billing settings.Check these items:

Open the provider billing page.
Check credits, usage limits, and monthly spend limits.
Add credits or update billing if needed.
If you just changed billing, wait a few minutes and try again.

For OpenAI, see error codes.

Incorrect API key

The saved key is wrong, old, revoked, copied with extra spaces, or belongs to a different account or organization.Check these items:

Create or copy a fresh API key from the provider dashboard.
Paste it again in AI Puffer > Settings > AI.
Save the setting.
Sync models again.

For OpenAI, see Incorrect API key provided.

Model is missing

AI Puffer syncs models when you switch providers, but the list can be out of date.Check these items:

Open AI Puffer > Settings > AI.
Select the provider.
Open Advanced.
Click Sync Models.
Open Settings > Others.
Click Clear Model Cache.
Click Clear Transients.
Click Sync All.
Return to Settings > AI and check the model list again.

If the model still does not appear, confirm the provider account has access to that model.

Organization must be verified

Some OpenAI models require a verified API organization.Check these items:

Open OpenAI Platform settings.
Go to Organization > General.
Complete organization verification.
Wait up to 30 minutes.
Generate a new API key if the error continues.
Make sure AI Puffer is using a key from the verified organization.

For OpenAI, see API Organization Verification.

Unexpected JSON error

A WordPress security plugin or firewall may be blocking AI Puffer settings requests.Check these items:

Check your security plugin or firewall logs.
Whitelist AI Puffer admin requests if your tool supports allow rules.
If using Wordfence or a similar firewall, switch to learning mode.
Save AI Puffer settings and use the affected module a few times.
Switch the firewall back to normal mode after it learns the requests.

Overview

Getting Started

Setup

Modules

Utilities

Developers

Overview

OpenAI

Google

Anthropic

OpenRouter

Azure

DeepSeek

xAI

Ollama

OpenAI

Google

Anthropic

OpenRouter

Azure

DeepSeek

xAI

Ollama

Troubleshooting

Overview

Getting Started

Setup

Modules

Utilities

Developers

Documentation Index

​Overview

OpenAI

Google

Anthropic

OpenRouter

Azure

DeepSeek

xAI

Ollama

​OpenAI

​Google

​Anthropic

​OpenRouter

​Azure

​DeepSeek

​xAI

​Ollama

​Troubleshooting

Overview

OpenAI

Google

Anthropic

OpenRouter

Azure

DeepSeek

xAI

Ollama

Troubleshooting