AI Models
The three AI tiers of Pollo Assistance and the multi-AI brain system.
How it works: the Brain Orchestrator
Pollo Assistance uses a multi-AI architecture. Instead of being locked to a single model, each tier (Palos, Luces, Summum) has a brain orchestrator — an AI that analyzes every message you send and decides which AI from its pool is best suited to answer it.
The brain also decides automatically when to:
- Search the web — when it needs up-to-date information, it performs a live search and shows a Sources button with the pages consulted.
- Draw an interactive diagram — complex explanations are rendered as interactive mind maps inside the chat.
- Activate think mode — for difficult tasks, the brain shows a collapsible panel with its internal reasoning before the response (Luces and Summum only).
- Open a chess game — when you ask to play chess, a playable board appears inline in the chat.
- Generate images — the brain can create images from text descriptions using Pollinations AI.
Important: Once you send the first message in a chat, the tier is locked for that conversation. If you want to switch tiers, create a new chat.
Available AIs in the Pool
The brain orchestrator picks from the following AI models depending on what each message needs:
| AI Model | Provider | Parameters | Speed | Best for |
|---|---|---|---|---|
| Llama 3.1 8B | Meta | 8B | Very fast | Quick chat, translations, simple questions |
| Llama 3.3 70B | Meta | 70B | Fast | Code, analysis, math, reasoning, creative writing |
| Llama 4 Scout | Meta | 17B (16 experts) | Fast | Vision (image understanding), multilingual, code, reasoning |
| Qwen 3 32B | Alibaba | 32B | Medium | Code, math, science, multilingual, reasoning |
| GPT OSS 20B | OpenAI | 20B | Medium | General chat, creative writing, analysis |
| GPT OSS 120B | OpenAI | 120B | Slower | Deep analysis, complex reasoning, science, code |
| Kimi K2 | Moonshot | Large | Slower | Deep analysis, complex reasoning, math, science |
All models run through the Groq infrastructure, which provides very low response times thanks to its specialized hardware (LPU — Language Processing Unit).
Image AI
In addition to text models, all tiers have access to:
- Pollinations AI — generates images from text descriptions (text-to-image).
- Llama 4 Scout (Vision) — analyzes images you upload and answers questions about them.
Pollo Palos v1.0
| Brain model | Llama 3.1 8B |
|---|---|
| AI pool | Llama 3.1 8B |
| Availability | Free |
| Max tokens | ~600 per response |
| Streaming | No |
| Features | Chat, voice |
| Best for | Quick responses and simple questions |
Pollo Palos is the lightest and fastest tier. Its brain and pool consist of a single fast model (Llama 3.1 8B with 8 billion parameters), delivering near-instant responses. Ideal for short questions, quick translations, simple calculations, or anything that doesn't require deep analysis.
Because it uses a smaller model, responses are more concise and it may struggle with very complex tasks. It's the perfect choice when you need something fast and to the point.
Pollo Luces v1.0
| Brain model | Llama 3.3 70B |
|---|---|
| AI pool | Llama 8B, Llama 70B, Llama 4 Scout, Qwen 3 32B, GPT OSS 20B |
| Availability | Free |
| Max tokens | Up to 2,000 per response (depends on AI selected) |
| Streaming | Yes (progressive response) |
| Features | Chat, voice, images, code, chess |
| Best for | Daily use, normal conversations |
Pollo Luces is the recommended tier for daily use. Its brain (Llama 3.3 70B) analyzes each message and picks the best AI from a pool of 5 models. Need a quick answer? The brain routes to the fast 8B model. Writing code? It picks Qwen or Llama 70B. Analyzing an image? Llama 4 Scout takes over.
It supports streaming (the response appears word by word), think mode (shows internal reasoning for hard tasks), web search, diagram generation, and image generation. The best option for most everyday uses.
Pollo Summum v1.0
| Brain model | Llama 3.3 70B |
|---|---|
| AI pool | Llama 8B, Llama 70B, Llama 4 Scout, Qwen 3 32B, GPT OSS 20B, GPT OSS 120B, Kimi K2 |
| Availability | Premium only |
| Max tokens | Up to 2,500 per response (depends on AI selected) |
| Streaming | Yes (progressive response) |
| Features | Chat, voice, images, code, chess, file analysis, chat memory |
| Best for | Deep analysis, complex tasks |
Pollo Summum is the most powerful tier, exclusive to premium users. It has access to all 7 AI models in the pool, including the heavyweight GPT OSS 120B and Kimi K2 — the most capable models for deep reasoning, science, and complex problem-solving.
Summum also unlocks exclusive features: file/document analysis (Pollo FILES integration) and chat memory (the AI remembers context from previous conversations). It reviews its own approach on every single message, ensuring maximum quality.
Tier Comparison
| Feature | Palos | Luces | Summum |
|---|---|---|---|
| Brain model | Llama 3.1 8B | Llama 3.3 70B | Llama 3.3 70B |
| AI pool size | 1 AI | 5 AIs | 7 AIs (all) |
| Max tokens/response | ~600 | Up to 2,000 | Up to 2,500 |
| Streaming | No | Yes | Yes |
| Speed | Very fast | Fast | Fast |
| Think mode | No | Yes | Yes |
| Web search | No | Yes | Yes |
| Image generation | No | Yes | Yes |
| Vision (image analysis) | No | Yes | Yes |
| Chess | No | Yes | Yes |
| File analysis | No | No | Yes |
| Chat memory | No | No | Yes |
| Availability | Free | Free | Premium |
Which one to choose
Palos if you need something fast and simple. Luces for daily use — it's the recommended option with access to 5 AIs and all core features. Summum when you need maximum depth and power, with access to all 7 AIs plus exclusive features like file analysis and chat memory (requires premium).