AI Models

The three AI tiers of Pollo Assistance and the multi-AI brain system.

How it works: the Brain Orchestrator

Pollo Assistance uses a multi-AI architecture. Instead of being locked to a single model, each tier (Palos, Luces, Summum) has a brain orchestrator — an AI that analyzes every message you send and decides which AI from its pool is best suited to answer it.

The brain also decides automatically when to:

Important: Once you send the first message in a chat, the tier is locked for that conversation. If you want to switch tiers, create a new chat.

Available AIs in the Pool

The brain orchestrator picks from the following AI models depending on what each message needs:

AI Model Provider Parameters Speed Best for
Llama 3.1 8B Meta 8B Very fast Quick chat, translations, simple questions
Llama 3.3 70B Meta 70B Fast Code, analysis, math, reasoning, creative writing
Llama 4 Scout Meta 17B (16 experts) Fast Vision (image understanding), multilingual, code, reasoning
Qwen 3 32B Alibaba 32B Medium Code, math, science, multilingual, reasoning
GPT OSS 20B OpenAI 20B Medium General chat, creative writing, analysis
GPT OSS 120B OpenAI 120B Slower Deep analysis, complex reasoning, science, code
Kimi K2 Moonshot Large Slower Deep analysis, complex reasoning, math, science

All models run through the Groq infrastructure, which provides very low response times thanks to its specialized hardware (LPU — Language Processing Unit).

Image AI

In addition to text models, all tiers have access to:


Pollo Palos v1.0

Brain modelLlama 3.1 8B
AI poolLlama 3.1 8B
AvailabilityFree
Max tokens~600 per response
StreamingNo
FeaturesChat, voice
Best forQuick responses and simple questions

Pollo Palos is the lightest and fastest tier. Its brain and pool consist of a single fast model (Llama 3.1 8B with 8 billion parameters), delivering near-instant responses. Ideal for short questions, quick translations, simple calculations, or anything that doesn't require deep analysis.

Because it uses a smaller model, responses are more concise and it may struggle with very complex tasks. It's the perfect choice when you need something fast and to the point.

Pollo Luces v1.0

Brain modelLlama 3.3 70B
AI poolLlama 8B, Llama 70B, Llama 4 Scout, Qwen 3 32B, GPT OSS 20B
AvailabilityFree
Max tokensUp to 2,000 per response (depends on AI selected)
StreamingYes (progressive response)
FeaturesChat, voice, images, code, chess
Best forDaily use, normal conversations

Pollo Luces is the recommended tier for daily use. Its brain (Llama 3.3 70B) analyzes each message and picks the best AI from a pool of 5 models. Need a quick answer? The brain routes to the fast 8B model. Writing code? It picks Qwen or Llama 70B. Analyzing an image? Llama 4 Scout takes over.

It supports streaming (the response appears word by word), think mode (shows internal reasoning for hard tasks), web search, diagram generation, and image generation. The best option for most everyday uses.

Pollo Summum v1.0

Brain modelLlama 3.3 70B
AI poolLlama 8B, Llama 70B, Llama 4 Scout, Qwen 3 32B, GPT OSS 20B, GPT OSS 120B, Kimi K2
AvailabilityPremium only
Max tokensUp to 2,500 per response (depends on AI selected)
StreamingYes (progressive response)
FeaturesChat, voice, images, code, chess, file analysis, chat memory
Best forDeep analysis, complex tasks

Pollo Summum is the most powerful tier, exclusive to premium users. It has access to all 7 AI models in the pool, including the heavyweight GPT OSS 120B and Kimi K2 — the most capable models for deep reasoning, science, and complex problem-solving.

Summum also unlocks exclusive features: file/document analysis (Pollo FILES integration) and chat memory (the AI remembers context from previous conversations). It reviews its own approach on every single message, ensuring maximum quality.

Tier Comparison

Feature Palos Luces Summum
Brain model Llama 3.1 8B Llama 3.3 70B Llama 3.3 70B
AI pool size 1 AI 5 AIs 7 AIs (all)
Max tokens/response ~600 Up to 2,000 Up to 2,500
Streaming No Yes Yes
Speed Very fast Fast Fast
Think mode No Yes Yes
Web search No Yes Yes
Image generation No Yes Yes
Vision (image analysis) No Yes Yes
Chess No Yes Yes
File analysis No No Yes
Chat memory No No Yes
Availability Free Free Premium

Which one to choose

Palos if you need something fast and simple. Luces for daily use — it's the recommended option with access to 5 AIs and all core features. Summum when you need maximum depth and power, with access to all 7 AIs plus exclusive features like file analysis and chat memory (requires premium).