What is a Large Language Model (LLM)?
A large language model is the AI behind ChatGPT, Claude, and Gemini. Plain-English explanation of how LLMs work, what they can and cannot do, and how to choose between them.

Why “large”?
“Large” refers to the number of parameters in the model: billions of numerical values that encode everything the model has learned. GPT-3 (2020) had 175 billion parameters. Modern frontier models are estimated at hundreds of billions to over a trillion parameters.
Parameters are not directly interpretable (you cannot look at parameter number 4.2 billion and see “this is the concept of democracy”). They are numerical weights that collectively produce language behaviour when combined. The “large” matters because more parameters generally means the model can represent more complex patterns, though this relationship is not linear.
How an LLM generates text
An LLM generates text one token at a time. A token is roughly a word or word fragment:
- “unbelievable” might become
un,believ,able - Most common words are single tokens
- Rare words and most non-English words use multiple tokens
When you send a message, the model processes all the text in the conversation and predicts the most likely next token. Then it appends that token and predicts the next one. This continues until the model generates a stop signal or reaches a length limit.
Prompt: "The capital of Austria is"
Token 1: "Vienna" ← predicted with ~95% probability
Token 2: "." ← predicted with high probability
Token 3: [STOP]The key insight: an LLM does not retrieve facts from a database. It generates text from learned statistical patterns. This is why it can write fluently about any topic, and also why it can be wrong.
The components of a modern LLM system
What LLMs are good at
- Writing: Drafts, emails, reports, summaries, translations
- Code: Writing, explaining, debugging, reviewing, and refactoring code
- Analysis: Extracting structure from unstructured text, classifying content, identifying patterns
- Question answering: Answering questions about documents you provide (the model reads what you paste in)
- Reasoning: Multi-step logical problems, comparing options, planning
- Conversation: Maintaining context across a long dialogue
What LLMs are not good at
- Real-time information: Training data has a cut-off date. The model does not know what happened yesterday unless you tell it or connect it to a search tool.
- Precise arithmetic: LLMs are not calculators. They can do basic maths but make errors on multi-step calculations. Use code execution for precise numbers.
- Remembering between sessions: Each conversation is independent. The model does not remember your name from last week’s chat unless explicitly told.
- Guaranteed accuracy: LLMs hallucinate. They generate plausible-sounding text that may be factually wrong. Never rely on LLM output for medical, legal, or financial decisions without expert verification.
Major LLMs compared
| GPT-4o (OpenAI) | Claude Sonnet 4.6 (Anthropic) | Gemini 2.0 Flash (Google) | Mistral Large | |
|---|---|---|---|---|
| Context window | 128K tokens | 200K tokens | 1M tokens | 128K tokens |
| Multimodal (images) | Yes | Yes | Yes | No |
| Best at | General tasks, GPT ecosystem | Long documents, coding | Speed, large context | EU residency, cost |
| API pricing (input/1M) | ~€4.50 | ~€3.00 | ~€0.10 | €2.00 |
| Data residency | US | US | US or EU | EU (Paris) |
What’s next
- What is an AI Agent? : LLMs that take actions in the world, not just chat
- What is AI Hallucination? : Why LLMs produce confident wrong answers
- What is Fine-tuning? : Adapting a pre-trained LLM for a specific domain
- Building RAG Systems : Giving an LLM access to your private knowledge base
Further reading
- LLM Landscape 2026 : Side-by-side comparison of all major models
- What is Generative AI? : Broader context for the content-generation capability of LLMs
- What is Machine Learning? : The foundational technique behind LLMs
- Attention is All You Need (2017) : The original Transformer paper that made modern LLMs possible
Frequently asked questions
