Correct. Tokens are the units text gets split into before processing.

How does a language model choose the next word?

Correct. That's the core mechanism – token by token, based on learned probabilities.

Why can a language model sometimes give confidently wrong answers ("hallucinate")?

Exactly. The missing truth check is the actual cause of hallucinations.

How does a language model actually think? Tokens, probability & hallucination

A language model often seems remarkably smart – but it doesn't "understand" text the way a human does. This module shows the four simple mechanisms behind it, so you can judge when to trust an answer and when not to.

The trick behind seemingly clever answers

A language model like ChatGPT or Claude only predicts, at each step, which piece of text most likely comes next – trained on huge amounts of text. Out of this one simple ability, sheer scale produces behavior that looks remarkably intelligent.

The four building blocks that explain the behavior

Tokens – the building blocks of language

A language model doesn't read or write in whole words, but in tokens – small text chunks that can also be parts of words.

Probability – how the next word gets chosen

For every next token, the model computes probabilities for all the candidates in question and picks one – token by token, until a whole answer emerges.

Context window – the memory for a conversation

The model only "knows" what's in the current context window. Start a new chat, and that memory is gone – unless an app builds a separate memory layer on top.

No truth check – why hallucinations happen

The model has no built-in mechanism that checks whether a statement is true – it optimizes for plausible-sounding text. That's why false statements can sound just as confident as correct ones.

Why this matters for you as a decision-maker

These four points explain why a language model isn't a database: important facts should be checked, important context has to be given explicitly, and answers are probabilities, not guarantees. This is exactly why Beyond Prompt deliberately uses deterministic logic instead of generative models for pricing and calculations (see the blog article on quote automation) – the language model gets to understand and phrase things, but a rule decides.

How does a language model actually "think"?