Skip to main content

GLOSSARY

What is LLM (Large Language Model)?

An AI system trained on massive text datasets to predict and generate human-like text — the technology behind ChatGPT, Claude, Gemini, and most modern AI chatbots.

A Large Language Model is a neural network — usually with hundreds of billions of parameters — trained on trillions of tokens (words, code, math, conversation) scraped from the internet, books, and licensed sources. Once trained, it can answer questions, write code, summarize documents, translate languages, and generate creative content. The 'Large' part matters: smaller language models existed for decades (autocorrect, predictive text), but it wasn't until OpenAI's GPT-3 (2020) that scale crossed a threshold where the models could handle open-ended reasoning. The dominant LLMs of 2026 — GPT-5, Claude 4.x, Gemini 3 — share the same underlying transformer architecture but differ on training data, alignment approach, and inference cost. LLMs do not 'understand' in the human sense — they predict statistically likely next tokens. This is why they sometimes confidently produce plausible-sounding nonsense (see: hallucination).

RELATED TERMS

Back to the AI Glossary