What is LLM (Large Language Model)? — AI Glossary

A Large Language Model is a neural network — usually with hundreds of billions of parameters — trained on trillions of tokens (words, code, math, conversation) scraped from the internet, books, and licensed sources. Once trained, it can answer questions, write code, summarize documents, translate languages, and generate creative content. The 'Large' part matters: smaller language models existed for decades (autocorrect, predictive text), but it wasn't until OpenAI's GPT-3 (2020) that scale crossed a threshold where the models could handle open-ended reasoning. The dominant LLMs of 2026 — GPT-5, Claude 4.x, Gemini 3 — share the same underlying transformer architecture but differ on training data, alignment approach, and inference cost. LLMs do not 'understand' in the human sense — they predict statistically likely next tokens. This is why they sometimes confidently produce plausible-sounding nonsense (see: hallucination).

RELATED TERMS