llm LLM
Large Language Model
A neural network trained on vast amounts of text data that can generate, understand, and analyze natural language text.
What Is an LLM
A Large Language Model (LLM) is a type of AI built on the transformer architecture. The model is trained on billions of text documents and can:
- Generate text based on instructions (prompts)
- Answer questions
- Translate between languages
- Write and analyze code
- Act as an agent with tools
Popular LLMs
- Claude (Anthropic) — Opus, Sonnet, Haiku
- GPT (OpenAI) — GPT-4o, o3
- Gemini (Google) — Gemini 2.5 Pro
- Llama (Meta) — open-source model
Key Parameters
- Context window — how much text the model can process at once
- Temperature — level of “creativity” in responses (0 = deterministic, 1 = creative)
- Tokens — units of text the model works with (~4 characters = 1 token in English)