Glossary
Recent articles
Showing 24 of 441
Tool Use (in Language Models)
The capability of a language model to invoke external tools: APIs, code execution, retrieval, computation: and …Structured Output
Constraining a language model to emit output that conforms to a specified schema (JSON, regex, grammar). The …Reasoning Models
Language models post-trained to allocate substantial inference-time compute to internal reasoning before …Prompt Caching
Server-side caching of attention key/value tensors for repeated prompt prefixes, reducing latency and cost for …Model Context Protocol (MCP)
An open protocol that standardises how language models connect to tools, data sources, and external systems …Mixture of Experts (MoE)
A neural network architecture in which only a small subset of parameters is activated for each input, enabling …LLM-as-a-Judge
Using a language model as an automated evaluator of another model's outputs: methodology, calibration with …LLM Routing
Architectures that direct each request to one of several available language models based on cost, capability, …Function Calling
Structured tool invocation by language models: how the model emits typed function calls, how runtimes execute …Direct Preference Optimization (DPO)
An alignment method that fine-tunes language models directly on preference data without training an explicit …Chain-of-Thought (CoT) Prompting
Eliciting intermediate reasoning steps from language models to improve performance on multi-step problems, …Agentic RAG
Retrieval-Augmented Generation systems in which the language model actively plans, queries, critiques, and …Zero-Shot Learning
What zero-shot learning is, how models perform tasks without examples, and when zero-shot approaches are …Zero Trust Architecture
What zero trust means, how it replaces perimeter-based security, and why AI model serving and data access …Zachman Framework
A two-dimensional classification schema for organizing the descriptive representations of an enterprise, …YAGNI Principle - You Aren't Gonna Need It
A software development principle from Extreme Programming stating that functionality should not be added until …XGBoost
What XGBoost is, why it dominates structured data tasks, and practical guidance for using gradient-boosted …Workflow Engine
Software that automates the execution of business processes by coordinating tasks, decisions, and integrations …Work Breakdown Structure (WBS)
A hierarchical decomposition of project scope into manageable deliverables and work packages.WebSocket
What WebSockets are, how they enable real-time bidirectional communication, and why they are used for …Webhooks
Webhooks are user-defined HTTP callbacks that deliver real-time event notifications between web applications, …Web Components
Web Components are a set of W3C standards (Custom Elements, Shadow DOM, HTML Templates) for creating reusable, …VPC - Virtual Private Cloud
What a VPC is, how it provides network isolation on AWS, and essential VPC design considerations for AI …Vite
The next-generation frontend build tool created by Evan You in 2020 that leverages native ES modules for …
441 articles in this section. Search for a specific topic.
Open source projects