LLM

127 articles Use search to find specific topics

All articles 127 total

DeepSeek Sparse Attention (DSA) New A trainable sparse-attention mechanism that scores query-key relevance … Glossary

Added 6 Jul · Upd 6 Jul ·2 min

Diffusion Language Model (dLLM) New A non-autoregressive approach to text generation that produces tokens by … Glossary

Added 6 Jul · Upd 6 Jul ·3 min

Microscaling (MX) Formats New A family of block-scaled low-precision numeric formats standardized by … Glossary

Added 6 Jul · Upd 6 Jul ·2 min

Multi-Head Latent Attention (MLA) New An attention mechanism that compresses the key-value cache into a … Glossary

Added 6 Jul · Upd 6 Jul ·2 min

Test-Time Training (TTT) New A class of layers whose hidden state is itself a small model updated by … Glossary

Added 6 Jul · Upd 6 Jul ·2 min

Titans (Learning to Memorize at Test Time) New A neural architecture with a long-term memory module that updates its … Glossary

Added 6 Jul · Upd 6 Jul ·2 min

Context Rot New The measurable degradation of an LLM's output quality as its input grows … Glossary

Added 5 Jul · Upd 5 Jul ·3 min

RLHF (Reinforcement Learning from Human Feedback) New How RLHF aligns language models with human preferences using a reward … Glossary

Added 2 Jul · Upd 2 Jul ·5 min

Catastrophic Forgetting New Why a neural network loses previously learned capabilities when trained … Glossary

Added 1 Jul · Upd 1 Jul ·4 min

LoRA and QLoRA New Low-Rank Adaptation freezes a model and trains a small pair of low-rank … Glossary

Added 1 Jul · Upd 1 Jul ·7 min

AI Evaluation New The whole practice of judging whether an AI system is fit for use, …

Added 29 Jun · Upd 29 Jun ·4 min

AI21 Labs New AI21 Labs is an enterprise AI company behind the Jamba hybrid …

Added 29 Jun · Upd 29 Jun ·4 min

Alibaba Qwen New Qwen is Alibaba Cloud's family of large language models, many released …

Added 29 Jun · Upd 29 Jun ·5 min

Cohere New Enterprise-focused model provider offering Command generation models …

Added 29 Jun · Upd 29 Jun ·4 min

Continuous Batching New An inference-serving technique that packs many users' requests onto one …

Added 29 Jun · Upd 29 Jun ·4 min

DeepSeek New DeepSeek is a Chinese AI lab known for open-weight large language models …

Added 29 Jun · Upd 29 Jun ·5 min

FinOps for AI: Controlling the Cost of LLMs and GPUs New A practical guide to controlling AI spend. Learn the cost drivers behind … Guides

Added 29 Jun · Upd 29 Jun ·9 min

Google Gemini New Google's family of frontier multimodal models, available through the …

Added 29 Jun · Upd 29 Jun ·5 min

Groq New Groq builds the LPU, a custom inference chip, and GroqCloud, a fast, …

Added 29 Jun · Upd 29 Jun ·5 min

Meta Llama New Meta's family of open-weight large language models, downloadable for …

Added 29 Jun · Upd 29 Jun ·5 min

Mistral AI New A French model provider offering open-weight and commercial LLMs plus a …

Added 29 Jun · Upd 29 Jun ·4 min

Model Evaluation New Model evaluation tests an AI model in isolation, measuring its raw …

Added 29 Jun · Upd 29 Jun ·4 min

Reka AI New Reka AI is a research lab building natively multimodal models that read …

Added 29 Jun · Upd 29 Jun ·5 min

SGLang New SGLang is an open-source high-performance serving framework for large …

Added 29 Jun · Upd 29 Jun ·5 min

Speculative Decoding New An inference speedup where a small draft model proposes several tokens …

Added 29 Jun · Upd 29 Jun ·5 min

System Evaluation New Testing an AI model plus everything around it - retrieval, prompts, …

Added 29 Jun · Upd 29 Jun ·5 min

Text Generation Inference (TGI) New Hugging Face's open-source, production-grade server for deploying open …

Added 29 Jun · Upd 5 Jul ·5 min

xAI Grok New xAI is the company behind the Grok family of large language models, …

Added 29 Jun · Upd 29 Jun ·5 min

Claude vs ChatGPT Updated Comparing the Apps and the Models Comparisons

Added 24 Mar · Upd 25 Jun ·8 min

How ChatGPT Actually Works Behind the Scenes New A plain-words walk through the request lifecycle of ChatGPT: … Guides

Added 25 Jun · Upd 25 Jun ·7 min

AI Agent Memory Management New How to give AI agents short-term and long-term memory using … Guides

Added 23 Jun · Upd 23 Jun ·8 min

Context Engineering New Curate the optimal set of tokens for every model call to cut cost and … Guides

Added 23 Jun · Upd 23 Jun ·9 min

Context Window New The Token Budget of a Language Model Glossary

Added 23 Jun · Upd 23 Jun ·3 min

Multi-Model Routing New How to route each query to the right LLM to cut cost and add … Guides

Added 23 Jun · Upd 23 Jun ·9 min

Small Language Models vs Large Language Models New How to choose between a small on-device model and a large general model … Comparisons

Added 23 Jun · Upd 23 Jun ·8 min

AI Concepts Knowledge Graph New Interactive learning-path map of 124 concepts and 237 connections. Click …

Added 22 Jun · Upd 22 Jun

LLM Landscape 2026: Every Major Model Compared Updated A comprehensive reference for every major large language model available … Comparisons

Added 1 Jun · Upd 22 Jun ·27 min

Mistral AI New European LLM provider with open-weight models and a commercial API. …

Added 22 Jun · Upd 22 Jun ·4 min

Perplexity New AI Search Engine

Added 22 Jun · Upd 22 Jun ·7 min

What is a Large Language Model (LLM)? New A large language model is the AI behind ChatGPT, Claude, and Gemini. … Basics

Added 22 Jun · Upd 22 Jun ·5 min

What is AI Hallucination? New AI hallucination is when a language model produces confident, fluent, … Basics

Added 22 Jun · Upd 22 Jun ·5 min

What is an AI Agent? New An AI agent is software that uses an LLM to plan and take actions … Basics

Added 22 Jun · Upd 22 Jun ·4 min

What is ChatGPT? New ChatGPT is an AI chatbot built by OpenAI on top of the GPT-4o language … Basics

Added 22 Jun · Upd 22 Jun ·5 min

What is Fine-tuning? New Fine-tuning adapts a pre-trained AI model to a specific task or domain … Basics

Added 22 Jun · Upd 22 Jun ·5 min

What is Generative AI? New Generative AI is software that creates new content: text, images, audio, … Basics

Added 22 Jun · Upd 22 Jun ·4 min

What is Natural Language Processing (NLP)? New Natural language processing (NLP) is the field of AI concerned with … Basics

Added 22 Jun · Upd 22 Jun ·4 min

Agent Harness The software scaffolding wrapped around a language model that turns it … Glossary

Added 14 Jun · Upd 14 Jun ·3 min

Agent Memory How an AI agent retains and recalls information beyond a single context … Glossary

Added 14 Jun · Upd 14 Jun ·3 min

AI Agents Autonomous Task Execution Glossary

Added 24 Mar · Upd 30 May ·4 min

AI Cost Optimization Patterns Model selection by task, caching strategies, batch vs real-time … Patterns

Added 24 Mar · Upd 30 May ·3 min

AI Gateway A centralized proxy layer that routes, governs, monitors, and optimizes … Glossary

Added 28 Mar · Upd 30 May ·2 min

AI Guardrails Safety and Compliance Controls Glossary

Added 24 Mar · Upd 30 May ·3 min

Amazon Bedrock Enterprise AI Foundation Models

Added 24 Mar · Upd 14 Jun ·6 min

Amazon Bedrock vs Google Vertex AI Cloud AI Platforms Compared Comparisons

Added 28 Mar · Upd 14 Jun ·5 min

AutoGen vs CrewAI Multi-Agent Systems Compared Comparisons

Added 28 Mar · Upd 14 Jun ·5 min

Azure OpenAI Enterprise GPT on Microsoft Cloud Tools

Added 28 Mar · Upd 30 May ·3 min

Building AI Chatbots From Prototype to Production Guides

Added 28 Mar · Upd 30 May ·5 min

Caching Patterns for AI Applications Semantic caching, Anthropic prompt caching, response caching, and … Patterns

Added 26 Mar · Upd 30 May ·5 min

Case Pattern: Automated Content Generation for a News Agency How a news agency automated structured report generation from data feeds … Case Patterns

Added 24 Mar · Upd 30 May ·3 min

Chain-of-Thought (CoT) Prompting Eliciting intermediate reasoning steps from language models to improve … Glossary

Added 8 May · Upd 30 May ·6 min

Claude by Anthropic Enterprise AI Assistant Tools

Added 24 Mar · Upd 30 May ·4 min

Context Engineering The practice of curating and maintaining the optimal set of tokens an … Glossary

Added 14 Jun · Upd 14 Jun ·3 min

Context Window Management Patterns Summarization, sliding window, retrieval-augmented, and hierarchical … Patterns

Added 24 Mar · Upd 30 May ·3 min

CrewAI Multi-Agent Orchestration Framework Tools

Added 24 Mar · Upd 30 May ·4 min

Custom ML Models vs Foundation Models When to Build vs Buy Comparisons

Added 24 Mar · Upd 14 Jun ·5 min

Daily AI Sparks One Automation Idea Per Day Ideas

Added 24 Mar · Upd 30 May ·3 min

DeepEval vs Promptfoo for LLM Evaluation in CI Comparing DeepEval and Promptfoo for automated LLM evaluation: metrics, … Comparisons

Added 28 Mar · Upd 14 Jun ·7 min

DSPy Programming with Foundation Models Tools

Added 28 Mar · Upd 30 May ·3 min

Few-Shot Learning What few-shot learning is, how it enables models to generalize from … Glossary

Added 28 Mar · Upd 30 May ·2 min

Fine-Tuning LLMs A Practical Guide Guides

Added 28 Mar · Upd 30 May ·10 min

Fine-Tuning vs Prompt Engineering Tradeoffs Comparing fine-tuning and prompt engineering for customizing LLM … Comparisons

Added 28 Mar · Upd 14 Jun ·5 min

Fine-Tuning vs Prompt Engineering vs RAG The three main approaches to customizing LLM behavior for specific use … Glossary

Added 24 Mar · Upd 30 May ·4 min

Foundation Models What foundation models are, how they differ from task-specific models, … Glossary

Added 24 Mar · Upd 30 May ·4 min

Full-Stack Observability for AI Systems How to implement comprehensive observability for AI applications … Guides

Added 28 Mar · Upd 30 May ·3 min

Function Calling Structured tool invocation by language models: how the model emits typed … Glossary

Added 8 May · Upd 30 May ·6 min

Getting Started with Amazon Bedrock for Enterprise AI A practical introduction to Amazon Bedrock: what it is, which models are … Guides

Added 24 Mar · Upd 30 May ·3 min

GPT-4 vs Claude for Enterprise Use A practical comparison of GPT-4 and Claude for enterprise applications, … Comparisons

Added 28 Mar · Upd 14 Jun ·7 min

Guardrails AI LLM Output Validation Tools

Added 28 Mar · Upd 30 May ·4 min

Hallucination What AI hallucination is, why language models generate plausible but … Glossary

Added 28 Mar · Upd 30 May ·3 min

How to Prepare for Sudden AI Provider Restrictions A resilience playbook for builders: multi-provider abstraction, … Guides

Added 14 Jun · Upd 14 Jun ·4 min

Inference Running AI Models in Production Glossary

Added 24 Mar · Upd 30 May ·3 min

Inference-Time Compute The practice of allocating additional computation during model inference … Glossary

Added 28 Mar · Upd 30 May ·2 min

LangChain LLM Application Framework Tools

Added 28 Mar · Upd 14 Jun ·6 min

LangChain vs DSPy LLM Application Development Compared Comparisons

Added 28 Mar · Upd 14 Jun ·5 min

LangChain vs LlamaIndex LLM Framework Comparison Comparisons

Added 28 Mar · Upd 14 Jun ·6 min

Level 4: AI and Building Production AI, vibe coding, and language models. How AI systems actually …

Added 29 May · Upd 30 May ·10 min

LLM Large Language Model Glossary

Added 24 Mar · Upd 30 May ·3 min

LLM Evaluation Methods Measuring Language Model Quality Guides

Added 28 Mar · Upd 30 May ·7 min

LLM Gateway Architecture How to design a centralized LLM access layer that handles routing, rate … Guides

Added 28 Mar · Upd 30 May ·3 min

LLM Routing Architectures that direct each request to one of several available … Glossary

Added 8 May · Upd 30 May ·5 min

LLM-as-a-Judge Using a language model as an automated evaluator of another model's … Glossary

Added 8 May · Upd 30 May ·5 min

LLMOps LLM Operations Glossary

Added 28 Mar · Upd 30 May ·3 min

Managing Prompts at Scale: Versioning, Testing, Deployment How to treat prompts as first-class software artifacts with version … Guides

Added 28 Mar · Upd 30 May ·4 min

Mixture of Agents How multi-LLM collaboration frameworks improve response quality by … Glossary

Added 28 Mar · Upd 30 May ·2 min

Mixture of Experts (MoE) A neural network architecture in which only a small subset of parameters … Glossary

Added 8 May · Upd 30 May ·5 min

Multi-Agent AI Systems When One Model Is Not Enough Guides

Added 24 Mar · Upd 30 May ·3 min

Multi-Agent Systems Definition, architecture patterns, and frameworks for multi-agent AI … Glossary

Added 24 Mar · Upd 30 May ·4 min

Multi-Modal AI Working with Text, Images, and Beyond Guides

Added 28 Mar · Upd 30 May ·5 min

Multi-Provider LLM Failover Automatic failover between LLM providers for high availability: health … Patterns

Added 28 Mar · Upd 30 May ·3 min

Ollama Local LLM Inference Engine Tools

Added 28 Mar · Upd 30 May ·3 min

OpenAI API GPT and DALL-E Integration Tools

Added 28 Mar · Upd 30 May ·4 min

OpenAI vs Anthropic Platform and Model Comparison Comparisons

Added 28 Mar · Upd 14 Jun ·7 min

OWASP Top 10 for LLM Applications (2025) Practical guide to the OWASP Top 10 vulnerabilities for LLM … Guides

Added 28 Mar · Upd 30 May ·4 min

PII Redaction Pipeline Automated detection and removal of personally identifiable information … Patterns

Added 28 Mar · Upd 30 May ·3 min

Prompt Caching Server-side caching of attention key/value tensors for repeated prompt … Glossary

Added 8 May · Upd 30 May ·5 min

Prompt Chaining Breaking Complex Tasks into Steps Guides

Added 28 Mar · Upd 30 May ·5 min

Prompt Engineering What prompt engineering is, why it matters in enterprise AI … Glossary

Added 24 Mar · Upd 30 May ·4 min

Prompt Engineering for Enterprise AI Applications Practical prompt engineering patterns for production AI systems: system … Guides

Added 26 Mar · Upd 30 May ·5 min

Prompt Engineering Patterns for Enterprise Applications Proven prompt patterns for enterprise AI applications: structured … Patterns

Added 24 Mar · Upd 30 May ·4 min

Prompt Injection An attack technique where malicious input manipulates an LLM into … Glossary

Added 28 Mar · Upd 30 May ·3 min

Prompt Injection Defense Layered defense strategies against prompt injection attacks in … Patterns

Added 28 Mar · Upd 30 May ·3 min

RAG vs Long Context Windows for Knowledge Access Comparing retrieval-augmented generation and long context windows as … Comparisons

Added 28 Mar · Upd 14 Jun ·5 min

Rate Limiting for LLM and AI Endpoints How to implement rate limiting for AI API endpoints: token bucket and … Guides

Added 28 Mar · Upd 30 May ·4 min

Reasoning Models Language models post-trained to allocate substantial inference-time … Glossary

Added 8 May · Upd 30 May ·6 min

Reducing LLM Inference Costs in Production Practical strategies for reducing LLM API and hosting costs without … Guides

Added 28 Mar · Upd 30 May ·3 min

Single Agent vs Multi-Agent Architectures When to use a single AI agent versus a multi-agent system, covering … Comparisons

Added 28 Mar · Upd 14 Jun ·5 min

Structured Output Constraining a language model to emit output that conforms to a … Glossary

Added 8 May · Upd 30 May ·5 min

Testing AI Systems Unit Tests to Production Monitoring Guides

Added 25 Mar · Upd 30 May ·5 min

Testing LLM Applications LLM-specific testing strategies: prompt template testing, structured … Guides

Added 28 Mar · Upd 30 May ·5 min

Tiered Analysis Pattern Progressive Depth for AI Processing Patterns

Added 26 Mar · Upd 30 May ·5 min

Token Budget The maximum number of tokens allocated for an LLM request or workflow, … Glossary

Added 28 Mar · Upd 30 May ·2 min

Tokenization in AI What tokens are, how different models tokenize text, why token count … Glossary

Added 24 Mar · Upd 30 May ·3 min

Tool Use (in Language Models) The capability of a language model to invoke external tools: APIs, code … Glossary

Added 8 May · Upd 30 May ·5 min

vLLM High-Performance LLM Serving Engine Tools

Added 28 Mar · Upd 30 May ·3 min

WebSocket What WebSockets are, how they enable real-time bidirectional … Glossary

Added 28 Mar · Upd 30 May ·3 min

What is AI? AI is software that learns patterns from data instead of following … Basics

Added 24 May · Upd 30 May ·7 min

Zero-Shot Learning What zero-shot learning is, how models perform tasks without examples, … Glossary

Added 28 Mar · Upd 30 May ·2 min

127 articles in this section. Search for a specific topic.

Open source projects

Freelancer Templates Contracts, proposals, SOWs

Freelancer Automation Workflow recipes, AI playbooks

Work with Linda

Workshop Series €2,000/mo x 3

1:1 Consulting 60 min session