Advanced

38 articles Use search to find specific topics
Showing 24 of 38
Async Job Queues - A Production Pattern for AI Applications How to offload slow operations: AI inference, video processing, file handling: from HTTP request cycles using …Reasoning Models Language models post-trained to allocate substantial inference-time compute to internal reasoning before …Mixture of Experts (MoE) A neural network architecture in which only a small subset of parameters is activated for each input, enabling …LLM-as-a-Judge Using a language model as an automated evaluator of another model's outputs: methodology, calibration with …Direct Preference Optimization (DPO) An alignment method that fine-tunes language models directly on preference data without training an explicit …Agentic RAG Retrieval-Augmented Generation systems in which the language model actively plans, queries, critiques, and …Hybrid Cloud AI Video Pipeline with Amazon FSx for NetApp ONTAP How to build an AI video processing pipeline that spans on-premises storage and AWS cloud using FSx for NetApp …Twelve-Factor AI - Applying 12-Factor App Principles to AI Systems How each of the 12 original 12-factor app principles applies to AI and LLM-based systems: model configuration, …Strangler Fig Pattern for AI Migration How to gradually replace manual processes and legacy rule-based systems with AI using the strangler fig …Microservices Architecture for AI Systems How to decompose AI systems into independent services with clear boundaries, API contracts, and independent …Event Sourcing and CQRS for AI Pipelines Using event-driven architecture patterns for AI data pipelines: immutable event logs, replay capability, audit …CI/CD for AI Projects - A Complete Pipeline Guide A detailed walkthrough of a CI/CD pipeline for AI: source control, Docker builds, model evaluation, staged …Amazon Bedrock AgentCore - Serverless AI Agent Hosting How Amazon Bedrock AgentCore provides managed infrastructure for running AI agents at scale without managing …RAG Implementation Patterns - Retrieval Augmented Generation in Practice Practical patterns for building production RAG systems: chunking strategies, retrieval optimization, …GIS and AI Architecture on AWS How to combine geospatial data processing (GeoPandas, Shapely, satellite imagery) with AI services (Bedrock, …Case Pattern: Multi-Track Audio Analysis for Film Production Architecture for an AI system that processes multi-track audio from film production, identifying issues, …Case Pattern: Insurance Claims Modernization with AI Architecture and lessons from modernizing an insurance claims processing workflow using AI for document …Case Pattern: Building a Geospatial AI Platform from Public Data How a team built a geospatial intelligence platform combining satellite imagery, public datasets, and AI …Case Pattern: Automated Content Generation for a News Agency How a news agency automated structured report generation from data feeds - producing hundreds of articles per …Case Pattern: AI Video Processing Pipeline for a National Broadcaster Architecture and lessons from building a production AI pipeline that processes, indexes, and makes searchable …Building RAG Systems - A Step-by-Step Guide Document ingestion, chunking strategies, embedding models, vector stores, retrieval tuning, and generation …Building an AI Video Pipeline on AWS Architecture guide for an end-to-end AI video pipeline: S3 ingest, Lambda trigger, Rekognition analysis, …Amazon SageMaker - Custom ML Model Training and Deployment What SageMaker is, when to use it instead of Bedrock, key capabilities, pricing model, and the workflows that …AI Video Editing Automation for Broadcasters How AI automates the most time-consuming parts of broadcast video editing - rough cuts, highlight generation, …

38 articles in this section. Search for a specific topic.