AI Systems Are Software Systems
Why production AI requires the same engineering discipline as any distributed system, and how this wiki covers the full stack of AI …
Why production AI requires the same engineering discipline as any distributed system, and how this wiki covers the full stack of AI …
A comprehensive guide to GitHub Actions security vulnerabilities, common exploit patterns, and how to audit and harden your CI/CD pipelines …
A step-by-step guide to creating professional demo and explainer videos entirely in code using Remotion, React, and AI-assisted development. …
The principle of defining infrastructure, configuration, documentation, policy, video, and design as version-controlled code artifacts - and …
A design principle stating that systems work best when they are kept simple rather than made complex, favoring straightforward solutions …
The process of eliciting, analyzing, documenting, and validating the needs and constraints that a software system must satisfy.
The discipline of tracking and controlling changes to software artifacts, rooted in military standards and formalized by IEEE 828, …
The structured process of planning, creating, testing, and deploying software systems through defined phases.
Overview of the IEEE Software Engineering Body of Knowledge Version 4, covering its knowledge areas and relevance to AI/ML engineering.
The accumulated cost of shortcuts, compromises, and deferred improvements in a software system that increase future maintenance effort.
The Unified Modeling Language, a standardized visual notation for specifying, constructing, and documenting software systems through 14 …
What an API is, REST vs GraphQL vs gRPC, authentication patterns, rate limiting, and how AI services are accessed through standardized API …
A practical guide to the three languages used across a modern AI stack: Python for agents and models, TypeScript for frontends and video …
Exponential backoff with jitter, retry budgets, and idempotency patterns for production AI systems. Why AI services require different retry …
What event sourcing is, why it matters for AI audit trails and pipeline replay, its relationship to CQRS, and when to apply it in AI …
What feature flags are, how they enable safe AI model rollouts, A/B testing, and instant rollback - and the tools available for implementing …
Using feature flags to safely roll out AI model changes: A/B testing models, canary deployments, gradual traffic shifting, and instant …
What property-based testing is, why it is ideal for AI systems that cannot be tested with exact-output assertions, and the tools available …
A practical testing strategy for AI systems: property-based testing, integration testing with mocked models, evaluation frameworks, and …
Using AWS Amplify to deploy front-end applications, host static sites, and connect to AWS AI backends.
What event-driven architecture is, how S3 triggers, EventBridge, and Step Functions patterns enable scalable AI pipelines.
Using Hugo to build fast, maintainable documentation sites and AI solution landing pages, with GitHub Pages and Amplify deployment.
Notion API for structured data, MCP integration, and using Notion databases as knowledge stores for AI agents. When it works and when to …