Scrum and Kanban are both agile frameworks, but they manage work differently. Scrum uses time-boxed sprints with defined commitments. Kanban uses continuous flow with work-in-progress limits. For ML teams, the choice depends on the type of work and how predictable it is.

Framework Comparison

AspectScrumKanban
Work cadenceFixed sprints (1-4 weeks)Continuous flow
PlanningSprint planning per sprintOn-demand (pull when capacity available)
CommitmentsSprint goal and backlogWIP limits only
RolesProduct Owner, Scrum Master, Dev TeamNo prescribed roles
CeremoniesPlanning, standup, review, retroDaily board review (optional)
MetricsVelocity (points per sprint)Cycle time, throughput
Change during cycleDiscouraged within sprintAllowed anytime
BoardSprint backlog (refreshed per sprint)Continuous (work flows through)

ML Work Type Analysis

Research and Experimentation

ML research (trying new model architectures, feature engineering experiments, hyperparameter tuning) is inherently unpredictable. An experiment might take 2 hours or 2 weeks.

Scrum challenges: Research stories are hard to estimate. Sprint commitments feel artificial because outcomes are uncertain. A sprint where all experiments “failed” (produced useful negative results) looks like zero velocity.

Kanban advantages: No sprint commitments means no penalty for experiments that take longer than expected. Continuous flow accommodates variable-duration work naturally. WIP limits prevent the team from running too many experiments simultaneously.

Recommendation: Kanban for research-heavy phases

Production Engineering

Building data pipelines, model serving APIs, monitoring dashboards, and CI/CD infrastructure is predictable engineering work with clear requirements and estimable effort.

Scrum advantages: Sprint commitments provide delivery cadence. Stakeholders get regular demonstrations. Velocity tracking enables capacity planning. Sprint goals focus the team.

Kanban challenges: Without sprint cadence, there is no natural point for stakeholder demos or team retrospectives. Continuous flow can feel directionless without sprint goals.

Recommendation: Scrum for engineering-heavy phases

Operations and Maintenance

Model monitoring, incident response, retraining, and infrastructure maintenance are interrupt-driven. Work arrives unpredictably and varies in urgency.

Scrum challenges: Unplanned work disrupts sprint commitments. Reserving capacity for “unknown” interrupts is imprecise. Sprint planning for unpredictable work is wasteful.

Kanban advantages: Work items enter the backlog as they arrive. The team pulls work based on priority and capacity. Service-level expectations (incidents resolved within 4 hours, standard requests within 5 days) replace sprint commitments.

Recommendation: Kanban for operations teams

Metrics Comparison

Scrum Metrics for ML

Velocity (story points per sprint) is the primary Scrum metric. For ML teams, velocity is unreliable because research story estimates have high variance. A more useful metric is the number of experiments completed per sprint (experiment throughput), regardless of their outcome.

Sprint goal completion tracks whether the team achieves its sprint objective. For ML sprints, goals should be outcome-based (“determine whether approach X is viable”) rather than output-based (“complete stories A, B, C”).

Kanban Metrics for ML

Cycle time measures how long work items take from start to finish. Track separately by work type: experiment cycle time, engineering cycle time, incident resolution time. Decreasing cycle time indicates improving team efficiency.

Throughput measures completed items per time period. For ML teams, track experiments completed per week and engineering items completed per week separately.

Cumulative flow diagram shows work distribution across board columns over time. Widening bands indicate bottlenecks. For ML teams, this often reveals that “in review” or “waiting for data” columns accumulate work.

Hybrid Approaches

Many ML teams use elements of both:

Scrumban

Sprint cadence with Kanban-style work management:

  • Sprint planning sets a goal and timeframe
  • During the sprint, work is pulled from a prioritized backlog (Kanban style)
  • No rigid sprint commitment - the sprint goal provides direction without rigid story commitment
  • Sprint reviews and retrospectives provide regular feedback points
  • WIP limits prevent overcommitment

Dual-Track

Separate tracks with different frameworks:

  • Research track uses Kanban (continuous flow, WIP limits)
  • Engineering track uses Scrum (sprint commitments, velocity tracking)
  • Both tracks align on shared goals and dependencies

This approach acknowledges that different types of ML work need different management approaches.

Transitioning Between Frameworks

ML projects naturally transition between phases that favor different frameworks:

Early phase (exploration): Kanban. The team explores data, tests approaches, and experiments freely. Work is unpredictable.

Middle phase (development): Scrum. The approach is validated, and the team builds the production system. Work is more predictable and benefits from sprint structure.

Late phase (operations): Kanban. The system is in production, and the team handles monitoring, incidents, and improvements. Work is interrupt-driven.

Rather than picking one framework for the entire project, adapt the framework to the current phase.

Recommendation

Start with Kanban if your ML team is new to agile practices. Kanban’s simplicity (visualize work, limit WIP, manage flow) is easier to adopt than Scrum’s full ceremony structure. Add Scrum elements (sprint cadence, planning, reviews) as the team matures and as the work becomes more predictable.

Start with Scrum if your ML team needs structure, stakeholder communication rhythm, and delivery predictability. Adapt ceremonies and metrics for ML work (experiment stories, research velocity, time-boxed spikes).

The best framework is the one the team will actually follow. Choose based on team preference and work type, then iterate.