experiment-tracking

Original🇺🇸 English
Translated

Use when "experiment tracking", "MLflow", "Weights & Biases", "wandb", "model registry", "hyperparameter logging", "ML experiments", "training metrics"

6installs
Added on

NPX Install

npx skill4agent add eyadsibai/ltk experiment-tracking

Tags

Translated version includes tags in frontmatter

Experiment Tracking

Track ML experiments, metrics, and models.

Comparison

PlatformBest ForSelf-hostedVisualization
MLflowOpen-source, model registryYesBasic
W&BCollaboration, sweepsLimitedExcellent
NeptuneTeam collaborationNoGood
ClearMLFull MLOpsYesGood

MLflow

Open-source platform from Databricks.
Core components:
  • Tracking: Log parameters, metrics, artifacts
  • Projects: Reproducible runs (MLproject file)
  • Models: Package and deploy models
  • Registry: Model versioning and staging
Strengths: Self-hosted, open-source, model registry, framework integrations Limitations: Basic visualization, less collaborative features
Key concept: Autologging for major frameworks - automatic metric capture with one line.

Weights & Biases (W&B)

Cloud-first experiment tracking with excellent visualization.
Core features:
  • Experiment tracking: Metrics, hyperparameters, system stats
  • Sweeps: Hyperparameter search (grid, random, Bayesian)
  • Artifacts: Dataset and model versioning
  • Reports: Shareable documentation
Strengths: Beautiful visualizations, team collaboration, hyperparameter sweeps Limitations: Cloud-dependent, limited self-hosting
Key concept:
wandb.init()
+
wandb.log()
- simple API, powerful features.

What to Track

CategoryExamples
HyperparametersLearning rate, batch size, architecture
MetricsLoss, accuracy, F1, per-epoch values
ArtifactsModel checkpoints, configs, datasets
SystemGPU usage, memory, runtime
CodeGit commit, diff, requirements

Model Registry Concepts

StagePurpose
NoneJust logged, not registered
StagingTesting, validation
ProductionServing live traffic
ArchivedDeprecated, kept for reference

Decision Guide

ScenarioRecommendation
Self-hosted requirementMLflow
Team collaborationW&B
Model registry focusMLflow
Hyperparameter sweepsW&B
Beautiful dashboardsW&B
Full MLOps pipelineMLflow + deployment tools

Resources