Monitoring, drift & retraining

Before we begin

Models degrade when reality diverges from training data. Production CV needs telemetry, alerts, and a retraining playbook.

Examples: new camera sensor, seasonal lighting, different geographic region.

Monitor histograms of brightness, blur estimates, aspect ratios — alert when KL divergence or simple z-scores exceed thresholds.

Same pixels, different meaning — new product SKU, changed defect types. Labels from training no longer match reality.

Requires new labeled data and model update — monitoring accuracy proxies (human review sample) is essential.

Run candidate model on live traffic in parallel; log disagreements with production model — no user impact until validated.

Route low-confidence predictions to reviewers; corrected labels feed next training round (active learning).

Signal	Action
Small drift, backbone still valid	Fine-tune on new batch
Large domain change	New data collection + full pipeline review
Latency regression	Profile ONNX/TRT; consider smaller model

Module 7 quiz — then deploy vision API project.