Machine Learning

Training-Serving Skew

A discrepancy between how features are computed during model training versus how they are computed during production serving. This is one of the most common and hardest-to-detect causes of model failure.

Why It Matters

Training-serving skew silently degrades model performance. Models appear to work in testing but fail in production because the data pipeline differs.

Example

A model trained with features computed in batch (using future data that would not be available in real time) that then receives different feature values in real-time serving.

Think of it like...

Like practicing basketball with a regulation ball but playing the game with a slightly different ball — the differences are subtle but affect performance.

Related Terms