Machine Learning

CatBoost

A gradient boosting library by Yandex that handles categorical features natively without requiring manual encoding. CatBoost also addresses prediction shift and target leakage.

Why It Matters

CatBoost simplifies the ML pipeline by eliminating the need for manual categorical encoding — a common source of bugs and data leakage.

Example

Training a model on a dataset with features like 'city,' 'product_category,' and 'day_of_week' without converting them to numbers first — CatBoost handles it natively.

Think of it like...

Like a chef who can work with any ingredient in its raw form — no need to pre-process or convert before cooking.

Related Terms