Artificial Intelligence

Sparse Model

A neural network where most parameters are zero or inactive for any given input. Sparse models achieve high capacity with lower computational cost by only using relevant parameters.

Why It Matters

Sparsity enables building models with massive total knowledge that are still efficient to run — the key insight behind Mixture of Experts architectures.

Example

A model with 1 trillion total parameters where only 100 billion are active for any single input — massive knowledge, manageable compute.

Think of it like...

Like a university with thousands of professors but each student only attends classes from the 20 most relevant to their major — total knowledge is vast but individual cost is manageable.

Sparse Model

Why It Matters

Example

Think of it like...

Related Terms

Mixture of Experts

Pruning

Parameter