Concept Bottleneck Model
CBM
A neural network architecture that first predicts human-understandable concepts from the input, then uses those concepts to make the final prediction, enabling interpretability.
In Plain Language
An AI that first identifies understandable concepts (e.g., "has stripes," "has four legs") before making its final prediction ("it's a zebra"). This makes it easier to understand and correct.
.png)
