
Neuron Populations Exhibit Divergent Selectivity with Scale
Researchers have discovered that neurons exhibiting consistent activation patterns across independently trained models (Rosetta Neurons) follow predictable scaling laws, but with a counterintuitive twist: while their absolute count grows, they shrink as a fraction of total neurons. More significantly, these neurons become increasingly specialized and monosemantic at scale, suggesting that model scaling drives functional consolidation rather than uniform expansion. This finding extends mechanistic interpretability beyond loss curves into neuron-level behavior, offering practitioners a new lens for understanding how model internals reorganize during training and potentially informing architecture design decisions.62
























