LughM to [email protected]English • 6 months agoAnthropic just published new research that successfully identified and mapped millions of human-interpretable concepts, called “features”, within the neural networks of Claude.www.anthropic.comexternal-linkmessage-square3fedilinkarrow-up140arrow-down11
arrow-up139arrow-down1external-linkAnthropic just published new research that successfully identified and mapped millions of human-interpretable concepts, called “features”, within the neural networks of Claude.www.anthropic.comLughM to [email protected]English • 6 months agomessage-square3fedilink
Wow. This is potentially huge.