Machine Unlearning: The Illusion of Forgetting
RULER exposes the gaps in current unlearning methods, revealing that models might still retain what they're supposed to forget. The AI community can't ignore this.
AI, machine unlearning is supposed to be the magic trick that lets models forget unwanted data without starting from scratch. But are they truly forgetting? That's where RULER steps in.
RULER's Revelation
RULER's not just another tool. It's a breakthrough for verifying if a model has genuinely forgotten certain data. Forget the usual output-level checks. RULER dives deep, looking at what's happening under the hood. And what it found is wild.
The two metrics, M2 and M4, are at the heart of RULER. M2 compares the representation of forgotten data to a model retrained without it. M4 skips retraining altogether, checking internal similarities. And guess what? Twelve conditions were tested, and in 10 of them, M2 cried foul. The forgotten data is still lurking. That's scary.
Unmasking the Illusion
This isn't just some theoretical exercise. Four popular unlearning methods passed all the usual output-level tests but failed miserably in M2. And even when using a different method, whimsically called Bad Teacher, the residuals stuck around.
Now, someone might ask, "Why does this matter?" Well, if machine learning's about trust, these findings shake its core. Imagine a face-recognition model that says it's forgotten your face but hasn't. That's not just sloppy. It's dangerous.
So, What's Next?
The labs are scrambling. They can't ignore this. With M4 signaling issues across tabular, image, clinical text, and face-identity datasets, the stakes are high. If models aren't truly unlearning, what's the point?
And just like that, everything changes. The leaderboard shifts. What's the use of bragging about a model's unlearning prowess if it's based on shaky ground? Trust, accuracy, and privacy are at risk. The AI community needs to wake up and smell the coffee.
Get AI news in your inbox
Daily digest of what matters in AI.