Auditing AI: Precision in a World of Noise
AI governance hinges on verifying claims about AI workloads. But non-determinism in GPU calculations creates challenges. A new approach suggests rounding errors in AI outputs can actually aid auditing.
Verifying AI workloads is essential for credible governance. But here's the catch: GPU floating-point arithmetic introduces a level of uncertainty that complicates the process. When auditors can't achieve exact output matches, the door is open for covert adversaries to manipulate AI systems in ways that defy detection.
The Challenge of Non-Determinism
AI auditors are tasked with ensuring that adversaries aren't exploiting unverified computational freedom. This includes sneaky tactics like steganography or unreported software changes. In a landscape where AI is synonymous with precision, why does non-determinism matter so much? Simply put, it allows adversaries to mask their actions within the noise of normal operations.
Yet, the reality is, exact output matches aren't always feasible. Auditors have to rely on approximate matches, which can be inherently problematic. The numbers tell a different story when you consider the intricacies of GPU calculations. Even minor variations in computation can lead to vastly different outcomes, making traditional verification methods unreliable.
Re-computation as a Solution
Surprisingly, there's a silver lining. Researchers have found that modern inference engines like vLLM and HF transformers can produce consistent outputs without compromising performance. This is contingent on having the right information for re-computation and avoiding certain backend functions. But why stop there? By focusing on accumulated rounding errors, auditors can identify the software and hardware setup used during AI inference. It's an auditable signature rather than a constraint.
What does this mean for the auditing process? It shifts the focus from chasing perfect matches to understanding the underlying architecture. The architecture matters more than the parameter count verifying AI systems. By embracing the nuances of rounding errors, we get a clearer picture of the computational environment.
The Path Forward
But there's more. This approach doesn't require access to identical hardware, which is a big deal for scalability. Using a software-only emulation across various NVIDIA GPU variants, auditors can achieve bitwise-precise re-computation. This flexibility is key in a world where hardware evolves rapidly, and access to specific setups isn't always feasible.
In a field dominated by complexity, simplification seems counterintuitive. Yet, by stripping away the noise and focusing on what truly matters, auditors can outsmart even the most sophisticated adversaries. So, will this new method redefine AI governance? Frankly, it just might. It's an innovative way to turn the challenges of non-determinism into an opportunity for clearer, more reliable audits.
Get AI news in your inbox
Daily digest of what matters in AI.