Auditing AI: Precision in a World of Noise

Verifying AI workloads is essential for credible governance. But here's the catch: GPU floating-point arithmetic introduces a level of uncertainty that complicates the process. When auditors can't achieve exact output matches, the door is open for covert adversaries to manipulate AI systems in ways that defy detection.

The Challenge of Non-Determinism

AI auditors are tasked with ensuring that adversaries aren't exploiting unverified computational freedom. This includes sneaky tactics like steganography or unreported software changes. In a landscape where AI is synonymous with precision, why does non-determinism matter so much? Simply put, it allows adversaries to mask their actions within the noise of normal operations.

Yet, the reality is, exact output matches aren't always feasible. Auditors have to rely on approximate matches, which can be inherently problematic. The numbers tell a different story when you consider the intricacies of GPU calculations. Even minor variations in computation can lead to vastly different outcomes, making traditional verification methods unreliable.

Re-computation as a Solution

Surprisingly, there's a silver lining. Researchers have found that modern inference engines like vLLM and HF transformers can produce consistent outputs without compromising performance. This is contingent on having the right information for re-computation and avoiding certain backend functions. But why stop there? By focusing on accumulated rounding errors, auditors can identify the software and hardware setup used during AI inference. It's an auditable signature rather than a constraint.

What does this mean for the auditing process? It shifts the focus from chasing perfect matches to understanding the underlying architecture. The architecture matters more than the parameter count verifying AI systems. By embracing the nuances of rounding errors, we get a clearer picture of the computational environment.

The Path Forward

But there's more. This approach doesn't require access to identical hardware, which is a big deal for scalability. Using a software-only emulation across various NVIDIA GPU variants, auditors can achieve bitwise-precise re-computation. This flexibility is key in a world where hardware evolves rapidly, and access to specific setups isn't always feasible.

In a field dominated by complexity, simplification seems counterintuitive. Yet, by stripping away the noise and focusing on what truly matters, auditors can outsmart even the most sophisticated adversaries. So, will this new method redefine AI governance? Frankly, it just might. It's an innovative way to turn the challenges of non-determinism into an opportunity for clearer, more reliable audits.

Auditing AI: Precision in a World of Noise

The Challenge of Non-Determinism

Re-computation as a Solution

The Path Forward

Key Terms Explained