Kathleen: A Lean, Mean Text-Processing Machine

Kathleen is making waves in text classification with a minimalist architecture that sidesteps conventional NLP complexities. With only 733K parameters, it's challenging the status quo by operating directly on raw UTF-8 bytes without a tokenizer or attention mechanism.

Innovative Components

The paper's key contribution lies in its three novel components. First, the RecurrentOscillatorBanks employ damped sinusoid convolutions with temporal memory, enabling sequence processing in O(L) time. Second, the FFT-Rotate Wavetable Encoder maps all 256 byte values using a single learnable vector of 256 floats. This clever technique replaces traditional embedding tables, which typically consume 65K parameters, and remarkably enhances accuracy. Third, PhaseHarmonics introduces a sinusoidal non-linearity, with just six learnable phase parameters, identified as the most impactful component through ablation studies, boosting accuracy by 2.6%.

Performance and Impact

Why should this matter to anyone in the NLP community? Kathleen-Clean achieves impressive results: 88.6% on IMDB, 92.3% on AG News, and 83.3% on SST-2. It even outperforms a tokenized counterpart packing 16 times more parameters on IMDB and AG News by 1.6% and 2.1% respectively. This isn't just a numbers game, it's about efficiency and scalability. As sequence lengths grow, traditional O(L^2) Transformers buckle under the weight of GPU memory constraints. Kathleen, processing in O(L) time, tackles this challenge head-on.

Beyond the Numbers

Critically, this work questions the necessity of ever-more complex architectures in NLP. Does bigger always mean better? Not according to Kathleen. By stripping away cognitive complexities and focusing on frequency-domain processing, it showcases that sometimes, less is indeed more. The ablation study reveals how Kathleen's pared-down components systematically outshine larger, bio-inspired frameworks. Removing a hefty 560K-parameter framework only drops performance by 0.2%, whereas losing the 6-parameter PhaseHarmonics costs 2.6%.

In a field where chasing state-of-the-art (SOTA) results often leads to bloated models, Kathleen is a refreshing counter-narrative. It reminds us that innovation thrives when we dare to rethink foundational assumptions. Code and data are available for those eager to test its claims. Is this the start of a minimalist revolution in NLP? Only time, and further benchmarks, will tell.

Kathleen: A Lean, Mean Text-Processing Machine

Innovative Components

Performance and Impact

Beyond the Numbers

Key Terms Explained