LocateAnything: Accelerating Visual Grounding with Parallel Box Decoding
LocateAnything introduces a new framework for visual grounding, leveraging Parallel Box Decoding to boost efficiency and precision. With a dataset of over 138 million samples, it redefines speed-accuracy dynamics.