Training-free framework that converts SAM3 into a real-time multi-class open-vocabulary detector. Achieves 55.8 AP on COCO val2017 (80 classes) at 15.8 FPS (4 classes, 1008px) on a single RTX 4080.
Abstract: Recognizing behaviors within classroom settings is vital for gauging educational progress and optimizing teaching methodologies. The complexity of classroom environments often poses ...
Abstract: This paper focuses on real-time object detection systems, analyzing existing Field-Programmable Gate Arrays (FPGAs) implementations that aim to achieve the best efficiency, performance, and ...
OV-DEIM is a real-time DETR-style framework for open-vocabulary object detection. It extends DEIMv2 to the open-vocabulary setting and achieves state-of-the-art performance on open-vocabulary ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results