Abstract: Compared with the object detection task, the Human-Object Interaction (HOI) detection is more complicated. There are interactions between humans and objects, but multiple pairs of ...
Abstract: As the core building block of vision transformers, attention is a powerful tool to capture long-range dependency. However, such power comes at a cost: it incurs a huge computation burden and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results