Abstract: With the increasing prevalence of digital documents replacing traditional paper documents, there is an urgent need to develop techniques to extract information from them. In particular, ...
A novel multimodal object detection framework with sparse transformer and explicit attention module. The illustration of our proposed multimodal object detection framework is shown in the following ...