With the success of Transformers in natural language processing, object detection with Transformers (DETR) has attracted widespread attentions. In previous Transformer-based 2D detectors, the object queries are a set of learning embeddings. However, it is very hard to apply these detectors to the 3D domain due to the lack of explicit physical meanings and position priors of learned object queries. In this paper, we introduce the concept of anchors and propose a novel query design based on anchor points. In our query design, we use the foreground points as the anchor points and encode these anchor points as the object queries. Consequently, each object query has an explicit physical meaning and only focus on its nearby object. Additionally, we also propose an instance-aware sampling strategy to select a small set of representation foreground points from the scene point cloud. Extensive experiments on several large-scale 3D object detection datasets demonstrate that the proposed AnchorPoint detector achieves promising accuracy and efficiency. In particularly, AnchorPoint achieves an average precision (AP) of 83.21 at 61 frame-per-second (FPS) on the moderate level of the KITTI-DET Car subset. Moreover, we model each object as its corresponding anchor point, and extend the AnchorPoint model to 3D multi-object tracking by adding an extra tracking head. We show that our method achieves comparable performance to existing state-of-the-art methods on the KITTI-MOT dataset.


    Access

    Check access

    Check availability in my library

    Order at Subito €


    Export, share and cite



    Title :

    AnchorPoint: Query Design for Transformer-Based 3D Object Detection and Tracking


    Contributors:
    Liu, Hao (author) / Ma, Yanni (author) / Wang, Hanyun (author) / Zhang, Chaobo (author) / Guo, Yulan (author)

    Published in:

    Publication date :

    2023-10-01


    Size :

    2396469 byte




    Type of media :

    Article (Journal)


    Type of material :

    Electronic Resource


    Language :

    English



    Bus tracking query system

    MAO MAOJUN | European Patent Office | 2015

    Free access

    Fast Vision Transformer via Query Vector Decoupling

    Sun, Donghao / Liu, Jiashun / Liu, Jiaxing et al. | IEEE | 2021


    Space Object Query Tool

    Phillips, Veronica J. | NTRS | 2017


    Transformer Sub-Patch Matching for High-Performance Visual Object Tracking

    Tang, Chuanming / Hu, Qintao / Zhou, Gaofan et al. | IEEE | 2023


    OBJECT DETECTION AND TRACKING

    DAS SUBHASIS / PHILBIN JAMES WILLIAM VAISEY / ZWIEBEL BENJAMIN ISAAC et al. | European Patent Office | 2021

    Free access