arxiv Cross Modal Transformer: Towards Fast and Robust 3D Object Detection