arxiv Revisiting Weakly Supervised Pre-Training of Visual Perception Models