arxiv VeCLIP: Improving CLIP Training via Visual-enriched Captions