arxiv Mass-Producing Failures of Multimodal Systems with Language Models