arxiv Understanding and Overcoming the Challenges of Efficient Transformer Quantization