arxiv TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document