arxiv Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models

名称
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
首页
https://yiyibooks.cn/arxiv/2305.12356v1/index.html
原始地址
https://arxiv.org/pdf/2305.12356
描述
有效的大型语言模型(LLMS)需要低位量化以最大程度地减少模型大小和推理成本。而低位整数格式(例如 ...