arXiv:2211.10438 [cs.CL]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords large language models, efficient post-training quantization, smoothquant, achieve faster inference, reduces hardware costs Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset