Exploring foci of:
arXiv (Cornell University)
NeUQI: Near-Optimal Uniform Quantization Parameter Initialization
May 2025 • Lin Li, Xinyu Hu, Xiaojun Wan
Large language models (LLMs) achieve impressive performance across domains but face significant challenges when deployed on consumer-grade GPUs or personal devices such as laptops, due to high memory consumption and inference costs. Post-training quantization (PTQ) of LLMs offers a promising solution that reduces their memory footprint and decoding latency. In practice, PTQ with uniform quantization representation is favored for its efficiency and ease of deployment since uniform quantization is widely supported b…
Main Page
Zohran Mamdani
Weapons (2025 Film)
Squid Game Season 3
Victoria Mboko
Lauren Sánchez
Jeff Bezos
Shefali Jariwala
F1 (Film)
Takahiro Shiraishi
The 1975
Matty Healy
Mira Nair
Kannappa (Film)
Squid Game
Alanis Morissette
Truth And Reconciliation Commission Of Canada
2025 Nba Draft
28 Years Later
Mahmood Mamdani
Reich Ministry Of Public Enlightenment And Propaganda
Rick Hurst
Fuck
Degenerate Art Exhibition
Mutual Aid
Kpop Demon Hunters
Anna Wintour