Quantization in Machine Learning

Joint Privacy Enhancement and Quantization in Federated Learning

Abstract: Federated learning (FL) is an emerging paradigm for training machine learning models using possibly private data available at edge devices. The distributed operation of FL gives rise to ...

3 天

Changing AI math could reduce the hardware burden, researchers show

Sophisticated AI models tend to require a lot of memory and take up a lot of storage space. One of the ways to reduce that ...

The Manila Times

Dnotitia Unveils STAR-KV, Achieving UP to 20x KV Cache Compression, Selected as an ICML ...

Introduces a low-rank-based approach to KV cache compression, one of the key bottlenecks in long-context AISpeeds up attention computation by up to 6.9x and overall generation throughput by up to 3.1x ...

Vietnam Investment Review

Dnotitia's STAR-KV cuts KV cache by up to 20x, earns ICML 2026 Spotlight selection

KV, a low-rank KV cache compression method achieving up to 20x reduction, with the paper selected as a Spotlight at ICML 2026 ...

Yahoo Finance

Nota AI Has Two MoE Quantization Papers Accepted at ICML 2026 Workshop, Demonstrating ...

SEOUL, South Korea, June 11, 2026 /PRNewswire/ -- Nota AI, a company specializing in AI model compression and optimization, announced that two of its papers on MoE-specific quantization algorithms ...

Yehey.com

Yehey.com - Machine Learning in 2026: Building the Future of Intelligent Systems

Image courtesy by QUE.com As we navigate the landscape of 2026, we find ourselves no longer merely using Machine Learning (ML) but ...

IEEE

Quantizing Heavy-Tailed Data in Statistical Estimation: (Near) Minimax Rates, Covariate ...

Abstract: Modern datasets often exhibit heavy-tailed behavior, while quantization is inevitable in digital signal processing and many machine learning problems. This paper studies the quantization of ...

manilatimes

Nota AI Has Two MoE Quantization Papers Accepted at ICML 2026 Workshop, Demonstrating ...

Two papers on MoE-specific quantization algorithms accepted at a workshop held in conjunction with ICML 2026 Recognition follows Nota AI's overall win at the NVIDIA Nemotron Hackathon Strengthening ...

Utusan Malaysia

Nota AI Has Two MoE Quantization Papers Accepted at ICML 2026 Workshop, Demonstrating ...

Two papers on MoE-specific quantization algorithms accepted at a workshop held in conjunction with ICML 2026 Recognition follows Nota AI’s overall win at the NVIDIA Nemotron Hackathon Strengthening ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果