Google’s TurboQuant Compresses ‘KV Cache’ to One-Sixth, Rattling Chip Stocks
AI-summarised brief · reviewed before publication
Google Research has unveiled TurboQuant, an AI memory optimization technology that compresses "KV cache" to one-sixth, lowering costs. This technology eases the memory bottleneck in AI adoption, but has raised concerns it may reduce demand for memory chips. Developed with DeepMind and New York University, TurboQuant boosts computational speed up to eight-fold, sparking hopes for wider AI adoption and rattling semiconductor stocks, including Samsung and SK hynix, with potential long-term impact on the industry.