CONNECT WITH US
Mar 27
In-depth: Google TurboQuant cuts LLM memory 6x, resets AI inference cost curve

Google has introduced TurboQuant, a compression algorithm that reduces large language model (LLM) memory usage by at least 6x while boosting performance, targeting one of AI's most persistent bottlenecks: memory. The breakthrough lowers inference costs and expands deployment across cloud and edge environments.

AI-driven demand is tightening global memory supply, pushing NAND flash and server DRAM into shortages, price hikes, and capacity constraints. Server memory demand is expected to grow more than 40% in 2026, accounting for over half of total storage usage.

The global DRAM industry is approaching a structural inflection point, as traditional scaling methods struggle to deliver the performance gains required by artificial intelligence workloads. With next-generation architectures such as 4F-squared (4F²) and 3D DRAM facing rising complexity and potential delays, manufacturers are being forced to reassess near-term roadmaps and rely more heavily on incremental and material-level improvements.

Semiconductor manufacturers are racing to secure critical materials as Middle East tensions disrupt supply chains, with the risk of production disruption outweighing rising costs.

SMIC on March 26 outlined an action plan to strengthen core operations and identify new growth drivers in 2026, targeting revenue growth above the industry average as it reinforces its role in China's semiconductor self-sufficiency strategy.

SMIC, China's largest chipmaker, is accused by senior Trump administration officials of supplying chipmaking equipment to Iran's military, escalating tensions linked to the ongoing US-Israel conflict with Tehran.
Samsung Electronics is facing a looming labor strike in May as its memory and foundry businesses take off, marking a more complex challenge than the initial 2024 walkout. The upcoming strike reflects significant changes in industry conditions and union size, highlighting Samsung's structural difficulties with labor issues amid evolving laws and its broad business portfolio.
Chang Wah Technology (CWTC) approved a plan to build a new factory in Weihai, Shandong. It confirmed leadership changes as it presses on with overseas expansion to boost production capacity and operational growth. The board's extraordinary meeting authorized a tentative CNY1 billion (US$145.00 million) investment through a subsidiary.
Innodisk told attendees at the 2026 AI EXPO that effective AI deployment requires more than raw computing power; it depends on tight integration between software and hardware, and on selecting components tailored to specific environments. The company argued that edge AI has progressed from image recognition and language models to autonomous learning and decision-making.
Amid the rapid development of generative artificial intelligence (GenAI) and large language models (LLM), global demand for high-performance computing (HPC) continues to rise. Memory module maker Adata Technology announced a US$3 million investment in the Series A funding round of artificial intelligence (AI) computing infrastructure provider KonstTech (Konst).

SEMICON China 2026 spotlighted the scale of the AI investment boom, with Handel Jones, CEO of International Business Strategies (IBS), estimating that global AI and data center capital expenditure has surged from about US$110 billion in 2020 to roughly US$600 billion in 2026.

espite a surge in demand driven by generative artificial intelligence, the fundamental economics of the memory industry remain largely intact. While high-bandwidth memory (HBM) has created a premium segment, the broader market continues to operate on standardized, high-volume production rather than structural product differentiation.