CONNECT WITH US
NEWS TAGGED AI INFERENCE
Friday 21 March 2025
Nvidia CEO dismisses ASIC threat as "noncompetitive" — but AI inference competition is heating up
At a GTC media roundtable, when pressed on whether application-specific integrated circuits (ASICs) threaten Nvidia's AI dominance, CEO Jensen Huang didn't mince words.
Thursday 20 March 2025
Nvidia launches Blackwell Ultra AI inference platform, Taiwanese manufacturers eye 2H25 deployment
Nvidia CEO Jensen Huang officially unveiled the Blackwell artificial intelligence (AI) factory platform "Blackwell Ultra" during his keynote speech at GTC. This new platform enhances...
Thursday 20 March 2025
China scales up DeepSeek AI inference clusters; Huawei Ascend takes lead
Since emerging in mid-January 2025, DeepSeek has rapidly reshaped the AI landscape. As its presence grows into a second month, OpenAI has escalated its opposition—first threatening...
Thursday 20 March 2025
Nvidia unleashes RTX Pro 6000 Blackwell; key partners ready to ship
Nvidia officially launched the RTX Pro 6000 Blackwell series of workstations and servers, redefining professional workflows for AI, technology, creative, engineering, and design professionals...
Wednesday 19 March 2025
GTC 2025: Neousys showcases rugged Edge AI computing solutions for smart applications
Neousys Technology Inc., a leading industrial PC (IPC) manufacturer, is unveiling its next-generation rugged edge AI computing platform at GTC 2025 to drive real-world AI adoption...
Wednesday 19 March 2025
GTC 2025: Nvidia redefines AI computing with Blackwell Ultra DGX SuperPOD, instant AI factory
Nvidia unveiled its next-generation DGX SuperPOD AI infrastructure at GTC 2025, powered by Blackwell Ultra GPUs. Engineered for agent-based AI inference, the system provides enterprises...
Wednesday 12 March 2025
Deepseek fuels China's AI boom, driving SSD demand, says DapuStor chairman
Deepseek's rapid adoption is accelerating AI application growth in China, fueling demand for enterprise-grade SSDs. DapuStor Chairman Yafei Yang said AI all-in-one machines are lowering...
Wednesday 5 March 2025
Nvidia's Blackwell bets big on AI inference despite Deepseek disruption
Since DeepSeek's open-source debut in January, its V3 and R1 models have rocked Silicon Valley, raising questions about whether high-performance AI can be built without costly GPU...
Tuesday 11 February 2025
Amazon expects Trainium chip series to reach 4th gen amid growing adoption
Amazon has revealed that its in-house designed Trainium 2 processor is gaining traction among potential users, including Qualcomm, who are participating in its early-stage evaluation...
Monday 10 February 2025
DeepSeek API price hikes signal the end of low-cost AI
DeepSeek's conclusion of its promotional trial period brings substantial price increases to its API services, potentially shifting enterprise strategies toward on-premise deployments...
Saturday 8 February 2025
IBM CEO backs open-source small models to cut AI inference costs
The low-cost open-source AI model from Chinese startup DeepSeek has ignited widespread debate, drawing responses from major American tech giants. IBM, which has recently prioritized...
Friday 7 February 2025
Amazon acknowledges FX impacts, sees promise in DeepSeek and AI inference
During the earnings call that followed the release of Amazon's financial results, senior executives acknowledged the adverse effects of foreign exchange fluctuations while expressing...
Wednesday 8 January 2025
Arm predicts AI boom and smartphones' enduring dominance in consumer tech
Arm has provided insights on technology trends for 2025 and beyond, including chip design, AI, and market dynamics.
Monday 18 November 2024
Foxconn advances into the AI server market at SC24, showcasing GB200 racks, new HGX products
Foxconn, through its subsidiary FII and Ingrasys, made a strong showing at Supercomputing 2024 (SC24) in Atlanta, the US, demonstrating its latest AI server technology portfolio....
Wednesday 30 October 2024
OpenAI, Broadcom working to develop AI inference chip
OpenAI is working with Broadcom Inc. to develop a new artificial intelligence chip specifically focused on running AI models after they've been trained, according to two people familiar...