Navigation
Search
|
Alibaba Cloud Says It Cut Nvidia AI GPU Use By 82% With New Pooling System
Tuesday October 21, 2025. 12:00 PM , from Slashdot
![]() The system was tested in production over several months, according to the paper, which lists authors from both Peking University and Alibaba's infrastructure division, including CTO Jingren Zhou. During that window, the number of GPUs needed to support dozens of different LLMs -- ranging in size up to 72 billion parameters -- fell from 1,192 to just 213. While the paper does not break down which models contributed most to the savings, reporting by the South China Morning Post says the tests were conducted using Nvidia's H20, one of the few accelerators still legally available to Chinese buyers under current U.S. export controls. Read more of this story at Slashdot.
https://hardware.slashdot.org/story/25/10/21/005243/alibaba-cloud-says-it-cut-nvidia-ai-gpu-use-by-8...
Related News |
25 sources
Current Date
Oct, Tue 21 - 21:28 CEST
|