Marvell Custom HBM Compute Architecture to Optimize Cloud AI Accelerators
AI accelerator (XPU) architecture enables up to 25% more compute, 33% greater memory while improving power efficiency and collaboration with Micron, Samsung and SK hynix on custom high-bandwidth memory (HBM) solutions to deliver custom XPUs
This is a Press Release edited by StorageNewsletter.com on December 13, 2024 at 2:01 pmSummary:
- Marvell AI accelerator (XPU) architecture enables up to 25% more compute, 33% greater memory while improving power efficiency.
- Marvell collaborating with Micron, Samsung and SK hynix on custom high-bandwidth memory (HBM) solutions to deliver custom XPUs.
- Architecture comprises advanced die-to-die interfaces, HBM base dies, controller logic and advanced packaging for new XPU designs.
Marvell Technology, Inc. announced that it has pioneered a new custom HBM compute architecture that enables XPUs to achieve greater compute and memory density.
This technology is available to all of its custom silicon customers to improve the performance, efficiency and TCO of their custom XPUs. The company is collaborating with its cloud customers and leading HBM manufacturers, Micron, Samsung Electronics, and SK hynix to define and develop custom HBM solutions for next-gen XPUs.
Diagram of an XPU with standard HBM. Below, an artist’s rendering of an XPU with custom HBM from Marvell. Note the reduced size of the I/O. This can be accomplished by customizing the controller or PHY that are part of the base die.
(source: Marvell)
HBM is a critical component integrated within the XPU using advanced 2.5D packaging technology and high-speed industry-standard interfaces. However, the scaling of XPUs is limited by the current standard interface-based architecture. The Marvell custom HBM compute architecture introduces tailored interfaces to optimize performance, power, die size, and cost for specific XPU designs. This approach considers the compute silicon, HBM stacks, and packaging. By customizing the HBM memory subsystem, including the stack itself, the company is advancing customization in cloud data center infrastructure. The firm is collaborating with major HBM makers to implement this new architecture and meet cloud data center operators’ needs.
The Marvell custom HBM compute architecture enhances XPUs by serializing and speeding up the I/O interfaces between its internal AI compute accelerator silicon dies and the HBM base dies. This results in greater performance and up to 70% lower interface power compared to standard HBM interfaces. The optimized interfaces also reduce the required silicon real estate in each die, allowing HBM support logic to be integrated onto the base die. These real-estate savings, up to 25%, can be used to enhance compute capabilities, add new features, and support up to 33% more HBM stacks, increasing memory capacity per XPU. These improvements boost XPU performance and power efficiency while lowering TCO for cloud operators.
“The leading cloud data center operators have scaled with custom infrastructure. Enhancing XPUs by tailoring HBM for specific performance, power, and total cost of ownership is the latest step in a new paradigm in the way AI accelerators are designed and delivered,” said Will Chu, SVP and GM, custom, compute and storage group, Marvell. “We’re very grateful to work with leading memory designers to accelerate this revolution and, help cloud data center operators continue to scale their XPUs and infrastructure for the AI era.“
“Increased memory capacity and bandwidth will help cloud operators efficiently scale their infrastructure for the AI era,” said Raj Narasimhan, SVP and GM, compute and networking business unit, Micron Technology, Inc. “Strategic collaborations focused on power efficiency, such as the one we have with Marvell, will build on Micron’s industry-leading HBM power specs, and provide hyperscalers with a robust platform to deliver the capabilities and optimal performance required to scale AI.“
“Optimizing HBM for specific XPUs and software environments will greatly improve the performance of cloud operators’ infrastructure and ensure efficient power use,” said Harry Yoon, corporate EVP, and head of Americas products and solutions planning, Samsung Electronics Co. Ltd. “The advancement of AI depends on such focused efforts. We look forward to collaborating with Marvell, a leader in custom compute silicon innovation.“
“By collaborating with Marvell, we can help our customers produce a more optimized solution for their workloads and infrastructure,” said Sunny Kang, VP, DRAM technology, SK hynix America. “As one of the leading pioneers of HBM, we look forward to shaping this next evolutionary stage for the technology.“
“Custom XPUs deliver superior performance and performance per watt compared to merchant, general-purpose solutions for specific, cloud-unique workloads,” said Patrick Moorhead, CEO and founder, Moor Insights & Strategy. “Marvell, already a player in custom compute silicon, is already delivering tailored solutions to leading cloud companies. Their latest custom compute HBM architecture platform provides an additional lever to enhance the TCO for custom silicon. Through strategic collaboration with leading memory makers, Marvell is poised to empower cloud operators in scaling their XPUs and accelerated infrastructure, thereby paving the way for them to enable the future of AI.”
Resource:
Blog: Custom HBM: What Is It and Why It’s the Future