AI's Storage Demands: Can Hardware Designers Keep Up?

Storage Bottleneck in Hardware Design AI's Storage Demands: Can Hardware Designers Keep Up?

2025-10-07 By Justin Sears* | Translated by AI 5 min Reading Time

Related Vendors

Renesas Electronics Germany GmbH

ALTIUM EUROPE GmbH

FAULHABER Drive Systems

tmts2026-ticec-banner-en-2000px (https://www.tmts.tw/en)

Taiwan Machine Tool & Accessory Builders' Association (TMBA)

Wuxi InfiMotion Technology Co., Ltd.

Artificial intelligence is increasingly becoming part of everyday life—whether in wearables, autonomous vehicles, or cloud models. This puts unprecedented pressure on hardware development. In this context, storage is increasingly becoming the critical bottleneck for AI systems at all levels.

When discussing hardware in AI applications, the focus is often on GPUs and other hardware accelerators. However, in hardware design, it is actually storage that increasingly challenges developers; not only in terms of quantity but also access times.(Image: freely licensed / Pixabay) — When discussing hardware in AI applications, the focus is often on GPUs and other hardware accelerators. However, in hardware design, it is actually storage that increasingly challenges developers; not only in terms of quantity but also access times.
(Image: freely licensed / Pixabay)

AI workloads rely on fast, high-density storage to supply data-hungry models. Whether a generative model is being trained in the data center or object recognition is being executed on an edge device, storage bandwidth and capacity are now the key factors limiting performance, energy efficiency, and thermal stability.

To meet these requirements, hardware teams rely on various memory architectures. These include High-Bandwidth Memory (HBM) for wide I/O channels and high throughput during AI training. GDDR6/GDDR7 is used for graphics-intensive or inference-heavy tasks. LPDDR5/LPDDR5X is suitable for power-constrained edge AI applications, and 3D-stacked DRAM offers more capacity in a smaller space. Emerging nonvolatile technologies like MRAM and ReRAM show promising approaches for persistent AI state storage and faster boot times for edge devices in the future but are still largely under development for mainstream AI applications. Each of these technologies, however, comes with specific limitations regarding power consumption, compatibility, thermal design, availability, and performance. This necessitates design trade-offs at the system level.

Hardware Design for AI is Defined By Storage

In traditional hardware development processes, memory selection only occurred after decisions about the CPU or GPU had been made. However, in the era of AI, this order has reversed. Today, hardware developers find that the choice of memory architecture determines the entire hardware stack. It affects the board layout, power supply design, and the form factor of the final product.

The selection of high-speed GDDR6 can enable faster AI inference but requires dedicated power rails and more complex PCB layouts, which introduce additional thermal and EMC challenges. Using LPDDR5 in battery-powered mobile devices reduces power consumption but limits the available bandwidth, potentially constraining model size or inference rates. High Bandwidth Memory (HBM), on the other hand, offers enormous throughput potential but demands advanced packaging and cooling technologies, such as vapor chambers or liquid cooling systems, both of which drive up costs.

These are not theoretical concerns. As memory requirements for AI evolve rapidly, developers must define the memory configuration and interfaces earlier—often before software models and firmware are stabilized. This increases the risk associated with early design decisions: a poorly calculated memory choice may necessitate a PCB redesign or limit future upgrade paths.

Volatility in the Storage Supply Chain

With the rapid proliferation of AI models, the demand for high-performance storage technologies is also increasing. However, DRAM and NAND, in particular, have always been cyclical and highly price-sensitive markets. The current surge in demand driven by AI is further amplifying this volatility.

HBM, GDDR6, and LPDDR5 are now considered strategic key technologies. Their production is concentrated in South Korea, which dominates the DRAM market. Taiwan, on the other hand, leads in advanced packaging and foundry services. Japan supplies materials and specialty memory.

This geographical concentration entails certain risks, including geopolitical instability, such as tensions in the Taiwan Strait. Additionally, export controls and trade restrictions pose challenges. Manufacturing bottlenecks are also an issue, as EUV lithography and DRAM-specific equipment are only available in a limited number of factories. Material shortages, such as for fluorinated process gases or specialized photoresists, can further disrupt the supply chain and delay deliveries.

For developers working on AI-driven new launches, this means longer lead times and a higher risk of component obsolescence—such as when memory roadmaps shift or a specific component becomes scarce.

It's Not Just About Storage Capacity—Access Matter Too

In AI hardware, more memory is only helpful if it is the right type of memory. It must be installed in the right location and connected in the right way.

The transition from universal computing tasks to AI-centric workloads opens up new design approaches for product development teams. Tightly coupled memory reduces latency but requires deeper integration with processors or SoCs. Loosely coupled memory offers more flexibility but can lead to bottlenecks depending on the architecture. Memory access patterns—such as tensor reuse, stride accesses, or sparsity—must be optimized according to the model structure and compute pipeline. Partitioning decisions—such as storing weights in HBM, activations in LPDDR, and intermediate data in NVM—also significantly impact performance, thermal behavior, and battery life.

Subscribe to the newsletter now

Don't Miss out on Our Best Content

Business E-mail

Please enter a valid mailadress.

By clicking on „Subscribe to Newsletter“ I agree to the processing and use of my data according to the consent form (please expand for details) and accept the Terms of Use. For more information, please see our Privacy Policy. The consent declaration relates, among other things, to the sending of editorial newsletters by email and to data matching for marketing purposes with selected advertising partners (e.g., LinkedIn, Google, Meta)

Date: 08.12.2025

Naturally, we always handle your personal data responsibly. Any personal data we receive from you is processed in accordance with applicable data protection legislation. For detailed information please see our privacy policy.

Consent to the use of data for promotional purposes

I hereby consent to Vogel Communications Group GmbH & Co. KG, Max-Planck-Str. 7-9, 97082 Würzburg including any affiliated companies according to §§ 15 et seq. AktG (hereafter: Vogel Communications Group) using my e-mail address to send editorial newsletters. A list of all affiliated companies can be found here

Newsletter content may include all products and services of any companies mentioned above, including for example specialist journals and books, events and fairs as well as event-related products and services, print and digital media offers and services such as additional (editorial) newsletters, raffles, lead campaigns, market research both online and offline, specialist webportals and e-learning offers. In case my personal telephone number has also been collected, it may be used for offers of aforementioned products, for services of the companies mentioned above, and market research purposes.

Additionally, my consent also includes the processing of my email address and telephone number for data matching for marketing purposes with select advertising partners such as LinkedIn, Google, and Meta. For this, Vogel Communications Group may transmit said data in hashed form to the advertising partners who then use said data to determine whether I am also a member of the mentioned advertising partner portals. Vogel Communications Group uses this feature for the purposes of re-targeting (up-selling, cross-selling, and customer loyalty), generating so-called look-alike audiences for acquisition of new customers, and as basis for exclusion for on-going advertising campaigns. Further information can be found in section “data matching for marketing purposes”.

In case I access protected data on Internet portals of Vogel Communications Group including any affiliated companies according to §§ 15 et seq. AktG, I need to provide further data in order to register for the access to such content. In return for this free access to editorial content, my data may be used in accordance with this consent for the purposes stated here. This does not apply to data matching for marketing purposes.

Right of revocation

I understand that I can revoke my consent at will. My revocation does not change the lawfulness of data processing that was conducted based on my consent leading up to my revocation. One option to declare my revocation is to use the contact form found at https://contact.vogel.de. In case I no longer wish to receive certain newsletters, I have subscribed to, I can also click on the unsubscribe link included at the end of a newsletter. Further information regarding my right of revocation and the implementation of it as well as the consequences of my revocation can be found in the data protection declaration, section editorial newsletter.

Compatibility is a technical minefield. Developers must ensure that the memory is not only electrically compatible with AI chips, FPGAs, or SoCs but also logically aligned with parameters such as bandwidth, latency, and parallelism. An incorrect configuration can significantly limit computational performance, increase energy consumption, or result in costly hardware accelerators being underutilized.

Proactive Memory Planning As A Clear Competitive Advantage

Companies that successfully bring AI hardware to market today not only optimize performance but also embed resilience into their memory strategy from the very beginning.

This means that risks related to procurement and lifecycle status must be considered during memory selection. Memory access and throughput should be simulated early in the design phase. Hardware, software, and supply chain teams need to collaborate to identify obstacles in a timely manner. Additionally, investing in design platforms that support real-time collaboration and component intelligence is worthwhile.

When design and procurement teams work in isolation, memory decisions are often delayed or made without coordination. The result is costly errors and missed market opportunities. However, when teams collaborate from the start, they can identify alternatives, reduce availability risks, and develop systems that balance power consumption, computational performance, and supply chain security.

As developers increasingly deploy AI across all areas—from edge sensors to data center infrastructure—memory becomes a critical lever in the competitive landscape, where strategies for performance, scalability, and component availability intersect. Those who view memory not as a secondary factor but as a key design constraint will successfully bring to market products that meet the demands of evolving AI workloads. This provides a clear advantage in the race for the next product generation. (sg)

*Justin Sears is responsible for product marketing of B2B SaaS platforms at Altium. As "Head of Product Marketing for SaaS," he leads the team that positions cloud solutions for electronics development in the market.