AI Inference Using CPU Why GPU-Only Systems Are No Longer Sufficient for Autonomous AI Agents

From Manuel Christa | Translated by AI 2 min Reading Time

Related Vendor

AI applications are increasingly working independently. However, pure graphics processors are reaching their limits. Intel and Sambanova are now developing an architecture that combines different chip types.

Infrastructure for AI agents: Intel and Sambanova share computing load across three processor types(Image: Gemini / AI-generated)
Infrastructure for AI agents: Intel and Sambanova share computing load across three processor types
(Image: Gemini / AI-generated)

When AI no longer just generates texts, but writes code and queries databases independently as an agent, previous hardware structures come to a standstill. AI systems based on graphics processing units (GPUs) work inefficiently with these complex tasks. To resolve this bottleneck, Intel and Sambanova have presented a blueprint for future server infrastructures. From the second half of 2026, they want to offer data centers and cloud providers hardware that specifically divides up the computing work.

The concept is based on a heterogeneous architecture. Instead of burdening a single chip type with all phases of AI calculation, the partners assign the appropriate processor to each step.

Special Chips for Each Calculation Phase

The design provides for a precise division of tasks. GPUs take over the starting signal: in the prefill phase, they process long input texts into usable data records. Then the specialized RDU chips from Sambanova take over. They generate the responses of the language models and ensure a high data throughput during decoding.

Intel contributes the Xeon 6 server CPU as a control center. This coordinates the tasks of the AI agents and distributes the workload. At the same time, the Intel chip acts as an executing element that compiles and checks the code. Rodrigo Liang, head of Sambanova, summarizes: "Agent AI is going into production—and the success pattern we see is GPUs to start the job, Intel Xeon 6 to run it, and Sambanova RDUs to finish it quickly."

Software Compatibility As A Basis

Compatibility is a key advantage of the architecture. As Intel supplies the host processors, companies can continue to use their x86 software environment without any adjustments. No single type of chip is optimal for every phase of a workflow. Intel and Sambanova's blueprint stands out because it combines high performance with fewer chips and full compatibility with the existing software ecosystem.

Systems in comparison: According to the manufacturers, the architecture from Intel and Sambanova for large AI models requires significantly fewer chips and less power than competing solutions and is easier to integrate into existing data centers.(Image: Sambanova)
Systems in comparison: According to the manufacturers, the architecture from Intel and Sambanova for large AI models requires significantly fewer chips and less power than competing solutions and is easier to integrate into existing data centers.
(Image: Sambanova)

Sambanova's own measurements support this approach. According to the company's data, the Xeon 6 compiles code more than 50 percent faster than server CPUs based on the Arm architecture. The Intel chip also works up to 70 percent faster with vector databases than comparable competitor products from the x86 camp. The cooperation between the two companies thus marks a clear step from pure feasibility studies to the broad commercial use of AI agents in data centers. (mc)

Subscribe to the newsletter now

Don't Miss out on Our Best Content

By clicking on „Subscribe to Newsletter“ I agree to the processing and use of my data according to the consent form (please expand for details) and accept the Terms of Use. For more information, please see our Privacy Policy. The consent declaration relates, among other things, to the sending of editorial newsletters by email and to data matching for marketing purposes with selected advertising partners (e.g., LinkedIn, Google, Meta)

Unfold for details of your consent