How AMD & Red Hat Are Rebuilding Gen AI Infrastructure

Tech industry heavyweights AMD and Red Hat are set to broaden their strategic partnership, with the aim of bringing enhanced AI infrastructure to modern enterprises.
It's a collaboration that was first introduced at the Red Hat Summit.
Their work together is designed to provide innovative solutions for organisations that are grappling to maintain equilibrium between their existing IT systems and the growing demands of AI workloads.
Both companies are keen on expanding their offerings across hybrid cloud environments, focusing on optimised AI models and the cost-effective modernisation of traditional virtual machines (VMs).
- Red Hat and AMD power leading-edge AI inference performance with vLLM on AMD Instinct GPUs
- Red Hat OpenShift Virtualization on AMD EPYC CPUs to help organisations more easily modernise existing systems for future innovations
A highlight of this partnership is set to be the full integration of AMD Instinct GPUs on Red Hat OpenShift AI, with the former's GPUs pivotal in processing AI across diverse cloud settings.
"Fully realising the benefits of AI means that organisations must have the choice and flexibility to optimise their IT footprint for the rigors of scaling demand," explains Ashesh Badani, Senior Vice President and Chief Product Officer at Red Hat.
AMD and Red Hat's work is also expected to modernise existing structures on high-performing CPU architectures and virtualisation platforms, as well as helping to prepare businesses for AI production with cutting-edge hardware accelerators and open-source AI technologies.
Addressing the demand for AI
As things stand, the average data centre predominantly supports traditional IT infrastructures which leaves little wiggle room for addressing the sector's growing demand for AI.
This is the main issue that has sparked the collaboration between Red Hat’s open-source technology and AMD’s high-performance computing (HPC) expertise.
- Improved performance on AMD GPUs
- Enhanced multi-GPU support
- Expanded vLLM ecosystem engagement
By combining AMD’s x86-based processors and GPU architectures with Red Hat AI, a more cost-efficient and scalable environment is made available to the industry, which is a compelling proposition.
The joint efforts include testing on Microsoft Azure ND MI300X v5, demonstrating effective AI inferencing for various language models.
These models operate across multiple GPUs on a single VM, reducing the need for multiple deployments and cutting performance costs.
Red Hat and AMD are actively involved in the vLLM community, giving the two a sharp focus on enhancing GPU performance, improving multi-GPU support and expanding ecosystem engagement through collaboration with industry giants like IBM.
These initiatives aim to optimise GPU server performance and boost ROI for high-intensity AI workloads.
Modernising the data centre landscape
By optimising current data centre structures, enterprises are able to channel more resources towards nurturing AI innovation.
Red Hat OpenShift Virtualisation is structured to facilitate the migration and management of VM workloads with the cloud-native simplicity and adaptability.
Validated for AMD EPYC processors, Red Hat OpenShift Virtualisation harnesses their performance and efficiency on the hybrid cloud, paving the way for a cloud-enabled future.
This modernisation approach promises higher infrastructure consolidation ratios, potentially lowering total costs related to hardware, software licensing and energy consumption.
For data centre executives tasked with balancing traditional systems with the adoption of future AI advancements, this partnership offers a viable path for streamlining existing infrastructure while accommodating upcoming technological demands.
"As enterprise customer workloads grow more diverse and demanding, they require solutions that can scale," says Philip Guido, Executive Vice President and Chief Commercial Officer at AMD.
"By combining Red Hat’s industry-leading open source platforms with world-class AMD Instinct GPUs and AMD EPYC CPUs, we’re delivering the performance and efficiency customers demand to accelerate AI, virtualisation and hybrid-cloud innovation."
Explore the latest edition of Technology Magazine and be part of the conversation at our global conference series, Tech & AI LIVE.
Discover all our upcoming events and secure your tickets today.
Technology Magazine is a BizClik brand

