Red Hat and Google Cloud launch open-source tools to accelerate scalable AI inference

Red Hat and Google Cloud have recently expanded their strategic alliance aimed at driving innovation in the realm of artificial intelligence (AI) and enhancing enterprise AI applications. This collaboration combines Red Hat’s strengths in open-source technology with Google Cloud’s robust infrastructure and advanced AI models, particularly the Gemma family, to create more scalable and efficient AI solutions.

The partnership ushers in important tools intended for users of Google Cloud, such as the newly launched Red Hat AI Inference Server. This server is designed to facilitate cost-effective AI inference while providing users with greater flexibility. Central to the collaboration is the launch of the open-source project llm-d, which aims to streamline generative AI inference across diverse platforms. As many organisations grapple with transitioning AI research into operational realities, this initiative is positioned to address the complexities associated with deploying AI in varied environments.

In particular, the partnership focuses on enhancing AI inference capabilities through the vLLM project, where Red Hat has taken a leading role by supporting Google’s latest AI model, Gemma 3. vLLM serves as an open-source inference server engineered to accelerate generative AI applications, ensuring swift execution and adaptability. Leveraging Google’s Cloud TPUs, high-performance AI accelerators, developers can optimise their resources while maintaining essential levels of performance and efficiency, crucial for meeting the demands of real-world applications.

The collaboration has an eye towards optimising workload efficiency and reducing costs, crucial factors as organisations increasingly pivot towards distributed computing strategies. By working on initiatives like llm-d, Red Hat and Google Cloud aim not only to make AI tools more accessible but also to encourage ongoing innovation within the AI landscape.

Beyond the realm of AI inference, this partnership reflects a commitment to community-driven innovation. The Red Hat AI Inference Server is now available on Google Cloud, enabling enterprises to deploy generative AI models that are both responsive and cost-efficient across hybrid cloud environments. The integration promises to enhance model inference consistency in today’s diverse computational ecosystems, allowing businesses to benefit from the open-source community’s advancements.

The two companies have been collaborating for over a decade, laying a strong foundation with Red Hat Enterprise Linux (RHEL) that has been pivotal for cloud migration strategies. Their ongoing efforts, including the integration of Red Hat OpenShift with Google Cloud, provide a cohesive platform for managing applications, further simplifying the deployment process for enterprises embracing hybrid cloud strategies.

Moreover, the partnership has proven beneficial in practical applications, such as the development of Shadowbot, an enterprise chat assistant that leverages Google Cloud’s infrastructure to deliver real-time answers efficiently. This application exemplifies how the synergy between Red Hat and Google Cloud is enhancing productivity through automated solutions in operational settings.

This collaborative vision extends to projects like the LeaderWorkerSet (LWS), which enhances Kubernetes capabilities for generative AI inference. Additionally, the recent certification of Red Hat Enterprise Linux 8 by Google Cloud adds further utility, supporting a variety of workloads essential for modern AI and container environments.

As the landscape of AI continues to evolve, the alliance between Red Hat and Google Cloud showcases a commitment to not only advancing technology but also fostering an open ecosystem where enterprises can flourish through collaborative innovation. The journey ahead promises to redefine how businesses leverage AI to meet strategic objectives effectively, facilitating an infrastructure where complex AI workflows can operate seamlessly across a variety of platforms.

Reference Map:

Paragraph 1 – ^[1], ^[2]
Paragraph 2 – ^[1], ^[3], ^[4]
Paragraph 3 – ^[1], ^[5], ^[6]
Paragraph 4 – ^[1], ^[7]

Source: Noah Wire Services

Subscribe to Industry Updates

Get the latest news and updates directly to your inbox.

Trending

Generative AI shifts online search from popular sites to broader, synthesised sources

Generative AI accelerates Amazon FBA success with innovative tools from Zonbase

Medical device supply chains face rapid acceleration of resilience efforts amid unprecedented disruptions

Reference Map:

Subscribe to Industry Updates

Generative AI shifts online search from popular sites to broader, synthesised sources

Generative AI accelerates Amazon FBA success with innovative tools from Zonbase

Palantir’s AI transformation redefines procurement and supply chain resilience

Reliance and Meta partner to drive enterprise AI revolution in India

Google Ads marks 25 years with a leap into AI-driven automated advertising era

DPW Amsterdam 2025 highlights the need to develop procurement capabilities alongside AI adoption

Generative AI accelerates Amazon FBA success with innovative tools from Zonbase

Medical device supply chains face rapid acceleration of resilience efforts amid unprecedented disruptions

E-commerce fulfilment strategies enter new era with advanced automation and flexible logistics for peak season

How evolving BPO partnerships are shaping the future of customer experience and revenue growth

Resilinc unveils 2025’s top high-tech supply chain resilient suppliers with new focus on agility and transparency

Manufacturing pivots to service as Industrial AI accelerates transformation and sustainability gains prominence

Digital innovations revolutionise global spare parts distribution for OEMs

Holland LP's acquisition of ARR Rail Solutions boosts global rail repair automation

Explore

Quick Links

Contribute to SRM Today

Advertise with us

Subscribe to Industry Updates

Trending

Subscribe to Industry Updates

Red Hat and Google Cloud launch open-source tools to accelerate scalable AI inference

Reference Map:

Subscribe to Industry Updates

Keep Reading

Explore

Quick Links

Contribute to SRM Today

Advertise with us

Subscribe to Industry Updates