Hewlett Packard Enterprise has expanded a partnership with NVIDIA to build an enterprise computing solution for generative AI (Gen AI). The co-engineered, preconfigured AI tuning and inferencing solution enables enterprises of any size to quickly customise foundation models using private data and deploy production applications anywhere, from edge to cloud.
The offering removes the complexity of developing and deploying Gen AI infrastructure with a full-stack AI tuning and inferencing solution from HPE and NVIDIA.
As enterprises develop and deploy Gen AI models for use cases such as conversational search, business process automation and content creation, they require a software and infrastructure stack that can be deployed quickly and from wherever the business needs it, HPE said.
The new enterprise computing solution for generative AI is part of an expanded collaboration between HPE and NVIDIA that delivers full-stack, out-of-the-box AI solutions. These solutions integrate HPE Machine Learning Development Environment Software, HPE Ezmeral Software, HPE ProLiant Compute and HPE Cray Supercomputers with the NVIDIA AI Enterprise software suite, including the NVIDIA NeMo framework.
“Together, HPE and NVIDIA are in a unique position to deliver a comprehensive AI-native solution that will dramatically ease the journey to develop and deploy AI models with a portfolio of pre-configured solutions,” said Antonio Neri, President and CEO, HPE.
“The strategic collaboration between HPE and NVIDIA will dramatically reduce barriers for customers looking to transform their businesses with AI.”
"The generative AI era is ramping at full speed, with enterprises racing to reimagine their businesses,” said Jensen Huang, founder and CEO, NVIDIA.
“Our expanded collaboration with HPE will help enterprises drive unprecedented productivity through AI applications that connect with business data to power accurate assistants, informed chatbots and semantic search.”
The new solution is an AI tuning and inferencing data centre tool that provides the ideal entry point for enterprises of all sizes with a ready-out-of-the-box offering. Enterprises can use pretrained foundation models with their private data to create production applications such as AI chatbots. In addition, retrieval-augmented generation (RAG) workstreams further improve the data quality and accuracy of the application.
Purpose-built and optimised for AI
A rack-scale architecture featuring HPE ProLiant Compute DL380a servers preconfigured with NVIDIA L40S GPUs, NVIDIA BlueField-3 DPUs and the NVIDIA Spectrum-X Ethernet Networking Platform for hyperscale AI. The solution is sized to finetune a 70 billion-parameter Llama 2 model and features 16 HPE ProLiant DL380a servers and 64 L40S GPUs.
HPE AI software
HPE Machine Learning Development Environment Software with new generative AI studio capabilities to rapidly prototype and test models, and HPE Ezmeral Software with new GPU-aware capabilities to simplify deployment and accelerate data preparation for AI workloads across the hybrid cloud.
NVIDIA AI software
NVIDIA AI Enterprise to accelerate production AI development and deployment with security, stability, manageability and support. It offers the NVIDIA NeMo framework, guardrailing toolkits, data curation tools and pretrained models to streamline enterprise GenAI.
HPE Services provides a broad portfolio of consulting services, workforce training and deployment solutions for AI. The new AI services take customers through every step of the journey: from Gen AI and LLM discovery to implementation, where customers develop the optimum operational models and hybrid cloud data strategies needed to build, deploy and scale solutions into transformative outcomes. These comprehensive services are supported by new Global Centers of Excellence for AI and Data around the world, including in India.
SC23 saw HPE announce a turnkey supercomputing solution powered by NVIDIA for large enterprises, research institutions and government organisations to address the first phase of the AI lifecycle: developing and training foundational models. The enterprise computing solution for generative AI is a smaller form-factor AI solution for enterprise customers that are focused on tuning and inferencing.
At HPE Discover Barcelona 2023, HPE also announced open, full-stack AI-native architecture and the next series of AI-native and hybrid cloud offerings for machine learning development, data analytics, AI-optimised file storage, AI tuning and inferencing and professional services.
Availability
The enterprise computing solution for generative AI can be ordered from Q124.
No comments:
Post a Comment