Amazon Web Services (AWS) and NVIDIA have expanded their strategic collaboration to provide a cutting-edge infrastructure, software, and services ecosystem for generative AI innovations. This collaboration marks a significant step forward in the world of AI, leveraging the strengths of both AWS and NVIDIA technologies.
AWS is introducing NVIDIA GH200 Grace Hopper Superchips with multi-node NVLink technology to its Amazon Elastic Compute Cloud (EC2) instances. This move will empower customers with on-demand access to supercomputer-class performance, featuring up to 20 TB of shared memory to tackle terabyte-scale workloads.
NVIDIA and AWS are joining forces to host NVIDIA DGX Cloud on AWS. This collaboration features the GH200 NVL32, providing developers access to the largest shared memory in a single instance. DGX Cloud on AWS is poised to expedite the training of cutting-edge generative AI and large language models.
Project Ceiba
The collaborative efforts extend to Project Ceiba, where NVIDIA and AWS are designing the world’s fastest GPU-powered AI supercomputer. This at-scale system, featuring GH200 NVL32 and Amazon EFA interconnect, boasts 16,384 NVIDIA GH200 Superchips. It also possesses the capability to process 65 exaflops of AI.
Set to make a significant stride, AWS will introduce three new Amazon EC2 instances powered by NVIDIA GPUs, catering to various applications. Equipped with NVIDIA H200, L4, and L40S GPUs, these instances go by P5e, G6, and G6e. They promise to handle tasks ranging from large-scale generative AI and HPC workloads to fine-tuning graphics, video, and AI applications.
NVIDIA Software on AWS Enhances Generative AI Development
NVIDIA’s software on AWS takes center stage in boosting generative AI development. The NeMo Retriever microservice offers tools for creating highly accurate chatbots and summarization tools using accelerated semantic retrieval. Additionally, BioNeMo, available on Amazon SageMaker and planned for AWS on NVIDIA DGX Cloud, simplifies and accelerates drug discovery processes for pharmaceutical companies.
This collaboration signifies a landmark moment in the evolution of AI. It showcases AWS and NVIDIA’s commitment to advancing technology in generative AI.