Introducing three new NVIDIA GPU-based Amazon EC2 instances

Amazon Elastic Compute Cloud (Amazon EC2) accelerated computing portfolio affords the broadest alternative of accelerators to energy your synthetic intelligence (AI), machine studying (ML), graphics, and excessive efficiency computing (HPC) workloads. We’re excited to announce the growth of this portfolio with three new situations that includes the most recent NVIDIA GPUs: Amazon EC2 P5e situations powered by NVIDIA H200 GPUs, Amazon EC2 G6 situations that includes NVIDIA L4 GPUs, and Amazon EC2 G6e situations powered by NVIDIA L40S GPUs. All three situations can be accessible in 2024, and we look ahead to seeing what you are able to do with them.

AWS and NVIDIA have collaborated for over 13 years and have pioneered large-scale, extremely performant, and cost-effective GPU-based options for builders and enterprise throughout the spectrum. We’ve got mixed NVIDIA’s highly effective GPUs with differentiated AWS applied sciences corresponding to AWS Nitro System, 3,200 Gbps of Elastic Cloth Adapter (EFA) v2 networking, tons of of GB/s of information throughput with Amazon FSx for Lustre, and exascale computing with Amazon EC2 UltraClusters to ship probably the most performant infrastructure for AI/ML, graphics, and HPC. Coupled with different managed providers corresponding to Amazon Bedrock, Amazon SageMaker, and Amazon Elastic Kubernetes Service (Amazon EKS), these situations present builders with the trade’s finest platform for constructing and deploying generative AI, HPC, and graphics functions.

Excessive-performance and cost-effective GPU-based situations for AI, HPC, and graphics workloads

To energy the event, coaching, and inference of the most important giant language fashions (LLMs), EC2 P5e situations will function NVIDIA’s newest H200 GPUs, which provide 141 GBs of HBM3e GPU reminiscence, which is 1.7 occasions bigger and 1.4 occasions quicker than H100 GPUs. This enhance in GPU reminiscence together with as much as 3200 Gbps of EFA networking enabled by AWS Nitro System will allow you to proceed to construct, prepare, and deploy your cutting-edge fashions on AWS.

EC2 G6e situations, that includes NVIDIA L40S GPUs, are constructed to supply builders with a broadly accessible possibility for coaching and inference of publicly accessible LLMs, in addition to help the rising adoption of Small Language Fashions (SLM). They’re additionally optimum for digital twin functions that use NVIDIA Omniverse for describing and simulating throughout 3D instruments and functions, and for creating digital worlds and superior workflows for industrial digitalization.

EC2 G6 situations, that includes NVIDIA L4 GPUs, will ship a lower-cost, energy-efficient answer for deploying ML fashions for pure language processing, language translation, video and picture evaluation, speech recognition, and personalization in addition to graphics workloads, corresponding to creating and rendering real-time, cinematic-quality graphics and sport streaming.

Concerning the Creator

Chetan Kapoor is the Director of Product Administration for the Amazon EC2 Accelerated Computing Portfolio.

Source link

Introducing three new NVIDIA GPU-based Amazon EC2 instances

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

Eric Evans receives Department of Defense Medal for Distinguished Public Service | MIT News

Defending your voice against deepfakes

From the Perceptron to Adaline. Setting the foundations right | by Pan Cretan | Nov, 2023

Recommended For You

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

Eric Evans receives Department of Defense Medal for Distinguished Public Service | MIT News

Imperva optimizes SQL generation from natural language using Amazon Bedrock

AI in Manufacturing: Overcoming Data and Talent Barriers

From the Perceptron to Adaline. Setting the foundations right | by Pan Cretan | Nov, 2023

New robotic system assesses mobility after stroke

This AI Paper Proposes a Novel Pre-Training Strategy Called Privacy-Preserving MAE-Align' to Effectively Combine Synthetic Data and Human-Removed Real Data

Leave a Reply Cancel reply

A technique for more effective multipurpose robots | MIT News

Helping robots grasp the unpredictable | MIT News

The Current State of AI! (My Personal News Recap)

2024 World Battery & Energy Storage Industry Expo (WBE)

MIT faculty, instructors, students experiment with generative AI in teaching and learning | MIT News

Robotics investments reach $418M in November 2023

What is AI – Artificial Intelligence in Telugu | Future of AI | TeluguBadi

A method to enable safe mobile robot navigation in dynamic environments

Robot Talk Episode 90 – Robotically Augmented People

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

RBR50 Spotlight: Slip Robotics minimizes trailer loading times with simple approach

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

Coval upgrades its CVGC Carbon Vacuum Gripper with an even more versatile second generation

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password

Introducing three new NVIDIA GPU-based Amazon EC2 instances

You might also like

Excessive-performance and cost-effective GPU-based situations for AI, HPC, and graphics workloads

Concerning the Creator

Defending your voice against deepfakes

From the Perceptron to Adaline. Setting the foundations right | by Pan Cretan | Nov, 2023

Recommended For You

Leave a Reply Cancel reply

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password