2024-12-23

12/23/2024, 12:00:00 AM ~ 12/24/2024, 12:00:00 AM (UTC) Recent Announcements AWS Neuron introduces support for Trainium2 and NxD Inference Today, AWS announces the release of Neuron 2.21, introducing support for AWS Trainium2 chips and Amazon EC2 Trn2 instances, including the trn2.48xlarge instance type and Trn2 UltraServer. This release also adds support for PyTorch 2.5 and introduces NxD Inference and Neuron Profiler 2.0 (beta). NxD Inference, is a new PyTorch-based library integrated with vLLM, simplifies the deployment of large language and multi-modality models and enables PyTorch model onboarding with minimal code changes, and Neuron Profiler 2....

December 23, 2024

2023-08-24

8/24/2023, 12:00:00 AM ~ 8/25/2023, 12:00:00 AM (UTC) Recent Announcements Announcing AWS ROSA console support for the ROSA with hosted control planes preview In May, Red Hat announced the preview of Red Hat OpenShift Service on AWS (ROSA) with hosted control planes (HCP), a new deployment model for ROSA clusters. Today, we are introducing an AWS account configuration workflow for ROSA with HCP on the AWS Management Console. Under the original ROSA deployment model, now called ROSA classic, all AWS infrastructure required to run the ROSA control plane is hosted on your AWS account....

August 24, 2023

2022-10-10

10/10/2022, 12:00:00 AM ~ 10/11/2022, 12:00:00 AM (UTC) Recent Announcements AWS Neuron adds support for Amazon EC2 Trn1 instances to unlock high-performance, cost-effective deep learning training at scale AWS Neuron adds support for AWS Trainium powered Amazon EC2 Trn1 instances to unlock high-performance, cost effective deep learning training at scale. The Neuron SDK includes a compiler, runtime libraries, and profiling tools that integrate with popular ML frameworks such as PyTorch and Tensorflow....

October 10, 2022