2026-04-08
4/8/2026, 12:00:00 AM ~ 4/9/2026, 12:00:00 AM (UTC) Recent Announcements SageMaker HyperPod now supports gang scheduling for distributed training workloads Amazon SageMaker HyperPod task governance now supports gang scheduling, which ensures all pods required for a distributed training job are ready before training begins. Administrators can configure gang scheduling to prevent wasted compute from partial job runs and avoid deadlocks from jobs waiting for resources.\n Data scientists running distributed AI/ML training jobs on Amazon SageMaker HyperPod clusters using the EKS orchestrator require multiple pods to work together across nodes with pod-to-pod communication....