2025-12-04

12/4/2025, 12:00:00 AM ~ 12/5/2025, 12:00:00 AM (UTC) Recent Announcements Amazon Bedrock now supports Responses API from OpenAI Amazon Bedrock now supports Responses API on new OpenAI API-compatible service endpoints. Responses API enables developers to achieve asynchronous inference for long-running inference workloads, simplifies tool use integration for agentic workflows, and also supports stateful conversation management. Instead of requiring developers to pass the entire conversation history with each request, Responses API enables them to automatically rebuild context without manual history management....

December 4, 2025

2025-12-03

12/3/2025, 12:00:00 AM ~ 12/4/2025, 12:00:00 AM (UTC) Recent Announcements Amazon SageMaker HyperPod now supports checkpointless training Amazon SageMaker HyperPod now supports checkpointless training, a new foundational model training capability that mitigates the need for a checkpoint-based job-level restart for fault recovery. Checkpointless training maintains forward training momentum despite failures, reducing recovery time from hours to minutes. This represents a fundamental shift from traditional checkpoint-based recovery, where failures require pausing the entire training cluster, diagnosing issues manually, and restoring from saved checkpoints, a process that can leave expensive AI accelerators idle for hours, costing your organization wasted compute....

December 3, 2025

2025-12-02

12/2/2025, 12:00:00 AM ~ 12/3/2025, 12:00:00 AM (UTC) Recent Announcements Announcing Amazon EC2 General purpose M8azn instances (Preview) Starting today, new general purpose high-frequency high-network Amazon Elastic Compute Cloud (Amazon EC2) M8azn instances are available for preview. These instances are powered by fifth generation AMD EPYC (formerly code named Turin) processors, offering the highest maximum CPU frequency, 5GHz in the cloud. The M8azn instances offer up to 2x compute performance versus previous generation M5zn instances....

December 2, 2025

2025-11-26

11/26/2025, 12:00:00 AM ~ 11/27/2025, 12:00:00 AM (UTC) Recent Announcements SageMaker HyperPod now supports Managed tiered KV cache and intelligent routing Amazon SageMaker HyperPod now supports Managed Tiered KV Cache and Intelligent Routing for large language model (LLM) inference, enabling customers to optimize inference performance for long-context prompts and multi-turn conversations. Customers deploying production LLM applications need fast response times while processing lengthy documents or maintaining conversation context, but traditional inference approaches require recalculating attention mechanisms for all previous tokens with each new token generation, creating computational overhead and escalating costs....

November 26, 2025

2025-11-24

11/24/2025, 12:00:00 AM ~ 11/25/2025, 12:00:00 AM (UTC) Recent Announcements OpenSearch Service Enhances Log Analytics with New PPL Experience Today, AWS announces enhanced log analytics capabilities in Amazon OpenSearch Service, making Piped Processing Language (PPL) and natural language the default experience in OpenSearch UI’s Observability workspace. This update combines proven pipeline syntax with simplified workflows to deliver an intuitive observability experience, helping customers analyze growing data volumes while controlling costs. The new experience includes 35+ new commands for deep analysis, faceted exploration, and natural language querying to help customers gain deeper insights across infrastructure, security, and business metrics....

November 24, 2025