9/12/2024, 12:00:00 AM ~ 9/13/2024, 12:00:00 AM (UTC)
Recent Announcements
Announcing sticky session routing for Amazon SageMaker Inference
Today, we are announcing the availability of sticky session routing on Amazon SageMaker Inference which helps customers improve the performance and user experience of their generative AI applications by leveraging their previously processed information. Amazon SageMaker makes it easier to deploy ML models including foundation models (FMs) to make inference requests at the best price performance for any use case.\n By enabling sticky sessions, all requests for the same session will be routed to the same instance, allowing your ML application to reuse previously processed information to reduce latency and improve user experience. This is particularly valuable when customers want to use large data payloads or have the need for seamless interactive experiences. By leveraging their previous inference requests, customers can now take advantage of this feature to build innovative state-aware AI applications on SageMaker. To do this customers will have to create a session id with their first request and then use that session id to indicate that SageMaker should route all the subsequent requests to the same instance. Sessions can also be deleted when done to free up resources for new sessions. This feature is available in all regions where SageMaker is available. You can learn more about deploying models on SageMaker here and more about this feature in our documentation.
AWS Network Firewall now supports AWS PrivateLink
AWS Network Firewall now supports AWS PrivateLink. Customers can now access and manage their Network Firewalls privately, without going through the public internet. AWS PrivateLink provides private connectivity between VPCs, AWS services, and on-premises applications, securely over the Amazon network. When AWS PrivateLink is used with AWS Network Firewall, all management and control traffic between clients and Network Firewall flows over a private network.\n AWS Network Firewall is a managed firewall service that makes it easy to deploy essential network protections for all your Amazon VPCs. Customers can use AWS PrivateLink with Network Firewall in regions where AWS Network Firewall is available today, including the AWS GovCloud (US) Regions. For more information about the AWS Regions where AWS Network Firewall is available, see the AWS Region table. To learn more about configuring AWS Network Firewall, please refer to the service documentation.
Amazon EC2 R6in and R6idn instances are now available in an additional region
Starting today, Amazon Elastic Compute Cloud (Amazon EC2) R6in and R6idn instances are available in AWS Region Asia Pacific (Sydney). These sixth-generation network optimized instances, powered by 3rd Generation Intel Xeon Scalable processors and built on the AWS Nitro System, deliver up to 200Gbps network bandwidth, 2x more network bandwidth, and up to 2x higher packet-processing performance over comparable fifth-generation instances. Customers can use R6in and R6idn instances to scale the performance and throughput of network-intensive workloads such as memory-intensive SQL and NoSQL databases, distributed web scale in-memory caches (Memcached, Redis), in-memory databases (SAP HANA), and real-time big data analytics (Apache Hadoop, Apache Spark).\n R6in and R6idn instances are available in 10 different instance sizes including metal, with up to 128 vCPUs and 1024 GiB of memory. They deliver up to 100 Gbps of Amazon Elastic Block Store (EBS) bandwidth, and up to 400K IOPS. R6in and R6idn instances offer Elastic Fabric Adapter (EFA) networking support on 32xlarge and metal sizes. R6idn instances offer up to 7.6 TB of high-speed, low-latency instance storage. With this regional expansion, R6in and R6idn instances are available in the following AWS Regions: US East (Ohio), US East (N. Virginia, Oregon), Europe (Ireland, Frankfurt, Stockholm), Asia Pacific (Singapore, Sydney, Tokyo), and AWS GovCloud (US-West). Customers can purchase the new instances through Savings Plans, Reserved, On-Demand, and Spot instances. To learn more, see R6in and R6idn instances page. To get started, see AWS Management Console, AWS Command Line Interface (AWS CLI), and AWS SDKs.
AWS Application Migration Service supports Trend Micro post-launch action
Starting today, AWS Application Migration Service (AWS MGN) provides an action for installing the Trend Micro Vision One Server & Workload Protection Agent on your migrated instances. For each migrated server, you can choose to automatically install the agent to support your security needs.\n Application Migration Service minimizes time-intensive, error-prone manual processes by automating the conversion of your source servers to run natively on AWS. It also helps simplify modernization of your migrated applications by allowing you to select preconfigured and custom optimization options during migration. This feature is now available in all of the Commercial regions where Application Migration Service is available. Access the AWS Regional Services List for the most up-to-date availability information. To start using Application Migration Service for free, sign in through the AWS Management Console. For more information, visit the Application Migration Service product page. For more information on Trend Micro and to create a trial account, visit the Trend Micro sign-up page.
AWS Elemental Media Services now support live AV1 encoding
The AV1 video codec is now supported in the AWS Elemental Media Services. You can use AV1 in AWS Elemental MediaLive, MediaPackage, MediaTailor, and MediaConvert to produce both live and on-demand streams with ad insertion.\n This launch enables live streaming and packaging of AV1 encoded content, preparation of AV1 VOD assets and ads, and ad insertion into AV1 encoded videos. AV1 provides a lower bitrate with a similar viewing experience when compared to HEVC and AVC, reducing both the bandwidth required to deliver live events and CDN costs. AV1 can also deliver better video quality for viewing on mobile devices and in network constrained environments. AWS Media Services enable you to transport, prepare, process, and deliver live and on-demand content in the cloud. These managed services let you build and adapt video workflows quickly, eliminate capacity planning, easily scale with growth, and benefit from pay-as-you-go pricing. Connect with other AWS services and third-party applications for live and on demand video streaming, media storage, machine learning, content protection, advertising and monetization, and more. To learn how the AV1 codec can reduce bandwidth and improve the viewing experience, read the blog post. For more information about live AV1 pricing, please review the AWS Elemental MediaLive pricing page. To learn more about the services, please visit the AWS Media Services page.
Announcing AWS Elemental MediaLive Anywhere for live video encoding on your own hardware
Today, AWS announces the general availability of AWS Elemental MediaLive Anywhere, which allows you to run live video transcoding on your on-premises hardware. MediaLive Anywhere brings the cloud control and pay-as-you-go pricing of AWS Elemental MediaLive to compute resources you manage. With MediaLive Anywhere, you can take advantage of MediaLive’s centralized configuration, control, and monitoring capabilities while processing live video on premises close to video sources and outputs.\n With MediaLive Anywhere, you deploy the same broadcast-grade video encoding engine used in AWS Elemental MediaLive on your hardware. MediaLive Anywhere supports a wide range of hardware configurations and can ingest video from multicast, SDI, and standard internet-based transport protocols. You get a consistent set of APIs, channel profiles, logs, and monitoring metrics across your cloud and on-premises live video workflows. AWS Elemental MediaLive Anywhere is available in all AWS Regions where AWS Elemental MediaLive is available. To learn more, visit the AWS Elemental MediaLive Anywhere page.
Amazon ECR announces support for dual-layer server-side encryption in the AWS GovCloud (US) Regions
Amazon Elastic Container Registry (ECR) now supports dual-layer server-side encryption in the AWS GovCloud (US) Regions. This capability allows you to apply two independent layers of server-side encryption to images stored in Amazon ECR. Dual-layer server-side encryption with keys stored in AWS Key Management Service (DSSE-KMS) enables you to meet stronger compliance and regulatory requirements of applying multiple layers of encryption to your container images.\n ECR supports server-side encryption of ECR images using either Amazon S3-managed encryption keys or keys stored in Amazon Key Management Service (KMS). This often meets your security requirements as it protects data at rest, however, if you operate in highly regulated environments that require rigorous security standards, you may require a second layer of encryption for your images. Now with DSSE-KMS, you can easily apply two layers of encryption and control the keys used for both layers. Once this feature is enabled, ECR automatically encrypts your images twice when pushed and decrypts twice when pulled using your encryption keys managed by Amazon Key Management Service (KMS). AWS KMS is a simple to use key management service that makes it easy for you to create, manage, and control keys by setting permissions per key and specifying key rotation schedules. DSSE-KMS with ECR is available for use in the AWS GovCloud (US) Regions at an additional cost. For pricing information, visit the Amazon ECR pricing page. To learn more about all available encryption options on Amazon ECR and get started with this feature, visit our user guide.
AWS Blogs
AWS Japan Blog (Japanese)
AWS Japan Startup Blog (Japanese)
- Balancing reconstruction support and DX promotion [CTO Night & Day 2024 Kanazawa — Day 2 Digest]
- CTOs and VPoE of startups and tech companies representing Japan gather in Kanazawa [CTO Night & Day 2024 Kanazawa — Day 1 Digest]
AWS News Blog
AWS Big Data Blog
- Harness Zero Copy data sharing from Salesforce Data Cloud to Amazon Redshift for Unified Analytics – Part 2
- The AWS Glue Data Catalog now supports storage optimization of Apache Iceberg tables
- Differentiate generative AI applications with your data using AWS analytics and managed databases
- How ZS built a clinical knowledge repository for semantic search using Amazon OpenSearch Service and Amazon Neptune
AWS Database Blog
AWS Developer Tools Blog
- Enhancing Observability in the AWS SDK for .NET with OpenTelemetry
- Linux Support Updates for AWS CLI v2
AWS HPC Blog
AWS for Industries
- Building a Charging Station Management System with AWS
- Barcelo Hotel Group achieves campaign efficiency by accelerating first-party audience effectiveness with Tealium and Amazon Marketing Cloud
AWS Machine Learning Blog
- Build a RAG-based QnA application using Llama3 models from SageMaker JumpStart
- Best prompting practices for using Meta Llama 3 with Amazon SageMaker JumpStart
- How healthcare payers and plans can empower members with generative AI
- Enabling production-grade generative AI: New capabilities lower costs, streamline production, and boost security
- Scaling Thomson Reuters’ language model research with Amazon SageMaker HyperPod