Classmethod Data Analytics Newsletter (AWS Data Analytics Edition) – June 2026 Issue

Classmethod Data Analytics Newsletter (AWS Data Analytics Edition) – June 2026 Issue

Here are the AWS data analytics-related updates for May 2026. It was a month in which improvements in price-performance and support for agentic workloads advanced simultaneously, including the general availability of Graviton-based Redshift RG instances, next-generation OpenSearch Serverless achieving zero standby costs, and Apache Spark 4.0.2 support. Please take a look!
2026.06.01

This page has been translated by machine translation. View original

This is Ishikawa from the Consulting Department of the Cloud Business Division. Here is the AWS data analytics-related update information for May 2026. This month's major topics are the general availability of AWS Graviton-based Amazon Redshift RG instances, the general availability of the next-generation Amazon OpenSearch Serverless achieving zero idle costs, and Apache Spark 4.0.2 support for Amazon EMR. It was a month in which price-performance improvements driven by Graviton and feature enhancements targeting agentic/generative AI workloads progressed simultaneously. There are other updates as well, so let me introduce them!

Amazon Redshift

New Features & Updates

2026/05/07 - Amazon Redshift now scales data ingestion automatically with concurrency scaling for batch workloads

https://aws.amazon.com/about-aws/whats-new/2026/05/concurrencyscaling-support-for-copy/

Amazon Redshift's concurrency scaling now supports data ingestion for batch processing, automatically scaling for COPY queries in Parquet and ORC formats from Amazon S3. This eliminates the trade-off between ingestion speed and query performance during traffic spikes. For Redshift Serverless, it is enabled automatically based on demand, and for the provisioned version, it is enabled based on pre-configuration.

2026/05/12 - Amazon Redshift launches RG instances powered by AWS Graviton

https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-redshift-rg-instances-powered-by-graviton/

The new generation RG instances equipped with AWS Graviton processors are now generally available. They achieve up to 2.2x faster performance for data warehouse workloads and up to 2.4x faster for data lake workloads, at a price 30% lower per vCPU compared to RA3. The integrated data lake query engine eliminates the per-TB scan charges of Redshift Spectrum. Available in two sizes, rg.xlarge and rg.4xlarge, across multiple regions including Tokyo (ap-northeast-1).

https://aws.amazon.com/blogs/big-data/meet-amazon-redshift-rg-aws-graviton-based-instances-with-an-integrated-data-lake-query-engine-delivering-up-to-2-4x-better-performance-at-30-lower-price-than-ra3/

https://dev.classmethod.jp/articles/amazon-redshift-rg-instances-graviton/

AWS Glue

New Features & Updates

API Changes

2026/05/06 - AWS Glue - 3 updated methods

https://awsapichanges.com/archive/changes/7068f3-glue.html

A CustomLogGroupPrefix parameter has been added to StartDataQualityRulesetEvaluationRun for specifying a CloudWatch log group path. Additionally, RulesetName has been added to ListDataQualityRulesetEvaluationRuns to filter evaluation runs by ruleset name.

2026/05/14 - AWS Glue - 1 updated methods

https://awsapichanges.com/archive/changes/bd1fb2-glue.html

A --has-databases parameter has been added to the get-catalogs API, enabling filtering to return only catalogs that can hold databases. Additionally, model-level validation on the size of a table's partition index list has been removed.

Amazon OpenSearch Service

New Features & Updates

2026/05/05 - Amazon OpenSearch Service expands Cluster Insights with a new insight

https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-opensearch-cluster-insights/

Cluster Insights has been expanded to cover all OpenSearch versions and Elasticsearch 6.8 and later. Cluster health and performance can be proactively visualized from the console, enabling early detection of potential issues.

2026/05/28 - The next generation of Amazon OpenSearch Serverless is now generally available

https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-opensearch-serverless-next-generation-generally-available/

The next-generation Amazon OpenSearch Serverless, redesigned from the ground up, is now generally available. Its compute and storage separated architecture achieves autoscaling 20x faster than before, provisioning in seconds, and scale-to-zero during idle periods. It enables up to 60% cost reduction compared to provisioning clusters for peak loads and is optimized for agentic workloads.

https://aws.amazon.com/blogs/big-data/the-next-generation-of-amazon-opensearch-serverless-built-from-the-ground-up-for-agents/

https://dev.classmethod.jp/articles/20260531-amazon-opensearch-service-nxgn-ga/

API Changes

2026/05/05 - Amazon OpenSearch Service - 10 updated methods

https://awsapichanges.com/archive/changes/2d415e-es.html

Support has been added for VPC egress, which privately routes outbound traffic from OpenSearch domains through a VPC instead of the public internet.

2026/05/13 - Amazon OpenSearch Service - 7 updated methods

https://awsapichanges.com/archive/changes/74501c-es.html

Support has been added for an automated snapshot pause option (AutomatedSnapshotPauseOptions).

Amazon Quick

New Features & Updates

2026/05/01 - AWS Transform now offers BI migration agents for Power BI and Tableau to Amazon Quick

https://aws.amazon.com/about-aws/whats-new/2026/05/quick-bi-migration/

BI migration agents have been added to AWS Transform that automatically convert Power BI and Tableau dashboards into Amazon Quick (QuickSight) assets. They automate dataset reconstruction, visual conversion, and field mapping, reducing migration work from months to days. The agents can be purchased from AWS Marketplace, and data processing is completed within the customer's AWS account.

2026/05/04 - Amazon Quick introduces Dataset Q&A for conversational analytics against enterprise data

https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-quick-dataset-qa/

Dataset Q&A has been added to Amazon Quick, enabling users to ask questions directly to datasets in natural language without pre-creating topics. A text-to-SQL agent interprets questions and generates SQL against multiple data sources including Amazon Redshift, Amazon Athena, Aurora PostgreSQL, and Apache Iceberg tables on S3. An Explain feature is also included to validate the generated results.

https://dev.classmethod.jp/articles/amazon-quick-dataset-qa/

2026/05/04 - Amazon Quick upgrades the extension for Microsoft Outlook (Preview)

https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-quick-microsoft-outlook/

A preview of the Amazon Quick extension for Microsoft Outlook has been made available. Users can summarize unread emails, organize their inbox, schedule meetings, and draft inline replies in natural language without leaving Outlook. Available in 7 regions including Tokyo (ap-northeast-1).

2026/05/05 - Amazon Quick now integrates with New Relic for observability-driven AI agents

https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-quick-new-relic/

Amazon Quick now integrates with New Relic's AI agents, enabling incident investigation, generation of root cause analysis briefs, and task creation directly from Quick. Through New Relic's remote MCP server, users can leverage alert insights, log analysis, and natural language NRQL queries.

API Changes

2026/05/01 - Amazon QuickSight - 23 updated methods

https://awsapichanges.com/archive/changes/6338dd-quicksight.html

Numerous features have been added, including private CA certificate specification for OAuth data sources (IdentityProviderCACertificatesBundleS3Uri), 256-character support for theme font families, ControlTitleFormatText for all 13 filter types, ContextRegion for cross-region support, and the addition of Story/scenario to the CreateCustomCapability API.

2026/05/13 - Amazon QuickSight - 4 updated methods

https://awsapichanges.com/archive/changes/74501c-quicksight.html

Five custom permission options have been added for Quick Apps, and these features can now be controlled from the public SDK/CLI.

2026/05/18 - Amazon QuickSight - 3 updated methods

https://awsapichanges.com/archive/changes/faaa92-quicksight.html

Dataset enrichment and geospatial features in the new data preparation experience are now supported.

2026/05/29 - Amazon QuickSight - 5 new methods

https://awsapichanges.com/archive/changes/96246a-quicksight.html

Support has been added for creating, updating, describing, listing, and deleting the new OAuthClientApplication resource, which stores OAuth configurations for connecting to databases using 3-Legged OAuth.

Amazon EMR

New Features & Updates

2026/05/27 - Amazon EMR now supports Apache Spark 4.0.2 in general availability

https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-emr-apache-spark/

Amazon EMR has made Apache Spark 4.0.2 generally available across all three deployment models (EMR on EC2/EKS/Serverless). It supports ANSI SQL, VARIANT data types, fine-grained access control (FGAC) at the row and column level, and the Apache Iceberg v3 table format, enabling stronger transaction guarantees and audit trail maintenance through data lineage tracking.

Amazon SageMaker Unified Studio

New Features & Updates

2026/05/11 - Amazon SageMaker Unified Studio adds getting started tutorials and in-product release notes

https://aws.amazon.com/about-aws/whats-new/2026/05/smus-getting-started/

Getting started tutorials have been added to Amazon SageMaker Unified Studio, allowing users to learn key workflows using sample data. The interface now automatically switches between dark/light mode based on system settings, and a "What's New" section has been added within the product to check new features and release notes.

2026/05/13 - Amazon SageMaker Data Agent now available for IAM Identity Center domains

https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-sagemaker-data-agent-idc/

Amazon SageMaker Data Agent is now available for domains configured with IAM Identity Center. Users can describe the purpose of their analysis in plain English to generate Python or SQL code, with support for connections to Amazon Athena, Amazon Redshift, Amazon S3, and AWS Glue Data Catalog. A debugging feature via "Fix with AI" is also included.

2026/05/20 - Amazon SageMaker Unified Studio now supports data quality rule authoring and evaluation

https://aws.amazon.com/about-aws/whats-new/2026/05/smus-data-quality/

Data quality rule authoring and evaluation powered by AWS Glue Data Quality is now available within Amazon SageMaker Unified Studio. Users can define rules, run ruleset evaluations, and review results directly from Studio, detecting issues in both data at rest and data in transit before they impact analytics pipelines.

2026/05/21 - SageMaker Unified Studio automates Glue connector provisioning for cross-subnet job retries

https://aws.amazon.com/about-aws/whats-new/2026/05/sagemaker-unified-studio-glue/

SageMaker Unified Studio has automated the provisioning of AWS Glue connectors across multiple subnets. Jobs can now automatically retry when a failure occurs in the primary subnet, reducing unplanned downtime for business-critical data pipelines and helping maintain SLA compliance without manually configuring backup connectors.

2026/05/22 - Amazon SageMaker adds business metadata and governance in IAM-based domains

https://aws.amazon.com/about-aws/whats-new/2026/05/sagemaker-catalog-iam-domains/

Business context, metadata, and data governance features that were previously available only in IdC-based domains are now also available in IAM-based domains. Business names, descriptions, and READMEs can be added to AWS Glue Data Catalog tables, with support for automatic generation of business names and descriptions using AI-generated metadata.

https://dev.classmethod.jp/articles/20260522-sagemaker-catalog-iam-domains/

2026/05/22 - Amazon SageMaker expands domain management across domain types

https://aws.amazon.com/about-aws/whats-new/2026/05/domain-management-iam-idc/

Domain management features previously available only for IAM domains have been extended to IAM Identity Center-based domains. Administrators can create and manage projects, configure execution roles, manage VPC settings, and handle cross-account access within the SageMaker Unified Studio portal without using the AWS Management Console.

https://dev.classmethod.jp/articles/20260522-amazon-smus-domain-management-iam-idc/

API Changes

2026/05/05 - Amazon SageMaker Service - 12 updated methods

https://awsapichanges.com/archive/changes/2d415e-api.sagemaker.html

The ml.p5.4xlarge instance type is now supported in multiple regions including Tokyo for JupyterLab and CodeEditor apps in SageMaker Studio.

2026/05/06 - Amazon SageMaker Service - 3 updated methods

https://awsapichanges.com/archive/changes/7068f3-api.sagemaker.html

SageMaker HyperPod now returns ImageVersionStatus in the responses of DescribeCluster, DescribeClusterNode, and ListClusterNodes, making it possible to verify whether cluster instances are running with the latest image version.

2026/05/13 - Amazon SageMaker Service - 27 updated methods

https://awsapichanges.com/archive/changes/74501c-api.sagemaker.html

Features added include an execution role session name mode that reflects user identity in Studio, Flexible Training Plans for Studio apps, and IAM-based access control for proprietary model artifacts (restricted model packages).

2026/05/19 - Amazon SageMaker Service - 4 updated methods

https://awsapichanges.com/archive/changes/7f1dfc-api.sagemaker.html

The ml.p5.4xlarge and ml.p5en.48xlarge instances are now supported on the SageMaker Notebook Instances platform.

2026/05/21 - Amazon SageMaker Service - 3 updated methods

https://awsapichanges.com/archive/changes/8bd61f-api.sagemaker.html

SageMaker domains can now disable the creation of the home EFS file system.

2026/05/27 - Amazon SageMaker Service - 7 updated methods

https://awsapichanges.com/archive/changes/0a7d57-api.sagemaker.html

Shared environments are now supported in Restricted Instance Groups (RIGs) for SageMaker HyperPod, enabling workload scheduling across RIGs and FSx sharing. Support for p6 instances in recommendation jobs has also been added.

Amazon DataZone

New Features & Updates

API Changes

2026/05/14 - Amazon DataZone - 8 new methods

https://awsapichanges.com/archive/changes/bd1fb2-datazone.html

Eight methods have been added to support notebook operations (including import/export) in SageMaker Unified Studio.

2026/05/22 - Amazon DataZone - 4 updated methods

https://awsapichanges.com/archive/changes/f23ff3-datazone.html

Support for VPC connections has been added.

2026/05/26 - Amazon DataZone - 3 updated methods

https://awsapichanges.com/archive/changes/e7ba6b-datazone.html

The resourceConfigurations and allowUserProvidedConfigurations fields have been added to the environment blueprint configuration API, enabling customers who have migrated from V1 to V2 domains to programmatically update resource configurations such as lineage schedules via the SDK.

AWS Clean Rooms

New Features & Updates

API Changes

2026/05/21 - AWS Clean Rooms Service - 13 updated methods

https://awsapichanges.com/archive/changes/8bd61f-cleanrooms.html

Collaboration creators can now update payment configurations without recreating the collaboration. When multiple payment candidates are configured per cost type, the analysis runner can specify the actual payer at the time of submission, enabling fine-grained control over billing.

2026/05/21 - AWS Clean Rooms ML - 15 updated methods

https://awsapichanges.com/archive/changes/8bd61f-cleanrooms-ml.html

AWS Clean Rooms ML similarly now supports updating payment configurations without recreating the collaboration, as well as specifying the payer at the time of submission.

In Closing

May 2026 was a month of significant advances in price-performance improvement and cost optimization, as exemplified by the general availability of AWS Graviton-based Amazon Redshift RG instances and the next-generation Amazon OpenSearch Serverless. At the same time, enhancements to the analytics experience with agentic/generative AI in mind were also notable, including Dataset Q&A and New Relic integration for Amazon Quick, IAM Identity Center support for Amazon SageMaker Data Agent, and Apache Spark 4.0.2 support for Amazon EMR. Additionally, Amazon SageMaker Unified Studio continues to expand governance features and domain management capabilities to IAM-based domains, steadily evolving as a governance foundation for the lakehouse.

If any of these updates catch your interest, please give them a try. We hope this article is helpful to someone.

Share this article

AWSのお困り事はクラスメソッドへ

Related articles