[Update] AWS Launches Amazon Redshift Integration with Apache Spark#reinvent

2022.11.30

この記事は公開されてから1年以上経過しています。情報が古い可能性がありますので、ご注意ください。

Hello Everyone!

AWS announced Amazon Redshift Integration with Apache Spark at the Keynote Session of re:Invent 2022.

Amazon Redshift Integration with Apache Spark

Amazon Redshift Integration with Apache Spark was made GA[General Available] at the Keynote session. With this engineers can run Apache Spark applications on Redshift data, i.e, Spark applications can read and write data from Amazon Redshift cluster.

 

 

Refer to the following blog for the official announcement about Amazon Redshift Integration with Apache Spark.

 

Redshift integration with Apache Spark is available for EMR [starting from EMR 6.9] - EC2, EKS, and Serverless, AWS Glue[starting from Glue 4.0] and Amazon Redshift in all regions. With this integration third-party Apache Spark connector is no longer required for using Spark applications with Amazon EMR, Amazon SageMaker and AWS Glue. And also Apache Spark runs 3x faster on AWS.

 

 

Refer to the following blog to know how to integrate Apache spark with Redshift.

Summary

Redshift Integration with Apache Spark makes it easier to build and run Spark applications on Redshift.

Reference :

Amazon Redshift Integration for Apache Spark

Using Amazon Redshift integration for Apache Spark with Amazon EMR