Big Data – Page 6

Grelup
December 12, 2024
0 Comments

Use open table format libraries on AWS Glue 5.0 for Apache Spark

Open table formats are emerging in the rapidly evolving domain of big data management, fundamentally altering the landscape of data storage and analysis. These formats, exemplified by Apache Iceberg, Apache … Read more

Grelup
December 12, 2024
0 Comments

Enforce fine-grained access control on data lake tables using AWS Glue 5.0 integrated with AWS Lake Formation

AWS Glue 5.0 supports fine-grained access control (FGAC) based on your policies defined in AWS Lake Formation. FGAC enables you to granularly control access to your data lake resources at … Read more

Grelup
December 12, 2024
0 Comments

Simplify data access for your enterprise using Amazon SageMaker Lakehouse

Organizations are increasingly using data to make decisions and drive innovation. However, building data-driven applications can be challenging. It often requires multiple teams working together and integrating various data sources, … Read more

Grelup
December 11, 2024
0 Comments

How REA Group approaches Amazon MSK cluster capacity planning

This post was written by Eunice Aguilar and Francisco Rodera from REA Group. Enterprises that need to share and access large amounts of data across multiple domains and services need … Read more

Grelup
December 11, 2024
0 Comments

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio (preview)

Organizations are building data-driven applications to guide business decisions, improve agility, and drive innovation. Many of these applications are complex to build because they require collaboration across teams and the … Read more

Grelup
December 10, 2024
0 Comments

Implement historical record lookup and Slowly Changing Dimensions Type-2 using Apache Iceberg

In today’s data-driven world, tracking and analyzing changes over time has become essential. As organizations process vast amounts of data, maintaining an accurate historical record is crucial. History management in data … Read more

Grelup
December 10, 2024
0 Comments

Build Write-Audit-Publish pattern with Apache Iceberg branching and AWS Glue Data Quality

Given the importance of data in the world today, organizations face the dual challenges of managing large-scale, continuously incoming data while vetting its quality and reliability. The importance of publishing … Read more

Bayte
October 20, 2024
0 Comments

Get started with Amazon DynamoDB zero-ETL integration with Amazon Redshift

We’re excited to announce the general availability (GA) of Amazon DynamoDB zero-ETL integration with Amazon Redshift, which enables you to run high-performance analytics on your DynamoDB data in Amazon Redshift … Read more

Bayte
October 19, 2024
0 Comments

Single sign-on SSO for Amazon OpenSearch Service using SAML and Keycloak

A standard use case for customers is to integrate existing identity providers (IdPs) with Amazon OpenSearch Service. OpenSearch Service offers built-in support for single sign-on (SSO) authentication for OpenSearch Dashboards, … Read more

Bayte
October 18, 2024
0 Comments

A customer’s journey with Amazon OpenSearch Ingestion pipelines

This is a guest post co-written with Mike Mosher, Sr. Principal Cloud Platform Network Architect at a multi-national financial credit reporting company. I work for a multi-national financial credit reporting … Read more