top of page
Optimizing Data Lakehouses with Starburst Data
Optimizing Data Lakehouses with Starburst Data

Mon, Oct 07

|

Virtual

Optimizing Data Lakehouses with Starburst Data

Revolutionize your Data Lakehouse efficiency with Starburst Galaxy! Unlock lightning-fast analytics and seamless scalability

Time & Location

Oct 07, 2024, 9:00 AM GMT+1 – Oct 09, 2024, 5:00 PM GMT+1

Virtual

About the event

Be a part of the ultimate event that's set to transform the way you optimize data lakehouses with Starburst Data! Whether you're a data engineer, data architect, or an experienced data analyst/scientist, this is your chance to revolutionize your data strategies.

Join us for a comprehensive dive into "Optimizing Data Lakehouses with Starburst Data." After attending this hands-on event, you would be able to:

  • Gain insights into using Starburst Galaxy as a unified access point for multiple data sources
  • Execute federate queries seamlessly across all data sources.
  • Understand the intricacies of query execution within a Starburst cluster and explore the power of Hive and Iceberg table formats.
  • Master the art of constructing, populating, querying, and modifying partitioned tables.
  • Discover strategies to improve query performance through intelligent file size, format, and hierarchy decisions.
  • Unlock the potential of the Cost-based optimizer and learn to read query plans to ensure optimizations and troubleshoot potential issues.
  • Create robust role-based access control policies for efficient table operations.

This event is designed for data professionals seeking to elevate their data engineering and optimization skills. If you have an intermediate understanding of SQL, this is the perfect opportunity to further your expertise. Don't miss out on building a cutting-edge data engineering pipeline with Starburst Galaxy. Take the leap towards data excellence!

Secure your spot today and unlock the potential of your data strategies.

Topics Covered

Introducing Starburst Galaxy

• Overview 

• Architecture 

• Web UI 

• Connectors & Catalogs 

• Client tools integrations

Data Lake Performance

• Foundations and use case 

• Limit Data Exchanges 

• File format options 

• Small files problem 

• Partitioning & bucketing

Table formats

• Moving beyond Hive 

• Compare/contrast alternatives 

• Explore Delta Lake

Apache Iceberg

• Creating tables 

• Insert, update & delete 

• CDC with merge 

• Schema & partition evolution 

• Snapshots & compaction

Parallel processing

• Divide & conquer 

• Beyond single-stage queries

Cost-based optimizer

• Benefits of statistics 

• Query plan analysis 

• Using EXPLAIN/EXPLAIN PLAN

Access control

• Configuration options 

• Role-based access control

Data pipelines

• Definition & differentiation 

• Reference architecture

DATES & TIMES: Oct 07, 2024, 9:00 AM GMT+1 – Oct 07, 2024, 5:00 PM GMT+1

                                 Oct 08, 2024, 9:00 AM GMT+1 – Oct 08, 2024, 5:00 PM GMT+1

                                 Oct 09, 2024, 9:00 AM GMT+1 – Oct 09, 2024, 5:00 PM GMT+1

Tickets

  • Optimizing Data Lakehouses

    Sale ends: Oct 04, 11:50 PM GMT+1

    *Inclusive of all taxes

    $1,800.00

Total

$0.00

Share this event

bottom of page