11月04日周一
|Virtual
Optimizing Data Lakehouses with Starburst Data
Revolutionize your Data Lakehouse efficiency with Starburst Galaxy! Unlock lightning-fast analytics and seamless scalability
時間和地點
2024年11月04日 GMT 09:00 – 2024年11月06日 GMT 17:00
Virtual
關於本活動
Be a part of the ultimate event that's set to transform the way you optimize data lakehouses with Starburst Data! Whether you're a data engineer, data architect, or an experienced data analyst/scientist, this is your chance to revolutionize your data strategies.
Join us for a comprehensive dive into "Optimizing Data Lakehouses with Starburst Data." After attending this hands-on event, you would be able to:
- Gain insights into using Starburst Galaxy as a unified access point for multiple data sources
- Execute federate queries seamlessly across all data sources.
- Understand the intricacies of query execution within a Starburst cluster and explore the power of Hive and Iceberg table formats.
- Master the art of constructing, populating, querying, and modifying partitioned tables.
- Discover strategies to improve query performance through intelligent file size, format, and hierarchy decisions.
- Unlock the potential of the Cost-based optimizer and learn to read query plans to ensure optimizations and troubleshoot potential issues.
- Create robust role-based access control policies for efficient table operations.
This event is designed for data professionals seeking to elevate their data engineering and optimization skills. If you have an intermediate understanding of SQL, this is the perfect opportunity to further your expertise. Don't miss out on building a cutting-edge data engineering pipeline with Starburst Galaxy. Take the leap towards data excellence!
Secure your spot today and unlock the potential of your data strategies.
Topics Covered
Introducing Starburst Galaxy
• Overview
• Architecture
• Web UI
• Connectors & Catalogs
• Client tools integrations
Data Lake Performance
• Foundations and use case
• Limit Data Exchanges
• File format options
• Small files problem
• Partitioning & bucketing
Table formats
• Moving beyond Hive
• Compare/contrast alternatives
• Explore Delta Lake
Apache Iceberg
• Creating tables
• Insert, update & delete
• CDC with merge
• Schema & partition evolution
• Snapshots & compaction
Parallel processing
• Divide & conquer
• Beyond single-stage queries
Cost-based optimizer
• Benefits of statistics
• Query plan analysis
• Using EXPLAIN/EXPLAIN PLAN
Access control
• Configuration options
• Role-based access control
Data pipelines
• Definition & differentiation
• Reference architecture
DATES & TIMES: Nov 04, 2024, 9:00 AM GMT – Nov 04, 2024, 5:00 PM GMT
Nov 05, 2024, 9:00 AM GMT – Nov 05, 2024, 5:00 PM GMT
Nov 06, 2024, 9:00 AM GMT – Nov 06, 2024, 5:00 PM GMT
門票
Optimizing Data Lakehouses
*Inclusive of all taxes
US$1,800.00銷售已完結
總計
US$0.00