top of page
Getting started with Apache Iceberg®
Getting started with Apache Iceberg®

Sat, May 25

|

Virtual

Getting started with Apache Iceberg®

Analyze your massic data with high-performance and reliability

Time & Location

May 25, 2024, 9:00 AM – 5:00 PM GMT+1

Virtual

About the event

In the fast-paced world of data management, staying ahead of the curve is crucial to harness the full potential of your data assets. Apache Iceberg® is a groundbreaking technology that is revolutionizing the way data is stored, managed, and queried within data lakes. Welcome to our exclusive event & master data engineering with Apache Iceberg®". With hands-on sessions you can unlock advanced big data analytics with Apache iceberg® for your organizations, virtually from anywhere. Data engineering with Apache Iceberg® is as smooth as it can get!

Course Highlights

  • Unlock the Apache Iceberg® Advantage Leverage the unparalleled advantages by learning how this cutting-edge technology can empower you to take control of your data like never before. Unlock advanced big data analytics with Apache iceberg® and excite your customers.
  • Explore the Intricacies of Apache Iceberg® Architecture: Data engineering with Apache Iceberg® starts with exploring Apache Iceberg's® architecture and understanding its core concepts that make it a game-changer.
  • Craft Efficient Data Queries: Efficiency is the name of the game. Discover how to write data queries that not only fetch results swiftly but also optimize your data lakes for peak performance.
  • Unravel Apache Iceberg® Internals: Go beneath the surface and uncover the inner workings of Apache Iceberg®. Gain insights into its internal mechanisms, enabling you to make informed decisions when working with your data to know what makes Data engineering with Apache Iceberg® a cake walk!.
  • Master the Art of Management, Monitoring, and Optimization: Managing data is an art, and we'll teach you the brushstrokes. Learn how to effectively manage, monitor, and optimize Apache Iceberg® to ensure your data lakes run like well-oiled machines.
  • Hands-on Implementation: Theory is important, but practice brings real progress. Roll up your sleeves and get ready to dirty you hands in an exciting hands-on session to solidify your understanding of Apache Iceberg®.

Prerequisites

Before diving into this transformative journey, make sure you have:

  • Familiarity with basic SQL concepts.
  • Basic Python programming skills.
  • Knowledge of Hadoop & Apache Spark will be helpful.

Who Should Attend?

This event is tailor-made for:

  • Data professionals aiming to supercharge their performance by leveraging Apache Iceberg®.
  • Data Engineers looking to establish or transition form Data lakes to Data Lakehouses using Apache Iceberg®.

Why You Shouldn't Miss This Event

By attending "Getting started with Apache Iceberg®," you'll gain:

  • In-Depth Knowledge: Walk away with a profound understanding of Apache Iceberg® and how it can elevate your data management game.
  • Practical Skills: Acquire hands-on skills that you can immediately apply in your data projects.
  • Industry Insights: Stay ahead of the curve by learning about the latest advancements in data engineering.
  • Networking Opportunities: Connect with like-minded professionals and expand your network in the data industry.
  • Career Advancement: Enhance your career prospects by adding Apache Iceberg® expertise to your skill set.

Don't miss out on this opportunity to level up your data engineering game. Join us for "Getting Started with Apache Iceberg®" and become a data management maestro. Reserve your spot today and unlock the power of Apache Iceberg®!

Topics Covered

Understanding Apache Iceberg®

  • Evolution of Data Platforms
  • Understanding Data Lakes and Technologies available
  • Challenges with Data Lakes
  • Introduction to Apache Iceberg®
  • Benefits of Apache Iceberg®
  • Apache Iceberg® vs Delta Lake vs Hudi
  • When to choose Apache Iceberg® over other formats for data lake storage?

Apache Iceberg Architecture

  • Overview of Apache Iceberg® architecture
  • Various Apache Iceberg® Components
  • How does Apache Iceberg® handle metadata and data versioning?
  • Integration of Apache Iceberg® with key data processing engines like Starburst, Spark

Setting Up Apache Iceberg

  • Installation and setup of Apache Iceberg®
  • Configuring metadata storage for Apache Iceberg® tables

Creating Apache Iceberg Tables

  • Apache Iceberg® table structure
  • Step-by-step guide to creating Apache Iceberg® tables using
    • Apache Spark
    • Presto/Trino on Starburst
    • Hive

Writing and Reading Data

  • Inserting data into Apache Iceberg® tables
    • Batch inserts
    • Streaming inserts
    • Upserts
  • Efficiently querying Apache Iceberg® tables
  • Demonstrating how Apache Iceberg's® data layout optimization enhances query performance

Internals of Apache Iceberg®

  • The Iceberg Catalog
  • The Metadata Layer
    • Metadata File
    • Manifest List
    • Manifest File
  • The Data Layer
  • A look under the covers when CRUDing

Management, Monitoring and Optimization

  • Managing schema evolution
  • Enabling partitions in Iceberg Tables
    • Hidden Partitioning
    • Partition Layer Evolution
  • Understanding Time Travel
  • Version Rollback
  • Data Compaction
  • Metrics and Alerts
  • Monitoring Iceberg Tables

Tickets

  • Apache Iceberg®

    Sale ends: May 22, 11:50 PM GMT+1

    *Inclusive of all taxes

    $440.00

Total

$0.00

Share this event

bottom of page