
Data Lineage: Understanding Data Lineage at Scale with Julien Le Dem


Big Data has exploded the past decade as cloud computing and more efficient hardware made scaling essentially limitless. Products like Uber revolve entirely around analyzing data to provide rides. According to an EMC/IDC study, there was approximately 5.2TB of data for every person in 2020. That estimate was made before the transition to remote work, which likely makes it much higher. 

The term “data lineage” refers to the collection, origin, storage, transfer, and use of data over time. Given the size of the Big Data industry and related industries, maintaining a thorough data lineage, even within small companies, can be very difficult. It becomes especially challenging at scale. What innovative tools make understanding all this information possible? Can data really continue growing at this rate?

In this episode we talk with Julien Le Dem, CTO and Co-Founder at Datakin. We discuss the challenges, available tools, and future for big data and data lineage.

Sponsorship inquiries: sponsor@softwareengineeringdaily.com

The post Data Lineage: Understanding Data Lineage at Scale with Julien Le Dem appeared first on Software Engineering Daily.

No comments yet...
Log in to comment
0 0 0

The Vulkan Graphics API with Tom Olson and Ralph Potter

Vulkan is a low-level graphics API designed to provide developers with more direct control over the …
0 0 0

Deno 2.0 with Luca Casonato

Deno is a free and open source JavaScript runtime built on Google’s V8 engine, Rust, and Tokio. It&#…
0 0 0

MLOps at JFrog with Bill Manning

JFrog is a DevOps platform that specializes in managing software packages and automating software de…
0 0 0

The Future of Offensive Pentesting with Mark Goodwin

Offensive penetration testing, or offensive pentesting, involves actively probing a system, network,…
0 0 0

WipEout with Dominic Szablewski

WipEout is a futuristic racing game that was originally released in 1995 for the PlayStation. The ga…
0 0 0

Engineering at Discord with Justin Beckwith

Discord is a popular communication and streaming platform that was originally launched in 2015. It w…

Software Engineering Daily

Technical interviews about software topics.

Log in to Follow

More episodes from Software Engineering Daily

Top Podcasts Top rated Podcasts