Advertisement · 728 × 90

Posts by Julien Le Dem

Knock knock!

1 day ago 0 0 0 0
Post image

Thank you for letting me keynote day 2 of the Iceberg summit. I hope you all enjoyed it!

1 week ago 7 2 0 0

I don’t really remember but it’s quite possible :)

1 week ago 0 0 0 0
Post image

And that’s a wrap! Thank you for a great Iceberg Summit!
@thedanicafine.bsky.social, Matt and everyone involved in the organization.

1 week ago 3 0 0 0

I have LLM jokes, they write themselves.

4 weeks ago 4 1 0 0

Paging @dickc

4 weeks ago 1 0 0 0
Variant Type in Apache Parquet for Semi-Structured Data Introducing Native Variant Type in Apache Parquet

If you want to learn more about the Variant type in Parquet, Aihua Xu and @andrewlamb1111.bsky.social wrote a great blog post on the project blog.

parquet.apache.org/blog/2026/02...

1 month ago 8 3 0 0
Native Geospatial Types in Apache Parquet Native Geospatial Types in Apache Parquet

Great inaugural post about the geospatial types on the Parquet blog.

Thank you Jia Yu, Dewey Dunnington , Kristin Cowalcijk, Feng Zhang.

More posts coming !

parquet.apache.org/blog/2026/02...

2 months ago 8 2 0 0
Preview
Apache Arrow is 10 years old 🎉 The Apache Arrow project was officially established and had its first git commit on February 5th 2016, and we are therefore enthusiastic to announce its 10-year anniversary! Looking back over these 10...

Apache Arrow is ten years old!
I can't believe that Apache board meeting when the project was instated was 10 years ago!
This went fast!

arrow.apache.org/blog/2026/02...

2 months ago 8 1 0 0

Congratulations to @andrewlamb1111.bsky.social on becoming the latest Parquet PMC member!

Thank you, Andrew, for your leadership in the community. I look forward to our continued collaboration!

2 months ago 4 1 0 0
Advertisement

File formats!
Great references 😎

3 months ago 3 0 0 0

Happy new year

3 months ago 2 0 1 0

Bonne année!

3 months ago 4 0 0 0

And that applies to sparkling wine as well ;)

3 months ago 1 0 0 0

I always dip the fries in the tartar sauce with my fish and chips.

3 months ago 1 0 0 0

😒Is it a kissing book?

4 months ago 1 0 0 0
Preview
Column Storage for the AI era (illustration hand generated in 1958) “Column Storage for the AI era” © 2025 by Julien Le Dem is licensed under CC BY-NC-SA 4.0. To view a copy of this license, visit https://creativecommons.org/licen...

I also gave a talk on this topic.

Slides: docs.google.com/presentation...

Recording here: www.youtube.com/watch?v=S_ao...

4 months ago 2 0 1 0
Advertisement
Column Storage for the AI Era In the past few years, we’ve seen a cambrian explosion of new columnar formats, challenging the hegemony of Parquet: Lance, Fastlanes, Nimble, Vortex, AnyBlox, F3 (File Format for the Future). The thi...

In the past few years, we’ve seen a cambrian explosion of new columnar formats, challenging the hegemony of Parquet. Presumably, the design of yore is not going to cut it moving forward. I spent some time to understand a bit better how things actually changed.

sympathetic.ink/2025/12/11/C...

4 months ago 31 4 1 1
The advent of the open data lake | Julien Le Dem, AI By the Bay 2025
The advent of the open data lake | Julien Le Dem, AI By the Bay 2025 YouTube video by FunctionalTV

If you missed my talk "The advent of the open data lake" at AI By the Bay, the recording is now available.
ai.bythebay.io/talks/the-ad...

www.youtube.com/watch?v=xHGV...

4 months ago 3 0 0 0

Earthquake!

4 months ago 5 0 0 0

It turns out that Friday’s NYT’s mini crosswords was written by my 13-year-old.

4 months ago 3 0 1 0
Introducing DuckLake
Introducing DuckLake YouTube video by DuckDB

Parquet praise in the wild :) Nice chatting with you at Datacouncil Hannes!
www.youtube.com/watch?v=zeon...

4 months ago 3 2 0 0
Preview
The advent of the open data lake | AI By the Bay

As compute and storage can be efficiently decoupled, a common storage layer enables a vibrant ecosystem of on-demand tools specialized to specific use cases that avoids vendor lock-in.
ai.bythebay.io/talks/the-ad...

5 months ago 0 0 0 0

In this talk I’ll discuss the impact of the cloud and the advent of the Open Data Lake breaking silos to form the foundation of this ecosystem.

5 months ago 0 0 1 0

It’s been incredible to see the adoption of key components like Parquet, Arrow, Iceberg, and DataFusion. They provide an interoperability layer that enables using data without creating silos and duplication.

5 months ago 0 0 1 0

The components of databases, distributed or not, have been commoditized as individual parts that anyone can compose into use-case specific engines. Define your constraints and build a query engine that solves your problem.

5 months ago 0 0 1 0
Advertisement

Over the past decade, the big data ecosystem has matured and evolved from a melting pot of competing projects into a composable ecosystem organized around a few open source standards.

5 months ago 1 0 1 0
Preview
The advent of the open data lake | AI By the Bay

Come say hi, Wednesday at 10am. I'll be speaking at the AI By the Bay Conference about "The Advent of The Open Data Lake".

ai.bythebay.io/talks/the-ad...

5 months ago 4 0 1 0
Circular limit. Yellow and green

Circular limit. Yellow and green

Step 2
3 more to go

5 months ago 0 0 0 0
Circle limit green layer

Circle limit green layer

Take 2 step 1

5 months ago 1 0 1 0