Advertisement · 728 × 90

Posts by

Preview
Support Materialized CTE by novikd · Pull Request #94849 · ClickHouse/ClickHouse Changelog category (leave one): New Feature Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md): Support Materialized CTE. Allow evaluating CTEs only on...

Materialized CTE support finally coming to Clickhouse. Goodbye unnecessary DB round trips and temp tables with consistency issues. github.com/ClickHouse/C...

1 month ago 0 0 0 1
Preview
GitHub - duckdb/duckdb-skills Contribute to duckdb/duckdb-skills development by creating an account on GitHub.

We're excited to announce duckdb-skills, a DuckDB plugin for Claude Code!

We think the embedded nature of DuckDB makes it a perfect companion for Claude in your local workflows.

Check out the repository at github.com/duckdb/duckd...

1 month ago 54 14 1 0
Post image

DuckDB just casually in the nVidia keynote. AI doesn't make databases irrelevant, it makes them more relevant than ever.

1 month ago 7 2 0 0
Post image

We released DuckDB v1.5!

This release comes with a “friendly CLI” client, a new (opt-in) PEG parser, support for VARIANT types and many lakehouse features. It also ships a new network stack, a reworked geospatial extension, Azure writes and an ODBC scanner.

Read more at duckdb.org/2026/03/09/a...

1 month ago 63 16 0 1
How Nvidia GPUs Compare To Google’s And Amazon’s AI Chips
How Nvidia GPUs Compare To Google’s And Amazon’s AI Chips YouTube video by CNBC

Good non-technical summary of some of the various chips out there. www.youtube.com/watch?v=RBmO...

2 months ago 0 0 0 0

Yeah, that or the opposite where it's biased toward a tool because of training data. I think you can suggest a tool in the global CLAUDEmd file but would be more seamless from cowork plugin.

2 months ago 0 0 0 0
Post image

Claude cowork w/ Opus 4.6 is definitely smart, but got stuck on a data task, I stopped it, pointed it to DuckDB, done instantly. LLMs still have much to learn 🤔

2 months ago 0 0 1 0
The $200M Machine that Prints Microchips:  The EUV Photolithography System
The $200M Machine that Prints Microchips: The EUV Photolithography System YouTube video by Branch Education

Great overview of chip manufacturing with EUV. We really do have supercomputers in our pockets, on our desk and on our wrists. www.youtube.com/watch?v=B248...

2 months ago 0 0 0 0
Post image

🎞️ The slide decks and talk recordings of last Friday's developer meeting are out! duckdb.org/events/2026/...

2 months ago 22 6 0 0
Advertisement
Post image

Great talks at South Bay Systems hosted at databricks on xNVMe, fast SSD query processing, and using NPUs for DB work. Much work using DuckDB extensions. Need for async I/O as bottleneck a common topic, mainly at larger scale. luma.com/8a54z94d?tk=...

2 months ago 1 0 0 0
Preview
NVIDIA CUDA-X Powers the New Sirius GPU Engine for DuckDB, Setting ClickBench Records | NVIDIA Technical Blog Sirius, an open-source GPU native SQL engine, achieved a new performance record on Clickbench—a widely used analytics benchmark. Developed by University of Wisconsin-Madison with support from NVIDIA…

GPU-powered analytical query engines going mainstream? Needs nVidia GPU and limited to memory for now, but neat use of Substrait+Arrow for interop. DuckDB still easier to run anywhere, but this is useful for acceleration if needed.
developer.nvidia.com/blog/nvidia-...

3 months ago 2 0 0 1

How are the compile times?

5 months ago 0 0 1 0
KEYNOTE: Hannes Mühleisen - Data Architecture Turned Upside Down | PyData Amsterdam 2025
KEYNOTE: Hannes Mühleisen - Data Architecture Turned Upside Down | PyData Amsterdam 2025 YouTube video by PyData

The PyData Amsterdam 2025 keynote “Minus Three Tier: Data Architecture Turned Upside Down” by @hannes.muehleisen.org is out now.

www.youtube.com/watch?v=DxwD...

5 months ago 25 4 1 1
SQL Arena Planner Ranking (November 2025)

SQL Arena Planner Ranking (November 2025)

New database leaderboard from Yellowbrick ranks the quality of DBMS optimizer estimates and plans. They only evaluate TPC-H for now and report results for Postgres + DuckDB + MSSQL: sql-arena.com/components/p...
Repo: github.com/sql-arena/db...
LinkedIn Group: www.linkedin.com/groups/15775...

5 months ago 14 3 1 0
Preview
[Future Data] Where We're Going, We Don't Need Rows: Columnar Data Connectivity with ADBC - Carnegie Mellon Database Group ADBC (Arrow Database Connectivity) is Apache Arrow’s answer to ODBC and JDBC:... Read More +

Today's Future Data Systems Seminar Speaker: Ian Cook (@ian.columnar.tech) will present @columnar.tech's work on Apache Arrow's database connectivity API (ADBC). ADBC is available in modern DBMSs. Zoom talk open to public at 4:30pm ET. YouTube video available after: db.cs.cmu.edu/events/futur...

6 months ago 15 8 0 1
Preview
[Future Data] Vortex: LLVM for File Formats - Carnegie Mellon Database Group Apache Parquet revolutionized columnar storage after its initial release in 2013, but... Read More +

Today's Future Data Systems Seminar Speaker: Will Manning (@willmanning.com) will present @spiraldb.com's Vortex file format. Vortex is now a @linuxfoundation.org project. Zoom talk open to public at 4:30pm ET. YouTube video available after: db.cs.cmu.edu/events/futur...

6 months ago 4 4 0 0
Benchmark Results for DuckDB v1.4 LTS DuckDB v1.4 LTS is both fast and scalable. In in-memory mode, it is the fastest system on ClickBench. In disk-based mode, it can run complex analytical queries on a dataset equivalent to 100 TB CSV fi...

Processing 100Tb of CSV files on a single machine is insane, little over 1hr per query, even if on a powerful AWS instance. Question heavily the need for complex systems when this is what’s possible now. Can’t wait for full write-up. Incredible work.

duckdb.org/2025/10/09/b...

6 months ago 8 1 0 0
Advertisement

It’s interesting the tradeoffs if the main goal is no operating cost and decent startup time. Definitely painful to develop on regularly but for a one and done this makes a lot of sense. I wonder if Rust compile times will come down further one day.

6 months ago 1 0 0 0
Post image

Taking the DuckDb hoodie on a trip. Not exactly Amsterdam but I’ve heard they like columnar databases here too.

6 months ago 3 0 0 0
Preview
Push Hive filtering into Glob() by xevix · Pull Request #18518 · duckdb/duckdb Summary Addresses part of #7620 for local filesystem. Part 1 of the work split off from the original PR #18430. The next part will handle fallback to eager loading in case of Hive issues. Push down...

I didn’t quite make it in time for Hive filtering lazy list to speed up filtering Hive folder with many partitions, but will pick up again before next release w/ luck 🙇‍♂️ github.com/duckdb/duckd...

7 months ago 2 0 0 0

Congrats to DuckDB team on LTS release w/ many great improvements! Hidden among them you can now use Hive filtering with read_blob, and SHOW TABLES FROM specific db w/o USE.

7 months ago 2 0 1 0
Post image

📈 DuckDB 1.4.0 is out! This is our first LTS release which comes with *one year of community support*. It also supports database encryption, the MERGE SQL statement and Iceberg writes.

For more details, read the announcement blog post at
duckdb.org/2025/09/16/a...

7 months ago 52 22 0 3
Preview
eBird in DuckDB I saw this post by the Clickhouse team which was doing a cool test of the eBird dataset from Cornell University, and wondered how DuckDB…

I tried loading eBird data (1.5B rows CSV ZIP) using DuckDB for fun, inspired by a Clickhouse blog post and a bit of curiosity. Both did well, DuckDB slightly faster querying and Parquet ingest, Clickhouse w/ native zip support, optimized for ingest and multitenancy. xevix.medium.com/ebird-in-duc...

7 months ago 3 0 0 0
Thumbnail: Saving Private Hash Join

Thumbnail: Saving Private Hash Join

Vol:18 No:8 → Saving Private Hash Join
👥 Authors: Laurens Kuiper, Paul Gross, Peter Boncz, Hannes Mühleisen
📄 PDF: https://www.vldb.org/pvldb/vol18/p2748-kuiper.pdf

8 months ago 14 4 0 1
Preview
Data Tool Component Sharing There are many partly overlapping tools in the data world, which is what inspired things like Calcite to have modular components for…

Is there too much duplicated effort in data tools? I sometimes wonder about this.

xevix.medium.com/data-tool-co...

7 months ago 0 0 0 0

Yeah, I don’t think MS is interested in 3rd party devs so much.

7 months ago 0 0 0 0
Advertisement

Unfortunately yes, I was already going to get one for something else and this put me over the edge. Maybe I'll also build a gaming rig one day in the distant future haha.

7 months ago 0 0 1 0
Post image

Compiling DuckDB on Windows 11 (ARM) using UTM VM on macOS to debug Windows compile issues. It's a shame msvc doesn't exist outside of Windows, mingw/clang don't work the same and cross-compiling is tricky. Compiling takes 5-10 mins (instead of 1-2 mins native), but it works 🎉!

7 months ago 3 0 1 0
Post image

Just the 1 day of data above is ~125GiB compressed, ~585GiB uncompressed. One month is about 3.75TiB compressed, or 17.5TiB. It makes sense this dataset is so popular for testing and analysis, wow.

8 months ago 6 0 0 0
Post image

Stretching DuckDB w/ Common Crawl, ~1.7B rows, ~300 parquet files. ~2-3s for single-column aggregations, ~2-3 mins to SUMMARIZE the data, peaking at ~12-14GB memory usage. Not exactly real-time, but the fact you can do this on a laptop with no server setups or Spark pipelines is still amazing.

8 months ago 44 9 1 1