Advertisement · 728 × 90

Posts by Mimoune Djouallah

Preview
Text-to-SQL using Semantic Models and Small language models The high-end version of this problem is mostly solved. GPT-4 or Claude Opus paired with a mature proprietary semantic layer like Microsoft Power BI’s will handle natural language queries reli…

Text-to-SQL using Semantic Models and Small language models

datamonkeysite.com/2026/03/31/t...

3 weeks ago 0 0 0 0
Post image

analytics as code, everything inside github

#vscode #dbt #duckdb #apacheiceberg #wasm #claude djouallah.github.io/analytics-as...

1 month ago 7 0 1 0
Preview
Building a Data Pipeline Using VSCode and Claude Out of Thin Air A complete data pipeline running on Microsoft Fabric that downloads public data, transforms it into a star schema, exports it as Delta Lake tables, and serves it through a Power BI semantic model w…

Building a Data Pipeline Using VSCode and Claude Out of Thin Air

#duckdb #onelake #vscode

datamonkeysite.com/2026/03/05/b...

1 month ago 4 0 0 0
Preview
a stuffed white cat is sitting on top of a box and asking why ? ALT: a stuffed white cat is sitting on top of a box and asking why ?

@hannes.muehleisen.org

2 months ago 0 0 0 0

Duckdb 1.5 is planned to be released next week and it the first time where apache iceberg support is substantially better than Delta , and I don't know how to feel about it.

2 months ago 4 0 1 0
Preview
GitHub - djouallah/delta_classic: DuckDB extension to attach a directory of Delta tables as a database DuckDB extension to attach a directory of Delta tables as a database - djouallah/delta_classic

I built a community extension for #DuckDB called Delta Classic.
Point it to a path containing Delta tables and it mounts them as a read-only database. all the code was written by #claude but i wrote the #python tests 👇

github.com/djouallah/de...

2 months ago 4 0 0 0
Preview
Querying a Onelake Table with RLS and CLS Using DuckDB’s MSSQL Extension Microsoft Fabric Lakehouse supports Row-Level Security (RLS) and Column-Level Security (CLS) . it works with trusted engines like PowerBI, Spark running inside Fabric etc, but if you use Python not…

when you apply RLS and CLS to a #onelake table, direct storage access get blocked, which means, #duckdb can not read it, there is a solution though using #SQLServer extension
datamonkeysite.com/2026/02/14/q...

2 months ago 3 0 0 0

It is already!!! But that doesn't mean all other format will disappear

2 months ago 1 0 0 0
Post image

Seriously impressive performance from #Polars with delta_sink to #onelake 🚀
Fully reproducible test:
github.com/djouallah/Fa...

3 months ago 1 0 1 0
Advertisement
Post image

a sql script that export #deltatable metadata from a #ducklkake db

never thought that #sql can do that !!!!

github.com/djouallah/Fa...

3 months ago 6 0 0 0
Post image

#duckdb #apacheiceberg is becoming good enough to be useful

the script will download zip files from the web, extract them, create 4 tables and generate a summary table for reporting all in pure #SQL

github.com/djouallah/Fa...

4 months ago 6 0 1 0
Second Look at Ducklake in Onelake
Second Look at Ducklake in Onelake YouTube video by DataMonkey

DuckDB just added native write support to OneLake. It is still early, and for now it only works through the blob API, There is no Delta write yet, but it is a very exciting first step.
#MicrosoftFabric #duckdb #Python #SQL
www.youtube.com/watch?v=V01W...

4 months ago 8 0 0 0
Querying Onelake Iceberg REST Catalog using DuckDB CLI with pure SQL
Querying Onelake Iceberg REST Catalog using DuckDB CLI with pure SQL YouTube video by DataMonkey

how to query #onelake iceberg catalog using pure sql with #duckdb
the query cross join a table from #MicrosoftFabric Warehouse and #Snowflake, why not :)
duckdb has no read only mode, so every query check the remote table state, which create a not so great experience
www.youtube.com/watch?v=DK8i...

4 months ago 9 0 0 0
Post image

Explaining how Python engines read and write #DeltaTable is not for the faint of heart.
The theory is everything will depends on the delta kernet rust for read and write, but we are not there yet
github.com/djouallah/Fa...
#duckdb #delta_rs #datafusion #chdb #daft #polars #rust #lakesail

5 months ago 5 0 0 0

Any system that allows exchanging real money for stuff with an element of chance is morally equivalent to a casino.

Corollary: Pokémon cards, Roblox, Labubus, and even claw machines should all be 18+

6 months ago 19 3 2 1
Post image

running #duckdb at 10 TB scale using #tpch like benchmark

#delta #onelake #singlenode #MicrosoftFabric
datamonkeysite.com/2025/10/19/r...

6 months ago 6 1 0 0
Post image

you are looking at #duckdb running tpch 1 TB with only 16 cores
it used to crash even with 64

pip install duckdb --upgrade is an act of faith basically

6 months ago 14 0 0 0
Post image
6 months ago 2 0 0 0
Post image

Put together a small python package duckrun :) point it at a folder of SQL/Python files, define a pipeline, and it will create Delta tables in #OneLake with #DuckDB and #delta_rs

github.com/djouallah/du...

6 months ago 10 1 2 0
Advertisement
Post image

actually #Microsoftfabric Datawarehouse automatically expose an Iceberg rest Catalog
thanks to #duckdb UI extension, you can see proper catalog

6 months ago 2 0 0 0
First Look at Onelake Iceberg REST Catalog
First Look at Onelake Iceberg REST Catalog YouTube video by DataMonkey

First Look at #onelake #apacheiceberg REST Catalog, please notice it is coming soon and not in production yet #MicrosoftFabric
www.youtube.com/watch?v=_QRE...

7 months ago 2 1 0 0

#pyconau @mimdj.bsky.social Life Beyond Pandas: Workflows with DuckDB, Daft, Polars, and Datafusion http://youtu.be/SnogunyMnE8

7 months ago 1 1 0 0
Post image

2 months ago, I got access to a beta release of #onelake #Apacheiceberg REST Catalog, first thing I run it with #duckdb 😀

7 months ago 6 0 0 0
Post image

storage format should not be tied to #SQL logic, #duckdb got it so right !!! but a bit sad that #deltalake is left behind :(

7 months ago 7 0 0 0

you know me too well :)

7 months ago 0 0 0 0
Preview
First Look at Incremental Framing in Power BI TL;DR: Incremental framing is like CDC to RAM :) It significantly improves cold-run performance of Direct Lake mode in some scenarios, there is an excellent documentation that explain everything in…

First look at incremental framing in #PowerBI directlake mode

#Parquet #OLAP #optimization #Performance #onelake
datamonkeysite.com/2025/09/09/f...

7 months ago 0 0 0 0
Preview
First Look at Geometry Types in Parquet Getting different parties in the software industry to agree on a common standard is rare. Most of the time, a dominant player sets the rules. Occasionally, however, collaboration happens organicall…

first look at geometry data type in #parquet
datamonkeysite.com/2025/09/07/f...

7 months ago 2 0 0 0
Post image

first #apacheiceberg table written by #duckdb

7 months ago 6 0 0 0
Post image

good news #duckdb added support for reading and writing geometry data type

Bad news : other Fabric engines don't support it yet, so it is not very useful for now :(

7 months ago 3 0 0 0
Advertisement

I nearly get the logic behind delta parquet, but clearly people did not like it 😅

7 months ago 0 0 1 0