Advertisement · 728 × 90

Posts by Georg Heiler

Grafeo - High-Performance Graph Database - Grafeo A high-performance, embeddable graph database with a Rust core and no required C dependencies. Python, Node.js, Go, C, C#, Dart and WebAssembly bindings. GQL (ISO standard) query language.

Nice #rust and #graphs grafeo.dev good fit for #databases

3 weeks ago 0 0 0 0

Just published in JOSS: 'Discovering the SUPER in computing - dagster-slurm for reproducible research on HPC' https://doi.org/10.21105/joss.09795

3 weeks ago 2 2 0 1

I hope that github.com/ascii-supply... can help to a) simplify access to sovereign AI compute for newcomers and b) help experts with tigher observability.

3 weeks ago 0 0 0 0
Preview
Metaxy + Dagster-Slurm for Efficient Multimodal Pipelines | Georg Heiler How to combine Metaxy, Dagster-Slurm, Docling, and Ray to run incremental multimodal pipelines on sovereign AI infrastructure.

See here in action how EU sovereign HPC AI + ray allow to easily scale document processing with docling georgheiler.com/2026/02/22/m...

1 month ago 0 0 0 0
Preview
Metaxy + Dagster-Slurm for Efficient Multimodal Pipelines | Georg Heiler How to combine Metaxy, Dagster-Slurm, Docling, and Ray to run incremental multimodal pipelines on sovereign AI infrastructure.

Multimodal data handling is different! Especially with regards to complexity and cost. Daniel Gafni and I built Metaxy (docs.metaxy.io) to simplify Efficient Multimodal Pipelines

1 month ago 0 0 1 0
Magenta Telekom Case Study | Dagster Learn how Magenta Telekom replaced fragmented, manual data workflows with a modular, Dagster-powered platform that reduced onboarding from 3 months to 1 day and laid the foundation for AI-driven decis...

Read how Magenta (👋 @geoheil.com and @milicevica23.bsky.social ) uses Dagster+ to feel a bit of that joy: dagster.io/customers/ho...

1 month ago 0 1 0 0
Preview
Introduction - Metaxy A high level introduction to Metaxy.

And docs.metaxy.io/main/ also integrating with I.e. lance for again a different kind of versioning

1 month ago 1 0 1 0
Preview
Branching and Shallow Cloning in Lance: Towards a "Git for AI Data" A deep dive into how table formats handle version management for ML/AI experimentation, and how Lance unifies branching, tagging, and shallow clone on top of …

Don’t forget lance lancedb.com/blog/branchi... for multimodal fit for data

1 month ago 1 0 1 0
Advertisement
The stacking workflow Stacked PRs. Stacked diffs. Stacked changes. A better workflow to manage pull requests.

great implementation for www.stacking.dev

1 month ago 1 0 0 0
Preview
GitHub - cesarferreira/stax: The fastest stacked-branch workflow for Git. Interactive TUI, smart PRs, safe undo. Written in Rust. The fastest stacked-branch workflow for Git. Interactive TUI, smart PRs, safe undo. Written in Rust. - cesarferreira/stax

github.com/cesarferreir... #rust #stacking is awesome

1 month ago 1 0 1 0
Hannes Werthner "The Role of Computer Science in the Age of AI (or Digital Humanism?)"
Hannes Werthner "The Role of Computer Science in the Age of AI (or Digital Humanism?)" YouTube video by Digital Humanism

www.youtube.com/watch?v=DGA0...

1 month ago 0 0 0 0

An European sovereign GPU cloud does not come out of nowhere maybe this project can support making HPC systems more accessible. The recently started projects will take a long time to complete. I hope github.com/ascii-supply... will help.

3 months ago 0 0 0 0
Modern Architecture 101 for New Engineers & Forgetful Experts - Jerry Nixon - NDC Copenhagen 2025
Modern Architecture 101 for New Engineers & Forgetful Experts - Jerry Nixon - NDC Copenhagen 2025 YouTube video by NDC Conferences

#great talk www.youtube.com/watch?v=WRg1... on #architecture for #engineers

3 months ago 1 0 0 0
Scaling data pipelines @ Magenta Telekom - OSA Con 2025
Scaling data pipelines @ Magenta Telekom - OSA Con 2025 Presented by Georg Heiler at OSA Con 2025. Magenta Telekom ingests many terabytes of new data every day, and every downstream consumer wants it immediately. The real bottleneck turned out not to be…

The OSACon recordings are available now www.youtube.com/watch?v=31LH...

4 months ago 1 0 0 0
Preview
PyTogether - Real-Time Collaborative Python IDE Collaborative Python IDE for students, educators, and teams.

#python #together pytogether.org nice

4 months ago 2 0 0 0
Advertisement
Reconstructing History with XTDB (Jeremy Taylor + James Henderson)
Reconstructing History with XTDB (Jeremy Taylor + James Henderson) YouTube video by CMU Database Group

interesting new #timeseries #database #xtdb xtdb.com see the great #cmu video for details www.youtube.com/watch?v=zzqD...

4 months ago 1 0 0 0
Preview
GitHub - mxschmitt/action-tmate: Debug your GitHub Actions via SSH by using tmate to get access to the runner system itself. Debug your GitHub Actions via SSH by using tmate to get access to the runner system itself. - mxschmitt/action-tmate

github.com/mxschmitt/ac... #tmux #action-tmate - really neat #debugging

5 months ago 2 0 0 0
DEF CON 33 - Exploiting Shadow Data from AI Models and Embeddings - Patrick Walsh
DEF CON 33 - Exploiting Shadow Data from AI Models and Embeddings - Patrick Walsh This talk explores the hidden risks in apps leveraging modern AI systems—especially those using large language models (LLMs) and retrieval-augmented generation (RAG) workflows. We demonstrate how…

A great video about LLMs and the data they can provide to the world - even though perhaps they should not | www.youtube.com/watch?v=O7BI... - DEF CON 33 - Exploiting Shadow Data from AI Models and Embeddings - Patrick Walsh

5 months ago 0 0 0 0
Preview
Introducing Apache Fory™ Rust: A Versatile Serialization Framework for the Modern Age | Apache Fory™ TL;DR: Apache Fory Rust is a blazingly-fast, cross-language serialization framework that delivers ultra-fast serialization performance while automatically handling circular references, trait objects, ...

#rust #fory #serialization fory.apache.org/blog/fory_ru...

5 months ago 1 0 0 0
Post image

The real AI win isn't superhuman agents, it's scaled mediocrity.
Doing less with less at massive scale unlocks tasks that were once uneconomical.
The magic is in aggregate value, not perfect outputs. Empower teams with practical AI tools. 
🔗 dlthub.com/blog/the-real-ai-win-sca...

5 months ago 3 1 1 0
Preview
GitHub - l-mds/dsc-dach-tutorial-dagster: Introduction to using and scaling dagster Introduction to using and scaling dagster. Contribute to l-mds/dsc-dach-tutorial-dagster development by creating an account on GitHub.

#dsc-dach #data it was a. pleasure to share an introductory workshop about spark and data pipelines. Thank you Aleks for the great collaboration!

Find the workshop files here if you want to follow along github.com/l-mds/dsc-da...

5 months ago 0 0 0 0
DuckLake: Learning from Cloud Data Warehouses to Build a Robust “Lakehouse” (Jordan Tigani)
DuckLake: Learning from Cloud Data Warehouses to Build a Robust “Lakehouse” (Jordan Tigani) YouTube video by CMU Database Group

#duckdb #ducklake #cmu www.youtube.com/watch?v=z2Gh...

6 months ago 8 1 0 0
Preview
feat: build SLURM integration for dagster by HPicatto · Pull Request #19 · ascii-supply-networks/dagster-slurm Type of Change feat: New feature fix: Bug fix docs: Documentation style: Code style refactor: Code refactor perf: Performance improvement test: Tests chore: Maintenance Description adds ...

Something about super and computing in the making anyone daring out there who wants to explore? Or folks who want to exchange ideas about SLURM, HET jobs and advanced resource management? github.com/ascii-supply...

6 months ago 0 0 0 0

good point. I think I only have < 1 hour so BI/vis will have to wait a bit. But otherwise it would be a great addition

6 months ago 1 0 0 0

#duckdb #dagster #ray #ducklake

6 months ago 0 0 1 0
Advertisement
Preview
Simple Sovereign Scalable Data Stack | Georg Heiler Tired of cloud lock-in and surprise bills? This talk shows how to build a fast, portable analytics stack around DuckDB and Dagster. Along the way of our journey to sovereignty and scale we touch on…

Simple Sovereign Scalable Data Stack georgheiler.com/event/tdwi-2... precursor: pypi.org/project/dags... github.com/dagster-io/c... if you want to see this in action join in Nürnberg or Vienna for some sovereign, scalable data talks in the coming weeks

6 months ago 6 0 2 0
When the duck quacks: Multimodal querying with FlockMTL
When the duck quacks: Multimodal querying with FlockMTL YouTube video by DuckDB

#duckdb #multimodal #rag www.youtube.com/watch?v=2qSZ... blobs.duckdb.org/events/duckd...

6 months ago 3 0 0 0
Katharine Jarmul - Anonymization: Why is it so hard? (PyData Prague #27)
Katharine Jarmul - Anonymization: Why is it so hard? (PyData Prague #27) YouTube video by PyData

#compliance #anonymization #python www.youtube.com/watch?v=EqQd...

6 months ago 0 0 0 0
Introducing SedonaDB: A single-node analytical database engine with geospatial as a first-class citizen - Apache Sedona Apache Sedona is a cluster computing system for processing large-scale spatial data. Sedona extends existing cluster computing systems, such as Apache Spark, Apache Flink, and Snowflake, with a set of...

#gis #medium-data #sedona #rust #datafusion sedona.apache.org/latest/blog/...

6 months ago 0 0 0 0
Post image

📈 DuckDB 1.4.0 is out! This is our first LTS release which comes with *one year of community support*. It also supports database encryption, the MERGE SQL statement and Iceberg writes.

For more details, read the announcement blog post at
duckdb.org/2025/09/16/a...

6 months ago 52 22 0 3