Advertisement · 728 × 90

Posts by Nikhil Benesch

They’re calling it our most boring feature yet.

boring(n): mundane; works exactly as expected; the highest praise you can give a database

8 months ago 4 0 0 0
Post image

tpuf Python client went async today! 🐡 ❤️ 🐍

Async client perf slightly edges out sync client perf under heavy query load.

github.com/turbopuffer/...

10 months ago 5 0 0 0

Big day today at turbopuffer HQ.

11 months ago 9 1 0 0
Post image

when we said "coming soon" we really meant it

now puffin' in aws-us-east-1 and aws-eu-central-1

1 year ago 3 1 0 0

I did! Couldn't pass up the opportunity to work with this team!

1 year ago 1 0 0 0
Post image

Things move fast at turbopuffer. Now puffin' in aws/us-east-1 and aws/eu-central-1 too.

1 year ago 3 0 1 0

I've been dreaming about conditional writes on S3 for years. I couldn't have asked for a better way to celebrate than getting to ship tpuf on AWS. 🐡

1 year ago 13 1 0 0
Preview
Try C4A, the first Google Axion Processor | Google Cloud Blog The custom Arm-based processor is designed for general-purpose workloads like web and app servers, databases, analytics, CPU-based AI, and more.

C4A: cloud.google.com/blog/product...

1 year ago 1 0 1 0

It’s a rare day that my love of going to battle with build systems pays off like this. Kudos to GCP for a very impressive new SKU. 🐡💨

1 year ago 13 0 0 0
Advertisement

Couldn't be more excited to be joining the team at @turbopuffer.bsky.social. 🐡💨

1 year ago 17 0 0 0

Just catching up on my NULL BITMAPS and this is easily the best intuition for write skew I've ever seen described. The analogy to merge skew in a codebase is genius.

1 year ago 9 0 0 0

Well well well: www.crunchydata.com/blog/pg_incr...

Incremental pipelines come to Postgres via Crunchy Data! This is like "dbt incremental", not true incremental view maintenance like @materialize.com or Snowflake's dynamic tables, but it's a neat step towards IVM.

1 year ago 22 3 0 0

Of course! Will be very excited to give the new DynamoDB-free SlateDB a spin.

1 year ago 1 0 0 0
Post image Post image

The 6th (and last of 2024!) NYC Systems talks are next Thursday! We've got @jaronoff.com of Omlet and @benesch.bsky.social of Materialize speaking. :)

nycsystems.xyz/december-202...

1 year ago 20 2 0 0

Same! I’ve been using Amethyst for over ten years at this point. When I discovered the project in 2013 I never expected that ianyh would still be lovingly maintaining it a decade later. One of my most loved pieces of software, for sure.

1 year ago 1 0 1 0

tl;dr the 10x TPS claim actually *is* based on a small but novel optimization inside of S3! Table buckets understand Iceberg naming conventions and adjust S3's rate limiting so that you start with budget for 55k/35k GETs/PUTs per sec rather than the general purpose default of 5.5k/3.5k.

1 year ago 6 0 0 0
AWS re:Invent 2024 - [NEW LAUNCH] Store tabular data at scale with Amazon S3 Tables (STG367-NEW)
AWS re:Invent 2024 - [NEW LAUNCH] Store tabular data at scale with Amazon S3 Tables (STG367-NEW) YouTube video by AWS Events

It just occurred to me that while the 3x query performance claim is based on a comparison to uncomplicated tables, I hadn't actually seen the evidence for the 10x TPS claim.

After some digging I just found the explanation in the re:Invent talk (start at 14:24): youtu.be/1U7yX4HTLCI?...

1 year ago 2 0 1 0
Advertisement

Exciting! I'm so here for it. 👀

1 year ago 0 0 0 0

Disclaimer: I'm not involved with DuckDB, ClickHouse, or Iceberg, so take this all with a grain of salt.

1 year ago 1 0 1 0
Preview
More Than 2x Faster Hashing in ClickHouse Using Rust Rust’s rich type system and ownership model guarantee memory-safety and thread-safety. There are a fairly large number of useful libraries written on it, so we considered using them in ClickHouse.

Rust might be feasible! In fact I think DuckDB's Delta extension is based on a Rust library. And ClickHouse is starting to integrate some Rust too: clickhouse.com/blog/more-th...

Sadly Rust's Iceberg library is still relatively immature (e.g. no support for writes: github.com/apache/icebe...).

1 year ago 4 0 1 0
Preview
Lies we tell ourselves to keep using Golang In the two years since I've posted I want off Mr Golang's Wild Ride, it's made the rounds time and time again, on Reddit, on Lobste.rs, on HackerNews, and elsewhere. And every time, it elicits the ...

Calling Go code from another language is hard enough that' it's rarely done—especially in high performance contexts. (🌶️ alert, but fasterthanli.me/articles/lie... covers this—see "the only good boundary with Go is a network boundary.")

1 year ago 4 0 1 0

Nice, looking forward to it!

1 year ago 0 0 0 0
Preview
Friendlier SQL with DuckDB DuckDB offers several extensions to the SQL syntax. For a full list of these features, see the Friendly SQL documentation page.

I’m not 100% sure on the chronology but I believe DuckDB originated many of these pragmatic SQL UX improvements: duckdb.org/2022/05/04/f...

1 year ago 2 0 0 0
Preview
GitHub - apache/iceberg-cpp: Apache Iceberg C++ Apache Iceberg C++. Contribute to apache/iceberg-cpp development by creating an account on GitHub.

Keep an eye on github.com/apache/icebe...! AFAIU the lack of a C++ Iceberg library is what’s been holding back full Iceberg support in DuckDB and ClickHouse.

1 year ago 3 1 1 0
Preview
Databases Are Commodities. Now What? How will vendors differentiate when databases are commoditized? I've got three ideas.

@chris.blue you might be interested in this. Something like this fits in very neatly to your views on the commoditization of the PostgreSQL dialect/protocol (materializedview.io/p/databases-...).

1 year ago 4 0 1 0

Indeed. That's one of the more cursed stories I've heard. Might be worth a uv bug report, if the constraints are indeed solvable!

1 year ago 0 0 0 0
uv

Have you tried `uv` yet? docs.astral.sh/uv/

It's night and day compared to pip.

1 year ago 1 0 1 0
Advertisement

👋🏽 Well hello, Spiral.

1 year ago 1 0 0 0

(for posterity for anyone else following along: it seems to be sufficient to define the external consistency guarantee in terms of T1's commit timestamp and T2's read timestamp because SI already implies that a txn can't commit before it starts: bsky.app/profile/bene...)

1 year ago 2 1 1 0

This is so cool to see!

Out of curiosity, which sort of features were hard to test via SQL and/or PL/pgSQL? I would have naively assumed that all observable behavior of PostgreSQL was observable via the SQL interface. 😅

1 year ago 1 0 1 0