I'll be speaking at PostgresConf San Jose tomorrow (April 21). Short notice, but here you go: postgresconf.org/conferences/...
Posts by Philippe Noël
I'm thrilled about this case study coming out. The Modern Treasury team have been an incredible design partner from the early days.
They had very strict requirements around data freshness, and we're honoured to be the database powering many of their user-facing dashboards and endpoints.
3/3. I'm pretty excited for this feature to come out. Not only do we have many customers who use Citus, but we're also backed by the founders Umur and Ozgun, who are now the team behind Ubicloud.
If you use Citus, go give it a try and let us know how it goes.
2/3. Citus transforms PostgreSQL into a distributed database with horizontal sharding. ParadeDB is fully compatible with Citus, enabling distributed full-text search across sharded tables.
You can read more on how to enable it here: docs.paradedb.com/deploy/citus
1/3. A frequent question I get about ParadeDB is "Can it support distributed search?" Elasticsearch is sharded, after all?
Starting today, the answer is yes: distributed search is now possible in Postgres thanks to ParadeDB x Citus.
What shocked me the most in the AI wave so far is seeing many smart people start to advocate for quantity over quality.
In a world where so many can "make", quality is so clearly the defining factor. In fact, it seems obvious that AI will raise the bar for quality, not lower it.
The ParadeDB Community Slack has just reached 1,000 members! The most unexpected part is certainly the growth in external contributors, which has been really wonderful to witness. We are deeply thankful to everyone who contributed.
Stu's talk is now available for anyone who couldn't attend South Bay Systems to watch: youtu.be/TeFsBVIYBis?...
I highly recommend watching it. It's ~30 minutes and goes into details on how ParadeDB achieves faster-than-Postgres JOINs.
4/4. Open source is the foundation of all software, and Paul and team really built a foundation that reaches deep. I personally found his story incredibly interesting, and am proud to be part of ParadeDB as a small part of the Tantivy project. I highly recommend a read.
3/4. While we work closer to the project than most, we're not the only ones to have integrated Tantivy. Aside from Paul's own startup (now at Datadog) and ParadeDB, I have met dozens of companies and projects that have integrated Tantivy to power full-text search in their product.
2/4. For people not familiar with Tantivy, it is a popular full-text search library, written entirely in Rust, and inspired by Apache Lucene (the search library behind Elasticsearch). Alongside Postgres, it is a core dependency of ParadeDB.
1/4. Today, a bit of a different blog post. James from our team sat down with @fulmicoton.bsky.social, the main author of Tantivy, to discuss his journey and how he created such a world-changing project. 🧵
The Terrapin team has been an incredible partner. Read up on how they power 25X faster bond search with ParadeDB.
Happening tonight!
For the people who love to ask "how does this work?", Mithun spoke at PGConf India about ParadeDB's architecture. Highly recommend a watch: www.youtube.com/watch?v=gh68...
ParadeDB is hiring more engineers to work on DB internals. We're headed towards v1.0 later this year, and have some really exciting projects coming up.
There's still so much to build in Postgres. The technical bar is very high, and you'll get a lot of ownership. DMs open
I'm incredibly excited for this talk. Top K queries are the backbone of search engines, which have historically been NoSQL databases unable to execute JOINs.
We're fixing this with ParadeDB. Highly recommend attending, I promise you'll learn something!
Excited for this blog to come out! Local development is key to developer velocity. I've personally used LocalStack before and am thrilled to partner with them to make it easy to develop with LocalStack + ParadeDB natively.
Our first integration, Django, is in the Django Newsletter this week.
ParadeDB is now officially available from the Python / Django ecosystem via `pip install django-paradedb`
Source: github.com/paradedb/dja...
Highly highly recommend attending if you're in the Vancouver area (or want to travel to it!). Stu will be going deep on how ParadeDB is architected from the ground up.
And just like that, 100 contributors to ParadeDB. Thank you to all who have contributed, and here's to 100 more!
I don't like advertising third-party products, but my testimonial is up on runs-on.com/testimonials/, and it's worth a look. @crohr has made something amazing.
We need an easy way to run benchmarks on bare-metal nodes via GitHub Actions, and nothing else compares. 12/10 product👌
4/4. I've long said that search engines and columnar databases have a lot in common... We'll likely write more about this in the future, but in the meantime, I recommend checking out the blog.
And of course don't hesitate to star the project: github.com/paradedb/par...
3/4. To accomplish this, we use two core technologies:
- Columnar storage, for fast, cache-friendly lookups
- Block WAND, which enables early pruning when doing BM25 scoring
Neither are native to Postgres, but ParadeDB supports them in our BM25 index thanks to Tantivy.
2/4. Top K queries are the core workload of any search engine. Whether you use Google Search as a consumer or a complex B2B search engine in a SaaS product, the core goal of search is "find me the K most relevant results." That's Top K optimization.
1/4. The simplest questions are often the hardest to answer. In this technical blog, our CTO dives into how we optimized Top K queries in ParadeDB.
I *highly* recommend reading. But if you need some more convincing, here's why this matters 🧵
ParadeDB is at 99 contributors. Who will be the 100th? 👀
We now support Nix/NixOS thanks to the incredible work of Luc Perkins!