"Arrow has the intricacy of a fine Swiss watch." The co-creator of Apache Arrow (@wesmckinney.com) on why AI agents cannot replicate decade-long infrastructure design.
#ApacheArrow #DataRenegades
If you keep hearing about Apache Arrow, but never quite got what it really is about, check out my blog post. I did a deep dive on Apache Arrow and wrote an educational introduction: thingsworthsharing.dev/arrow/
#dataengineering #apachearrow #softwareengineering
Wes McKinney built pandas in a mouse-infested NYC apartment on founder hours. Now he runs parallel Claude Code sessions and says AI is forcing "radical accountability" on every software vendor shipping mediocre products.
Full conversation: youtu.be/Uso8-yaERkE
#DataRenegades #pandas #ApacheArrow
I finally got around to writing my first ADBC driver and it doesn't do anything (and that's the point!): amoeba.github.io/tiniest-adbc...
#apachearrow
image for R Consortium webinar: Scaling Up Data Analysis in R with Arrow
R Consortium webinar: Scale up data analysis in R with Arrow—fast, memory-efficient analytics without a DB or cluster. With Dr Nic Crane (Arrow R maintainer, Apache Arrow PMC). Register:
Don't miss it!
r-consortium.org/webinars/sca...
#rstats #apachearrow
We're excited to announce the release of {arrow} 23.0.0 🏹📦
Here's a roundup of the new features and changes in a 🧵
Full details can be found at arrow.apache.org/docs/r/news/
#rstats #apachearrow
📢 Rerun is hiring a Software Engineer (Rust) - Backend
Salary: $130K - $225K
Locations: 🇺🇸 East Coast - United States (Remote), 🇪🇺 EU (Remote)
#ai #rustlang #aws #gcp #azure #apachearrow #apachedatafusion
www.remotehiro.com/jobs/softwar...
It's nice to see people bringing up ADBC in conversations like this one: www.reddit.com/r/dataengine... #apachearrow
⚙️ Optimiza UDFs en #Python para Arrow en Spark
✳️ El uso de UDFs en PySpark ha sido una solución flexible pero ineficiente
✳️ Desde #ApacheSpark 3.5, la integración con #ApacheArrow ha supuesto una mejora significativa de rendimiento
➡️ blog.damavis.com/como-optimiz...
#Spark #Arrow
Was anyone else like me, wondering if you can use ADBC with $5 Postgres from @planetscale.com? Well, you can! (No surprise)
I wrote up my test at brycemecum.com/2025/11/15/a...
#apachearrow #adbc
Two More Days till Subsuface NYC!
Register at Dremio.com/subsurface
#DataLakehouse #NYC #ApacheIceberg #ApachePolaris #ApacheArrow
REGISTER FOR SUBSURFACE (NOV 6 in San Jose, NOV 13 in NYC)
Register here:
#DataLakehouse #ApacheIceberg #ApachePolaris #ApacheArrow
REGISTER FOR SUBSURFACE (NOV 6 in San Jose, NOV 13 in NYC)
Register here:
#DataLakehouse #ApacheIceberg #ApachePolaris #ApacheArrow
REGISTER FOR SUBSURFACE (NOV 6 in San Jose, NOV 13 in NYC)
Register here:
#DataLakehouse #ApacheIceberg #ApachePolaris #ApacheArrow
REGISTER FOR SUBSURFACE (NOV 6 in San Jose, NOV 13 in NYC)
Register here:
#DataLakehouse #ApacheIceberg #ApachePolaris #ApacheArrow
I may have fallen down the rabbit hole here but...
#apacheArrow had my curiosity but now had my attention...
And I'm building wal2arrow for #postgres and #mysql...
It's kind of awesome not to have json /avro around in the hot paths.
I updated my @val.town for my daily GitHub repo email digest so it can handle multiple repos: www.val.town/x/amoeba/git.... I like the new, more condensed view. #apachearrow
Check out @andrewlamb1111.bsky.social 's talk at the recent Iceberg meetup for a condensed overview of the the new Variant type coming to Parquet #apacheparquet #apachearrow
Ruby on Rails (ActiveRecord) now supports ADBC with a new adapter written by Sutou Kouhei. Check out the blog post (in Japanese): www.clear-code.com/blog/2025/8/.... The gem is available at rubygems.org/gems/activer.... #apachearrow #rubyonrails
Centralized storage, decentralized compute: maximizing data use (OLAP) while minimizing ETL and disparate versions of data.
#hotTake #dataEngineering #dataAnalytics #dataScience #machineLearning #ai #cloudComputing #s3 #apacheArrow #apacheParquet #apacheIceberg
Looking forward to attending this! #apachearrow
Compile-time Arrow schemas for Rust. Looks pretty nice for use cases where the data schema is well-defined, i.e. either internal to an app, or part of a client/server protocol. #RustLang #ApacheArrow
PyArrow 21 was a great release, especially for @hf.co users: PyArrow now seamlessly handles hf:// URIs and does content-defined chunking to reduce transfer and storage costs on HF. Check out this blog post: huggingface.co/blog/parquet... #apachearrow #apacheparquet
New post up on the Arrow blog about some of the recent improvements to the embedded query engine inside Arrow C++: arrow.apache.org/blog/2025/07... #apachearrow
@duckdb.org just merged a PR for a new, simpler, and -- mostly importantly -- non-deprecated @arrow.apache.org C API: github.com/duckdb/duckd.... #apachearrow #duckdb
Next Tuesday, get ready to meet the mind behind #Pandas & #ApacheArrow!
@wesmckinney.com shares his origin story (Part 1) on #TheTestSet. From speedruns to shaping the data stack, this is one you won't want to miss.
Mark your calendar for Tuesday & subscribe at thetestset.co!
#DataScience #Python
This update (v0.4.x) provides complete #ApacheArrow data models for 11 file formats and counting, including the GA4GH/htslib formats and UCSC’s BigWig/BigBed.
This update (v0.4.x) provides complete #ApacheArrow data models for 11 file formats and counting, including the GA4GH/htslib formats and UCSC’s BigWig/BigBed.
This is an exciting release! The Swift implementation of Arrow has been split off into its own repo which means we can now publish it on the @SwiftPackageIndex.mas.to.ap.brid.gy: swiftpackageindex.com/apache/arrow.... #apachearrow #swift #swiftlang
It’s true! #apachearrow