We watched data teams waste years building the same ingestion pipelines for weather data.
So we fixed it. Zero ingestion. High velocity. Cloud-native.
Proud to launch with incredible partners like
@brightband.com, #zeusai, and dynamical.org. Give it a spin! ๐
Posts by Joe Hamman
Wildfire season may be abating, but our need to understand future scenarios has not: Come join CTO
@jhamman.bsky.social Thursday morning for a workshop: Predicting cloud native wildfire risk with the Earthmover platform: m.earthmover.io/predictingwi...
Meet @rabernat.bsky.social, @jhamman.bsky.social and more of the Earthmover team at our #ClimateWeekNYC events. PSA: the hands-on tech workshop Wednesday and panel on Thursday still have capacity, but are filling up fast.
Super excited to see #icechunk v1.0 ship today. Stable format, stable API, and ready for production. Take it for a spin and let us know how it goes. ๐๐๐
Just back from #CNG2025, I wrote up some thoughts on the traction the @zarr.dev format is gaining in the broader geospatial industry (specifically Earth Observation). I also summarize what features and tools the user and developer communities are asking for to fully leverage Zarr. ๐๐๐
I'll be at the CNG conference in Snowbird next week. I wrote a short blog post about what the Earthmover team will be up to.
tldr; we'll be talking about @zarr.dev, #icechunk, @xarray.bsky.social and cloud-native data cubes.
Details in the blog post ๐
Most people think of @zarr.dev as a "file format". With #Icechunk, we've turned Zarr into a database. @functionth.bsky.social's post shows how Icechunk can be used to solve a problem where transactional databases are often required.
1/ ๐จ New Blog Post Alert: "๐ฟ๐๐๐๐๐๐๐ ๐ด๐๐๐ข๐ก ๐ผ๐๐๐โ๐ข๐๐ ๐ถ๐๐๐ ๐๐ ๐ก๐๐๐๐ฆ ๐ค๐๐กโ ๐ ๐ถ๐๐๐โ๐ฬ๐ ๐๐ข๐ก ๐ผ๐๐ ๐ก๐๐ข๐๐ก๐๐ฃ๐ ๐ธ๐ฅ๐๐๐๐๐" ๐ฆ๐
๐ Read it here: earthmover.io/blog/learnin...
There's still time to sign up for tomorrow's webinar on โกFlux โก
๐
share.hsforms.com/1FLrE9CpwSLq...
@zarr.dev and #icechunk are amazing but they are not magic. They are part of a thoughtfully designed cloud-native data architecture. @tegnicholas.bsky.social peels back the covers on cloud-optimized scientific data formats in our latest "Fundamentals" post ๐
We found similar results when we first benchmarked #icechunk. Our conclusion: doing IO with a Rust backend is much faster than Python.
๐Really exciting to see @kylebarron.dev's Obstore backend for Zarr-Python ship today.
๐ More details: earthmover.io/blog/announc...
๐
Webinar April 22: share.hsforms.com/1FLrE9CpwSLq...
๐ฌ DMs open โ happy to chat or schedule a demo.
No more bespoke APIs. No glue code.
Just fast, interoperable access via tools you already use โ QGIS, ArcGIS, web maps, Python, R, MATLAB, and more.
Flux is a cheat code for building data products.
Flux serves data directly from Icechunk via standard APIs like EDR, WMS, and OPeNDAP.
Itโs stateless, scalable, and cloud-native.
From raw data to production-grade API endpoints in seconds.
We built Flux to solve a core pain point: turning massive geospatial datasets into usable APIs was slow, brittle, and distracting.
Flux makes it trivial for any data producer to become a data provider.
๐ Big launch today: introducing Flux โ a game-changing way to serve geospatial data via standard APIs in seconds. Built for scale, speed, and simplicity.
Training AI models at scale from data stored in cloud object storage requires thinking carefully about both bandwidth and concurrency. In this post, @functionth.bsky.social getโs into the details of concurrent reads at scale, showing how #Icechunk and S3 can easily scale beyond 200k requests/second!
๐ฃย Blog post alert! ๐๐ฑ๐ฉ๐ฅ๐จ๐ซ๐ข๐ง๐ ๐๐๐๐๐ก๐ฎ๐ง๐ค ๐ฌ๐๐๐ฅ๐๐๐ข๐ฅ๐ข๐ญ๐ฒ: ๐ฎ๐ง๐ญ๐๐ง๐ ๐ฅ๐ข๐ง๐ ๐๐'๐ฌ ๐ฉ๐ซ๐๐๐ข๐ฑ ๐ฌ๐ญ๐จ๐ซ๐ฒ. This technical post by @functionth.bsky.social dives deep into the internals of how S3 shards data, showing that distributed Icechunk can easily perform 230,000 object reads/sec and beyond. earthmover.io/blog/explori...
Weโll look into it!
We often see folks try to convince tabular data tools to perform well with multi-dimensional array data. This post by @rabernat.bsky.social explains, from first principles, why this rarely works. Its a good one! ๐๐๐
Super exciting to see @copernicusecmwf.bsky.social starting to adopt @zarr.dev!
๐จ New blog post ๐จ
In it, we show off our recent work deploying #icechunk on top of #NASA's existing archives of Earth observation data. The results: 100x speed up when extracting time series from existing datasets stored as netCDF.
This session is going to be a blast! If you are headed to CNG next month (and you should be!), consider joining us for this workshop on @xarray.bsky.social , @zarr.dev , and #icechunk. ๐๐๐
1/ As a data platform company, we get a unique view of how other companies use data. In light of the recent cuts at @noaa.gov, our CEO, @rabernat.bsky.social, reached out to our customers to get their perspective on the value of NOAAโs data and services.
I'll be giving a short update tomorrow on the state of Zarr-Python following the 3.0 release back in January. Swing by the Pangeo showcase to catch the latest progress.
@zarr.dev
Denver folks โ join us for a Geospatial Happy Hour on Wednesday, March 12th at 7 PM MST. Come enjoy drinks, tacos, discussing geospatial data, and meeting the Earthmover team. Register: lu.ma/o6r39p6j
We now have a public slack channel for VirtualiZarr development and questions, feel free to join!
earthmover-community.slack.com/archives/C08...
Very, very stoked to have @tegnicholas.bsky.social join the Earthmover team!
I'm curious what sort of issues you are seeing. I'm able to open datasets on s3 without any problems.