π C-Blosc2 3.0.0 RC1 is out!
What's new?
π¦ VL-blocks for irregular data (strings, JSON)
ποΈ LZ4/LZ4HC dictionary compression
π‘οΈ Important safety fixes
Check out the full release notes here: github.com/Blosc/c-blos...
Give it a spin and provide feedback!
#Blosc2 #OpenSource #C
π Our #PyDataGlobal 2025 tutorial on modern #Blosc2 & #Caterva2 features is out!
Learn how compression boosts array performance & enables cloud computing without downloads. βοΈ
Watch here: π www.youtube.com/watch?v=tUvS...
#Python #HPC #DataScience #BigData
π€ Can a library that computes on compressed data actually outperform performance heavyweights like NumPy and NumExpr?
Find some surprising answers π www.blosc.org/posts/roofli...
During the walk, I'm also introducing a funny anecdote back when NumPy was a πΆ
Enjoy!
#Blosc2 #HPC #NumPy #Numexpr
Data browsing and computing right in your phone.
I've built a demo Jupyter notebook that connects to a Cat2Cloud server from an Android phone and slices into an 8 TB dataset, downloading a 1 MB chunk in under 100 milliseconds. β‘
Interact with your vasts remote datasets right in your phone!
cat2.cloud/demo/roots/@...
#Cat2Cloud #Blosc2 #BigData
πIronPill 2π
See how Blosc2 powers heavy-duty linear algebra (100GB!) workflows
β‘1.5-2x faster than PyTorch + h5py!
π§± optimised chunking for your cache hierarchy
π one line syntax ππππππΈ.ππππππ(π°, π±, πππππππ='πππ.ππΈππ')
See blog here: ironarray.io/blog/la-blosc
#Blosc2
#Data
#LinearAlgebra
π Want to boost performance in your complex calculations? Avoiding temporary arrays is critical!
Check out our new video π¬ to see how #Blosc2 eliminates these temporariesβ‘
Compress Better, Compute Larger
π IronPill 1π
Blosc2 calculates Fourier approximations:
β‘5x faster than NumPy
π£ fraction of the memory footprint
π one-line syntax πππ(πβ*βπππ(π)β+βπβ*βπππ(π),βππ‘ππ=π·)
Full notebook: github.com/Blosc/python...
(inspiration: towardsdatascience.com/numexpr-the-...)
#Blosc2
#Data
#SignalProcessing
Inspired by the ESA's Gaia Milky Way map π!
At SciPy 2023, we showcased visualizing a massive 7.3 TB Gaia dataset on a standard laptop by compressing it to 1.8 GB using #Blosc2. Big data analysis for everyone! π
Dataset: digistorage.es/links/652403...
Paper: www.blosc.org/docs/Explori...
#napari
π£οΈ Announcing Python-Blosc2 3.8.0 π
A step closer to compliance with the array-api standard: data-apis.org/array-api!
This is an effort across all array-based libraries so that your code works (e.g. for both blosc2 and NumPy) by simply changing the import statement below!
#arrayapi #Blosc2
Struggling to get performant code from LLMs? π€
We can help! πͺ We've spent countless hours profiling and micro-benchmarking Blosc2 for you. A thing that LLMs cannot do (yet).
Ready to dive in? π
πΉ EuroSciPy Talk: www.blosc.org/docs/2025-Eu...
πΉ Tutorial: github.com/Blosc/EuroSc...
#HPC #Blosc2
π£Great to see the community building powerful tools on Blosc2! π
Check out compress-image: a new C++/Python library for working with compressed images directly in memory.
Kudos to Emil Dohne for this fantastic work!
Project here: github.com/EmilDohne/co...
#Blosc2 #FOSS #ImageProcessing
π We're thrilled to announce *TreeStore*, a new class in Python-Blosc2! Endow your datasets with a hierarchical structure! β‘οΈ
π We've blogged about it:
www.blosc.org/posts/new-tr...
It's in beta, and available in Python-Blosc2 v3.7.2. Enjoy!
#Python #Blosc2 #TreeStore #DataScience #OpenSource
While we've poured lots of β€οΈ into #Blosc2 internal algorithms, every use case is unique. No worries! You can always tweak the internal knobs to get the best performance for your needs πͺ
New Blosc2 fancy indexing performance against other competitors, including NumPy. Blosc2 is quite efficient.
π£οΈ Announcing Python-Blosc2 3.6.1
Β‘Unlock new levels of data manipulation with #Blosc2! π
We've tamed the complexity of fancy indexing to make it intuitive, efficient, and consistent with NumPy's behavior.
π www.blosc.org/posts/blosc2...
Enjoy
#Python #DataScience #BigData #NumPy #Performance
Tired of trial-and-error with #Blosc2 compression?
You can do better with #Btune! π
Btune can be trained to automatically find the best parameters for your specific datasets and use case.
Learn more about optimizing your compression workflow: ironarray.io/btune
#DataCompression #Optimization
Mixing #JupyterLite, #WebAssembly and #Cat2Cloud allows incredible speed and flexibility in accessing large remote data silos, both in #Blosc2 and #HDF5 formats β‘
Accessing remote datasets in less than 100 ms
Thanks to the advanced double partitioning techniques in #Blosc2, our #Caterva2 package can serve small slices of big datasets (3.8 GB) through internet in less than the blink of an eye.
Try it out by yourself! π cat2.cloud/demo/roots/@...
Share Data Faster!
#BigData #RemoteAccess #Performance
#Blosc2 now runs directly in your browser! Leveraging the power of #WASM, #Pyodide, and #JupyterLite, you can harness efficient, adaptable compression through the web's universal interface.
Compress Better, Compute Bigger, Share Faster
#WebAssembly #DataCompression #WebDevelopment #DataScience
Struggling with slow I/O in #HDF5? Try #Blosc2 as a filter or I/O data handler β faster data, less pain!
π www.blosc.org/posts/pytabl...
#Performance #DataScience
π’ We are pleased to announce the integration of a new stack feature in #Blosc2 π, which allows for stacking large arrays along a new axis.
We've updated our recent blog post:
Check it out! π
www.blosc.org/posts/blosc2...
Compress Better, Compute Bigger
#Python #DataScience #Performance #OpenSource
In this chapter, our Luke Shaw explains how filters work in Caterva2, a free software that you can run on your own laptop (or server, if you are more serious about large datasets). It works for either #Blosc2 or #HDF5 datasets.
Have it a try!
#WebGUI #DataAnalysis #LargeData
π Excited to share more about Caterva2, your ultimate gateway to #Blosc2 / #HDF5 repositories! π
#Caterva2 is designed to revolutionize how you interact with large datasets. Want to see it in action? π€ See our new video showcasing Caterva2's main functionalities! π¬
ironarray.io/caterva2
Enjoy!
#Python-Blosc2 is hitting 1 million weekly downloads on PyPI! π
Users are rapidly adopting #Blosc2, which now accounts for over 95% of downloads compared to Blosc1. π Thanks to our amazing users. π π
Our motto: Compress Better, Compute Bigger! πͺ
#Milestone #CommunitySupport #DataCompression
π For over a year, the #Blosc2 plugin for #HDF5 has empowered users to bypass the HDF5 pipeline, proving its production-ready capabilities! π
#Performance #Compression
π‘ Did you know you can supercharge your #HDF5 datasets with #Blosc2? π
Leverage hdf5plugin (hdf5plugin.readthedocs.io) to integrate Blosc2 as a filter within HDF5. Create and read data using popular Python wrappers like h5py or PyTables, while achieving excellent performance! π¨
Compress Better!
π Meet b2h5py: Supercharge your HDF5 data access! β‘οΈ
Unlock faster reading of Blosc2-compressed HDF5 datasets. Achieve up to 2x-5x speed-ups in reading compared to standard methods! π¨ github.com/Blosc/b2h5py
Compress better!
#HDF5 #DataAccess #Blosc2 #DataScience #Optimization #Performance
Python-Blosc2 3.3.3 is live! β¨ Bug fixes, smoother string lazy expressions, and Windows mmap improvements thanks to the latest C-Blosc2.
Upgrade now: pip install blosc2 --update
Learn more: github.com/Blosc/python...
#Python #Blosc2 #DataScience
Compress Better, Compute Larger
Python-Blosc2 3.3.3 is live! β¨ Bug fixes, smoother string lazy expressions, and Windows mmap improvements thanks to the latest C-Blosc2.
Upgrade now: pip install blosc2 --update
Learn more: github.com/Blosc/python...
#Python #Blosc2 #DataScience
π Just published my blog post as part of the @blosc.org team:
Make NDArrays Transposition Fast (and Compressed!) in #Blosc2 π
I dive into optimizing matrix transposition using smart chunk layouts β and how Blosc2 can rival NumPy's speed with compression included.
Compress Better, Compute Faster π
Performance of Blosc2 transposition depending on the chunk size
π£οΈNew blog entry of Ricardo Sales Piquer, our intern working in linear algebra problems:
Make NDArrays Transposition Fast (and Compressed!) in #Blosc2 π
www.blosc.org/posts/optimi...
Great work Ricardo! π
#Compression #LinearAlgebra #Optimization #DataScience
Compress Better, Compute Faster π