Advertisement Β· 728 Γ— 90
#
Hashtag
#Blosc2
Advertisement Β· 728 Γ— 90

πŸš€ C-Blosc2 3.0.0 RC1 is out!

What's new?

πŸ“¦ VL-blocks for irregular data (strings, JSON)

πŸ—œοΈ LZ4/LZ4HC dictionary compression

πŸ›‘οΈ Important safety fixes

Check out the full release notes here: github.com/Blosc/c-blos...

Give it a spin and provide feedback!

#Blosc2 #OpenSource #C

3 2 0 0
Francesc Alted+Luke Shaw - Hands-on with Blosc2: Accelerating Your Python Data Workflows-PyData 2025
Francesc Alted+Luke Shaw - Hands-on with Blosc2: Accelerating Your Python Data Workflows-PyData 2025 YouTube video by PyData

πŸš€ Our #PyDataGlobal 2025 tutorial on modern #Blosc2 & #Caterva2 features is out!

Learn how compression boosts array performance & enables cloud computing without downloads. ☁️

Watch here: πŸ‘‡ www.youtube.com/watch?v=tUvS...

#Python #HPC #DataScience #BigData

0 2 0 0
Post image Post image Post image Post image

πŸ€” Can a library that computes on compressed data actually outperform performance heavyweights like NumPy and NumExpr?

Find some surprising answers πŸ‘‰ www.blosc.org/posts/roofli...

During the walk, I'm also introducing a funny anecdote back when NumPy was a πŸ‘Ά

Enjoy!

#Blosc2 #HPC #NumPy #Numexpr

7 3 1 0
Data browsing and computing right in your phone.

Data browsing and computing right in your phone.

Post image Post image

I've built a demo Jupyter notebook that connects to a Cat2Cloud server from an Android phone and slices into an 8 TB dataset, downloading a 1 MB chunk in under 100 milliseconds. ⚑

Interact with your vasts remote datasets right in your phone!

cat2.cloud/demo/roots/@...

#Cat2Cloud #Blosc2 #BigData

0 0 0 0
Video

πŸ’ŠIronPill 2πŸ’Š
See how Blosc2 powers heavy-duty linear algebra (100GB!) workflows
⚑1.5-2x faster than PyTorch + h5py!
🧱 optimised chunking for your cache hierarchy
🐍 one line syntax πš‹πš•πš˜πšœπšŒπŸΈ.πš–πšŠπšπš–πšžπš•(𝙰, 𝙱, πšžπš›πš•πš™πšŠπšπš‘='𝚘𝚞𝚝.πš‹πŸΈπš—πš')

See blog here: ironarray.io/blog/la-blosc

#Blosc2
#Data
#LinearAlgebra

4 1 0 1

πŸš€ Want to boost performance in your complex calculations? Avoiding temporary arrays is critical!

Check out our new video 🎬 to see how #Blosc2 eliminates these temporaries⚑

Compress Better, Compute Larger

0 0 0 0
Video

πŸ’Š IronPill 1πŸ’Š
Blosc2 calculates Fourier approximations:
⚑5x faster than NumPy
πŸ‘£ fraction of the memory footprint
🐍 one-line syntax πšœπšžπš–(πšŠβ€‚*β€‚πšŒπš˜πšœ(𝚝) +β€‚πš‹β€‚*β€‚πšœπš’πš—(𝚝),β€‚πšŠπš‘πš’πšœ=𝟷)

Full notebook: github.com/Blosc/python...

(inspiration: towardsdatascience.com/numexpr-the-...)

#Blosc2
#Data
#SignalProcessing

4 2 0 1
Video

Inspired by the ESA's Gaia Milky Way map 🌌!

At SciPy 2023, we showcased visualizing a massive 7.3 TB Gaia dataset on a standard laptop by compressing it to 1.8 GB using #Blosc2. Big data analysis for everyone! πŸš€

Dataset: digistorage.es/links/652403...
Paper: www.blosc.org/docs/Explori...

#napari

2 0 0 1
Post image

πŸ—£οΈ Announcing Python-Blosc2 3.8.0 πŸš€

A step closer to compliance with the array-api standard: data-apis.org/array-api!
This is an effort across all array-based libraries so that your code works (e.g. for both blosc2 and NumPy) by simply changing the import statement below!

#arrayapi #Blosc2

5 3 1 1
Post image

Struggling to get performant code from LLMs? πŸ€”

We can help! πŸ’ͺ We've spent countless hours profiling and micro-benchmarking Blosc2 for you. A thing that LLMs cannot do (yet).

Ready to dive in? πŸš€
πŸ”Ή EuroSciPy Talk: www.blosc.org/docs/2025-Eu...
πŸ”Ή Tutorial: github.com/Blosc/EuroSc...

#HPC #Blosc2

3 1 0 1
Preview
GitHub - EmilDohne/compressed-image: Fast and memory-efficient, compressed in-memory image storage for C++ and Python. Fast and memory-efficient, compressed in-memory image storage for C++ and Python. - EmilDohne/compressed-image

πŸ“£Great to see the community building powerful tools on Blosc2! πŸš€

Check out compress-image: a new C++/Python library for working with compressed images directly in memory.

Kudos to Emil Dohne for this fantastic work!

Project here: github.com/EmilDohne/co...

#Blosc2 #FOSS #ImageProcessing

0 0 0 1
Post image

πŸš€ We're thrilled to announce *TreeStore*, a new class in Python-Blosc2! Endow your datasets with a hierarchical structure! ⚑️

πŸ“ We've blogged about it:
www.blosc.org/posts/new-tr...

It's in beta, and available in Python-Blosc2 v3.7.2. Enjoy!

#Python #Blosc2 #TreeStore #DataScience #OpenSource

3 2 0 1

While we've poured lots of ❀️ into #Blosc2 internal algorithms, every use case is unique. No worries! You can always tweak the internal knobs to get the best performance for your needs πŸ’ͺ

0 0 0 0
New Blosc2 fancy indexing performance against other competitors, including NumPy. Blosc2 is quite efficient.

New Blosc2 fancy indexing performance against other competitors, including NumPy. Blosc2 is quite efficient.

πŸ—£οΈ Announcing Python-Blosc2 3.6.1

Β‘Unlock new levels of data manipulation with #Blosc2! πŸš€
We've tamed the complexity of fancy indexing to make it intuitive, efficient, and consistent with NumPy's behavior.

πŸ‘‰ www.blosc.org/posts/blosc2...

Enjoy

#Python #DataScience #BigData #NumPy #Performance

1 0 0 1
Post image

Tired of trial-and-error with #Blosc2 compression?

You can do better with #Btune! πŸš€

Btune can be trained to automatically find the best parameters for your specific datasets and use case.

Learn more about optimizing your compression workflow: ironarray.io/btune

#DataCompression #Optimization

0 0 0 1

Mixing #JupyterLite, #WebAssembly and #Cat2Cloud allows incredible speed and flexibility in accessing large remote data silos, both in #Blosc2 and #HDF5 formats ⚑

1 0 0 0
Accessing remote datasets in less than 100 ms

Accessing remote datasets in less than 100 ms

Thanks to the advanced double partitioning techniques in #Blosc2, our #Caterva2 package can serve small slices of big datasets (3.8 GB) through internet in less than the blink of an eye.

Try it out by yourself! πŸ‘‰ cat2.cloud/demo/roots/@...

Share Data Faster!

#BigData #RemoteAccess #Performance

1 0 0 1
Post image

#Blosc2 now runs directly in your browser! Leveraging the power of #WASM, #Pyodide, and #JupyterLite, you can harness efficient, adaptable compression through the web's universal interface.

Compress Better, Compute Bigger, Share Faster

#WebAssembly #DataCompression #WebDevelopment #DataScience

3 1 0 1
Post image

Struggling with slow I/O in #HDF5? Try #Blosc2 as a filter or I/O data handler β€” faster data, less pain!

πŸ‘‰ www.blosc.org/posts/pytabl...

#Performance #DataScience

1 0 0 0
Post image

πŸ“’ We are pleased to announce the integration of a new stack feature in #Blosc2 πŸš€, which allows for stacking large arrays along a new axis.

We've updated our recent blog post:
Check it out! πŸ‘‡
www.blosc.org/posts/blosc2...

Compress Better, Compute Bigger
#Python #DataScience #Performance #OpenSource

4 1 0 0

In this chapter, our Luke Shaw explains how filters work in Caterva2, a free software that you can run on your own laptop (or server, if you are more serious about large datasets). It works for either #Blosc2 or #HDF5 datasets.

Have it a try!

#WebGUI #DataAnalysis #LargeData

2 0 0 0
Post image

πŸš€ Excited to share more about Caterva2, your ultimate gateway to #Blosc2 / #HDF5 repositories! πŸš€

#Caterva2 is designed to revolutionize how you interact with large datasets. Want to see it in action? πŸ€” See our new video showcasing Caterva2's main functionalities! 🎬

ironarray.io/caterva2

Enjoy!

0 0 0 1
Post image

#Python-Blosc2 is hitting 1 million weekly downloads on PyPI! πŸŽ‰

Users are rapidly adopting #Blosc2, which now accounts for over 95% of downloads compared to Blosc1. πŸ“ˆ Thanks to our amazing users. πŸ™ πŸš€

Our motto: Compress Better, Compute Bigger! πŸ’ͺ

#Milestone #CommunitySupport #DataCompression

1 0 0 1

πŸš€ For over a year, the #Blosc2 plugin for #HDF5 has empowered users to bypass the HDF5 pipeline, proving its production-ready capabilities! πŸŽ‰

#Performance #Compression

1 0 0 0
Post image

πŸ’‘ Did you know you can supercharge your #HDF5 datasets with #Blosc2? πŸš€

Leverage hdf5plugin (hdf5plugin.readthedocs.io) to integrate Blosc2 as a filter within HDF5. Create and read data using popular Python wrappers like h5py or PyTables, while achieving excellent performance! πŸ’¨

Compress Better!

0 0 1 1
Preview
GitHub - Blosc/b2h5py: Transparent optimized reading of n-dimensional Blosc2 slices for h5py Transparent optimized reading of n-dimensional Blosc2 slices for h5py - Blosc/b2h5py

πŸš€ Meet b2h5py: Supercharge your HDF5 data access! ⚑️

Unlock faster reading of Blosc2-compressed HDF5 datasets. Achieve up to 2x-5x speed-ups in reading compared to standard methods! πŸ’¨ github.com/Blosc/b2h5py

Compress better!

#HDF5 #DataAccess #Blosc2 #DataScience #Optimization #Performance

1 0 0 1
Preview
Releases Β· Blosc/python-blosc2 A high-performance library for compressed ndarrays, with a flexible computational engine - Blosc/python-blosc2

Python-Blosc2 3.3.3 is live! ✨ Bug fixes, smoother string lazy expressions, and Windows mmap improvements thanks to the latest C-Blosc2.

Upgrade now: pip install blosc2 --update

Learn more: github.com/Blosc/python...

#Python #Blosc2 #DataScience

Compress Better, Compute Larger

2 1 0 0
Preview
Releases Β· Blosc/python-blosc2 A high-performance library for compressed ndarrays, with a flexible computational engine - Blosc/python-blosc2

Python-Blosc2 3.3.3 is live! ✨ Bug fixes, smoother string lazy expressions, and Windows mmap improvements thanks to the latest C-Blosc2.

Upgrade now: pip install blosc2 --update

Learn more: github.com/Blosc/python...

#Python #Blosc2 #DataScience

1 0 0 0

πŸ“ Just published my blog post as part of the @blosc.org team:

Make NDArrays Transposition Fast (and Compressed!) in #Blosc2 πŸš€
I dive into optimizing matrix transposition using smart chunk layouts β€” and how Blosc2 can rival NumPy's speed with compression included.

Compress Better, Compute Faster πŸ˜„

1 1 0 0
Performance of Blosc2 transposition depending on the chunk size

Performance of Blosc2 transposition depending on the chunk size

πŸ—£οΈNew blog entry of Ricardo Sales Piquer, our intern working in linear algebra problems:

Make NDArrays Transposition Fast (and Compressed!) in #Blosc2 πŸš€

www.blosc.org/posts/optimi...

Great work Ricardo! πŸŽ‰

#Compression #LinearAlgebra #Optimization #DataScience

Compress Better, Compute Faster πŸ˜€

2 3 0 1