Advertisement · 728 × 90

Posts by Ritchie Vink

Preview
Release Polars Cloud client 0.5.0 · pola-rs/polars-cloud-client Highlights Launch of the Compute Dashboard This release marks the launch of the compute dashboard that is tied to the cluster directly. This allows for direct compute metrics and advanced query pr...

We shipped a major release of Polars Cloud

Live query profiles. See all data flowing through and exactly which nodes take up your compute.

Aside from that it also lands:

- Streaming shuffles
- Defaulting to our cost based planner
- Streaming and broadcasting ASOF joins
github.com/pola-rs/pola...

2 months ago 4 0 0 0
Post image

In 1-2 weeks we land live query profiling in Polars Cloud.

See exactly how many rows are consumed and produced per operation. Which operation takes most runtime, and watch the data flow through live, like water. 😍

2 months ago 10 1 0 0
Post image

ClickBench now runs the Polars streaming engine.

Polars is the fastest solution on that benchmark on Parquet file(s) 😎

The speed is there. This year, we will tackle out of core (spill to disk) and distributed to truly tackle scale.

benchmark.clickhouse.com#system=-ahi|...

3 months ago 8 0 0 0

I am sorry about that. We do welcome contributions, however design-wise we have to be strict. I would always recommend picking accepted issues and asking if now is the time to implement them before putting in the effort.

3 months ago 0 0 1 0
Preview
Release Python Polars 1.36.0-beta.2 · pola-rs/polars 🏆 Highlights Add Extension types (#25322) 🚀 Performance improvements Reduce HuggingFace API calls (#25521) Use strong hash instead of traversal for CSPE equality (#25537) Fix panic in is_between...

The pre-release of Polars 1.36 is out. Please give it a try so that we can ensure a stable final release with minimal regressions.

It lands a lot of goodies:

- Extension types
- Lazy pivots
- Streaming group_by_dynamic
- Float16 support
- Nested .over() expressions

github.com/pola-rs/pola...

4 months ago 7 1 0 0
Post image

Polars 1.34.0 is out!

Any Polars query can be turned into a generator!

Aside from that Polars now properly supports decimal types, scan_iceberg is completely native, cross joins can maintain order and much more.

Changelogs here:

- github.com/pola-rs/pola...
- github.com/pola-rs/pola...

6 months ago 6 0 0 0

Struct:

Think of a tuple with named fields. Or multiple columns in a single column.

7 months ago 0 0 0 0
Preview
Release Python Polars 1.32.0 · pola-rs/polars 🏆 Highlights Make Selector a concrete part of the DSL (#23351) Rework Categorical/Enum to use (Frozen)Categories (#23016) 🚀 Performance improvements Lower Expr.slice to streaming engine (#23683)...

4/4
Many new expressions are lowered to the streaming engine. This means you can run more queries faster!

See the full changelog here:

github.com/pola-rs/pola...

8 months ago 0 0 0 0
Advertisement
Post image

3/4
More joins will be lowered to more strict variants based on predicates. This can save a lot of intermediate rows!

8 months ago 0 0 1 0
Preview
refactor: Rework Categorical/Enum to use (Frozen)Categories by orlp · Pull Request #23016 · pola-rs/polars NoteTLDR Categoricals are completely reimplemented to be streaming compatible and fit better into the Polars Data model. They should generally be faster, more stable and more reliable. Physical ord...

2/4
The Categorical type is now streaming! No `StringCache` anymore and working Categoricals in distributed Polars.

github.com/pola-rs/pola...

8 months ago 0 0 1 0
Post image

Polars 1.32 is out and it lands a lot!

Let's go through a few:

1/4
Selectors are now implemented in Rust and we can finally select arbitrary nested types:

8 months ago 17 1 1 1
Preview
Polars Meetup - Polars Cloud and Acceleration · Luma Join the second edition of our Polars Meetup with talks from Ritchie Vink (Polars) and Vyas Ramasubramani (NVIDIA) to discuss accelerating and scaling…

Join me the 24th in SF for a @pola.rs meetup!

I will be having a talk about Polars, Polars-Cloud and the upcoming distributed engine.

NVIDIA will also be doing a talk about their GPU acceleration with Polars-CuDF

Hope to see you there!

lu.ma/60b6wfs8

9 months ago 1 0 1 0

No more `with pl.StringCache()`

Soon... 🌈

9 months ago 0 0 0 0
Preview
GP1085: Scaling DataFrames With Polars - Theme: undefined Location: NVIDIA GTC PARIS - Pavillon 7 - June 12 1:00 PM 1:45 PM - CET | Resume: Room: N03 Polars is a query engine with a DataFrame frontend designed for fast, efficient data processing. This sess...

This Thursday I will join Lawrence Mitchell from @nvidia
on the podium during the NVIDIA GTC in Paris.

We'll discuss how we made Polars work on the GPU and how it will scale to multi-GPU in the future.

On se voit là-bas !

vivatechnology.com/sessions/ses...

10 months ago 2 0 0 0
Post image

Polars has gotten 4x faster than Polars! 🚀

In the last months, the team has worked incredibly hard on the new-streaming engine and the results pay off. It is incredibly fast, and beats the Polars in-memory engine by a factor of 4 on a 96vCPU machine.

11 months ago 16 3 4 0
Post image

** Sponsor announcement ** Polars is a Supporter of RustWeek! 
Find out more about them here: pola.rs

Thank you @pola.rs for your support! 🙏

More info about RustWeek and tickets: rustweek.org

#rustweek #rustlang

1 year ago 2 1 0 0
Advertisement

Yeah, or even for single machine remotely. E.g. let's say you run a very small node as airflow orchestrator, but need a big VM for the ETL job. That orchestrator can initiate the remote query and doesn't have to worry about hardware setup/teardown.

1 year ago 1 0 1 0

Already got all TPC-H queries running distributed!

1 year ago 7 0 0 0

He is a notorious Polars hater and has tweeted he wants the project to fail.

He fears the fact that Polars deviation from the pandas API will splinter the landscape and doesn't appreciate new API development. I haven't read a technical reason from his side.

1 year ago 1 0 2 0

That was all pre 1.0.

We've released 1.0 in july last year. The API is stable now.

1 year ago 2 0 1 0

I would call that a weakness of the AI. 😅

1 year ago 1 0 1 0
Post image

This weeks Polars release we shipped initial Unity Catalog support. This makes integration with Databricks much smoother.

Writing features are under development and will follow soon. Full release notes: github.com/pola-rs/pola...

1 year ago 11 4 0 0
Preview
Modern Polars A side-by-side comparison of the Polars and Pandas libraries.

Learning polars has been ... actually a joy? It just makes sense to my #rstats #dplyr trained data muscles #databs #python

kevinheavey.github.io/modern-polars/

1 year ago 45 12 2 2
Advertisement
Preview
Release Python Polars 1.20.0 · pola-rs/polars ⚠️ Deprecations Make parameter of str.to_decimal keyword-only (#20570) 🚀 Performance improvements Extend functionality on BitmapBuilder and use in Growables (#20754) Specialize first/last agg fo...

This weeks Polars release has a huge improvement for window functions. They can be an order of magnitude faster.

And we can run 20/22 TPC-H queries on the new streaming engine and all on Polars cloud. More will follow soon! ;)

See the full release docs here:

github.com/pola-rs/pola...

1 year ago 20 2 0 0
A screenshot of a Pyodide REPL executing Polars code:

import polars as pl
import requests
r = requests.get("https://raw.githubusercontent.com/pola-rs/polars/refs/heads/main/examples/datasets/foods2.csv")
pl.read_csv(r.content).group_by("category").mean()

A screenshot of a Pyodide REPL executing Polars code: import polars as pl import requests r = requests.get("https://raw.githubusercontent.com/pola-rs/polars/refs/heads/main/examples/datasets/foods2.csv") pl.read_csv(r.content).group_by("category").mean()

A screenshot of a Quarto Live code cell executing Polars code:

import polars as pl
import requests
r = requests.get("https://raw.githubusercontent.com/pola-rs/polars/refs/heads/main/examples/datasets/foods2.csv")
pl.read_csv(r.content).group_by("category").mean()

A screenshot of a Quarto Live code cell executing Polars code: import polars as pl import requests r = requests.get("https://raw.githubusercontent.com/pola-rs/polars/refs/heads/main/examples/datasets/foods2.csv") pl.read_csv(r.content).group_by("category").mean()

A screenshot of a Shinylive app using Polars code:

from shiny import App, render, ui
import polars as pl
from pathlib import Path

app_ui = ui.page_fluid(
    ui.input_select("cyl", "Select Cylinders", choices=["4", "6", "8"]),
    ui.output_data_frame("filtered_data")
)

def server(input, output, session):
    df = pl.read_csv(Path(__file__).parent / "mtcars.csv")
    
    @output
    @render.data_frame
    def filtered_data():
        return (df
                .filter(pl.col("cyl") == int(input.cyl()))
                .select(["mpg", "cyl", "hp"]))

app = App(app_ui, server)

A screenshot of a Shinylive app using Polars code: from shiny import App, render, ui import polars as pl from pathlib import Path app_ui = ui.page_fluid( ui.input_select("cyl", "Select Cylinders", choices=["4", "6", "8"]), ui.output_data_frame("filtered_data") ) def server(input, output, session): df = pl.read_csv(Path(__file__).parent / "mtcars.csv") @output @render.data_frame def filtered_data(): return (df .filter(pl.col("cyl") == int(input.cyl())) .select(["mpg", "cyl", "hp"])) app = App(app_ui, server)

Recently I've been working on getting #polars running in #pyodide. This was a fun one, even requiring patches to LLVM's #wasm writer! Everything has now been upstreamed and earlier this week Pyodide v0.27.0 released, including a Wasm build of Polars usable in Pyodide, Shinylive and Quarto Live 🎉

1 year ago 49 9 0 0
demo of dt.replace

demo of dt.replace

✨ New temporal feature in the next Polars release!

⏲️ dt.replace lets you replace components of Date / Datetime columns

⚡🦀 It's an expressified vectorised rustified version of the Python standard library datetime.replace

1 year ago 9 1 0 1
Post image

We removed serde from our Series struct and saw a significant drop in Polars' binary size (of all features activated). The amount of codegen is huge. 😮

1 year ago 5 0 0 0
Post image

We finally support writing to cloud storage natively and seamlessly!

1 year ago 18 2 0 1
Making a recommender by just using Polars!
Making a recommender by just using Polars! YouTube video by probabl

Join us this Friday if you're eager to see what it can be like to design a recommender while limiting ourselves to just a DataFrame API. It is somewhat unconventional, but a great excuse to show off a Polars trick or two.

www.youtube.com/watch?v=U3Fi...

1 year ago 14 2 0 1

Interesting. is it an authentication error with Azure Storage?

1 year ago 0 0 1 0