Gaël Varoquaux (@gaelvaroquaux) Bsky

Claude Code's Source: 3,167-Line Function, Regex Sentiment Anthropic claimed 100% of Claude Code is AI-written. A source leak exposed a 3,167-line function, regex sentiment analysis, and 250K wasted API calls daily

"AI amplifies whatever is already there. Good discipline becomes great output. No discipline becomes technical debt at machine speed. Anthropic chose a direction. Go faster. Have Claude check Claude. And when it breaks, go faster still." substack.com/home/post/p-...

1 day ago 22 10 2 1

Call for Proposals — Compute! Paris 2026 Submit your talk proposal for Compute! Paris 2026. The Call for Proposals is open from April 15th to May 24th, 2026.

The team of JupyterCon 2023, PyData Paris 2024 & 2025 organizes a new conference named Compute! Paris 2026 on open source computation and data. The event will take place on November 25–26, 2026 at Sorbonne Université in Paris.

CfP deadline: May 24, 2026: compute.events/paris2026/cf...

17 hours ago 5 6 1 0

Elle s'intéresse peut-être entre autre au langage utilisé

5 days ago 1 0 1 0

Et donc se reconverti au pipo

1 week ago 2 0 1 0

Oui, c'est une stratégie de management incarnée par la chaîne de commandement.

1 week ago 2 0 0 0

Les ZRRs le ministère de l'intérieur les veut. On peut lire ça comme une guerre entre l'intérieur et la la recherche.

Les HDRs, c'est beaucoup créé et subit par la recherche.

Ne pas s'imaginer qu'une administration, ou un état, c'est un ensemble cohérent, mais plutôt une juxtaposition d'intérêts.

2 weeks ago 2 0 0 0

Oh, et un côté influence, avec les entreprises meilleurs pour souffler les idées de comment il faut dépenser

2 weeks ago 3 0 0 0

Possible que ta remarque s'applique plus à cet environnement.

2 weeks ago 0 0 0 0

Les lourdeurs administratives sont des stratégies de pouvoir de la part de plusieurs niveaux hiérarchiques, y compris des échelons intermédiaires.
Ces tendances sont naturelles dans toute structure mais nous avons un manque de travail pour le combattre de la part d'au dessus.

2 weeks ago 1 0 1 0

Mon expérience à côtoyer le gouverneur me fait dire que ce n'est pas que la classe politique déteste les universitaires, mais elle fait systématiquement ses arbitrages en faveur de l'industrie.

2 weeks ago 1 0 2 0

Recours aux zones à régime restrictif au sein des laboratoires de recherche publics

Perles du Sénat. Le ministre de l'ESRE :
« je veux rappeler (...) qu'il n'y a là aucune discrimination politique [dans les refus d'embauche en ZRR]. La preuve en est que l'immense majorité des ZRR concerne les sciences dites dures, en particulier les technologies. »
www.senat.fr/questions/ba...

3 weeks ago 4 5 2 0

New skrub release ✨️
I'am really excited about the more general ApplyToCols.

I've found that it enables me to write very naturally complex data transformations on dataframes, as I combine it with skrub's selectors to choose which columns I apply transformations on.

skrub-data.org/stable/refer...

3 weeks ago 7 1 0 0

The minimum required version of polars has been increased from 0.20 to 1.5.

3 weeks ago 2 1 0 0

The TableReport custom filters have been improved and expanded: they can now take skrub selectors for filtering columns. The interface has also been simplified.

3 weeks ago 2 1 1 0

The has_nulls selector can now select columns based on a user-specified threshold of null values.

3 weeks ago 2 1 1 0

It is now possible to provide custom null values to the Cleaner, so that they are marked as nulls (for example, the string "unknown").

3 weeks ago 2 1 1 0

The performance of DataOps with many computational nodes has been improved. Additionally, DataOps CV splitters can now take kwargs. For example, this allows to specify groups when creating train/test splits.

3 weeks ago 2 1 1 0

The SingleColumnTransformer and RejectColumn classes allow the construction of custom-made transformers for specific use cases.

3 weeks ago 2 1 1 0

The ApplyToCols transformer is now a powerful alternative to the regular scikit-learn ColumnTransformer. It is now possible to apply any transformer to a subset of chosen columns using the skrub selectors.

3 weeks ago 3 1 1 0

Release history Release 0.8.0: New Features: The eager_data_ops configuration option has been added. When set to False, no previews are computed and validation is deferred until the DataOp is actually used (e.g. w...

✨ skrub version 0.8.0 has been released ✨

This version includes several new features, including multiple improvements to the functionality and performance of the Data Ops, along with a few bug fixes and improvements to the docs.

Changelog:
skrub-data.org/stable/CHANG...

Highlights below ⤵️

3 weeks ago 8 4 1 1

skrub: Machine learning for dataframes

Try skrub skrub-data.org for machine learning with dataframes, and skore, still young but aiming to help evaluation and tracking of data-science docs.skore.probabl.ai

Both by creators of scikit-learn

Claude should be using them, but doesn't.

4 weeks ago 0 0 0 0

Today NeurIPS is announcing our official satellite event in Paris.

After responding to the call from Ellis following the success of EurIPS in December, we are pleased to reach a new milestone by joining forces with the NeurIPS organizing committee for the 2026 edition.

4 weeks ago 89 32 1 9

TabICLv2: A better, faster, scalable, and open tabular foundation model Tabular foundation models, such as TabPFNv2 and TabICL, have recently dethroned gradient-boosted trees at the top of predictive benchmarks, demonstrating the value of in-context learning for tabular d...

The state of the art for learning or tabular data is:
arxiv.org/abs/2602.11139
It comes with a high-quality software implementation:
tabicl.readthedocs.io/en/latest/

4 weeks ago 14 1 0 2

💰 €1M award pool for Medical Imaging AGI’s Last Exam (MEDAL)

Medical AI papers are booming - but are we solving the right problems?

Too often, research follows available data, not real clinical needs.

1 month ago 9 5 1 1

Looking forward to being back in Berkeley and seeing you all!

Thanks for hosting me

1 month ago 1 0 0 0

Cementing a machine-learning ecosystem: scikit-learn and beyond Please join us for a special event with Gaël Varoquaux (Probabl, scikit-learn), co-sponsored by BIDS, CDSS, and the Department of Statistics! Varo...

"Cementing a machine-learning ecosystem: scikit-learn and beyond": on Friday March 20, we at @ucbids.bsky.social and the Berkeley Statistics department are delighted to host @gaelvaroquaux.bsky.social for a seminar. Join us in person or online!

events.berkeley.edu/BIDS/event/3...

1 month ago 23 11 1 1

Pour un moratoire sur le passage total de l'Inria en ZRR

La pétition zrr.collectif-inria.fr a dépassé les 900 signataires. On commence à faire des stats, elle a en particulier été signée par 40% des chercheur·euses permanent·es rémunéré·es par l'Inria (300 personnes) et 33% des responsables d'équipes projets.

1 month ago 7 5 1 0

Chief Executive Officer - New York City, New York (US) job with arXiv | 37961678 arXiv seeks its first CEO to champion open, free scientific discovery and guide the platform’s next chapter as an independent nonprofit.

@arxiv.bsky.social is hiring a CEO. Job advert is here: jobs.chronicle.com/job/37961678...

1 month ago 21 13 0 1

The next scikit-learn release will allow inspecting the type and values of attributes of fitted estimators in Jupyter notebooks & example code rendered as HTML in sphinx-gallery powered project websites.

scikit-learn.org/dev/auto_exa...

1 month ago 13 6 2 2

Skore Is Live: Track Your Data Science Skore by Probabl: The collaboration platform for data scientists. Evaluate models, automate reports, and bridge the gap from notebooks to production.

📝 Read the announcement: blog.probabl.ai/skore-is-live
🚀 Try Skore: skore.probabl.ai
👨💻 Explore the code: github.com/probabl-ai/s...
📖Read the docs: docs.skore.probabl.ai

1 month ago 3 1 0 5

Posts by Gaël Varoquaux