Vlad Niculae (@vn-ml) Bsky

Your data is low-rank, so stop wasting compute! In our new paper on low-rank thinning, we share one weird trick to speed up Transformer inference, SGD training, and hypothesis testing at scale. Come by ICML poster W-1012 Tuesday at 4:30!

9 months ago 25 7 2 2

So you want to skip our thinning proofs—but you’d still like our out-of-the-box attention speedups? I’ll be presenting the Thinformer at two ICML workshop posters tomorrow!

Catch me at Es-FoMo (1-2:30, East hall A) and at LCFM (10:45-11:30 & 3:30-4:30, West 202-204)

9 months ago 5 4 0 0

new working paper! we (me, Su Lin Blodgett, @ninamarkl.bsky.social) examine how recent marketing of LLMs extends older discourses that cast workers as bundles of skills, and unpack the false promises of empowerment these discourses embed, in times of precarity

tisjune.github.io/papers/aarhu...

9 months ago 35 3 3 5

Looking forward to this year's edition! With great speakers: Ryan McDonald Yulan He @vn-ml.bsky.social @antonisa.bsky.social Raquel Fernandez @annarogers.bsky.social Preslav Nakov @mohitbansal.bsky.social @eunsol.bsky.social Marie-Catherine de Marnefffe !

10 months ago 6 3 0 0

Language and Computation in Neural Systems We are an international group of scientists consisting of linguists, cognitive scientists, cognitive neuroscientists, computational neuroscientists, computational modellers, computational scientists, ...

my lab (lacns.github.io) at @mpi-nl.bsky.social and @dondersinst.bsky.social is recruiting for two PhD and two postdoctoral positions funded by an @erc.europa.eu Consolidator - come join us!

PhD: www.mpi.nl/career-educa...

Postdoc: www.mpi.nl/career-educa...

(please share widely)

11 months ago 81 53 3 9

Does the "agreement" part refer only the previous question or to something else, and does the answer there have any consequences in the review process (can we review regardless of option? can we submit papers regardless of option?)

thanks in advance!

11 months ago 2 0 0 0

The "attribution" section only has an option "yes", signaling agreement to deanonymize your reviews. Is there an option to say no? (eg. by not selecting anything?) This is not communicated and is different from most other entries in the form.

11 months ago 2 0 1 0

hi, i'm struggling with the author registration form. i can't work out how to navigate the dark design patterns used in the "attribution" and "agreement" part of the form.

could you please provide some details about those choices?

most imp., are there choices that result in desk reject?

11 months ago 2 0 1 0

Schematic illustration of a scalar-valued residual deep GP with L hidden layers. The last layer is a scalar-valued GP on the manifold. If it is not present, the model is manifold-valued. If it is replaced with a Gaussian vector field (GVF), the model is a vector field on the manifold.

Excited to share our ICLR 2025 oral "Residual Deep Gaussian Processes on Manifolds"!

With @vabor112.bsky.social & @arkrause.bsky.social, we introduce manifold-to-manifold GPs that can be composed together, generalising deep GPs to manifolds. Applications include wind prediction & Bayes opt! 1/n

1 year ago 37 9 1 2

i can't believe how long we've spent fooling ourselves about the value of fully specified, massive matmuls instead of embracing the gods of sparsity

1 year ago 5 2 0 0

Vlad Niculae

Recruiting a PhD candidate at U. of Amsterdam (funded, 4yr). We will use ML&NLP, prob. models, and user studies, to make adaptive scientific-assistant systems that communicate & justify decisions in ways helpful to experts.

More: vene.ro/jobs.html
Apply by May 18: werkenbij.uva.nl/en/vacancies...

11 months ago 11 6 0 0

Variational approximation with Gaussian mixtures is looking cute! So here it's just gradient descent on K(q||p) for optimising the mixtures means & covariances & weights...
@lacerbi.bsky.social

1 year ago 33 7 2 0

This review paper by @guillaume-garrigos.com on SGD-related algorithms is a fantastic resource, offering elegant, self-contained, and concise proofs in a single, accessible reference. arxiv.org/pdf/2301.11235

1 year ago 189 40 1 0

These phenomenon have been observed since early vision systems. It is important to report these things, though. Maybe it will permeate and we won’t keep making the same mistakes over and over

1 year ago 12 3 1 0

This is such a beautiful algorithm (and a nice analysis): to check if an array is sorted vs. far from being sorted (many entries need to be changed), just:
- pick an element uniformly at random in the array
- "forget" where it was
- try to find it again via binary search
Repeat this a few times.

1 year ago 28 6 0 0

It is in our hands: To protect our safety and the right to protest, do more than re-formulating the house rules. Column by anonymous FNV-member from UvA Protest is a fundamental right, and universities have a duty to facilitate and protect it. The violent events surrounding pro-Palestine protests this past year ...

I and hundreds other workers at the University of Amsterdam are on strike with @fnv.bsky.social

www.linkedin.com/pulse/our-ha...

1 year ago 2 1 0 0

"AI can be bad but also it can be good" is just a really dumb way to talk about anything...it's the grade-school exercise of "make a list of pros and cons" but pressed into service for producing a sense of inevitability and making the medicine go down

1 year ago 4 1 1 0

OpenAI’s new defense contract completes its military pivot A new partnership with Anduril, announced today, will deploy AI on the battlefield. It represents an overhaul of the company’s position in just a year.

OpenAI in 2024:

“No AI for weapons or military”

“Do use our AI to make weapons to hurt yourself or others”

“Military is fine, but no AI for weapons”

“Sure put it on battlefield drones”

www.technologyreview.com/2024/12/04/1...

1 year ago 195 71 6 17

Blue skies 🦋 , hot (?) takes 🔥

Constrained output for LLMs, e.g., outlines library for vllm which forces models to output json/pydantic schemas, is cool!

But, because output tokens cost much more latency than input tokens, if speed matters: bespoke, low-token output formats are often better.

1 year ago 8 1 2 0

I hope I am not late to the party (was away post-quals chilling) but here are some thoughts on why this is bad IMO:

First, a disclaimer that I am writing this as an African who is a speaker of multiple African languages, NLP researcher of African languages, and HCI researcher focusing broadly on..

1 year ago 125 60 9 8

Posts by Vlad Niculae