Rasmus Aagaard (@rasgaard.com) Bsky

For a recent lab meeting, I wrote up a grab bag of ways to think about your development as a researcher during a PhD: emerge-lab.github.io/papers/an-un...

Sharing in case folks find it useful or have feedback!

3 weeks ago 102 12 6 5

Flash-KMeans

An IO-aware implementation of exact k-means that rethinks the algorithm around modern GPU bottlenecks. Flash-KMeans achieves 30x speedup over cuML and 200x speedup over FAISS.

Paper: arxiv.org/abs/2603.09229
Code: github.com/svg-project/...

3 weeks ago 28 6 0 1

AI i valgkampen: Hjælper det kandidater, eller skræmmer det vælgerne væk? Når politikere bruger kunstig intelligens, kan det svække deres troværdighed og skabe større afstand til vælgerne.

AI i valgkampen: Hjælper det kandidater, eller skræmmer det vælgerne væk? #dkvid #dkforsk

1 month ago 5 4 2 0

Claude Code Copenhagen meetup 🍻🚀

1 month ago 1 0 1 0

Open source er en strategisk nødvendighed i AI: Sådan kommer vi med i kapløbet | DataTech På tværs af it-branchen spiller open source software en fundamental rolle for både offentlige og private løsninger. Dette gør sig også gældende i AI-verdenen.Transformer-arkitekturen, grundstenen i nu...

Together with fellow ddsc.io members I helped a bit with this piece on @ing.dk :)

1 month ago 4 0 0 0

First blogpost of my cofounder post-Delhi on the deafening silence of public/political sphere as we come closer to another industrial revolution. anastasiastasenko.substack.com/p/the-countr...

1 month ago 73 19 3 8

The Danoliterate Generative Large Language Model Benchmark

Strongly agree! Sounds like a fantastic idea for a special course :) Maybe you can take inspiration from Danoliterate danoliterate.compute.dtu.dk which was a master's project at DTU

1 month ago 3 0 1 0

Helt klart. Der er stadig tonsvis af quirks. Jeg er mest begejstret for tendensen der bliver vist. At software og hardware begge udvikler sig i retning af at lokal inference bliver konkurrencedygtigt ift. cloud.

1 month ago 0 0 0 0

LLM cloud inference dominates usage, but should it? Local models and accelerators have improved massively over recent years.

Perfect routing to best local model "reduce energy consumption by 80.4%, compute by 77.3%, and cost by 73.8% versus cloud-only deployment"

arxiv.org/pdf/2511.07885

1 month ago 9 3 3 1

Introducing ✨Tiny Aya✨, a family of massively multilingual small language models built to run where people actually are.

Tiny Aya delivers strong multilingual performance in 70+ global languages in a 3.35B parameter model, efficient enough to run locally, even on a phone.

1 month ago 97 15 2 5

Excited to be in fantastic company amongst the speakers for IDA Driving AI 2026 :)

ida.dk/driving-ai/t...

1 month ago 5 0 1 0

The paper suggests that poor reasoning abilities is one failure mode that comes from removing deeper layers. But yeah, wonder what else unexpectedly takes a hit

1 month ago 2 0 0 0

Due to residual structures in Transformer-models it's possible that many layers contribute very little to the downstream performance of the network, allowing for removal those redundant layers with little impact.

openreview.net/pdf/72eeca23...

1 month ago 12 2 0 2

Local AI inference through the browser is great. Transformers.js makes it feasible for someone like me who has little web dev experience :)

Here's a simple transcription tool using Whisper-large-v3-turbo, making use of your local hardware and ensuring privacy.
rasgaard.com/webai-stuff/...

2 months ago 3 1 0 0

ofc there's already a python plotting library for this sort of thing

github.com/jndean/LossR...

2 months ago 7 4 2 0

This bullshit has been going on for generations. A 1996 Simpsons episode tried to teach us. YouTube video by Gangstagrass

This is crazy:

youtu.be/v81b0XllvgI

2 months ago 374 130 14 15

AI-utopierne: En morgen i Elon Musks drømmeverden YouTube video by Verden forsøgt forklaret

www.youtube.com/watch?v=b64v...

2 months ago 1 0 0 0

A photograph of sunny Copenhagen in the summer!

📢 I am hiring a highly-motivated Ph.D student at the University of Copenhagen to work on tokenization-free NLP.

Read our previous work in this topic: aclanthology.org/2025.emnlp-m...
aclanthology.org/2023.emnlp-m...
openreview.net/forum?id=FkS...

Apply by March 8: employment.ku.dk/phd/?show=1563

2 months ago 20 9 0 0

Sometimes a single line changes the reproducibility-game completely:))

The paper is really cool though: Can the transformer+conv Mimi (NAC) decoder be replaced with a purely transformer-based one?

Answer is yes, and it's 10x faster! Benchmarked on actual mobile hardware.

arxiv.org/pdf/2601.20094

2 months ago 1 1 1 0

WE ARE BACK! 🚀🚀🚀

📢 #databeers #copenhagen x Absalon is back on March 10th, SAVE THE DATE!

Many thanks to our sponsors, the Danish Data Science Academy and the Danmarks Tekniske Universitet

Speaker announcement and tickets out soon, stay tuned!

2 months ago 15 11 1 1

Slowly but surely getting a better intuition for how and when compression methods such as quantization is beneficial and, perhaps more crucially, detrimental to overall model efficiency

2 months ago 2 0 0 0

Understanding Efficiency: Quantization, Batching, and Serving Strategies in LLM Energy Use Large Language Models (LLMs) are increasingly deployed in production, contributing towards shifting the burden in terms of computational resources and energy demands from training to inference. While ...

"numerical precision reduction yields the most benefit in the prefill phase of large models, where compute dominates. In contrast, the decode phase remains memory-limited, and aggressive quantization (e.g., int8 or int4) may incur overheads that outweigh theoretical savings"

2 months ago 16 0 2 2

It took me weeks, but finally it's there: an overlong blogpost on synthetic pretraining. vintagedata.org/blog/posts/s...

2 months ago 86 21 3 2

Ny open source-baseret platform kan fjerne de store tech-virksomheders greb om danske skoler · Dataetisk Tænkehandletank Photo: publiccode.eu Open source-værktøjer i skoler og kommuner vil ikke kun spare samfundet penge og...

"Når 32 % af alle danske skoler bruger OS2 Skole, går det i nul. Omkostningerne ved OS2 Skole vil ikke være de licenser, der betales til Big Tech, men penge, der betales til de [skattebetalende] personer, der ansættes til at vedligeholde og udvikle systemet."
dataethics.eu/da/new-open-...

2 months ago 2 1 0 0

This reddit post made me laugh

2 months ago 19 4 0 0

Kan man klare sig uden amerikansk 'big tech'? Her er nogle af alternativerne Der findes masser af alternativer til de store amerikanske platforme og programmer, siger dataetisk rådgiver og forbrugerråd.

Kan man klare sig uden amerikansk 'big tech'? Her er nogle af alternativerne www.dr.dk/nyheder/viden/teknologi/...

#dk #nyheder #dkbot

2 months ago 22 6 1 0

In Grenoble/France where I work, a steep rise in bicycle commutes (green) happened, mainly because of strong support from the local municipality through infrastructure changes - bike lanes! This coincides with a drop in car usage (red), while walking (yellow) and public transp. (black) stayed flat.

2 months ago 52 6 2 0

Creating the world’s best model for Scandinavian languages The Tech Collective offers the world’s best embedding model for Danish, Swedish, and Norwegian, transforming text into numerical data for AI applications.

Der er en artikel om det her thetechcollective.eu/insights/cre... :)

2 months ago 1 0 0 0

The winners were,
- CoRal: huggingface.co/CoRal-project/
- Top Scandinavian text embedding model: huggingface.co/jealk/TTC-L2...
- EuroEval: euroeval.com

2 months ago 4 0 1 0

Had a great time at @danskerhverv.dk today where the Danish Data Science Community (ddsc.io - join the Slack!) revealed the winners of the first Open Source Awards

2 months ago 3 1 1 1

Posts by Rasmus Aagaard