If you're in Berkeley or like a nice streamed talk, I'm about to give a talk at the Simons Institute today: "You Know It Or You Donβt: Compositionality and Phase Transitions in LMs". Tune in at 4PM pacific!
Posts by Sasha Rush
I'm hanging around with Theorists π€
What to know about DeepSeek
youtu.be/0eMzc-WnBfQ?...
In which we attempt to figure out MoE, o1, scaling, tech reporting, modern semiconductors, microeconomics, and international geopolitics.
These are great recommendations thank you.
For reasons, I find myself thinking a lot about the history of US/USSR Cold War science, particularly in applied math. Does anyone have a recommendation for a good book on this topic?
Yeah vertical is kind of dumb, but I thought I try it out.
π, I noticed I have trouble saying that word.
however if you listen to your own videos then you will never manage to release anything.
10 short videos about LLM infrastructure to help you appreciate Pages 12-18 of the DeepSeek-v3 paper (arxiv.org/abs/2412.19437)
www.youtube.com/watch?v=76gu...
Thought this "Bill Gates" guy was on the level.
I'll try it out. Good to check once a year to see if I'm secretly an existential risk guy.
I mean Microsoft under the table bought his company, right?
Luckily there is no indication from the review he read the book.
Maybe I'll just use bluesky to rant about topics I'm too scared to talk about on twitter.
The casual conflation of AI with gene editing is intellectual malpractice. These two things have nothing to do with each other!
Would love to read a good book about this topic if anyone wants to give it a try.
I've been listening to too much If Books Could Kill, so now I'm convinced these airport books are actually the only thing that matters.
I tried reading this book, and I was just shocked at how little insight it had, and it's sheer inability to focus. The fact that it is being recommended to policy makers...
www.gatesnotes.com/The-Coming-W...
I'm going to do a live coding stream for the next couple of hours. We'll start by running through some WebGPU tutorials. Can also talk about some AI stuff.
www.youtube.com/watch?v=sqKq...
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute π₯
How? By combining step-wise reward models with tree search algorithms :)
We're open sourcing the full recipe and sharing a detailed blog post π
huh, so maybe OCaml should be the target for verifiable generation? I heard you guys have ways to build fast
As coding LLMs get faster at inference, iterating verification-in-the-loop tests becomes the bottleneck for coding agents. Probably need quite different programming systems for these settings, or even things like "batchable" runtimes, whatever that means.
We organised a lively poster fest with many students rehearsing for the upcoming @neuripsconf.bsky.social next week and others discussing their cool works!
Thanks to #GAIL, the #Generative #AI lab in #Edinburgh for sponsoring the event!
I wanted to make my first post about a project close to my heart. Linear algebra is an underappreciated foundation for machine learning. Our new framework CoLA (Compositional Linear Algebra) exploits algebraic structure arising from modelling assumptions for significant computational savings! 1/4
Screenshot of BBC 100 picture of Sasha and blurb; linked in post.
Proud of my amazing colleague @sashamtl.bsky.social for her much deserved recognition on advancing the science of AI energy use.
BBC100: www.bbc.co.uk/news/resourc...
Fast Company: www.fastcompany.com/91233692/why...
Sasha has been working tirelessly moving things fwd--endurance & brilliance in one.
NEW: we have an exciting opportunity for a tenure-track professor at the #KempnerInstitute and the John A. Paulson School of Engineering and Applied Sciences (SEAS). Read the full description & apply today: academicpositions.harvard.edu/postings/14362
#ML #AI
We're hiring another predoctoral researcher for my team at Ai2/OLMo next year. The goal of this position is to mentor and grow future academic stars of NLP/AI over 1-2 years before grad school.
This ends up being people done with BS or MS who want to continue to a PhD soon.
https://buff.ly/49nuggo
Unfortunately Yoav's question is a bit more interesting and subtle than this talk.
π
Is there a community that writes RL-first programming languages? Something like (Num)Pyro that takes seriously the idea of separating the policy specification from the learning process.