Advertisement · 728 × 90

Posts by Tal Schuster

Post image Post image

🥁Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding.

Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on the LM Arena leaderboard. 🥇

1 year ago 215 66 34 11
Post image

Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n

1 year ago 94 29 3 8
Post image

*Relaxed Recursive Transformers*
by @talschuster.bsky.social et al.

Converts pre-trained transformers to a more efficient version by turning blocks of layers into a single layer which is iterated. Lots of interesting tricks!

arxiv.org/abs/2410.20672

1 year ago 5 2 1 0

Will be at NeurIPS. Reach out if you're interested in discussing adaptive compute in LLMs or other topics

1 year ago 0 0 0 0
Post image

New Gemini model grabs first place in all domains.

Happy one year anniversary Gemini team!

1 year ago 2 1 0 0
Blue sky, green grass, bicycle looks good, bird riding it is almost recognizable as a pelican

Blue sky, green grass, bicycle looks good, bird riding it is almost recognizable as a pelican

Google just released a new Gemini model - gemini-exp-1206

I upgraded my llm-gemini plugin to support it and then got the best result yet for my "Generate an SVG of a pelican riding a bicycle" benchmark

simonwillison.net/2024/Dec/6/g...

1 year ago 143 10 7 0

Lol

1 year ago 0 0 0 0
Advertisement

Dear algorithm,
I would like to view:
70% new ML and LLM research and cool results.
10% funny videos with cute animals.
5% sports (but no spoilers if I'm planning to watch the reply later).
5% travel and life hacks.
5% general tech.
5% random.
Regards,

1 year ago 9 2 1 1