Shivam Raval (@sraval) Bsky

This map shows the hour of sunrise globally through the year. It reveals time zones following national and, sometimes, regional boundaries, and slicing through the oceans.

11 months ago 41 13 3 2

Thinking in analogies Connecting the dots Shivam Raval How analogical thinking can drive breakthroughs in Human-AI Collaboration @sraval.bsky.social

Here are the slides from the presentation: docs.google.com/presentation...

Feedback, comments or suggestions are most welcome 😄

1 year ago 1 0 0 0

Update on the VIS+AI meetup: I'm a speaker now!

1 year ago 1 0 1 0

On the Biology of a Large Language Model

Can we understand the mechanisms of a frontier AI model?

📝 Blog post: www.anthropic.com/research/tra...
🧪 "Biology" paper: transformer-circuits.pub/2025/attribu...
⚙️ Methods paper: transformer-circuits.pub/2025/attribu...

Featuring basic multi-step reasoning, planning, introspection and more!

1 year ago 125 28 4 3

bsky.app/profile/ieee...

1 year ago 3 0 0 0

Update: The event is free now, thanks to generous funding from Northeastern University!

1 year ago 2 0 0 0

Join us for our first Vis+AI meetup on April 3rd at Northeastern University, a meetup to gather people interested in the intersection of Data Visualization and Artificial Intelligence. Sign up as soon as possible! We have a limited number of spots. lnkd.in/e8whS6v2.

1 year ago 5 2 1 0

The wind map at hint.fm/wind/ has been running since 2012, relying on weather data from NOAA. We added a notice like this today. Thanks to @cambecc.bsky.social for the inspiration.

1 year ago 78 20 1 1

Great thread describing the new ARBOR open interpretability project, which has some fascinating projects already. Take a look!

1 year ago 8 2 0 0

GitHub - ARBORproject/arborproject.github.io Contribute to ARBORproject/arborproject.github.io development by creating an account on GitHub.

Today we're launching a multi-lab open collaboration, the ARBOR project, to accelerate AI interpretability research for reasoning models. Please join us!

github.com/ARBORproject...

(ARBOR = Analysis of Reasoning Behavior through Open Research)

1 year ago 44 9 1 0

DeepSeek R1 shows how important it is to be studying the internals of reasoning models. Try our code: Here @canrager.bsky.social shows a method for auditing AI bias by probing the internal monologue.

dsthoughts.baulab.info

I'd be interested in your thoughts.

1 year ago 28 9 1 1

In 1897, Alfred G. Mayer created his butterfly wing projections, an attempt to gain new insights into natural patterns and laws. Vertical blocks denote individual wings, distorted and stretched mathematically to fill a tidy rectangular space. More here: publicdomainreview.org/collection/m...

1 year ago 142 40 3 4

DeepSeek is a side project 🔥

1 year ago 299 40 4 6

1 year ago 12 1 2 0

Tailscan website now uses v4!

Also updated the Tailwind CSS color palette cheat sheet 👀 added a button to see the old v3 and new v4 color Tailwind color palette.

#buildinpublic

1 year ago 6 1 0 0

I'm teaching my first course! A seminar on "Machine Behavior."

Readings are a mix of NLP, CSS-y, and ML work on how machines (focus LLMs) "behave" within sociotechnical systems and on how they can be used to study human behavior.

Syllabus: manoelhortaribeiro.github.io/teaching/spr...

1 year ago 46 8 3 1

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers Recent advancements in large language models (LLMs) have sparked optimism about their potential to accelerate scientific discovery, with a growing number of works proposing research agents that autono...

1) In the narrow area of prompt generation techniques LLMs can generate ideas rated as more novel and exciting. They are sometimes less feasible. Out of 4000 ideas generated, only 200 were potentially unique. arxiv.org/abs/2409.04109

1 year ago 0 1 0 0

hahahahah there were actually two technical reports for RL reasoning models today, kimi 1.5 also has good stuff on reward shaping + RL infra

kimi 1.5 report: https://buff.ly/4jqgCOa

1 year ago 29 7 0 0

Dimensionality reduction beyond neural subspaces with slice tensor component analysis - Nature Neuroscience Neural activity does not always lie in a low-dimensional subspace. The authors extend this classic view to show that task-relevant information is distributed across multiple covariability classes and ...

Dimensionality reduction beyond neural subspaces with slice tensor component analysis

www.nature.com/articles/s41...

1 year ago 14 7 1 1

What a beauty! This is comet C/2024 G3 (ATLAS) passing through the field of view of the LASCO C3 coronagraph.

It wasn't for certain whether it would survive it's closest approach to the sun on January 13th, but it did and delivered us a spectacular show!

#comet #C2024G3 🔭

1 year ago 527 197 14 16

AI Agents Are Here. What Now? We’re on a journey to advance and democratize artificial intelligence through open source and open science.

To wrap your head around agents and think through the ethics, our Society and Ethics team put together a great resource - 👏 @mmitchell.bsky.social @sashamtl.bsky.social @evijit.io @giadapistilli.com

huggingface.co/blog/ethics-...

1 year ago 15 3 0 0

Pie and donut charts get a bad rep, but they work well if used for the right data and tasks. Read about what the science has to say about them in our new blog post: https://buff.ly/3DURbnS

1 year ago 9 3 0 2

*Deep Learning Through A Telescoping Lens*
by @alanjeffares.bsky.social @aliciacurth.bsky.social

Shows that tracking 1st-order approximations to the training dynamics provides insights into many phenomena (e.g., double descent, grokking).

arxiv.org/abs/2411.00247

1 year ago 10 1 0 0

New paper <3
Interested in inference-time scaling? In-context Learning? Mech Interp?
LMs can solve novel in-context tasks, with sufficient examples (longer contexts). Why? Bc they dynamically form *in-context representations*!
1/N

1 year ago 53 16 2 1

I've started a Research Integrity Feed populated by hashtags below & choice users. For the #SciPub / #AcademicPublishing sleuths 📊🔍👀

Plus it's got a cute furry mascot! 😘
bsky.app/profile/did:...

#ResearchIntegrity
#PredatoryPublisher
#PredatoryPublishing
#EditorialIndependence
#SciRetraction

1 year ago 57 24 5 0

ByteDance has open-sourced a lip-sync model called LatentSync. LatentSync is an end-to-end lip-sync framework that does not rely on any intermediate motion representation, but instead models complex audio-visual correlations directly in the latent space.

1 year ago 17 3 1 1

Genuary 2025, Day 3: "Exactly 42 lines of code." A tool for drawing with the osculating (kissing) circles of one's stroke.

#genuary #genuary2025 #genuary3

1 year ago 62 7 1 0

What was the most important machine learning paper in 2024?

My Famous Deep Learning Papers list (that I use in teaching) does not include any new ideas from the last year.

papers.baulab.info

Which single new paper would you add?

1 year ago 55 11 10 0

Noteworthy AI Research Papers of 2024 (Part One) Six influential AI papers from January to June

Incredibly interesting Roundup of AI papers.

open.substack.com/pub/sebastia...

1 year ago 14 2 0 0

Happy New Year! 🎆🎇
[Throwback to 2016 when I was in Sydney on New Year’s Eve]

1 year ago 2 0 0 0

Posts by Shivam Raval