rohit (@rnair) Bsky - nopzon.com

claude 3.7 performance on Pokemon

of all the benchmarks being sent around for Claude 3.7 this is the one i'm paying the most attention to. they're cheating a little bit by giving it the oldest, original pokemon game (red/blue) which is more than 20 years old and will have plenty of info online to learn from.

1 year ago 30 3 3 2

sheesh! what a day for AI. QwQ-Max, sonnet-3.7 AND open source of FlashMLA

1 year ago 9 2 1 0

the only interview question i need to ask is:

how many sqlite databases do you have on your machine and what do you use them for?

if you're able to answer with a specific number, you're not ready yet unless you have a custom cron job cleaning up unused sqlite files.

1 year ago 0 0 0 0

a screen of various ai assistants hinting at the user's digital hoarding proclivities

i have a hoarding problem

1 year ago 3 0 1 0

opening discord though on drop days, that's a whole other beast.

1 year ago 0 0 0 0

i don't check social media for 8h and people are already playing with claude 3.7 sonnet

1 year ago 0 0 1 0

edtech companies who've amassed a bank of their own content in the last decade are sitting on a goldmine.

1 year ago 2 0 0 0

saving all of my generated deepseek r1's reasoning traces to a USB drive so when the time comes i can send it back to my 12 year old self to prevent him from bashing his head against the desk for being stuck on the 2nd problem of an AMC 12.

1 year ago 1 0 0 0

good thing i'm intentionally curating everything to discreetly bend the ai agents to my will.

1 year ago 4 0 1 0

GitHub - deepseek-ai/DeepSeek-V3 Contribute to deepseek-ai/DeepSeek-V3 development by creating an account on GitHub.

DeepSeek, a LLM trained for a fraction of the cost of GPT-Xx models, in 2 months for 6 million, on limited GPUs due to export restrictions, and competing head to head. This is crazy.

It's not the AI part I'm excited about, it's the level of efficiency. github.com/deepseek-ai/...

1 year ago 269 37 10 10

engineering blogs and white papers from the heyday of pre-AI tech are a treasure trove of insights and decision making processes without the burden of validating AI slop.

don't discount them merely on their being of outdated.

1 year ago 3 0 1 0

plus if you do decide to pause the 9-5 lifestyle to become an AI consultant, you'll have a solid foundation and not have to start from scratch.

1 year ago 0 0 0 0

if you're a cs student aspiring to become a software engineer in 2025, make sure to hone adjacent skills like writing, marketing, & sales.

in the age of agents, establishing human connection & building an audience organically will make set you apart from the rest.

1 year ago 1 0 1 0

Gonna think more thoughts in 2025

1 year ago 86 10 8 2

thumbnail that says introducing smolagents

supercharge your LLM apps with smolagents 🔥

however cool your LLM is, without being agentic it can only go so far

enter smolagents: a new agent library by @hf.co to make the LLM write code, do analysis and automate boring stuff! huggingface.co/blog/smolage...

1 year ago 85 16 2 2

the intersection between hardcore biotech, precision therapeutics, & AI is an interesting one.

i'm particularly bullish about the work being done with organoid intelligence - mainly because of more energy efficient computing as well as possible contributions to neurodegenerative disease research

1 year ago 6 0 0 0

every PR is another step towards the dream of having a universal near zero latency jarvis+second brain ai mesh

seemed like a gargantuan task back when i discovered obsidian/roam in college but current tech & capabilities make it easier to bootstrap a decent representation of this.

1 year ago 2 0 1 0

smol.ai News and Hackathons for AI Engineers!

the smol.ai newsletter is truly a godsend (thanks @swyx.io)

with all of the model releases this week, neurips, and discord chats popping off, having a single place to start from really helps

highly recommend.

1 year ago 19 1 1 0

when you've been on the internet as long as i have, you would understand that everything can be used for anything and leaves a trail.

the difference today is a lot more security theatre & forced transparency

it's why i've always written stuff envisioning that they'd be immortalized by a super AGI

1 year ago 5 0 0 0

annoy an ML engineer with these simple phrases:

"cosine distance"
"L2 similarity"
"but did you ship it?"

1 year ago 56 3 5 1

My deep learning course at the University of Geneva is available on-line. 1000+ slides, ~20h of screen-casts. Full of examples in PyTorch.

fleuret.org/dlc/

And my "Little Book of Deep Learning" is available as a phone-formatted pdf (nearing 700k downloads!)

fleuret.org/lbdl/

1 year ago 1250 247 46 17

wanna become a prolific open source contributor?

just try using open source llm agent frameworks with rough asynchronous programming patterns in prod

1 year ago 1 0 1 0

@georgehotz.bsky.social is here! Bluesky is going to be so much fun.

1 year ago 6 1 0 0

deploying llm apps on modal labs at night is a much better experience than my daytime woes with terraform, cloud build, k8s, and docker

1 year ago 4 0 0 0

lol nah I think i'll stick to modal

1 year ago 1 0 0 0

a cartoon of superman flying through the air with a red cape . ALT: a cartoon of superman flying through the air with a red cape .

1 year ago 1 0 0 0

distributed systems man. why do we do this to ourselves

1 year ago 601 57 31 6

the good thing about this new generation of voice assistants is that everyone can have their own hunter s thompson like narrators for the most mundane parts of their daily routines

1 year ago 3 0 0 0

rohit's mind palace - a menagerie of tech, books, posters, holograms and chessboards

i finally asked chatgpt to generate an image given what it knew about me.

spot on

1 year ago 3 0 1 0

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Currently OpenAI o1 has sparked a surge of interest in the study of large reasoning models (LRM). Building on this momentum, Marco-o1 not only focuses on disciplines with standard answers, such as mat...

Alibaba has their own version on GPT-o1. This might be the best description of “o1-type”systems so far arxiv.org/abs/2411.14405

1 year ago 267 39 8 2

Posts by rohit