Advertisement · 728 × 90

Posts by vik / λh.(h h)

Post image

Examples of toxic prompts we removed

4 months ago 12 0 0 1
Post image

Announcing RefCOCO-M, a refreshed RefCOCO with pixel-accurate masks and the problematic prompts removed. Better data for better evaluation.

huggingface.co/datasets/moo...

4 months ago 12 2 1 0
go to moondream.ai/c/playground for description

go to moondream.ai/c/playground for description

lone pip user among uv sheeple

6 months ago 5 0 0 0

matz is nice and so we kick out people starting ridiculous witch hunts

6 months ago 1 0 0 0

you sound like a markov chain. just parroting random things without any underlying comprehension

6 months ago 5 0 1 0

wrong. that's exactly how human cognition works

6 months ago 2 0 2 0

i bet she can't even write down the flash attention 2 backward pass without looking it up

6 months ago 8 0 0 0

wrong yet highly confident is not a good look

6 months ago 8 0 0 0
Advertisement
Boars detected!

Boars detected!

Are there any boars?

Are there any boars?

The Moondream VLM (vision language model) is amazing. It's small (2B parameters) and yet accurately detects boars in a dark infrared camera image. Asking for both boar and pig increases accuracy though. #sanglicam

6 months ago 14 1 0 0

One night the pupil came in tears to Suiwo. "I must return south in shame and embarrassment," he said, "for I cannot solve my problem."

Suiwo said: "Meditate for three days longer, then if you fail to attain enlightenment, you had better kill yourself."

On the second day the pupil was enlightened.

6 months ago 3 0 0 0

😭

6 months ago 1 0 0 0
Post image
8 months ago 3 0 0 0
Post image
1 year ago 5 0 0 0
Post image

new moondream, new me

huggingface.co/vikhyatk/moo...

1 year ago 16 3 1 0
Video
1 year ago 41 2 0 1
Video

Gaze detection will be in the upcoming moondream release!

Live demo: huggingface.co/spaces/moond...
Blog post: moondream.ai/blog/announc...

1 year ago 7 2 1 0
Advertisement

yeah… in latest torch on h100s it’s basically the same speed

1 year ago 5 0 2 0

bsky.app/profile/jay....

1 year ago 1 0 1 0

i heard they have prebuilt wheels but you have to go to the github repo to find them... just stopped using flash attention instead

1 year ago 5 0 1 0

how much do we actually love the open web?

1 year ago 32 0 3 0

I noticed that my posts are not in this dataset, may I ask why? Have I offended you somehow?

1 year ago 74 2 1 0
Video

Moondream playground is back, running the latest upcoming version of the model, and now with prompt suggestions! Look at how blazingly fast it is running on an L40S...

Try it out here: moondream.ai/playground

1 year ago 22 2 0 0

💀

1 year ago 1 0 0 0
Post image
1 year ago 21 0 1 0
Advertisement

while we're ruining your weekend here's another one i think you'll like 😂 arxiv.org/pdf/2410.06205

1 year ago 5 0 1 0
Preview
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training Extending context window sizes allows large language models (LLMs) to process longer sequences and handle more complex tasks. Rotary Positional Embedding (RoPE) has become the de facto standard due to...

my post was prompted by this paper btw i just realized i didn't share context arxiv.org/abs/2411.13476

1 year ago 6 0 2 0

is the amazing length extrapolation it enables really worth all of the suffering entailed in debugging when it breaks? 🥲

1 year ago 4 0 1 0

when are we going to admit RoPE was a mistake?

1 year ago 7 0 1 0
Post image

working on screen understanding

1 year ago 15 0 2 0

talking to customers is overrated

1 year ago 8 1 0 0