Advertisement Β· 728 Γ— 90

Posts by Pranav

Gemini has been surprisingly very good to resolve bugs. Claude on cursor has been a lifesaver. Used deepseek for DB design and was mindblown with reasoning. Overall, great time to be alive!

1 year ago 0 0 1 0

Looks like it analyzes the image to click on screen coordinates, and doesn't care about the DOM.

1 year ago 0 0 0 0
Qwen2.5 VL! Qwen2.5 VL! Qwen2.5 VL! QWEN CHAT GITHUB HUGGING FACE MODELSCOPE DISCORD We release Qwen2.5-VL, the new flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL. To try the latest model, ...

Has anyone tested the new Qwen 2.5 VL model's computer/mobile agent capabilities? Just launched and they are calling it Superior Computer and Mobile Agent πŸ€”
qwenlm.github.io/blog/qwen2.5...

1 year ago 0 0 1 0

Is Deepseek API down already?

1 year ago 0 0 0 0
Preview
GitHub - browser-use/browser-use: Make websites accessible for AI agents Make websites accessible for AI agents. Contribute to browser-use/browser-use development by creating an account on GitHub.

Try it out out πŸ‘‰ github.com/browser-use/...

Got ideas for what I should automate next? Drop them in the comments.

1 year ago 0 0 0 0
Video

OpenAI’s Operator is cool, but imagine this:
Open-sourced version that runs locally on your machine, logged into your Google account with all your saved passwords. No cloud. Full privacy.

πŸ’‘ Check out the demo below πŸ‘‡ (4x speed)

1 year ago 1 0 1 0

/pranav

Cheers!

1 year ago 1 0 0 0
Advertisement

After several sleepless nights of building AI side projects, I'm finally taking the leap. In 7 days, I leave my corporate job to start my indiehacker journey. There's never been a better time to build, and I'm going ALL IN!πŸš€ #BuildInPublic

1 year ago 11 0 8 0