Advertisement · 728 × 90

Posts by Apratim Bhattacharyya

Video

🚨🚨🚨Take part in the AI Coach: Fitness challenge and the Low Power Computer Vision Challenge @ CVPR 2026

🎯Both challenges use the Qualcomm Exercise Video Dataset (QEVD) dataset.

👉Quick start guides and sample solutions: apratimbh.github.io/whatandwhen/

@cvprconference.bsky.social

1 month ago 0 1 0 0
FY26 Intern - Deep Learning Research Internship - Embodied AI - Canada (4 months) Company: Qualcomm Canada ULC Job Area: Interns Group, Interns Group > Interim Engineering Intern - SW Qualcomm Overview: Qualcomm is a company of inventors that unlocked 5G ushering in an age of ra...

📣📣📣Our team at Qualcomm AI Research is hiring Research Interns for Summer 2026 in Toronto to work on multi-modal LLMs and embodied AI.

👉Apply here:
1) Embodied AI:
qualcomm.wd12.myworkdayjobs.com/External/job...

2) Multi-modal LLMs:

6 months ago 0 0 0 0
Post image

🚨Submit by 1st May @cvprconference.bsky.social: extended abstracts on streaming vision-language models, real-time activity understanding, grounding, ego-centric video understanding, language and robot learning. Contributions are encouraged to include a demo!

👉Details: varworkshop.github.io/calls/

11 months ago 1 1 0 0

🚨🚨🚨 We are now accepting submissions!

1 year ago 1 0 0 0
Post image

Call for Participation @cvprconference.bsky.social: Multi-Modal LLMs - prepare to engage in a dynamic, face-to-face conversation with a real human user!

Details: varworkshop.github.io/challenges/

🚨🚨🚨The winning teams will receive a prize and a contributed talk.

P.S. GPT-4o does not do too well.

1 year ago 3 3 0 2
Post image

Call for Participation: We're excited to announce a challenge focused on developing AI assistants that can guide users through workout sessions with intelligent feedback!

🚨The winning teams will receive a prize along with a contributed talk. 🚨

Website: varworkshop.github.io/challenges/

1 year ago 4 2 0 0

🚨Submission are now open!

1 year ago 0 0 0 0
Post image

Call for Papers and Demos @cvprconference.bsky.social: on topics such as streaming vision-language models, real-time activity understanding, grounding, ego-centric video understanding, language and robot learning. Contributions are encouraged to include a demo!

Link: varworkshop.github.io/calls/

1 year ago 8 6 0 2
Advertisement

Join us at the @cvprconference.bsky.social Workshop on Vision-based Assistants in the Real-world (VAR) and tackle one of AI's biggest challenges: building systems that can comprehend and reason about dynamic, real-world scenes.

Workshop Page: varworkshop.github.io

1 year ago 4 1 0 1

By popular demand, we are extending #CVPR2025 coverage to Bluesky. Stay tuned!

1 year ago 124 17 5 2

Accepted to CVPR2025 🥳🥳

#CVPR2025

1 year ago 1 0 0 0
Post image

🚨We present in "Enhancing Hallucination Detection through Noise Injection" [https://arxiv.org/pdf/2502.03799] an efficient approach to detect hallucinations in LLMs, within a Bayesian framework.

TL; DR - We use noise injection to capture both epistemic and aleatoric uncertainty!

1 year ago 1 1 0 0
Post image

Join us at the CVPR 2025 Workshop on Vision-based Assistants in the Real-world (VAR) and tackle one of AI's biggest challenges: building systems that can comprehend and reason about dynamic, real-world scenes.

Workshop Page: varworkshop.github.io

1 year ago 9 1 0 0
Post image

🚨Check out our new work on distilling reasoning skills from LLMs into efficient driving policies, to deal with critical "long-tail" scenarios.

arXiv: arxiv.org/abs/2501.09757

1 year ago 1 0 0 1
Preview
ClevrSkills AI Dataset

🚨 The code for our NeurIPS 2024 (D&B track) paper: ClevrSkills: Compositional Language And Visual Understanding in Robotics (arxiv.org/abs/2411.09052), is now available.

GitHub Repo: github.com/Qualcomm-AI-...
Dataset Page: www.qualcomm.com/developer/so...

1 year ago 0 0 0 0
Advertisement