Advertisement · 728 × 90

Posts by Lerrel Pinto

Video

How is AI helping robots to generalise their skills to unfamiliar environments? 🤖 🏠

In the latest episode, I chatted to Prof. Lerrel Pinto (@lerrelpinto.com) from New York University about #robot learning and decision making.

Available wherever you get your podcasts: linktr.ee/robottalkpod

10 months ago 2 1 0 0

This project, which combines hardware design with learning-based controllers was a monumental effort led by @anyazorin.bsky.social and Irmak Guzey. More links and information about RUKA are below:

Website: ruka-hand.github.io
Assembly Instructions: ruka.gitbook.io/instructions

11 months ago 1 0 0 0
Video

We just released RUKA, a $1300 humanoid hand that is 3D-printable, strong, precise, and fully open sourced!

The key technical breakthrough here is that we can control joints and fingertips of the robot **without joint encoders**. All we need here is self-supervised data collection and learning.

11 months ago 29 7 1 0

This would be funny! 😂

1 year ago 0 0 0 0
Video

When life gives you lemons, you pick them up.

(trained with robotutilitymodels.com)

1 year ago 15 4 1 0
A photo of Lerrel looking happy.

A photo of Lerrel looking happy.

What would you love to know about #robot learning and decision making?

Later this season, I'll be chatting to Prof. Lerrel Pinto (@lerrelpinto.com) from NYU about using machine learning to train robots to adapt to new environments.

Send me your questions for Lerrel: robottalk.org/ask-a-question/

1 year ago 13 7 0 1

Is there a word for the feeling when you want to cheer for the other team?

1 year ago 5 0 1 0
Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation

This project was an almost solo effort from @haldarsiddhant.bsky.social. And as always, this project is fully opensourced.

Project page: point-policy.github.io
Paper: arxiv.org/abs/2502.20391

1 year ago 3 0 0 0

The overall algorithm is simple:
1. Extract key points from human videos.
2. Train a transformer policy to predict future robot key points.
3. Convert predicted key points to robot actions.

1 year ago 2 0 1 0
Video

Point Policy uses sparse key points to represent both human demonstrators and robots, bridging the morphology gap. The scene is hence encoded through semantically meaningful key points from minimal human annotations.

1 year ago 0 0 1 0
Advertisement
Video

The robot behaviors shown below are trained without any teleop, sim2real, genai, or motion planning. Simply show the robot a few examples of doing the task yourself, and our new method, called Point Policy, spits out a robot-compatible policy!

1 year ago 21 5 1 1
Post image

This is important because the humble iPhone is one of the best accessories for embodied AI out there, if not actually the best. It's got a depth sensor, good camera, built-in internet, decent compute, and -- uniquely -- it has really good slam already built in.

1 year ago 15 4 3 0

It should be accessible in EU now!

1 year ago 1 0 1 0
Preview
‎AnySense ‎AnySense is an open-source iPhone app that enables multi-sensory data collection by integrating the iPhone’s sensory suite with external sensors via Bluetooth and wired interfaces, enabling both offl...

AnySense is built to empower researchers with better tools for robotics. Try it out below.

Download on App store: apps.apple.com/us/app/anyse...
Open-source code on GitHub: github.com/NYU-robot-le...
Website: anysense.app

AnySense is led by @raunaqb.bsky.social with several from NYU.

1 year ago 3 0 0 0

With this 'wild' robot data, data collected by AnySense can then be used to train multimodal policies! In the video above, we use the Robot Utility Models framework to train Visuo-Tactile policies for a whiteboard erasing task. You can use it for so much more though!

1 year ago 3 0 1 0
Video

We just released AnySense, an iPhone app for effortless data acquisition and streaming for robotics. We leverage Apple’s development frameworks to record and stream:

1. RGBD + Pose data
2. Audio from the mic or custom contact microphones
3. Seamless Bluetooth integration for external sensors

1 year ago 34 10 2 0

A useful “productivity” trick is to remind yourself that research should be fun and inspiring and if it’s not that something should change.

1 year ago 72 7 2 1
Post image

Just found a new winner for the most hype-baiting, unscientific plot I have seen. (From the recent Figure AI release)

1 year ago 37 6 1 1

One reason to be intolerant of misleading hype in tech and science is that tolerating the small lies and deception is how you get tolerance of big lies

1 year ago 185 27 4 0
Advertisement

Thanks Tucker! The timing of this is great given the uncertainty with other funding mechanisms.

1 year ago 0 0 0 0

Thank you to @sloanfoundation.bsky.social for this generous award to our lab. Hopefully this will bring us closer to building truly general-purpose robots!

1 year ago 22 4 3 0

Yes, this is one of our inspirations!

1 year ago 1 0 1 0

A fun, clever idea from @upiter.bsky.social : treat code generation as a sequential editing problem -- this gives you loads of training data from synthetically editing existing code

And it works! Higher performance on HumanEval, MBPP, and CodeContests across small LMs like Gemma-2, Phi-3, Llama 3.1

1 year ago 5 0 1 0

Thanks Eugene! Sounds exciting!

1 year ago 1 0 0 0

Hi Eugene, this sounds cool! Could you comment a bit on how well simulated driving agents translate to real world driving?

1 year ago 7 0 1 0

We have been working a bunch on offline world models. Pre-trained features from DINOv2 seem really powerful for modeling. I hope this opens up a whole set of applications for decision making and robotics!

Check out the thread from @gaoyuezhou.bsky.social for more details.

1 year ago 4 0 0 0

nah they are friendly cat food by folks around NYU AD.

1 year ago 0 0 0 0

Your robot looks cool!

1 year ago 0 0 1 0
Advertisement

If you’re in grad school, finding a therapist can be really helpful. The thing you’re doing is hard and it’s harder if you don’t have help managing imposter syndrome, stress, self esteem, and a whole bunch of other things.

1 year ago 65 13 5 3

omg a student somehow accidentally wrote an email addressed to a faculty-wide NYU listserv and my inbox is now a master class on who understands the difference between a listserv and an email chain

1 year ago 5406 933 203 839