#PaperILike hashtag - Bluesky

@tomssilver.bsky.social

1 day ago

INQUIRE: INteractive Querying for User-aware Informative REasoning Research on Interactive Robot Learning has yielded several modalities for querying a human for training data, including demonstrations, preferences, and corrections. While prior work in this space ...

This week's #PaperILike is "INQUIRE: INteractive Querying for User-aware Informative REasoning" (Fitzgerald et al., CoRL 2022).

A very nice way to unify different forms of information gathering and preference learning for human-robot assistance.

PDF: proceedings.mlr.press/v205/fitzger...

0 0 0 0

Tom Silver

@tomssilver.bsky.social

1 week ago

AssistanceZero: Scalably Solving Assistance Games Assistance games are a promising alternative to reinforcement learning from human feedback (RLHF) for training AI assistants. Assistance games resolve key drawbacks of RLHF, such as incentives for dec...

This week's #PaperILike is "AssistanceZero: Scalably Solving Assistance Games" (Laidlaw et al., ICML 2025).

AlphaZero-like combo of learning & planning for assistance games, where robot & human share reward fn that robot doesn't know. + Minecraft!

PDF: arxiv.org/abs/2504.07091

17 4 0 1

Tom Silver

@tomssilver.bsky.social

2 weeks ago

Cooperative Inverse Reinforcement Learning For an autonomous system to be helpful to humans and to pose no unwarranted risks, it needs to align its values with those of the humans in its environment in such a way that its actions contribute to...

This week's #PaperILike is "Cooperative Inverse Reinforcement Learning" (Hadfield-Menell et al., 2016).

Seminal work. My favorite part is the simple example showing that demonstration-by-expert is suboptimal.

PDF: arxiv.org/abs/1606.03137

22 4 0 0

Tom Silver

@tomssilver.bsky.social

3 weeks ago

RoboCook: Long-Horizon Elasto-Plastic Object Manipulation with Diverse Tools Humans excel in complex long-horizon soft body manipulation tasks via flexible tool use: bread baking requires a knife to slice the dough and a rolling pin to flatten it. Often regarded as a hallmark ...

This week's #PaperILike is "RoboCook: Long-Horizon Elasto-Plastic Object Manipulation with Diverse Tools" (Shi et al., CoRL 2023).

So much to like in one paper: planning, learning, deformable manipulation, GNNs, 15 3D-printed tools, and dumplings!

PDF: arxiv.org/abs/2306.14447

11 1 0 1

Tom Silver

@tomssilver.bsky.social

4 weeks ago

This week's #PaperILike is "Efficient memory-based learning for robot control" (Andrew Moore's dissertation, 1990).

This and Moore's follow-up work from the 90s are worth revisiting, especially now that VLAs are starting to remember!

PDF: www.cl.cam.ac.uk/techreports/...

5 1 0 0

Tom Silver

@tomssilver.bsky.social

1 month ago

Learning Montezuma's Revenge from a Single Demonstration We propose a new method for learning from a single demonstration to solve hard exploration tasks like the Atari game Montezuma's Revenge. Instead of imitating human demonstrations, as proposed in othe...

This week's #PaperILike is "Learning Montezuma’s Revenge from a Single Demonstration" (Salimans & Chen, 2018).

1 demo + known world model = very natural and still under-explored problem setting.

PDF: arxiv.org/abs/1812.03381

20 4 0 0

Tom Silver

@tomssilver.bsky.social

1 month ago

Partially Observable Task and Motion Planning with Uncertainty and Risk Awareness Integrated task and motion planning (TAMP) has proven to be a valuable approach to generalizable long-horizon robotic manipulation and navigation problems. However, the typical TAMP problem formulatio...

This week's #PaperILike is "Partially Observable Task and Motion Planning with Uncertainty and Risk Awareness" (Curtis et al., RSS 2024).

State of the art for TAMP + POMDPs. I learn more every time I read this paper.

PDF: arxiv.org/abs/2403.10454

5 0 0 0

Tom Silver

@tomssilver.bsky.social

1 month ago

This week's #PaperILike is "Empowerment: A Universal Agent-Centric Measure of Control" (Klyubin et al., 2005).

An important idea in RL, and a fun read -- mentions bacteria, chimpanzees, Newtonian mechanics, and Othello all within a few sentences.

PDF: uhra.herts.ac.uk/id/eprint/28...

12 1 1 0

Tom Silver

@tomssilver.bsky.social

1 month ago

This week's #PaperILike is "Integrating Planning and Learning: The PRODIGY Architecture" (Veloso et al., 1995).

A foundational project in the history of robot planning + learning, and a good place to look for old ideas that are worth resurfacing.

PDF: www.cs.cmu.edu/~jgc/publica...

0 0 0 0

Tom Silver

@tomssilver.bsky.social

2 months ago

Continuous Deep Q-Learning with Model-based Acceleration Model-free reinforcement learning has been successfully applied to a range of challenging problems, and has recently been extended to handle large neural network policies and value functions. However,...

This week's #PaperILike is "Continuous Deep Q-Learning with Model-based Acceleration" (Gu et al., 2016).

Got swept away by other deep RL, but I always liked the idea of parameterizing Q in a form where the optimal policy can be derived analytically.

PDF: arxiv.org/abs/1603.00748

1 0 0 0

Tom Silver

@tomssilver.bsky.social

2 months ago

virtualtools Many animals, and an increasing number of artificial agents, display sophisticated capabilities to perceive and manipulate objects. But human beings remain distinctive in their capacity for flexible, ...

This week's #PaperILike is "Rapid trial-and-error learning with simulation supports flexible tool use and physical reasoning" (Allen et al., PNAS 2020).

Their "Virtual Tools Game" is one I revisit often when brainstorming open challenges.

PDF & game: sites.google.com/view/virtual...

1 0 0 0

Tom Silver

@tomssilver.bsky.social

2 months ago

This week's #PaperILike is "Robot Task Planning Under Local Observability" (Merlin et al., 2024).

LOMDPs are a very natural middle ground between MDPs and POMDPs with enough structure for interesting planning and learning.

PDF: maxmerl.in/papers/lomdp...

1 0 0 0

Tom Silver

@tomssilver.bsky.social

2 months ago

Bayesian Residual Policy Optimization: Scalable Bayesian Reinforcement Learning with Clairvoyant Experts Informed and robust decision making in the face of uncertainty is critical for robots that perform physical tasks alongside people. We formulate this as Bayesian Reinforcement Learning over latent Mar...

This week's #PaperILike is "Bayesian Residual Policy Optimization" (Lee et al., 2020).

I like this POMDP approach because it reduces the problem to figuring out a good set of "clairvoyant experts".

PDF: arxiv.org/abs/2002.03042

5 0 0 0

Tom Silver

@tomssilver.bsky.social

3 months ago

Differentiable GPU-Parallelized Task and Motion Planning Planning long-horizon robot manipulation requires making discrete decisions about which objects to interact with and continuous decisions about how to interact with them. A robot planner must select g...

This week's #PaperILike is "Differentiable GPU-Parallelized Task and Motion Planning" (Shen et al., RSS 2025).

As always, meticulous work from @WillShenSaysHi and team. TAMP + GPU is long overdue!

PDF: arxiv.org/abs/2411.11833

3 0 0 0

Tom Silver

@tomssilver.bsky.social

3 months ago

Kinodynamic Task and Motion Planning using VLM-guided and Interleaved Sampling Task and Motion Planning (TAMP) integrates high-level task planning with low-level motion feasibility, but existing methods are costly in long-horizon problems due to excessive motion sampling. While ...

This week's #PaperILike is "Kinodynamic Task and Motion Planning using VLM-guided and Interleaved Sampling" (Kwon & Kim, 2025).

I particularly like using VLMs to guide backtracking in TAMP. Outperforms PDDLStream and LLM3.

PDF: arxiv.org/abs/2510.26139

1 1 0 0

Tom Silver

@tomssilver.bsky.social

3 months ago

Home Learning Exploration Strategies to Solve Real-World Marble Runs

This week's #PaperILike is "Learning Exploration Strategies to Solve Real-World Marble Runs" (Allaire & Atkeson, ICRA 2023).

A very fun and creative challenge for robot physical reasoning.

Video: sites.google.com/view/learnin...
PDF: arxiv.org/abs/2303.04928

0 0 0 0

Tom Silver

@tomssilver.bsky.social

3 months ago

Elephants Don't Pack Groceries: Robot Task Planning for Low Entropy Belief States Recent advances in computational perception have significantly improved the ability of autonomous robots to perform state estimation with low entropy. Such advances motivate a reconsideration of robot...

This week's #PaperILike is "Elephants Don't Pack Groceries: Robot Task Planning for Low Entropy Belief States" (Adu-Bredu, RAL 2022).

Love the focus on planning with "low entropy beliefs" -- not full-fledged POMDPs, but also not full observability.

PDF: arxiv.org/abs/2011.09105

18 4 0 0

Tom Silver

@tomssilver.bsky.social

3 months ago

This week's #PaperILike is "Sloppy Programming" (Little et al., 2010).

Vibe coding before it was cool.

PDF: dspace.mit.edu/bitstream/ha...

0 0 0 0

Tom Silver

@tomssilver.bsky.social

4 months ago

This week's #PaperILike is "Robot Programming" (Tomas Lozano-Perez, 1983).

A prescient paper that asks how we might generally program robots like we program computers. Much remains true 42 years later.

PDF: homes.cs.washington.edu/~ztatlock/59...

0 0 0 0

Tom Silver

@tomssilver.bsky.social

4 months ago

This week's #PaperILike is "Learning Proofs of Motion Planning Infeasibility" (Li & Dantam, RSS 2021).

I like using learning to "fail fast", with guarantees. Important for TAMP, where there are other MP problems to try next.

PDF: www.roboticsproceedings.org/rss17/p064.pdf

1 0 0 0

Tom Silver

@tomssilver.bsky.social

4 months ago

Interleaving Monte Carlo Tree Search and Self-Supervised Learning for Object Retrieval in Clutter In this study, working with the task of object retrieval in clutter, we have developed a robot learning framework in which Monte Carlo Tree Search (MCTS) is first applied to enable a Deep Neural Netwo...

This week's #PaperILike is "Interleaving Monte Carlo Tree Search and Self-Supervised Learning for Object Retrieval in Clutter" (Huang et al., ICRA 2022).

Impressive results on a difficult and subtle problem, with a nice combo of planning + learning.

PDF: arxiv.org/abs/2202.01426

2 0 0 0

Tom Silver

@tomssilver.bsky.social

4 months ago

Goal-Oriented End-User Programming of Robots End-user programming (EUP) tools must balance user control with the robot's ability to plan and act autonomously. Many existing task-oriented EUP tools enforce a specific level of control, e.g., by re...

This week's #PaperILike is "Goal-Oriented End-User Programming of Robots" (Porfirio et al., HRI 2024).

I like this use of planning to fill in the gaps between subgoals that are directly programmed by end users.

PDF: arxiv.org/abs/2403.13988

3 0 0 0

Tom Silver

@tomssilver.bsky.social

5 months ago

Lifelong Robot Library Learning: Bootstrapping Composable and Generalizable Skills for Embodied Control with Language Models Large Language Models (LLMs) have emerged as a new paradigm for embodied reasoning and control, most recently by generating robot policy code that utilizes a custom library of vision and control primi...

This week's #PaperILike is "Lifelong Robot Library Learning: Bootstrapping Composable and Generalizable Skills for Embodied Control with Language Models" (Tziafas & Kasaei, ICRA 2024).

DreamCoder-like robot skill learning. Refactoring helps!

PDF: arxiv.org/abs/2406.18746

3 1 0 0

Tom Silver

@tomssilver.bsky.social

5 months ago

Monte Carlo Tree Search with Spectral Expansion for Planning with Dynamical Systems The ability of a robot to plan complex behaviors with real-time computation, rather than adhering to predesigned or offline-learned routines, alleviates the need for specialized algorithms or training...

This week's #PaperILike is "Monte Carlo Tree Search with Spectral Expansion for Planning with Dynamical Systems" (Riviere et al., Science Robotics 2024).

A creative synthesis of control theory and search. I like using the Gramian to branch.

PDF: arxiv.org/abs/2412.11270

3 0 0 0

Tom Silver

@tomssilver.bsky.social

5 months ago

Reality Promises

This week's #PaperILike is "Reality Promises: Virtual-Physical Decoupling Illusions in Mixed Reality via Invisible Mobile Robots" (Kari & Abtahi, UIST 2025).

This is some Tony Stark level stuff! XR + robots = future.

Website: mkari.de/reality-prom...
PDF: mkari.de/reality-prom...

2 0 0 0

Tom Silver

@tomssilver.bsky.social

5 months ago

Learning to guide task and motion planning using score-space representation In this paper, we propose a learning algorithm that speeds up the search in task and motion planning problems. Our algorithm proposes solutions to three different challenges that arise in learning to ...

This week's #PaperILike is "Learning to Guide Task and Motion Planning Using Score-Space Representation" (Kim et al., IJRR 2019).

This is one of those papers that I return to over the years and appreciate more every time. Chock full of ideas.

PDF: arxiv.org/abs/1807.09962

1 0 0 0

Tom Silver

@tomssilver.bsky.social

6 months ago

On the Utility of Koopman Operator Theory in Learning Dexterous Manipulation Skills Despite impressive dexterous manipulation capabilities enabled by learning-based approaches, we are yet to witness widespread adoption beyond well-resourced laboratories. This is likely due to practic...

This week's #PaperILike is "On the Utility of Koopman Operator Theory in Learning Dexterous Manipulation Skills" (Han et al., CoRL 2023).

This and others have convinced me that I need to learn Koopman! Another perspective on abstraction learning.

PDF: arxiv.org/abs/2303.13446

2 0 0 0

Tom Silver

@tomssilver.bsky.social

6 months ago

This week's #PaperILike is "Predictive Representations of State" (Littman et al., 2001).

A lesser known classic that is overdue for a revival. Fans of POMDPs will enjoy.

PDF: web.eecs.umich.edu/~baveja/Pape...

3 0 1 0

Tom Silver

@tomssilver.bsky.social

6 months ago

Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems Learning effective policies for real-world problems is still an open challenge for the field of reinforcement learning (RL). The main limitation being the amount of data needed and the pace at which t...

This week's #PaperILike is "Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems" (Suau et al., ICML 2022).

Nice work on using fast local simulators to plan & learn in large partially observed worlds.

PDF: arxiv.org/abs/2202.01534

3 0 0 0

Tom Silver

@tomssilver.bsky.social

6 months ago

Optimal Interactive Learning on the Job via Facility Location Planning Collaborative robots must continually adapt to novel tasks and user preferences without overburdening the user. While prior interactive robot learning methods aim to reduce human effort, they are typi...

This week's #PaperILike is "Optimal Interactive Learning on the Job via Facility Location Planning" (Vats et al., RSS 2025).

I always enjoy a surprising connection between one problem (COIL) and another (UFL). And I always like work by Shivam Vats!

PDF: arxiv.org/abs/2505.00490

0 0 0 0