Helping robots learn skills independently to adapt to unfamiliar environments

August 8, 2024

162

The phrase “practice makes perfect” is frequently reserved for humans, however it’s also a terrific maxim for robots newly deployed in unfamiliar environments.

Imagine a robot arriving at a warehouse, bringing with it the abilities it has learned, akin to placing an object, and now it has to choose items from a shelf it doesn’t recognize. At first, the machine struggles with this because it must familiarize itself with its recent environment. To improve, the robot needs to know which skills inside an overall task it needs to enhance on after which specialize (or parameterize) that motion.

A human on site could program the robot to optimize its performance, but researchers at MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL) and AI Institute have developed a more practical alternative. Their Estimate, Extrapolate, and Situate (EES) algorithm, presented last month on the Robotics: Science and Systems Conference, allows these machines to practice on their very own, potentially helping them higher perform useful tasks in factories, homes, and hospitals.

Assess the situation

To help robots get well at tasks like sweeping floors, EES works with a vision system that locates and tracks the machine's surroundings. The algorithm then estimates how reliably the robot performs an motion (like sweeping) and whether it will be value practicing more. EES predicts how well the robot could perform the general task if it refined and eventually practiced that exact skill. Then, after each attempt, the vision system checks whether that skill was performed appropriately.

EES might be useful in hospitals, factories, private homes, or cafes. For example, for those who wanted a robot to scrub your front room, it will need assistance practicing skills like sweeping. But in line with Nishanth Kumar SM '24 and his colleagues, EES could help that robot improve without human intervention and with just a couple of practice attempts.

“When we began this project, we wondered whether this specialization can be possible with an inexpensive variety of samples on an actual robot,” says Kumar, co-leader of a Paper describes the work, PhD student in electrical engineering and computer science and CSAIL worker. “Now we now have an algorithm that permits robots to significantly improve certain skills in an inexpensive period of time using dozens or tons of of knowledge points, an improvement over the hundreds or thousands and thousands of samples required by a normal reinforcement learning algorithm.”

See Spot Sweep

EES's talent for efficient learning was evident when it was implemented on Boston Dynamics' quadruped Spot during research trials on the AI Institute. The robot, which has an arm attached to its back, accomplished manipulation tasks after a couple of hours of practice. In one demonstration, the robot learned the right way to safely place a ball and ring on a slanted table in about three hours. In one other demonstration, the algorithm led the machine to higher sweep toys right into a bin in about two hours. Both results look like an improvement over previous frameworks, which might likely have taken greater than 10 hours per task.

“We wanted the robot to realize its own experience in order that it could higher resolve which strategies would work well when deployed,” says co-leader Tom Silver SM '20, PhD '24, an electrical engineering and computer science (EECS) graduate student and CSAIL member who’s now an assistant professor at Princeton University. “By specializing in what the robot knows, we desired to answer a key query: Which of the abilities the robot has can be most useful to practice without delay?”

EES could eventually help optimize robots' autonomous work in recent deployment environments, but currently it has some limitations. They first used tables that were low to the bottom so the robot could more easily recognize its objects. Kumar and Silver also 3D printed an attachable handle that helped Spot grip the comb more easily. The robot failed to acknowledge some items and identified objects within the fallacious places, so the researchers considered these errors to be failures.

Giving robots homework

The researchers note that the practice speeds from the physical experiments might be accelerated even further using a simulator. Instead of physically training each skill autonomously, the robot could eventually mix real and virtual practice. They hope to make their system faster and with less latency by engineering EES to beat the image delays the researchers found. In the long run, they might investigate an algorithm that thinks about sequences of practice trials reasonably than planning which skills to refine.

“Letting robots learn on their very own is each incredibly useful and very difficult,” says Danfei Xu, assistant professor within the School of Interactive Computing at Georgia Tech and research scientist at NVIDIA AI, who was not involved on this work. “In the long run, domestic robots shall be sold to all kinds of households and expected to perform a wide selection of tasks. We can't possibly program all the pieces they should know prematurely, so it's vital that they’ll learn on the job. But letting robots explore and learn without guidance may be very slow and result in unintended consequences. The research by Silver and his colleagues introduces an algorithm that permits robots to practice their skills autonomously and in a structured way. This is a giant step toward developing domestic robots that may constantly evolve and improve on their very own.”

Silver and Kumar's co-authors are AI Institute researchers Stephen Proulx and Jennifer Barry, and 4 CSAIL members: Linfeng Zhao, a doctoral student and visiting researcher at Northeastern University, Willie McClinton, a doctoral student at MIT EECS, and MIT EECS professors Leslie Pack Kaelbling and Tomás Lozano-Pérez. Their work was supported partly by the AI Institute, the US National Science Foundation, the US Air Force Office of Scientific Research, the US Office of Naval Research, the US Army Research Office, and MIT Quest for Intelligence with high-performance computing resources from MIT SuperCloud and the Lincoln Laboratory Supercomputing Center.

Helping robots learn skills independently to adapt to unfamiliar environments

LEAVE A REPLY Cancel reply

Must Read

DataStax CEO: 2025 shall be the yr we see an actual AI transformation

500,000 tokens: How Anthropic's Claude Enterprise is pushing the boundaries of AI

Fal.Con 2024: CrowdStrike unveils resilient-by-design framework to strengthen global cybersecurity

Make way, copilots: Meet the following generation of AI-powered assistants

Sam Altman leaves OpenAI's security committee

Aversion to algorithmic analysts

Uniphore introduces X-Stream, a unified knowledge offering to construct RAG apps 8x faster

Latest articles

DataStax CEO: 2025 shall be the yr we see an actual AI transformation

500,000 tokens: How Anthropic's Claude Enterprise is pushing the boundaries of AI

Fal.Con 2024: CrowdStrike unveils resilient-by-design framework to strengthen global cybersecurity

Our Newsletter

Helping robots learn skills independently to adapt to unfamiliar environments

RELATED ARTICLES

LEAVE A REPLY Cancel reply

Must Read

Latest articles

Our Newsletter