IndustReal: Transferring Contact-Rich Assembly Tasks from Simulation to Reality – RSS 2023
Tags: ,

Algorithms, systems, and tools that solve assembly tasks in simulation with reinforcement learning (RL) and successfully achieve policy transfer to a real-world robotic assembly system.


A Study on Multirobot Quantile Estimation in Natural Environments
Tags: ,

Authors: Isabel M. Rayas Fernández, Christopher E. Denniston, Gaurav S. Sukhatme

Abstract— Quantiles of a natural phenomena can provide scientists with an important understanding of different spreads of concentrations. When there are several available robots, it may be advantageous to pool resources in a collaborative way to improve performance. A multirobot team can be difficult
to practically bring together and coordinate. To this end, we present a study across several axes of the impact of using multiple robots to estimate quantiles of a distribution of interest using an informative path planning formulation. We measure quantile estimation accuracy with increasing team
size to understand what benefits result from a multirobot approach in a drone exploration task of analyzing the algae concentration in lakes. We additionally perform an analysis on several parameters, including the spread of robot initial positions, the planning budget, and inter-robot communication, and find that while using more robots generally results in
lower estimation error, this benefit is achieved under certain conditions. We present our findings in the context of real field robotic applications and discuss the implications of the results and interesting directions for future work.


Reducing Network Load via Message Utility Estimation for Decentralized Multirobot Teams
Tags: ,

Authors: Isabel M. Rayas Fernández, Christopher E. Denniston, Gaurav S. Sukhatme

Abstract— We are motivated by quantile estimation of algae concentration in lakes and how decentralized multirobot teams can effectively tackle this problem. We find that multirobot teams improve performance in this task over single robots, and communication-enabled teams further over communication-deprived teams; however, real robots are resource-constrained, and communication networks cannot support arbitrary message loads, making naive, constant information-sharing but also complex modeling and decision-making infeasible. With this in mind, we propose online, locally computable metrics for determining the utility of transmitting a given message to the other team members and a decision-theoretic approach that chooses to transmit only the most useful messages, using a decentralized and independent framework for maintaining beliefs of other teammates. We validate our approach in simulation on a real-world aquatic dataset, and we show that restricting communication via a utility estimation method based on the expected impact of a message on future teammate behavior results in a 42% decrease in network load while simultaneously decreasing quantile estimation error by 1.84%.


Square grid trajectory flight on a Crazyflie2.1. with a 1 hidden layer 4 neuron neural controller
Tags: ,

A tiny (single hidden layer 4 neuron) neural network, estimated by HyperPPO, driving a quadcopter to track a square grid trajectory.


Bezier Curve trajectory flight on a Crazyflie2.1. with a 1 hidden layer 4 neuron neural controller
Tags: ,

A tiny (single hidden layer 4 neuron) neural network, estimated by HyperPPO, driving a quadcopter to track a 3D random Bezier curve.


MRNAV Long Term Execution with 8 Quadrotors in a Cluttered Environment
Tags:

MRNAV Experiment Recordings
Tags:

DREAM: Decentralized Real-time Asynchronous Trajectory Planning for Collision-free Navigation
Tags: , ,

Decentralized Real-time Asynchronous Probabilistic Trajectory Planning for Collision-free Multi-Robot Navigation in Cluttered Environments.


Related Publications

Senbaslar, Baskin; Sukhatme, Gaurav S.: Asynchronous Real-time Decentralized Multi-Robot Trajectory Planning. IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022, Kyoto, Japan, October 23-27, 2022, pp. 9972-9979, IEEE, 2022
Efficiently Learning Small Policies for Locomotion and Manipulation
Tags: ,

Leveraging graph hyper networks to learn graph hyper policies trained with off-policy reinforcement learning.


Deformable Manipulation from Demonstrations (DMfD)
Tags: ,

A novel Learning from Demonstration (LfD) method, Deformable Manipulation from Demonstrations (DMfD), to solve deformable manipulation tasks using states or images as inputs, given expert demonstrations.


Efficient Multi-Task Learning via Iterated Single-Task Transfer
Tags: ,

Multi-task Reinforcement Learning (MTRL) for robots to acquire new skills.


Selective Object Rearrangement in Clutter – CoRL 2022
Tags: ,

An image-based, learned method for selective tabletop object rearrangement in clutter using a parallel jaw gripper.


IROS 2022 Video Presentation IKFlow Generating Diverse Inverse Kinematics Solutions

Generating Diverse Inverse Kinematics Solutions


Informative Path Planning to Estimate Quantiles for Environmental Analysis (IROS 2022)
Tags: ,

Choosing locations for scientific analysis by using a robot to perform an informative path planning survey.


Decentralized Control of Quadrotor Swarms with End to end Deep Reinforcement Learning (Physical)
Tags: ,

Decentralized Control of Quadrotor Swarms with End to end Deep Reinforcement Learning (Simulation)
Tags: ,

Adaptive Sampling using POMDPs with Domain-Specific Considerations – ICRA 2021
Tags: , ,

We investigate improving Monte Carlo Tree Search based solvers for Partially Observable Markov Decision Processes (POMDPs), when applied to adaptive sampling problems. We propose improvements in rollout allocation, the action exploration algorithm, and plan commitment. The first allocates a different number of rollouts depending on how many actions the agent has taken in an episode. We find that rollouts are more valuable after some initial information is gained about the environment. Thus, a linear increase in the number of rollouts, i.e. allocating a fixed number at each step, is not appropriate for adaptive sampling tasks. The second alters which actions the agent chooses to explore when building the planning tree. We find that by using knowledge of the number of rollouts allocated, the agent can more effectively choose actions to explore. The third improvement is in determining how many actions the agent should take from one plan. Typically, an agent will plan to take the first action from the planning tree and then call the planner again from the new state. Using statistical techniques, we show that it is possible to greatly reduce the number of rollouts by increasing the number of actions taken from a single planning tree without affecting the agent’s final reward. Finally, we demonstrate experimentally, on simulated and real aquatic data from an underwater robot, that these improvements can be combined, leading to better adaptive sampling. Accepted at ICRA 2021 The first two authors had an equal contribution


Sampling-Based Motion Planning on Manifold Sequences
Tags: , ,

Sampling-Based Motion Planning on Sequenced Manifolds (PSM*)
Tags: , ,

Learning Equality Constraints for Motion Planning on Manifolds
Tags: ,

Constrained robot motion planning is a widely used technique to solve complex robot tasks. We consider the problem of learning representations of constraints from demonstrations with a deep neural network, which we call Equality Constraint Manifold Neural Network (ECoMaNN). The idea behind it is to learn a level-set function of the constraint by aligning subspaces in the network with sub-spaces of the data such that it can be integrated into a constrained sampling-based motion planner. We combine both learned constraints and analytically described constraints into the planner and use a projection-based strategy to find valid points. We evaluate ECoMaNN on its representation capabilities of constraint manifolds, the impact of its individual loss terms, and the motions produced with it.


Related Publications

Sutanto, Giovanni; Rayas Fernández, Isabel M.; Englert, Peter; Ramachandran, Ragesh K.; Sukhatme, Gaurav S.: Learning Equality Constraints for Motion Planning on Manifolds. CoRR, vol. abs/2009.11852, 2020
[PDF] [Video]
Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors
Tags: , ,

Quadrotor stabilizing controllers often require careful, model-specific tuning for safe operation. We use reinforcement learning to train policies in a simulation that transfer remarkably well to multiple different physical quadrotors. Our policies are low-level, i.e., we map the rotorcrafts’ state directly to the motor outputs. The trained control policies are very robust to external disturbances and can withstand harsh initial conditions such as throws. To the best of our knowledge, this is the first work that demonstrates that a simple neural network can learn a robust stabilizing low-level quadrotor controller without the use of a stabilizing PD controller; as well as the first work that analyses the transfer capability of a single policy to multiple quadrotors.


Downwash-Aware Trajectory Planning for Large Quadcopter Teams
Tags: , ,

This work is in collaboration with the USC ACT Lab.


Related Publications

Preiss, James A.; Hünig, Wolfgang; Ayanian, Nora; Sukhatme, Gaurav S.: Downwash-aware trajectory planning for large quadrotor teams. 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2017, Vancouver, BC, Canada, September 24-28, 2017, pp. 250-257, IEEE, 2017
Cooperative Control for Target Tracking with Onboard Sensing – 2 drones in a string topology
Tags: ,

We consider the cooperative control of a team of robots to estimate the position of a moving target using onboard sensing. In particular, we do not assume that the robot positions are known, but also estimate their positions using relative onboard sensing. Our probabilistic localization and control method takes into account the motion and sensing capabilities of the individual robots to minimize the expected future uncertainty of the target position. It reasons about multiple possible sensing topologies and incorporates an efficient topology switching technique to generate locally optimal controls in polynomial time complexity. Simulations show the performance of our approach and prove its flexibility to find suitable sensing topologies depending on the limited sensing capabilities of the robots and the movements of the target. Furthermore, we demonstrate the applicability of our method in various experiments with single and multiple quadrotor robots tracking a ground vehicle in an indoor environment.


Dancebot 2012

USC’s Robotic Embedded Systems Lab and the Interaction Lab jointly present the sensational new DanceBot 2012.


Using Manipulation Primitives for Brick Sorting in Clutter
Tags:

Using Manipulation Primitives for Brick Sorting in Clutter.


Related Publications

Gupta, Megha; Sukhatme, Gaurav S.: Using manipulation primitives for brick sorting in clutter. IEEE International Conference on Robotics and Automation, ICRA 2012, 14-18 May, 2012, St. Paul, Minnesota, USA, pp. 3883-3889, IEEE, 2012
Motivation behind Minimum Risk Planning
Tags:

Toward Risk Aware Mission Planning for Autonomous Underwater Vehicles.


Related Publications

Pereira, Arvind; Binney, Jonathan; Hollinger, Geoffrey A.; Sukhatme, Gaurav S.: Risk-aware Path Planning for Autonomous Underwater Vehicles using Predictive Ocean Models. J. Field Robotics, vol. 30, no. 5, pp. 741-762, 2013
ICRA 2011 – Cooperative Control of Autonomous Surface Vehicles for Oil Skimming and Cleanup
Tags: ,

Cooperative Control of Autonomous Surface Vehicles for Oil Skimming and Cleanup.


Related Publications

Bhattacharya, Subhrajit; Heidarsson, Hordur Kristinn; Sukhatme, Gaurav S.; Kumar, Vijay: Cooperative control of autonomous surface vehicles for oil skimming and cleanup. IEEE International Conference on Robotics and Automation, ICRA 2011, Shanghai, China, 9-13 May 2011, pp. 2374-2379, IEEE, 2011
Finding Rusalka
Tags:

Little video from a recent recovery mission when we went out to pick up one of our gliders just off the coast from Santa Catalina Island.


Collective Transport of Robots: Coherent, Minimalist Multi-Robot Leader Following
Tags: ,

Collective Transport of Robots: Emergent Flocking from Minimalist Multi-robot Leader-following: Create robots were fitted with Wii remotes and made to follow a person wearing some Infrared LED harness. The robots are unable to sense each other, but are able to communicate.


Related Publications

Gupta, Megha; Das, Jnaneshwar; Vieira, Marcos Augusto M.; Heidarsson, Hordur Kristinn; Vathsangam, Harshvardhan; Sukhatme, Gaurav S.: Collective transport of robots: Coherent, minimalist multi-robot leader-following. 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 11-15, 2009, St. Louis, MO, USA, pp. 5834-5840, IEEE, 2009
Rusalka – Slocum Glider
Tags:

Rusalka, one of USC’s Webb Slocum Gliders, diving off the coast of Catalina Island while doing hardware testing. Shot with a Olympus Stylus sw1030 waterproof camera. We use our gliders in applications that aid biologists and oceanographers.


Simulations for Path Planning for Gliders
Tags:

Path planning for underwater vehicles in the presence of ocean currents using “Pseudo-Waypoints”.


Duplo Sorting by PR2
Tags:

A PR2 robot sorting Duplo bricks by size and color using visual data from a head-mounted Kinect sensor. 100% accuracy is achieved for uncluttered scenes. Duplo Bricks Sorting by PR2


Multi-Robot Collaboration With Range Limited Communication
Tags: ,

Multi-Robot Collaboration with Range-Limited Communication: Experiments with Two Underactuated ASVs.


Related Publications

Arrichiello, Filippo; Das, Jnaneshwar; Heidarsson, Hordur Kristinn; Pereira, Arvind; Chiaverini, Stefano; Sukhatme, Gaurav S.: Multi-Robot Collaboration with Range-Limited Communication: Experiments with Two Underactuated ASVs. Howard, Andrew; Iagnemma, Karl; Kelly, Alonzo (Ed.): Field and Service Robotics, Results of the 7th International Conference, FSR 2009, Cambridge, Massachusetts, USA, 14-16 July 2009, pp. 443-453, Springer, 2009