Tags: Deep reinforcement learning, Learning from demonstrations
Algorithms, systems, and tools that solve assembly tasks in simulation with reinforcement learning (RL) and successfully achieve policy transfer to a real-world robotic assembly system.
Tags: Multi-robot coordination and planning, Multi-robot SLAM
Authors: Isabel M. Rayas Fernández, Christopher E. Denniston, Gaurav S. Sukhatme
Abstract— Quantiles of a natural phenomena can provide scientists with an important understanding of different spreads of concentrations. When there are several available robots, it may be advantageous to pool resources in a collaborative way to improve performance. A multirobot team can be difficult
to practically bring together and coordinate. To this end, we present a study across several axes of the impact of using multiple robots to estimate quantiles of a distribution of interest using an informative path planning formulation. We measure quantile estimation accuracy with increasing team
size to understand what benefits result from a multirobot approach in a drone exploration task of analyzing the algae concentration in lakes. We additionally perform an analysis on several parameters, including the spread of robot initial positions, the planning budget, and inter-robot communication, and find that while using more robots generally results in
lower estimation error, this benefit is achieved under certain conditions. We present our findings in the context of real field robotic applications and discuss the implications of the results and interesting directions for future work.
Tags: Multi-robot coordination and planning, Multi-robot SLAM
Authors: Isabel M. Rayas Fernández, Christopher E. Denniston, Gaurav S. Sukhatme
Abstract— We are motivated by quantile estimation of algae concentration in lakes and how decentralized multirobot teams can effectively tackle this problem. We find that multirobot teams improve performance in this task over single robots, and communication-enabled teams further over communication-deprived teams; however, real robots are resource-constrained, and communication networks cannot support arbitrary message loads, making naive, constant information-sharing but also complex modeling and decision-making infeasible. With this in mind, we propose online, locally computable metrics for determining the utility of transmitting a given message to the other team members and a decision-theoretic approach that chooses to transmit only the most useful messages, using a decentralized and independent framework for maintaining beliefs of other teammates. We validate our approach in simulation on a real-world aquatic dataset, and we show that restricting communication via a utility estimation method based on the expected impact of a message on future teammate behavior results in a 42% decrease in network load while simultaneously decreasing quantile estimation error by 1.84%.
Tags: Deep reinforcement learning, Meta learning
A tiny (single hidden layer 4 neuron) neural network, estimated by HyperPPO, driving a quadcopter to track a square grid trajectory.
Tags: Deep reinforcement learning, Meta learning
A tiny (single hidden layer 4 neuron) neural network, estimated by HyperPPO, driving a quadcopter to track a 3D random Bezier curve.
Tags: Multi-robot coordination and planning
Tags: Informative path planning, Motion planning, Multi-robot coordination and planning
Decentralized Real-time Asynchronous Probabilistic Trajectory Planning for Collision-free Multi-Robot Navigation in Cluttered Environments.
Related Publications
Senbaslar, Baskin; Sukhatme, Gaurav S.: Asynchronous Real-time Decentralized Multi-Robot Trajectory Planning. IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022, Kyoto, Japan, October 23-27, 2022, pp. 9972-9979, IEEE, 2022Tags: Deep reinforcement learning, Meta learning
Leveraging graph hyper networks to learn graph hyper policies trained with off-policy reinforcement learning.
Tags: Deformable manipulation, Learning from demonstrations
A novel Learning from Demonstration (LfD) method, Deformable Manipulation from Demonstrations (DMfD), to solve deformable manipulation tasks using states or images as inputs, given expert demonstrations.
Tags: Deep reinforcement learning, Meta learning
Multi-task Reinforcement Learning (MTRL) for robots to acquire new skills.
Tags: Deformable manipulation, Learning from demonstrations
An image-based, learned method for selective tabletop object rearrangement in clutter using a parallel jaw gripper.
Generating Diverse Inverse Kinematics Solutions
Tags: Informative path planning, Motion planning
Choosing locations for scientific analysis by using a robot to perform an informative path planning survey.
Tags: Deep reinforcement learning, Learning from demonstrations
Tags: Deep reinforcement learning, Learning from demonstrations
Tags: Informative path planning, Meta learning, Motion planning
We investigate improving Monte Carlo Tree Search based solvers for Partially Observable Markov Decision Processes (POMDPs), when applied to adaptive sampling problems. We propose improvements in rollout allocation, the action exploration algorithm, and plan commitment. The first allocates a different number of rollouts depending on how many actions the agent has taken in an episode. We find that rollouts are more valuable after some initial information is gained about the environment. Thus, a linear increase in the number of rollouts, i.e. allocating a fixed number at each step, is not appropriate for adaptive sampling tasks. The second alters which actions the agent chooses to explore when building the planning tree. We find that by using knowledge of the number of rollouts allocated, the agent can more effectively choose actions to explore. The third improvement is in determining how many actions the agent should take from one plan. Typically, an agent will plan to take the first action from the planning tree and then call the planner again from the new state. Using statistical techniques, we show that it is possible to greatly reduce the number of rollouts by increasing the number of actions taken from a single planning tree without affecting the agent’s final reward. Finally, we demonstrate experimentally, on simulated and real aquatic data from an underwater robot, that these improvements can be combined, leading to better adaptive sampling. Accepted at ICRA 2021 The first two authors had an equal contribution
Tags: Informative path planning, Motion planning, Multi-robot coordination and planning
Tags: Informative path planning, Motion planning, Multi-robot coordination and planning
Tags: Deep reinforcement learning, Motion planning
Constrained robot motion planning is a widely used technique to solve complex robot tasks. We consider the problem of learning representations of constraints from demonstrations with a deep neural network, which we call Equality Constraint Manifold Neural Network (ECoMaNN). The idea behind it is to learn a level-set function of the constraint by aligning subspaces in the network with sub-spaces of the data such that it can be integrated into a constrained sampling-based motion planner. We combine both learned constraints and analytically described constraints into the planner and use a projection-based strategy to find valid points. We evaluate ECoMaNN on its representation capabilities of constraint manifolds, the impact of its individual loss terms, and the motions produced with it.
Related Publications
Sutanto, Giovanni; Rayas Fernández, Isabel M.; Englert, Peter; Ramachandran, Ragesh K.; Sukhatme, Gaurav S.: Learning Equality Constraints for Motion Planning on Manifolds. CoRR, vol. abs/2009.11852, 2020[PDF] [Video]
Tags: Deep reinforcement learning, Learning from demonstrations, Meta learning
Quadrotor stabilizing controllers often require careful, model-specific tuning for safe operation. We use reinforcement learning to train policies in a simulation that transfer remarkably well to multiple different physical quadrotors. Our policies are low-level, i.e., we map the rotorcrafts’ state directly to the motor outputs. The trained control policies are very robust to external disturbances and can withstand harsh initial conditions such as throws. To the best of our knowledge, this is the first work that demonstrates that a simple neural network can learn a robust stabilizing low-level quadrotor controller without the use of a stabilizing PD controller; as well as the first work that analyses the transfer capability of a single policy to multiple quadrotors.
Tags: Informative path planning, Motion planning, Multi-robot coordination and planning
This work is in collaboration with the USC ACT Lab.
Related Publications
Preiss, James A.; Hünig, Wolfgang; Ayanian, Nora; Sukhatme, Gaurav S.: Downwash-aware trajectory planning for large quadrotor teams. 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2017, Vancouver, BC, Canada, September 24-28, 2017, pp. 250-257, IEEE, 2017Tags: Multi-robot coordination and planning, Multi-robot SLAM
We consider the cooperative control of a team of robots to estimate the position of a moving target using onboard sensing. In particular, we do not assume that the robot positions are known, but also estimate their positions using relative onboard sensing. Our probabilistic localization and control method takes into account the motion and sensing capabilities of the individual robots to minimize the expected future uncertainty of the target position. It reasons about multiple possible sensing topologies and incorporates an efficient topology switching technique to generate locally optimal controls in polynomial time complexity. Simulations show the performance of our approach and prove its flexibility to find suitable sensing topologies depending on the limited sensing capabilities of the robots and the movements of the target. Furthermore, we demonstrate the applicability of our method in various experiments with single and multiple quadrotor robots tracking a ground vehicle in an indoor environment.
USC’s Robotic Embedded Systems Lab and the Interaction Lab jointly present the sensational new DanceBot 2012.
Tags: Planning
Using Manipulation Primitives for Brick Sorting in Clutter.
Related Publications
Gupta, Megha; Sukhatme, Gaurav S.: Using manipulation primitives for brick sorting in clutter. IEEE International Conference on Robotics and Automation, ICRA 2012, 14-18 May, 2012, St. Paul, Minnesota, USA, pp. 3883-3889, IEEE, 2012Tags: Planning
Toward Risk Aware Mission Planning for Autonomous Underwater Vehicles.
Related Publications
Pereira, Arvind; Binney, Jonathan; Hollinger, Geoffrey A.; Sukhatme, Gaurav S.: Risk-aware Path Planning for Autonomous Underwater Vehicles using Predictive Ocean Models. J. Field Robotics, vol. 30, no. 5, pp. 741-762, 2013Tags: Multi-robot coordination and planning, Multi-robot SLAM
Cooperative Control of Autonomous Surface Vehicles for Oil Skimming and Cleanup.
Related Publications
Bhattacharya, Subhrajit; Heidarsson, Hordur Kristinn; Sukhatme, Gaurav S.; Kumar, Vijay: Cooperative control of autonomous surface vehicles for oil skimming and cleanup. IEEE International Conference on Robotics and Automation, ICRA 2011, Shanghai, China, 9-13 May 2011, pp. 2374-2379, IEEE, 2011Tags: Aquatic
Little video from a recent recovery mission when we went out to pick up one of our gliders just off the coast from Santa Catalina Island.
Tags: Multi-robot coordination and planning, Multi-robot SLAM
Collective Transport of Robots: Emergent Flocking from Minimalist Multi-robot Leader-following: Create robots were fitted with Wii remotes and made to follow a person wearing some Infrared LED harness. The robots are unable to sense each other, but are able to communicate.
Related Publications
Gupta, Megha; Das, Jnaneshwar; Vieira, Marcos Augusto M.; Heidarsson, Hordur Kristinn; Vathsangam, Harshvardhan; Sukhatme, Gaurav S.: Collective transport of robots: Coherent, minimalist multi-robot leader-following. 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 11-15, 2009, St. Louis, MO, USA, pp. 5834-5840, IEEE, 2009Tags: Aquatic
Rusalka, one of USC’s Webb Slocum Gliders, diving off the coast of Catalina Island while doing hardware testing. Shot with a Olympus Stylus sw1030 waterproof camera. We use our gliders in applications that aid biologists and oceanographers.
Tags: Planning
Path planning for underwater vehicles in the presence of ocean currents using “Pseudo-Waypoints”.
Tags: Planning
A PR2 robot sorting Duplo bricks by size and color using visual data from a head-mounted Kinect sensor. 100% accuracy is achieved for uncluttered scenes. Duplo Bricks Sorting by PR2
Tags: Multi-robot coordination and planning, Multi-robot SLAM
Multi-Robot Collaboration with Range-Limited Communication: Experiments with Two Underactuated ASVs.