A note from Dr. Otte

This part of our website is a perpetual work in progress, loved, yet woefully neglected. The last time the content below accurately reflected the entire scope of our lab's research was in 2017 — when I was a postdoc in the process of applying for academic jobs. That is, before our lab was even a lab :-)

If what appears below seems out of date, then please check our publications page for our most recent work.

The rest of this page is divided into two different sections. The first half contains a brief summary of the research areas I believe form the core of my identity as a researcher, along with a list of topics our lab has explored within each area. The second half contains an in-depth look at selected projects. Until I find more hours in the day, the sole selection criterion for inclusion in the more detailed second half is simply "did this content exist before 2018?"

— Michael, August, 2023

Areas of research interest

(More narrowly defined areas of interest appear higher on this list. There is overlap between areas.)

How can multiple robots pool their computational resources to collectively solve their common problems?

Robots have on-board computational resources. It seems natural that multiple robots should pool their on-board computational resources to collectively solve their common problems. This goes beyond mere coordination — the robots should actively cooperate or collaborate computationally. I am interested in applying ideas from distributed and parallel computing to multi-robot problems, subject to the constraints of on-board computation and robot-to-robot communication. In other words, robots are the computational nodes of a distributed computer that has coalesced (perhaps temporarily) to calculate the solution to a common problem. I am also interested in new forms of collective computation that become possible due to the physical nature of multi-robot systems. Our lab has explored these ideas in a number of settings as listed below.

Clustering algorithms distributed across a robot swarm: WAFR'22.
Neural networks distributed across a robot swarm: IJRR'18, ISER'16.
Multi-robot motion planning distributed across a multi-robot team: AURO'18, ICRA'14, TRO'13, PhD Disertation'11, ISER'10, DARS'10.

How can autonomous robots/vehicles efficiently replan in response to changes in the environment, robot, or mission objectives?

The world will often change in ways that invalidate an existing plan. There are many sceneries in which we cannot anticipate which changes will occur, but we can observe changes after they occur using on-board sensors. A classic example of this is path replanning in an environment where obstacles may appear, disappear, or move. Indeed, there are many other types of changes that can occur during a mission that affect the interaction between a robot/vehicle and the environment. While it is always possible to perform a new "brute-force" replan from scratch, doing so is computationally inefficient — not well suited to high-speed operation. I am interested in finding efficient ways to repair or remodel the existing (outdated) plan so that it respects the new state of the world. Sometimes a quick repair/remodel is not possible; in these cases, I am interested in ways that the old plan can be used to expedite the calculation of a new plan. Our lab has looked at replanning in a number of scenarios as listed below.

Replanning after a change in vehicle dynamics: ICRA'23, ISER'20.
Quick motion replanning in networks of control funnels: WAFR'22.
Replanning after a change in environmental dynamics: IAC'20, ICRA'16.
Target search in changing environments: ICRA'17.
Quick replanning in sampling based motion planning: IJRR'15, WAFR'14.
Path replanning in fields built with linear interpolation: IROS'09.

How can multiple robots work together when communication is limited?

Robots use communication for coordination, cooperation, and collaboration — both physically and computationally. Yet, wireless communication is not perfect in most real-world settings and may be limited, disrupted, or unavailable. Multi-robot algorithms that exhibit similar performance when communication is perfect often degrade in very different ways as communication becomes limited. During my PhD, we invented the term any-com to describe multi-robot algorithms that degrade gracefully as communication gets worse. I am interested in understanding why different algorithms have different any-com properties, and then leveraging this understanding to design new methods with better performance in communication limited scenarios. I am also interested in understanding the game theoretic and adversarial aspects of communication and/or the lack of communication. See below for a list of our lab's previous work at the intersection of multi-robot algorithms and limited communication.

Detecting hazards in communication denied environments: ArXive'23, DARS'22, TASE'21, WAFR'18.
Hacking and preventing hacking in robot swarms: WAFR'22.
Distributed multi-robot task allocation with limited communication: Access'22, Access'21, RAL'20.
Auctions (centralized multi-robot task allocation) with limited communication: AURO'20, MRS'17.
Distributed neural networks with imperfect communication between computational nodes: IJRR'18, ISER'16.
Multi-agent search games with communication constraints: AURO'17, WAFR'16.
Any-Com multi-robot motion planning (imperfect communication between robots): AURO'18, ICRA'14, TRO'13, PhD Disertation'11, ISER'10, DARS'10.

How can swarms of autonomous agents collaborate to achieve a complex goal?

The term swarm is often used to indicate that multiple agents act in concert to achieve a common goal, but it also used in subtly different ways depending on one's discourse community. For example, 'swarm' may carry connotations of scalability, emergent behavior, numerosity, overwhelming an adversary, or biological inspiration. Sometimes 'swarm' is simply used as a synonym for 'multi-agent system.' While our lab's work has involved many of these elements, I am most excited by problems related to scalability and numerosity. I am interested in how we design algorithms and swarm behaviors that still work if the number of agents is increased by multiple orders of magnitude. As we increase the number of agents from 1 to 100 to 10,000 ... maintaining computational tractability requires sacrificing algorithmic performance guarantees (such as optimality, completeness, etc.). On the other hand, new statistical tools and performance guarantees become available, and useful behaviors may emerge at the swarm-size scale (interesting unanticipated or unhelpful behaviors may also emerge). A list of our lab's previous work on swarms appears below.

Robot swarms used as smart fluids: RAL'22, ISER'22,
Clustering algorithms distributed across a robot swarm: WAFR'22.
Hacking/hijacking robot swarms: WAFR'22.
Collision avoidance in robot swarms: RAL'22.
Swarms for distributed sensing: IJRR'18, SSCI'18. SSCI'17. ISER'16. IROS'10.
Neural networks distributed across a robot swarm: IJRR'18, ISER'16.

What are new fun/interesting/useful path and motion planning problems, and how can they be solved?

Path and motion planning algorithms are used to calculate how a physical system can move through an environment to achieve a long-term goal. In the real world, environments may be highly non-convex due to obstacles or other constraints. We often consider the kinematics, dynamics, and/or controllability of a robot or vehicle. Mission objectives may require reaching a goal, performing coverage, or doing other things that require long-term navigation and motion. I am interested in discovering new path and motion planning problems that have not be addressed, and then designing algorithms to solve these problems. I am also interested in gaining theoretical insight into how these algorithms work (and when they might not work) using analytical tools. A list of problems that our lab has studied appears below.

Path-based sensors (sensors shaped like paths): ArXive'23, DARS'22, TASE'21, WAFR'18.
Replanning (note: the replanning sub-section, above, contains finer details): ICRA'23, WAFR'22, ISER'20, IAC'20, ICRA'17, ICRA'16, IJRR'15, WAFR'14, IROS'09.
Multi-agent path and motion planning: RAL'23, RAL'22, RAL'19, AURO'18, ICRA'14, TRO'13, PhD Disertation'11, ISER'10, DARS'10.
Bidirectional search when we cannot solve a vehicle's two-point boundary value problem: TRO'22, arXive'20.
Motion planning + Control: WAFR'22. arXive'15,
Path planning for target search, coverage, and information gathering: ArXive'23, DARS'22, TASE'21, ICRA'21, WAFR'18, SSCI'18, AURO'17, Sensors'17, ICRA'17, WAFR'16.
Efficient collision checking in sampling based motion planning: IJRR'16, ICRA'14, IROS'13, WAFR'12.
Navigation with foraging: IROS'13.
Path planning in image space: JFR'09, TechReport'09, MS Disertation'07, IROS'07.
Path planning + learning: PhD Preliminary Exam'08.

More information about selected projects prior to 2018

Work since 2018 is not reflected below but can be found in the Publications part of our website. Publications are kept more up-to-date so that our papers can be indexed by search engines and therefore more readily found by people that would like to read them.

A brief summery of selected research projects that appear below:

Collective Cognition & Sensing in Robotic Swarms via Emergent Group Minds
RRT-X: Real-Time Replanning in Dynamic Environments (unpredictable obstacles)
Efficient Collision Checking for Sampling Based Motion Planning
C-FOREST: Parallel Shortest-Path Planning with Super Linear Speedup
Any-Com Multi-Robot Path Planning
2D Robotic Path Planning (Extracting paths from Field D*)
Path Planning in Image Space (DARPA LAGR)
Machine Learning Applied to Systems Data

Collective Cognition & Sensing in Robotic Swarms via Emergent Group Minds

People: Michael Otte

The idea of a "group mind" has long been an element of science fiction. We use a similar idea across a robotic swarm to enable distributed sensing and decision making, and to facilitate human-swarm interaction.

One Node

All robots receive identical individual programming a priori and are dispersed into the environment. A distributed neural network emerges across the swarm at runtime as each robot maintains a slice of neurons and forms neural links with other robots within communication range. The resulting "artificial group mind" can be taught to differentiate between images, and trained to perform a prescribed (possibly heterogeneous) behavior in response to each image that it can recognize.

One Node

This animated video contains high level introduction to some of these ideas (it starts with a simpler idea to get started). See below for more videos of actual experiments.

In experiments we taught such a group mind to differentiate between three images: (1) peace sign, (2) biohazard sign, and (3) neutral gray pattern. The desired output behaviors the swarm learns to associate with each image are respectively: (1) form a blue smiley face, (2) form a red frowny face, or (3) keep training and display a yellow LED when finished.

In the first video (above) the swarm detects a peace sign and so creates a smiley face.

In the second video (above) the swarm detects a biohazard sign and so creates a frowny face.

A playlist of videos showing more experiments on swarms of 160-260 Kilobot robots can be found here.

It is important to understand that the group mind learns to differentiate between the patterns as a meta-entity. Many robots would be unable to differentiate between the smiley and frowny faces on their own. For example, any robot that receives the same input signal from both faces. In contrast, the swarm is able to leverage its distributed sensor network to detect patterns across the environment (similar to cells in a eye retina). The group mind neural network learns a mapping from different images to desired behaviors (which may vary spatially between robots for the same image).

Movement breaks the neural links and causes the group mind to dissolve. However, each robot retains the knowledge of which pattern was sensed, and which output behavior it should perform. The faces are physically created as robots that are not yet part of the face randomly move until they either join the face or leave the environment. It is assumed that robots have a library of such possible low level behaviors (e.g., randomly walk until some condition is met) but the mapping from possible images to output behaviors is learned at run-time.

The best place for more information is the IJRR 2018 journal paper , which includes many experiments, proofs of convergence , and a detailed algorithmic description. The earlier ISER 2016 conference paper is much shorter, with a limited focus on selected experiments.

top

Efficient Collision Checking in Motion Planning

People: Joshua Bialkowski, Sertac Karaman, Michael Otte, and Emilio Frazzoli

Collision checking has been considered one of the main computational bottlenecks in motion planning algorithms; yet, collision checking has also been predominantly (if not entirely) employed as a black-box subroutine by those same motion planning algorithms. We discovered that a more intelligent, yet simple, collision checking procedure that essentially eliminates this bottleneck. Specifically (in math jargon), the amortized computational complexity of collision checking approaches zero in the limit as the number of nodes in the graph approaches infinity.

1 One Node 2 Two Nodes

1) Instead of just returning a Boolean (true/false) value that represents "in collision" vs. "not in collision," our collision checking procedure returns the minimum distance to the closest obstacle. This obstacle-distance data (D i) is stored with the node (Vi) for which it was calculated (dotted black line), and represents a safe-ball (blue) around Vi (in non-Euclidean space the shape may not be a ball, but if the shape is convex then our method will still work).

2) When a new node (Vj) tries to connect to Vi with a new edge (black line) it can skip collision checking whenever Vj is closer than Di to Vi (in other words, if the new node is inside the safe-ball of the old node). Vj then remembers a pointer (dotted yellow) to Vi.

3 Three Nodes 4 Underestimate

3) If a third node (Vk) wants to connect with Vj then collision checking can again be skipped if Vk is closer than Di to Vi (thanks to the convexity of the safe-ball around Vi). Vk then also remembers a pointer to Vi in case new nodes wish to attach to it (dotted yellow).

4) The same basic strategy also works if Di is an under-estimate of the distance from Vi to the nearest obstacle.

Click here to see the WAFR paper, which includes experiments and proofs. Or click here for the final IJRR journal version, which includes a more thorough treatment and includes extensions.

top

C-FOREST: Parallel Shortest-Path Planning with Super Linear Speedup

People: Michael Otte and Nikolaus Correll

Increasing the speed of path planning algorithms will enable robots to perform more complex tasks, while also increasing the speed and efficiently of robotic systems. Harnessing the power of parallel computation is one way that this can be achieved.

C-FOREST (Coupled Forests of Random Engrafting Search Trees) is an algorithmic framework for parallelizing sampling-based shortest-path planning algorithms.

shortest-path planning algorithm: a path-planning algorithm that tries to find the shortest path between the start and the goal positions.

Sampling-based planning algorithm: an algorithm draws random samples from the search-space in order to figure out where the robot can and cannot go.

C-FOREST works as follows:

Each of the above three images represents the planning process that is happening on one of three different CPUs (red, blue, yellow). The problem being solved is for a robot (light gray, located in the lower-center of each image) to get to the tower (dark gray and brown above and to the right of the robot). Each CPU performs independent random planning until a better solution is found (this happens on the red CPU and the path is also red), and then the new (red) solution is exchanged between CPUs (as explained in the following images).

The length of this solution defines a boundary (here is is shown as red ellipse, as it would be if the robot were planning in 2D Euclidean space). Future samples are drawn from inside the ellipse (because those outside cannot yield better solutions). This increases the probability that an even better solution is found each time new random sample is chosen, and thus speeds up future planning. Existing nodes/branches can also be pruned (pruned nodes/branches are shown in gray), which decreases the time that it takes to connect future nodes to the tree (each time a new node is connected to the tree, we must find the best old node to connect the new node to; this search takes longer when there are more nodes, even though we use advanced data structures such as KD-trees). Sharing the length of the solution (from red CPU to blue and yellow CPUs) gives the advantages of knowing the red path to the other CPUs.

The dark shaded regions represent areas in the search space from which new samples will yield better paths. Note that those depicted here are only a cartoon approximation; in practice it is usually impossible to calculate this sub-space explicitly---which is why we use random sampling to begin with. Sharing the path itself increases the size of the sub-space from which new samples will yield better paths on the other CPUs. In this example, sharing the path from the red CPU to the blue and yellow CPUs further increases the probability of finding an even better solution on the blue and yellow CPUs.

As more solutions are found (blue in this example), sharing the data among CPUs ensures that all CPUs can always prune, sample and improve their solutions based on the best solution known to any CPU.

Here are some of our results comparing C-FOREST to OR-parallelization (both use the RRT* planning algorithm) on a manipulator-arm path planning problem. Note that results are shown in terms of Speedup and Efficiency.

Speedup: (S) How many times faster an algorithm runs on a parallel architecture with N CPUs than it does on an architecture with 1 CPU.
Efficiency: (E) is calculated as E = S/N and measures the relative power use of solving the same problem on an N CPU architecture vs. a 1 CPU architecture.

The color of the lines indicates how good of a solution the algorithm is asked to return (Ltarget). Warmer colors indicate better (and harder to find) paths. The horizontal axis shows the number of CPUs. Experiments with C-FOREST appear on the left, those with OR-parralelization appear on the right.

Not only does C-FOREST perform better than OR-parallelization on nearly all trials, C-FOREST also has super-linear speedup on nearly all trials!

super-linear speedup: speedup such that S > N (or E > 1).

C-FOREST can theoretically be used with any sample-based shortest-path planning algorithm such that, if it were allowed to use an infinite number of samples, then it would find the optimal solution with probability 1. (In formal math jargon: any algorithm that almost surely finds the optimal solution, in the limit as the number of samples approaches infinity).

See the paper and/or my PhD thesis for more experiments, as well as a full description of C-FOREST.

Here is a link to C-FOREST Code.

top

Any-Com Multi-Robot Path Planning

People: Michael Otte and Nikolaus Correll

Multi-robot path planning algorithms find coordinated paths for all robots operating in a shared environment. 'Centralized' algorithms have the best guarantees on completeness, but they are also the most expensive. Centralized solutions have traditionally been calculated on a single computer.

Solutions via a single robot (left) and an ad-hoc distributed computer (right).

In contrast, I combine all robots into a distributed computer using an ad-hoc wireless network. Since the network is wireless, the distributed planning algorithm must cope with communication disruption. The term 'Any-Com' implies graceful performance declines vs. increasing packet loss.

My Any-Com path planning algorithm works as follows:

Each robot performs independent search in the combined configuration space of the entire team using a random tree algorithm.
Better solutions are broadcast throughout the team whenever they are found.
Robots that receive better solutions can use them to prune their own search trees, and also to avoid building new branches in places that cannot possibly lead to even better solutions.

Since each robot maintains its own tree, lost communication does not prevent a better solution from being found. On the other hand, successful communication helps better solutions to be found more quickly. In experiments, packet loss as high as 97% has little effect on solution quality.

Here is the paper:

Michael Otte and Nikolaus Correll.
Any-Com Multi-Robot Path-Planning: Maximizing Collaboration For Variable Bandwidth. In The 10th International Symposium on Distributed Autonomous Robotic Systems (DARS), Lausanne, Switzerland, November 2010.

More recently I have been using multiple ad-hoc distributed computers in a single environment. Each robot starts in its own team, and teams are combined if their solutions conflict. This helps reduce the complexity of the problem per distributed computer.

Two non-conflicting ad-hoc distributed computers.

This is discussed in the following paper:

Michael Otte and Nikolaus Correll. Any-Com Multi-Robot Path-Planning with Dynamic Teams: Multi-Robot Coordination under Communication Constraints. In International Symposium on Experimental Robotics (ISER), New Delhi, India, December 2010.

In early work on the Any-Com idea, I also experimented with task allocation:

Michael Otte and Nikolaus Correll. The Any-Com Approach to Multi-Robot Coordination. In IEEE International Conference on Robotics and Automation (ICRA), Workshop on Network Science and Systems Issues in Multi-Robot Autonomy (NETSS), Anchorage, USA, 2010. Poster.

Finally, the most recent version of all of this work (and more!) is in my PhD Thesis.

top

Path Planning in Image Space (DARPA LAGR)

People: Michael Otte, Scott Richardson, Jane Mulligan, Greg Grudic

I began working with mobile robotics while involved with the Defense Advanced Research Project Agency (DARPA) Learning Applied to Ground Robotics (LAGR) program. Here is a picture of the LAGR robotic platform:

(Note that I borrowed this image from a slide presentation given by former LAGR P.I. Dr. Larry Jackel)

The robot's task was to navigate to a predefined coordinate (as fast as possible given a 1.3 m/s speed cap). The basic idea of LAGR was that the robot should be able to learn about the environment as it interacts with it, thus improving the ability to navigate through the environment as time progresses. The robot's primary sensors include four digital cameras that are arranged in two stereo pairs. The stereo camera pairs provide scene depth information (similar to the way that human eyes do) and also color information. The range of the depth information is accurate to a maximum of 5 meters, but color information is only limited by the robot's line-of-sight.

One of the primary goals was to learn what colors were likely to be associated with the ground or obstacles, given a particular environment. This relationship can be learned in the near field, within the range of accurate depth information from the stereo cameras. Once learned, the ground vs. obstacle information can be extrapolated to the far-field as a function of color. This is important because it significantly increases the usable range of the sensor, giving the robot more information about the environment, and enabling it to make better decisions about what to do.

My part in this project was to create and maintain the mapping/path-planning system---the piece of software charged with remembering information that the robot has discovered about world traversability, and then calculating the "best" way to reach the goal given the robot's current position.

Here is a map that has been created from environmental traversability information:

In the map, green-black represents ground-obstacle, dark blue means "unknown," the purple dashed line is the path that the robot has taken to get to its current location, the white triangle represents the robots field-of-view, the light blue rectangle is the robot, and the big white square is the goal. This is a happy robot because it has reached the goal.

Most path planners either operate in a Cartesian based top-down map like the one shown above, or a 3D representation of the environment. This means that camera information must be transformed into another coordinate space before it can be used. I developed methods that allow paths to be found directly in the coordinate space of raw camera information. This is like steering a car by looking out of the front windshield, instead of by looking at the car's position on a road-map.

Here is a picture showing ground vs. obstacle information (right) that is extracted from a stereo disparity image (center)---disparity is inversely related to depth. The RGB image is also shown (left).

As demonstrated by a simple bale obstacle, paths can be found directly in this cost information (right):

Note that the width of obstacles have been increased by 1/2 the width of the robot so that it will not collide with the obstacles if it follows the path.

Here is the paper I wrote about this idea (my Master's Thesis was an earlier iteration of this work):

Michael W. Otte, Scott G. Richardson, Jane Mulligan, Gregory Grudic. Path planning in image space for the autonomous robot navigation in unstructured outdoor environments. Journal of Field Robotics, Volume 26, Issue 2, February, 2009, p. 212-240. preprint version.

I found that using a version of this planner that simulates a 360 degree field-of-view in memory can provide a good local planner--that is, a system that is used to quickly find paths around immediate obstacles. However, the image planner cannot remember information about locations that have moved far away from the robot, thus it does not make a robust global planner--that is, a system that finds long-range paths.

top

Machine Learning Applied to Systems Data

People: Scott Richadson, Michael Otte, and Mike Mozer

Modern computer programs are very complicated, often hard to analyze, and may interact with lower levels of code in unexpected ways. Often programmers cannot predict how a piece of code will affect the given computer architecture that it runs on until it is actually executed (i.e. with respect to memory usage or runtime). Also, different architectures may respond differently to the same piece of code.

It has been shown that computer architectures exhibits phase behavior. That is, if a record of the architecture's operation sequence (i.e. execution trace) is sliced into many pieces, than many of these pieces can be divided into groups defined by similar behavior. For instance, here is a plot of the cycles per instruction (CPI) of a simple benchmarking program (gzip,ref2):

Thanks to Scott Richardson for making this figure.

As you can see, there appear to be two or three distinct phases of behavior, as well as a few intervals that are unique.

CPI is just one of many metrics that are used to evaluate an architecture's performance. When a new architecture is designed, it must either be fabricated or simulated in software. The former is expensive, while the latter is time intensive. For hardware dependent metrics such as CPI, architecture simulation may taking months of computation time (even to run a trivial program).

In order to speed up development, designers often choose to simulate only a fraction of the entire program. To date, simple machine learning tools have been used to cluster program intervals based on easily obtained software metrics, such as non-branching pieces of code (i.e. basic blocks). Once the clustering of a program execution trace is found, only one interval per cluster must be executed on the software simulated architecture to determine how the architecture will likely perform over the entire program.

In previous research we investigated whether or not more sophisticated machine learning techniques could provide advantages over the relatively simple methods that are currently used.

top

Research