Grasp Anywhere — Visibility-Aware Mobile Grasping in Dynamic Environments

Abstract

This paper addresses the problem of mobile grasping in dynamic, unknown environments where a robot must operate under a limited field-of-view. The fundamental challenge is the inherent trade-off between "seeing" around to reduce environmental uncertainty and "moving" the body to achieve task progress in a high-dimensional configuration space, subject to visibility constraints.

We propose a unified mobile grasping system comprising two core components: (1) an iterative low-level whole-body planner coupled with velocity-aware active perception to navigate dynamic environments safely; and (2) a hierarchical high-level planner based on behavior trees that adaptively generates subgoals to guide the robot through exploration and runtime failures.

We provide experimental results across 400 dynamic simulation scenarios and real-world deployment on a Fetch mobile manipulator. The results show that in both static unknown environments and dynamic environments with suddenly appearing obstacles, our system achieves success rates of 68.75% and 58.0%, respectively, significantly outperforming baselines in both robustness and safety.

Video

Method Overview

System architecture diagram — System architecture. Our receding-horizon framework tightly couples perception and planning, jointly optimizing observation and whole-body motion at runtime.

Velocity-Aware Active Gaze

A state-dependent gaze policy π_v that switches between observing the target during planning and monitoring the swept volume during execution. Prioritizes collision-critical regions based on velocity and temporal proximity.

Hierarchical Subgoal Generation

An adaptive behavior-tree policy π_g with three progressive strategies: direct grasping, pre-grasp repositioning, and observation gathering — enabling runtime recovery from failures.

Fast Whole-Body Planning

A novel vmRRT-C (Vectorized Mobile RRT-Connect) planner achieving 50–80 ms planning times for real-time replanning across the 11-DoF heterogeneous configuration space.

Execution Example

Full execution sequence showing third-person view, first-person view, and belief map. — Full execution sequence. **Top:** third-person view. **Middle:** onboard camera (first-person). **Bottom:** occupancy belief map. The robot actively explores, repositions, and grasps the target while avoiding dynamic obstacles.

Results

68.75% Success — Unknown Static

58.0% Success — Dynamic

400 Simulation Scenarios

50–80ms Planning Time

Simulation Evaluation

Success rate comparison across methods — Success rate comparison against baselines across 400 scenarios in 20 ReplicaCAD scenes.

Failure Analysis

Failure flow in unknown static environments — Failure decomposition in unknown static environments.

Real-World Deployment

Real robot in static environments — Static environment: 65% success across 5 indoor locations.

Real robot with dynamic obstacles — Dynamic environment with moving pedestrians: 55% success.

Spatial Failure Distribution

Spatial distribution of successes and failures across scenes — Spatial distribution of successes and failures. Failures cluster in deep table areas (reachability limits), wall-adjacent regions (constrained approach angles), and cluttered zones (precision requirements).

Evaluation Environments

Qualitative Results

Initial configuration — Initial Configuration

Pre-grasp repositioning — Pre-Grasp Repositioning

BibTeX

@inproceedings{hu2025visibility,
  title     = {Visibility-Aware Mobile Grasping in Dynamic Environments},
  author    = {Hu, Tianrun and Xiao, Anxing and Hsu, David and Zhang, Hanbo},
  booktitle = {arXiv preprint},
  year      = {2025}
}

Visibility-Aware Mobile Graspingin Dynamic Environments