Human Following Robot
Integrating YOLO and SLAM with a Clearpath Robotics Jackal to locate and track a specific person while listening to gesture-based commands.
Summary
Implemented autonomous human-following on a Clearpath Jackal mobile robot. The system uses ROS 2 with a full Nav2 stack and YOLO pose detection to dynamically track, pursue, and navigate toward a designated person in real-time without relying on wearable sensors or tags.
System Architecture
Vision-Based Tracking
- Deploys YOLO to identify the target and continuously orients the robot to keep the person centered within the camera frame.
- Extracts RGB and depth data via a RealSense d435i to continuously calculate the robot's distance to the person.
Gesture Control
- Recognizes and responds to visual gesture commands, allowing the user to initiate or halt the following behavior hands-free.
- Maps individual joint keypoints to specific control commands through YOLO pose detection, trained on the Hand Keypoints dataset.
Autonomous Navigation & Obstacle Avoidance
- Safely maneuvers through environments, calculating dynamic paths to the target while preventing collisions.
- Remains approximately 1 meter away from the person to give them space and keep them in the camera frame.
- Integrates the Nav2 stack, SLAM Toolbox, and a Velodyne LiDAR sensor to achieve reliable spatial awareness.
Search & Reacquisition
- If line-of-sight is broken, the system calculates the target's last known position to autonomously explore and reacquire visual contact.
- Initiates a rotational search pattern in the direction the person exited the frame if they are no longer detected at their last known coordinates.
Discussion
The tracking performed reliably, especially considering all processing was executed onboard a 4th-generation i5 CPU. The project successfully proved the feasibility of dynamic, markerless human tracking using only onboard vision and SLAM. Future iterations would benefit from upgraded compute hardware to allow for higher-frequency pose processing and more aggressive dynamic tracking.