VisionPipeline

This is a work in progress repo that holds a probabilistic vision pipeline

Network Config

I found it helpful to increase the socket buffer size on linux

# Temporarily raise to 16 MB
sudo sysctl -w net.core.rmem_max=16777216
sudo sysctl -w net.core.rmem_default=16777216

# Verify
sysctl net.core.rmem_max net.core.rmem_default

To make it permanent, add a file /etc/sysctl.d/60-cyclone-rmem.conf with:

net.core.rmem_max     = 16777216
net.core.rmem_default = 16777216

Then either reboot or reload via sudo sysctl --system.

Starting the camera topics

To start the camera topics ssh into your robot where the realsense are connected and create a vision pipleine workspace

user@robot mkdir ~/vp_ws/
user@robot mkdir ~/vp_ws/src
user@robot cd ~/vp_ws/src/

then clone and build the vision pipeline and custom messages

user@robot git clone [email protected]:correlllab/vision_pipeline.git
user@robot git clone [email protected]:correlllab/custom_ros_messages.git
user@robot cd ~/vp_ws/
user@robot colcon build
user@robot source ./install/setup.sh
user@robot ros2 launch vision_pipeline realsense_cameras.launch.py launch_head:=true launch_left_hand:=true  launch_right_hand:=true

you should see all the topics start, you can verify this in a new terminal with

user@robot ros2 topic list
...
/realsense/head/aligned_depth_to_color/camera_info
/realsense/head/aligned_depth_to_color/image_raw
/realsense/head/aligned_depth_to_color/image_raw/compressed
/realsense/head/aligned_depth_to_color/image_raw/compressedDepth
/realsense/head/aligned_depth_to_color/image_raw/theora
/realsense/head/color/camera_info
/realsense/head/color/image_raw
/realsense/head/color/image_raw/compressed
/realsense/head/color/image_raw/compressedDepth
/realsense/head/color/image_raw/theora
/realsense/head/color/metadata
/realsense/head/depth/camera_info
/realsense/head/depth/image_rect_raw
/realsense/head/depth/image_rect_raw/compressed
/realsense/head/depth/image_rect_raw/compressedDepth
/realsense/head/depth/image_rect_raw/theora
/realsense/head/depth/metadata
/realsense/head/extrinsics/depth_to_color
/realsense/head/extrinsics/depth_to_depth
/realsense/left_hand/aligned_depth_to_color/camera_info
/realsense/left_hand/aligned_depth_to_color/image_raw
/realsense/left_hand/aligned_depth_to_color/image_raw/compressed
/realsense/left_hand/aligned_depth_to_color/image_raw/compressedDepth
/realsense/left_hand/aligned_depth_to_color/image_raw/theora
/realsense/left_hand/color/camera_info
/realsense/left_hand/color/image_raw
/realsense/left_hand/color/image_raw/compressed
/realsense/left_hand/color/image_raw/compressedDepth
/realsense/left_hand/color/image_raw/theora
/realsense/left_hand/color/metadata
/realsense/left_hand/depth/camera_info
/realsense/left_hand/depth/image_rect_raw
/realsense/left_hand/depth/image_rect_raw/compressed
/realsense/left_hand/depth/image_rect_raw/compressedDepth
/realsense/left_hand/depth/image_rect_raw/theora
/realsense/left_hand/depth/metadata
/realsense/left_hand/extrinsics/depth_to_color
/realsense/right_hand/aligned_depth_to_color/camera_info
/realsense/right_hand/aligned_depth_to_color/image_raw
/realsense/right_hand/aligned_depth_to_color/image_raw/compressed
/realsense/right_hand/aligned_depth_to_color/image_raw/compressedDepth
/realsense/right_hand/aligned_depth_to_color/image_raw/theora
/realsense/right_hand/color/camera_info
/realsense/right_hand/color/image_raw
/realsense/right_hand/color/image_raw/compressed
/realsense/right_hand/color/image_raw/compressedDepth
/realsense/right_hand/color/image_raw/theora
/realsense/right_hand/color/metadata
/realsense/right_hand/depth/camera_info
/realsense/right_hand/depth/image_rect_raw
/realsense/right_hand/depth/image_rect_raw/compressed
/realsense/right_hand/depth/image_rect_raw/compressedDepth
/realsense/right_hand/depth/image_rect_raw/theora
/realsense/right_hand/depth/metadata
/realsense/right_hand/extrinsics/depth_to_color
...

Starting the vision pipeline

First create a vision pipeline workspace with

user@desktop mkdir ~/vp_ws/
user@desktop mkdir ~/vp_ws/src

then clone the vision pipeline and the custom ros messages into the src

user@desktop cd ~/vp_ws/src/
user@desktop git clone [email protected]:correlllab/vision_pipeline.git
user@desktop git clone [email protected]:correlllab/custom_ros_messages.git

Next edit the docker container with

user@desktop nano ~/vp_ws/src/vision_pipeline/Docker/Dockerfile

change the line

ENV CYCLONEDDS_URI="<CycloneDDS><Domain><General><Interfaces><NetworkInterface name=\"enp4s0\" priority=\"default\" multicast=\"default\"/></Interfaces></General></Domain></CycloneDDS>"

to use name=name of your network interface

finally build the docker container with

user@desktop cd ~/vp_ws/
user@desktop ./src/vision_pipeline/Docker/docker_build.sh ./src/vision_pipeline/Docker/DockerFile

To resolve an error like

> [ 6/15] RUN apt-get update &&     apt-get install -y --no-install-recommends       ros-humble-rmw-cyclonedds-cpp &&     rm -rf /var/lib/apt/lists/*:        
0.131 E: Conflicting values set for option Signed-By regarding source http://packages.ros.org/ros2/ubuntu/ jammy: /usr/share/keyrings/ros-archive-keyring.gpg !=
-----BEGIN PGP PUBLIC KEY BLOCK-----    
...
0.131    -----END PGP PUBLIC KEY BLOCK-----
0.131 E: The list of sources could not be read.
------
Dockerfile:27
--------------------
  26 |     # Install Cyclone DDS
  27 | >>> RUN apt-get update && \
  28 | >>>     apt-get install -y --no-install-recommends \
  29 | >>>       ros-humble-rmw-cyclonedds-cpp && \
  30 | >>>     rm -rf /var/lib/apt/lists/*
  31 |    
--------------------
ERROR: failed to solve: process "/bin/sh -c apt-get update &&     apt-get install -y --no-install-recommends       ros-humble-rmw-cyclonedds-cpp &&     rm -rf /var/lib/apt/lists/*" did not complete successfully: exit code: 100

comment out the lines in the DockerFile

# Re-import ROS 2 key and repo
RUN curl -sSL https://raw.githubusercontent.com/ros/rosdistro/master/ros.asc \
    | gpg --dearmor \
    | tee /usr/share/keyrings/ros-archive-keyring.gpg > /dev/null && \
    echo "deb [signed-by=/usr/share/keyrings/ros-archive-keyring.gpg] \
      http://packages.ros.org/ros2/ubuntu \
      $(lsb_release -cs) main" \
    | tee /etc/apt/sources.list.d/ros2-latest.list

to resolve an error like

Package ros-humble-rmw-cyclonedds-cpp is not available, but is referred to by another package.
This may mean that the package is missing, has been obsoleted, or
is only available from another source

E: Package 'ros-humble-rmw-cyclonedds-cpp' has no installation candidate
The command '/bin/sh -c apt-get update &&     apt-get install -y --no-install-recommends       ros-humble-rmw-cyclonedds-cpp &&     rm -rf /var/lib/apt/lists/*' returned a non-zero code: 100

make sure the lines in the DockerFile

# Re-import ROS 2 key and repo
RUN curl -sSL https://raw.githubusercontent.com/ros/rosdistro/master/ros.asc \
    | gpg --dearmor \
    | tee /usr/share/keyrings/ros-archive-keyring.gpg > /dev/null && \
    echo "deb [signed-by=/usr/share/keyrings/ros-archive-keyring.gpg] \
      http://packages.ros.org/ros2/ubuntu \
      $(lsb_release -cs) main" \
    | tee /etc/apt/sources.list.d/ros2-latest.list

are not commented out.

With the DockerFile build we just need to run the container modify the file vision_pipeline/Docker/docker_run.sh so that the line -v /home/max/vp_ws/src/vision_pipeline:/ros2_ws/src/vision_pipeline mounts your vision_pipeline, you likely should only have to change the /home/max/ part

finally you can run the interactive container with

user@desktop ./src/vision_pipeline/Docker/docker_run.sh

you should see colcon build the vision pipeline and custom message packages like

user@desktop:~/vp_ws$ ./src/vision_pipeline/Docker/docker_run.sh 
non-network local connections being added to access control list
Sourcing ROS 2...
Building vision_pipeline...
Starting >>> custom_ros_messages
Starting >>> vision_pipeline
Finished <<< vision_pipeline [0.46s]                                    
Finished <<< custom_ros_messages [4.31s]                    

Summary: 2 packages finished [4.41s]
Sourcing overlay workspace...
root@DockerContainer:/ros2_ws#

from there you can run vision_pipeline entry points with

root@DockerContainer:/ros2_ws# ros2 run vision_pipeline camera
root@DockerContainer:/ros2_ws# ros2 run vision_pipeline foundationmodels
root@DockerContainer:/ros2_ws# ros2 run vision_pipeline visionpipeline

to start the visionpipeline you can just use

user@desktop ./src/vision_pipeline/Docker/docker_run.sh ros2 run vision_pipeline visionpipeline

Code Explanaition

Utils.py

Utils holds utility functions used elsewhere like

get_points_and_colors
iou_2d
iou_3d
pose_to_matrix
matrix_to_pose
quat_to_euler
decode_compressed_depth_image
in_image
nms

FoundationModels.py

Foundation Models holds two key classes OWLv2 and SAM2_PC The OWLv2 class class consumes an rgb image and text queries and produces a dictionary of 2d candidates like

candidates:{
  query_1:{
    boxes:list[n_1]
    scores:list[n_1]
  },
  ...
  query_m:{
    boxes:list[n_m]
    scores:list[n_m]
  },

The Sam2_PC class consumes an rgb image, a depth image, the lists of 2d bounding boxes and scores produced for a single querry by OWLv2, and the camera intrisics. It produces a set of 3d candidates like

pointclouds[n]
boundingboxes3d[n]
scores[n]
rgb_masks[n]
depth_masks[n]

VisionPipeline.py

The vision pipeline orchestrates OWLv2 and Sam2_PC to continously update beliefs about objects that have been querried for The vision pipeline maintains a dictionary of tracked objects like

tracked_objects[object_name] = {
            "boxes": List of 3D bounding boxes,
            "scores": Tensor of belief scores,
            "pcds": List of point clouds,
            "rgb_masks": List of lists of RGB masks,
            "depth_masks": List of lists of depth masks
            "names": List of strings for object names
        }

The vision pipeline updates with the update method that takes an rgb image, depth image, a set of querry strings, the camera intrensics and the observation pose. For each querry in the update call a set 3d candidates made up of point clouds, scores, 3d bounding boxes, and masks is generated. Then using the IOU metric the candidates from this call to update are matched to objects already in the tracked objects array, their beliefs are then fused by converting their scores into odds and using the odds to perform a baysian belif update. If an object was not updated but should have been in view we decay its belief and prune low belief tracked objects.

RosRealsense.py

RosRealsense contains the class RealSenseSubscriber that takes a camera name like head, left_hand, right_hand and subscribes to the nessesary topics created in Starting the camera topics This subscriber should only be used with the get_data method that will return a tuple containing the rgb image, depth image, camera info, and pose. RosRealsense.py also contains the TestSubscriber and TestFoundationModel functions that are entry points that can be run with

root@DockerContainer:/ros2_ws# ros2 run vision_pipeline camera
root@DockerContainer:/ros2_ws# ros2 run vision_pipeline foundationmodels

respectivly

RosVisionPipeline.py

This is the main node for the this repo that combines everything above it contains the ROS_VisionPipe class that runs a node. that exposes services to use the functionality from VisionPipeline.py and publishes markers and pointclouds from the tracked object dictionary. RosVisionPipeline relies on UpdateTrackedObject and Querry service from our Custom ros messages package.

This file also provides an example client for interacting with the services

Usage

after the docker container has built sucessfully you can run the node with

user@desktop ./src/vision_pipeline/Docker/docker_run.sh ros2 run vision_pipeline visionpipeline

You can run the example client that looks for a drill with

user@desktop ./src/vision_pipeline/Docker/docker_run.sh ros2 run vision_pipeline exampleclient

If you just want to manually set a tracked object and querry for it you can use

ros2 service call /vp_update_tracked_object custom_ros_messages/srv/UpdateTrackedObject "{object: 'object to track', action: 'add'}"
ros2 service call /vp_update_tracked_object custom_ros_messages/srv/UpdateTrackedObject "{object: 'object to track', action: 'remove'}"
ros2 service call /vp_query_tracked_objects custom_ros_messages/srv/Query "{query: 'object being tracked'}"

To programatically use the services you first need to import the services

from custom_ros_messages.srv import Query, UpdateTrackedObject

then you can create service clients inside a node class like

self.update_client = self.create_client(UpdateTrackedObject, 'vp_update_tracked_object')
self.query_client = self.create_client(Query, 'vp_query_tracked_objects')

Once the clients are created you can add and remove a track string with an UpdateTrackedObject service like

add:

req = UpdateTrackedObject.Request()
req.object = track_string
req.action = "add"
future = self.update_client.call_async(req)
rclpy.spin_until_future_complete(self, future)
result = future.result()

remove:

req = UpdateTrackedObject.Request()
req.object = track_string
req.action = "remove"
future = self.update_client.call_async(req)
rclpy.spin_until_future_complete(self, future)
result = future.result()

you can querry for where one of your tracked objects are by using the Query service like

req = Query.Request()
req.query = track_string
future = self.query_client.call_async(req)
rclpy.spin_until_future_complete(self, future)
result = future.result()

Name		Name	Last commit message	Last commit date
Latest commit History 146 Commits
Docker		Docker
Rviz		Rviz
launch		launch
resource		resource
vision_pipeline		vision_pipeline
.gitignore		.gitignore
README.md		README.md
package.xml		package.xml
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VisionPipeline

Network Config

Starting the camera topics

Starting the vision pipeline

Code Explanaition

Utils.py

FoundationModels.py

VisionPipeline.py

RosRealsense.py

RosVisionPipeline.py

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

correlllab/vision_pipeline

Folders and files

Latest commit

History

Repository files navigation

VisionPipeline

Network Config

Starting the camera topics

Starting the vision pipeline

Code Explanaition

Utils.py

FoundationModels.py

VisionPipeline.py

RosRealsense.py

RosVisionPipeline.py

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages