Open-Set Segmentation

Setting Up

The open-set segmentation interface works with and without ROS. For working with ROS, we assume you have already built your catkin workspace with this repository in it beforehand (i.e., by running catkin build).

Note
If you intend only to use the open-set segmentation interface, you may want to turn off building against TensorRT, which you can do by the following:
catkin config -a -DSEMANTIC_INFERENCE_USE_TRT=OFF

Installing

We assume you are using a virtual environment. You may want to install virtualenv (usually sudo apt install python3-virtualenv) if you haven't already. To set up a virtual environment for use with ROS:

python3 -m virtualenv -p /usr/bin/python3 --system-site-packages /desired/path/to/environment

Otherwise, omit the --system-site-packages option:

python3 -m virtualenv -p /usr/bin/python3 --download /desired/path/to/environment

⚠️ Warning
Note that newer versions of setuptools are incompatible with --system-site-packages on Ubuntu 20.04. Do not use --download and --system-site-packages and expect the installation to work (specifically with external packages specified by git url).

Then, install semantic_inference

cd /path/to/repo
source /path/to/environment
pip install ./semantic_inference[openset]  # note that the openset extra is required for open-set semantic segmentation

You may see dependency version errors from pip if installing into an environment created with --system-site-packages. This is expected. Other open-set segmentation features require other extras (e.g., running with SAM instead of FastSAM requires the sam extra, and using dense language feature interpolation requires f3rm as an extra). It is also possible to install via an editable install (i.e., by using -e when running pip install).

Models

Note that both CLIP and FastSAM automatically download the relevant model weights when they are first run. Running with the original SAM may require downloading the model weights. See the official SAM repository here for more details.

Using open-set segmentation online

To use the open-set segmentation as part of a larger system, include openset_segmentation.launch in your launch file. Often this will look like this:

<launch>

    <!-- ... rest of launch file ... -->

    <remap from="semantic_inference/color/image_raw" to="YOUR_INPUT_TOPIC_HERE"/>
    <include file="$(find semantic_inference_ros)/launch/openset_segmentation.launch"/>

</launch>

Note there are some arguments you may want to specify when including openset_segmentation.launch that are not shown here, specifically the configuration for the model to use.

Pre-generating semantics

It is also possible to pre-generate semantics when working with recorded data. To create a rosbag containing the original bag contents plus the resulting open-set segmentation, run the following

rosrun semantic_inference_ros make_rosbag --copy /path/to/input_bag      \
                                          /color_topic:/output_topic     \
                                          -o /path/to/desired/output_bag

replacing /color_topic and /output_topic with appropriate topic names (usually /camera_name/color/image_raw and /camera_name/semantic/image_raw).

Additional options exist. Running without --copy will output just the open-set segmentation at the path specified by -o. If no output path is specified, the semantics will be added in-place to the bag after a confirmation prompt (you can disable the prompt with -y). Additional information and documentation is available via --help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

open_set.md

open_set.md

Open-Set Segmentation

Setting Up

Installing

Models

Using open-set segmentation online

Pre-generating semantics

Files

open_set.md

Latest commit

History

open_set.md

File metadata and controls

Open-Set Segmentation

Setting Up

Installing

Models

Using open-set segmentation online

Pre-generating semantics