Semantic Segmentation Mode

In semantic segmentation mode, the model classifies each pixel in an image into a predefined set of classes. By defining a set of classes during the dataset annotation process, the user can train the model to recognize and segment different regions of interest within an image.

Dataset Creation

To start creating a dataset for semantic segmentation, follow these steps:

Click the View Collections button on the Home page.
Click the Add a new Segmentation dataset button to create a new dataset.
Fill the appropriate information for your dataset, such as name, description and register your classes.
Upload images to the dataset by clicking on the Upload Images region.
Press Submit and Open Button to proceed with dataset annotation.

The dataset creation page will look like the following:

CVAT

Annotation Process

Once the dataset is created, you can start annotating the images. The annotation interface provides various tools to facilitate the annotation process:

Draw mask Tool: Allows users to paint over regions of interest using a brush to assign them to a specific class.
Eraser Tool: Enables users to remove parts of the mask that were incorrectly annotated.
Polygon/Rectangle Selection Tool: Enables users to create precise masks by outlining the area of interest with polygons or rectangles.

CVAT

Important Note: For Semantic Segmentation, the Draw mask Tool is primarily used especially when dealing with complex shapes and regions.

Once you navigate through all images and complete the annotations, click the Menu button on the top left corner and change the job state to completed. Once this process is done, you will be able to see the dataset ready for training in the Collections page for the corresponding task type.

Training Initiation

After completing the annotation process, you can initiate the training of your semantic segmentation model. Follow these steps:

Navigate to the Train an ML model page.
Select the Semantic Segmentation mode.

There are two main fields that need to be filled:

Select Model: Choose the model architecture you want to use for training.
Select Collection: Provide a name for your model.

For the available models, you can find information about them by clicking on the Currently Available models arrow and click on the preferred model to see more details about its architecture.

Inference Process

Once the model is trained, you can use it for inference on new images. To perform inference, follow these steps:

Navigate to the Inference page.
Select the trained semantic segmentation model from the list.
Upload the image you want to perform inference on.
Click the Run Inference button to see the segmentation results.

Once the Inference process is complete, the segmented image will be displayed directly into the same page, showing the different regions classified according to the predefined classes.

CVAT