Bounding Box (Rectangle) Tool
In this guide, you'll learn how to use bounding box tool in the Supervisely image labeling toolbox and explore additional features that can help streamline your annotation experience.
Last updated
In this guide, you'll learn how to use bounding box tool in the Supervisely image labeling toolbox and explore additional features that can help streamline your annotation experience.
Last updated
The Bounding Box is a fundamental tool in Computer Vision used for image annotation, object detection and tracking tasks.
Rectangles that precisely enclose an object are commonly referred to as bounding boxes. These boxes are defined by coordinates that indicate the position and size of the object. Coordinates are usually specified as the top-left and bottom-right corners (or alternatively, by the center point, width, and height).
The main goal of a bounding box is to provide a clear boundary that the machine learning model can use to identify and locate objects in images or videos.
Watch our 5-minute video tutorial that provides clear and simple instructions on how to create and use Bounding Boxes for image annotation in Supervisely Image Labeling Toolbox.
We'll explore how to create and manually apply bounding boxes, including advanced features and tips for efficient and accurate labeling.
You can create a new class directly from the Annotation Toolbox. To do this:
Click the Rectangle Icon in the toolbar of the labeling interface.
Or select an existing object class or add a new class by clicking Add new class definition.
In the modal window, enter the class name, choose the Bounding Box or Any shape, and configure additional settings (e.g., color, hotkeys).
Click the Create
button to add the new class to the definitions list.
Select the newly created class and put a bounding box around the object in the image or video.
Select the object of interest in the image or video and place a rectangle around it.
Click to set the first point (top-left corner), then drag the rectangle to the bottom-right corner.
To edit the bounding box, drag its edges or use hotkeys for precise positioning.
You can quickly move to the next object without pressing the SPACE bar to finish annotating the previous object; just set new points.
Always aim to create bounding boxes that tightly fit around the object to minimize background noise and enhance model performance.
Use Auto-select to switch between classes by hovering over the desired object. Also easily edit existing bounding boxes, including those predicted by Neural Networks.
Use the Object Color Randomizer to instantly change the color of any object with a single click, as often as you like. This is particularly useful if you have many objects of the same class, or if they are close together, so you can easily differentiate between bounding boxes.
Control the BBox tool more efficiently with HOTKEYS
.
Create & add point
Edit point
Drag
Drag bounding box
Drag bounding box
Alt + Arrow Keys
You can create new bounding box immediately after.
Bounding boxes can be seamlessly combined with both semi-automated and fully automated object detection tools to improve and speed up your image annotation.
OWL-ViT (Vision Transformer for Open-World Localization) uses bounding boxes as a reference to improve object detection. Here’s how it works:
Reference Image Mode: Annotate an object with a bounding box in a reference image, and OWL-ViT will use this to identify similar objects in other images.
Text Prompt Mode: Define objects using text descriptions, and OWL-ViT will detect these objects across your dataset.
YOLOv8 offers a fully automated approach to object detection. It uses bounding boxes to quickly and accurately label objects in large datasets:
Automatic Detection: YOLOv8 generates bounding boxes around objects without the need for manual input.
Batch Processing: Apply the model to all images in a project for efficient pre-labeling.
For more information on using bounding boxes with OWL-ViT and YOLOv8 models, check out our comprehensive Bounding Box guide.
,
Alt + Hold
Zoom with Mouse wheel
. Hold to move scene.