Supervisely
AboutAPI ReferenceSDK Reference
  • 🤖What's Supervisely
  • 🚀Ecosystem of Supervisely Apps
  • 💡FAQ
  • 📌Getting started
    • How to import
    • How to annotate
    • How to invite team members
    • How to connect agents
    • How to train models
  • 🔁Import and Export
    • Import
      • Overview
      • Import using Web UI
      • Supported annotation formats
        • Images
          • 🤖Supervisely JSON
          • 🤖Supervisely Blob
          • COCO
          • Yolo
          • Pascal VOC
          • Cityscapes
          • Images with PNG masks
          • Links from CSV, TXT and TSV
          • PDF files to images
          • Multiview images
          • Multispectral images
          • Medical 2D images
          • LabelMe
          • LabelStudio
          • Fisheye
          • High Color Depth
        • Videos
          • Supervisely
        • Pointclouds
          • Supervisely
          • .PCD, .PLY, .LAS, .LAZ pointclouds
          • Lyft
          • nuScenes
          • KITTI 3D
        • Pointcloud Episodes
          • Supervisely
          • .PCD, .PLY, .LAS, .LAZ pointclouds
          • Lyft
          • nuScenes
          • KITTI 360
        • Volumes
          • Supervisely
          • .NRRD, .DCM volumes
          • NIfTI
      • Import sample dataset
      • Import into an existing dataset
      • Import using Team Files
      • Import from Cloud
      • Import using API & SDK
      • Import using agent
    • Migrations
      • Roboflow to Supervisely
      • Labelbox to Supervisely
      • V7 to Supervisely
      • CVAT to Supervisely
    • Export
  • 📂Data Organization
    • Core concepts
    • MLOps Workflow
    • Projects
      • Datasets
      • Definitions
      • Collections
    • Team Files
    • Disk usage & Cleanup
    • Quality Assurance & Statistics
      • Practical applications of statistics
    • Operations with Data
      • Data Filtration
        • How to use advanced filters
      • Pipelines
      • Augmentations
      • Splitting data
      • Converting data
        • Convert to COCO
        • Convert to YOLO
        • Convert to Pascal VOC
    • Data Commander
      • Clone Project Meta
  • 📝Labeling
    • Labeling Toolboxes
      • Images
      • Videos 2.0
      • Videos 3.0
      • 3D Point Clouds
      • DICOM
      • Multiview images
      • Fisheye
    • Labeling Tools
      • Navigation & Selection Tools
      • Point Tool
      • Bounding Box (Rectangle) Tool
      • Polyline Tool
      • Polygon Tool
      • Brush Tool
      • Mask Pen Tool
      • Smart Tool
      • Graph (Keypoints) Tool
      • Frame-based tagging
    • Labeling Jobs
      • Labeling Queues
      • Labeling Consensus
      • Labeling Statistics
    • Labeling with AI-Assistance
  • 🤝Collaboration
    • Admin panel
      • Users management
      • Teams management
      • Server disk usage
      • Server trash bin
      • Server cleanup
      • Server stats and errors
    • Teams & workspaces
    • Members
    • Issues
    • Guides & exams
    • Activity log
    • Sharing
  • 🖥️Agents
    • Installation
      • Linux
      • Windows
      • AMI AWS
      • Kubernetes
    • How agents work
    • Restart and delete agents
    • Status and monitoring
    • Storage and cleanup
    • Integration with Docker
  • 🔮Neural Networks
    • Overview
    • Inference & Deployment
      • Overview
      • Supervisely Serving Apps
      • Deploy & Predict with Supervisely SDK
      • Using trained models outside of Supervisely
    • Model Evaluation Benchmark
      • Object Detection
      • Instance Segmentation
      • Semantic Segmentation
      • Custom Benchmark Integration
    • Custom Model Integration
      • Overview
      • Custom Inference
      • Custom Training
    • Legacy
      • Starting with Neural Networks
      • Train custom Neural Networks
      • Run pre-trained models
  • 👔Enterprise Edition
    • Get Supervisely
      • Installation
      • Post-installation
      • Upgrade
      • License Update
    • Kubernetes
      • Overview
      • Installation
      • Connect cluster
    • Advanced Tuning
      • HTTPS
      • Remote Storage
      • Single Sign-On (SSO)
      • CDN
      • Notifications
      • Moving Instance
      • Generating Troubleshoot Archive
      • Storage Cleanup
      • Private Apps
      • Data Folder
      • Firewall
      • HTTP Proxy
      • Offline usage
      • Multi-disk usage
      • Managed Postgres
      • Scalability Tuning
  • 🔧Customization and Integration
    • Supervisely .JSON Format
      • Project Structure
      • Project Meta: Classes, Tags, Settings
      • Tags
      • Objects
      • Single-Image Annotation
      • Single-Video Annotation
      • Point Cloud Episodes
      • Volumes Annotation
    • Developer Portal
    • SDK
    • API
  • 💡Resources
    • Changelog
    • GitHub
    • Blog
    • Ecosystem
Powered by GitBook
On this page
  • What is Visual Object Tracking?
  • Deploying an AI tracking model
  • Some of the integrated AI tracking models:
  • Types of Object Tracking
  • Single Object Tracking (SOT)
  • Multiple Object Tracking (MOT)
  • Semi-Supervised Class-Agnostic Multiple Object Tracking
  • Video Object Segmentation (VOS)

Was this helpful?

  1. Labeling
  2. Labeling Toolboxes

Video tracking

Last updated 8 months ago

Was this helpful?

One of the most common tasks in the video labeling toolbox is the video tracking. Supervisely has the top AI models and automatic video annotation tools you can use to efficiently track objects on videos.

What is Visual Object Tracking?

Visual Object Tracking involves predicting the position of a target object in each frame of a video. The primary subtasks in Visual Object Tracking include:

  • Single Object Tracking (SOT)

  • Multiple Object Tracking (MOT)

  • Semi-Supervised Class-Agnostic Multiple Object Tracking

  • Video Object Segmentation (VOS)

Why video object tracking is challenging

Creating custom datasets for tracking is labor-intensive. For example, a one-hour video at 24 frames per second contains 86,400 frames. If each frame contains 8-12 objects, this results in about a million objects to track. Automating this process with AI models and tools can significantly reduce the workload.

Deploying an AI tracking model

Apart from other solutions, Supervisely is built like an OS. Instead of having a fixed number of video tracking algorithms, we provide a constantly growing Ecosystem of the best models. Pick the one you like, deploy it on your agent, select it in the Track Settings panel — and enjoy!

For our Community Edition users we have a MixFormer model deployed on our GPUs and shared with everyone — but you can always run an agent on your computer and deploy any other model you like!

Some of the integrated AI tracking models:

Types of Object Tracking

Single Object Tracking (SOT)

SOT involves tracking one object throughout the video based on a manual annotation on the first frame. The annotation, called a template, is used by a neural network to locate the object in subsequent frames. These models are class-agnostic, meaning they can track any object based on the initial annotation.

Models and Tools:

How to use:

  1. Label the first frame: Annotate the target object with a bounding box on the first frame.

  2. Track automatically: Use a class-agnostic neural network to track the object in subsequent frames automatically.

Multiple Object Tracking (MOT)

MOT detects and tracks multiple objects of predefined classes, estimating their trajectories. The process involves two steps: detecting objects in each frame and associating detections across frames to form tracklets. This approach is known as the Tracking-by-Detection paradigm.

Models and Tools:

  • YOLO

  • DeepSort Algorithm

How to use:

  1. Object detection: Use a detection model like YOLOv5 to predict bounding boxes on each frame.

  2. Tracking algorithm: Apply a tracking algorithm (e.g., DeepSort) to link detections and form object trajectories.

Semi-Supervised Class-Agnostic Multiple Object Tracking

This approach combines the simplicity of SOT with the ability to track multiple objects. By applying an SOT model to each object on the first frame, users can track and correct multiple objects simultaneously, enhancing annotation speed and accuracy.

How to use:

  1. Apply SOT to each object: Use an SOT model to track each object from the first frame.

  2. Correct and re-track: Correct tracking predictions and re-track objects as needed.

Video Object Segmentation (VOS)

VOS tracks objects in videos using masks instead of bounding boxes. The user labels the object mask on the first frame, and the model segments and tracks the object in subsequent frames.

Models and Tools:

How to use with steps:

Step 1. Connect GPU to Supervisely:

  • Use a single command in the terminal to connect your personal computer with GPU to the Supervisely account.

Step 2. Run Segment Anything Model (SAM):

  • Deploy the SAM for fast object segmentation on the first frame.

  • Start the app, select a pre-trained model, and deploy via the GUI.

  • Interactive segmentation allows users to provide feedback to the model by marking positive and negative points on the image.

Step 3. Run XMem Model:

  • Serve the XMem model on a computer with GPU.

  • Go to the Neural Networks page and select the XMem model for video segmentation.

Step 4. Segment and Track Objects on Videos:

  • Use the video labeling toolbox to select objects and track masks in subsequent frames.

  • Apply the tracking algorithm and review results.

Check out our tutorial or or watch this 5-minute video to learn what object tracking is and how to track objects in your videos with the best models and tools.

Supervisely Apps: ,

Watch this

📝
TransT object tracking (CVPR2021)
MixFormer object tracking (CVPR2022)
PIPs object tracking
TAP-Net object tracking
CoTracker object tracking
MixFormer object tracking (CVPR2022)
TransT object tracking (CVPR2021)
Supervisely Video Labeling Toolbox
Serve YOLO v5
Apply NN to Videos Project
Segment Anything 2 Model
XMem Video Object Segmentation Model
1.5-minute how-to video.
Complete Guide to Object Tracking: Best AI Models, Tools and Methods in 2023 - SuperviselySupervisely
Logo