Supervisely
AboutAPI ReferenceSDK Reference
  • 🤖What's Supervisely
  • 🚀Ecosystem of Supervisely Apps
  • 💡FAQ
  • 📌Getting started
    • How to import
    • How to annotate
    • How to invite team members
    • How to connect agents
    • How to train models
  • 🔁Import and Export
    • Import
      • Overview
      • Import using Web UI
      • Supported annotation formats
        • Images
          • 🤖Supervisely JSON
          • 🤖Supervisely Blob
          • COCO
          • Yolo
          • Pascal VOC
          • Cityscapes
          • Images with PNG masks
          • Links from CSV, TXT and TSV
          • PDF files to images
          • Multiview images
          • Multispectral images
          • Medical 2D images
          • LabelMe
          • LabelStudio
          • Fisheye
          • High Color Depth
        • Videos
          • Supervisely
        • Pointclouds
          • Supervisely
          • .PCD, .PLY, .LAS, .LAZ pointclouds
          • Lyft
          • nuScenes
          • KITTI 3D
        • Pointcloud Episodes
          • Supervisely
          • .PCD, .PLY, .LAS, .LAZ pointclouds
          • Lyft
          • nuScenes
          • KITTI 360
        • Volumes
          • Supervisely
          • .NRRD, .DCM volumes
          • NIfTI
      • Import sample dataset
      • Import into an existing dataset
      • Import using Team Files
      • Import from Cloud
      • Import using API & SDK
      • Import using agent
    • Migrations
      • Roboflow to Supervisely
      • Labelbox to Supervisely
      • V7 to Supervisely
      • CVAT to Supervisely
    • Export
  • 📂Data Organization
    • Core concepts
    • MLOps Workflow
    • Projects
      • Datasets
      • Definitions
      • Collections
    • Team Files
    • Disk usage & Cleanup
    • Quality Assurance & Statistics
      • Practical applications of statistics
    • Operations with Data
      • Data Filtration
        • How to use advanced filters
      • Pipelines
      • Augmentations
      • Splitting data
      • Converting data
        • Convert to COCO
        • Convert to YOLO
        • Convert to Pascal VOC
    • Data Commander
      • Clone Project Meta
  • 📝Labeling
    • Labeling Toolboxes
      • Images
      • Videos 2.0
      • Videos 3.0
      • 3D Point Clouds
      • DICOM
      • Multiview images
      • Fisheye
    • Labeling Tools
      • Navigation & Selection Tools
      • Point Tool
      • Bounding Box (Rectangle) Tool
      • Polyline Tool
      • Polygon Tool
      • Brush Tool
      • Mask Pen Tool
      • Smart Tool
      • Graph (Keypoints) Tool
      • Frame-based tagging
    • Labeling Jobs
      • Labeling Queues
      • Labeling Consensus
      • Labeling Statistics
    • Labeling with AI-Assistance
  • 🤝Collaboration
    • Admin panel
      • Users management
      • Teams management
      • Server disk usage
      • Server trash bin
      • Server cleanup
      • Server stats and errors
    • Teams & workspaces
    • Members
    • Issues
    • Guides & exams
    • Activity log
    • Sharing
  • 🖥️Agents
    • Installation
      • Linux
      • Windows
      • AMI AWS
      • Kubernetes
    • How agents work
    • Restart and delete agents
    • Status and monitoring
    • Storage and cleanup
    • Integration with Docker
  • 🔮Neural Networks
    • Overview
    • Inference & Deployment
      • Overview
      • Supervisely Serving Apps
      • Deploy & Predict with Supervisely SDK
      • Using trained models outside of Supervisely
    • Model Evaluation Benchmark
      • Object Detection
      • Instance Segmentation
      • Semantic Segmentation
      • Custom Benchmark Integration
    • Custom Model Integration
      • Overview
      • Custom Inference
      • Custom Training
    • Legacy
      • Starting with Neural Networks
      • Train custom Neural Networks
      • Run pre-trained models
  • 👔Enterprise Edition
    • Get Supervisely
      • Installation
      • Post-installation
      • Upgrade
      • License Update
    • Kubernetes
      • Overview
      • Installation
      • Connect cluster
    • Advanced Tuning
      • HTTPS
      • Remote Storage
      • Single Sign-On (SSO)
      • CDN
      • Notifications
      • Moving Instance
      • Generating Troubleshoot Archive
      • Storage Cleanup
      • Private Apps
      • Data Folder
      • Firewall
      • HTTP Proxy
      • Offline usage
      • Multi-disk usage
      • Managed Postgres
      • Scalability Tuning
  • 🔧Customization and Integration
    • Supervisely .JSON Format
      • Project Structure
      • Project Meta: Classes, Tags, Settings
      • Tags
      • Objects
      • Single-Image Annotation
      • Single-Video Annotation
      • Point Cloud Episodes
      • Volumes Annotation
    • Developer Portal
    • SDK
    • API
  • 💡Resources
    • Changelog
    • GitHub
    • Blog
    • Ecosystem
Powered by GitBook
On this page
  • Open the project and navigate to the Filter tab
  • Create custom conditional filters
  • Video tutorial
  • Tracking image status in labeling jobs
  • Labeling job management
  • Data operations: copy and move
  • Use сases for advanced dataset filters
  • 1. Find unique data with custom filters
  • 2. Manage huge datasets at any scale with ease
  • 3. Use filters to explore model predictions
  • 4. Labeling job tracking for team collaboration

Was this helpful?

  1. 📂Data Organization
  2. Operations with Data
  3. Data Filtration

How to use advanced filters

PreviousData FiltrationNextPipelines

Last updated 9 months ago

Was this helpful?

Open the project and navigate to the Filter tab

Open the images project you are working on and navigate to the Filter tab. Here, you can search among all images, subsample the desired dataset, configure filters, and preview filtering results.

Create custom conditional filters

Click the Filter button to open a modal window where you can customize your filters based on various criteria. Fine-tune your view by including or excluding specific criteria, such as conditions on the number of objects. For example, you can filter images with the tag validation that contain more than 5 objects of the class plant.

Let's create custom filters step-by-step

  1. Open filter configuration: Click the Filter button to open the filter configuration modal.

  2. Set criteria: Choose criteria based on datasets, names, image tags, object classes, assignees, issues, or labeling job status.

  3. Fine-tune filters: Include or exclude specific conditions, such as the number of objects within a specified range.

  4. Apply filters: Click the Apply button to retrieve and preview the data that meets your specified conditions.


Video tutorial

In this 3-minute video tutorial, you will learn how to use advanced conditional filters for Computer Vision datasets, helping you search, filter, and explore annotated images of any size.

Tracking image status in labeling jobs

Users can easily view the status of each image within a labeling job, simplifying the monitoring of annotation progress. By clicking on the status of a labeling job, users can access statistics on job activity, labeling time per object, and total labeling time. Detailed statistics are available for individual images, including labeler time in the tool, editing durations, and object counts. This comprehensive information ensures thorough tracking and analysis of the annotation process, enabling managers to accurately assess their annotation workflow.

Labeling job management

When selecting multiple images, users can quickly create labeling jobs or delete unnecessary data, allowing for more precise data selection for labeling.

Data operations: copy and move

Users can easily copy or move images with annotations from filtering results to other datasets, streamlining training data management.

Use сases for advanced dataset filters

Explore some illustrative examples and common use cases of how filters can be used in real projects.

1. Find unique data with custom filters

You can easily find unique data by combining different filters. Let's look at an example using the Pascal VOC dataset. Suppose we need to find images from the "train" dataset that contain both buses and cars. The number of buses in the image should not exceed 5, and the number of cars should be 1 or more. Here are the steps you can follow:

  1. Apply a simple filter to subsample images from the "train" dataset.

  2. Add a filter to find images with buses and set the maximum number of buses in the image to 5.

  3. Add another filter to find images with cars and set the minimum number of cars in the image to 1.

  4. Press the Apply button to retrieve the data that meets all of the specified conditions and to view the filter results.

2. Manage huge datasets at any scale with ease

Conditional dataset filters make managing large datasets simple. These filters help you to explore and identify images that can be merged into a new labeling job, moved or submitted to additional review and verification. Instead of using API and writing custom Python scripts, data annotation managers can quickly configure custom conditions and integrate them into their labeling pipelines in a few clicks.

Creating a new labeling job from unannotated images

3. Use filters to explore model predictions

Data scientists can use custom filters to evaluate predictions from custom Neural Networks. For example, Supervisely users can easily apply custom object detection model, save model predictions and further use them for analysis or as an initial data prelabeling. In that case, every object (bounding box) will have the tag Confidence with some value from 0 to 1. Thus you can create custom filter to find all images with the most or less confident predictions (e.g. "Confidence < 0.5"). Analyzing bounding boxes with low confidence levels can help to gain insights and better understand model performance and uncover the ways to improve it.

Or developers can leverage the Supervisely platform for exploratory data analysis, finding data outliers or possible errors in training data. It can be used to improve adaptive learning strategies by dynamically filtering images based on model feedback, performance metrics, or user-defined criteria to iteratively sample, annotate, and improve model performance.

Also filters can be utilized to make smaller training datasets from the large ones for model evaluation also known K-fold cross-validation training technique.

Searching for images in the training dataset with a low threshold and removing them

4. Labeling job tracking for team collaboration

Efficient team collaboration requires streamlined processes for tracking and managing annotation tasks. Supervisely's advanced dataset filtering capabilities optimize the job tracking process. By filtering datasets based on labeling jobs and their status and combining with other conditional filters, you can quickly find relevant images, explore unusual patters in your annotations, find hidden mistakes and perform simple yet effective quality assurance operations. This also improves visibility of job progress and ensures quick access to job activity and statistics.

Labeling job statuses indicates the progress of the annotation or the quality of the data annotation: for example, images with rejected status may require review or correction. Sorting by these statuses helps to quickly identify and respond to potential data quality issues. Optimized job tracking through dataset filtering enables your team to be more productive, resulting in faster annotation project completion.

Combining filters to search for object classes "bus" and "car" in specific quantities