CtrlK
BlogDocsLog inGet started
Tessl Logo

roboflow-data-management

Use when uploading images, labeling, organizing datasets, creating Roboflow projects (detection/segmentation/keypoint/classification), tags, splits, versions, or RoboQL search.

77

1.40x
Quality

66%

Does it follow best practices?

Impact

94%

1.40x

Average score across 3 eval scenarios

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./skills/data-management/SKILL.md
SKILL.md
Quality
Evals
Security

For agents — source-of-truth: This skill is authored in roboflow/computer-vision-skills and shipped with the Roboflow plugin. If your client has loaded the plugin (you'll see roboflow:<name> skills in your available skills list), use those local skills — they're read fresh from disk every session. The same content served as MCP resources at roboflow://skills/<name>/... is a fallback for clients without the plugin and may lag this repo. Don't call ReadMcpResourceTool for roboflow://skills/... URIs when a local roboflow:<name> skill is available.

Data Management on Roboflow

Project Types

TypeAnnotation FormatUse Case
Object DetectionBounding box (polygon/mask auto-converted)Locate objects with boxes
Instance SegmentationPolygon, MaskPixel-level per-object boundaries
Semantic SegmentationPolygon, MaskPixel-level class regions
Keypoint DetectionKeypoints (skeleton)Pose/skeleton estimation
Single-Label ClassificationImage-level label (no drawn annotations)One class per image
Multi-Label ClassificationImage-level labelsMultiple classes per image

Project type is set at creation and cannot be changed later.

Uploading Data

Methods

MethodBest ForFormats
Web UI drag-and-drop< 1,000 imagesJPG, PNG, WEBP, AVIF, BMP, MOV, MP4, PDF + 40+ annotation formats
CLI (roboflow import)> 1,000 images (images only)Same image formats, no video
Dataset Upload Workflow BlockCollecting from production WorkflowsProgrammatic
Universe forkStarting from a public datasetAny Universe dataset

Limits: Max 20 MB per image, max 16,400 x 10,900 px. Duplicate images are skipped automatically.

Video Upload

Videos are split into frames at a configurable rate (1 frame/60s to 60 fps). Supported formats depend on browser (MP4 H.264 most compatible).

CLI Upload

pip install roboflow
roboflow import -w <workspace> -p <project-id> /path/to/dataset

Tags

Tags are free-form labels on images for organization and filtering.

ActionHow
Add during uploadTag selector in upload dialog or via API
Add to existing imagesSelect images -> "Images Selected" -> "Apply tags"
Rename/delete in bulkProject Settings -> Tags -> "Modify Tags"
Filter by tagSearch with tag:<name> or use Assign page filter
Use in versions"Filter by Tag" preprocessing step (require/exclude/allow)

Dataset Search (RoboQL)

Search images via the Images page search bar. Combine filters with boolean logic.

Filters

FilterExampleDescription
(free text)person on sidewalkSemantic search (CLIP-based)
like-image:<ID>like-image:abc123Find visually similar images
filename:filename:*factory*Filename match (* for partial)
tag:tag:factoryFilter by tag
split:split:trainFilter by split
job:job:<JOB_ID>Filter by annotation job
class:class:helmetHas annotation with class
metadata:metadata:key=valueFilter by user metadata
project:project:my-projectFilter by project (workspace search)
sort:sort:updatedSort results
min-width: / max-width:min-width:1000Image dimension filters
min-height: / max-height:max-height:800Image dimension filters
min-annotations: / max-annotations:max-annotations:1Annotation count filters

Boolean Logic

  • AND, OR, NOT, parentheses: class:helmet AND NOT (tag:v1 OR tag:v2)
  • Inverted filter with -: -class:vest
  • Comparison operators on numeric filters: >, <, >=, <=, = (e.g., class:helmet>=3)

Splits (Train / Valid / Test)

Images are assigned to train, valid, or test splits. Splits are rebalanced during version generation (Step 2 in version creation). Augmentations only apply to train split.

Dataset Versions

A version is a frozen snapshot of the dataset at a point in time. Changes to the project after version creation do not affect existing versions.

Version Creation Pipeline

  1. Source selection — images from the dataset split
  2. Train/Test split — rebalance percentages
  3. Preprocessing — applied to all splits (train + valid + test)
  4. Augmentation — applied only to train split
  5. Generate — creates immutable version

Preprocessing Options

StepEffect
Auto-OrientStrips EXIF, normalizes orientation
ResizeStretch to / Fit within / Fit (black edges) / Fit (white edges)
GrayscaleConvert RGB to single channel
Auto-Adjust ContrastContrast Stretching / Histogram Equalization / Adaptive (CLAHE)
Isolate ObjectsCrop each bbox into separate image (converts OD to classification)
Static CropCrop all images to fixed region
TileSplit images into NxN grid (default 2x2, helps small object detection)
Dynamic CropCrop images around a specific class
Modify ClassesRemap/omit classes for this version only
Filter NullControl percentage of unannotated images
Filter by TagRequire / Exclude / Allow images by tag
Random SampleSample a percentage of images per split

Augmentation Options

Applied to train images only. Configurable max version size (e.g., 3x = source + 2 augmented copies).

AugmentationImage LevelBBox LevelTier
FlipyesyesBasic
90 deg RotateyesyesBasic
CropyesyesBasic
RotationyesyesBasic
ShearyesyesBasic
GrayscaleyesnoBasic
HueyesnoBasic
SaturationyesnoBasic
BrightnessyesyesBasic
ExposureyesyesBasic
BluryesyesBasic
NoiseyesyesBasic
Camera GainyesyesBasic
Motion BluryesyesBasic
CutoutyesnoEnhanced (paid)
MosaicyesnoEnhanced (paid)

Dataset Analytics

Available at project sidebar -> "Analytics". Shows:

  • Image count, annotation count, avg image size, median aspect ratio
  • Missing and null annotation counts
  • Class distribution across train/valid/test
  • Image dimension insights (size + aspect ratio distribution)
  • Annotation heatmap (click-drag to filter images by region)
  • Object count histogram (click bars to see matching images)

Classes

Managed at Project Settings -> Classes.

ActionDescription
RenameType new name in Override column
MergeOverride multiple classes to same name
DeleteCheck Delete checkbox
Lock"Lock Annotation Classes" prevents new class creation

Warning: Class changes at project level affect all images (irreversible). Use version-level "Modify Classes" preprocessing for non-destructive changes.

Annotation Groups

Annotation group = the category encompassing all classes in a project. Projects sharing the same annotation group share their class list and annotations.

  • Enable during project creation: "Share image annotations with other projects"
  • Shared annotations: editing in one project affects all linked projects
  • Look for chain-link icon to identify shared images/projects
  • Images shared across projects count only once toward usage

Project Folders

Folders group projects for organization. SSO workspaces can restrict folder access to specific team members.

ActionHow
Create"+ New Folder" from workspace view
Move projectProject menu -> "Move Project"
Delete folderFolder menu -> "Delete" (projects move to workspace root, not deleted)

Export Formats

Versions can be exported as .zip download or curl command. 40+ formats supported including COCO, YOLO, Pascal VOC, TFRecord, and more. Full list at roboflow.com/formats.

Export via Python SDK:

project.version(1).download("yolov8")

MCP apps vs plain tools

Prefab MCP apps (create_project_app) exist when parameters are unclear, you need real UX, or a human must confirm after seeing form fields — plain chat/MCP calls should not guess project type and license alone.

MCP Tools Available

ToolPurpose
projects_createCreate a new project (specify type, annotation group)
projects_list / projects_getList or get project details
images_searchSearch images using RoboQL filters
images_prepare_uploadPrepare image upload to a project
versions_generateGenerate a dataset version with preprocessing/augmentation
versions_getInspect a version
versions_exportExport a version in a given format

Related Pages

  • roboflow://skills/roboflow-labeling/SKILL — annotation tools, AI labeling, Label Assist, Smart Polygon, Auto Label, annotation jobs
Repository
roboflow/computer-vision-skills
Last updated
Created

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.