senior-computer-vision

Computer vision engineering skill for object detection, image segmentation, and visual AI systems. Covers CNN and Vision Transformer architectures, YOLO/Faster R-CNN/DETR detection, Mask R-CNN/SAM segmentation, and production deployment with ONNX/TensorRT. Includes PyTorch, torchvision, Ultralytics, Detectron2, and MMDetection frameworks. Use when building detection pipelines, training custom models, optimizing inference, or deploying vision systems.

1.37x

Quality

78%

Does it follow best practices?

Impact

92%

1.37x

Average score across 6 eval scenarios

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./engineering-team/senior-computer-vision/SKILL.md

Evaluation results

82%

76%

Aerial Surveillance Dataset Preparation

Dataset preparation pipeline

Criteria

Without context

With context

Uses pipeline script

100%

COCO output format

100%

Correct split ratios

100%

Stratified split

100%

Seed 42

100%

Mosaic augmentation

100%

Mixup augmentation

100%

Cutout parameters

100%

Horizontal flip probability

100%

Rotate limit

Brightness/contrast limits

Hue saturation values

pycocotools verification

100%

25%

Retail Shelf Detection System

Architecture selection and training setup

Criteria

Without context

With context

YOLOv8 or RT-DETR chosen

100%

Uses vision_model_trainer.py

100%

YOLO CLI training command

100%

mAP@50 threshold

100%

mAP@50:95 threshold

100%

Precision threshold

100%

Recall threshold

100%

Inference time target

100%

CNN vs ViT data reasoning

100%

Detection task flag

100%

97%

30%

Multi-Target Model Deployment

Model optimization and deployment

Criteria

Without context

With context

Uses inference_optimizer.py

100%

Baseline benchmark step

70%

100%

Benchmark parameters

62%

100%

Dynamic batch flag

37%

100%

Simplify flag

50%

100%

ONNX verification

100%

Cloud path: TensorRT FP16

100%

Intel path: OpenVINO

100%

ONNX intermediary for Intel

100%

INT8 calibration samples

100%

Opset version 17

100%

50%

97%

Warehouse Visual Scene Understanding System

Segmentation architecture selection

Criteria

Without context

With context

Instance seg architecture

100%

Real-time instance choice

100%

Semantic seg architecture

100%

SegFormer speed advantage

100%

62%

SAM for zero-shot

100%

SAM prompt types

100%

segment-anything package

100%

Mask R-CNN quality trade-off

100%

mmsegmentation or torchvision

100%

Architecture decision format

100%

Instance vs semantic distinction

100%

93%

15%

Retail Store Customer Flow Analytics

Video tracking pipeline

Criteria

Without context

With context

ByteTrack or SORT

100%

YOLOv8 detector

100%

Real-time FPS target

100%

Latency P99 target

50%

GPU memory target

100%

Model size target

57%

Persistent track IDs

100%

Video capture loop

100%

Detection then tracking flow

100%

Script is executable

100%

FPS measurement

100%

84%

-1%

Aerial Wildlife Monitoring on Embedded Hardware

Small object detection and edge deployment

Criteria

Without context

With context

Edge architecture choice

58%

50%

SAHI or high-res for small objects

100%

CNN over ViT justification

100%

Edge FPS target

100%

Edge mAP@50 target

100%

Edge GPU memory target

100%

Edge model size target

100%

NVIDIA edge optimization path

Copy-paste for small objects

100%

Architecture plan document

100%

P2 FPN level or anchor adjustment

100%

Repository: alirezarezvani/claude-skills
Commit: f567c61

Evaluated: about 2 months ago
Agent: Claude Code
Model: Claude Sonnet 4.6

Table of Contents

Aerial Surveillance Dataset Preparation Retail Shelf Detection System Multi-Target Model Deployment Warehouse Visual Scene Understanding System Retail Store Customer Flow Analytics Aerial Wildlife Monitoring on Embedded Hardware

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.