Training Your Own Models
Whether you're detecting objects, classifying scenes, or identifying unique patterns in your videos, Vidsy.ai lets you train your own models from scratch or on top of pre-trained ones.
This guide walks you through the entire process — from setting up an index to reviewing training results.
1. Create a New Project
- Launch Vidsy.ai.
- Open the Home tab and click on New to create a new project
- Open the Engine tab from the top menu.
- Click the Setup button.
- In the left column, click "Add new..." to create a new index.
An index is a database that stores the data and configurations for a single model. You can create multiple indexes.
- Fill in the required fields:
- Name the index.
- Set it as primary.
- Add a description.
The primary model is used as the entry point by the analysis engine. There can only be 1 entry point per project.
2. Configure the Model Pipeline
Next, set up the model’s purpose and behavior.
-
Choose the model purpose:
Locate
: Find object positions.Classify
: Label the entire image.Identify
: Recognize specific instances of objects.
-
Adjust the slider to balance between:
- Performance (speed, size)
- Accuracy
-
Enable detection options:
- Bounding boxes
- Keypoints
-
Choose a starting model:
- Pretrained (COCO 2017)
- Blank model
-
If using classification:
- Choose between single-label or multi-object detection.
-
Click Save to finish setting up the index and model.
3. Import Your Dataset
Vidsy supports the following dataset formats:
- COCO (JSON)
- Labeled folders (one folder per class)
- Previous Vidsy exports
To import:
- Open your index
- Click Import Dataset or drag and drop your dataset
4. Open the Trainer View
- Click the Chart button
in the interface.
- This switches the view from index content to Trainer View — a dedicated space for visualizing training metrics.
In Trainer View, you’ll see charts for accuracy, loss, learning rate, and more.
5. Configure and Start Training
-
Click the Play button ▶️ to open training options.
-
Choose a training strategy:
- Fine-tune the full model
- Train only the top layer
-
Training parameters (prefilled with defaults):
- Batch size
- Minimum epochs
- Optimizer, learning rate, momentum, decay
-
Prepare dataset (first time only):
- Vidsy will organize training and validation sets.
-
Optional:
- Clip and align keypoints (if available)
- Enable data augmentations:
- Rotate, scale, flip, brightness, contrast, saturation, hue, translateX/Y
- You can also set how many combinations per image are allowed
-
Click Start to begin training.
6. Monitor Training Progress
During training, you'll see:
- Accuracy/loss over time
- Validation results (data not seen during training)
- Live feedback in charts
⏱️ A dataset of ~600 images over 50 epochs typically trains in a few minutes on an NVIDIA 3070 GPU.
7. Save and Use Your Model
Once training finishes:
- Choose Save to keep the model.
- Or discard it if you’re not satisfied.
Saved models are automatically available inside the Vidsy.ai application for:
- Analyzing new videos
- Real-time camera feed processing
- Future fine-tuning
Models stay within Vidsy — there’s no need to export them.
Tips
- Use the Trainer View to understand training performance.
- You can retrain the model later with better data or more epochs.
- If you annotate incorrect predictions during use, Vidsy can incorporate them into retraining.