Workflow tutorial

ML Research Pipelines

An end-to-end tutorial: build a complete ML pipeline as a Connectify graph, from data loading through training to evaluation, then run two ablation variants and compare.

What you'll build

A CIFAR-10 image classification pipeline:

Dataset node loading CIFAR-10.
Two Logic augmentation nodes (random crop + horizontal flip).
A Model node training a small CNN.
An evaluation node reporting top-1 accuracy.
Two variants — Baseline and No augmentation — for comparison.

Estimated time: 10–15 minutes.

Step-by-step

Create a new project

From the Graphs Hub, click New graph. Name it "CIFAR-10 ablation". You'll land in the editor with one empty variant ("v1") and a blank canvas.
Add the Dataset node

Click + Node → Dataset. In the Inspector, set source to cifar-10 and split to train. Its single output port produces image data with a label sidecar.
Add augmentation

Add two Logic nodes — random_crop (padding=4) and horizontal_flip (p=0.5). Wire Dataset → random_crop → horizontal_flip. Both nodes pass image through unchanged.
Add the Model node

Click + Node → Model. Choose small_cnn from the model library. The model has two inputs (image, label) and one output (tensor for predictions). Wire augmented images and labels in.
Configure training

Click the Model node. In the Config tab set epochs to 10, lr to 1e-3, batch_size to 128. Leave the optimizer at the default (AdamW).
Add evaluation

Add a Logic → eval/top1_accuracy node. Wire the model's predictions and the labels in. Its output is a float — the metric you care about.
Run the baseline

Click Run. Watch edges animate as data flows. When the model finishes training, click the eval node and check Run Data — you should see something around 78–82% accuracy on a freshly initialized CNN.
Create the ablation variant

Click + Variant. Name it "No augmentation". In the new variant, delete both augmentation nodes and wire Dataset → Model directly.
Set the baseline and compare

Right-click the "v1" tab → Set as baseline. Switch to "No augmentation", click Run. When it finishes, open Compare → v1 (Baseline). You'll see the structural diff and the accuracy delta side by side.
Trace the result

On the baseline, switch to the Path tool and click the accuracy output. Save the path as "Top-1 → augmented input". Anyone opening the project later can click that chip to see exactly what produced the metric.

Going further

Add more variants for a learning-rate sweep, swap in a ResNet, or add a third augmentation. Variants are cheap — make as many as you need.

What you'll build

Step-by-step

Create a new project

Add the Dataset node

Add augmentation

Add the Model node

Configure training

Add evaluation

Run the baseline

Create the ablation variant

Set the baseline and compare

Trace the result

Related