ML Research Pipelines
An end-to-end tutorial: build a complete ML pipeline as a Connectify graph, from data loading through training to evaluation, then run two ablation variants and compare.
What you'll build
A CIFAR-10 image classification pipeline:
- Dataset node loading CIFAR-10.
- Two Logic augmentation nodes (random crop + horizontal flip).
- A Model node training a small CNN.
- An evaluation node reporting top-1 accuracy.
- Two variants — Baseline and No augmentation — for comparison.
Estimated time: 10–15 minutes.
Step-by-step
-
Create a new project
From the Graphs Hub, click New graph. Name it "CIFAR-10 ablation". You'll land in the editor with one empty variant ("v1") and a blank canvas.
-
Add the Dataset node
Click + Node → Dataset. In the Inspector, set
sourcetocifar-10andsplittotrain. Its single output port produces image data with a label sidecar. -
Add augmentation
Add two Logic nodes —
random_crop(padding=4) andhorizontal_flip(p=0.5). Wire Dataset → random_crop → horizontal_flip. Both nodes pass image through unchanged. -
Add the Model node
Click + Node → Model. Choose
small_cnnfrom the model library. The model has two inputs (image, label) and one output (tensor for predictions). Wire augmented images and labels in. -
Configure training
Click the Model node. In the Config tab set
epochsto10,lrto1e-3,batch_sizeto128. Leave the optimizer at the default (AdamW). -
Add evaluation
Add a Logic → eval/top1_accuracy node. Wire the model's predictions and the labels in. Its output is a float — the metric you care about.
-
Run the baseline
Click Run. Watch edges animate as data flows. When the model finishes training, click the eval node and check Run Data — you should see something around 78–82% accuracy on a freshly initialized CNN.
-
Create the ablation variant
Click + Variant. Name it "No augmentation". In the new variant, delete both augmentation nodes and wire Dataset → Model directly.
-
Set the baseline and compare
Right-click the "v1" tab → Set as baseline. Switch to "No augmentation", click Run. When it finishes, open Compare → v1 (Baseline). You'll see the structural diff and the accuracy delta side by side.
-
Trace the result
On the baseline, switch to the Path tool and click the accuracy output. Save the path as "Top-1 → augmented input". Anyone opening the project later can click that chip to see exactly what produced the metric.
Going further
Add more variants for a learning-rate sweep, swap in a ResNet, or add a third augmentation. Variants are cheap — make as many as you need.