This document describes the diagrams needed for the article. Create these as PNG files in this folder.
Purpose: Show the three-stage measurement pipeline with distributed execution
Style: Flowchart with parallel branches, clean modern look
Content:
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β MEASUREMENT PIPELINE β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β
βββββββββββββββββ΄ββββββββββββββββ
βΌ βΌ
βββββββββββββββββββββββββββ βββββββββββββββββββββββββββ
β CONFIGURATION β β EXECUTION β
β βββββββββββββββββββ β β βββββββββββββββββββ β
β β’ Model selection ββββββΆβ β’ HuggingFace Accelerate
β β’ Hardware config β β β’ Tensor parallelism β
β β’ Generation params β β β’ Barrier sync β
β β’ YAML inheritance β ββββββββββββββ¬βββββββββββββ
βββββββββββββββββββββββββββ β
ββββββββββββββββΌβββββββββββββββ
βΌ βΌ βΌ
ββββββββββ ββββββββββ ββββββββββ
β GPU 0 β β GPU 1 β β GPU N β
β ββββββ β β ββββββ β β ββββββ β
β Energy β β Energy β β Energy β
β Tokens β β Tokens β β Tokens β
β Memory β β Memory β β Memory β
ββββββ¬ββββ ββββββ¬ββββ ββββββ¬ββββ
β β β
ββββββββββββββββΌβββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββ
β AGGREGATION β
β βββββββββββββββββββ β
β β’ Late aggregation β
β β’ Raw results preservedβ
β β’ Flexible analysis β
βββββββββββββββββββββββββββ
Colours:
Dimensions: ~800Γ600px
Purpose: Show what happens during a single measurement run
Style: Vertical timeline / sequence diagram
Content:
βββββββββββββββββββββββββββββββββββββββββββββββ
β 1. INITIALISATION β
β β’ Load model onto GPU(s) β
β β’ Configure distributed backend β
β β’ Start CodeCarbon tracker β
βββββββββββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββ
β 2. WARM-UP (discarded) β
β β’ 3 dummy forward passes β
β β’ Trigger lazy initialisations β
β β’ Stabilise GPU clocks β
βββββββββββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββ
β 3. MEASUREMENT β
β βββββββββββββββββββββββββββββββββββββββ β
β β for batch in dataloader: β β
β β outputs = model.generate(batch) β β
β β record_tokens(outputs) β β
β βββββββββββββββββββββββββββββββββββββββ β
β β’ Energy tracked continuously β
β β’ Tokens counted per process β
βββββββββββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββ
β 4. COLLECTION β
β β’ Stop CodeCarbon β
β β’ Gather per-GPU metrics β
β β’ Save raw results to JSON β
βββββββββββββββββββββββββββββββββββββββββββββββ
Colours: Gradient from top (light) to bottom (darker)
Dimensions: ~500Γ700px
Purpose: Pie chart or stacked bar showing energy breakdown
Style: Simple data visualisation
Content:
Typical Energy Breakdown (LLaMA-3B, 4ΓA100)
ββββββββββββββββββββββββββββββββββββββββββββββ
β ββββββββββββββββββββββββββββββββββββββββ β GPU: 89%
β ββββββββββββββββββββββββββββββββββββββββ β CPU: 7%
β ββββββββββββββββββββββββββββββββββββββββ β RAM: 4%
ββββββββββββββββββββββββββββββββββββββββββββββ
Or as a pie:
Dimensions: ~400Γ400px
Export as PNG at 2x resolution for crisp display on retina screens.
Powered by Jekyll and Minimal Light theme.