0
0
Prompt Engineering / GenAIml~12 mins

Environmental impact of AI in Prompt Engineering / GenAI - Model Pipeline Trace

Choose your learning style9 modes available
Model Pipeline - Environmental impact of AI

This pipeline shows how training a large AI model affects the environment through energy use and carbon emissions. It tracks data from raw input to model training and measures energy consumption and emissions at each step.

Data Flow - 6 Stages
1Raw Data Collection
1000000 rows x 10 columnsGathering text and image data from the internet1000000 rows x 10 columns
Text samples and images collected for training
2Data Preprocessing
1000000 rows x 10 columnsCleaning, filtering, and formatting data950000 rows x 10 columns
Removed duplicates and irrelevant data
3Feature Engineering
950000 rows x 10 columnsConverting raw data into numerical features950000 rows x 512 features
Text converted to 512-dimensional vectors
4Model Training
950000 rows x 512 featuresTraining a deep neural network with 12 layersModel weights updated after 10 epochs
Model learns patterns from features
5Energy Consumption Measurement
Model training processMeasuring electricity used by GPUsTotal energy used: 1500 kWh
Energy meter records power during training
6Carbon Emission Estimation
Energy consumption dataCalculating CO2 emissions based on energy sourceEstimated emissions: 750 kg CO2
Using local electricity carbon intensity
Training Trace - Epoch by Epoch

Loss
1.2 |*        
0.9 | *       
0.7 |  *      
0.55|   *     
0.45|    *    
0.38|     *   
0.33|      *  
0.29|       * 
0.26|        *
0.24|         *
     ----------------
      Epochs 1 to 10
EpochLoss ↓Accuracy ↑Observation
11.20.45High loss and low accuracy at start
20.90.60Loss decreases, accuracy improves
30.70.72Model learns important features
40.550.80Training progressing well
50.450.85Good balance of loss and accuracy
60.380.88Model converging
70.330.90Stable improvement
80.290.92Approaching best performance
90.260.93Small improvements
100.240.94Training complete with good accuracy
Prediction Trace - 3 Layers
Layer 1: Input Layer
Layer 2: Hidden Layers (12 layers)
Layer 3: Output Layer with Softmax
Model Quiz - 3 Questions
Test your understanding
What happens to the data size after preprocessing?
AIt decreases because irrelevant data is removed
BIt increases because new features are added
CIt stays the same size
DIt becomes empty
Key Insight
Training large AI models uses significant energy, which leads to carbon emissions. Monitoring energy and emissions during data processing and training helps understand and reduce AI's environmental impact.