0
0
TensorFlowml~5 mins

Data augmentation in pipeline in TensorFlow - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is data augmentation in a machine learning pipeline?
Data augmentation is a technique to create new training data by making small changes to existing data. It helps the model learn better by showing it more varied examples.
Click to reveal answer
beginner
Why do we use data augmentation in the training pipeline?
We use data augmentation to increase the size and diversity of the training data. This reduces overfitting and helps the model generalize better to new data.
Click to reveal answer
beginner
Name three common data augmentation techniques for images.
Common techniques include flipping images horizontally, rotating images by small angles, and zooming in or out slightly.
Click to reveal answer
intermediate
How can data augmentation be integrated into a TensorFlow pipeline?
In TensorFlow, data augmentation can be added as part of the tf.data pipeline using functions like map() to apply augmentation operations on the fly during training.
Click to reveal answer
intermediate
What is the benefit of applying data augmentation on the fly during training instead of beforehand?
Applying augmentation on the fly saves storage space and creates new variations each epoch, making the training data more diverse without needing to save all augmented images.
Click to reveal answer
Which of the following is NOT a typical image data augmentation technique?
ASorting pixels by brightness
BAdding random noise
CHorizontal flipping
DRandom rotation
In TensorFlow, which method is commonly used to apply data augmentation in a pipeline?
Atf.constant()
Btf.keras.Model.compile()
Ctf.Variable.assign()
Dtf.data.Dataset.map()
What is a key advantage of using data augmentation during training?
AIt increases training data diversity
BIt reduces the model size
CIt speeds up training
DIt removes the need for validation data
Which statement about data augmentation is TRUE?
AIt always decreases model accuracy
BIt can help prevent overfitting
CIt requires manual labeling of new data
DIt is only useful for text data
When applying data augmentation on the fly, what happens each training epoch?
AThe same augmented images are reused
BThe dataset size shrinks
CNew random augmentations are applied each time
DNo augmentation is applied after the first epoch
Explain how data augmentation improves model training and how it can be implemented in a TensorFlow pipeline.
Think about why showing the model more varied images helps it learn better.
You got /4 concepts.
    Describe the difference between applying data augmentation before training and applying it on the fly during training.
    Consider storage needs and data variety over multiple training passes.
    You got /3 concepts.