0
0
TensorFlowml~20 mins

Dataset from files in TensorFlow - Practice Problems & Coding Challenges

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
TensorFlow Dataset Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
Predict Output
intermediate
2:00remaining
Output of loading text files with tf.data
What is the output of this code snippet that loads text lines from two files using TensorFlow's tf.data API?
TensorFlow
import tensorflow as tf

# Create two small text files
with open('file1.txt', 'w') as f:
    f.write('apple\nbanana')
with open('file2.txt', 'w') as f:
    f.write('cherry\ndate')

# Load dataset from files
files = ['file1.txt', 'file2.txt']
dataset = tf.data.TextLineDataset(files)

# Collect all lines into a list
lines = list(dataset.as_numpy_iterator())
print(lines)
A[b'file1.txt', b'file2.txt']
BSyntaxError
C[b'apple banana', b'cherry date']
D[b'apple', b'banana', b'cherry', b'date']
Attempts:
2 left
💡 Hint
tf.data.TextLineDataset reads each line from all files in order.
data_output
intermediate
1:30remaining
Number of elements in a dataset from multiple CSV files
Given three CSV files each with 2 rows, what is the number of elements in the dataset created by tf.data.experimental.CsvDataset loading all files?
TensorFlow
import tensorflow as tf

# Assume files: data1.csv, data2.csv, data3.csv each with 2 rows
files = ['data1.csv', 'data2.csv', 'data3.csv']
dataset = tf.data.experimental.CsvDataset(files, [tf.float32, tf.int32])

count = 0
for _ in dataset:
    count += 1
print(count)
A6
B3
C2
D9
Attempts:
2 left
💡 Hint
Each file has 2 rows, and dataset reads all rows from all files.
🔧 Debug
advanced
1:30remaining
Error raised when loading non-existent files
What error will this code raise when trying to create a TextLineDataset from a file that does not exist?
TensorFlow
import tensorflow as tf

files = ['missing_file.txt']
dataset = tf.data.TextLineDataset(files)

for line in dataset:
    print(line.numpy())
ANo error, prints nothing
Btf.errors.NotFoundError
CValueError
DFileNotFoundError
Attempts:
2 left
💡 Hint
TensorFlow raises its own error type for missing files.
🚀 Application
advanced
2:30remaining
Creating a dataset from image files with labels
You have a folder with images and a CSV file mapping image filenames to labels. Which code snippet correctly creates a tf.data.Dataset yielding (image_tensor, label) pairs?
AUse tf.data.TextLineDataset on CSV, parse lines, then map to load images with tf.io.read_file and tf.image.decode_jpeg
BUse tf.data.Dataset.list_files on images, then map to load images and assign labels manually
CUse tf.data.experimental.CsvDataset on CSV, then map to load images and parse labels
DUse tf.data.Dataset.from_tensor_slices with image paths and labels loaded into memory
Attempts:
2 left
💡 Hint
CsvDataset is designed to read CSV files with typed columns.
🧠 Conceptual
expert
3:00remaining
Effect of interleave on dataset from multiple files
What is the main difference between tf.data.TextLineDataset(files) and tf.data.Dataset.from_tensor_slices(files).interleave(tf.data.TextLineDataset, cycle_length=2) when reading multiple text files?
ATextLineDataset reads files sequentially; interleave reads lines from files in parallel, mixing lines
BTextLineDataset reads files in parallel; interleave reads files sequentially
CBoth produce the same output order
DTextLineDataset reads only first file; interleave reads all files
Attempts:
2 left
💡 Hint
Interleave cycles through datasets to mix their elements.