Challenge - 5 Problems

🎖️

DataFrame Mastery

Get all challenges correct to earn this badge!

Test your skills under time pressure!

❓ Predict Output

intermediate

2:00remaining

What is the output shape of this DataFrame?

Consider the following code creating a DataFrame from a dictionary of lists. What is the shape (rows, columns) of the resulting DataFrame?

Pandas

import pandas as pd
data = {'A': [1, 2, 3], 'B': [4, 5, 6], 'C': [7, 8, 9]}
df = pd.DataFrame(data)
print(df.shape)

A(2, 3)

B(3, 2)

C(3, 3)

D(9, 3)

Attempts:

2 left

❓ data_output

intermediate

2:00remaining

What is the DataFrame content after creation?

Given this code, what is the content of the DataFrame?

Pandas

import pandas as pd
data = {'Name': ['Alice', 'Bob'], 'Age': [25, 30]}
df = pd.DataFrame(data)
print(df)

Name  Age
1  Alice  25
2  Bob    30

   Name  Age
0  Alice   25
1    Bob   30

   Name  Age
1  Alice   25
2    Bob   30

Name  Age
0  Alice  25
1  Bob    30

Attempts:

2 left

🔧 Debug

advanced

2:00remaining

Why does this DataFrame creation raise an error?

Look at this code snippet. Why does it raise an error?

Pandas

import pandas as pd
data = {'A': [1, 2], 'B': [3, 4, 5]}
df = pd.DataFrame(data)

AValueError because columns have different length lists

BSyntaxError due to missing colon

CTypeError because dictionary keys are not strings

DNo error, DataFrame created successfully

Attempts:

2 left

🧠 Conceptual

advanced

2:00remaining

Why is DataFrame creation method important for performance?

Which statement best explains why the method of creating a DataFrame matters for performance?

ADataFrame creation speed depends only on the number of columns, not rows.

BAll DataFrame creation methods have the same performance because pandas optimizes automatically.

CCreating a DataFrame from a CSV file is always faster than from any Python data structure.

DCreating a DataFrame from a list of dictionaries is slower than from a dictionary of lists because of internal data alignment.

Attempts:

2 left

🚀 Application

expert

3:00remaining

Which code produces a DataFrame with a MultiIndex?

Choose the code snippet that creates a DataFrame with a MultiIndex on rows.

import pandas as pd
index = pd.MultiIndex.from_tuples([('a', 1), ('a', 2)])
data = {'value': [10, 20]}
df = pd.DataFrame(data, index=index)

import pandas as pd
data = {('A', 1): [10, 20], ('A', 2): [30, 40]}
df = pd.DataFrame(data)

import pandas as pd
data = {'A': [10, 20], 'B': [30, 40]}
df = pd.DataFrame(data)
df.set_index(['A', 'B'], inplace=True)

import pandas as pd
data = {'A': [10, 20], 'B': [30, 40]}
df = pd.DataFrame(data)
df.index = ['a', 'b']

Attempts:

2 left