Challenge - 5 Problems

🎖️

Sample() Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

❓ Predict Output

intermediate

2:00remaining

What is the output of this sample() code?

Given the DataFrame df below, what will df.sample(n=3, random_state=1) return?

Data Analysis Python

import pandas as pd

df = pd.DataFrame({'A': [10, 20, 30, 40, 50], 'B': ['a', 'b', 'c', 'd', 'e']})
sample_df = df.sample(n=3, random_state=1)
print(sample_df)

Attempts:

2 left

❓ data_output

intermediate

1:00remaining

How many rows are returned by sample() with frac=0.4?

If a DataFrame has 10 rows, what is the number of rows returned by df.sample(frac=0.4, random_state=5)?

Data Analysis Python

import pandas as pd

df = pd.DataFrame({'X': range(10)})
sample_df = df.sample(frac=0.4, random_state=5)
print(len(sample_df))

Attempts:

2 left

🔧 Debug

advanced

1:30remaining

What error does this sample() code raise?

What error will this code produce?

Data Analysis Python

import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3]})
sample_df = df.sample(n=5)

AKeyError: 'n'

BTypeError: sample() got an unexpected keyword argument 'n'

CNo error, returns 5 rows with NaNs

DValueError: Cannot take a larger sample than population when 'replace=False'

Attempts:

2 left

🚀 Application

advanced

1:30remaining

Which code produces a random sample with replacement?

You want to randomly select 4 rows from a DataFrame of 3 rows, allowing repeats. Which code does this?

Data Analysis Python

import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3]})

Adf.sample(frac=1.33, random_state=0)

Bdf.sample(n=4, replace=False, random_state=0)

Cdf.sample(n=4, replace=True, random_state=0)

Ddf.sample(n=4)

Attempts:

2 left

🧠 Conceptual

expert

1:00remaining

What is the effect of setting random_state in sample()?

Why do we set the random_state parameter in df.sample()?

ATo ensure the sample is the same every time the code runs

BTo increase the sample size automatically

CTo speed up the sampling process

DTo sort the sampled rows by their index

Attempts:

2 left