Challenge - 5 Problems

🎖️

Advanced Data Science Mastery

Get all challenges correct to earn this badge!

Test your skills under time pressure!

❓ Predict Output

intermediate

2:00remaining

Output of complex data aggregation with groupby

What is the output of this Python code that groups data and calculates the sum?

Data Analysis Python

import pandas as pd

data = {'Category': ['A', 'B', 'A', 'B', 'C'], 'Value': [10, 20, 30, 40, 50]}
df = pd.DataFrame(data)
result = df.groupby('Category').sum()
print(result)

Value
A    40
B    60
C    50

   Value
Category       
A       40
B       60
C       50

Category  Value
A         40
B         60
C         50

   Category  Value
0        A     40
1        B     60
2        C     50

Attempts:

2 left

❓ data_output

intermediate

2:00remaining

Number of unique values after complex filtering

After filtering a DataFrame with multiple conditions, how many unique 'Type' values remain?

Data Analysis Python

import pandas as pd

data = {'Type': ['X', 'Y', 'X', 'Z', 'Y', 'Z', 'X'], 'Score': [5, 10, 15, 20, 25, 30, 35]}
df = pd.DataFrame(data)
filtered = df[(df['Score'] > 10) & (df['Type'] != 'Z')]
unique_count = filtered['Type'].nunique()
print(unique_count)

Attempts:

2 left

❓ visualization

advanced

2:00remaining

Identify correct plot for multi-dimensional data

Which plot correctly visualizes the relationship between three variables in the dataset?

Data Analysis Python

import matplotlib.pyplot as plt
import pandas as pd
import seaborn as sns

data = {'X': [1, 2, 3, 4, 5], 'Y': [5, 4, 3, 2, 1], 'Z': [2, 3, 4, 5, 6]}
df = pd.DataFrame(data)

plt.figure(figsize=(6,4))
sns.scatterplot(data=df, x='X', y='Y', hue='Z', palette='viridis')
plt.show()

AA bar chart showing sum of X grouped by Y

BA line plot connecting points of X and Y ignoring Z

CA scatter plot with points colored by 'Z' values showing X vs Y

DA histogram of Z values

Attempts:

2 left

🔧 Debug

advanced

2:00remaining

Error type from incorrect DataFrame merge

What error does this code raise when merging two DataFrames with no common columns specified?

Data Analysis Python

import pandas as pd

df1 = pd.DataFrame({'A': [1, 2], 'B': [3, 4]})
df2 = pd.DataFrame({'C': [5, 6], 'D': [7, 8]})
merged = pd.merge(df1, df2)
print(merged)

AValueError: No common columns to perform merge on

BKeyError: 'A'

CTypeError: merge() missing required positional argument

DNo error, merges with Cartesian product

Attempts:

2 left

🚀 Application

expert

3:00remaining

Choosing the right method for handling missing data in complex datasets

You have a large dataset with missing values scattered in multiple columns. Which method is best to handle missing data while preserving as much information as possible?

AUse advanced imputation methods like K-Nearest Neighbors (KNN) imputation

BFill missing values with the mean of each column

CReplace missing values with zero

DDrop all rows with any missing values

Attempts:

2 left