0
0
Data Analysis Pythondata~20 mins

Basic DataFrame info (shape, dtypes, describe) in Data Analysis Python - Practice Problems & Coding Challenges

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
DataFrame Info Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
Predict Output
intermediate
2:00remaining
What is the shape of the DataFrame?
Given the following DataFrame, what will be the output of df.shape?
Data Analysis Python
import pandas as pd

data = {'Name': ['Alice', 'Bob', 'Charlie'], 'Age': [25, 30, 35], 'City': ['NY', 'LA', 'SF']}
df = pd.DataFrame(data)
print(df.shape)
A(2, 3)
B(3, 2)
C(3, 3)
D(3, 4)
Attempts:
2 left
💡 Hint
Shape shows (rows, columns). Count rows and columns in the data.
Predict Output
intermediate
2:00remaining
What are the data types of each column?
What will be the output of df.dtypes for this DataFrame?
Data Analysis Python
import pandas as pd

data = {'Name': ['Alice', 'Bob'], 'Age': [25, 30], 'Height': [5.5, 6.0]}
df = pd.DataFrame(data)
print(df.dtypes)
A
Name     int64
Age      object
Height   float64
dtype: object
B
Name     object
Age       int64
Height  float64
dtype: object
C
Name     float64
Age      int64
Height   object
dtype: object
D
Name     object
Age      float64
Height   int64
dtype: object
Attempts:
2 left
💡 Hint
Names are text, ages are whole numbers, heights are decimals.
data_output
advanced
3:00remaining
What does describe() show for numeric columns?
Given this DataFrame, what will df.describe() output?
Data Analysis Python
import pandas as pd

data = {'Score': [80, 90, 100, 70, 85], 'Attempts': [1, 2, 1, 3, 2]}
df = pd.DataFrame(data)
print(df.describe())
A
       Score  Attempts
count    5.0       5.0
mean    85.0       1.8
std      10.0      0.84
min     70.0       1.0
25%     80.0       1.0
50%     85.0       2.0
75%     90.0       2.0
max    100.0       3.0
B
       Score  Attempts
count    5.0       5.0
mean    85.0       1.8
std      11.18     0.84
min     70.0       1.0
25%     75.0       1.0
50%     85.0       2.0
75%     90.0       3.0
max    100.0       3.0
C
       Score  Attempts
count    5.0       5.0
mean    85.0       2.0
std      11.18     1.0
min     70.0       1.0
25%     80.0       1.0
50%     85.0       2.0
75%     90.0       2.0
max    100.0       3.0
D
       Score  Attempts
count    5.0       5.0
mean    85.0       1.8
std      11.18     0.84
min     70.0       1.0
25%     80.0       1.0
50%     85.0       2.0
75%     90.0       2.0
max    100.0       3.0
Attempts:
2 left
💡 Hint
Check the mean and standard deviation carefully.
🧠 Conceptual
advanced
1:30remaining
Which statement about DataFrame dtypes is true?
Consider a DataFrame with columns of different types. Which statement is correct about df.dtypes?
AIt returns a Series showing the data type of each column.
BIt returns the data type of the first column only.
CIt returns a list of all unique data types in the DataFrame.
DIt returns the number of columns for each data type.
Attempts:
2 left
💡 Hint
Think about what you want to know about each column's type.
🔧 Debug
expert
2:30remaining
Why does this describe() call miss some columns?
Given this DataFrame, why does df.describe() only show one column?
Data Analysis Python
import pandas as pd
import numpy as np

data = {'A': [1, 2, 3], 'B': ['x', 'y', 'z'], 'C': [np.nan, np.nan, np.nan]}
df = pd.DataFrame(data)
print(df.describe())
Adescribe() by default only summarizes numeric columns, so only column 'A' appears.
Bdescribe() fails because column 'C' has NaN values and stops processing.
Cdescribe() only shows columns with no missing values, so 'C' is excluded.
Ddescribe() shows all columns but output is truncated to one column by default.
Attempts:
2 left
💡 Hint
Check pandas describe default behavior with non-numeric columns.