beginner

What is the purpose of the pivot_table() function in pandas?

The pivot_table() function is used to summarize and aggregate data in a DataFrame by creating a new table that groups data by one or more keys and applies aggregation functions like sum, mean, or count.

Click to reveal answer

beginner

Which parameters are commonly used in pivot_table()?

data: The DataFrame to summarize.
index: Column(s) to group by on rows.
columns: Column(s) to group by on columns.
values: Column(s) to aggregate.
aggfunc: Aggregation function like sum, mean, or count.

Click to reveal answer

intermediate

How does pivot_table() handle missing values by default?

By default, pivot_table() fills missing values with NaN. You can change this behavior using the fill_value parameter to replace missing values with a specific value like 0.

Click to reveal answer

intermediate

What is the difference between pivot() and pivot_table()?

pivot() reshapes data without aggregation and requires unique index/column pairs. pivot_table() allows aggregation and can handle duplicate entries by applying aggregation functions.

Click to reveal answer

beginner

Write a simple example of using pivot_table() to find the average sales per product category.

Example:<br><pre>import pandas as pd

data = {'Category': ['A', 'A', 'B', 'B'], 'Sales': [100, 150, 200, 250]}
df = pd.DataFrame(data)

pivot = df.pivot_table(index='Category', values='Sales', aggfunc='mean')
print(pivot)</pre>

Click to reveal answer

What does the aggfunc parameter in pivot_table() specify?

AThe columns to use as column labels

BThe columns to use as row labels

CThe aggregation function to apply to grouped data

DThe DataFrame to summarize

Which parameter in pivot_table() controls the rows of the new table?

Aindex

Bvalues

Ccolumns

Daggfunc

If your data has duplicate entries for the same index and column, which function should you use to summarize it?

Apivot()

Bgroupby()

Cmerge()

Dpivot_table()

What will pivot_table() fill missing values with by default?

ANaN

BEmpty string

C0

DMean of the column

Which of these is NOT a valid aggregation function for aggfunc?

Amean

Bsort

Ccount

Dsum

Explain how you would use pivot_table() to summarize sales data by region and product.

Describe the difference between pivot() and pivot_table() and when to use each.