0
0
Pandasdata~5 mins

Memory savings with categoricals in Pandas - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is a categorical data type in pandas?
A categorical data type in pandas is a way to store data that has a limited set of possible values, called categories. It saves memory by storing the data as integers internally and mapping them to the categories.
Click to reveal answer
beginner
How does using categorical data types save memory compared to using strings?
Using categorical data types saves memory because it stores each unique value only once and uses integer codes to represent the data, instead of storing full strings repeatedly.
Click to reveal answer
beginner
Which pandas method converts a column to a categorical type?
The pandas method to convert a column to categorical type is astype('category').
Click to reveal answer
beginner
What is one common use case for converting data to categorical type?
A common use case is when a column has repeated values from a small set, like days of the week or product categories, to reduce memory and speed up operations.
Click to reveal answer
beginner
True or False: Categorical data types can only be used with text data.
False. Categorical data types can be used with any data that has a limited set of values, including numbers and strings.
Click to reveal answer
What does pandas store internally when using a categorical data type?
AFloating point numbers
BInteger codes representing categories
CFull string values for each row
DBoolean values
Which pandas function converts a column to categorical?
Aastype('category')
Bto_categorical()
Cconvert_category()
Dcategory_convert()
Why do categorical types save memory?
AThey remove missing values
BThey compress data using zip
CThey store unique values once and use codes
DThey convert data to integers only
Which data is best suited for categorical type?
AUnique IDs with many different values
BContinuous numerical data
CRandom floating point numbers
DRepeated values from a small set
Can categorical types be used for numeric data?
AYes, if the numbers represent categories
BNo, only for text data
COnly for floating point numbers
DOnly for boolean data
Explain how converting a pandas column to categorical type helps save memory.
Think about how repeated values are stored in normal strings versus categorical.
You got /4 concepts.
    Describe a real-life example where using categorical data type would be beneficial.
    Consider data with many repeated labels.
    You got /4 concepts.