A DataFrame is like a smart table that helps you organize and analyze data easily. It lets you work with rows and columns just like a spreadsheet, making data simple to understand and use.
0
0
Why DataFrame is the core data structure in Data Analysis Python
Introduction
When you have data with rows and columns, like a list of sales records.
When you want to clean or change data before analysis.
When you need to calculate summaries like averages or totals.
When you want to combine data from different sources into one table.
When you want to filter or select specific parts of your data.
Syntax
Data Analysis Python
import pandas as pd df = pd.DataFrame(data, columns=['Column1', 'Column2'])
DataFrame is created using the pandas library in Python.
You can create a DataFrame from lists, dictionaries, or other data sources.
Examples
This creates a DataFrame from a dictionary where keys are column names.
Data Analysis Python
import pandas as pd data = {'Name': ['Anna', 'Bob'], 'Age': [28, 34]} df = pd.DataFrame(data)
This creates a DataFrame from a list of lists and assigns column names.
Data Analysis Python
import pandas as pd list_of_lists = [['Anna', 28], ['Bob', 34]] df = pd.DataFrame(list_of_lists, columns=['Name', 'Age'])
Sample Program
This program creates a DataFrame with product sales data, calculates total sales, and prints the table.
Data Analysis Python
import pandas as pd # Create a simple DataFrame with sales data data = {'Product': ['Apple', 'Banana', 'Cherry'], 'Price': [1.2, 0.5, 2.5], 'Quantity': [10, 20, 15]} df = pd.DataFrame(data) # Calculate total sales for each product df['Total'] = df['Price'] * df['Quantity'] # Show the DataFrame print(df)
OutputSuccess
Important Notes
DataFrames make it easy to handle different types of data in one table.
They support many useful functions for data analysis like filtering, grouping, and joining.
Summary
DataFrames organize data in rows and columns like a table.
They are easy to create and work with using pandas in Python.
DataFrames help you analyze, clean, and summarize data efficiently.