Pandas Library
Pandas Library in Python
Overview:
Pandas is a fast, powerful, flexible, and easy-to-use open-source data analysis and data
manipulation library built on top of the Python programming language.
Key Features:
- Data structures: Series (1D) and DataFrame (2D)
- Tools for reading and writing data between in-memory data structures and different formats: CSV,
Excel, SQL, JSON, etc.
- Data alignment and integrated handling of missing data
- Reshaping and pivoting of datasets
- Label-based slicing, indexing and subsetting of large datasets
- Data aggregation using group by functionality
- High-performance merging and joining of datasets
- Time series functionality
Sample Code:
```python
import pandas as pd
# Create a DataFrame
data = {'Name': ['Alice', 'Bob', 'Charlie'],
'Age': [25, 30, 35],
'City': ['New York', 'Paris', 'London']}
df = pd.DataFrame(data)
# Display DataFrame
print(df)
# Filter data
print(df[df['Age'] > 30])
```
Use Cases:
- Financial data analysis
- Cleaning and preprocessing data for machine learning
- Time series analysis
- Automating data reporting pipelines
Pandas is widely used in academia, finance, data journalism, and many fields where structured data
analysis is needed.