Skip to content

pranalibose/PandasOptimization

Repository files navigation

Pandas Optimization

This repository explores various optimization techniques for working with Pandas. It covers best practices and advanced concepts, illustrating key optimization strategies with practical examples. Additionally, it includes a demonstration of how to read large CSV files using chunking.

While there are alternative libraries for data manipulation, such as Polars, Dask, and Spark, this repository focuses on enhancing performance specifically within the context of Pandas.

Recommended Reading Flow

  1. The Deck: An overview and summary of the methods.
  2. Best Practices: Practical examples of effective best practices for using Pandas.
  3. Advanced Concepts: Insights into advanced techniques that significantly improve performance with large datasets.
  4. Use Case: A detailed example of reading large CSV files through chunking.

About

Pandas Optimization Techniques

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors