Skip to content

neiloconnor/pyspark-local

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

45 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PySpark Local

Minimal implementation of the PySpark API to make it easier to write and locally test PySpark code. You can run your PySpark code against this local API to verify it works with sample data before using the real PySpark API in production.

Getting started

Copy pyspark.py into your project directory.

Interactive mode

You can emulate pysparks interactive mode by running the command

python -i pyspark.py

Script mode

Or import and use in your Python scripts.

from pyspark import SparkContext

Disclaimer

The set of features is incomplete, but commonly used features were chosen based on https://s3.amazonaws.com/assets.datacamp.com/blog_assets/PySpark_Cheat_Sheet_Python.pdf

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages