Data Cleaning for Text Fields in Aggregate Analyses Using dplyr

Text-Field-Data-Cleaning

Data Cleaning for Text Fields in Aggregate Analyses Using dplyr

Code developed by Arielle Landau

This R. Script is designed to clean text fields in large datasets. The code is specifically centered around shorter text fields, like names, titles, etc., with misspellings, weird capitalizations, and other common mistakes that occur when humans enter data. After cleaning the data, formally messy text fields can now be matched and counted in aggregate analyses.

Use Cases:

Counting the number of times different corporations are involved in environmental conflicts with user entered data from the Environmental Justice Atlas

Potential Use Cases:

Keeping track of donor involvement in non-profits despite name misspellings
Standardizing answers to surveys with text fields

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Text-Field-Data-Cleaning		Text-Field-Data-Cleaning
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text-Field-Data-Cleaning

Data Cleaning for Text Fields in Aggregate Analyses Using dplyr

Code developed by Arielle Landau

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Text-Field-Data-Cleaning

Data Cleaning for Text Fields in Aggregate Analyses Using dplyr

Code developed by Arielle Landau

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages