Extract the data frame of errors?

I find that I often require two things from the same assumption-checking code:

1. Fail the analysis if the assumptions are incorrect,
2. Separate out two data frames: (1) a dataframe of the rows with faulty assumptions (to remand to data collection) and (2) a data frame that passes the checks (for further data analysis).
3. ~~Alternatively, get a single data frame with a column that indicates whether they passed the check.~~

I understand the original intention of `engarde` is to fail early, and it does provide some tools for (2), but there are two particular pain points:

1. Getting back to a data frame with and without errors is a little tough. In some cases, that's easy: `verify_all` returns a dataframe in `AssertionError.args[1]`. In others, it is less so: `none_missing` returns a list of `(index, column)` tuples, which all have to be passed to `pandas.DataFrame.loc` separately.
2. Engarde throws the first errors it encounters, which means that any other checks that might fail will only be discovered when this error is worked around.

Can `engarde` be used for my use case, or is that too far away from `engarde`'s philosophy? 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract the data frame of errors? #51

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Extract the data frame of errors? #51

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions