limitation

Evidpath helps teams check a recommender before launch by running interaction tests, saving clear evidence, and comparing two versions before they ship.

What Evidpath Helps You Do

check that your recommender endpoint is wired correctly
run a repeatable audit against a real target URL
open a report that shows who struggled and why
compare a baseline and a candidate before launch

Get Started In 3 Steps

Install the package:

python -m pip install evidpath

Check your endpoint:

evidpath check-target --domain recommender --target-url http://127.0.0.1:8051

Run one audit:

evidpath audit --domain recommender --target-url http://127.0.0.1:8051 --scenario returning-user-home-feed --seed 7

That run writes an output folder with files such as report.md, results.json, and traces.jsonl.

Where To Go Next

product guide: products/evidpath/README.md
PyPI: https://pypi.org/project/evidpath/
TestPyPI: https://test.pypi.org/project/evidpath/
releases: https://github.com/AlankritVerma01/limitation/releases
demo guide: products/evidpath/DEMO.md
external target contract: products/evidpath/EXTERNAL_TARGET_CONTRACT.md
contributing: CONTRIBUTING.md

What Is In This Repo

This repository contains two closely related things:

the product package under products/evidpath
the public proof and study under studies/01-recommender-offline-eval

If you are here to use the product, start with the product guide. If you are here to understand the original proof behind the direction, read the study.

Public Proof

The study package shows the original argument behind Evidpath: offline ranking metrics can miss important user-level tradeoffs.

Useful links:

study README: studies/01-recommender-offline-eval/README.md
canonical report: studies/01-recommender-offline-eval/artifacts/canonical/official_demo_report.md
canonical JSON: studies/01-recommender-offline-eval/artifacts/canonical/official_demo_results.json

Repo Guide

product docs: products/evidpath/README.md
PyPI README: products/evidpath/README_PYPI.md
plans: plans/evidpath-v0/README.md
code of conduct: CODE_OF_CONDUCT.md
security: SECURITY.md
support: SUPPORT.md

Background

The earlier public write-up that motivated this direction is here:

https://dev.to/alankritverma/why-offline-evaluation-is-not-enough-for-recommendation-systems-15ii

Name		Name	Last commit message	Last commit date
Latest commit History 156 Commits
.github		.github
products/evidpath		products/evidpath
studies		studies
.editorconfig		.editorconfig
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

limitation

What Evidpath Helps You Do

Get Started In 3 Steps

Where To Go Next

What Is In This Repo

Public Proof

Repo Guide

Background

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

limitation

What Evidpath Helps You Do

Get Started In 3 Steps

Where To Go Next

What Is In This Repo

Public Proof

Repo Guide

Background

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages