feat: Implementation of embed_and_evaluate by guenthermi · Pull Request #702 · docarray/docarray

guenthermi · 2022-10-28T09:53:09Z

Goals:

Add a function that does embed, match, and evaluate at once
Do embed and match in batches to reduce the memory foot-print of the method
Enable sampling of query vectors to reduce an quadratic increase of the time complexity
check and update documentation, if required. See guide

Example Code:

import numpy as np
from docarray import Document, DocumentArray


def emb_func(da):
    for d in da:
        np.random.seed(int(d.text))
        d.embedding = np.random.random(5)


da = DocumentArray(
    [Document(text=str(i), tags={'label': i % 10}) for i in range(1_000)]
)

da.embed_and_evaluate(
    metrics=['precision_at_k'], embed_funcs=emb_func, query_sample_size=100
)

Reduction of memory usage when evaluating 100 query vectors against 500,000 index vectors with 500 dimensions:

Manual Evaluation:

Line #    Mem usage    Increment  Occurrences   Line Contents
=============================================================
    28   1130.7 MiB   1130.7 MiB           1   @profile
    29                                         def run_evaluation_old_style(queries, index, model):
    30   1133.1 MiB      2.5 MiB           1       queries.embed(model)
    31   2345.6 MiB   1212.4 MiB           1       index.embed(model)
    32   2360.4 MiB     14.8 MiB           1       queries.match(index)
    33   2360.4 MiB      0.0 MiB           1       return queries.evaluate(metrics=['reciprocal_rank'])

Evaluation with `embed_and_evaluate (batch_size 100,000):

Line #    Mem usage    Increment  Occurrences   Line Contents
=============================================================
    23   1130.6 MiB   1130.6 MiB           1   @profile
    24                                         def run_evaluation(queries, index, model, batch_size=None):
    25   1130.6 MiB      0.0 MiB           1       kwargs = {'match_batch_size':batch_size} if batch_size else {}
    26   1439.9 MiB    309.3 MiB           1       return queries.embed_and_evaluate(metrics=['reciprocal_rank'], index_data=index, embed_models=model, **kwargs)

guenthermi · 2022-10-28T09:54:33Z

This is only a draft atm. I have to add more tests, benchmark the memory usage, add add documentation if everything seems to work well.

codecov · 2022-10-28T10:03:40Z

Codecov Report

Merging #702 (ebbd9dd) into main (3f07f52) will increase coverage by 2.20%.
The diff coverage is 96.82%.

@@            Coverage Diff             @@
##             main     #702      +/-   ##
==========================================
+ Coverage   85.95%   88.16%   +2.20%     
==========================================
  Files         133      133              
  Lines        6538     6600      +62     
==========================================
+ Hits         5620     5819     +199     
+ Misses        918      781     -137

Flag	Coverage Δ
docarray	`88.16% <96.82%> (+2.20%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
docarray/array/mixins/embed.py	`92.30% <ø> (ø)`
docarray/array/mixins/match.py	`91.66% <ø> (ø)`
docarray/array/mixins/evaluation.py	`88.11% <96.82%> (+6.63%)`	⬆️
docarray/array/storage/redis/find.py	`62.26% <0.00%> (-26.42%)`	⬇️
docarray/array/storage/sqlite/getsetdel.py	`100.00% <0.00%> (+2.38%)`	⬆️
docarray/array/storage/annlite/getsetdel.py	`100.00% <0.00%> (+2.63%)`	⬆️
docarray/array/storage/elastic/getsetdel.py	`100.00% <0.00%> (+3.38%)`	⬆️
docarray/array/storage/weaviate/getsetdel.py	`100.00% <0.00%> (+5.26%)`	⬆️
docarray/array/storage/base/getsetdel.py	`91.21% <0.00%> (+5.40%)`	⬆️
docarray/array/mixins/getitem.py	`90.47% <0.00%> (+11.11%)`	⬆️
... and 6 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

samsja

lgtm, could we change the docstring of embed to mention that this new method exist and should be used when the user need to do embed and eval directly ?

samsja · 2022-10-31T13:11:59Z

docarray/array/mixins/evaluation.py

+        **kwargs,
+    ) -> Optional[Union[float, List[float]]]:  # average for each metric
+        """
+        Computes ranking evaluation metrics for a given `DocumentArray`. Moreover, this


no need for moreover here IMO

samsja · 2022-10-31T13:16:02Z

docarray/array/mixins/evaluation.py

+        is possible with the :func:``evaluate`` function.
+
+        :param metrics: List of metric names or metric functions to be computed
+        :param index_data: The other DocumentArray  to match against, if not given,


can we describe what it means match against itself ? Will all of the document inside the da will be match over all the other one ? Or does is split the da in query, index randomly and perform the search from query to index ?

Ok, yes I can explain this. Atm everything it will match everything against everything. I might do another small PR afterwards, to enable sampling.

match everything against everything is not tracktable tbh, why not just do the sampling right now ?

Note: We discussed that it might make sense, but it is better to implement it in this PR to avoid a breaking change.

samsja · 2022-10-31T13:18:58Z

docarray/array/mixins/evaluation.py

+                    'Either a ground_truth `DocumentArray` or labels are '
+                    'required for the evaluation.'
+                )
+            if not label_tag in index_data[0].tags:


technically we need to do this for all of the document in index_data ...

Yes, but I don't think it is necessary to do a full data validation. If a user provides only partly labels, I think it is ok if it crashes with a key error.

@JohannesMessner @alaeddine-13 what is your opinion here ? For me it is either we do it for every doc on we don't do validation but I might be wrong

github-actions · 2022-11-02T08:03:49Z

📝 Docs are deployed on https://ft-feat-add-embed-and-evaluate-function--jina-docs.netlify.app 🎉

feat: add first implemenation of embed_and_evaluate

9c2b492

guenthermi marked this pull request as draft October 28, 2022 09:53

github-actions bot added size/m area/core area/testing component/array labels Oct 28, 2022

guenthermi linked an issue Oct 28, 2022 that may be closed by this pull request

Implement embed_and_evaluate function which combines embed, match, and evaluate in one function and enables batch-wise matching to evaluate dataset more memory efficient #667

Closed

refactor: revise embed_and_evaluate

429a585

github-actions bot added the area/docs label Oct 28, 2022

guenthermi added 3 commits October 31, 2022 09:20

test: fix labeled test

bca2f07

refactor: delete embeddings to save memory

901c5cb

test: add tests for exceptions

84a1331

guenthermi marked this pull request as ready for review October 31, 2022 10:44

test: add parameters to increase coverage

8acc865

github-actions bot added size/l and removed size/m labels Oct 31, 2022

guenthermi requested review from JoanFM, JohannesMessner, bwanglzu and samsja October 31, 2022 12:43

samsja requested changes Oct 31, 2022

View reviewed changes

guenthermi added 4 commits October 31, 2022 14:43

refactor: implement review notes

c5fb5df

Merge branch 'main' into feat-add-embed-and-evaluate-function

c0d17ee

docs: add references in docstrings

19a3e84

feat: add sampling of queries

562d1cb

samsja marked this pull request as draft November 1, 2022 13:12

guenthermi added 2 commits November 1, 2022 16:12

feat: enable sampling on query set

a64b2f8

fix: test_embed_and_evaluate_sampling test

4ba9f25

docs: add note for evaluations attribute

ebbd9dd

guenthermi requested review from NicholasDunham and samsja November 2, 2022 08:07

guenthermi marked this pull request as ready for review November 2, 2022 08:07

guenthermi changed the title ~~feat: add first implemenation of embed_and_evaluate~~ feat: Implementation of embed_and_evaluate Nov 2, 2022

hanxiao approved these changes Nov 2, 2022

View reviewed changes

hanxiao merged commit c38d82d into main Nov 2, 2022

hanxiao deleted the feat-add-embed-and-evaluate-function branch November 2, 2022 11:49

alexcg1 mentioned this pull request Nov 14, 2022

chore: draft release note v0.19.0 #770

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Implementation of embed_and_evaluate#702

feat: Implementation of embed_and_evaluate#702
hanxiao merged 13 commits intomainfrom
feat-add-embed-and-evaluate-function

guenthermi commented Oct 28, 2022 •

edited

Loading

Uh oh!

guenthermi commented Oct 28, 2022

Uh oh!

codecov bot commented Oct 28, 2022 •

edited

Loading

Uh oh!

samsja left a comment

Uh oh!

samsja Oct 31, 2022

Uh oh!

samsja Oct 31, 2022

Uh oh!

guenthermi Oct 31, 2022 •

edited

Loading

Uh oh!

samsja Oct 31, 2022

Uh oh!

guenthermi Oct 31, 2022

Uh oh!

samsja Oct 31, 2022

Uh oh!

guenthermi Oct 31, 2022

Uh oh!

samsja Nov 1, 2022

Uh oh!

github-actions bot commented Nov 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

guenthermi commented Oct 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

guenthermi commented Oct 28, 2022

Uh oh!

codecov bot commented Oct 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

samsja left a comment

Choose a reason for hiding this comment

Uh oh!

samsja Oct 31, 2022

Choose a reason for hiding this comment

Uh oh!

samsja Oct 31, 2022

Choose a reason for hiding this comment

Uh oh!

guenthermi Oct 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

samsja Oct 31, 2022

Choose a reason for hiding this comment

Uh oh!

guenthermi Oct 31, 2022

Choose a reason for hiding this comment

Uh oh!

samsja Oct 31, 2022

Choose a reason for hiding this comment

Uh oh!

guenthermi Oct 31, 2022

Choose a reason for hiding this comment

Uh oh!

samsja Nov 1, 2022

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

guenthermi commented Oct 28, 2022 •

edited

Loading

codecov bot commented Oct 28, 2022 •

edited

Loading

guenthermi Oct 31, 2022 •

edited

Loading