docs: index predefined documents by JohannesMessner · Pull Request #1434 · docarray/docarray

JohannesMessner · 2023-04-24T09:48:43Z

Explains how to index predefined documents into a document index

Signed-off-by: Johannes Messner <[email protected]>

github-actions · 2023-04-24T09:53:11Z

📝 Docs are deployed on https://ft-docs-predefined-index--jina-docs.netlify.app 🎉

alexcg1 · 2023-04-24T10:06:38Z

docs/user_guide/storing/docindex.md

+
+### Using a predefined Document as schema
+
+DocArray offers a number of predefined Documents, like [ImageDoce][docarray.documents.ImageDoc] and [TextDoc][docarray.documents.TextDoc].


Suggested change

DocArray offers a number of predefined Documents, like [ImageDoce][docarray.documents.ImageDoc] and [TextDoc][docarray.documents.TextDoc].

DocArray offers a number of predefined Documents, like [ImageDoc][docarray.documents.ImageDoc] and [TextDoc][docarray.documents.TextDoc].

alexcg1 · 2023-04-24T10:06:56Z

docs/user_guide/storing/docindex.md

+
+DocArray offers a number of predefined Documents, like [ImageDoce][docarray.documents.ImageDoc] and [TextDoc][docarray.documents.TextDoc].
+If you try to use these directly as a schema for a Document Index, you will get unexpected behavior:
+Depending on the backend, and exception will be raised, or no vector index for ANN lookup will be built.


Suggested change

Depending on the backend, and exception will be raised, or no vector index for ANN lookup will be built.

Depending on the backend, an exception will be raised, or no vector index for ANN lookup will be built.

alexcg1 · 2023-04-24T10:07:26Z

docs/user_guide/storing/docindex.md

+    ```
+
+Once the schema of your Document Index is defined in this way, the data that you are indexing can be either of the
+predefined Document type, or of your custom Document type.


Suggested change

predefined Document type, or of your custom Document type.

predefined Document types, or your custom Document type.

alexcg1 · 2023-04-24T10:07:50Z

docs/user_guide/storing/docindex.md

    - A and B have the same field names and field types
    - A and B have the same field names, and, for every field, the type of B is a subclass of the type of A

+    In particular this means that you can easily [index predefined Documents](#using-a-predefined-document-as-schema) into a Document Index.


Suggested change

In particular this means that you can easily [index predefined Documents](#using-a-predefined-document-as-schema) into a Document Index.

In particular, this means that you can easily [index predefined Documents](#using-a-predefined-document-as-schema) into a Document Index.

What's the policy on capitalizing Document now that we don't use that class name? I think @samsja mentioned on Discord we don't do that any more.

I think we should still capitalize it, since it is a concept in our library. Lowercased it looks a bit weird and "unofficial" to me. Plus, I think the rule of thumb was always that "concepts" are capitalized, whereas classes go in between backticks

No strong feeling here. But tehcnically speaking Document is not a concept in term of code in the library

I think it is a concept but just not a class, otherwise "concept" and "class" would be synonyms. But I just checked the pydantic documentation, they don't capitalize "model". So no strong feeling either

JohannesMessner · 2023-04-24T10:18:31Z

@alexcg1 I was a bit fast on the trigger there, your suggested fixes are here: #1436

docs: index predefined documents

62da10d

Signed-off-by: Johannes Messner <[email protected]>

JohannesMessner requested review from AnneYang720 and alexcg1 April 24, 2023 09:48

github-actions bot added size/s area/docs labels Apr 24, 2023

samsja approved these changes Apr 24, 2023

View reviewed changes

JohannesMessner merged commit fad1290 into main Apr 24, 2023

JohannesMessner deleted the docs-predefined-index branch April 24, 2023 10:06

alexcg1 suggested changes Apr 24, 2023

View reviewed changes

samsja mentioned this pull request Apr 26, 2023

v0.31.0 release note draft #1456

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: index predefined documents#1434

docs: index predefined documents#1434
JohannesMessner merged 1 commit intomainfrom
docs-predefined-index

JohannesMessner commented Apr 24, 2023

Uh oh!

github-actions bot commented Apr 24, 2023

Uh oh!

alexcg1 Apr 24, 2023

Uh oh!

alexcg1 Apr 24, 2023

Uh oh!

alexcg1 Apr 24, 2023

Uh oh!

alexcg1 Apr 24, 2023

Uh oh!

alexcg1 Apr 24, 2023

Uh oh!

JohannesMessner Apr 24, 2023

Uh oh!

samsja Apr 24, 2023

Uh oh!

JohannesMessner Apr 24, 2023

Uh oh!

JohannesMessner commented Apr 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		### Using a predefined Document as schema

		DocArray offers a number of predefined Documents, like [ImageDoce][docarray.documents.ImageDoc] and [TextDoc][docarray.documents.TextDoc].

	Depending on the backend, and exception will be raised, or no vector index for ANN lookup will be built.
	Depending on the backend, an exception will be raised, or no vector index for ANN lookup will be built.

	predefined Document type, or of your custom Document type.
	predefined Document types, or your custom Document type.

	In particular this means that you can easily [index predefined Documents](#using-a-predefined-document-as-schema) into a Document Index.
	In particular, this means that you can easily [index predefined Documents](#using-a-predefined-document-as-schema) into a Document Index.

Conversation

JohannesMessner commented Apr 24, 2023

Uh oh!

github-actions bot commented Apr 24, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JohannesMessner commented Apr 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants