Skip to content

Version 1.2.0#446

Merged
JaMe76 merged 9 commits intomasterfrom
version_1.2.0
Mar 2, 2026
Merged

Version 1.2.0#446
JaMe76 merged 9 commits intomasterfrom
version_1.2.0

Conversation

@JaMe76
Copy link
Contributor

@JaMe76 JaMe76 commented Mar 2, 2026

This PR

  • refactors sub classes of ObjectTypes.

    • Sub classes are not divided by their concept of being either are key, a label a code (LanguageCode) or a kind. Some more ObjectTypes members have been added as well (DocumentFileLabel).
    • Two functions register_custom_token_tag and register_string_categories_from_list for adding ObjectTypes dynamically have been added.
  • adds a container annotation registry. One can now build sub classes of ContainerAnnotation ensuring that they get serialized correctly. Add _container_type to the serialized version and register the sub-class.

  • adds a new class Document.
    -- Document is a class for bundling multi pages and supports lazy loading, document-level annotations, JSON serialization/deserialization and Pipeline metadata tracking.

@JaMe76 JaMe76 merged commit dd00f4f into master Mar 2, 2026
32 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant