chore: dump version 3.4.0 into chart and pyproject.toml files#159
Merged
a-klos merged 25 commits intostackitcloud:mainfrom Nov 17, 2025
Merged
chore: dump version 3.4.0 into chart and pyproject.toml files#159a-klos merged 25 commits intostackitcloud:mainfrom
a-klos merged 25 commits intostackitcloud:mainfrom
Conversation
- Added DoclingFileExtractor to handle PDF, DOCX, PPTX, XLSX, and HTML formats using the Docling library. - Introduced MarkitdownFileExtractor for processing various document types with MarkItDown. - Updated pyproject.toml to restrict Python version to >=3.13,<3.14. - Created unit tests for DoclingFileExtractor to ensure proper serialization of tables and handling of cell content.
…age and improve table handling
…nd add origin metadata in PDFExtractor
- Added new file types (MD, ASCIIDOC, CSV, TXT, IMAGE) to FileType enum. - Refactored test cases to utilize new _FakeTable and _FakeDataFrame classes. - Updated document extractor Dockerfile to include libleptonica-dev and pkg-config. - Upgraded docling package to version 2.61.2 and added tesserocr as a dependency. - Improved document upload component to dynamically display allowed file types. - Localized upload descriptions to include allowed file types in both English and German. - Introduced utility functions for document type validation, including MIME type and file extension checks. - Added unit tests for document type validation logic.
- Created a plain text sample file with multiple lines. - Added a sample Excel file with a significant amount of data. - Introduced an XML sample file containing a title and a paragraph.
- Added module docstrings to various Python files for better clarity and maintainability. - Improved function docstrings to provide detailed parameter and return information. - Updated `flake8` and `pydocstyle` dependencies to ensure code quality checks are in place. - Bumped version numbers for `admin-backend`, `document-extractor`, `mcp-server`, and `rag-backend` to 3.3.0. - Added new dependencies for `flake8-docstrings` and `flake8-logging-format` to enhance linting capabilities. - Cleaned up unnecessary whitespace and comments in test files.
…date TesseractImageExtractor's PSM default value
…cling_extractor_test
…nce test coverage
…ctor pipeline details
… templates and clear existing tags in values.yaml
a-klos
approved these changes
Nov 17, 2025
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This pull request updates the RAG application's versioning to 3.4.0 across the Helm charts, Python libraries, and service components. It also improves the Helm chart image tag handling by defaulting to the chart's app version when no explicit tag is set, and updates the default image tags in the values files to be empty, ensuring that the latest chart version is used unless overridden.
Version Updates
pyproject.tomlfiles) and the Helm chart (Chart.yaml). [1] [2] [3] [4] [5] [6] [7] [8] [9] [10]Helm Chart Improvements
.Chart.AppVersionif no explicit tag is provided, making image versioning more robust and consistent. (_admin_backend_and_extractor_helpers.tpl,_admin_frontend_helpers.tpl,_backend_helpers.tpl,_frontend_helpers.tpl) [1] [2] [3] [4]values.yamlfor all components, so the templates will use the chart's app version by default. (backend,frontend,adminBackend,extractor,adminFrontendsections) [1] [2] [3] [4] [5]