Skip to content

refactor: remove table level stats on TableProvider#4174

Merged
ion-elgreco merged 3 commits intodelta-io:mainfrom
roeap:refactor/remote-df-table-stats
Feb 9, 2026
Merged

refactor: remove table level stats on TableProvider#4174
ion-elgreco merged 3 commits intodelta-io:mainfrom
roeap:refactor/remote-df-table-stats

Conversation

@roeap
Copy link
Copy Markdown
Collaborator

@roeap roeap commented Feb 7, 2026

Description

For delta tables, table level statistics are not quite as useful as file level stats. However we do go through quite some trouble to expose table level stats which also assume we always have a materialised log to expose these stats. As such, it hinders us in migration to a lazy architecture.

In fact the datafusion native file-based table implementation (parquet, json, csv, ...) only expose stats on the execution pkan level, and not on the table provider level.

In this PR we therefore remote the table level stats from the current table provider and remove the associated code.

@roeap roeap requested review from hntd187 and rtyler as code owners February 7, 2026 10:56
@github-actions github-actions Bot added the binding/rust Issues for the Rust crate label Feb 7, 2026
@roeap roeap requested a review from ion-elgreco February 7, 2026 10:57
@codecov
Copy link
Copy Markdown

codecov Bot commented Feb 7, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 76.63%. Comparing base (a0cf6b1) to head (98425fa).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #4174      +/-   ##
==========================================
- Coverage   76.63%   76.63%   -0.01%     
==========================================
  Files         166      166              
  Lines       47052    46745     -307     
  Branches    47052    46745     -307     
==========================================
- Hits        36059    35823     -236     
+ Misses       9250     9187      -63     
+ Partials     1743     1735       -8     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

ion-elgreco
ion-elgreco previously approved these changes Feb 7, 2026
@ion-elgreco ion-elgreco enabled auto-merge (squash) February 8, 2026 04:22
@ion-elgreco ion-elgreco force-pushed the refactor/remote-df-table-stats branch from 0de6c32 to ad7c6c0 Compare February 8, 2026 04:22
This started failing after some changes that touch the table provider
because the order of rows returned was not the same as expected before.
My opinion is that ordering is not guaranteed unless an ordering
predicate is supplied 😄

Signed-off-by: R. Tyler Croy <[email protected]>
@github-actions github-actions Bot added the binding/python Issues for the Python package label Feb 8, 2026
@ion-elgreco ion-elgreco merged commit c4f9d42 into delta-io:main Feb 9, 2026
29 checks passed
@roeap roeap deleted the refactor/remote-df-table-stats branch February 9, 2026 14:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

binding/python Issues for the Python package binding/rust Issues for the Rust crate

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

3 participants