Added
- Benchmarking module (
comprexx.benchmark):cx.benchmark()measures inference
latency with warmup, percentiles (p50/p90/p99), and throughput.
cx.compare_benchmarks()returns before/after comparisons with speedup deltas.
Newcomprexx benchCLI command. - Example notebooks with cell outputs: ResNet18 edge deployment (fuse, prune,
benchmark, ONNX export) and linear layer compression (SVD, weight-only INT4,
dynamic INT8). - GitHub Actions CI: pytest on Python 3.10/3.11/3.12 + ruff lint.
- CHANGELOG.md covering v0.1.0 through v0.3.0.
Changed
- Silenced
torch.ao.quantization is deprecatedwarnings in PTQ stages. - Fixed
__version__to report the correct version. - Cleaned up all ruff lint errors across the codebase.
Stats
- 174 tests passing
- 9 compression techniques + sensitivity analysis
- Python 3.10+, PyTorch 2.0+
Full changelog: v0.2.0...v0.3.0