Skip to content

v0.3.0

Latest

Choose a tag to compare

@maskedsyntax maskedsyntax released this 11 Apr 08:41
· 1 commit to master since this release

Added

  • Benchmarking module (comprexx.benchmark): cx.benchmark() measures inference
    latency with warmup, percentiles (p50/p90/p99), and throughput.
    cx.compare_benchmarks() returns before/after comparisons with speedup deltas.
    New comprexx bench CLI command.
  • Example notebooks with cell outputs: ResNet18 edge deployment (fuse, prune,
    benchmark, ONNX export) and linear layer compression (SVD, weight-only INT4,
    dynamic INT8).
  • GitHub Actions CI: pytest on Python 3.10/3.11/3.12 + ruff lint.
  • CHANGELOG.md covering v0.1.0 through v0.3.0.

Changed

  • Silenced torch.ao.quantization is deprecated warnings in PTQ stages.
  • Fixed __version__ to report the correct version.
  • Cleaned up all ruff lint errors across the codebase.

Stats

  • 174 tests passing
  • 9 compression techniques + sensitivity analysis
  • Python 3.10+, PyTorch 2.0+

Full changelog: v0.2.0...v0.3.0