Releases: google/budoux
Releases · google/budoux
v0.8.1
What's Changed
- docs(README): fix minor typo by @strogonoff in #1168
- Upgrade GitHub Actions to latest versions by @salmanmkc in #1167
- chore(deps-dev): bump @vitest/browser from 4.0.18 to 4.1.0 in /javascript by @dependabot[bot] in #1175
- chore(deps-dev): bump org.apache.maven.plugins:maven-source-plugin from 3.3.0 to 3.4.0 in /java by @dependabot[bot] in #1169
- chore(deps): bump step-security/harden-runner from 2.15.1 to 2.16.0 by @dependabot[bot] in #1171
- chore(deps): bump github/codeql-action from 4.32.6 to 4.33.0 by @dependabot[bot] in #1170
- chore(deps-dev): bump @types/node from 25.3.5 to 25.5.0 in /javascript by @dependabot[bot] in #1174
- chore(deps): bump the patch group across 2 directories with 3 updates by @dependabot[bot] in #1172
- Fix package.json error by @tushuhei in #1165
- Update pom.xml to include javadoc jar and GPG signatures in release by @tushuhei in #1166
- chore(deps-dev): bump org.apache.maven.plugins:maven-gpg-plugin from 3.2.7 to 3.2.8 in /java in the patch group by @dependabot[bot] in #1177
- chore(deps-dev): bump @biomejs/biome from 2.4.7 to 2.4.8 in /javascript in the patch group across 1 directory by @dependabot[bot] in #1178
- chore(deps-dev): bump the patch group across 1 directory with 4 updates by @dependabot[bot] in #1181
- chore(deps): bump requests from 2.32.5 to 2.33.0 in the uv group across 1 directory by @dependabot[bot] in #1184
- chore(deps): bump github/codeql-action from 4.33.0 to 4.34.1 by @dependabot[bot] in #1180
- chore(deps): bump actions/deploy-pages from 4.0.5 to 5.0.0 by @dependabot[bot] in #1179
- Fix Typescript 6.0.2 compilation errors by @tushuhei in #1183
- chore(deps): bump cryptography from 46.0.5 to 46.0.6 in the uv group across 1 directory by @dependabot[bot] in #1185
- Bump picomatch to 4.0.4 by @tushuhei in #1187
- Bump to v0.8.1 by @tushuhei in #1186
New Contributors
- @strogonoff made their first contribution in #1168
- @salmanmkc made their first contribution in #1167
Full Changelog: v0.8.0...v0.8.1
v0.8.0
What's Changed
- fix(js): add missing exports for webcomponents by @tushuhei in #1018
- Update Java to JDK 21 by @tushuhei in #1060
- Migrate from Karma + Jasmine to Vitest by @tushuhei in #1110
- Replace http-server with serve by @tushuhei in #1113
- Modernize Python package management to use pyproject.toml and uv by @tushuhei in #1124
- Add conflict finder by @tushuhei in #1135
- Migrate from ESLint + Prettier to Biome by @tushuhei in #1142
- [experimental] Add GitHub Action for KNBC model training on Apple Silicon by @tushuhei in * Refactor license and version attribute in pyproject.toml by @tushuhei in #1150
- Cache base score for parsing by @shuofengzhang in #1151
- feat: Make parser model effectively read-only by @tushuhei in #1152
- feat: Improve Python 3.10 compatibility for dev environment by @tushuhei in #1153
- java: update pom.xml for release and artifact registry publishing by @tushuhei in #1155
New Contributors
- @shuofengzhang made their first contribution in #1151
Full Changelog: v0.7.0...v0.8.0
v0.8.0rc1
What's Changed
- fix(js): add missing exports for webcomponents by @tushuhei in #1018
- Update Java to JDK 21 by @tushuhei in #1060
- Migrate from Karma + Jasmine to Vitest by @tushuhei in #1110
- Replace http-server with serve by @tushuhei in #1113
- Modernize Python package management to use pyproject.toml and uv by @tushuhei in #1124
- Add conflict finder by @tushuhei in #1135
- Migrate from ESLint + Prettier to Biome by @tushuhei in #1142
Full Changelog: v0.7.0...v0.8.0-rc
v0.7.0
Breaking changes
- JSDOM is replaced with linkedom, which makes our JavaScript port SSR-friendly. #888
- The deprecated
HTMLProcessor.applyElementmethod is deleted. #926
What's Changed
- Add exports in package.json by @tushuhei in #892
- Remove duplicate NodeType definition by @tushuhei in #904
- Replace JSDOM with linkedom by @tushuhei in #905
- Isolate functions only used by tests by @tushuhei in #906
- Delete the deprecated
applyElementfunction by @tushuhei in #926
Full Changelog: v0.6.4...v0.7.0
v0.6.4
What's Changed
- Use updated maven-javadoc-plugin by @tushuhei in #779
- Make BudouX Web Components work without setInnerHtml by @tushuhei in #813
- Deploy demo site directly with GitHub Actions by @tushuhei in #821
- Remove unused blank parser in the abstract webcomponent class by @tushuhei in #829
- Add notes on Web Components by @tushuhei in #831
- Add type guards for NodeOrText by @tushuhei in #830
- Fix python typing issue by @tushuhei in #845
- Move applyWrapStyle to html_processor.ts by @tushuhei in #846
- Add the
--phrase-delimiteroption to the Python command line by @kojiishi in #842 - Add sep option to JavaScript CLI by @tushuhei in #849
Full Changelog: v0.6.3...v0.6.4
v0.6.3
What's Changed
- Show failing sentences for quality test by @tushuhei in #453
- Japanese model improvement by @tushuhei in #454
- Bundle budoux-th Web Components by @tushuhei in #452
- Support new CSS
displaysyntax by @kojiishi in #483 - Remove Node.js 16 support and add Node.js 22 support by @tushuhei in #484
- Replace WBR with ZWSP in demo page by @tushuhei in #494
- Sort ICU format output by key by @tushuhei in #495
- Update JS README about Web Workers by @tushuhei in #509
- [security] Include DOMPurify in the demo bundle by @tushuhei in #658
- [nodejs] Override with [email protected] to suppress punycode deprecation warning by @tushuhei in #657
- [demo] Bind the input content and the query param by @tushuhei in #656
- Migrate to eslint flat config using @eslint/migrate-config by @tushuhei in #673
- Mention Korean support in README.md by @tushuhei in #701
- Correct a small typo/missing word in README.md by @adamsilverstein in #746
- [Java] Handle comment nodes by @tushuhei in #764
- [Java] Skip node at the end of input by @tushuhei in #765
New Contributors
- @adamsilverstein made their first contribution in #746
Full Changelog: v0.6.2...v0.6.3
v0.6.2
Thai is now supported! 🎉
What's Changed
- Add the scale argument to encode_data.py by @tushuhei in #408
- Nit fix for an ignored test by @tushuhei in #407
- Ja model improvement by @tushuhei in #410
- Add granularity option to prepare_knbc.py by @tushuhei in #417
- Add Thai language support by @tushuhei in #421
- Improve typing by @amitmarkel in #426
- Update README for Thai support by @tushuhei in #429
- Rename @returns to @return by @tushuhei in #415
New Contributors
- @amitmarkel made their first contribution in #426
Full Changelog: v0.6.1...v0.6.2
v0.6.1
What's Changed
- Bump @typescript-eslint/eslint-plugin from 6.9.1 to 6.10.0 in /javascript by @dependabot in #353
- Bump org.apache.maven.plugins:maven-surefire-plugin from 3.2.1 to 3.2.2 in /java by @dependabot in #354
- Bump actions/dependency-review-action from 3.1.1 to 3.1.2 by @dependabot in #357
- Bump @types/node from 20.8.3 to 20.9.0 in /javascript by @dependabot in #356
- Support weighted samples by @tushuhei in #358
- Fix unpaired close tags and self-closing tags by @kojiishi in #360
- [Java] Stop emitting close tags if self-closing by @kojiishi in #362
- Update Google Java Format action by @tushuhei in #363
- Bump actions/dependency-review-action from 3.1.2 to 3.1.3 by @dependabot in #364
- Bump @typescript-eslint/eslint-plugin from 6.10.0 to 6.11.0 in /javascript by @dependabot in #365
- [java] Fix errors by collapsed white spaces and
<br>by @kojiishi in #367 - Bump github/codeql-action from 2.22.5 to 2.22.6 by @dependabot in #368
- [java] Replace
wholeText()withNodeVisitorby @kojiishi in #369 - Implement tail for node visitor by @tushuhei in #370
- Update jsoup to 1.16.2 by @tushuhei in #371
- Version up to 0.6.1 by @tushuhei in #372
Full Changelog: v0.6.0...v0.6.1
v0.6.0
Noteworthy changes
- BudouX Web Components don't use Shadow DOM anymore. The segmentation results will be reflected in their Light DOM, where the global styles can apply. #291
- Phrases are segmented by ZWSP (U+200B) not
<wbr>for a better screen reader experience. #346 - You can insert non-breaking markup (
<nobrandwhite-space: nowrap) when you have a phrase you don't want to break. #240
What's Changed
- Remove dependency to gts by @tushuhei in #187
- Add
Parser.parseBoundariesfor JavaScript by @kojiishi in #234 - Replace
slicewithsubstringby @kojiishi in #241 - Support non-breaking content (
<nobr>andwhite-space: nowrap) by @kojiishi in #240 - Make scripts run without install by @tushuhei in #239
- Add permissions to style check action by @tushuhei in #246
- Specify maxsplit to handle colon symbols properly by @tushuhei in #247
- Support non-breaking content in java by @kojiishi in #248
- Support non-breaking content in Python by @kojiishi in #251
- Nit: use get_nowait instead of get by @tushuhei in #253
- Remove utils from JavaScript module by @tushuhei in #262
- Move hasChildTextNode to HTML Processor by @tushuhei in #274
- Fix mypy issues by @tushuhei in #308
- Fix Python dependency issues by @tushuhei in #316
- Avoid inserting separators to where the source has one by @kojiishi in #342
- [Web Components] Use Light DOM instead of Shadow DOM by @tushuhei in #291
- Use ZWSP instead of WBR by @tushuhei in #346
- [Java] Use ArrayDeque instead of Stack by @tushuhei in #349
- Rename applyElement to applyToElement by @tushuhei in #348
- Update README to use ZWSP by @tushuhei in #347
- Version up to 0.6.0 by @tushuhei in #343
Full Changelog: v0.5.2...v0.6.0
v0.5.2
What's Changed
- Use overflow-wrap: anywhere; instead of overflow-wrap: break-word; by @tamanyan in #144
- Add a script to finetune models. by @tushuhei in #145
- Add quality regression test by @tushuhei in #146
- Release finetuned model by @tushuhei in #147 #154 #161
- Add validation data arg to train.py by @tushuhei in #148
- Remove direct dependency to NumPy by @tushuhei in #149
- Add a README for BudouX Scripts by @tushuhei in #155
- Add score scale arg to build_model.py by @tushuhei in #156
- Separate HTML processing as a mixin by @tushuhei in #159
New Contributors
- @step-security-bot made their first contribution in #163
Full Changelog: v0.5.1...v0.5.2