Redlines

Redlines produces a text showing the differences between two strings/text. The changes are represented with strike-throughs and underlines, which looks similar to Microsoft Word's track changes. This method of showing changes is more familiar to lawyers and is more compact for long series of characters.

Redlines uses SequenceMatcher to find differences between words used. The output can be in HTML, Markdown, or rich format.

Example

Given an original string:

The quick brown fox jumps over the lazy dog.

And the string to be tested with:

The quick brown fox walks past the lazy dog.

The library gives a result of:

The quick brown fox <del>jumps over </del><ins>walks past </ins>the lazy dog.

Which is rendered like this:

The quick brown fox ~~jumps over~~ walks past the lazy dog.

The library can also output the results in Markdown, HTML or rich format, and for a variety of environments like Streamlit, Jupyter Notebooks, Google Colab and the terminal.

Install

pip install redlines

Optional: Install with NupunktProcessor support

For advanced sentence boundary detection (requires Python 3.11+):

pip install redlines[nupunkt]

The NupunktProcessor provides intelligent sentence tokenization that handles:

Abbreviations (Dr., Mr., etc.)
Decimals and numbers (3.14, $5.99)
URLs and email addresses
Legal citations and complex punctuation

See the Usage section below for more details.

Usage

The library contains one class: Redlines, which is used to compare text.

from redlines import Redlines

test = Redlines(
    "The quick brown fox jumps over the lazy dog.",
  "The quick brown fox walks past the lazy dog.", markdown_style="none",
)
assert (
        test.output_markdown
        == "The quick brown fox <del>jumps over </del><ins>walks past </ins>the lazy dog."
)

Alternatively, you can create Redline with the text to be tested, and compare several times to see the results.

from redlines import Redlines

test = Redlines("The quick brown fox jumps over the lazy dog.", markdown_style="none")
assert (
        test.compare("The quick brown fox walks past the lazy dog.")
        == "The quick brown fox <del>jumps over </del><ins>walks past </ins>the lazy dog."
)

assert (
        test.compare("The quick brown fox jumps over the dog.")
        == "The quick brown fox jumps over the <del>lazy </del>dog."
)

Advanced: Custom Processors

Redlines supports custom processors for different tokenization strategies. By default, it uses WholeDocumentProcessor which tokenizes at the paragraph level.

Using NupunktProcessor

For sentence-level tokenization with intelligent boundary detection (requires pip install redlines[nupunkt]):

from redlines import Redlines
from redlines.processor import NupunktProcessor

# Use NupunktProcessor for better handling of abbreviations and complex punctuation
processor = NupunktProcessor()
test = Redlines(
    "Dr. Smith said hello. Mr. Jones replied.",
    "Dr. Smith said hi. Mr. Jones replied.",
    processor=processor
)

When to use NupunktProcessor:

Legal or technical documents with many abbreviations
Text with URLs, emails, or complex citations
When you need sentence-level granularity
Documents with decimal numbers that shouldn't be treated as sentence boundaries

When to use WholeDocumentProcessor (default):

Simple documents without complex sentence structures
When speed is critical (5-6x faster than NupunktProcessor)
When paragraph-level granularity is sufficient

See the demo comparison for detailed performance and accuracy benchmarks.

Command Line Tool

Redlines also features a simple command line tool redlines to visualise the differences in text in the terminal.

 Usage: redlines text [OPTIONS] SOURCE TEST

 Compares the strings SOURCE and TEST and produce a redline in the terminal.

You may also want to check out the demo project redlines-textual.

Documentation

Read the available Documentation.

Uses

View and mark changes in legislation: PLUS Explorer
Visualise changes after ChatGPT transforms a text: ChatGPT Prompt Engineering for Developers Lesson 6

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
.github/workflows		.github/workflows
demo		demo
redlines		redlines
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Code_of_Conduct.md		Code_of_Conduct.md
LICENSE.txt		LICENSE.txt
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
repository-open-graph.png		repository-open-graph.png
tox.ini		tox.ini
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Redlines

Example

Install

Optional: Install with NupunktProcessor support

Usage

Advanced: Custom Processors

Using NupunktProcessor

Command Line Tool

Documentation

Uses

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Redlines

Example

Install

Optional: Install with NupunktProcessor support

Usage

Advanced: Custom Processors

Using NupunktProcessor

Command Line Tool

Documentation

Uses

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages