Skip to content

HASMAC-AS/ttl-chunker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ttl-chunker

Splits a Turtle file into smaller Turtle files.

Prerequisites

Verify your installations:

java -version
mvn -version

Building

Compile the project and package it into a JAR file:

mvn package

Jar: target/ttl-chunker-1.0-SNAPSHOT.jar

Running

java -jar target/ttl-chunker-1.0-SNAPSHOT.jar input.ttl 128MB

Optional output dir:

java -jar target/ttl-chunker-1.0-SNAPSHOT.jar input.ttl 128MB ./chunks

Chunk size accepts:

  • raw bytes: 50000000
  • binary-ish suffixes: 64KB, 128MB, 2GB

Behavior:

  • splits on Turtle statement/directive boundaries
  • keeps chunk size approximate, not exact
  • prepends every chunk with all @prefix / @base directives seen so far
  • default output dir: <input-name>-chunks/

Project Structure

ttl-chunker/
├── pom.xml                                          # Maven build descriptor
└── src/
    ├── main/java/no/hasmac/ttlchunker/
    │   ├── TurtleBlockReader.java                   # Streaming Turtle block reader
    │   └── TurtleChunker.java                       # CLI + chunk writer
    └── test/java/no/hasmac/ttlchunker/
        └── TurtleChunkerTest.java                   # Regression coverage

About

Chunks a turtle file into smaller files

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages