- Eindhoven, The Netherlands
- https://proycon.anaproy.nl
-
Research software engineer - NLP - 🐧 Linux & open-source enthusiast - 🦀 Rust / 🐍 Python/ 🌊C/C++ / 🐚 Shell - 🔐 InfoSec - Also at https://git.sr.ht/~proycon - work at KNAW Humanities Cluster & Radboud University Nijmegen
-
Joined on
2024-08-29
Harvest and aggregate codemeta/schema.org software metadata from source repositories and service endpoints, automatically converting from known metadata schemes in the process
Updated 2026-03-18 14:43:57 +01:00
Convert software metadata descriptions in codemeta to html
Updated 2026-03-18 14:42:10 +01:00
A Python package for generating and working with codemeta
Updated 2026-03-18 13:43:13 +01:00
Command line tools for working with standoff text annotations (STAM)
Updated 2026-03-17 22:04:04 +01:00
Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an annotation. This repository contains the model's full specification, extensions, schemas, examples and documentation.
Updated 2026-02-25 13:38:25 +01:00
Webservice for working with stand-off annotations on text (STAM)
Updated 2026-02-10 23:34:11 +01:00
An approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction
Updated 2026-02-10 23:16:26 +01:00
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way.
Updated 2026-02-09 22:52:35 +01:00
codemeta to SSHOC Open Marketplace converter
Updated 2026-02-09 16:23:12 +01:00
Python binding to work with STAM, the Standoff Text Annotation Model, from Python. Written in Rust.
Updated 2026-01-05 14:56:47 +01:00