VioPTT: Violin Technique-Aware Transcription from Synthetic Data Augmentation

Installation

pip install -r requirements.txt

How to Use

Note: Before running the scripts, you need to modify the paths in the scripts according to your setup.

Option A: Two-step pipeline

Run transcription and technique classification separately.

# Step 1. Audio to MIDI
# Edit AUDIO_DIR, OUTPUT_DIR, and CHECKPOINT_PATH in the script before running
bash scripts/run_HRPT_inference.sh

# Step 2. Recognize playing techniques for each note
# Requires audio directory and corresponding MIDI directory from Step 1
# Edit AUDIO_DIR, MIDI_DIR, OUTPUT_DIR, and checkpoint paths in the script before running
bash scripts/run_HRPT_inference_note_tech.sh

Option B: End-to-end

Transcription and technique classification in a single step — no pre-existing MIDI required.

# Audio → MIDI + per-note technique labels (CSV) in one pass
# Edit AUDIO_DIR, OUTPUT_DIR, and checkpoint paths in the script before running
bash scripts/run_infer_technique.sh

Description

scripts/run_HRPT_inference.sh: Convert audio files to MIDI format
- Supports .wav, .mp3, .flac formats
- Can process single file or entire directory
- Configuration required: Edit the following variables at the top of the script:
  - AUDIO_DIR: Path to audio file or directory
  - OUTPUT_DIR: Output directory for MIDI files
  - CHECKPOINT_PATH: Path to transcription model checkpoint
scripts/run_HRPT_inference_note_tech.sh: Recognize playing techniques for each note
- Requires audio directory and corresponding MIDI directory
- Outputs playing techniques for each note to CSV file
- Configuration required: Edit the following variables at the top of the script:
  - AUDIO_DIR: Directory containing audio files
  - MIDI_DIR: Directory containing corresponding MIDI files
  - OUTPUT_DIR: Output directory for technique CSV files
  - NOTE_MODEL_CHECKPOINT: Path to note technique model
  - TRANSCRIPTOR_CHECKPOINT: Path to transcriptor model
scripts/run_infer_technique.sh: End-to-end transcription + technique classification
- Takes raw audio as input; no pre-existing MIDI needed
- Outputs per-note technique labels to CSV and a technique-annotated MIDI file
- Uses the full model (all transcription features, no ablation)
- Configuration required: Edit the following variables at the top of the script:
  - AUDIO_DIR: Directory containing audio files (.wav, .mp3, .flac)
  - OUTPUT_DIR: Output directory for CSV and MIDI files
  - NOTE_MODEL_CHECKPOINT: Path to note technique model (checkpoints/note_tech_model.pth)
  - TRANSCRIPTOR_CHECKPOINT: Path to transcription model checkpoint

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
checkpoints		checkpoints
piano_transcription		piano_transcription
preprocessing		preprocessing
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
average_confusion_across_folds.py		average_confusion_across_folds.py
data_utils.py		data_utils.py
evaluate.py		evaluate.py
mosapt_export_note_dataset.py		mosapt_export_note_dataset.py
parse_final_test_3cv_to_csv.py		parse_final_test_3cv_to_csv.py
plot_final_test_3cv_summary.py		plot_final_test_3cv_summary.py
requirements.txt		requirements.txt
rwc_check.py		rwc_check.py
rwc_export_full_dataset.py		rwc_export_full_dataset.py
rwc_export_note_dataset.py		rwc_export_note_dataset.py
test_mosapt_note_dataset.py		test_mosapt_note_dataset.py
test_rwc_note_wav_dataset.py		test_rwc_note_wav_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VioPTT: Violin Technique-Aware Transcription from Synthetic Data Augmentation

Installation

How to Use

Option A: Two-step pipeline

Option B: End-to-end

Description

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VioPTT: Violin Technique-Aware Transcription from Synthetic Data Augmentation

Installation

How to Use

Option A: Two-step pipeline

Option B: End-to-end

Description

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages