TrackID

Automatically identify tracklists from SoundCloud DJ mixes using audio fingerprinting. This is a hobby project, it is not in active development, so best to not take this too seriously.

Features

🎵 Downloads audio from SoundCloud URLs
🔍 Identifies tracks using Shazam's audio fingerprinting
📊 Generates timestamped tracklists with metadata
🌐 Web interface for easy interaction
🔄 Support for proxy rotation to avoid rate limits
💾 Saves results locally for quick re-access
📚 Browse and reload previously processed mixes
⚡ Flexible processing stages (skip download or recognition steps)
🎯 Dynamic confidence scoring with customizable thresholds

Technologies

yt-dlp - Audio downloading
ShazamIO - Track identification
Streamlit - Web interface / frontend
FFmpeg - Audio segmentation

Getting Started

Prerequisites

Before installing, ensure you have the following installed:

Python 3.12.4 (managed via pyenv)
pyenv - Python version management (Installation guide)
pyenv-virtualenv - Python virtual environment plugin (Installation guide)
Poetry - Python dependency management (Installation guide)

FFmpeg - Audio processing library

# macOS
brew install ffmpeg

# Ubuntu/Debian
sudo apt-get install ffmpeg

Installation

Clone the repository:

git clone https://github.com/yourusername/trackid.git
cd trackid

Install the development environment:

make install_dev

This command will:

Install Python 3.12.4 via pyenv
Create a virtual environment named trackid-3.12.4
Install all dependencies via Poetry
Set up the Jupyter kernel for notebooks

Configuration

Settings File

Configure the application's default settings by editing trackid/settings.py:

Audio Processing Settings

SEGMENT_LENGTH = 30  # Duration of each audio segment in seconds
MAX_AUDIO_HOURS = 4  # Maximum audio length to process (in hours)
MAX_AUDIO_LENGTH = 60 * 60 * MAX_AUDIO_HOURS  # Calculated time limit in seconds

SEGMENT_LENGTH: The mix is split into segments of this duration for recognition. Shorter segments = more API calls but better accuracy for transitions.
MAX_AUDIO_HOURS: Limits processing time to avoid excessive API usage on very long mixes.

Recognition Settings

SEMAPHORE_LIMIT = 100  # Number of concurrent API requests
BATCH_SIZE = 100  # Number of segments per batch
REQUEST_BATCH_DELAY = 3 # Delay (seconds) between batches

SEMAPHORE_LIMIT: Controls concurrent requests to Shazam API. Higher = faster but may hit rate limits.
BATCH_SIZE: Process segments in batches to manage memory and track progress.
REQUEST_BATCH_DELAY: Wait time between batches to avoid rate limiting.

Proxy Configuration

USE_PROXY = True  # Enable/disable proxy usage

Using Proxies (Recommended for Heavy Usage)

To avoid rate limiting when processing many mixes, we recommend using rotating proxies. For example, Webshare.io:

Sign up at Webshare.io
Get your proxy credentials from the dashboard
Set environment variables:

export PROXY_USER_NAME="your-username-rotate"
export PROXY_PASSWORD="your-password"
export PROXY_HOST="p.webshare.io"
export PROXY_PORT="80"

Or add them to your .env file:

PROXY_USER_NAME=your-username-rotate
PROXY_PASSWORD=your-password
PROXY_HOST=p.webshare.io
PROXY_PORT=80

The rotating proxy will automatically change IPs for each request, helping avoid detection and rate limits.

Processing Stage Settings

PROCESSING_STAGE = "full"  # Options: "full", "recognition", "postprocessing"

full: Complete pipeline (download → segment → recognize → postprocess)
recognition: Skip download if audio exists (segment → recognize → postprocess)
postprocessing: Only reprocess existing raw results with new settings

Confidence Scoring

MIN_DETECTIONS_FOR_PROBABLE = 2  # Minimum detections to mark track as "probable"
MAX_TIME_GAP_FOR_SAME_TRACK = 600  # Max seconds between detections for same track

Tracks detected multiple times across the mix are merged based on these settings. Adjusting thresholds helps filter out false positives.

Environment Variables

Optional Discogs integration for enhanced metadata:

export DISCOGS_TOKEN="your-discogs-api-token"
export DISCOGS_USER_AGENT="TrackID/1.0"

Usage

Command Line

Process a mix directly:

# Edit trackid/main.py to set your SoundCloud URL
poetry run python -m trackid.main

Web Interface

Launch the Streamlit app:

make serve_local

Then open your browser to http://localhost:8501.

Features:

📚 Browse Saved Mixes: View and reload previously processed mixes
🎵 Process New Mixes: Enter SoundCloud URL and click "Process Mix"
⚙️ Configurable Settings: Adjust all processing parameters via sidebar
🎯 Confidence Filtering: Toggle probable/uncertain tracks with dynamic thresholds
⏰ Time Navigation: Click timestamps to jump to specific tracks in player
🔄 Reprocess: Re-run with different settings without re-downloading
🎬 YouTube Preview: Search and play tracks directly in the interface

Output Files

Processed results are stored in data/outputs/track_list/:

data/outputs/track_list/
└── Mix Name/
    ├── tracklist_raw.json      # Raw Shazam API responses
    └── tracklist_processed.json # Cleaned, merged, and sorted tracklist

Each tracklist includes:

Track title and artist
Album and release year
Start and end timestamps
Shazam URL
Cover artwork URL
Streaming links (Spotify, Apple Music, etc.)
Discogs search URL (if configured)

Project Structure

trackid/
├── trackid/                 # Core package
│   ├── main.py             # Entry point
│   ├── core.py             # Main processing logic
│   ├── downloader.py       # Audio downloading
│   ├── splitter.py         # Audio segmentation
│   ├── recognizer.py       # Track recognition
│   ├── postprocess.py      # Tracklist processing
│   ├── metadata_manager.py # Mix metadata CRUD
│   ├── schemas.py          # Data models
│   └── settings.py         # Configuration
├── frontend/               # Streamlit web interface
│   ├── streamlit_app.py   # Main UI
│   ├── ui_components.py   # Reusable UI components
│   └── settings.py        # Frontend config
├── data/
│   ├── inputs/            # Downloaded audio
│   └── outputs/           # Generated tracklists & metadata
└── Makefile               # Development commands

Architecture

Processing Pipeline

URL Input → Download Audio → Segment → Recognize → Postprocess → Save
                 ↓              ↓          ↓           ↓
              (cached)     (temp files) (Shazam)  (merge/enrich)

Key Design Decisions

Modular Processing Stages: Allows reprocessing with new settings without re-downloading or re-recognizing tracks.

Metadata Management: All processed mixes tracked in data/outputs/metadata.json for browsing and reloading.

Dynamic Confidence Scoring: Track confidence recalculated in real-time based on UI settings without reprocessing.

Memory Efficiency: Streamlit app uses caching (@st.cache_resource) and lazy loading to minimize memory footprint.

Error Resilience: Comprehensive error handling with backup/restore for metadata writes.

Development

Available Commands

make serve_local   # Run Streamlit app
make run           # Run main script
make install_dev   # Set up development environment
make fix_all       # Auto-fix linting issues
make check_ruff    # Check code with Ruff linter
make check_black   # Check code formatting
make nuke_venv     # Delete virtual environment

Troubleshooting

Rate Limiting: Enable proxy usage (USE_PROXY = True) or reduce SEMAPHORE_LIMIT and BATCH_SIZE

Missing Tracks: Try adjusting SEGMENT_LENGTH (shorter = more API calls but better accuracy) or use "recognition" stage to retry with different settings

Audio Errors: MP3 decoder warnings (mpa: invalid main_data_begin) are harmless and automatically suppressed

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.vscode		.vscode
assets/images		assets/images
frontend		frontend
script		script
trackid		trackid
.DS_Store		.DS_Store
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
README.MD		README.MD
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TrackID

Features

Technologies

Getting Started

Prerequisites

Installation

Configuration

Settings File

Audio Processing Settings

Recognition Settings

Proxy Configuration

Processing Stage Settings

Confidence Scoring

Environment Variables

Usage

Command Line

Web Interface

Output Files

Project Structure

Architecture

Processing Pipeline

Key Design Decisions

Development

Available Commands

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TrackID

Features

Technologies

Getting Started

Prerequisites

Installation

Configuration

Settings File

Audio Processing Settings

Recognition Settings

Proxy Configuration

Processing Stage Settings

Confidence Scoring

Environment Variables

Usage

Command Line

Web Interface

Output Files

Project Structure

Architecture

Processing Pipeline

Key Design Decisions

Development

Available Commands

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages