Skip to content

sorukumar/orange-dev-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

27 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŸง Orange Dev Data Hub

The open-source Data Hub for the Bitcoin development ecosystem. This repository provides unified ingestion and forensic analytics for Bitcoin Core (git-logs), BIPs, Mailing Lists, and Delving Bitcoin research.


๐Ÿ Mandatory Infrastructure

To ensure data integrity and full support for the analytical pipeline, all scripts MUST be run using the Anaconda environment:

# Core execution path for AI and Humans:
/opt/anaconda3/bin/python3 scripts/rebuild_daily.py

๐Ÿ—๏ธ The Numbered Chain Pipeline

The scripts are organized by functional stage to maintain a clean Sources โ†’ Raw โ†’ Enriched โ†’ Output lifecycle:

  1. 01_ingest/: Raw extraction from Git mirrors and Discourse APIs.
  2. 02_process/: Identity resolution, social merging, and technical categorization.
  3. 03_analyze/: Global PageRank influence and expertise fingerprinting.
  4. 04_deliver/: Final public artifact generation for the UI dashboards.

๐Ÿš€ Getting Started

Daily Update (Fast)

/opt/anaconda3/bin/python3 scripts/rebuild_daily.py

Monthly Rebuild (Deep NLP & Graphs)

/opt/anaconda3/bin/python3 scripts/rebuild_monthly.py

๐Ÿ“‚ Documentation

For detailed architectural maps, reference the /docs folder:

About

The open-source Data Hub for the Bitcoin development ecosystem. ๐ŸŸง Unified ingestion and forensic analytics for Bitcoin Core (git-logs), BIPs, Mailing Lists, and Delving Bitcoin research. Providing structured, high-fidelity datasets (Parquet/JSON) for contributor tracking, protocol maturation, and technical governance metrics.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages