mmnn

This project creates NCAA Division I college basketball tournament bracket predictions (men's or women's) using a neural network.

Inspiration

The starting point is a student research paper in research/—Comparing Various Machine Learning Statistical Methods Using Vari—that compares several machine learning and classical statistical approaches to predicting NCAA Division I basketball tournament outcomes from team-level statistics. The paper frames the problem as a classification task (e.g. which side of a matchup wins) and evaluates how different estimators behave on that data.

mmnn turns that idea into a small, usable codebase: it fetches and normalizes tournament data, builds the same style of stat-based features, trains a neural network as one such model, and adds commands to score a full bracket (holdout year) or a single head-to-head matchup. Men’s and women’s tournaments are both supported.

Tournament data for 2010–2026 (men's) is included under data/men/. To add more years, use mmnn data fetch. Use -w / --women on mmnn data … and mmnn nn … to use women's data under data/women/ (Sports Reference URLs use women instead of men in the path).

Usage

Basic workflow (men's, default): Processed *-data.csv files are already in data/men/ and data/women/, so you can train immediately. To add or refresh a year, run mmnn data fetch <year> and mmnn data process <year> first.

mmnn nn train
mmnn nn bracket 2025
mmnn nn predict Duke Siena

Women's tournament — add -w or --women to each command below. Bracket evaluation needs at least two processed years total (e.g. another year already in data/women/ plus the year you evaluate):

mmnn data fetch 2025 --women
mmnn data process 2025 --women
mmnn nn train --women
mmnn nn bracket 2025 --women
mmnn nn predict "South Carolina" UConn --women

Fetch data

mmnn data fetch <year>
mmnn data fetch <year> --women

Fetch the raw bracket and team stats for the given year from Sports Reference.

Process data

Process raw data for a given year into the format needed by the neural network. Reads data/men|women/YEAR-teams.csv and YEAR-games.csv, then writes YEAR-data.csv with per-game delta features and a Winner label.

mmnn data process <year>
mmnn data process <year> --women

Development (with Hatch):

hatch run mmnn data process 2025

Train the neural network

Train the model on all *-data.csv files in data/men/ or data/women/ (90% train / 10% test split), then save weights to data/men/model.pt or data/women/model.pt:

mmnn nn train
mmnn nn train --women

Evaluate a full bracket (holdout year)

Retrain the network on every *-data.csv except the bracket year, then predict each game in that year’s tournament and print per-game results plus accuracy, log loss, and related metrics. The model is fit in memory only; it does not read or overwrite data/men/model.pt or data/women/model.pt.

You need at least one other processed year besides the bracket year (mmnn data process <year>), and that year’s {year}-games.csv and {year}-teams.csv must exist.

mmnn nn bracket 2025
mmnn nn bracket 2025 --women

Optional --epochs sets the training epoch count (same default as mmnn nn train). Useful for quicker runs while iterating:

mmnn nn bracket 2025 --epochs 50

Predict a single matchup

Predict which team wins (higher- or lower-ranked) given two team names. Team stats are looked up from data/men/2026-teams.csv (or data/women/2026-teams.csv with --women):

mmnn nn predict <team1> <team2>
mmnn nn predict "Ohio State" TCU

Look in the appropriate 2026-teams.csv for the correct team names to use.

Installation

From PyPI (end users):

pip install mmnn

From source (development):

Development uses Hatch for environments and commands—not pip install -e .. Clone the repo and run the CLI or tests through Hatch:

hatch run mmnn data fetch 2024
hatch run mmnn data process 2024
hatch run mmnn nn train
hatch run mmnn nn bracket 2025
hatch run mmnn nn predict Duke UConn
hatch run test:test

test:test runs the test script in the test environment (pytest over tests/; see [tool.hatch.envs.test] in pyproject.toml). Use hatch shell if you want an interactive shell with the project and its dependencies on the path.

License

mmnn is distributed under the terms of the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
research		research
src/mmnn		src/mmnn
tests		tests
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mmnn

Inspiration

Usage

Fetch data

Process data

Train the neural network

Evaluate a full bracket (holdout year)

Predict a single matchup

Installation

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mmnn

Inspiration

Usage

Fetch data

Process data

Train the neural network

Evaluate a full bracket (holdout year)

Predict a single matchup

Installation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages