gambit

The aim of this project is to investigate the scaling properties of language models when they go beyond their training data. We'll do this by training a series of language models on chess PGN data with an ELO cap, then trying to surpass the ELO cap. We'll try this in two ways: (1) generalisation on the ELO value used to condition the model, and (2) using MCTS to improve the resulting model. This should give a compute/ELO scaling plot with the pretraining dataset ELO cap somewhere in the middle.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
README.md		README.md
pgn_processing.py		pgn_processing.py
process_data.py		process_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gambit

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

gambit

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages