Sudoku-RWKV

A specialized RWKV model for solving Sudoku puzzles.

Requirements

rwkv
tkinter

Quick Start

Run demo.py or minimum_inference.py to solve Sudoku puzzles
Run generate_sudoku_data.py to generate training data

Model

The current sudoku_rwkv_20241120.pth model is a specialized RWKV-v6 model trained on 2M Sudoku samples (~39.2B tokens) specifically for solving Sudoku puzzles.

Model specifications:

Parameters: ~12.7M
Vocabulary size: 133
Architecture: 8 layers, 320 dimensions

The model includes a simple improvement for better performance (see model.py line 372). Corresponding modifications were made in the inference code (rwkv_model.py lines 852, 893-896).

Training

The model was trained using the RWKV-LM repository.

Hyperparameters:

M_BSZ: 48
CTX_LEN: 8192
LR: 12e-4 to 3e-5
ADAM_EPS: 1e-18
ADAM_BETA1: 0.9
ADAM_BETA2: 0.95
WEIGHT_DECAY: 0.1

Loss Curve:

Experiments

Below are the old results. The current model seems to be able to solve any solvable Sudoku. If you find any failed cases, please let me know.

I tested the model on samples of varying difficulty levels, with results shown below:

Note: Difficulty is measured by the number of empty cells in the Sudoku puzzle

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
assets		assets
.gitignore		.gitignore
README.md		README.md
demo.py		demo.py
formatter.py		formatter.py
generate_sudoku_data.py		generate_sudoku_data.py
minimum_inference.py		minimum_inference.py
model.py		model.py
rwkv_model.py		rwkv_model.py
sudoku_data.jsonl		sudoku_data.jsonl
sudoku_rwkv_20241120.pth		sudoku_rwkv_20241120.pth
sudoku_vocab.txt		sudoku_vocab.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sudoku-RWKV

Requirements

Quick Start

Model

Training

Experiments

About

Releases

Packages

Languages

xiyuzhai-husky-lang/Sudoku-RWKV

Folders and files

Latest commit

History

Repository files navigation

Sudoku-RWKV

Requirements

Quick Start

Model

Training

Experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages