ROOK-CLF-9m — Interactive Chess Analysis

Autoplay

Current FEN: rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w KQkq - 0 1

Analysis Results

Top Predicted Moves (click to play)

Loading model...

Model Information

Architecture: LlamaForSequenceClassification
Parameters: 9M
Classes: 1968 chess moves
Input: Customized FEN notation
Runtime: ONNX WebAssembly

Research Benchmark Evaluation

Evaluate ROOK-CLF-9M performance across established research benchmarks using authentic datasets and methodologies

Benchmark:

Accuracy Over Time

Performance

0.0%

Throughput

0.0

Progress

0 / 0

ETA

--:--

Overall Progress

Recent Results

Start a benchmark to see results

ChessBench Puzzles Benchmark

Target: 49% 1,000 positions

Original Source

Tactical puzzle solve rate evaluation from "Grandmaster-level chess without search" paper. Measures model's ability to find correct moves in tactical sequences, not isolated best-move accuracy.

📄 Paper (arXiv:2402.04494) 💻 Code Repository 📊 Original Dataset

Evaluation Methodology

Original Research

CSV Format:

PuzzleId,FEN,Moves,Solution
00MTG,4r1k1/2p1q...,h4f2 f1f2 e2f2,Bf2+ Rxf2 ...

Evaluation: Puzzle solve rate: model tested on every other move in tactical sequences (when model's turn to play)

ROOK-CLF Adaptation

JSON Format:

{"fen": "4r1k1/2p1q...", "correct_move": "f1f2"}

Evaluation: Position-based tactical move prediction at model decision points, evaluating puzzle-solving capability

Citation

Ruoss et al. 2024. Grandmaster-level chess without search. arXiv:2402.04494

Attention Rollout

Compute square relevance from exported attentions and explore how focus evolves across layers.

FEN

Layer all

Attention Rollout

Residual‑corrected attention aggregated across layers. Shows token‑to‑decision relevance per layer. Use the slider to scrub layers.

Early Logit Lens

For each layer’s decision-token state, projects with the final classifier to show layer-wise move beliefs. Hover a move to highlight its squares.

Awaiting run…

Model Diagram & Heads

Interactive overview of layers and heads. Click a head to view that head’s attention at the selected layer; click “Mean over heads” to return.

Top‑k Attention Paths

Strongest paths to the decision token. Opacity scales with path strength.

Occlusion Heatmap

Forward‑only masking. Shows Δ for the selected move; click a move in the Logit Lens.

Enter a FEN and click Run. Use the slider to explore layers. Hover moves in the Logit Lens to see overlays.

ROOK-CLF-9M Chess Analysis

Loading ROOK-CLF-9M Model

Analysis Results

Top Predicted Moves (click to play)

Model Information

Research Benchmark Evaluation

Accuracy Over Time

Overall Progress

Recent Results

ChessBench Puzzles Benchmark

Original Source

Evaluation Methodology

Original Research

ROOK-CLF Adaptation

Citation

Attention Rollout

Attention Rollout

Early Logit Lens

Model Diagram & Heads

Top‑k Attention Paths

Occlusion Heatmap

Loading ROOK-CLF-9M Model

Analysis Results

Top Predicted Moves (click to play)

Model Information

FEN Tokenization Process

Motivation

Processing Steps:

Example Transformation:

Benefits of Fixed-Length Encoding:

Token Details:

Benchmark Complete

Research Benchmark Evaluation

Accuracy Over Time

Overall Progress

Recent Results

ChessBench Puzzles Benchmark

Original Source

Evaluation Methodology

Original Research

ROOK-CLF Adaptation

Citation

Attention Rollout

Attention Rollout

Early Logit Lens

Model Diagram & Heads

Top‑k Attention Paths

Occlusion Heatmap