Research Projects

Interactive Demos & Experiments

Interactive Demos

RookWorld-LM Reasoning & World Model Demo

Experience transparent reasoning with ROOK-LM and RookWorld-LM. Watch the models think step-by-step through chess positions with streaming chain-of-thought visualization.

Key Achievements:

  • 🏆 32.1% Checkmate-in-One - outperforms ChessGPT-Base (26.5%) with 24x fewer parameters
  • ChessGPT: 3B params, NeurIPS'23 dataset award (Feng et al.)
  • 99.9% environment simulation accuracy
  • Self-play without external engines

ROOK-CLF-9M Multi-Component Analysis Demo

Comprehensive evaluation platform for the 9M parameter chess language model featuring interactive analysis, professional benchmarking, and attention visualization. Explore strategic reasoning capabilities through three specialized interfaces.

Features:

  • Selfplay: Interactive chess board with real-time move analysis and autoplay
  • Benchmark: Professional evaluation against research datasets (ChessBench, BIG-bench, Lichess)
  • Interpretability (WIP 🚧): Attention rollout heatmaps and early logit lens visualization
  • Performance: WebGPU acceleration with IndexedDB model caching

Research Benchmarks:

  • 49% action accuracy (ChessBench)
  • 57% checkmate-in-one accuracy (BIG-bench)
  • Real-time evaluation with performance metrics
  • Authentic research methodologies and datasets

Publications

ROOK: Reasoning Over Organized Knowledge

LAION research note detailing the development of language models for strategic reasoning through chess, including architectural innovations and training methodologies.

Datasets & Models

Datasets

Models