david-carteau
diff --git a/‎LICENSE‎
Lines changed: 2 additions & 4 deletions b/‎LICENSE‎
Lines changed: 2 additions & 4 deletions
diff --git a/‎README.md‎
Lines changed: 51 additions & 52 deletions b/‎README.md‎
Lines changed: 51 additions & 52 deletions
diff --git a/‎v2.0/1. data preparation (optional)/bonus/remove_checks.py‎
Lines changed: 58 additions & 0 deletions b/‎v2.0/1. data preparation (optional)/bonus/remove_checks.py‎
Lines changed: 58 additions & 0 deletions
@@ -1,7 +1,5 @@
-MIT License
-
-The Cerebrum library and engine
-Copyright (c) 2025, by David Carteau. All rights reserved.
+The Cerebrum library
+Copyright (c) 2020-2025, by David Carteau. All rights reserved.
 
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal
 
@@ -1,14 +1,25 @@
-## The Cerebrum library and engine
+## The Cerebrum library
 
-![Logo](/v1.1/logo.png)
+![Logo](/v2.0/logo.png)
 
-The **Cerebrum library** can be used to easily train a **first** "[NNUE](https://www.chessprogramming.org/NNUE)-like" neural network for a chess engine. It was originally designed and built for the [Orion UCI chess engine](https://www.orionchess.com/).
+The **Cerebrum library** can be used to easily train a "[NNUE](https://www.chessprogramming.org/NNUE)-like" neural network for a chess engine. It was originally designed and built for the [Orion UCI chess engine](https://www.orionchess.com/).
 
-Its originality lies in using only game results, parsed from pgn files provided by the user, and material values, computed on the fly, as targets for prediction. Both predicted values (game result and material) can be used for board evaluation.
+It is composed of a few Python scripts for data preparation (optional), one Python script for **training**, and C code for **inference**.
 
-Inference code is provided for embedding and using the trained network in a C/C++ or Python project, in two alternatives: standard (for accuracy) or quantized (for speed).
+Default network architecture is perspective-based with one hidden layer. Network weights are quantised to maximise inference speed.
 
-Do not hesitate to adapt the library to your own needs, and/or to use newer/better NNUE libraries for more flexibility/performance (e.g. [Bullet](https://github.com/jw1912/bullet/tree/main))!
+Code is also provided to train a **first** network using only game results, parsed from PGN files provided by the user, and material values, computed on the fly (optional).
+
+Feel free to adapt the library to your own needs and/or use newer/better NNUE libraries for greater flexibility and performance (e.g. [Bullet](https://github.com/jw1912/bullet/tree/main))!
+
+<br/>
+
+## Changes in 2.0
+
+- **Change in network outputs**: networks now directly predict scores in centipawns → _breaking change!_
+- **Tiny change to the data format** for data preparation → _breaking change!_
+- **Lower disk usage** for data preparation
+- **New default architecture**: `2x(768→256)→32→1` (1 hidden layer)
 
 <br/>
 
@@ -23,7 +34,7 @@ Do not hesitate to adapt the library to your own needs, and/or to use newer/bett
 
 ## Changes in 1.0
 
-- Training now relies on game results (from which a win ratio is deduced for each position during a game) and material only !
+- Training now relies on game results (from which a win ratio is deduced for each position during a game) and material only!
 - Data preparation scripts are provided to automate the preparation of training data (using one or several pgn files)
 - Network quantization is performed at the end of each training epoch, allowing the choice between better accuracy or increased inference speed
 - A basic UCI chess engine is provided in two versions (standard or quantized) to demonstrate how to load and use the network
@@ -33,44 +44,42 @@ Do not hesitate to adapt the library to your own needs, and/or to use newer/bett
 
 ## Content and prerequisites (Windows)
 
-The library consists of four main parts:
+To use the library, you will first need to:
 
-1. Data preparation code (Python scripts)
-2. Training code (Python script)
-3. Inference code (C files)
-4. A basic UCI chess engine for demonstration purposes (Python script)
+- Download the `v2.0` folder of this repository
+- Install a Python runtime: https://www.python.org/
+- Install some Python librairies: `pip install tqdm chess`
+- Install PyTorch librairy: `pip install torch` or, if you have an NVIDIA GPU, `pip install torch --index-url https://download.pytorch.org/whl/cu128`
 
 <br/>
 
-To use the library, you will first need to:
+Optionally, if you want to train a **first** network from PGN files:
 
-- Download the `v1.1` folder of this repository
-- Install a Python runtime: https://www.python.org/
-- Install some Python librairies: `pip install torch tqdm chess`
-- Download the [pgn-extract](https://www.cs.kent.ac.uk/people/staff/djb/pgn-extract/) tool and put the `pgn-extract.exe` file in the folder `./1. data preparation/`
+- Download the [pgn-extract](https://www.cs.kent.ac.uk/people/staff/djb/pgn-extract/) tool and put the `pgn-extract.exe` file in the folder `./1. data preparation (optional)/`
 
 <br/>
 
-Optionally (for better results):
-
-- Download the [3, 4, 5 pieces](http://tablebase.sesse.net/syzygy/3-4-5/) endgame Syzygy tablebases and put them in the folder `./1. data preparation/syzygy/3-4-5/`
-- Download the [6 pieces](http://tablebase.sesse.net/syzygy/6-WDL/) endgame Syzygy tablebases and put them in the folder `./1. data preparation/syzygy/6-pieces/`
+## Usage (Windows)
 
-Optionally (for faster training, if you have an NVIDIA GPU):
+### Data preparation (standard)
 
-- `pip install torch --index-url https://download.pytorch.org/whl/cu124`
+Prepare a file containing positions and evaluations. Each line of the file must contain a fenstring followed by its evaluation (in pawns), separated with a comma, e.g.:
 
-<br/>
+Example:
+- _r5k1/5pp1/pR2p3/1p1rP3/7P/R3P3/P6P/6K1 w - -,-4.5000_
+- _6k1/ppp5/8/3P1p2/PP1b4/5pPp/5P1K/8 b - -,6.5000_
+- _3r3k/p4pp1/4p2p/2pRq3/8/PP2P2P/2Q2PP1/2R3K1 b - -,-2.5000_
+- _1r4k1/q2pbp1p/4n1p1/p1pQP2P/Rr1nB1N1/4B1P1/5PK1/2R5 w - -,3.5000_
 
-## Usage (Windows)
+Copy the `positions-shuffled.txt` file to the folder `./2. training/positions/`.
 
-### Data preparation
+<br/>
 
-Prepare one or several pgn files containing full games and put it/them in the folder `./1. data preparation/pgn/`.
+### Data preparation (alternative)
 
-Then launch the script `prepare.bat` in the folder `./1. data preparation/` to obtain a file named `positions-shuffled.txt` which will be stored in the same folder.
+Prepare one or several pgn files containing full games and put it/them in the folder `./1. data preparation (optional)/pgn/`.
 
-This script will parse games and compute the average win ratio for each encountered position in all the games. It will also add some other statistical information (popcount, number of occurences of each position).
+Then launch the script `prepare.bat` in the folder `./1. data preparation (optional)/` to obtain a file named `positions-shuffled.txt`, which will be stored in the same folder.
 
 Copy the `positions-shuffled.txt` file to the folder `./2. training/positions/`.
 
@@ -81,9 +90,9 @@ Copy the `positions-shuffled.txt` file to the folder `./2. training/positions/`.
 You can configure the network architecture by modifying the script `train.py` in the folder `./2. training/`.
 
 Supported architectures are:
-- `2x(768→A)→2` (no hidden layer)
-- `2x(768→A)→B→2` (one hidden layer)
-- `2x(768→A)→B→C→2` (two hidden layers)
+- `2x(768→A)→1` (no hidden layer)
+- `2x(768→A)→B→1` (one hidden layer)
+- `2x(768→A)→B→C→1` (two hidden layers)
 
 _(where A, B and C are mutliples of 32, e.g. `2x(768→128)→32→2` for `A=128` and `B=32`)_
 
@@ -101,47 +110,37 @@ This script will parse the `positions-shuffled.txt` file in the folder `./2. tra
 
 Trained networks will be located in the folder `./2. training/networks/`. One network will be saved at the end of each training epoch.
 
-By default:
-
-- `epoch-11.txt` will be the last standard network (i.e. full precision: weights and biases are stored as `float`)
-- `epoch-11-q.txt` will be the last quantized network (i.e. less precision, but high inference speed: weights and biases are stored as `int8`)
+By default, `epoch-11-q.txt` will be the last quantized network.
 
 <br/>
 
 ### How to use trained networks
 
-These networks can now be used in your own engine, using your own code, or:
-
-- using the provided inference C code in `./3. inference/1. standard/` or `./3. inference/2. quantized/` folders
-- using the provided inference Python code located in the `./4. engine/1. standard/` or `./4. engine/2. quantized/` folders
-
-<br/>
-
-In order to use your own trained network with the provided Cerebrum UCI chess engine:
-
-- Copy the `epoch-11.txt` (resp. the `epoch-11-q.txt`) file in the folder `4. engine/1. standard/` (resp. `4. engine/2. quantized/`)
-- Rename it to `network.txt`
-- Launch the engine
+Trained networks can now be used in your own engine, using your own code, or using the provided inference C code, provided in the `./3. inference/` folder.
 
 <br/>
 
 ## How to configure name and author
 
 You can adjust the name and author of the trained networks:
 
-- Before training, by modifying the `NN_NAME` (default = "Cerebrum 1.1") and `NN_AUTHOR` (default = "David Carteau") variables in the script `train.py` located in the folder `./2. training/`
-- After training, by modifying the first two lines of the generated networks (default = "name=Cerebrum 1.1" and "author=David Carteau")
+- Before training, by modifying the `NN_NAME` (default = "Cerebrum 2.0") and `NN_AUTHOR` (default = "David Carteau") variables in the script `train.py` located in the folder `./2. training/`
+- After training, by modifying the first two lines of the generated networks (default = "name=Cerebrum 2.0" and "author=David Carteau")
+
+<br/>
+
+You can adjust more parameters: open and inspect the provided Python scripts!
 
 <br/>
 
 ## Contribute
 
-If you want to help me improve the library, do not hesitate to contact me via the [talkchess.com](https://www.talkchess.com) forum !
+If you want to help me improve the library, do not hesitate to contact me via the [talkchess.com](https://www.talkchess.com) forum!
 
 <br/>
 
 ## Copyright, license
 
 Copyright 2025 by David Carteau. All rights reserved.
 
-The Cerebrum library is licensed under the **MIT License** (see "LICENSE" and "/v1.1/license.txt" files).
+The Cerebrum library is licensed under the **MIT License** (see "LICENSE" and "/v2.0/license.txt" files).
@@ -0,0 +1,58 @@
+"""
+The Cerebrum library
+Copyright (c) 2020-2025, by David Carteau. All rights reserved.
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
+"""
+
+##############################################################################
+## NAME: remove_checks.py                                                   ##
+## AUTHOR: David Carteau, France, November 2025                             ##
+## LICENSE: MIT (see above and "license.txt" file content)                  ##
+##############################################################################
+
+##############################################################################
+## PURPOSE:                                                                 ##
+## Remove positions positions in check (optional)                           ##
+##############################################################################
+
+import tqdm
+import chess
+
+
+def main():
+    with open('./positions-shuffled.txt', 'rt') as i_file:
+        with open('./positions-shuffled-without-checks.txt', 'wt') as o_file:
+            for line in tqdm.tqdm(i_file, unit_scale=True):
+                fen, stm, pop, cnt, evl = line.split()
+                
+                board = chess.Board(f'{fen} {stm} - -')
+                
+                if not board.is_check():
+                    o_file.write(line)
+                #end if
+            #end for
+        #end with
+    #end with
+#end def
+
+
+if __name__ == "__main__":
+    main()
+#end if