Cutechess Framework from the author of Maverick chess engine http://immortalchess.net/forum/showthread.php?t=21629
Цитата:
Chess Engine Testing Framework By Steve Maughan on Computer Chess, Testing, Tools ![]() Since Maverick is now playing chess I really need a testing framework. Testing has always been important but I think Fabian Letouzey really “turned some heads” when he started making rapid progress with Fruit back in 2004. His secret turned out to be making decisions about engine parameters and potential improvements based on cold hard facts i.e. testing. Since then we’ve seen much more discussion about testing. Then came Rybka and Houdini who championed, what I’d call, turbo-testing. This is where the games are super short – something like 60 moves in 20 seconds. Ten years ago I think this would have been frowned upon as too fast and the results will likely not scale to longer time controls. As it turns out this fear seems to be incorrect. Turbo-testing seems to be the best way to quickly test potential improvements. When developing Monarch my testing really wasn’t systematic. I’d run games of varying lengths. Maybe 50 games or at most 100 games. In hindsight Monarch really wasn’t tested well. I’m determined not to do that with Maverick. Thankfully there are also some great tools to help with the testing. The tool which seems to be regarded as the best is CuteChess-Cli. Another one is LittleBlitzer. Both enable you to run multiple games in parallel and with minimal screen updates. I decided to plump for CuteChess-Cli as my tool of choice. Here’s how I went about setting up the testing framework:
As you can see this, runs one match of 500 games between Maverick and Predateur. Four games are played concurrently on the same machine. All the games are stored in a PGN which contains today’s date (you may need to adjust the code at the top of the bat file for different date formats – mine is US). For editing these types of files I’d suggest NotePad++. When installed you simply right click on a .bat file and you have the option to edit it using NotePad++. It also color code the formatting – very useful. Since it may be useful to some of you, here’s my whole testing framework zipped up and ready for download. |