Cutechess-cli and SPRT

silentshark · Post by **silentshark** » Tue Jan 19, 2021 6:37 pm

Hi all,

Looking for some help re: SPRT. I've been happily using cutechess-cli for automated testing, and it works great.

I'm now trying to use SPRT to terminate matches early, and save time. I'm using the following to test a new version of my engine against a very old version (several hundred ELO weaker). I'd have expected the SPRT bit to terminate the match quite early, but it seems to play on and on and not terminate. Here's my code:

Code: Select all

cutechess-cli.exe -tournament round-robin -rounds 30000 -concurrency 10 -engine name=jan2021_13_10 dir=..\wccc99\jan2021_13_10 cmd=jan2021_13_10.exe book=2moves_v2.bin bookdepth=10 proto=xboard -engine name=mad019 dir=..\wccc99\mad019 cmd=mad019.exe book=2moves_v2.bin bookdepth=10 proto=xboard -each tc=60/60 -pgnout sprt1.pgn -resign movecount=3 score=500 -draw movenumber=100 movecount=5 score=10 -maxmoves 200 -wait 100 -recover -sprt elo0=0 elo1=5 alpha=0.05 beta=0.05

I'm probably missing something stupid, so please shout

Running cutechess 1.20 under win10, fyi.

Thanks in advance!

brianr · Post by **brianr** » Tue Jan 19, 2021 7:47 pm

A 5 Elo range is quite small to detect (with the 95% error) without thousands of games, unless there is a huge difference the strength of the engines.
You mentioned expecting hundreds of Elo, but it might be less.

Also, the time controls are very long for what is typically used with SPRT.
Draw movenumber 100 seems like more than what many people use.

Depending on how many games are anticipated, the 2 move book seems rather shallow.
SCID and be used to scan for duplicate games.
You can try to reduce the number of draws with a slightly imbalanced book too.

Finally, using syzygy tablebase adjudication will speed things up some also.

Perhaps trying a "fast" sort of test run with much faster time controls would show a difference.
If they are traditional A/B engines, something like 0:10+0.1 would be where I would start.
For timed games, concurrency 10 seems fine with 12+ cores.

Others may have better suggestions and I don't actually use SPRT very much and prefer Ordo myself with cutechess-cli.

Michel · Post by **Michel** » Tue Jan 19, 2021 8:52 pm

silentshark wrote: ↑Tue Jan 19, 2021 6:37 pm Hi all,

Looking for some help re: SPRT. I've been happily using cutechess-cli for automated testing, and it works great.

I'm now trying to use SPRT to terminate matches early, and save time. I'm using the following to test a new version of my engine against a very old version (several hundred ELO weaker). I'd have expected the SPRT bit to terminate the match quite early, but it seems to play on and on and not terminate. Here's my code:
Code: Select all
cutechess-cli.exe -tournament round-robin -rounds 30000 -concurrency 10 -engine name=jan2021_13_10 dir=..\wccc99\jan2021_13_10 cmd=jan2021_13_10.exe book=2moves_v2.bin bookdepth=10 proto=xboard -engine name=mad019 dir=..\wccc99\mad019 cmd=mad019.exe book=2moves_v2.bin bookdepth=10 proto=xboard -each tc=60/60 -pgnout sprt1.pgn -resign movecount=3 score=500 -draw movenumber=100 movecount=5 score=10 -maxmoves 200 -wait 100 -recover -sprt elo0=0 elo1=5 alpha=0.05 beta=0.05
I'm probably missing something stupid, so please shout

Running cutechess 1.20 under win10, fyi.

Thanks in advance!

You should be aware that SPRT is only efficient if the Elo difference is comparable to the bounds. If this is not the case then SPRT is very inefficient (compared to a standard fixed length test).

silentshark · Post by **silentshark** » Tue Jan 19, 2021 10:07 pm

Michel wrote: ↑Tue Jan 19, 2021 8:52 pm You should be aware that SPRT is only efficient if the Elo difference is comparable to the bounds. If this is not the case then SPRT is very inefficient (compared to a standard fixed length test).

Interesting.. why would that be? So the parameters I'm using would be more efficient if there is only a small difference in ELO?

Ferdy · Post by **Ferdy** » Tue Jan 19, 2021 10:15 pm

silentshark wrote: ↑Tue Jan 19, 2021 6:37 pm Hi all,

Looking for some help re: SPRT. I've been happily using cutechess-cli for automated testing, and it works great.

I'm now trying to use SPRT to terminate matches early, and save time. I'm using the following to test a new version of my engine against a very old version (several hundred ELO weaker). I'd have expected the SPRT bit to terminate the match quite early, but it seems to play on and on and not terminate. Here's my code:
Code: Select all
cutechess-cli.exe -tournament round-robin -rounds 30000 -concurrency 10 -engine name=jan2021_13_10 dir=..\wccc99\jan2021_13_10 cmd=jan2021_13_10.exe book=2moves_v2.bin bookdepth=10 proto=xboard -engine name=mad019 dir=..\wccc99\mad019 cmd=mad019.exe book=2moves_v2.bin bookdepth=10 proto=xboard -each tc=60/60 -pgnout sprt1.pgn -resign movecount=3 score=500 -draw movenumber=100 movecount=5 score=10 -maxmoves 200 -wait 100 -recover -sprt elo0=0 elo1=5 alpha=0.05 beta=0.05
I'm probably missing something stupid, so please shout

Running cutechess 1.20 under win10, fyi.

Thanks in advance!

Add this in your command line. See the progress of llr.

Code: Select all

-ratinginterval 10

silentshark · Post by **silentshark** » Wed Jan 27, 2021 7:46 pm

Ferdy wrote: ↑Tue Jan 19, 2021 10:15 pm
silentshark wrote: ↑Tue Jan 19, 2021 6:37 pm Hi all,

Looking for some help re: SPRT. I've been happily using cutechess-cli for automated testing, and it works great.

I'm now trying to use SPRT to terminate matches early, and save time. I'm using the following to test a new version of my engine against a very old version (several hundred ELO weaker). I'd have expected the SPRT bit to terminate the match quite early, but it seems to play on and on and not terminate. Here's my code:
Code: Select all
cutechess-cli.exe -tournament round-robin -rounds 30000 -concurrency 10 -engine name=jan2021_13_10 dir=..\wccc99\jan2021_13_10 cmd=jan2021_13_10.exe book=2moves_v2.bin bookdepth=10 proto=xboard -engine name=mad019 dir=..\wccc99\mad019 cmd=mad019.exe book=2moves_v2.bin bookdepth=10 proto=xboard -each tc=60/60 -pgnout sprt1.pgn -resign movecount=3 score=500 -draw movenumber=100 movecount=5 score=10 -maxmoves 200 -wait 100 -recover -sprt elo0=0 elo1=5 alpha=0.05 beta=0.05
I'm probably missing something stupid, so please shout

Running cutechess 1.20 under win10, fyi.

Thanks in advance!
Add this in your command line. See the progress of llr.
Code: Select all
-ratinginterval 10

A good tip, I'm doing that now.. cheers.

Cutechess-cli and SPRT

Cutechess-cli and SPRT

Re: Cutechess-cli and SPRT

Re: Cutechess-cli and SPRT

Re: Cutechess-cli and SPRT

Re: Cutechess-cli and SPRT

Re: Cutechess-cli and SPRT