OpenBench

OpenBench Testing Framework

Active : 6 Machines / 94 Threads / 157.22 MNPS
Priority -1
Terje	Weiss	NoisyUnderPromotions	diff	10.0+0.10	LLR: 1.85 (-2.94, 2.94) [-3.00, 0.00] Games: 69222 W: 19435 L: 19488 D: 30299 Ptnml(0-2): 1292, 8360, 15353, 8321, 1285	Generate under-promotions as noisy (now with faster bench)
Terje	Weiss	UpcomingRepetition	diff	10.0+0.10	LLR: 0.39 (-2.94, 2.94) [-3.00, 0.00] Games: 96730 W: 27223 L: 27457 D: 42050 Ptnml(0-2): 1904, 11667, 21366, 11615, 1813

Finished
Mhoupp	Stash	thread_voting_v2	diff	20.0+0.20	LLR: 2.96 (-2.94, 2.94) [0.00, 5.00] Games: 7848 W: 1877 L: 1745 D: 4226 Ptnml(0-2): 24, 831, 2081, 965, 23	Add thread voting to search (Note: impl derived from Stormphrax which is itself ported from Stockfish)
Terje	Weiss	NoisyUnderPromotions	diff	10.0+0.10	LLR: -1.18 (-2.94, 2.94) [-3.00, 0.00] Games: 58908 W: 16460 L: 16704 D: 25744 Ptnml(0-2): 1146, 7104, 13096, 7064, 1044	Generate under-promotions as noisy
Mhoupp	Stash	thread_voting_v2	diff	5.0+0.05	LLR: 2.99 (-2.94, 2.94) [0.00, 5.00] Games: 10940 W: 2732 L: 2572 D: 5636 Ptnml(0-2): 96, 1250, 2608, 1430, 86	Add thread voting to search (Note: impl derived from Stormphrax which is itself ported from Stockfish)
Mhoupp	Stash	thread_voting	diff	5.0+0.05	LLR: 0.82 (-2.94, 2.94) [0.00, 5.00] Games: 41554 W: 9991 L: 9795 D: 21768 Ptnml(0-2): 316, 4908, 10170, 5030, 353	Add thread voting to search (Note: impl derived from Stormphrax which is itself ported from Stockfish)
Mhoupp	Stash	master	diff	40.0+0.40	Elo: 32.48 +- 2.34 (95%) [N=20000] Games: 20000 W: 3634 L: 1770 D: 14596 Ptnml(0-2): 57, 1142, 5919, 2644, 238	Progression test
Kieren	Halogen	spsa_tune_21	diff	40.0+0.40	Tuning 122 Parameters 1888/25000 Iterations 30220/400000 Games Played
Eren	Stash	depth-8-fp	diff	40.0+0.40	LLR: -2.68 (-2.94, 2.94) [0.00, 5.00] Games: 10618 W: 2552 L: 2611 D: 5455 Ptnml(0-2): 71, 1291, 2640, 1240, 67	[SPEC] LTC
Eren	Stash	depth-8-fp	diff	8.0+0.08	LLR: -2.97 (-2.94, 2.94) [0.00, 5.00] Games: 15034 W: 3832 L: 3890 D: 7312 Ptnml(0-2): 215, 1804, 3513, 1794, 191	more deeper quiet futility pruning
Eren	Stash	improving-probcut	diff	8.0+0.08	LLR: -3.58 (-2.94, 2.94) [0.00, 5.00] Games: 16088 W: 4071 L: 4150 D: 7867 Ptnml(0-2): 223, 2005, 3658, 1944, 214	apply improving margin to probcut
Eren	Stash	depth-5-se	diff	8.0+0.08	LLR: -3.00 (-2.94, 2.94) [0.00, 5.00] Games: 6410 W: 1561 L: 1655 D: 3194 Ptnml(0-2): 84, 810, 1490, 758, 63	shallower SE attempts
Eren	Stash	better-razoring	diff	8.0+0.08	LLR: 3.05 (-2.94, 2.94) [0.00, 5.00] Games: 13690 W: 3659 L: 3477 D: 6554 Ptnml(0-2): 161, 1592, 3188, 1712, 192	an idea from Halogengine
Eren	Stash	better-razoring	diff	40.0+0.40	LLR: 2.97 (-2.94, 2.94) [0.00, 5.00] Games: 34204 W: 8355 L: 8112 D: 17737 Ptnml(0-2): 212, 3966, 8504, 4207, 213	an idea from Halogengine
Mhoupp	Stash	chp_lmr_depth	diff	8.0+0.08	LLR: -2.97 (-2.94, 2.94) [0.00, 5.00] Games: 20852 W: 5155 L: 5188 D: 10509 Ptnml(0-2): 259, 2571, 4795, 2546, 255	Use LMR depth for Continuation History Pruning
Mhoupp	Stash	lmr_cutnode_no_ttmove	diff	40.0+0.40	LLR: -2.96 (-2.94, 2.94) [0.00, 5.00] Games: 17832 W: 4154 L: 4195 D: 9483 Ptnml(0-2): 115, 2096, 4497, 2131, 77	Increase LMR for expected cutnodes with no TT move
Mhoupp	Stash	lmr_cutnode_no_ttmove	diff	8.0+0.08	LLR: 3.15 (-2.94, 2.94) [0.00, 5.00] Games: 37742 W: 9551 L: 9263 D: 18928 Ptnml(0-2): 475, 4450, 8738, 4728, 480	Increase LMR for expected cutnodes with no TT move
Rosent	Winter	net_end_p *	diff	8.0+0.08	LLR: -2.96 (-2.94, 2.94) [-5.00, 0.00] Games: 20464 W: 5192 L: 5430 D: 9842 Ptnml(0-2): 599, 2484, 4207, 2440, 502	A much more reasonable middle ground.
Rosent	Winter	net_end_p *	diff	8.0+0.08	LLR: -2.98 (-2.94, 2.94) [-5.00, 0.00] Games: 1216 W: 243 L: 393 D: 580 Ptnml(0-2): 54, 189, 236, 111, 18	Regression testing net training idea.
Mhoupp	Stash	node_timeman	diff	40.0+0.40	LLR: -3.00 (-2.94, 2.94) [0.00, 5.00] Games: 3628 W: 777 L: 871 D: 1980 Ptnml(0-2): 29, 436, 958, 382, 9	Add node repartition scaling to time management
Rosent	Winter	net_end_p *	diff	8.0+0.08	LLR: 3.08 (-2.94, 2.94) [-5.00, 0.00] Games: 13636 W: 3293 L: 3219 D: 7124 Ptnml(0-2): 272, 1627, 2966, 1661, 292	Non-regression sanity check.
Mhoupp	Stash	node_timeman	diff	8.0+0.08	LLR: -3.04 (-2.94, 2.94) [0.00, 5.00] Games: 8042 W: 1941 L: 2029 D: 4072 Ptnml(0-2): 81, 1028, 1906, 910, 96	Add node repartition scaling to time management
Mhoupp	Stash	lmr_less_for_checks	diff	40.0+0.40	LLR: -2.95 (-2.94, 2.94) [0.00, 5.00] Games: 20020 W: 4691 L: 4723 D: 10606 Ptnml(0-2): 107, 2377, 5054, 2385, 87	Decrease LMR if the move gives check
Rosent	Winter	net_end_p	diff	N=64000	Elo: 7.25 +- 13.90 (95%) [N=1000] Games: 1006 W: 272 L: 251 D: 483 Ptnml(0-2): 19, 118, 213, 129, 24	Sanity test of net training
Kieren	Halogen	r140	diff	40.0+0.40	LLR: -3.00 (-2.94, 2.94) [0.00, 3.00] Games: 23402 W: 5751 L: 5874 D: 11777 Ptnml(0-2): 58, 2740, 6228, 2617, 58
Kieren	Halogen	r140	diff	8.0+0.08	LLR: -2.96 (-2.94, 2.94) [0.00, 3.00] Games: 21304 W: 5467 L: 5604 D: 10233 Ptnml(0-2): 179, 2559, 5287, 2474, 153
Kieren	Halogen	r141	diff	40.0+0.40	LLR: -2.96 (-2.94, 2.94) [0.00, 3.00] Games: 16686 W: 4066 L: 4202 D: 8418 Ptnml(0-2): 43, 2006, 4373, 1886, 35

1 2 3 1331 1332 1333