Active : 0 Machines / 0 Threads / 0 MNPS
Priority 0
MhouppStashse_negext_pvdiff8.0+0.08LLR: -1.47 (-2.94, 2.94) [0.00, 5.00]
Games: 4446 W: 1088 L: 1129 D: 2229
Ptnml(0-2): 54, 564, 1033, 513, 59
Increase Negative Singular Extensions by one ply for PV nodes Bench: 4,382,866
Priority -1
RosentWinternet_311m *diff8.0+0.08LLR: -0.71 (-2.94, 2.94) [-2.50, 2.50]
Games: 3340 W: 824 L: 857 D: 1659
Ptnml(0-2): 86, 426, 674, 403, 81
Trying to better understand the hybrid loss. Both nets are not fully trained, but are at a similar point in training.
RosentWinternet_311l *diff8.0+0.08LLR: -1.17 (-2.94, 2.94) [-3.00, 2.00]
Games: 2048 W: 480 L: 535 D: 1033
Ptnml(0-2): 57, 252, 448, 223, 44
Trying to better understand the hybrid loss. Both nets are not fully trained, but are at a similar point in training. 311n removes the CE loss component completely. Even if worth Elo, we may not want this.
Finished
RosentWinternet_311d *diff8.0+0.08LLR: -2.95 (-2.94, 2.94) [0.00, 5.00]
Games: 11346 W: 2730 L: 2809 D: 5807
Ptnml(0-2): 236, 1434, 2411, 1357, 235
Draw calibration handled with MSE loss directly on the draw probability should be less sensitive to outliers that are common in the high entropy STC time controls the training data stems from.
RosentWinternet_311 *diff40.0+0.40LLR: 2.96 (-2.94, 2.94) [0.00, 5.00]
Games: 3948 W: 860 L: 731 D: 2357
Ptnml(0-2): 32, 395, 1004, 498, 45
Fully trained model with identical hyperparameters to prior gen, but extra data. This should clearly beat the master branch.
VshcheIgelttc2 *diff60.0+0.60LLR: 1.88 (-2.94, 2.94) [-3.00, 1.00]
Games: 25814 W: 3241 L: 3202 D: 19371
Ptnml(0-2): 80, 2105, 8522, 2096, 104
3.6.10: fix cache entry
VshcheIgelttc2diff60.0+0.60LLR: 0.39 (-2.94, 2.94) [-3.00, 1.00]
Games: 2680 W: 678 L: 665 D: 1337
Ptnml(0-2): 8, 289, 738, 292, 13
3.6.10: fix cache entry
VshcheIgelttc2diff5.0+0.05LLR: 2.96 (-2.94, 2.94) [-3.00, 1.00]
Games: 9688 W: 2554 L: 2436 D: 4698
Ptnml(0-2): 41, 1061, 2524, 1175, 43
3.6.10: fix cache entry
RosentWinternet_311 *diff8.0+0.08LLR: 2.96 (-2.94, 2.94) [0.00, 5.00]
Games: 8158 W: 2101 L: 1925 D: 4132
Ptnml(0-2): 193, 922, 1694, 1056, 214
Testing net with same hyperparameters but extra self play data a bit prematurely to get a sense of where things lie.
VshcheIgelttc2 *diff5.0+0.05LLR: 2.97 (-2.94, 2.94) [-3.00, 1.00]
Games: 10058 W: 1610 L: 1497 D: 6951
Ptnml(0-2): 53, 936, 2964, 997, 79
3.6.10: fix cache entry
VshcheIgelttc2diff10.0+0.10LLR: 3.00 (-2.94, 2.94) [-3.00, 1.00]
Games: 54722 W: 14226 L: 14171 D: 26325
Ptnml(0-2): 407, 6133, 14232, 6176, 413
3.6.10: fix cache entry
RosentWinternet_test *diff40.0+0.40LLR: 2.95 (-2.94, 2.94) [0.00, 5.00]
Games: 3618 W: 807 L: 675 D: 2136
Ptnml(0-2): 37, 366, 880, 480, 46
Trained with new data. The new training data has a very different distribution and has neutral effect on validation metrics.
RosentWinternet_test *diff8.0+0.08LLR: 2.97 (-2.94, 2.94) [-2.00, 3.00]
Games: 7204 W: 1851 L: 1710 D: 3643
Ptnml(0-2): 160, 820, 1495, 973, 154
Trained with new data. The new training data has a very different distribution and has neutral effect on validation metrics.
RosentWinternet_test *diff40.0+0.40LLR: 1.77 (-2.94, 2.94) [0.00, 5.00]
Games: 4820 W: 1009 L: 922 D: 2889
Ptnml(0-2): 42, 513, 1225, 576, 54
Net trained with some extra data I had laying around and not yet utilized.
VshcheIgelttc2 *diff10.0+0.10LLR: 2.96 (-2.94, 2.94) [-3.00, 1.00]
Games: 16216 W: 3056 L: 2936 D: 10224
Ptnml(0-2): 220, 1683, 4185, 1797, 223
3.6.10: fix cache entry
VshcheIgelnn-epoch537diff10.0+0.10LLR: -1.02 (-2.94, 2.94) [0.00, 3.00]
Games: 5340 W: 1383 L: 1436 D: 2521
Ptnml(0-2): 46, 669, 1292, 618, 45
VshcheIgelnn-epoch568diff10.0+0.10LLR: -1.21 (-2.94, 2.94) [0.00, 3.00]
Games: 7160 W: 1892 L: 1953 D: 3315
Ptnml(0-2): 54, 917, 1722, 810, 77
VshcheIgelnn-epoch518diff10.0+0.10LLR: -2.95 (-2.94, 2.94) [0.00, 3.00]
Games: 14124 W: 3616 L: 3772 D: 6736
Ptnml(0-2): 117, 1791, 3404, 1631, 119
VshcheIgelnn-epoch462diff10.0+0.10LLR: -1.65 (-2.94, 2.94) [0.00, 3.00]
Games: 5138 W: 1290 L: 1387 D: 2461
Ptnml(0-2): 59, 665, 1215, 574, 56
VshcheIgelnn-epoch467diff10.0+0.10LLR: -1.10 (-2.94, 2.94) [0.00, 3.00]
Games: 3382 W: 825 L: 889 D: 1668
Ptnml(0-2): 32, 441, 810, 375, 33
VshcheIgelnn-epoch554diff10.0+0.10LLR: -1.18 (-2.94, 2.94) [0.00, 3.00]
Games: 5018 W: 1267 L: 1333 D: 2418
Ptnml(0-2): 51, 644, 1190, 568, 56
VshcheIgelnn-epoch535diff10.0+0.10LLR: -1.27 (-2.94, 2.94) [0.00, 3.00]
Games: 2194 W: 533 L: 613 D: 1048
Ptnml(0-2): 24, 311, 505, 235, 22
RosentWinternet_test *diff8.0+0.08LLR: 2.95 (-2.94, 2.94) [0.00, 5.00]
Games: 4334 W: 1208 L: 1050 D: 2076
Ptnml(0-2): 92, 482, 903, 556, 134
Net trained with some extra data I had laying around and not yet utilized.
RosentWinternet_test *diff40.0+0.40LLR: 2.17 (-2.94, 2.94) [0.00, 5.00]
Games: 8822 W: 1791 L: 1673 D: 5358
Ptnml(0-2): 80, 938, 2276, 1018, 99
Confirmed 2 year old net reproduced and slightly improved
VshcheIgelhtypodiff10.0+0.10LLR: 0.57 (-2.94, 2.94) [-3.00, 1.00]
Games: 2096 W: 558 L: 535 D: 1003
Ptnml(0-2): 13, 229, 544, 246, 16
3.6.10: fix history typo
RosentWinternet_test *diff8.0+0.08LLR: 3.03 (-2.94, 2.94) [0.00, 5.00]
Games: 27442 W: 6402 L: 6150 D: 14890
Ptnml(0-2): 464, 3213, 6148, 3399, 497
RosentWinternet_test *diff8.0+0.08LLR: -3.11 (-2.94, 2.94) [0.00, 5.00]
Games: 4948 W: 1188 L: 1304 D: 2456
Ptnml(0-2): 109, 648, 1070, 544, 103
RosentWinternet_test *diff8.0+0.08LLR: -3.06 (-2.94, 2.94) [0.00, 5.00]
Games: 7424 W: 1778 L: 1880 D: 3766
Ptnml(0-2): 163, 943, 1583, 879, 144