nodchipのコンピューター将棋ブログ

コンピューター将棋ソフト「tanuki-」シリーズの実験結果を掲載しています。

tanuki- 2018-02-18 NNUE評価関数+雑巾絞り

tanuki- 2018-02-18 NNUE評価関数+雑巾絞り

  • 棋譜生成
    • tanuki-棋譜生成ルーチン
    • 評価関数
      • eval_apery_4.5G_limit3000
    • 開始局面
      • roots.hcpより選択
      • 選択された局面からランダムに1手指した局面から開始
    • 探索深さ 8
    • 自己対戦を打ち切る評価値の絶対値: 3000
    • 生成局面数 20億
  • シャフル
    • あり
  • 機械学習
    • NNUE機械学習ルーチン
      • 学習手法 SGD ミニバッチ法
      • batchsize 1000000
      • lambda 1.0 or 0.5
      • eta 1.0
      • newbob_decay 0.5
      • nn_batch_size 1000
      • eval_save_interval 100000000
      • loss_output_interval 1000000
      • mirror_percentage 50
      • eval_limit 3000

      実験結果

      実験1

      雑巾絞り

      lambda=1.0

      03:28:03 engine1=nnue\source\YaneuraOu-by-gcc.exe eval1=F:\hnoda\nnue\eval\zoukin2.2018-02-11.normal\final 03:28:03 engine2=hakubishin-private\build\2017Early\YaneuraOu-2017-early.exe eval2=F:\hnoda\hakubishin\exe\eval\apery_sdt5_eval_twig_format 03:28:03 T1,b5000,388 - 37 - 575(40.29% R-68.33) win black: white = 53.79% : 46.21% declaration win engine1=0 engine2=0

      11:28:24 engine1=nnue\source\YaneuraOu-by-gcc.exe eval1=F:\hnoda\nnue\eval\zoukin2.2018-02-11.normal\final 11:28:24 engine2=nnue\source\YaneuraOu-by-gcc.exe eval2=F:\ynasu\nnue-v0.2\p-k-halfkp_96w-32\eval_apery_4.5G_limit3000 11:28:24 T1,b5000,471 - 55 - 474(49.84% R-1.10) win black: white = 54.39% : 45.61% declaration win engine1=0 engine2=0

      実験2

      lambda=0.5

      20:26:43 engine1=nnue\source\YaneuraOu-by-gcc.exe eval1=F:\hnoda\nnue\eval\zoukin2.2018-02-11.elmo\final 20:26:43 engine2=hakubishin-private\build\2017Early\YaneuraOu-2017-early.exe eval2=F:\hnoda\hakubishin\exe\eval\apery_sdt5_eval_twig_format 20:26:43 T1,b5000,375 - 36 - 589(38.90% R-78.43) win black: white = 52.80% : 47.20% declaration win engine1=0 engine2=0

      04:34:56 engine1=nnue\source\YaneuraOu-by-gcc.exe eval1=F:\hnoda\nnue\eval\zoukin2.2018-02-11.elmo\final 04:34:56 engine2=nnue\source\YaneuraOu-by-gcc.exe eval2=F:\ynasu\nnue-v0.2\p-k-halfkp_96w-32\eval_apery_4.5G_limit3000 04:34:56 T1,b5000,476 - 54 - 470(50.32% R2.20) win black: white = 51.90% : 48.10% declaration win engine1=0 engine2=0

      実験3

      評価値にdiscountを適用してからシャッフルした場合

      18:48:15 engine1=nnue\source\YaneuraOu-by-gcc.exe eval1=F:\hnoda\nnue\eval\zoukin2.2018-02-11.discount\final 18:48:15 engine2=hakubishin-private\build\2017Early\YaneuraOu-2017-early.exe eval2=F:\hnoda\hakubishin\exe\eval\apery_sdt5_eval_twig_format 18:48:15 T1,b5000,388 - 30 - 582(40.00% R-70.44) win black: white = 51.65% : 48.35% declaration win engine1=0 engine2=0

      02:47:15 engine1=nnue\source\YaneuraOu-by-gcc.exe eval1=F:\hnoda\nnue\eval\zoukin2.2018-02-11.discount\final 02:47:15 engine2=nnue\source\YaneuraOu-by-gcc.exe eval2=F:\ynasu\nnue-v0.2\p-k-halfkp_96w-32\eval_apery_4.5G_limit3000 02:47:15 T1,b5000,455 - 53 - 492(48.05% R-13.58) win black: white = 49.95% : 50.05% declaration win engine1=0 engine2=0

      実験4

      評価値を手数方向にスライドさせてからシャッフルした場合

      スライド量=8

      16:49:16 engine1=nnue\source\YaneuraOu-by-gcc.exe eval1=F:\hnoda\nnue\eval\zoukin2.2018-02-11.slide\final 16:49:16 engine2=hakubishin-private\build\2017Early\YaneuraOu-2017-early.exe eval2=F:\hnoda\hakubishin\exe\eval\apery_sdt5_eval_twig_format 16:49:16 T1,b5000,382 - 42 - 576(39.87% R-71.34) win black: white = 52.82% : 47.18% declaration win engine1=0 engine2=0

      00:48:02 engine1=nnue\source\YaneuraOu-by-gcc.exe eval1=F:\hnoda\nnue\eval\zoukin2.2018-02-11.slide\final 00:48:02 engine2=nnue\source\YaneuraOu-by-gcc.exe eval2=F:\ynasu\nnue-v0.2\p-k-halfkp_96w-32\eval_apery_4.5G_limit3000 00:48:02 T1,b5000,474 - 54 - 472(50.11% R0.73) win black: white = 52.75% : 47.25% declaration win engine1=0 engine2=0