Being a curious sort, before I installed 1.5a into the Fritz GUI I re-checked autotune. So it appears that on my quad split depth 12 is the best setting.
Summary: Houdini 1.5
=======================================================
Split_Depth 10, Position 1: 2092 kN/s
Split_Depth 10, Position 2: 5805 kN/s
Split_Depth 10, Position 3: 5789 kN/s
Split_Depth 10, Position 4: 5855 kN/s
Split_Depth 10 - Average speed: 4885 kN/s
=======================================================
Split_Depth 12, Position 1: 5819 kN/s
Split_Depth 12, Position 2: 5804 kN/s
Split_Depth 12, Position 3: 5793 kN/s
Split_Depth 12, Position 4: 5905 kN/s
Split_Depth 12 - Average speed: 5831 kN/s
=======================================================
Split_Depth 14, Position 1: 5805 kN/s
Split_Depth 14, Position 2: 5873 kN/s
Split_Depth 14, Position 3: 5744 kN/s
Split_Depth 14, Position 4: 5810 kN/s
Split_Depth 14 - Average speed: 5808 kN/s
=======================================================
Split_Depth 16, Position 1: 5882 kN/s
Split_Depth 16, Position 2: 5856 kN/s
Split_Depth 16, Position 3: 5605 kN/s
Split_Depth 16, Position 4: 5770 kN/s
Split_Depth 16 - Average speed: 5778 kN/s
=======================================================
Split_Depth 18, Position 1: 5645 kN/s
Split_Depth 18, Position 2: 5751 kN/s
Split_Depth 18, Position 3: 5644 kN/s
Split_Depth 18, Position 4: 5553 kN/s
Split_Depth 18 - Average speed: 5648 kN/s
=======================================================
Average CPU time per thread: 624.507 s
CPU-corrected average speed: 5371 kN/s
=======================================================
Summary: Houdini 1.5a
=======================================================
Split_Depth 10, Position 1: 2092 kN/s
Split_Depth 10, Position 2: 5805 kN/s
Split_Depth 10, Position 3: 5789 kN/s
Split_Depth 10, Position 4: 5855 kN/s
Split_Depth 10 - Average speed: 4885 kN/s
=======================================================
Split_Depth 12, Position 1: 5819 kN/s
Split_Depth 12, Position 2: 5804 kN/s
Split_Depth 12, Position 3: 5793 kN/s
Split_Depth 12, Position 4: 5905 kN/s
Split_Depth 12 - Average speed: 5831 kN/s
=======================================================
Split_Depth 14, Position 1: 5805 kN/s
Split_Depth 14, Position 2: 5873 kN/s
Split_Depth 14, Position 3: 5744 kN/s
Split_Depth 14, Position 4: 5810 kN/s
Split_Depth 14 - Average speed: 5808 kN/s
=======================================================
Split_Depth 16, Position 1: 5882 kN/s
Split_Depth 16, Position 2: 5856 kN/s
Split_Depth 16, Position 3: 5605 kN/s
Split_Depth 16, Position 4: 5770 kN/s
Split_Depth 16 - Average speed: 5778 kN/s
=======================================================
Split_Depth 18, Position 1: 5645 kN/s
Split_Depth 18, Position 2: 5751 kN/s
Split_Depth 18, Position 3: 5644 kN/s
Split_Depth 18, Position 4: 5553 kN/s
Split_Depth 18 - Average speed: 5648 kN/s
=======================================================
Average CPU time per thread: 624.507 s
CPU-corrected average speed: 5371 kN/s
=======================================================
Mike
Houdini 1.5a Autotune
Moderators: hgm, Rebel, chrisw
-
- Posts: 352
- Joined: Thu Mar 09, 2006 6:00 pm
Re: Houdini 1.5a Autotune
I wonder what position 1 is....
-
- Posts: 1471
- Joined: Tue Mar 16, 2010 12:00 am
Re: Houdini 1.5a Autotune
The 4 test positions are:
[D] r2qk2r/1b1n1ppp/1P1p1b2/pBpP4/P5Q1/2N5/1P3PPP/R3K1NR b KQkq - 0 1
[D] 5rk1/p2qr1b1/1Pn3pp/2pn4/2Np1p2/1N1P2P1/1P1BPP1P/R2QR1K1 b - - 0 1
[D] rqb2rk1/p4p1p/2pb1np1/3pn3/NP1B1N2/P3P3/2Q1BPPP/R4RK1 w - - 0 1
[D] r4rk1/p1qbnppp/1pn1p3/2ppP3/P2P4/2PB1N2/2P2PPP/R1BQ1RK1 w - - 6 1
Robert
[D] r2qk2r/1b1n1ppp/1P1p1b2/pBpP4/P5Q1/2N5/1P3PPP/R3K1NR b KQkq - 0 1
[D] 5rk1/p2qr1b1/1Pn3pp/2pn4/2Np1p2/1N1P2P1/1P1BPP1P/R2QR1K1 b - - 0 1
[D] rqb2rk1/p4p1p/2pb1np1/3pn3/NP1B1N2/P3P3/2Q1BPPP/R4RK1 w - - 0 1
[D] r4rk1/p1qbnppp/1pn1p3/2ppP3/P2P4/2PB1N2/2P2PPP/R1BQ1RK1 w - - 6 1
Robert
-
- Posts: 352
- Joined: Thu Mar 09, 2006 6:00 pm
Re: Houdini 1.5a Autotune
I was ignorant about the autotune command in Houdini but gave it a try.
On my Q6600 default is best. I wonder why Mike got such a low nps on position 1 and SD=10. What CPU is that Mike?
Code: Select all
Summary:
=======================================================
Split_Depth 10, Position 1: 7573 kN/s
Split_Depth 10, Position 2: 7458 kN/s
Split_Depth 10, Position 3: 7518 kN/s
Split_Depth 10, Position 4: 7513 kN/s
Split_Depth 10 - Average speed: 7515 kN/s
=======================================================
Split_Depth 12, Position 1: 7347 kN/s
Split_Depth 12, Position 2: 7460 kN/s
Split_Depth 12, Position 3: 7114 kN/s
Split_Depth 12, Position 4: 7360 kN/s
Split_Depth 12 - Average speed: 7320 kN/s
=======================================================
Split_Depth 14, Position 1: 7066 kN/s
Split_Depth 14, Position 2: 7240 kN/s
Split_Depth 14, Position 3: 7163 kN/s
Split_Depth 14, Position 4: 6998 kN/s
Split_Depth 14 - Average speed: 7117 kN/s
=======================================================
Split_Depth 16, Position 1: 6945 kN/s
Split_Depth 16, Position 2: 7234 kN/s
Split_Depth 16, Position 3: 7256 kN/s
Split_Depth 16, Position 4: 7468 kN/s
Split_Depth 16 - Average speed: 7225 kN/s
=======================================================
Split_Depth 18, Position 1: 7090 kN/s
Split_Depth 18, Position 2: 7284 kN/s
Split_Depth 18, Position 3: 6939 kN/s
Split_Depth 18, Position 4: 6767 kN/s
Split_Depth 18 - Average speed: 7020 kN/s
=======================================================
Average CPU time per thread: 566.904 s
CPU-corrected average speed: 7662 kN/s
=======================================================
-
- Posts: 1471
- Joined: Tue Mar 16, 2010 12:00 am
Re: Houdini 1.5a Autotune
Probably another process stealing CPU time from the engine.ernst wrote:I wonder why Mike got such a low nps on position 1 and SD=10.
It's better to restart the test to make sure.
-
- Posts: 2041
- Joined: Wed Mar 08, 2006 8:30 pm
Re: Houdini 1.5a Autotune
Actually, if you run it again, with your Position 1 bug (at Split_Depth 10) corrected, you will see that your performance at Split_Depth 10, 12, 14, (and maybe even 16) is practically the same!...mhurd wrote:So it appears that on my quad split depth 12 is the best setting.
(run again => different numbers anyway, because of MP non-reproducibility)
-
- Posts: 20943
- Joined: Mon Feb 27, 2006 7:30 pm
- Location: Birmingham, AL
Re: Houdini 1.5a Autotune
Split depth is about more than just raw NPS. Time to depth is also important. You drive NPS up by splitting deeper in the tree. But you also drive up overhead since ordering is worse near the tips. There is a two-pronged test you have to do, measure both NPS _and_ time to depth...mhurd wrote:Being a curious sort, before I installed 1.5a into the Fritz GUI I re-checked autotune. So it appears that on my quad split depth 12 is the best setting.
Summary: Houdini 1.5
=======================================================
Split_Depth 10, Position 1: 2092 kN/s
Split_Depth 10, Position 2: 5805 kN/s
Split_Depth 10, Position 3: 5789 kN/s
Split_Depth 10, Position 4: 5855 kN/s
Split_Depth 10 - Average speed: 4885 kN/s
=======================================================
Split_Depth 12, Position 1: 5819 kN/s
Split_Depth 12, Position 2: 5804 kN/s
Split_Depth 12, Position 3: 5793 kN/s
Split_Depth 12, Position 4: 5905 kN/s
Split_Depth 12 - Average speed: 5831 kN/s
=======================================================
Split_Depth 14, Position 1: 5805 kN/s
Split_Depth 14, Position 2: 5873 kN/s
Split_Depth 14, Position 3: 5744 kN/s
Split_Depth 14, Position 4: 5810 kN/s
Split_Depth 14 - Average speed: 5808 kN/s
=======================================================
Split_Depth 16, Position 1: 5882 kN/s
Split_Depth 16, Position 2: 5856 kN/s
Split_Depth 16, Position 3: 5605 kN/s
Split_Depth 16, Position 4: 5770 kN/s
Split_Depth 16 - Average speed: 5778 kN/s
=======================================================
Split_Depth 18, Position 1: 5645 kN/s
Split_Depth 18, Position 2: 5751 kN/s
Split_Depth 18, Position 3: 5644 kN/s
Split_Depth 18, Position 4: 5553 kN/s
Split_Depth 18 - Average speed: 5648 kN/s
=======================================================
Average CPU time per thread: 624.507 s
CPU-corrected average speed: 5371 kN/s
=======================================================
Summary: Houdini 1.5a
=======================================================
Split_Depth 10, Position 1: 2092 kN/s
Split_Depth 10, Position 2: 5805 kN/s
Split_Depth 10, Position 3: 5789 kN/s
Split_Depth 10, Position 4: 5855 kN/s
Split_Depth 10 - Average speed: 4885 kN/s
=======================================================
Split_Depth 12, Position 1: 5819 kN/s
Split_Depth 12, Position 2: 5804 kN/s
Split_Depth 12, Position 3: 5793 kN/s
Split_Depth 12, Position 4: 5905 kN/s
Split_Depth 12 - Average speed: 5831 kN/s
=======================================================
Split_Depth 14, Position 1: 5805 kN/s
Split_Depth 14, Position 2: 5873 kN/s
Split_Depth 14, Position 3: 5744 kN/s
Split_Depth 14, Position 4: 5810 kN/s
Split_Depth 14 - Average speed: 5808 kN/s
=======================================================
Split_Depth 16, Position 1: 5882 kN/s
Split_Depth 16, Position 2: 5856 kN/s
Split_Depth 16, Position 3: 5605 kN/s
Split_Depth 16, Position 4: 5770 kN/s
Split_Depth 16 - Average speed: 5778 kN/s
=======================================================
Split_Depth 18, Position 1: 5645 kN/s
Split_Depth 18, Position 2: 5751 kN/s
Split_Depth 18, Position 3: 5644 kN/s
Split_Depth 18, Position 4: 5553 kN/s
Split_Depth 18 - Average speed: 5648 kN/s
=======================================================
Average CPU time per thread: 624.507 s
CPU-corrected average speed: 5371 kN/s
=======================================================
Mike
-
- Posts: 46
- Joined: Wed Dec 30, 2009 9:27 pm
Re: Houdini 1.5a Autotune
Robert wrote
Summary:
=======================================================
Split_Depth 10, Position 1: 4867 kN/s
Split_Depth 10, Position 2: 5113 kN/s
Split_Depth 10, Position 3: 4755 kN/s
Split_Depth 10, Position 4: 4785 kN/s
Split_Depth 10 - Average speed: 4880 kN/s
=======================================================
Split_Depth 12, Position 1: 4922 kN/s
Split_Depth 12, Position 2: 4942 kN/s
Split_Depth 12, Position 3: 4626 kN/s
Split_Depth 12, Position 4: 4610 kN/s
Split_Depth 12 - Average speed: 4775 kN/s
=======================================================
Split_Depth 14, Position 1: 4726 kN/s
Split_Depth 14, Position 2: 5289 kN/s
Split_Depth 14, Position 3: 4470 kN/s
Split_Depth 14, Position 4: 4748 kN/s
Split_Depth 14 - Average speed: 4808 kN/s
=======================================================
Split_Depth 16, Position 1: 4699 kN/s
Split_Depth 16, Position 2: 4682 kN/s
Split_Depth 16, Position 3: 4650 kN/s
Split_Depth 16, Position 4: 4968 kN/s
Split_Depth 16 - Average speed: 4750 kN/s
=======================================================
Split_Depth 18, Position 1: 4735 kN/s
Split_Depth 18, Position 2: 5289 kN/s
Split_Depth 18, Position 3: 5365 kN/s
Split_Depth 18, Position 4: 4650 kN/s
Split_Depth 18 - Average speed: 5010 kN/s
=======================================================
Average CPU time per thread: 551.440 s
CPU-corrected average speed: 5271 kN/s
=======================================================
Ernst wrote:
Mike
I ran G boost to stop all non-essential services then ran the test again with nothing else running. This time split depth 18 comes out best. Other runs have had 10 come out best. Maybe due to non reproducibility I had better leave at default.Probably another process stealing CPU time from the engine.
It's better to restart the test to make sure.
Summary:
=======================================================
Split_Depth 10, Position 1: 4867 kN/s
Split_Depth 10, Position 2: 5113 kN/s
Split_Depth 10, Position 3: 4755 kN/s
Split_Depth 10, Position 4: 4785 kN/s
Split_Depth 10 - Average speed: 4880 kN/s
=======================================================
Split_Depth 12, Position 1: 4922 kN/s
Split_Depth 12, Position 2: 4942 kN/s
Split_Depth 12, Position 3: 4626 kN/s
Split_Depth 12, Position 4: 4610 kN/s
Split_Depth 12 - Average speed: 4775 kN/s
=======================================================
Split_Depth 14, Position 1: 4726 kN/s
Split_Depth 14, Position 2: 5289 kN/s
Split_Depth 14, Position 3: 4470 kN/s
Split_Depth 14, Position 4: 4748 kN/s
Split_Depth 14 - Average speed: 4808 kN/s
=======================================================
Split_Depth 16, Position 1: 4699 kN/s
Split_Depth 16, Position 2: 4682 kN/s
Split_Depth 16, Position 3: 4650 kN/s
Split_Depth 16, Position 4: 4968 kN/s
Split_Depth 16 - Average speed: 4750 kN/s
=======================================================
Split_Depth 18, Position 1: 4735 kN/s
Split_Depth 18, Position 2: 5289 kN/s
Split_Depth 18, Position 3: 5365 kN/s
Split_Depth 18, Position 4: 4650 kN/s
Split_Depth 18 - Average speed: 5010 kN/s
=======================================================
Average CPU time per thread: 551.440 s
CPU-corrected average speed: 5271 kN/s
=======================================================
Ernst wrote:
My CPU is an Intel Quad Q8200 @2.33 GHZ with 4 MB ram.On my Q6600 default is best. I wonder why Mike got such a low nps on position 1 and SD=10. What CPU is that Mike?
Mike
-
- Posts: 162
- Joined: Thu Dec 17, 2009 10:46 am
Re: Houdini 1.5a Autotune
I second Bobs remark. Using NPS to tune split depth is dubious, because different trees are searched and hash table interactions take place. A better test is time to depth or - as i prefer - time to solution, over a reasonable number of positions and using a large number of runs per position. Imagine larger subtrees, leading most likely to higher nps but also to more tree overlap and therefore to slower time to depth/solution. As your autotune code is already in place, maybe you can test this easily and give us feedback whether a time to depth/solution gives different results.
bob wrote:Split depth is about more than just raw NPS. Time to depth is also important. You drive NPS up by splitting deeper in the tree. But you also drive up overhead since ordering is worse near the tips. There is a two-pronged test you have to do, measure both NPS _and_ time to depth...mhurd wrote:Being a curious sort, before I installed 1.5a into the Fritz GUI I re-checked autotune. So it appears that on my quad split depth 12 is the best setting.
Summary: Houdini 1.5
=======================================================
Split_Depth 10, Position 1: 2092 kN/s
Split_Depth 10, Position 2: 5805 kN/s
Split_Depth 10, Position 3: 5789 kN/s
Split_Depth 10, Position 4: 5855 kN/s
Split_Depth 10 - Average speed: 4885 kN/s
=======================================================
Split_Depth 12, Position 1: 5819 kN/s
Split_Depth 12, Position 2: 5804 kN/s
Split_Depth 12, Position 3: 5793 kN/s
Split_Depth 12, Position 4: 5905 kN/s
Split_Depth 12 - Average speed: 5831 kN/s
=======================================================
Split_Depth 14, Position 1: 5805 kN/s
Split_Depth 14, Position 2: 5873 kN/s
Split_Depth 14, Position 3: 5744 kN/s
Split_Depth 14, Position 4: 5810 kN/s
Split_Depth 14 - Average speed: 5808 kN/s
=======================================================
Split_Depth 16, Position 1: 5882 kN/s
Split_Depth 16, Position 2: 5856 kN/s
Split_Depth 16, Position 3: 5605 kN/s
Split_Depth 16, Position 4: 5770 kN/s
Split_Depth 16 - Average speed: 5778 kN/s
=======================================================
Split_Depth 18, Position 1: 5645 kN/s
Split_Depth 18, Position 2: 5751 kN/s
Split_Depth 18, Position 3: 5644 kN/s
Split_Depth 18, Position 4: 5553 kN/s
Split_Depth 18 - Average speed: 5648 kN/s
=======================================================
Average CPU time per thread: 624.507 s
CPU-corrected average speed: 5371 kN/s
=======================================================
Summary: Houdini 1.5a
=======================================================
Split_Depth 10, Position 1: 2092 kN/s
Split_Depth 10, Position 2: 5805 kN/s
Split_Depth 10, Position 3: 5789 kN/s
Split_Depth 10, Position 4: 5855 kN/s
Split_Depth 10 - Average speed: 4885 kN/s
=======================================================
Split_Depth 12, Position 1: 5819 kN/s
Split_Depth 12, Position 2: 5804 kN/s
Split_Depth 12, Position 3: 5793 kN/s
Split_Depth 12, Position 4: 5905 kN/s
Split_Depth 12 - Average speed: 5831 kN/s
=======================================================
Split_Depth 14, Position 1: 5805 kN/s
Split_Depth 14, Position 2: 5873 kN/s
Split_Depth 14, Position 3: 5744 kN/s
Split_Depth 14, Position 4: 5810 kN/s
Split_Depth 14 - Average speed: 5808 kN/s
=======================================================
Split_Depth 16, Position 1: 5882 kN/s
Split_Depth 16, Position 2: 5856 kN/s
Split_Depth 16, Position 3: 5605 kN/s
Split_Depth 16, Position 4: 5770 kN/s
Split_Depth 16 - Average speed: 5778 kN/s
=======================================================
Split_Depth 18, Position 1: 5645 kN/s
Split_Depth 18, Position 2: 5751 kN/s
Split_Depth 18, Position 3: 5644 kN/s
Split_Depth 18, Position 4: 5553 kN/s
Split_Depth 18 - Average speed: 5648 kN/s
=======================================================
Average CPU time per thread: 624.507 s
CPU-corrected average speed: 5371 kN/s
=======================================================
Mike