История изменений
Исправление aist1, (текущая версия) :
Dell Precision 7540, Ubuntu 20.04 in Hyper-V (Windows 10 Pro), 64GB RAM. i7-9850H.
Current date/time: Sun Aug 8 19:06:28 2021
CPU frequency: 4.174 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6
Parameters are set to:
Number of tests: 15
Number of equations to solve (problem size) : 1000 2000 5000 10000 15000 18000 20000 22000 25000 26000 27000 30000 35000 40000 45000
Leading dimension of array : 1000 2000 5008 10000 15000 18008 20016 22008 25000 26000 27000 30000 35000 40000 45000
Number of trials to run : 4 2 2 2 2 2 2 2 2 2 1 1 1 1 1
Data alignment value (in Kbytes) : 4 4 4 4 4 4 4 3 1 1 1 1 1 1 1
Maximum memory requested that can be used=16200901024, at the size=45000
=================== Timing linear equation system solver ===================
Size LDA Align. Time(s) GFlops Residual Residual(norm) Check
1000 1000 4 0.005 146.8603 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 148.9902 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 164.2513 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 171.4927 9.394430e-13 3.203742e-02 pass
2000 2000 4 0.036 146.3781 3.842024e-12 3.342090e-02 pass
2000 2000 4 0.036 147.4116 3.842024e-12 3.342090e-02 pass
5000 5008 4 0.474 176.0769 2.313949e-11 3.226615e-02 pass
5000 5008 4 0.458 181.9024 2.313949e-11 3.226615e-02 pass
10000 10000 4 3.585 186.0133 9.955517e-11 3.510416e-02 pass
10000 10000 4 3.703 180.1103 9.955517e-11 3.510416e-02 pass
15000 15000 4 12.117 185.7226 2.246575e-10 3.538393e-02 pass
15000 15000 4 12.459 180.6297 2.246575e-10 3.538393e-02 pass
18000 18008 4 21.695 179.2436 2.518988e-10 2.758602e-02 pass
18000 18008 4 21.669 179.4534 2.518988e-10 2.758602e-02 pass
20000 20016 4 30.633 174.1317 3.520981e-10 3.116839e-02 pass
20000 20016 4 30.106 177.1758 3.520981e-10 3.116839e-02 pass
22000 22008 3 39.537 179.5689 4.260146e-10 3.120390e-02 pass
22000 22008 3 39.690 178.8786 4.260146e-10 3.120390e-02 pass
25000 25000 1 58.186 179.0438 5.639031e-10 3.206715e-02 pass
25000 25000 1 58.295 178.7112 5.639031e-10 3.206715e-02 pass
26000 26000 1 65.652 178.4961 6.647586e-10 3.495500e-02 pass
26000 26000 1 65.890 177.8518 6.647586e-10 3.495500e-02 pass
27000 27000 1 74.438 176.3011 6.293582e-10 3.069070e-02 pass
30000 30000 1 98.852 182.1086 8.721390e-10 3.437981e-02 pass
35000 35000 1 160.014 178.6459 1.021299e-09 2.964677e-02 pass
40000 40000 1 236.581 180.3609 1.302909e-09 2.897713e-02 pass
45000 45000 1 337.243 180.1492 1.909955e-09 3.360364e-02 pass
Performance Summary (GFlops)
Size LDA Align. Average Maximal
1000 1000 4 157.8986 171.4927
2000 2000 4 146.8949 147.4116
5000 5008 4 178.9897 181.9024
10000 10000 4 183.0618 186.0133
15000 15000 4 183.1762 185.7226
18000 18008 4 179.3485 179.4534
20000 20016 4 175.6538 177.1758
22000 22008 3 179.2238 179.5689
25000 25000 1 178.8775 179.0438
26000 26000 1 178.1740 178.4961
27000 27000 1 176.3011 176.3011
30000 30000 1 182.1086 182.1086
35000 35000 1 178.6459 178.6459
40000 40000 1 180.3609 180.3609
45000 45000 1 180.1492 180.1492
Residual checks PASSED
End of tests
Т.е. тут видно, что лэптоп класса рабочей станции выйдет на плато и там и будет сидеть. Частоту весь тест держал 3.5GHz стабильно.
Исправление aist1, :
Dell Precision 7540, Ubuntu 20.04 in Hyper-V (Windows 10 Pro), 64GB RAM.
Current date/time: Sun Aug 8 19:06:28 2021
CPU frequency: 4.174 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6
Parameters are set to:
Number of tests: 15
Number of equations to solve (problem size) : 1000 2000 5000 10000 15000 18000 20000 22000 25000 26000 27000 30000 35000 40000 45000
Leading dimension of array : 1000 2000 5008 10000 15000 18008 20016 22008 25000 26000 27000 30000 35000 40000 45000
Number of trials to run : 4 2 2 2 2 2 2 2 2 2 1 1 1 1 1
Data alignment value (in Kbytes) : 4 4 4 4 4 4 4 3 1 1 1 1 1 1 1
Maximum memory requested that can be used=16200901024, at the size=45000
=================== Timing linear equation system solver ===================
Size LDA Align. Time(s) GFlops Residual Residual(norm) Check
1000 1000 4 0.005 146.8603 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 148.9902 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 164.2513 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 171.4927 9.394430e-13 3.203742e-02 pass
2000 2000 4 0.036 146.3781 3.842024e-12 3.342090e-02 pass
2000 2000 4 0.036 147.4116 3.842024e-12 3.342090e-02 pass
5000 5008 4 0.474 176.0769 2.313949e-11 3.226615e-02 pass
5000 5008 4 0.458 181.9024 2.313949e-11 3.226615e-02 pass
10000 10000 4 3.585 186.0133 9.955517e-11 3.510416e-02 pass
10000 10000 4 3.703 180.1103 9.955517e-11 3.510416e-02 pass
15000 15000 4 12.117 185.7226 2.246575e-10 3.538393e-02 pass
15000 15000 4 12.459 180.6297 2.246575e-10 3.538393e-02 pass
18000 18008 4 21.695 179.2436 2.518988e-10 2.758602e-02 pass
18000 18008 4 21.669 179.4534 2.518988e-10 2.758602e-02 pass
20000 20016 4 30.633 174.1317 3.520981e-10 3.116839e-02 pass
20000 20016 4 30.106 177.1758 3.520981e-10 3.116839e-02 pass
22000 22008 3 39.537 179.5689 4.260146e-10 3.120390e-02 pass
22000 22008 3 39.690 178.8786 4.260146e-10 3.120390e-02 pass
25000 25000 1 58.186 179.0438 5.639031e-10 3.206715e-02 pass
25000 25000 1 58.295 178.7112 5.639031e-10 3.206715e-02 pass
26000 26000 1 65.652 178.4961 6.647586e-10 3.495500e-02 pass
26000 26000 1 65.890 177.8518 6.647586e-10 3.495500e-02 pass
27000 27000 1 74.438 176.3011 6.293582e-10 3.069070e-02 pass
30000 30000 1 98.852 182.1086 8.721390e-10 3.437981e-02 pass
35000 35000 1 160.014 178.6459 1.021299e-09 2.964677e-02 pass
40000 40000 1 236.581 180.3609 1.302909e-09 2.897713e-02 pass
45000 45000 1 337.243 180.1492 1.909955e-09 3.360364e-02 pass
Performance Summary (GFlops)
Size LDA Align. Average Maximal
1000 1000 4 157.8986 171.4927
2000 2000 4 146.8949 147.4116
5000 5008 4 178.9897 181.9024
10000 10000 4 183.0618 186.0133
15000 15000 4 183.1762 185.7226
18000 18008 4 179.3485 179.4534
20000 20016 4 175.6538 177.1758
22000 22008 3 179.2238 179.5689
25000 25000 1 178.8775 179.0438
26000 26000 1 178.1740 178.4961
27000 27000 1 176.3011 176.3011
30000 30000 1 182.1086 182.1086
35000 35000 1 178.6459 178.6459
40000 40000 1 180.3609 180.3609
45000 45000 1 180.1492 180.1492
Residual checks PASSED
End of tests
Т.е. тут видно, что лэптоп класса рабочей станции выйдет на плато и там и будет сидеть. Частоту весь тест держал 3.5GHz стабильно.
Исправление aist1, :
Dell Precision 7540, Ubuntu 20.04 in Hyper-V (Windows 10 Pro), 64GB RAM.
Current date/time: Sun Aug 8 19:06:28 2021
CPU frequency: 4.174 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6
Parameters are set to:
Number of tests: 15
Number of equations to solve (problem size) : 1000 2000 5000 10000 15000 18000 20000 22000 25000 26000 27000 30000 35000 40000 45000
Leading dimension of array : 1000 2000 5008 10000 15000 18008 20016 22008 25000 26000 27000 30000 35000 40000 45000
Number of trials to run : 4 2 2 2 2 2 2 2 2 2 1 1 1 1 1
Data alignment value (in Kbytes) : 4 4 4 4 4 4 4 3 1 1 1 1 1 1 1
Maximum memory requested that can be used=16200901024, at the size=45000
=================== Timing linear equation system solver ===================
Size LDA Align. Time(s) GFlops Residual Residual(norm) Check
1000 1000 4 0.005 146.8603 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 148.9902 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 164.2513 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 171.4927 9.394430e-13 3.203742e-02 pass
2000 2000 4 0.036 146.3781 3.842024e-12 3.342090e-02 pass
2000 2000 4 0.036 147.4116 3.842024e-12 3.342090e-02 pass
5000 5008 4 0.474 176.0769 2.313949e-11 3.226615e-02 pass
5000 5008 4 0.458 181.9024 2.313949e-11 3.226615e-02 pass
10000 10000 4 3.585 186.0133 9.955517e-11 3.510416e-02 pass
10000 10000 4 3.703 180.1103 9.955517e-11 3.510416e-02 pass
15000 15000 4 12.117 185.7226 2.246575e-10 3.538393e-02 pass
15000 15000 4 12.459 180.6297 2.246575e-10 3.538393e-02 pass
18000 18008 4 21.695 179.2436 2.518988e-10 2.758602e-02 pass
18000 18008 4 21.669 179.4534 2.518988e-10 2.758602e-02 pass
20000 20016 4 30.633 174.1317 3.520981e-10 3.116839e-02 pass
20000 20016 4 30.106 177.1758 3.520981e-10 3.116839e-02 pass
22000 22008 3 39.537 179.5689 4.260146e-10 3.120390e-02 pass
22000 22008 3 39.690 178.8786 4.260146e-10 3.120390e-02 pass
25000 25000 1 58.186 179.0438 5.639031e-10 3.206715e-02 pass
25000 25000 1 58.295 178.7112 5.639031e-10 3.206715e-02 pass
26000 26000 1 65.652 178.4961 6.647586e-10 3.495500e-02 pass
26000 26000 1 65.890 177.8518 6.647586e-10 3.495500e-02 pass
27000 27000 1 74.438 176.3011 6.293582e-10 3.069070e-02 pass
30000 30000 1 98.852 182.1086 8.721390e-10 3.437981e-02 pass
35000 35000 1 160.014 178.6459 1.021299e-09 2.964677e-02 pass
40000 40000 1 236.581 180.3609 1.302909e-09 2.897713e-02 pass
45000 45000 1 337.243 180.1492 1.909955e-09 3.360364e-02 pass
Performance Summary (GFlops)
Size LDA Align. Average Maximal
1000 1000 4 157.8986 171.4927
2000 2000 4 146.8949 147.4116
5000 5008 4 178.9897 181.9024
10000 10000 4 183.0618 186.0133
15000 15000 4 183.1762 185.7226
18000 18008 4 179.3485 179.4534
20000 20016 4 175.6538 177.1758
22000 22008 3 179.2238 179.5689
25000 25000 1 178.8775 179.0438
26000 26000 1 178.1740 178.4961
27000 27000 1 176.3011 176.3011
30000 30000 1 182.1086 182.1086
35000 35000 1 178.6459 178.6459
40000 40000 1 180.3609 180.3609
45000 45000 1 180.1492 180.1492
Residual checks PASSED
End of tests
Т.е. тут видно, что лэптоп класса рабочей станции выйдет на плато и там и будет сидеть.
Исходная версия aist1, :
Dell Precision 7540, Ubuntu 20.04 in Hyper-V (Windows 10 Pro), 64GB RAM.
Current date/time: Sun Aug 8 19:06:28 2021
CPU frequency: 4.174 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6
Parameters are set to:
Number of tests: 15
Number of equations to solve (problem size) : 1000 2000 5000 10000 15000 18000 20000 22000 25000 26000 27000 30000 35000 40000 45000
Leading dimension of array : 1000 2000 5008 10000 15000 18008 20016 22008 25000 26000 27000 30000 35000 40000 45000
Number of trials to run : 4 2 2 2 2 2 2 2 2 2 1 1 1 1 1
Data alignment value (in Kbytes) : 4 4 4 4 4 4 4 3 1 1 1 1 1 1 1
Maximum memory requested that can be used=16200901024, at the size=45000
=================== Timing linear equation system solver ===================
Size LDA Align. Time(s) GFlops Residual Residual(norm) Check
1000 1000 4 0.005 146.8603 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 148.9902 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 164.2513 9.394430e-13 3.203742e-02 pass
1000 1000 4 0.004 171.4927 9.394430e-13 3.203742e-02 pass
2000 2000 4 0.036 146.3781 3.842024e-12 3.342090e-02 pass
2000 2000 4 0.036 147.4116 3.842024e-12 3.342090e-02 pass
5000 5008 4 0.474 176.0769 2.313949e-11 3.226615e-02 pass
5000 5008 4 0.458 181.9024 2.313949e-11 3.226615e-02 pass
10000 10000 4 3.585 186.0133 9.955517e-11 3.510416e-02 pass
10000 10000 4 3.703 180.1103 9.955517e-11 3.510416e-02 pass
15000 15000 4 12.117 185.7226 2.246575e-10 3.538393e-02 pass
15000 15000 4 12.459 180.6297 2.246575e-10 3.538393e-02 pass
18000 18008 4 21.695 179.2436 2.518988e-10 2.758602e-02 pass
18000 18008 4 21.669 179.4534 2.518988e-10 2.758602e-02 pass
20000 20016 4 30.633 174.1317 3.520981e-10 3.116839e-02 pass
20000 20016 4 30.106 177.1758 3.520981e-10 3.116839e-02 pass
22000 22008 3 39.537 179.5689 4.260146e-10 3.120390e-02 pass
22000 22008 3 39.690 178.8786 4.260146e-10 3.120390e-02 pass
25000 25000 1 58.186 179.0438 5.639031e-10 3.206715e-02 pass
25000 25000 1 58.295 178.7112 5.639031e-10 3.206715e-02 pass
26000 26000 1 65.652 178.4961 6.647586e-10 3.495500e-02 pass
26000 26000 1 65.890 177.8518 6.647586e-10 3.495500e-02 pass
27000 27000 1 74.438 176.3011 6.293582e-10 3.069070e-02 pass
30000 30000 1 98.852 182.1086 8.721390e-10 3.437981e-02 pass
35000 35000 1 160.014 178.6459 1.021299e-09 2.964677e-02 pass
40000 40000 1 236.581 180.3609 1.302909e-09 2.897713e-02 pass
45000 45000 1 337.243 180.1492 1.909955e-09 3.360364e-02 pass
Performance Summary (GFlops)
Size LDA Align. Average Maximal
1000 1000 4 157.8986 171.4927
2000 2000 4 146.8949 147.4116
5000 5008 4 178.9897 181.9024
10000 10000 4 183.0618 186.0133
15000 15000 4 183.1762 185.7226
18000 18008 4 179.3485 179.4534
20000 20016 4 175.6538 177.1758
22000 22008 3 179.2238 179.5689
25000 25000 1 178.8775 179.0438
26000 26000 1 178.1740 178.4961
27000 27000 1 176.3011 176.3011
30000 30000 1 182.1086 182.1086
35000 35000 1 178.6459 178.6459
40000 40000 1 180.3609 180.3609
45000 45000 1 180.1492 180.1492
Residual checks PASSED
End of tests