LINUX.ORG.RU

История изменений

Исправление aist1, (текущая версия) :

Dell Precision 7540, Ubuntu 20.04 in Hyper-V (Windows 10 Pro), 64GB RAM. i7-9850H.

Current date/time: Sun Aug  8 19:06:28 2021

CPU frequency:    4.174 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6

Parameters are set to:

Number of tests: 15
Number of equations to solve (problem size) : 1000  2000  5000  10000 15000 18000 20000 22000 25000 26000 27000 30000 35000 40000 45000
Leading dimension of array                  : 1000  2000  5008  10000 15000 18008 20016 22008 25000 26000 27000 30000 35000 40000 45000
Number of trials to run                     : 4     2     2     2     2     2     2     2     2     2     1     1     1     1     1    
Data alignment value (in Kbytes)            : 4     4     4     4     4     4     4     3     1     1     1     1     1     1     1    

Maximum memory requested that can be used=16200901024, at the size=45000

=================== Timing linear equation system solver ===================

Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
1000   1000   4      0.005      146.8603 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      148.9902 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      164.2513 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      171.4927 9.394430e-13 3.203742e-02   pass
2000   2000   4      0.036      146.3781 3.842024e-12 3.342090e-02   pass
2000   2000   4      0.036      147.4116 3.842024e-12 3.342090e-02   pass
5000   5008   4      0.474      176.0769 2.313949e-11 3.226615e-02   pass
5000   5008   4      0.458      181.9024 2.313949e-11 3.226615e-02   pass
10000  10000  4      3.585      186.0133 9.955517e-11 3.510416e-02   pass
10000  10000  4      3.703      180.1103 9.955517e-11 3.510416e-02   pass
15000  15000  4      12.117     185.7226 2.246575e-10 3.538393e-02   pass
15000  15000  4      12.459     180.6297 2.246575e-10 3.538393e-02   pass
18000  18008  4      21.695     179.2436 2.518988e-10 2.758602e-02   pass
18000  18008  4      21.669     179.4534 2.518988e-10 2.758602e-02   pass
20000  20016  4      30.633     174.1317 3.520981e-10 3.116839e-02   pass
20000  20016  4      30.106     177.1758 3.520981e-10 3.116839e-02   pass
22000  22008  3      39.537     179.5689 4.260146e-10 3.120390e-02   pass
22000  22008  3      39.690     178.8786 4.260146e-10 3.120390e-02   pass
25000  25000  1      58.186     179.0438 5.639031e-10 3.206715e-02   pass
25000  25000  1      58.295     178.7112 5.639031e-10 3.206715e-02   pass
26000  26000  1      65.652     178.4961 6.647586e-10 3.495500e-02   pass
26000  26000  1      65.890     177.8518 6.647586e-10 3.495500e-02   pass
27000  27000  1      74.438     176.3011 6.293582e-10 3.069070e-02   pass
30000  30000  1      98.852     182.1086 8.721390e-10 3.437981e-02   pass
35000  35000  1      160.014    178.6459 1.021299e-09 2.964677e-02   pass
40000  40000  1      236.581    180.3609 1.302909e-09 2.897713e-02   pass
45000  45000  1      337.243    180.1492 1.909955e-09 3.360364e-02   pass

Performance Summary (GFlops)

Size   LDA    Align.  Average  Maximal
1000   1000   4       157.8986 171.4927
2000   2000   4       146.8949 147.4116
5000   5008   4       178.9897 181.9024
10000  10000  4       183.0618 186.0133
15000  15000  4       183.1762 185.7226
18000  18008  4       179.3485 179.4534
20000  20016  4       175.6538 177.1758
22000  22008  3       179.2238 179.5689
25000  25000  1       178.8775 179.0438
26000  26000  1       178.1740 178.4961
27000  27000  1       176.3011 176.3011
30000  30000  1       182.1086 182.1086
35000  35000  1       178.6459 178.6459
40000  40000  1       180.3609 180.3609
45000  45000  1       180.1492 180.1492

Residual checks PASSED

End of tests

Т.е. тут видно, что лэптоп класса рабочей станции выйдет на плато и там и будет сидеть. Частоту весь тест держал 3.5GHz стабильно.

Исправление aist1, :

Dell Precision 7540, Ubuntu 20.04 in Hyper-V (Windows 10 Pro), 64GB RAM.

Current date/time: Sun Aug  8 19:06:28 2021

CPU frequency:    4.174 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6

Parameters are set to:

Number of tests: 15
Number of equations to solve (problem size) : 1000  2000  5000  10000 15000 18000 20000 22000 25000 26000 27000 30000 35000 40000 45000
Leading dimension of array                  : 1000  2000  5008  10000 15000 18008 20016 22008 25000 26000 27000 30000 35000 40000 45000
Number of trials to run                     : 4     2     2     2     2     2     2     2     2     2     1     1     1     1     1    
Data alignment value (in Kbytes)            : 4     4     4     4     4     4     4     3     1     1     1     1     1     1     1    

Maximum memory requested that can be used=16200901024, at the size=45000

=================== Timing linear equation system solver ===================

Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
1000   1000   4      0.005      146.8603 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      148.9902 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      164.2513 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      171.4927 9.394430e-13 3.203742e-02   pass
2000   2000   4      0.036      146.3781 3.842024e-12 3.342090e-02   pass
2000   2000   4      0.036      147.4116 3.842024e-12 3.342090e-02   pass
5000   5008   4      0.474      176.0769 2.313949e-11 3.226615e-02   pass
5000   5008   4      0.458      181.9024 2.313949e-11 3.226615e-02   pass
10000  10000  4      3.585      186.0133 9.955517e-11 3.510416e-02   pass
10000  10000  4      3.703      180.1103 9.955517e-11 3.510416e-02   pass
15000  15000  4      12.117     185.7226 2.246575e-10 3.538393e-02   pass
15000  15000  4      12.459     180.6297 2.246575e-10 3.538393e-02   pass
18000  18008  4      21.695     179.2436 2.518988e-10 2.758602e-02   pass
18000  18008  4      21.669     179.4534 2.518988e-10 2.758602e-02   pass
20000  20016  4      30.633     174.1317 3.520981e-10 3.116839e-02   pass
20000  20016  4      30.106     177.1758 3.520981e-10 3.116839e-02   pass
22000  22008  3      39.537     179.5689 4.260146e-10 3.120390e-02   pass
22000  22008  3      39.690     178.8786 4.260146e-10 3.120390e-02   pass
25000  25000  1      58.186     179.0438 5.639031e-10 3.206715e-02   pass
25000  25000  1      58.295     178.7112 5.639031e-10 3.206715e-02   pass
26000  26000  1      65.652     178.4961 6.647586e-10 3.495500e-02   pass
26000  26000  1      65.890     177.8518 6.647586e-10 3.495500e-02   pass
27000  27000  1      74.438     176.3011 6.293582e-10 3.069070e-02   pass
30000  30000  1      98.852     182.1086 8.721390e-10 3.437981e-02   pass
35000  35000  1      160.014    178.6459 1.021299e-09 2.964677e-02   pass
40000  40000  1      236.581    180.3609 1.302909e-09 2.897713e-02   pass
45000  45000  1      337.243    180.1492 1.909955e-09 3.360364e-02   pass

Performance Summary (GFlops)

Size   LDA    Align.  Average  Maximal
1000   1000   4       157.8986 171.4927
2000   2000   4       146.8949 147.4116
5000   5008   4       178.9897 181.9024
10000  10000  4       183.0618 186.0133
15000  15000  4       183.1762 185.7226
18000  18008  4       179.3485 179.4534
20000  20016  4       175.6538 177.1758
22000  22008  3       179.2238 179.5689
25000  25000  1       178.8775 179.0438
26000  26000  1       178.1740 178.4961
27000  27000  1       176.3011 176.3011
30000  30000  1       182.1086 182.1086
35000  35000  1       178.6459 178.6459
40000  40000  1       180.3609 180.3609
45000  45000  1       180.1492 180.1492

Residual checks PASSED

End of tests

Т.е. тут видно, что лэптоп класса рабочей станции выйдет на плато и там и будет сидеть. Частоту весь тест держал 3.5GHz стабильно.

Исправление aist1, :

Dell Precision 7540, Ubuntu 20.04 in Hyper-V (Windows 10 Pro), 64GB RAM.

Current date/time: Sun Aug  8 19:06:28 2021

CPU frequency:    4.174 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6

Parameters are set to:

Number of tests: 15
Number of equations to solve (problem size) : 1000  2000  5000  10000 15000 18000 20000 22000 25000 26000 27000 30000 35000 40000 45000
Leading dimension of array                  : 1000  2000  5008  10000 15000 18008 20016 22008 25000 26000 27000 30000 35000 40000 45000
Number of trials to run                     : 4     2     2     2     2     2     2     2     2     2     1     1     1     1     1    
Data alignment value (in Kbytes)            : 4     4     4     4     4     4     4     3     1     1     1     1     1     1     1    

Maximum memory requested that can be used=16200901024, at the size=45000

=================== Timing linear equation system solver ===================

Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
1000   1000   4      0.005      146.8603 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      148.9902 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      164.2513 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      171.4927 9.394430e-13 3.203742e-02   pass
2000   2000   4      0.036      146.3781 3.842024e-12 3.342090e-02   pass
2000   2000   4      0.036      147.4116 3.842024e-12 3.342090e-02   pass
5000   5008   4      0.474      176.0769 2.313949e-11 3.226615e-02   pass
5000   5008   4      0.458      181.9024 2.313949e-11 3.226615e-02   pass
10000  10000  4      3.585      186.0133 9.955517e-11 3.510416e-02   pass
10000  10000  4      3.703      180.1103 9.955517e-11 3.510416e-02   pass
15000  15000  4      12.117     185.7226 2.246575e-10 3.538393e-02   pass
15000  15000  4      12.459     180.6297 2.246575e-10 3.538393e-02   pass
18000  18008  4      21.695     179.2436 2.518988e-10 2.758602e-02   pass
18000  18008  4      21.669     179.4534 2.518988e-10 2.758602e-02   pass
20000  20016  4      30.633     174.1317 3.520981e-10 3.116839e-02   pass
20000  20016  4      30.106     177.1758 3.520981e-10 3.116839e-02   pass
22000  22008  3      39.537     179.5689 4.260146e-10 3.120390e-02   pass
22000  22008  3      39.690     178.8786 4.260146e-10 3.120390e-02   pass
25000  25000  1      58.186     179.0438 5.639031e-10 3.206715e-02   pass
25000  25000  1      58.295     178.7112 5.639031e-10 3.206715e-02   pass
26000  26000  1      65.652     178.4961 6.647586e-10 3.495500e-02   pass
26000  26000  1      65.890     177.8518 6.647586e-10 3.495500e-02   pass
27000  27000  1      74.438     176.3011 6.293582e-10 3.069070e-02   pass
30000  30000  1      98.852     182.1086 8.721390e-10 3.437981e-02   pass
35000  35000  1      160.014    178.6459 1.021299e-09 2.964677e-02   pass
40000  40000  1      236.581    180.3609 1.302909e-09 2.897713e-02   pass
45000  45000  1      337.243    180.1492 1.909955e-09 3.360364e-02   pass

Performance Summary (GFlops)

Size   LDA    Align.  Average  Maximal
1000   1000   4       157.8986 171.4927
2000   2000   4       146.8949 147.4116
5000   5008   4       178.9897 181.9024
10000  10000  4       183.0618 186.0133
15000  15000  4       183.1762 185.7226
18000  18008  4       179.3485 179.4534
20000  20016  4       175.6538 177.1758
22000  22008  3       179.2238 179.5689
25000  25000  1       178.8775 179.0438
26000  26000  1       178.1740 178.4961
27000  27000  1       176.3011 176.3011
30000  30000  1       182.1086 182.1086
35000  35000  1       178.6459 178.6459
40000  40000  1       180.3609 180.3609
45000  45000  1       180.1492 180.1492

Residual checks PASSED

End of tests

Т.е. тут видно, что лэптоп класса рабочей станции выйдет на плато и там и будет сидеть.

Исходная версия aist1, :

Dell Precision 7540, Ubuntu 20.04 in Hyper-V (Windows 10 Pro), 64GB RAM.

Current date/time: Sun Aug  8 19:06:28 2021

CPU frequency:    4.174 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6

Parameters are set to:

Number of tests: 15
Number of equations to solve (problem size) : 1000  2000  5000  10000 15000 18000 20000 22000 25000 26000 27000 30000 35000 40000 45000
Leading dimension of array                  : 1000  2000  5008  10000 15000 18008 20016 22008 25000 26000 27000 30000 35000 40000 45000
Number of trials to run                     : 4     2     2     2     2     2     2     2     2     2     1     1     1     1     1    
Data alignment value (in Kbytes)            : 4     4     4     4     4     4     4     3     1     1     1     1     1     1     1    

Maximum memory requested that can be used=16200901024, at the size=45000

=================== Timing linear equation system solver ===================

Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
1000   1000   4      0.005      146.8603 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      148.9902 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      164.2513 9.394430e-13 3.203742e-02   pass
1000   1000   4      0.004      171.4927 9.394430e-13 3.203742e-02   pass
2000   2000   4      0.036      146.3781 3.842024e-12 3.342090e-02   pass
2000   2000   4      0.036      147.4116 3.842024e-12 3.342090e-02   pass
5000   5008   4      0.474      176.0769 2.313949e-11 3.226615e-02   pass
5000   5008   4      0.458      181.9024 2.313949e-11 3.226615e-02   pass
10000  10000  4      3.585      186.0133 9.955517e-11 3.510416e-02   pass
10000  10000  4      3.703      180.1103 9.955517e-11 3.510416e-02   pass
15000  15000  4      12.117     185.7226 2.246575e-10 3.538393e-02   pass
15000  15000  4      12.459     180.6297 2.246575e-10 3.538393e-02   pass
18000  18008  4      21.695     179.2436 2.518988e-10 2.758602e-02   pass
18000  18008  4      21.669     179.4534 2.518988e-10 2.758602e-02   pass
20000  20016  4      30.633     174.1317 3.520981e-10 3.116839e-02   pass
20000  20016  4      30.106     177.1758 3.520981e-10 3.116839e-02   pass
22000  22008  3      39.537     179.5689 4.260146e-10 3.120390e-02   pass
22000  22008  3      39.690     178.8786 4.260146e-10 3.120390e-02   pass
25000  25000  1      58.186     179.0438 5.639031e-10 3.206715e-02   pass
25000  25000  1      58.295     178.7112 5.639031e-10 3.206715e-02   pass
26000  26000  1      65.652     178.4961 6.647586e-10 3.495500e-02   pass
26000  26000  1      65.890     177.8518 6.647586e-10 3.495500e-02   pass
27000  27000  1      74.438     176.3011 6.293582e-10 3.069070e-02   pass
30000  30000  1      98.852     182.1086 8.721390e-10 3.437981e-02   pass
35000  35000  1      160.014    178.6459 1.021299e-09 2.964677e-02   pass
40000  40000  1      236.581    180.3609 1.302909e-09 2.897713e-02   pass
45000  45000  1      337.243    180.1492 1.909955e-09 3.360364e-02   pass

Performance Summary (GFlops)

Size   LDA    Align.  Average  Maximal
1000   1000   4       157.8986 171.4927
2000   2000   4       146.8949 147.4116
5000   5008   4       178.9897 181.9024
10000  10000  4       183.0618 186.0133
15000  15000  4       183.1762 185.7226
18000  18008  4       179.3485 179.4534
20000  20016  4       175.6538 177.1758
22000  22008  3       179.2238 179.5689
25000  25000  1       178.8775 179.0438
26000  26000  1       178.1740 178.4961
27000  27000  1       176.3011 176.3011
30000  30000  1       182.1086 182.1086
35000  35000  1       178.6459 178.6459
40000  40000  1       180.3609 180.3609
45000  45000  1       180.1492 180.1492

Residual checks PASSED

End of tests