Update results for 64C rome with larger working set. authored by Jan Eitzinger's avatar Jan Eitzinger
......@@ -15,7 +15,7 @@
| Compiler | AMD clang |
|----------|-------------------------------------------------------------------|
| Version | AMD clang version 10.0.0 (CLANG: AOCC_2.2.0-Build#93 2020_06_25) |
+----------+-------------------------------------------------------------------+
|----------|-------------------------------------------------------------------|
```
Optimizing flags: ```-Ofast -fnt-store=aggressive -std=c99 -fopenmp```
......@@ -26,33 +26,33 @@ All results are in ```GB/s```.
Summary results:
```
+------------------------------------------------+
| Single core | 35.83 (Triad) |
| Memory domain | 43.15 (Sum with 16 cores) |
| Socket | 182.28 (Update with 16 cores) |
| Node | 414.18 (Update with 16 cores) |
+------------------------------------------------+
+-----------------------------------------------+
| Single core | 34.28 (STriad) |
| Memory domain | 43.30 (Sum with 16 cores) |
| Socket | 171.57 (Sum with 16 cores) |
| Node | 338.88 (Update with 16 cores)|
+-----------------------------------------------+
```
Results for scaling within a memory domain:
```
#nt Init Sum Copy Update Triad Daxpy STriad SDaxpy
1 23.42 8.17 33.14 32.21 35.83 35.13 35.21 35.34
2 23.43 16.24 40.58 41.31 40.87 39.69 40.03 39.25
3 23.44 24.04 40.64 39.47 39.31 37.95 38.23 37.44
4 23.42 31.52 40.02 38.78 38.53 37.03 37.45 36.51
5 23.41 36.46 39.65 38.67 38.16 36.84 37.18 36.27
6 23.42 35.78 39.13 38.34 37.49 36.12 36.57 35.62
7 23.44 35.96 38.59 37.69 36.85 35.48 36.00 35.02
8 23.45 36.72 38.08 37.15 36.35 35.00 35.53 34.56
9 25.88 39.38 39.13 38.56 37.67 36.54 36.83 36.05
10 28.14 40.35 39.57 39.16 38.53 37.63 37.75 37.12
11 30.25 40.67 40.00 39.74 39.17 38.49 38.39 37.95
12 32.27 41.30 40.30 40.24 39.72 39.20 38.94 38.68
13 34.16 41.96 40.62 40.96 40.21 39.90 39.52 39.37
14 35.96 42.56 40.78 41.29 40.59 40.42 39.93 39.89
15 37.53 43.02 40.78 41.49 40.81 40.77 40.19 40.20
16 39.00 43.15 40.69 41.70 40.89 40.90 40.22 40.38
1 23.42 8.16 33.47 32.14 33.56 33.27 34.28 33.88
2 23.42 16.13 40.56 41.25 40.53 39.68 40.00 39.35
3 23.44 24.04 40.67 39.39 39.37 37.92 38.23 37.40
4 23.42 31.51 40.00 38.61 38.42 36.90 37.32 36.36
5 23.43 36.40 39.69 38.42 38.14 36.75 37.11 36.15
6 23.44 35.69 39.11 38.05 37.44 36.10 36.51 35.56
7 23.44 36.00 38.59 37.44 36.83 35.47 35.96 35.00
8 23.45 36.78 38.05 36.92 36.27 34.95 35.44 34.49
9 26.11 39.46 39.11 38.32 37.63 36.57 36.80 36.05
10 28.31 40.39 39.58 38.86 38.42 37.56 37.66 37.05
11 30.29 40.70 39.99 39.47 39.12 38.46 38.38 37.92
12 32.30 41.36 40.31 39.91 39.66 39.17 38.93 38.65
13 34.22 42.05 40.64 40.42 40.22 39.85 39.52 39.35
14 36.03 42.66 40.78 40.75 40.58 40.36 39.92 39.85
15 37.66 43.09 40.80 40.88 40.79 40.69 40.14 40.19
16 39.11 43.30 40.74 40.86 40.84 40.85 40.21 40.34
```
......@@ -61,110 +61,110 @@ Results for scaling across memory domains. Shown are the results for the number
Init:
```
#nm 1 2 3 4 5 6 7 8
1 23.42 46.80 70.17 93.39 116.72 139.90 163.25 186.51
2 23.43 46.81 70.20 93.40 116.91 140.07 163.24 186.93
3 23.44 46.83 70.24 93.48 116.78 140.02 163.63 186.35
4 23.42 46.83 70.14 93.64 116.82 140.02 163.03 186.49
5 23.41 46.86 70.25 93.63 116.89 140.05 163.55 186.24
6 23.42 46.86 70.19 93.65 116.96 140.53 163.64 186.85
7 23.44 46.86 70.30 93.67 117.12 140.58 163.98 186.95
8 23.45 46.89 70.32 93.76 117.13 140.42 163.94 187.27
9 25.88 51.73 77.62 103.46 129.34 155.20 180.96 206.69
10 28.14 56.28 84.35 112.41 140.65 168.66 196.57 224.67
11 30.25 60.49 90.74 120.87 151.30 181.45 211.47 241.65
12 32.27 64.49 96.82 129.12 161.07 193.26 225.11 257.60
13 34.16 68.39 102.58 136.61 170.89 204.79 239.30 273.49
14 35.96 71.92 107.87 143.59 179.81 215.61 250.92 287.76
15 37.53 75.10 112.48 150.01 187.76 225.06 263.02 300.74
16 39.00 78.07 117.07 155.87 194.85 234.03 272.51 312.25
1 23.42 46.80 70.20 93.61 116.81 140.21 163.33 186.71
2 23.42 46.82 70.17 93.62 116.99 140.37 163.66 186.86
3 23.44 46.85 70.21 93.68 117.03 140.48 163.60 186.99
4 23.42 46.83 70.12 93.63 117.03 140.22 163.42 186.97
5 23.43 46.86 70.24 93.67 117.07 140.45 163.61 187.13
6 23.44 46.86 70.26 93.72 117.12 140.45 163.88 187.24
7 23.44 46.88 70.30 93.74 117.16 140.60 163.96 187.38
8 23.45 46.89 70.33 93.76 117.19 140.57 163.94 187.42
9 26.11 52.20 77.61 103.47 129.32 155.15 180.97 206.74
10 28.31 56.28 84.40 112.49 140.60 168.64 196.69 224.91
11 30.29 60.55 90.76 121.05 151.34 181.42 211.60 241.80
12 32.30 64.48 96.75 128.97 161.31 193.35 225.73 257.81
13 34.22 68.35 102.53 136.58 170.71 204.93 239.13 273.26
14 36.03 71.95 107.94 144.08 179.85 216.01 251.73 287.91
15 37.66 75.21 112.84 150.47 188.07 225.62 263.23 300.96
16 39.11 77.91 116.95 156.07 195.04 234.14 272.70 311.39
```
Sum:
```
#nm 1 2 3 4 5 6 7 8
1 8.17 16.28 24.26 32.33 40.33 48.46 56.48 64.87
2 16.24 32.45 48.33 64.36 80.45 96.45 112.24 128.16
3 24.04 48.10 72.43 96.67 119.72 144.96 167.81 192.66
4 31.52 62.56 94.35 125.68 86.72 186.48 218.07 252.19
5 36.46 73.01 109.20 146.27 182.43 218.32 254.82 292.77
6 35.78 71.55 107.27 142.58 128.45 213.63 246.21 284.28
7 35.96 71.96 107.48 143.37 179.03 212.19 248.61 287.04
8 36.72 73.41 109.78 146.40 182.22 218.01 255.47 291.35
9 39.38 78.64 117.82 156.32 195.02 234.33 273.41 311.02
10 40.35 80.60 120.55 160.54 200.38 239.11 278.60 319.35
11 40.67 81.10 121.70 162.13 201.35 241.95 279.86 322.24
12 41.30 82.52 123.36 164.17 204.71 245.30 285.33 326.99
13 41.96 83.88 125.24 166.52 208.33 249.05 289.74 330.07
14 42.56 84.96 127.11 168.45 211.13 252.30 291.75 334.65
15 43.02 85.76 127.99 170.28 212.49 253.65 294.62 336.58
16 43.15 86.15 128.82 170.71 212.87 254.60 296.24 337.88
1 8.16 16.18 24.17 32.29 40.25 48.36 56.48 64.51
2 16.13 32.23 48.33 64.57 80.47 96.39 112.42 128.20
3 24.04 48.08 72.15 96.03 119.97 144.10 167.85 191.76
4 31.51 62.51 94.47 125.98 157.17 188.51 219.76 250.79
5 36.40 72.95 109.28 145.62 181.83 218.18 253.92 290.27
6 35.69 71.35 107.10 142.49 178.26 213.63 248.47 283.31
7 36.00 71.93 107.82 143.69 179.91 214.79 250.68 285.62
8 36.78 73.53 110.07 146.76 182.97 219.33 255.07 290.99
9 39.46 78.91 118.11 157.37 196.08 234.54 272.76 310.78
10 40.39 80.79 120.94 161.00 201.01 240.46 279.21 318.95
11 40.70 81.38 121.87 162.07 202.32 242.50 281.34 320.99
12 41.36 82.62 123.74 164.71 205.44 245.52 285.50 325.30
13 42.05 83.95 125.76 167.21 208.10 248.89 289.02 329.41
14 42.66 85.17 127.55 169.72 211.13 252.78 293.64 333.37
15 43.09 85.93 128.61 171.14 213.21 255.18 295.19 336.38
16 43.30 86.33 129.25 171.57 213.80 255.15 296.23 336.86
```
Copy
```
#nm 1 2 3 4 5 6 7 8
1 33.14 66.22 100.61 132.80 165.62 198.30 232.71 263.91
2 40.58 80.93 121.07 161.23 201.26 241.39 280.37 322.12
3 40.64 81.29 121.72 162.27 202.93 243.46 283.27 323.31
4 40.02 79.96 119.80 159.40 197.78 238.99 278.07 316.26
5 39.65 79.32 118.89 158.29 198.21 238.02 278.26 317.75
6 39.13 78.24 117.07 156.25 195.10 233.99 272.62 310.97
7 38.59 77.18 115.70 153.83 192.31 229.83 268.74 305.98
8 38.08 76.07 114.03 152.01 189.34 227.66 265.25 302.02
9 39.13 78.23 117.47 156.45 195.88 235.18 274.36 312.44
10 39.57 79.13 118.75 158.06 197.35 236.92 277.64 316.05
11 40.00 79.99 119.85 159.56 199.29 238.81 279.20 317.96
12 40.30 80.56 120.77 160.86 200.73 240.13 280.81 318.94
13 40.62 81.31 122.19 163.75 205.94 247.65 289.42 331.40
14 40.78 81.48 122.10 162.53 203.86 245.19 288.91 335.03
15 40.78 81.54 122.24 162.58 203.24 244.12 286.56 331.70
16 40.69 81.32 121.87 162.54 202.72 243.18 285.64 329.20
1 33.47 65.72 98.72 134.01 166.29 197.55 228.82 259.89
2 40.56 81.02 121.53 161.79 202.20 242.45 283.13 322.62
3 40.67 81.24 122.03 162.61 203.09 243.69 284.12 324.14
4 40.00 79.92 119.84 159.79 199.55 239.05 278.52 317.78
5 39.69 79.30 119.09 158.51 197.86 237.57 276.90 315.90
6 39.11 78.11 117.28 156.32 195.17 234.26 273.25 311.62
7 38.59 77.14 115.71 154.25 192.60 231.05 269.28 307.35
8 38.05 76.15 114.09 152.09 189.82 227.73 265.38 303.09
9 39.11 78.22 117.37 156.33 195.20 233.89 272.85 311.33
10 39.58 79.16 118.62 158.18 197.61 236.59 275.47 314.54
11 39.99 79.95 119.95 159.70 199.52 239.03 278.54 318.17
12 40.31 80.54 120.89 161.01 200.86 240.71 280.51 319.87
13 40.64 81.24 121.81 162.35 202.66 242.91 283.15 322.65
14 40.78 81.53 122.24 162.75 203.25 243.50 283.65 323.49
15 40.80 81.56 122.30 162.90 203.41 243.66 284.17 323.90
16 40.74 81.38 122.09 162.56 203.08 243.23 283.37 323.38
```
Update
```
#nm 1 2 3 4 5 6 7 8
1 32.21 64.50 96.82 129.53 162.49 193.57 224.44 259.21
2 41.31 82.63 124.20 165.50 206.95 248.68 289.79 332.64
3 39.47 79.20 119.26 159.33 200.45 241.61 280.88 323.21
4 38.78 77.86 117.09 156.39 195.74 235.17 275.95 315.99
5 38.67 78.07 118.46 158.73 200.38 242.20 281.01 327.63
6 38.34 77.22 116.79 157.32 198.43 239.64 282.76 326.79
7 37.69 76.09 115.36 154.97 195.94 237.15 279.56 322.33
8 37.15 74.95 113.25 152.32 192.08 232.98 273.96 315.53
9 38.56 77.88 117.97 158.68 200.08 242.66 284.03 328.92
10 39.16 79.03 119.90 161.39 204.38 248.19 292.73 338.33
11 39.74 80.39 121.76 164.08 207.84 251.65 297.63 343.84
12 40.24 81.28 123.19 166.43 210.07 254.30 300.00 346.20
13 40.96 83.53 127.59 173.09 219.23 265.50 314.49 364.42
14 41.29 84.26 128.40 174.20 222.00 272.58 325.20 374.63
15 41.49 84.46 128.84 175.51 225.82 277.54 335.09 395.36
16 41.70 85.97 132.67 182.28 235.50 290.26 348.92 414.18
1 32.14 64.33 95.57 128.60 160.72 190.89 222.81 256.62
2 41.25 82.58 123.52 164.81 205.70 247.46 289.00 329.62
3 39.39 78.76 118.38 158.02 197.89 237.18 276.75 316.25
4 38.61 77.26 116.14 154.66 193.32 232.05 270.46 308.92
5 38.42 76.98 115.74 154.28 193.22 232.44 271.33 310.38
6 38.05 76.15 114.57 152.86 191.43 230.39 268.81 308.06
7 37.44 74.94 112.73 150.53 188.30 226.36 264.44 303.16
8 36.92 73.96 111.04 148.40 185.76 223.08 260.63 298.21
9 38.32 76.75 115.41 153.85 192.52 231.71 270.41 309.85
10 38.86 77.88 117.03 156.31 195.53 235.15 274.93 315.11
11 39.47 79.05 118.84 158.64 198.61 238.89 279.14 319.47
12 39.91 79.93 120.17 160.56 201.14 241.60 282.40 323.30
13 40.42 81.06 121.99 163.16 204.68 246.34 288.20 330.82
14 40.75 81.69 122.97 164.49 206.24 248.34 290.81 333.33
15 40.88 82.01 123.41 165.13 207.02 249.33 291.69 334.77
16 40.86 81.96 123.73 165.81 208.43 251.34 294.79 338.88
```
Triad
```
#nm 1 2 3 4 5 6 7 8
1 35.83 71.78 107.20 142.32 177.47 213.48 247.02 283.48
2 40.87 81.62 122.39 163.07 203.44 243.03 283.41 323.40
3 39.31 78.59 118.07 157.10 196.21 235.74 273.83 313.68
4 38.53 76.79 115.30 153.26 191.35 229.25 266.80 304.19
5 38.16 76.21 114.58 152.45 190.61 228.50 266.43 304.31
6 37.49 74.84 112.32 149.64 187.13 224.39 261.21 298.23
7 36.85 73.67 110.37 147.05 183.74 220.05 256.51 292.53
8 36.35 72.67 108.75 145.25 181.45 217.76 253.14 288.85
9 37.67 75.36 113.14 150.45 188.15 225.74 262.60 299.94
10 38.53 76.91 115.50 154.15 192.40 230.79 269.53 307.39
11 39.17 78.37 117.71 156.74 196.09 235.00 273.80 312.87
12 39.72 79.33 119.20 159.03 198.52 238.05 277.01 316.48
13 40.21 80.50 120.74 160.92 200.90 241.13 280.83 321.22
14 40.59 81.17 121.82 162.15 202.54 242.53 282.45 323.20
15 40.81 81.66 122.33 163.02 204.04 244.11 284.20 324.86
16 40.89 81.67 122.67 163.15 203.70 244.54 284.19 325.05
1 33.56 68.24 101.69 135.64 169.67 203.60 238.03 270.19
2 40.53 80.99 121.39 161.72 202.04 242.32 282.53 322.45
3 39.37 78.66 118.11 157.44 196.61 235.50 274.99 314.15
4 38.42 76.91 115.20 153.27 191.32 229.47 267.65 305.44
5 38.14 76.31 114.41 152.41 190.44 228.38 266.20 303.83
6 37.44 74.83 112.27 149.65 186.89 224.32 261.48 298.76
7 36.83 73.65 110.44 147.10 183.69 220.34 256.95 292.91
8 36.27 72.65 108.85 145.22 181.10 217.42 253.03 288.64
9 37.63 75.31 113.03 150.47 187.96 225.38 262.41 299.74
10 38.42 76.86 115.21 153.61 191.78 229.86 267.91 305.70
11 39.12 78.22 117.32 156.31 195.31 233.93 272.55 311.10
12 39.66 79.29 118.90 158.45 197.68 236.93 275.77 314.88
13 40.22 80.43 120.51 160.61 200.47 240.20 279.94 319.04
14 40.58 81.10 121.57 161.97 202.00 242.27 282.02 321.13
15 40.79 81.57 122.30 162.85 202.97 243.38 283.19 323.15
16 40.84 81.68 122.42 162.89 203.14 243.63 283.35 323.26
```
......
......