Raspberry Pi 400 PC 32 Bit and 64 Bit Benchmarks and Stress TestsRoy Longbottom
The benchmark uses small to rather excessive simple objects to measure drawing performance in Frames Per Second (FPS). Five tests draw on a background of continuously changing colour shades, each test adding to the load.
|
********************* Pi 4B 1500 MHz 32 bits ********************
Java Drawing Benchmark, May 15 2019, 18:55:41
Produced by OpenJDK 11 javac
Test Frames FPS
Display PNG Bitmap Twice Pass 1 877 87.65
Display PNG Bitmap Twice Pass 2 1042 104.18
Plus 2 SweepGradient Circles 1015 101.47
Plus 200 Random Small Circles 779 77.85
Plus 320 Long Lines 336 33.52
Plus 4000 Random Small Circles 83 8.25
Total Elapsed Time 60.1 seconds
Operating System Linux, Arch. arm, Version 4.19.37-v7l+
Java Vendor BellSoft, Version 11.0.2-BellSoft
******************** Pi 400 1800 MHz 32 bits ********************
Java Drawing Benchmark, Jul 30 2020, 12:01:08
Produced by javac 1.7.0_02
Test Frames FPS
Display PNG Bitmap Twice Pass 1 904 90.36
Display PNG Bitmap Twice Pass 2 1038 103.79
Plus 2 SweepGradient Circles 1019 101.84
Plus 200 Random Small Circles 855 85.41
Plus 320 Long Lines 391 39.08
Plus 4000 Random Small Circles 102 10.11
Total Elapsed Time 60.1 seconds
Operating System Linux, Arch. arm, Version 5.4.51-v7l+
Java Vendor Raspbian, Version 11.0.8
******************* Dual Monitor + TV Involved *****************
Display PNG Bitmap Twice Pass 1 698 69.75
Display PNG Bitmap Twice Pass 2 909 90.84
Plus 2 SweepGradient Circles 918 91.78
** 32 bit Pi 400 after enabling experimental desktop GL driver **
Java Drawing Benchmark, Jul 31 2020, 10:08:07
Produced by javac 1.6.0_27
Test Frames FPS
Display PNG Bitmap Twice Pass 1 1164 116.33
Display PNG Bitmap Twice Pass 2 1346 134.49
Plus 2 SweepGradient Circles 1317 131.62
Plus 200 Random Small Circles 976 97.53
Plus 320 Long Lines 402 40.12
Plus 4000 Random Small Circles 103 10.27
Total Elapsed Time 60.1 seconds
Operating System Linux, Arch. arm, Version 5.4.51-v7l+
Java Vendor Raspbian, Version 11.0.8
|
******************** Pi 400 1800 MHz 64 bits ********************
Java Drawing Benchmark, Aug 26 2020, 19:38:46
Produced by javac 1.8.0_222
Test Frames FPS
Display PNG Bitmap Twice Pass 1 860 85.92
Display PNG Bitmap Twice Pass 2 957 95.68
Plus 2 SweepGradient Circles 1002 100.18
Plus 200 Random Small Circles 843 84.24
Plus 320 Long Lines 402 40.12
Plus 4000 Random Small Circles 99 9.86
Total Elapsed Time 60.1 seconds
Operating System Linux, Arch. aarch64, Version 5.4.51-v8+
Java Vendor Debian, Version 11.0.8
** 64 bit Pi 400 after enabling experimental desktop GL driver **
Java Drawing Benchmark, Aug 26 2020, 20:09:05
Produced by javac 1.8.0_222
Test Frames FPS
Display PNG Bitmap Twice Pass 1 800 79.94
Display PNG Bitmap Twice Pass 2 966 96.51
Plus 2 SweepGradient Circles 999 99.81
Plus 200 Random Small Circles 864 86.30
Plus 320 Long Lines 409 40.83
Plus 4000 Random Small Circles 109 10.86
Total Elapsed Time 60.1 seconds
Operating System Linux, Arch. aarch64, Version 5.4.51-v8+
Java Vendor Debian, Version 11.0.8
******************* Dual Monitor + TV Involved *****************
Dual Monitor Part on monitor and part on TV
Display PNG Bitmap Twice Pass 1 748 74.72
Display PNG Bitmap Twice Pass 2 872 87.15
Plus 2 SweepGradient Circles 914 91.37
|
The benchmark measures graphics speed in terms of Frames Per Second (FPS) via six simple and more complex tests. The first four tests portray moving up and down a tunnel including various independently moving objects, with and without texturing. The last two tests, represent a real application for designing kitchens. The first is in wireframe format, drawn with 23,000 straight lines. The second has colours and textures applied to the surfaces.
As a benchmark, it was run using the following script file, the first command needed to avoid VSYNC, allowing FPS to be greater than 60.
export vblank_mode=0 ./videogl32 Width 320, Height 240, NoEnd ./videogl32 Width 640, Height 480, NoHeading, NoEnd ./videogl32 Width 1024, Height 768, NoHeading, NoEnd ./videogl32 Width 1920, Height 1080, NoHeading
The first Pi 400 results indicated that performance was slower on all tests, excluding those for the kitchen displays, the latter being more CPU speed limited, providing hoped for 20% performance improvement. Then, I remembered using an experimental desktop GL driver, enabled via sudo raspi-config. This was used on the Pi 400, where G3 GL OpenGL desktop driver with full KMS was selected. This produced the same or better Pi 400 performance than the Pi 4B.
As indicated below, the dual monitor connections enabled this option to be tested, the default for monitor full screen pixel settings across both monitors being applied, 2 x 1920 wide in this case.
********************* Pi 4B 1500 MHz 32 bits ********************
GLUT OpenGL Benchmark 32 Bit Version 1, Thu May 2 19:01:05 2019
Running Time Approximately 5 Seconds Each Test
Window Size Coloured Objects Textured Objects WireFrm Texture
Pixels Few All Few All Kitchen Kitchen
Wide High FPS FPS FPS FPS FPS FPS
320 240 766.7 371.4 230.6 130.2 32.5 22.7
640 480 427.3 276.5 206.0 121.8 31.7 22.2
1024 768 193.1 178.8 150.5 110.4 31.9 21.5
1920 1080 81.4 79.4 74.6 68.3 30.8 20.0
******************** Pi 400 1800 MHz 32 bits ********************
GLUT OpenGL Benchmark 32 Bit Version 1, Thu Jul 30 12:31:31 2020
Running Time Approximately 5 Seconds Each Test
Window Size Coloured Objects Textured Objects WireFrm Texture
Pixels Few All Few All Kitchen Kitchen
Wide High FPS FPS FPS FPS FPS FPS
320 240 688.1 405.2 223.1 138.2 42.8 29.0
640 480 319.4 281.4 200.1 126.8 41.4 27.8
1024 768 140.3 134.5 113.9 103.0 40.2 27.1
1920 1080 57.7 56.3 53.5 49.6 37.4 24.0
******************* Pi 400 New Driver 32 bits ******************
GLUT OpenGL Benchmark 32 Bit Version 1, Thu Jul 30 13:59:55 2020
Running Time Approximately 5 Seconds Each Test
Window Size Coloured Objects Textured Objects WireFrm Texture
Pixels Few All Few All Kitchen Kitchen
Wide High FPS FPS FPS FPS FPS FPS
320 240 823.6 435.1 244.5 140.7 42.5 28.7
640 480 427.8 310.0 219.6 134.3 42.1 28.3
1024 768 192.3 181.9 149.9 116.3 40.9 27.0
1920 1080 81.7 79.0 73.7 67.4 38.1 24.5
****************** Pi 400 Dual Monitor 32 bits ******************
3840 1080 27.0 26.6 26.3 25.1 27.3 19.3
|
The dual monitor test results were similar to those at 32 bits.
******************** Pi 400 1800 MHz 64 bits ********************
GLUT OpenGL Benchmark 64 Bit gcc 9, Wed Aug 26 19:53:43 2020
Running Time Approximately 5 Seconds Each Test
Window Size Coloured Objects Textured Objects WireFrm Texture
Pixels Few All Few All Kitchen Kitchen
Wide High FPS FPS FPS FPS FPS FPS
160 120 334.3 162.1 173.9 90.0 27.1 23.7
320 240 220.5 131.9 128.7 74.1 25.0 21.5
640 480 109.4 81.0 80.6 55.7 22.2 17.9
1024 768 57.5 47.5 45.4 34.2 18.2 13.4
1920 1080 27.0 24.3 22.0 18.9 14.3 8.4
******************* Pi 400 New Driver 64 bits ******************
GLUT OpenGL Benchmark 64 Bit gcc 9, Wed Aug 26 20:03:54 2020
Running Time Approximately 5 Seconds Each Test
Window Size Coloured Objects Textured Objects WireFrm Texture
Pixels Few All Few All Kitchen Kitchen
Wide High FPS FPS FPS FPS FPS FPS
160 120 783.4 446.7 286.4 170.7 50.9 35.5
320 240 659.3 406.0 265.5 160.8 51.9 35.1
640 480 319.2 276.9 229.0 144.2 47.5 32.7
1024 768 140.2 134.4 122.4 113.2 48.1 32.5
1920 1080 57.8 56.5 55.6 52.4 46.7 29.8
****************** Pi 400 Dual Monitor 64 bits ******************
3840 1080 27.2 26.6 27.0 26.0 27.5 21.4
|
The default 64 bit benchmark produced similar performance as the 32 bit version. However, a major advantage of the former, is its ability to handle much larger files, as illustrated below at 3 and 6 GB.
******************** Pi 4B 1500 MHz 32 bits ******************
MBytes/Second
MB Write1 Write2 Write3 Read1 Read2 Read3
8 67.82 12.97 90.19 99.84 93.49 96.83
16 92.25 92.66 92.96 103.9 105.28 91.17
Random Read Write
From MB 4 8 16 4 8 16
msecs 0.007 0.01 0.04 1.01 0.85 0.91
200 Files Write Read Delete
File KB 4 8 16 4 8 16 secs
MB/sec 1.47 2.80 5.14 2.47 4.71 8.61
ms/file 2.78 2.92 3.19 1.66 1.74 1.90 0.256
Large File Write MBytes/Second Read MBytes/Second
1 GB 96.13 93.34 94.98 114.51 112.16 114.91
2 GB Error writing file Segmentation fault
******************* Pi 400 1800 MHz 32 bits ******************
MBytes/Second
MB Write1 Write2 Write3 Read1 Read2 Read3
8 47.07 87.12 90.94 102.11 100.03 100.24
16 82.75 90.84 91.03 106.19 106.39 105.10
Random Read Write
From MB 4 8 16 4 8 16
msecs 0.007 0.02 0.43 0.98 0.90 0.89
200 Files Write Read Delete
File KB 4 8 16 4 8 16 secs
MB/sec 1.35 2.62 5.04 2.21 4.10 6.88
ms/file 3.03 3.12 3.25 1.85 2.00 2.38 0.184
Large File Write MBytes/Second Read MBytes/Second
1 GB 109.69 111.03 107.39 112.28 112.72 112.02
2 GB Error writing file Segmentation fault
******************* Pi 400 1800 MHz 64 bits ******************
MBytes/Second
MB Write1 Write2 Write3 Read1 Read2 Read3
8 46.59 89.13 93.19 103.35 73.78 65.73
16 65.89 96.57 67.83 90.43 105.20 105.43
Random Read Write
From MB 4 8 16 4 8 16
msecs 0.004 0.017 0.397 1.09 1.02 1.05
200 Files Write Read Delete
File KB 4 8 16 4 8 16 secs
MB/sec 1.36 2.64 5.11 1.95 3.33 8.55
ms/file 3.01 3.11 3.21 2.10 2.46 1.92 0.194
Large File Write MBytes/Second Read MBytes/Second
3 GB 114.00 114.11 114.93 112.31 114.79 116.96
6 GB 92.46 92.06 114.06 115.22 115.57 113.66
|
Results at 64 bits were similar to those at 32 bits and it took many more attempts to run at 5 GHz.
**************** Pi 4B 1500 MHz 2.4 GHz 32 bit **************
MBytes/Second
MB Write1 Write2 Write3 Read1 Read2 Read3
8 6.35 6.33 6.38 7.05 6.98 7.10
16 6.70 6.82 6.76 7.19 6.53 7.22
Random Read Write
From MB 4 8 16 4 8 16
msecs 2.691 2.875 3.048 3.13 2.93 2.84
200 Files Write Read Delete
File KB 4 8 16 4 8 16 secs
MB/sec 0.34 0.44 1.04 0.37 0.37 1.26
ms/file 12.14 18.59 15.7 11.1 22.2 12.99 2.153
***************** Pi 4B 1500 MHz 5 GHz 32 bit ***************
MBytes/Second
MB Write1 Write2 Write3 Read1 Read2 Read3
8 11.90 12.96 13.16 10.11 9.55 9.66
16 11.50 13.93 14.13 9.91 8.88 9.92
200 Files Write Read Delete
File KB 4 8 16 4 8 16 secs
MB/sec 0.13 0.46 0.91 0.25 0.55 1.02
ms/file 30.85 17.83 18.10 16.62 14.93 16.01 3.361
Random similar to 2.4 GHz
*************** Pi 400 1800 MHz 2.4 GHz 32 bit **************
MBytes/Second
MB Write1 Write2 Write3 Read1 Read2 Read3
8 2.02 6.08 6.59 6.91 5.82 7.01
16 6.78 6.64 6.70 7.04 6.05 6.36
Random Read Write
From MB 4 8 16 4 8 16
msecs 3.234 3.354 3.637 4.12 3.72 3.72
200 Files Write Read Delete
File KB 4 8 16 4 8 16 secs
MB/sec 0.36 0.61 1.07 0.46 0.85 1.55
ms/file 11.50 13.37 15.34 8.88 9.59 10.55 2.924
**************** Pi 400 1800 MHz 5 GHz 32 bit ***************
MBytes/Second
MB Write1 Write2 Write3 Read1 Read2 Read3
8 2.85 9.75 9.82 4.03 4.20 4.14
16 11.42 10.20 10.14 4.18 4.17 4.16
Random Read Write
From MB 4 8 16 4 8 16
msecs 3.006 3.206 3.276 3.55 3.29 3.28
200 Files Write Read Delete
File KB 4 8 16 4 8 16 secs
MB/sec 0.42 0.50 0.34 0.48 0.88 1.44
ms/file 9.72 16.44 48.26 8.61 9.30 11.39 2.812
|
*************** Pi 400 1800 MHz 2.4 GHz 64 bit **************
MBytes/Second
MB Write1 Write2 Write3 Read1 Read2 Read3
8 5.93 5.91 5.98 6.79 5.75 6.62
16 6.51 3.23 6.61 6.08 5.72 6.19
Random Read Write
From MB 4 8 16 4 8 16
msecs 3.240 3.720 3.651 4.14 3.92 4.16
200 Files Write Read Delete
File KB 4 8 16 4 8 16 secs
MB/sec 0.32 0.58 1.00 0.42 0.79 1.44
ms/file 12.92 14.14 16.39 9.80 10.42 11.36 1.335
*************** Pi 400 1800 MHz 5 GHz 64 bit ****************
MBytes/Second
MB Write1 Write2 Write3 Read1 Read2 Read3
8 11.55 12.00 12.24 4.16 4.32 4.28
16 12.21 12.41 12.34 4.13 4.28 4.24
Random Read Write
From MB 4 8 16 4 8 16
msecs 2.738 2.882 2.967 3.10 2.87 2.89
200 Files Write Read Delete
File KB 4 8 16 4 8 16 secs
MB/sec 0.49 0.43 0.64 0.54 0.92 1.54
ms/file 8.42 19.06 25.54 7.65 8.87 10.66 1.009
|
As indicated later, the USB 3 drives produced higher data transfer speeds than the 32 GB SD card, but were slower on booting, as shown in the following early life measurements, that could change. Part of the reason for slow booting is explained in an initial display, indicating that an SD card cannot be found and later, apparently searching for a bootable device.
Seconds Initial Total To Reboot To Drive Display Desktop Desktop 32 bit 8 GB SD card N/A 46 68 32 bit 32 GB SD card N/A 22 26 32 bit 32 GB SD USB Reader 7 31 30 32 bit 64 GB USB Stick 25 46 29 32 bit 64 GB HD Partition 29 63 64 64 bit 32 GB SD card N/A 22 26 64 bit 32 GB SD USB Reader 7 25 28
Random Access - The measured access times can vary widely, with the reasons for differences difficult to identify. Traditionally, hard disk drive times would normally be greater than half the revolution time, 5.5 ms, in this case. Then this Toshiba Canvio is said to have a 8 MB buffer, indicating that most accesses could be to the buffer, at bus speeds.
200 Small Files - Under EXT4 format, hard drive performance was indicated as being far superior, with the 8 GB SD card worst. Hard drive performance at FAT32 was exceptionally bad, with sector size of 32 KB, when each of the 200 files were that size. Pi 400 and Pi 4B performance was essentially the same running all these tests (those identical results were double checked).
Large Files MBytes/Second
MB Write1 Write2 Write3 Read1 Read2 Read3
8 GB SD Card 16 7.44 5.23 6.13 22.88 22.71 22.12
32 GB SD Card 16 19.02 17.56 17.39 44.71 43.50 44.84
64 GB USB Drive 16 74.42 77.55 76.80 129.86 130.65 129.75
64 GB USB Drive 500 30.92 23.74 29.67 132.11 131.10 132.16
64 GB USB Drive 2000 28.78 28.77 29.45 131.87 132.27 132.33
64 GB HD Partition 16 55.80 81.05 52.98 134.06 142.09 143.91
64 GB HD Partition 2000 149.83 148.52 146.76 151.64 151.99 150.15
64 GB HD Pi 4B USB 2000 147.17 146.79 146.45 148.38 151.14 97.80
Same HD Pi400 FAT 2000 83.27 82.66 83.22 143.79 144.02 144.06
Same HD Pi400 EXT4 2000 125.74 123.60 120.72 130.20 128.47 124.88
Same HD Pi4B FAT 2000 68.10 66.83 67.67 148.63 148.69 149.25
Same HD Pi4B EXT4 2000 125.36 118.11 122.68 130.29 127.10 128.45
Random Read Write
From MB 4 8 16 4 8 16
8 GB SD Card msecs 0.436 0.417 0.406 2.86 2.87 79.52
32 GB SD Card msecs 0.250 0.249 0.279 1.61 1.50 1.55
64 GB USB Drive msecs 0.671 0.675 0.671 2.14 2.20 2.18
64 GB HD Partition msecs 0.170 0.647 0.426 5.18 11.79 11.13
64 GB HD Pi 4B USB msecs 0.976 0.356 0.367 0.68 0.64 0.68
Same HD Pi400 FAT msecs 0.169 0.170 0.170 0.66 0.63 0.70
Same HD Pi400 EXT4 msecs 0.436 0.486 0.314 0.71 0.64 0.70
Same HD Pi4B FAT msecs 0.573 0.515 0.368 0.63 0.58 0.65
Same HD Pi4B EXT4 msecs 1.087 0.391 0.286 0.68 0.63 0.68
200 Small Files Write Read
File KB 4 8 16 4 8 16
8 GB SD Card MB/sec 0.42 2.59 2.61 5.63 8.95 12.15
32 GB SD Card MB/sec 2.57 5.10 5.59 9.08 12.42 20.69
64 GB USB Drive MB/sec 1.95 2.55 4.58 7.33 11.85 21.22
64 GB HD Partition MB/sec 4.20 16.53 13.64 13.32 20.21 50.28
64 GB HD Pi 4B USB MB/sec 8.58 20.83 35.28 20.83 36.94 61.32
Same HD Pi400 FAT MB/sec 0.04 0.07 0.15 0.37 0.73 1.46
Same HD Pi400 EXT4 MB/sec 8.15 15.02 20.04 8.86 12.86 34.40
Same HD Pi4B FAT MB/sec 0.04 0.07 0.15 0.37 0.73 1.46
Same HD Pi4B EXT4 MB/sec 9.90 15.22 14.05 13.42 7.95 19.51
|
Large Files - Compared with 32 bit operation and using the appropriate formatting, performance was similar using Ext4 partitions, but much larger files could be handled at 64 bits. At FAT32, files of twice the size could be dealt with, but performance on writing was much worse.
Random Access - All Pi 400 reading times do not represent drive hardware performance, accelerated by caching or HD buffering, but 32 bit reading was also faster than expectations. Writing times produced inexplicable variations.
Small Files - FAT32 performance was again particularly slow. Then, Ext4 reported speed via LanSpeed was accelerated by buffering.
Large Files MBytes/Second
MB Write1 Write2 Write3 Read1 Read2 Read3
HD Ext4 LanSpeed 4096 130.60 112.66 110.96 85.23 118.60 119.20
HD Ext4 LanSpeed 8192* 122.62 111.55 103.17 101.44 124.52 119.92
HD FAT32 LanSpeed 4096 Error writing file Segmentation fault
HD FAT32 LanSpeed 4000= 125.45 137.00 137.94 147.63 146.50 146.16
HD Ext4 DriveSpeed Error writing file Segmentation fault
HD FAT32 DriveSpeed 4096 Error writing file Segmentation fault
HD FAT32 DriveSpeed 4000= 20.50 20.56 12.53 143.32 146.59 146.32
SD Main LanSpeed 4096 21.34 18.22 17.40 34.78 45.86 45.33
SD Main Write/Read 4096# 18.73 18.87 18.83 45.96 46.01 46.04
SD Main Read only 4096 Memory allocation failed asked for 3 x 4096 MB
SD Main Read only 1333 Memory allocation failed asked for 3 x 1333 MB
SD Main Read only 1024 N/A N/A N/A 46.26 46.23 45.87
SD FAT32 LanSpeed 4096 Error writing file Segmentation fault
SD FAT32 LanSpeed 4000 20.14 20.12 20.09 95.33 95.21 95.32
SD FAT32 DriveSpeed 4096 Error writing file Segmentation fault
SD FAT32 DriveSpeed 4000 17.13 17.20 17.25 95.79 95.71 95.54
32 Bit From Above For Comparison
HD EXT4 DriveSpeed 2000* 125.74 123.60 120.72 130.20 128.47 124.88
HD FAT32 DriveSpeed 2000= 83.27 82.66 83.22 143.79 144.02 144.06
SD Main DriveSpeed 16# 19.02 17.56 17.39 44.71 43.50 44.84
Random Read Write
From MB 4 8 16 4 8 16
HD Ext4 LanSpeed msecs* 0.002 0.002 0.002 43.48 45.76 41.66
HD FAT32 LanSpeed msecs= 0.003 0.003 0.003 12.22 12.24 16.22
HD FAT32 DriveSpeed msecs= 0.003 0.003 0.004 12.68 12.37 12.26
SD Main LanSpeed msecs# 0.002 0.002 0.002 4.46 4.17 4.63
SD FAT32 LanSpeed msecs 0.003 0.003 0.003 6.05 5.87 6.05
SD FAT32 DriveSpeed msecs 0.004 0.004 0.010 2.97 2.55 2.42
32 Bit From Above For Comparison
HD EXT4 DriveSpeed msecs* 0.436 0.486 0.314 0.71 0.64 0.70
HD FAT32 DriveSpeed msecs= 0.169 0.170 0.170 0.66 0.63 0.70
SD Main DriveSpeed msecs# 0.250 0.249 0.279 1.61 1.50 1.55
200 Small Files Write Read
File KB 4 8 16 4 8 16
HD Ext4 LanSpeed MB/sec* 69.10 115.19 175.42 232.95 395.30 624.64
HD FAT32 LanSpeed MB/sec= 0.04 0.08 0.16 296.33 485.65 736.98
HD FAT32 DriveSpeed MB/sec= 0.04 0.07 0.15 292.47 40.44 391.07
SD Main LanSpeed MB/sec# 83.78 36.88 148.72 335.38 216.77 786.13
SD FAT32 LanSpeed MB/sec 0.04 0.08 0.15 306.56 493.11 730.44
SD FAT32 DriveSpeed MB/sec 0.04 0.08 0.15 299.67 130.72 34.53
32 Bit From Above For Comparison
HD EXT4 DriveSpeed MB/sec* 8.15 15.02 20.04 8.86 12.86 34.40
HD FAT32 DriveSpeed MB/sec= 0.04 0.07 0.15 0.37 0.73 1.46
SD Main DriveSpeed MB/sec# 2.57 5.10 5.59 9.08 12.42 20.69
|
Below is an example of the main output from the first set of tests on the Pi 400, followed by a summary of later results, comprising five runs over around 50 minutes at N = 20000. There are start and end overheads not reported in benchmark execution time. Performance is shown to be constant over this period. Then are details of VMSTAT system monitor results, showing use of 3.2 GB RAM and 100% CPU utilisation of four cores.
Then are details of VMSTAT system monitor results, showing use of 3.2 GB RAM and 100% CPU utilisation of four cores. This is followed by CPU and Power Management IC temperatures, during the five runs, nowhere near where CPU MHz throttling might be expected. Room temperature was 27°C and hot spot readings on the keyboard up to 36°C.
================================================================================ T/V N NB P Q Time Gflops -------------------------------------------------------------------------------- WR11C2R4 20000 128 2 2 451.75 1.181e+01 HPL_pdgesv() start time Thu Jul 23 21:28:59 2020 HPL_pdgesv() end time Thu Jul 23 21:36:31 2020 -------------------------------------------------------------------------------- ||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0010188 ...... PASSED ================================================================================ |
System Fan N GFLOPS Seconds Max °C Min MHz
Pi 4B No 16000 6.8 404 86 750/600
Yes 16000 10.4 263 70 1500
Later 4B No 16000 8.6 319 83 1000
Yes 16000 10.4 263 63 1500
Pi 400 16000 11.4 239 57 1800
Pi 4B No 20000 6.2 856 87 750/600
Yes 20000 10.8 494 71 1500
Later 4B No 20000 8.8 604 85 1000
Yes 20000 10.7 497 63 1500
Pi 400 20000 11.8 452 62 1800
|
Start Time N NB P Q Time Gflops SumCheck
Sep 8 12:07:45 20000 128 2 2 456.61 1.168e+01 0.0009306 .. PASSED
Sep 8 12:16:31 20000 128 2 2 459.68 1.160e+01 0.0009602 .. PASSED
Sep 8 12:25:20 20000 128 2 2 460.25 1.159e+01 0.0011412 .. PASSED
Sep 8 12:34:10 20000 128 2 2 454.22 1.174e+01 0.0009636 .. PASSED
Temperature and CPU MHz Measurement Start at Tue Sep 8 12:07:12 2020
Seconds
0.0 ARM MHz=1800, core volt=0.9500V, CPU temp=39.0'C, pmic temp=41.1'C
60.0 ARM MHz=1800, core volt=0.9500V, CPU temp=52.0'C, pmic temp=45.8'C
121.4 ARM MHz=1800, core volt=0.9500V, CPU temp=53.0'C, pmic temp=48.6'C
182.6 ARM MHz=1800, core volt=0.9500V, CPU temp=54.0'C, pmic temp=49.6'C
243.8 ARM MHz=1800, core volt=0.9500V, CPU temp=55.0'C, pmic temp=50.5'C
304.9 ARM MHz=1800, core volt=0.9500V, CPU temp=55.0'C, pmic temp=51.4'C
366.1 ARM MHz=1800, core volt=0.9500V, CPU temp=56.0'C, pmic temp=52.4'C
427.2 ARM MHz=1800, core volt=0.9500V, CPU temp=57.0'C, pmic temp=52.4'C
488.4 ARM MHz=1800, core volt=0.9500V, CPU temp=54.0'C, pmic temp=52.4'C
549.3 ARM MHz=1800, core volt=0.9500V, CPU temp=54.0'C, pmic temp=51.4'C
610.2 ARM MHz=1800, core volt=0.9500V, CPU temp=57.0'C, pmic temp=53.3'C
671.5 ARM MHz=1800, core volt=0.9500V, CPU temp=59.0'C, pmic temp=54.3'C
732.7 ARM MHz=1800, core volt=0.9500V, CPU temp=57.0'C, pmic temp=54.3'C
794.0 ARM MHz=1800, core volt=0.9500V, CPU temp=59.0'C, pmic temp=54.3'C
855.2 ARM MHz=1800, core volt=0.9500V, CPU temp=58.0'C, pmic temp=55.2'C
916.3 ARM MHz=1800, core volt=0.9500V, CPU temp=58.0'C, pmic temp=55.2'C
977.5 ARM MHz=1800, core volt=0.9500V, CPU temp=59.0'C, pmic temp=55.2'C
1038.7 ARM MHz=1800, core volt=0.9500V, CPU temp=56.0'C, pmic temp=54.3'C
1099.7 ARM MHz=1800, core volt=0.9500V, CPU temp=59.0'C, pmic temp=54.3'C
1160.8 ARM MHz=1800, core volt=0.9500V, CPU temp=61.0'C, pmic temp=55.2'C
1222.0 ARM MHz=1800, core volt=0.9500V, CPU temp=60.0'C, pmic temp=55.2'C
1283.1 ARM MHz=1800, core volt=0.9500V, CPU temp=60.0'C, pmic temp=56.2'C
1344.2 ARM MHz=1800, core volt=0.9500V, CPU temp=60.0'C, pmic temp=56.2'C
1405.4 ARM MHz=1800, core volt=0.9500V, CPU temp=60.0'C, pmic temp=57.1'C
1466.5 ARM MHz=1800, core volt=0.9500V, CPU temp=61.0'C, pmic temp=57.1'C
1527.8 ARM MHz=1800, core volt=0.9500V, CPU temp=59.0'C, pmic temp=56.2'C
1589.0 ARM MHz=1800, core volt=0.9500V, CPU temp=57.0'C, pmic temp=55.2'C
1649.9 ARM MHz=1800, core volt=0.9500V, CPU temp=60.0'C, pmic temp=55.2'C
1710.9 ARM MHz=1800, core volt=0.9500V, CPU temp=60.0'C, pmic temp=57.1'C
1772.1 ARM MHz=1800, core volt=0.9500V, CPU temp=61.0'C, pmic temp=57.1'C
1833.4 ARM MHz=1800, core volt=0.9500V, CPU temp=61.0'C, pmic temp=57.1'C
1894.5 ARM MHz=1800, core volt=0.9500V, CPU temp=62.0'C, pmic temp=57.1'C
1955.6 ARM MHz=1800, core volt=0.9500V, CPU temp=62.0'C, pmic temp=58.0'C
2016.7 ARM MHz=1800, core volt=0.9500V, CPU temp=62.0'C, pmic temp=58.0'C
2077.9 ARM MHz=1800, core volt=0.9500V, CPU temp=59.0'C, pmic temp=57.1'C
vmstat 60 seconds sampling
procs -----------memory---------- ---swap-- ----io---- --system- ------cpu------
r b swpd free buff cache si so bi bo in cs us sy id wa st
0 0 0 3436000 23368 250372 0 0 286 6 204 293 2 2 95 1 0
4 0 15616 266040 880 131528 0 259 109 263 1250 462 94 2 4 0 0
4 0 18688 258616 3928 136892 4 49 251 51 1114 119 100 0 0 0 0
4 0 21504 262664 4240 132936 0 46 13 49 1102 92 100 0 0 0 0
4 0 21504 262160 4264 132940 1 0 1 2 1097 84 100 0 0 0 0
4 0 21504 262932 4288 132932 0 0 0 2 1095 79 100 0 0 0 0
4 0 21504 262428 4316 132944 0 0 0 2 1102 85 100 0 0 0 0
4 0 21504 265200 4340 130812 1 0 1 2 1092 74 100 0 0 0 0
4 0 21504 264948 4708 132172 3 0 31 2 1099 93 100 0 0 0 0
4 0 21504 2423512 4852 135656 0 0 20 3 1105 100 99 1 0 0 0
4 0 21504 280360 4880 117156 1 0 3 3 1100 89 99 1 0 0 0
4 0 27904 294848 4908 105784 0 105 79 107 1115 110 100 0 0 0 0
4 0 27904 293336 4928 105984 0 0 3 2 1097 79 100 0 0 0 0
4 0 57600 301452 9764 120232 13 495 758 517 1458 809 99 1 0 0 0
4 0 73728 312128 9576 123948 25 283 257 305 1336 548 99 1 0 0 0
4 0 73728 311372 9740 124008 0 0 3 2 1099 86 100 0 0 0 0
4 0 73728 310868 9752 124016 1 0 1 2 1096 80 100 0 0 0 0
4 0 73728 309828 9764 124624 0 0 10 2 1098 85 100 0 0 0 0
4 0 73472 1445224 10136 127356 1 0 73 3 1118 128 98 2 0 0 0
4 0 73472 306776 10172 128404 1 0 1 4 1098 87 99 1 0 0 0
4 0 73472 306280 10196 128488 1 0 3 2 1166 219 100 0 0 0 0
5 0 73472 305920 10216 128516 0 0 0 2 1095 78 100 0 0 0 0
4 0 73472 305044 10244 128524 1 0 1 2 1100 90 100 0 0 0 0
4 0 73216 305412 10268 128620 0 0 2 2 1094 80 100 0 0 0 0
4 0 73216 305040 10292 128632 0 0 0 2 1091 75 100 0 0 0 0
4 0 72960 304916 10320 128640 0 0 0 2 1100 81 100 0 0 0 0
4 0 72960 302436 10348 131852 1 0 1 3 1096 80 100 0 0 0 0
4 0 72704 470192 10380 127388 0 0 0 3 1111 110 98 2 0 0 0
4 0 72704 306264 10412 128684 0 0 0 2 1126 146 100 0 0 0 0
4 0 72704 305768 10440 128696 1 0 1 3 1095 82 100 0 0 0 0
4 0 72704 305768 10464 128708 0 0 0 2 1095 79 100 0 0 0 0
4 0 72704 305876 10488 128752 0 0 1 2 1096 77 100 0 0 0 0
4 0 72704 305752 10516 128760 0 0 0 3 1094 77 100 0 0 0 0
4 0 72704 305504 10540 128768 1 0 1 2 1099 87 100 0 0 0 0
4 0 72704 305380 10568 128784 0 0 0 3 1090 74 100 0 0 0 0
4 0 72704 299800 10596 134108 5 0 19 2 1227 353 100 0 0 0 0
|
Floating Point - Below are Pi 4B single precision speeds in MFLOPS then sumchecks, followed by those for double precision working, then the same for Pi 400 and comparisons. The latter indicate a 20% Pi 400 performance gain for cache based tests and no difference on those that were RAM speed dependent The sumcheck comparisons show that the two system produced the same numeric results carrying out millions of calculations.
Integer - The test loop comprises 32 add or subtract instructions, operating on hexadecimal data patterns, with sequences of 8 subtracts then 8 adds to restore the original pattern. Performance is measured in MBytes per second. Results show the varying hexadecimal data patters used and compared verification. Comparative performance again show that the PI 400 was 20% faster on CPU speed dependent tasks and no different when reliant on RAM speed
32 Bit Single Precision Floating Point 32 Bit Double Precision Floating Point
------ MFLOPS ----- ----- Sumchecks --- ------ MFLOPS ----- ----- Sumchecks ---
* * *
-------------------------------------- Pi 4B -------------------------------------
Ops KB KB MB * KB KB MB * KB KB MB * KB KB MB
Thrds /Wd 12.8 128 12.8 * 12.8 128 12.8 " 12.8 128 12.8 * 12.8 128 12.8
T1 2 2603 2607 651 " 40392 76406 99700 * 992 990 317 * 40395 76384 99700
T2 2 5017 5138 645 * 40392 76406 99700 * 1940 1993 319 * 40395 76384 99700
T4 2 7045 9724 656 * 40392 76406 99700 * 3639 3925 329 * 40395 76384 99700
T8 2 8747 9690 633 * 40392 76406 99700 " 3690 3913 331 * 40395 76384 99700
T1 8 5542 5427 2479 * 54756 85091 99820 " 2390 2435 1266 * 54805 85108 99820
T2 8 10774 10716 2579 * 54756 85091 99820 * 4608 4853 1170 * 54805 85108 99820
T4 8 19196 20561 2595 * 54756 85091 99820 * 8902 9081 1165 * 54805 85108 99820
T8 8 18718 20629 2512 * 54756 85091 99820 * 8852 8971 1098 * 54805 85108 99820
T1 32 5307 5244 5217 * 35296 66020 99519 * 2703 2724 2672 * 35159 66065 99521
T2 32 10559 10521 9764 * 35296 66020 99519 * 5385 5442 5009 * 35159 66065 99521
T4 32 20070 20557 9864 * 35296 66020 99519 * 10582 10836 4824 * 35159 66065 99521
T8 32 19793 20919 9460 * 35296 66020 99519 * 10484 10749 4765 * 35159 66065 99521
------------------------------------- Pi 400 -------------------------------------
T1 2 3163 3129 646 * 40392 76406 99700 * 1192 1187 321 * 40395 76384 99700
T2 2 6145 6144 646 * 40392 76406 99700 * 2362 2392 324 * 40395 76384 99700
T4 2 8974 10119 655 * 40392 76406 99700 * 4155 4692 278 * 40395 76384 99700
T8 2 9584 11780 645 * 40392 76406 99700 * 4232 4730 272 * 40395 76384 99700
T1 8 6606 6514 2515 * 54756 85091 99820 * 2899 2931 1250 * 54805 85108 99820
T2 8 13028 12755 2831 * 54756 85091 99820 * 5643 5829 1128 * 54805 85108 99820
T4 8 22820 25005 2778 " 54756 85091 99820 * 10637 11351 1208 * 54805 85108 99820
T8 8 23260 24714 2345 * 54756 85091 99820 * 10850 10938 1217 * 54805 85108 99820
T1 32 6368 6327 6115 * 35296 66020 99519 * 3252 3257 3156 * 35159 66065 99521
T2 32 12643 12602 10838 * 35296 66020 99519 * 6484 6538 5455 * 35159 66065 99521
T4 32 24016 25146 10124 * 35296 66020 99519 * 12833 12791 4790 * 35159 66065 99521
T8 32 23811 24068 8760 * 35296 66020 99519 * 12093 12226 4463 * 35159 66065 99521
--------------------------------- Pi 400 / Pi 4B --------------------------------
L1 L2 RAM * ---- Sumchecks --- * L1 L2 RAM " ---- Sumchecks ---
T1 2 1.22 1.20 0.99 * 1.00 1.00 1.00 * 1.20 1.20 1.01 " 1.00 1.00 1.00
T2 2 1.22 1.20 1.00 * 1.00 1.00 1.00 * 1.22 1.20 1.02 * 1.00 1.00 1.00
T4 2 1.27 1.04 1.00 * 1.00 1.00 1.00 * 1.14 1.20 0.84 * 1.00 1.00 1.00
T8 2 1.10 1.22 1.02 * 1.00 1.00 1.00 * 1.15 1.21 0.82 * 1.00 1.00 1.00
T1 8 1.19 1.20 1.01 * 1.00 1.00 1.00 * 1.21 1.20 0.99 * 1.00 1.00 1.00
T2 8 1.21 1.19 1.10 * 1.00 1.00 1.00 * 1.22 1.20 0.96 * 1.00 1.00 1.00
T4 8 1.19 1.22 1.07 * 1.00 1.00 1.00 * 1.19 1.25 1.04 * 1.00 1.00 1.00
T8 8 1.24 1.20 0.93 * 1.00 1.00 1.00 * 1.23 1.22 1.11 * 1.00 1.00 1.00
T1 32 1.20 1.21 1.17 * 1.00 1.00 1.00 * 1.20 1.20 1.18 * 1.00 1.00 1.00
T2 32 1.20 1.20 1.11 * 1.00 1.00 1.00 * 1.20 1.20 1.09 * 1.00 1.00 1.00
T4 32 1.20 1.22 1.03 * 1.00 1.00 1.00 * 1.21 1.18 0.99 * 1.00 1.00 1.00
T8 32 1.20 1.15 0.93 * 1.00 1.00 1.00 * 1.15 1.14 0.94 * 1.00 1.00 1.00
---------------------------- 32 Bit Integers ---------------------------
Pi 4B MB/second Same Pi 400 MB/second Pi 400/Pi4B
KB KB MB All KB KB MB KB KB MB
Threads 16 160 16 Sumcheck Tests 16 160 16 16 160 16
1 5751 5755 3882 00000000 Yes 7062 6907 3825 1.23 1.20 0.99
2 11820 11302 3772 FFFFFFFF Yes 14215 13724 3736 1.20 1.21 0.99
4 22467 21906 3375 5A5A5A5A Yes 27026 26533 3397 1.20 1.21 1.01
8 22019 22094 3415 AAAAAAAA Yes 26959 25993 3419 1.22 1.18 1.00
16 22891 22448 3395 CCCCCCCC Yes 27424 27479 3413 1.20 1.22 1.01
32 22574 23412 3436 0F0F0F0F Yes 27143 27869 3458 1.20 1.19 1.01
|
64 Bit Single Precision Floating Point 64 Bit Double Precision Floating Point
------ MFLOPS ----- ----- Sumchecks --- ------ MFLOPS ----- ----- Sumchecks ---
* * *
Ops KB KB MB * KB KB MB * KB KB MB * KB KB MB
Thrds /Wd 12.8 128 12.8 * 12.8 128 12.8 " 12.8 128 12.8 * 12.8 128 12.8
T1 2 3114 4852 1191 * 40394 76395 99700 * 1822 2252 613 * 40395 76384 99700
T2 2 9362 9555 1236 * 40394 76395 99700 * 4190 4493 604 * 40395 76384 99700
T4 2 16966 15205 1119 * 40394 76395 99700 * 8082 8708 603 * 40395 76384 99700
T8 2 16096 17963 1027 * 40394 76395 99700 + 8275 7905 603 * 40395 76384 99700
T1 8 5645 5697 3695 * 54764 85092 99820 * 3342 3354 2190 * 54805 85108 99820
T2 8 11333 11335 4125 * 54764 85092 99820 * 6643 6718 2142 * 54805 85108 99820
T4 8 21208 22499 4151 * 54764 85092 99820 * 12734 13322 2058 * 54805 85108 99820
T8 8 21585 21456 4115 * 54764 85092 99820 * 12919 12523 2101 * 54805 85108 99820
T1 32 7025 7049 7006 * 35206 66015 99520 * 4002 4009 3961 * 35159 66065 99521
T2 32 14081 14047 13565 * 35206 66015 99520 * 7993 8016 7511 * 35159 66065 99521
T4 32 27027 28036 16116 * 35206 66015 99520 * 15462 15988 8132 * 35159 66065 99521
T8 32 26548 27040 16049 * 35206 66015 99520 * 15722 15825 8038 * 35159 66065 99521
------------------------------- Pi 400 64 Bit/32 Bit ------------------------------
L1 L2 RAM * ---- Sumchecks --- * L1 L2 RAM " ---- Sumchecks ---
T1 2 0.98 1.55 1.84 * 1.00 1.00 1.00 * 1.53 1.90 1.91 * 1.00 1.00 1.00
T2 2 1.52 1.56 1.91 * 1.00 1.00 1.00 * 1.77 1.88 1.86 * 1.00 1.00 1.00
T4 2 1.89 1.50 1.71 * 1.00 1.00 1.00 * 1.95 1.86 2.17 * 1.00 1.00 1.00
T8 2 1.68 1.52 1.59 * 1.00 1.00 1.00 * 1.96 1.67 2.22 * 1.00 1.00 1.00
T1 8 0.85 0.87 1.47 * 1.00 1.00 1.00 * 1.15 1.14 1.75 * 1.00 1.00 1.00
T2 8 0.87 0.89 1.46 * 1.00 1.00 1.00 * 1.18 1.15 1.90 * 1.00 1.00 1.00
T4 8 0.93 0.90 1.49 * 1.00 1.00 1.00 * 1.20 1.17 1.70 * 1.00 1.00 1.00
T8 8 0.93 0.87 1.75 * 1.00 1.00 1.00 * 1.19 1.14 1.73 * 1.00 1.00 1.00
T1 32 1.10 1.11 1.15 * 1.00 1.00 1.00 * 1.23 1.23 1.26 * 1.00 1.00 1.00
T2 32 1.11 1.11 1.25 * 1.00 1.00 1.00 * 1.23 1.23 1.38 * 1.00 1.00 1.00
T4 32 1.13 1.11 1.59 * 1.00 1.00 1.00 * 1.20 1.25 1.70 * 1.00 1.00 1.00
T8 32 1.11 1.12 1.83 * 1.00 1.00 1.00 * 1.30 1.29 1.80 * 1.00 1.00 1.00
------------------------------ Pi 400 Integers ----------------------------
gcc 8 MB/second Same Pi 400 64 Bit/32 Bit Version 1 MB/sec
KB KB MB All KB KB MB KB KB MB
Threads 16 160 16 Sumcheck Tests 16 160 16 16 160 16
1 3455 3481 3074 00000000 Yes 0.49 0.50 0.80 8774 8150 3772
2 7047 6975 3507 FFFFFFFF Yes 0.50 0.51 0.94 17241 15941 3687
4 13712 13977 3357 5A5A5A5A Yes 0.51 0.53 0.99 32768 29966 3339
8 13631 13696 3353 AAAAAAAA Yes 0.51 0.53 0.98 32845 33055 3366
16 13184 13906 3351 CCCCCCCC Yes 0.48 0.51 0.98 32959 34188 3364
32 12617 13960 3414 0F0F0F0F Yes 0.46 0.50 0.99 31531 33694 3388
|
./MP-FPUStress Threads tt, Minutes mm, KB kk, Ops 00, Log ll ./MP-FPUStressDP Threads tt, Minutes mm, KB kk, Ops 00, Log ll ./MP-IntStress Threads tt, Minutes mm, KB kk, Log ll ./RPiHeatMHzVolts2 Passes pp, Seconds ss, Log ll vmstat ss pp tt = Threads 1, 2, 4, 8, 16, 32, (64 FPU) mm = Minutes greater than 0 kk = KBytes 12 to 15624 oo = Operations Per Word 2, 8 or 32 ll = number added to log file name, 0 to 99 pp = Passes (at ss econd intervals) ss = Second intervals |
An extra test was carried out on the Pi 4B, with the fan disabled, demonstrating severe CPU MHz throttling, much worst performance and reflecting the Pi 400 advantage. Average temperature, over half an hour, was 84°C accompanied by a 42% reduction in performance.
The Pi 400, keyboard temperature was measured during the stress tests, reaching warm to touch 40°C, according to my infrared thermometer.
Pi 4B Pi 4B No Fan Pi 400 Fanless
Single Double Single Single Double
MFLOPS Avg 20896 10797 12151 25056 12953
MFLOPS Min 20541 10587 10946 24587 12754
MHx Avg 1500 1500 870 1800 1800
MHz Min 1500 1500 600 1800 1800
Volts 0.8600 0.8600 0.8600 0.9500 0.9500
Temperatures CPU PMIC CPU PMIC CPU PMIC CPU PMIC CPU PMIC
°C °C °C °C °C °C °C °C °C °C
Avg 66.6 49.5 60.3 47.1 84.0 71.2 61.8 57.7 61.1 59.5
Max 69.0 50.5 62.0 48.6 86.0 72.2 68.0 61.8 64.0 60.9
Minutes
0 43 38.2 44 41.1 61 57.1 38 43.9 48 53.3
1 63 45.8 59 43.9 82 65.6 56 49.6 60 57.1
2 66 48.6 61 46.7 83 68.4 57 51.4 59 58.0
3 67 49.6 61 47.7 84 70.3 58 53.3 61 59.0
4 67 49.6 62 46.7 85 70.3 59 54.3 62 59.0
5 67 49.6 61 47.7 84 70.3 61 55.2 61 59.0
6 68 49.6 61 47.7 85 71.2 61 55.2 62 59.0
7 67 49.6 62 47.7 85 72.2 62 57.1 63 59.0
8 69 49.6 61 47.7 85 72.2 62 57.1 54 58.0
9 69 49.6 62 47.7 86 72.2 63 58.0 61 59.0
10 68 49.6 62 47.7 85 72.2 63 58.0 62 59.0
11 67 49.6 61 46.7 86 72.2 64 59.0 62 59.0
12 67 49.6 61 46.7 86 72.2 65 59.0 62 59.0
13 68 50.5 59 46.7 86 72.2 53 55.2 62 59.0
14 68 50.5 60 46.7 86 72.2 64 59.0 63 59.0
15 68 50.5 60 46.7 85 72.2 61 57.1 64 60.9
16 69 50.5 59 46.7 85 72.2 64 59.0 64 60.9
17 69 50.5 60 46.7 85 72.2 66 59.0 63 60.9
18 68 49.6 60 46.7 86 72.2 66 59.9 61 60.9
19 68 49.6 61 47.7 85 72.2 66 60.9 64 60.9
20 69 50.5 61 47.7 85 72.2 66 60.9 56 59.0
21 68 50.5 62 47.7 86 72.2 66 60.9 63 59.0
22 68 50.5 61 47.7 85 72.2 66 60.9 62 60.9
23 68 50.5 62 47.7 85 72.2 67 60.9 64 60.9
24 67 50.5 62 48.6 85 72.2 67 61.8 63 60.9
25 68 50.5 62 48.6 84 72.2 54 58.0 63 60.9
26 68 50.5 61 47.7 86 72.2 67 60.9 63 60.9
27 68 50.5 62 47.7 85 72.2 66 61.8 64 60.9
28 69 50.5 61 47.7 86 72.2 68 61.8 63 60.9
29 68 50.5 60 47.7 85 72.2 68 61.8 63 60.9
30 57 50.5 59 47.7 78 72.2 51 57.1 53 59.0
|
With temperatures remaining relatively low, CPU MHz and measured performance were constant and a little faster than at 32 bits. Average Pi 400 64 bit double precision performance of 15.9 GFLOPS can be judged against 11.6 GFLOPS from High Performance Linpack.
10-Sep-20 09:45 10:18 10:49
Precision Single Single Double
MFLOPS Avg 28011 27995 15948
MFLOPS Min 26336 27019 15467
MHz Avg 1800 1800 1800
MHz Min 1800 1800 1800
Volts 0.9500 0.9500 0.9500
Temperatures CPU PMIC CPU PMIC CPU PMIC
°C °C °C °C °C °C
Avg 52.3 51.7 56.6 56.2 60.5 58.5
Max 57.0 55.2 59.0 58.0 63.0 60.9
Minutes
0 35 39.2 47 50.5 47 50.5
1 45 43.9 55 53.3 58 55.2
2 48 45.8 55 54.3 58 55.2
3 49 46.7 54 55.2 59 57.1
4 50 47.7 56 55.2 59 57.1
5 49 48.6 56 55.2 59 57.1
6 51 49.6 57 55.2 61 58.0
7 52 49.6 55 55.2 60 58.0
8 51 50.5 57 55.2 61 59.0
9 52 50.5 56 55.2 60 59.0
10 52 51.4 57 56.2 60 59.0
11 53 51.4 58 56.2 61 59.0
12 53 51.4 57 57.1 62 59.0
13 54 52.4 57 57.1 61 59.0
14 54 52.4 57 57.1 61 59.0
15 54 53.3 58 57.1 61 59.0
16 54 53.3 57 57.1 61 59.0
17 54 53.3 58 57.1 62 59.0
18 55 53.3 58 57.1 62 59.0
19 55 54.3 58 57.1 62 59.0
20 56 54.3 57 57.1 62 59.0
21 55 54.3 59 57.1 63 59.0
22 55 54.3 58 57.1 63 59.0
23 55 54.3 58 57.1 61 59.0
24 54 55.2 58 57.1 62 59.0
25 56 55.2 58 57.1 63 59.0
26 55 55.2 59 57.1 63 60.9
27 55 55.2 59 57.1 63 60.9
28 56 55.2 58 58.0 63 60.9
29 57 55.2 59 58.0 63 60.9
30 48 54.3 50 56.2 53 59.0
|
An additional test was carried out on the Pi 400 outside on a sheltered table, where the local temperature was initially 40°C, increasing to 44°C with the sun shining on part of the keyboard. The keyboard temperature increased to 51°C for the last minute of the test.
Over the testing time, maximum temperatures increased by around 7°C, not sufficient to invoke throttling and providing virtually the same performance as in the earlier test.
Pi 4B Fan Pi 4B No Fan Pi 400 Fanless Pi 400 Outside
MB/S Avg 22164 15736 26395 26215
MB/S Min 21472 13756 25541 25779
MHz Avg 1500 1053 1800 1800
MHz Min 1500 600 1800 1800
Volts 0.8600 0.8600 0.9500 0.9500
Temperatures CPU PMIC CPU PMIC CPU PMIC CPU PMIC
°C °C °C °C °C °C °C °C
Avg 61.5 47.6 82.7 69.9 62.1 59.8 65.1 63.5
Max 64.0 48.6 86.0 72.2 64.0 61.8 71.0 69.4
Minutes
0 45 41.1 60 55.2 48 53.3 45 49.6
1 59 43.9 78 62.8 58 55.2 56 54.3
2 63 46.7 82 66.5 62 57.1 58 56.2
3 63 48.6 83 69.4 60 59.0 60 58.0
4 63 48.6 83 70.3 62 59.0 62 59.0
5 63 48.6 83 70.3 62 59.0 63 60.9
6 63 47.7 84 70.3 62 59.0 63 61.8
7 64 47.7 84 70.3 63 59.0 63 61.8
8 62 47.7 83 70.3 61 59.0 64 62.8
9 60 46.7 84 70.3 64 59.0 65 62.8
10 62 47.7 83 70.3 63 59.0 65 62.8
11 62 47.7 83 70.3 62 59.0 64 62.8
12 63 48.6 83 70.3 63 59.0 66 62.8
13 62 48.6 83 70.3 63 60.9 67 62.8
14 63 48.6 83 70.3 63 59.0 68 64.6
15 64 48.6 83 70.3 62 59.0 68 64.6
16 63 48.6 83 70.3 63 60.9 67 65.6
17 61 47.7 83 70.3 63 60.9 68 65.6
18 63 47.7 84 71.2 62 60.9 67 65.6
19 62 47.7 83 70.3 64 60.9 68 65.6
20 63 47.7 83 70.3 64 60.9 67 65.6
21 63 48.6 84 70.3 61 60.9 69 66.5
22 63 48.6 84 70.3 63 60.9 67 65.6
23 64 47.7 85 72.2 63 60.9 67 65.6
24 62 48.6 84 72.2 63 61.8 68 66.5
25 61 47.7 86 72.2 63 60.9 69 66.5
26 62 47.7 86 72.2 64 61.8 70 67.5
27 61 47.7 84 72.2 63 61.8 70 67.5
28 62 47.7 86 72.2 62 60.9 71 68.4
29 62 48.6 84 72.2 64 61.8 71 69.4
30 54 47.7 84 72.2 64 61.8 62 68.4
|
Again, temperatures were low and performance constant, within normal variations.
Memory KB 16 256 3500000 vmstat Memory
3500000 KB
MB/sec Avg 34803 28756 3571
MB/sec Min 32815 26940 2804
MHz Avg 1800 1800 1800
MHz Min 1800 1800 1800
Volts 0.9500 0.9500 0.9500
Temperatures CPU PMIC CPU PMIC CPU PMIC
°C °C °C °C °C °C
Avg 55.8 55.3 59.9 58.6 48.2 51.6
Max 59.0 57.1 63.0 61.8 50.0 52.4
swpd free
Minutes
0 42 46.7 44 49.6 42 46.7 0 3417124
1 51 50.5 56 53.3 46 49.6 77312 115776
2 53 52.4 57 55.2 48 50.5 76800 107948
3 55 52.4 57 55.2 47 50.5 76800 107648
4 54 53.3 58 57.1 48 50.5 76800 107884
5 54 54.3 59 57.1 48 51.4 76800 107884
6 55 54.3 58 57.1 48 51.4 76800 107128
7 55 54.3 61 57.1 48 51.4 76800 108388
8 55 55.2 60 58.0 48 51.4 76544 106120
9 55 55.2 61 59.0 48 51.4 76544 105868
10 56 55.2 60 59.0 49 51.4 76544 104356
11 55 55.2 59 59.0 49 51.4 76544 105364
12 58 55.2 60 59.0 49 51.4 76544 104356
13 57 55.2 61 59.0 49 51.4 74240 116292
14 58 55.2 62 59.0 49 52.4 74240 129052
15 57 57.1 63 59.0 48 52.4 74240 129052
16 56 55.2 61 59.0 49 52.4 74240 128548
17 57 57.1 62 59.0 49 52.4 74240 128664
18 57 57.1 63 59.0 49 52.4 74240 128012
19 58 57.1 62 59.0 49 52.4 74240 129272
20 58 57.1 62 60.9 50 52.4 74240 128784
21 58 57.1 62 60.9 49 52.4 74240 128028
22 59 57.1 61 59.0 49 52.4 74240 128532
23 58 57.1 63 60.9 48 52.4 74240 128280
24 58 57.1 63 60.9 49 52.4 74240 127020
25 58 57.1 63 60.9 49 52.4 74240 128532
26 58 57.1 62 60.9 48 52.4 74240 128532
27 58 57.1 60 60.9 50 52.4 74240 128280
28 59 57.1 62 60.9 48 52.4 73728 127776
29 58 57.1 63 61.8 49 52.4 73728 126516
30 50 55.2 52 59.0 44 51.4 73472 127524
|
The script file, shown below, was used to kick off the programs at the same time (within 10 seconds, validated by provided results logs). The tests were run on the latest 8 GB Pi 4B, with cooling fan, and the fanless Pi 400 PC. The 4B drive was a 32 GB SD card with the Pi 400 using a higher speed USB 3 booted disk drive.
######################## Script File ######################## lxterminal -e ./RPiHeatMHzVolts2 Passes 16 Seconds 60 Log 31 & lxterminal -e ./liverloopsPiA7R Seconds 12 Log 31 & lxterminal -e ./MP-IntStress Threads 1 KB 15000 Mins 15 Log 31 & lxterminal -e ./burnindrive2 Repeats 16, Minutes 12, Log 31, Seconds 1 & export vblank_mode=0 & lxterminal -e ./videogl32 Test 6 Mins 15 Log 31 & vmstat 60 16 > vmstat31.txt
The following results cover CPU MHz, voltage, temperatures and utilisation of memory, drives and CPU, with details for other programs on the next page. Both systems appeared to run continuously at maximum CPU MHz, without temperatures increasing anywhere near the point where throttling would occur. The Pi 4B CPU started 5°C higher, continuing with the same difference until the end. The Pi 400 PMIC started 4°C higher and that increased to 6°C.
VMSTAT shows that not much RAM was needed for these tests, both systems having similar CPU + Wait For I/O utilisations, with around 1% idle time. The main difference was main drive MB/second, with the Pi 400 disk drive some 75% faster than the Pi 4B SD card. Not too much can be read into that. It might have been the opposite effect, with the Pi 4B using the hard drive.
Results on the next page indicate that the Pi 400 obtained an official Livermore Loops average of 592.1 MFLOPS, compared with 494.4 on the Pi 4B, a difference of 20%. The two systems obtained similar speeds during the integer RAM tests, of over 2.3 GB/second, with the Pi 400 producing an 11% performance advantage, running running the OpenGL Textured Kitchen routine.
################# Pi 4B ################# ################# Pi 400 ################
================== CPU MHz CPU Voltage and Temperature Measurement =================
Secs Start at Wed Aug 12 14:03:08 2020 Secs Start at Wed Aug 12 14:02:58 2020
0 ARM MHz=1500 0.86V CPU=46°C pmic=42°C 0 ARM MHz=1800 0.95V CPU=41°C pmic=46°C
60 ARM MHz=1500 0.86V CPU=56°C pmic=47°C 60 ARM MHz=1800 0.95V CPU=49°C pmic=50°C
121 ARM MHz=1500 0.86V CPU=59°C pmic=49°C 121 ARM MHz=1800 0.95V CPU=51°C pmic=51°C
182 ARM MHz=1500 0.86V CPU=59°C pmic=49°C 182 ARM MHz=1800 0.95V CPU=50°C pmic=52°C
243 ARM MHz=1500 0.86V CPU=57°C pmic=49°C 243 ARM MHz=1800 0.95V CPU=51°C pmic=52°C
304 ARM MHz=1500 0.86V CPU=59°C pmic=49°C 303 ARM MHz=1800 0.95V CPU=53°C pmic=53°C
365 ARM MHz=1500 0.86V CPU=59°C pmic=50°C 364 ARM MHz=1800 0.95V CPU=54°C pmic=54°C
426 ARM MHz=1500 0.86V CPU=59°C pmic=50°C 425 ARM MHz=1800 0.95V CPU=53°C pmic=54°C
486 ARM MHz=1500 0.86V CPU=59°C pmic=50°C 486 ARM MHz=1800 0.95V CPU=52°C pmic=55°C
547 ARM MHz=1500 0.86V CPU=59°C pmic=49°C 548 ARM MHz=1800 0.95V CPU=53°C pmic=55°C
608 ARM MHz=1500 0.86V CPU=59°C pmic=49°C 609 ARM MHz=1800 0.95V CPU=55°C pmic=55°C
669 ARM MHz=1500 0.86V CPU=58°C pmic=50°C 670 ARM MHz=1800 0.95V CPU=55°C pmic=55°C
730 ARM MHz=1500 0.86V CPU=59°C pmic=49°C 731 ARM MHz=1800 0.95V CPU=54°C pmic=55°C
790 ARM MHz=1500 0.86V CPU=58°C pmic=49°C 792 ARM MHz=1800 0.95V CPU=54°C pmic=55°C
851 ARM MHz=1500 0.86V CPU=59°C pmic=49°C 854 ARM MHz=1800 0.95V CPU=54°C pmic=56°C
912 ARM MHz=1500 0.86V CPU=51°C pmic=48°C 915 ARM MHz=1800 0.95V CPU=48°C pmic=55°C
End at Wed Aug 12 14:19:21 2020 End at Wed Aug 12 14:19:14 2020
============================== vmstat 60 second samples =============================
Memory MB MB/sec CPU %utilise Wait Memory MB MB/sec CPU %utilise Wait
free buff cache in out user sys idle I/O free buff cache in out user sys idle I/O
7505 23 224 0 0 3 1 95 1 3377 89 229 0 0 3 1 94 2
7428 24 263 0 7 75 7 3 15 3313 89 256 33 11 76 12 2 10
7424 24 266 15 4 76 9 0 15 3315 89 252 42 0 76 11 0 12
7424 24 265 24 0 76 10 1 13 3316 89 252 42 0 76 11 1 12
7423 24 265 25 0 75 10 1 15 3315 89 252 43 0 76 10 2 12
7423 24 265 24 0 75 10 1 15 3312 89 256 42 0 76 11 1 13
7422 24 266 24 0 75 9 1 16 3313 89 254 41 0 76 10 1 13
7422 24 268 24 0 75 10 1 14 3311 89 256 41 0 77 11 0 12
7422 24 268 24 0 76 10 1 13 3310 89 257 42 0 76 11 1 12
7420 24 269 24 0 75 9 1 14 3311 89 255 41 0 77 10 1 12
7422 24 267 24 0 76 10 1 13 3310 89 257 41 0 77 11 0 12
7423 24 267 24 0 74 10 1 15 3308 89 258 41 0 77 11 1 11
7420 24 269 24 0 75 9 0 15 3308 89 258 43 0 76 11 2 12
7419 25 270 24 0 74 9 0 16 3309 89 256 64 0 77 1 5 1
7420 25 268 24 0 75 10 1 14 3309 90 256 75 0 77 1 6 1
7423 25 266 25 0 70 10 4 16 3309 90 258 78 0 63 1 6 1
|
MP Integer RAM Exerciser and OpenGL Benchmark - These report results as the tests progress, and performance for both is provided together below. There can be performance variations over the testing time, depending on activities in other programs or manual interventions.
BurnInDrive uses 64 KB block sizes, with 164 variations of data patterns, where a parameter controls file size, in this case 16 blocks for 164 MB files. Four of these are written then read by random selection for a specified time. Finally, blocks are read continuously for a specified number of seconds. Further information is in Raspberry Pi 4B Stress Tests Including High Performance Linpack.htm.
Performance from the Pi 400 hard drive was clearly superior to that from the Pi 4B SD card. Calculated reading speeds were effectively the same as indicated by VMSTAT.
======= Livermore Loops 64 Bit Reliability test 12 seconds each loop x 24 x 3 =======
Pi 4B Pi 400
Wed Aug 12 14:03:08 2020 Wed Aug 12 14:02:58 2020
Numeric results were as expected Numeric results were as expected
MFLOPS for 24 loops MFLOPS for 24 loops
734.0 933.4 982.3 939.1 204.1 717.0 820.5 1060.4 1063.9 1066.8 233.7 517.4
1128.8 1600.7 1225.6 383.5 211.8 184.8 1358.0 1911.3 1521.3 487.6 251.1 220.1
135.9 267.9 710.0 619.7 731.0 1012.8 188.6 363.1 842.4 734.0 868.1 1177.8
315.4 330.3 305.8 352.7 681.0 186.4 379.2 393.9 303.3 416.7 835.5 207.1
Maximum Average Geomean Harmean Minimum Maximum Average Geomean Harmean Minimum
1600.7 610.8 494.4 390.8 117.9 1911.3 728.0 592.1 472.4 164.5
End of test Wed Aug 12 14:17:56 2020 End of test Wed Aug 12 14:17:24 2020
===================== MP Integer RAM and OpenGL Tests ======================
Pi 4B Pi 400
Start Aug 12 2020 14:03:08 14:03:08 14:02:58 14:02:58
Secs Kbytes Thrds Pattern All Same MB/sec FPS MB/sec FPS
30 15000 1 00000000 Yes 2528 13 2428 14
60 15000 1 FFFFFFFF Yes 2501 13 2379 15
90 15000 1 FFFFFFFF Yes 2217 13 2539 13
To
840 15000 1 AAAAAAAA Yes 2217 14 2175 17
870 15000 1 CCCCCCCC Yes 2569 12 2343 15
900 15000 1 CCCCCCCC Yes 2455 13 2348 16
Average 2351 13.2 2394 14.7
End Aug 12 2020 14:18:12 14:18:11 14:18:03 14:18:01
======================== burnindrive2 Main Drive =========================
pi 400 Pi 4B
Start Wed Aug 12 14:02:58 2020 14:03:08 2020
Write seconds 164.00 MB x 4 files 11.93 82.49
Read files for 12+ minutes Files Minutes
x 4
Read passes 1 x 4 Files x 164.00 MB in 0.26 minutes 1 0.44
Read passes 2 x 4 Files x 164.00 MB in 0.52 minutes 2 0.92
TO
Read passes 25 x 4 Files x 164.00 MB in 6.52 minutes 13 5.88
Read passes 26 x 4 Files x 164.00 MB in 6.78 minutes 14 6.34
To
Read passes 45 x 4 Files x 164.00 MB in 11.79 minutes 26 11.79
Read passes 46 x 4 Files x 164.00 MB in 12.08 minutes 27 12.25
Calculated MB/second over 12+ minutes 41.6 24.1
Passes in 1 second(s) for each of 164 blocks of 64KB:
Examples
1140 1180 1160 1220 1280 1360 1520 1520 1460 1240 420 420
1260 1200 1160 1140 1140 1140 1160 1140 1160 1120 380 400
To
1320 1400 1360 1300 1160 1240 1360 1380 1400 1140 540 560
1240 1240 1240 1220 1220 1240 1180 1180 1160 1180 560 560
Passes Minutes
200220 read passes of 64KB blocks in 2.76 minutes 79580 2.80
No errors found during reading tests
End Wed Aug 12 14:18:00 2020 14:19:34 2020
|
The vmstat report shows that all these programs ran without memory swapping, with nearly all four cores being used continuously. Recorded data transfer speeds confirmed those measured by the drive program. Processor speed and and measured OpenGL frames per second were constant, with low temperatures being maintained.
vmstat RPiHeatMHzVolts MP-Int OpenGL
Memory MB------- MB/sec CPU %util-- %wait ARM Volts CPU PMIC Stress Test 6
Minutes swpd free cache in out usr sys idl I/O MHz °C °C MB/sec FPS
0 0 3285 298 0 0 8 0 91 0 1800 0.95 38 41
1 0 291 326 0 11 74 8 2 17 1800 0.95 46 45 2198 22
2 0 277 329 28 0 77 8 1 14 1800 0.95 47 46 2202 21
3 0 273 332 28 0 76 7 1 16 1800 0.95 46 47 2203 21
4 0 273 333 28 0 76 7 1 16 1800 0.95 48 48 2211 21
5 0 272 334 28 0 76 7 1 15 1800 0.95 49 48 2201 22
6 0 275 331 28 0 76 7 1 16 1800 0.95 48 48 2196 22
7 0 275 330 28 0 76 7 1 16 1800 0.95 50 49 2193 21
8 0 270 334 28 0 76 7 1 16 1800 0.95 48 49 2189 21
9 0 275 330 28 0 76 7 1 16 1800 0.95 49 49 2175 22
10 0 274 331 28 0 76 7 1 15 1800 0.95 51 50 2169 21
11 0 273 331 28 0 76 7 1 15 1800 0.95 51 50 2166 20
12 0 271 333 28 0 76 7 1 16 1800 0.95 51 50 2162 21
13 0 271 334 28 0 76 7 2 15 1800 0.95 51 50 2156 21
14 0 270 335 30 0 76 7 1 16 1800 0.95 51 50 2148 21
15 0 271 335 30 0 70 7 6 16 1800 0.95 46 49 2129 20
Avg 1800 0.95 48 48 2180 21
Min 1800 0.95 38 41 2129 20
Max 1800 0.95 51 50 2211 22
|
Later, two terminals were connected from Putty, on a PC. Sysstat software was installed from there, to enable monitoring of network data transfer speeds. VMSTAT system utilisation monitor was started from the second terminal, both saving results on the Pi 400 SD card.
Received network data mainly arrived continuously at around 214k Bytes per second. Taking into account extra overhead bits, that is similar to 1700k bits per second. The increases to more than 250 kB/s and associated transmitted bytes were included after I opened VNC Viewer, on my Smart Phone, to have a look at the TV picture there. It was really bad, with jumpy rather than smooth flow. Assuming that full screen data is transferred, rather than in compressed input format, 960 x 540 pixels at 4 bytes per pixel indicates over 2000 kB, implying supplied data to the phone would result in an extremely low displayed frames per second.
VMSTAT indicates low Pi 400 CPU utilisation. The only noticeable activity is data output to the main drive being the same as kB/s received over the network. The burst of reading from the drive, near the end, occurred following pausing the iPlayer for a short time, followed by continuing playing the recording.
sar -n DEV 1800 20 Communications Traffic
10:38:54 rxpck/s txpck/s rxkB/s txkB/s rxcmp/s txcmp/s rxmcst/s %ifutil
11:08:54 147.41 61.02 214.87 4.27 0.00 0.00 0.04 0.18
11:38:54 146.16 62.63 211.44 4.38 0.00 0.00 0.04 0.17
12:08:54 898.76 1548.94 261.53 2104.84 0.00 0.00 0.54 1.72
12:38:54 1028.00 1794.83 273.61 155.17 0.00 0.00 1.40 0.22
13:08:54 148.80 62.42 216.92 4.36 0.00 0.00 0.04 0.18
13:38:54 148.77 62.69 216.94 4.38 0.00 0.00 0.03 0.18
14:08:54 3266.34 3909.49 155.45 2299.59 0.00 0.00 2.33 1.88
14:38:54 147.26 62.17 214.69 4.34 0.00 0.00 0.05 0.18
15:08:54 149.15 62.01 216.70 4.33 0.00 0.00 0.66 0.18
15:38:54 146.12 62.80 211.66 4.39 0.00 0.00 1.17 0.17
16:08:54 148.26 61.73 216.21 4.31 0.00 0.00 0.03 0.18
16:38:54 148.85 62.48 217.04 4.37 0.00 0.00 0.03 0.18
vmstat 1800 20 System Utilisation
procs -----------memory---------- ---swap-- -----io---- -system-- ------cpu-----
r b swpd free buff cache si so bi bo in cs us sy id wa st
1 1 0 2772060 39876 714888 0 0 14 58 680 1022 6 3 91 0 0
0 0 0 2659720 46688 797124 0 0 31 259 2771 4188 6 3 91 0 0
0 0 0 2670300 49392 790776 0 0 0 233 2697 4046 5 3 92 0 0
1 0 0 2636548 52248 812228 0 0 3 232 3014 4191 14 4 81 0 0
1 0 0 2606896 55732 819748 0 0 4 239 3271 4198 24 6 70 0 0
0 1 0 2651508 58032 812232 0 0 0 237 2704 4037 6 3 91 0 0
0 0 0 2626876 60160 822588 0 0 0 235 2687 4038 5 3 92 0 0
0 0 0 2631420 62128 821656 0 0 0 238 2703 4034 5 3 92 0 0
0 0 0 2634884 64036 817980 0 0 0 235 2688 4033 5 3 92 0 0
0 0 0 2643896 65856 813900 0 0 0 237 2684 4032 5 3 92 0 0
0 0 0 2629000 67704 816040 0 0 0 233 2682 4036 5 3 91 0 0
4 0 0 2529104 68992 899540 0 0 40 238 2818 4258 6 3 90 0 0
0 0 0 2529352 70632 896856 0 0 0 237 2693 4034 5 2 92 0 0
Temperature and CPU MHz Measurement
Start at Wed Aug 19 08:42:13 2020
Using samples at 1800 second intervals
Seconds
0.0 ARM MHz=1800, core volt=0.9500V, CPU temp=32.0'C, pmic temp=32.6'C
1800.0 ARM MHz=1800, core volt=0.9500V, CPU temp=35.0'C, pmic temp=38.2'C
3600.3 ARM MHz=1800, core volt=0.9500V, CPU temp=36.0'C, pmic temp=39.2'C
5400.5 ARM MHz=1800, core volt=0.9500V, CPU temp=36.0'C, pmic temp=40.1'C
7200.8 ARM MHz=1800, core volt=0.9500V, CPU temp=37.0'C, pmic temp=40.1'C
9001.1 ARM MHz=1800, core volt=0.9500V, CPU temp=36.0'C, pmic temp=40.1'C
10801.3 ARM MHz=1800, core volt=0.9500V, CPU temp=38.0'C, pmic temp=40.1'C
12601.7 ARM MHz=1800, core volt=0.9500V, CPU temp=41.0'C, pmic temp=42.9'C
14401.9 ARM MHz=1800, core volt=0.9500V, CPU temp=38.0'C, pmic temp=42.0'C
16202.2 ARM MHz=1800, core volt=0.9500V, CPU temp=38.0'C, pmic temp=41.1'C
18002.4 ARM MHz=1800, core volt=0.9500V, CPU temp=37.0'C, pmic temp=41.1'C
19802.7 ARM MHz=1800, core volt=0.9500V, CPU temp=37.0'C, pmic temp=41.1'C
21603.0 ARM MHz=1800, core volt=0.9500V, CPU temp=37.0'C, pmic temp=41.1'C
23403.3 ARM MHz=1800, core volt=0.9500V, CPU temp=37.0'C, pmic temp=41.1'C
25203.5 ARM MHz=1800, core volt=0.9500V, CPU temp=37.0'C, pmic temp=41.1'C
27003.8 ARM MHz=1800, core volt=0.9500V, CPU temp=37.0'C, pmic temp=41.1'C
28804.1 ARM MHz=1800, core volt=0.9500V, CPU temp=37.0'C, pmic temp=41.1'C
Terminated Wed Aug 19 16:42
|
Without paying serious attention, full screen close up picture quality was acceptable. Then, right clicking on the screen, from time to time, indicated the following properties, showing large differences in the amount of data handled and displayed. There were corresponding variations in monitored statistics, with network received traffic (rxkB/s), vmstat drive kB data out (bo), and CPU utilisation (us + sy). CPU and PMIC temperatures reduced, but that might have been due to the room becoming cooler approaching midnight. I suppose that the varying traffic levels were caused by network congestion (but was it?).
Bluetooth - I found it difficult to connect bluetooth devices (in my environment?). After failing to pair, I could find no menu based operation to prevent further error indications. Executing the commands, shown below, allowed more attempts and sometimes successful connection.
kbps pixels
Periodic 1700 960 x 540
Properties 5166 1280 x 720 at 18:30
Displayed 932 704 x 396
533 512 x 288
5166 1280 x 720
sar -n DEV 1800 20 Communications Traffic
16:25:17 rxpck/s txpck/s rxkB/s txkB/s rxcmp/s txcmp/s rxmcst/s %ifutil
16:55:17 142.08 73.26 205.16 5.13 0.00 0.00 1.93 16.81
17:25:17 137.60 70.84 198.17 4.97 0.00 0.00 2.10 16.23
17:55:17 130.85 67.14 188.22 4.71 0.00 0.00 2.18 15.42
18:25:17 397.85 193.62 576.05 13.28 0.00 0.00 0.49 47.19
18:55:17 455.64 221.75 661.41 15.08 0.00 0.00 0.03 54.18
19:25:17 450.00 217.99 654.41 14.73 0.00 0.00 0.04 53.61
19:55:17 446.87 216.61 649.94 14.68 0.00 0.00 0.03 53.24
20:25:17 176.93 88.05 256.34 6.11 0.00 0.00 1.43 21.00
20:55:17 134.94 68.95 193.33 4.90 0.00 0.00 2.08 15.84
21:25:17 84.61 44.44 117.08 3.32 0.00 0.00 2.04 9.59
21:55:17 79.75 42.79 110.32 3.24 0.00 0.00 2.07 9.04
22:25:17 51.18 28.03 69.52 2.26 0.00 0.00 1.99 5.70
22:55:17 51.49 28.67 69.98 2.32 0.00 0.00 2.07 5.73
23:25:17 37.40 21.18 49.87 1.82 0.00 0.00 2.04 4.09
vmstat 1800 20 System Utilisation
procs -----------memory---------- ---swap-- -----io---- -system-- ------cpu-----
r b swpd free buff cache si so bi bo in cs us sy id wa st
2 0 0 2586672 50628 714756 0 0 39 4 148 161 2 1 97 0 0
0 0 0 2314608 54248 897128 0 0 0 235 6690 5244 15 4 81 0 0
4 0 0 2289668 57772 911392 0 0 0 219 6638 5156 14 3 82 0 0
1 0 0 2287436 60836 913288 0 0 0 226 6688 5248 14 4 82 0 0
4 0 0 2123628 64124 970940 0 0 1 356 5991 4691 27 5 68 0 0
3 0 0 2102296 67052 973788 0 0 0 650 7919 5869 50 9 41 0 0
10 0 0 2019576 69760 1020768 0 0 0 665 7963 5894 51 9 40 0 0
7 0 0 2011040 72504 1028204 0 0 0 636 7882 5837 50 9 41 0 0
1 0 0 2017004 74848 1021604 0 0 0 473 7445 5680 37 7 56 0 0
2 0 0 2009756 77008 1013644 0 0 0 230 6684 5016 15 3 82 0 0
0 0 0 2019832 79008 1002208 0 0 0 164 6491 4966 12 3 85 0 0
0 0 0 2006692 80804 1013468 0 0 0 140 6471 4992 11 3 86 0 0
0 0 0 2005796 82740 1007964 0 0 0 107 6381 4839 9 3 88 0 0
1 0 0 1986264 84220 1029168 0 0 0 92 6344 4781 8 3 89 0 0
0 0 0 1995508 85804 1010960 0 0 0 85 6351 4789 8 3 90 0 0
1 0 0 1995000 87296 1011092 0 0 0 75 6295 4743 7 2 90 0 0
Temperature and CPU MHz Measurement Start at Sat Sep 12 16:11:35 2020
Seconds
0.0 ARM MHz=1800, core volt=0.9500V, CPU temp=42.0'C, pmic temp=43.9'C
1800.0 ARM MHz=1800, core volt=0.9500V, CPU temp=42.0'C, pmic temp=46.7'C
3600.3 ARM MHz=1800, core volt=0.9500V, CPU temp=42.0'C, pmic temp=46.7'C
5400.5 ARM MHz=1800, core volt=0.9500V, CPU temp=43.0'C, pmic temp=47.7'C
7200.8 ARM MHz=1800, core volt=0.9500V, CPU temp=47.0'C, pmic temp=50.5'C
9001.3 ARM MHz=1800, core volt=0.9500V, CPU temp=48.0'C, pmic temp=51.4'C
10801.7 ARM MHz=1800, core volt=0.9500V, CPU temp=48.0'C, pmic temp=51.4'C
12602.2 ARM MHz=1800, core volt=0.9500V, CPU temp=48.0'C, pmic temp=51.4'C
14402.6 ARM MHz=1800, core volt=0.9500V, CPU temp=44.0'C, pmic temp=48.6'C
16202.9 ARM MHz=1800, core volt=0.9500V, CPU temp=44.0'C, pmic temp=47.7'C
18003.2 ARM MHz=1800, core volt=0.9500V, CPU temp=42.0'C, pmic temp=46.7'C
19803.4 ARM MHz=1800, core volt=0.9500V, CPU temp=41.0'C, pmic temp=46.7'C
21603.7 ARM MHz=1800, core volt=0.9500V, CPU temp=41.0'C, pmic temp=45.8'C
23404.0 ARM MHz=1800, core volt=0.9500V, CPU temp=40.0'C, pmic temp=45.8'C
25204.2 ARM MHz=1800, core volt=0.9500V, CPU temp=41.0'C, pmic temp=45.8'C
27004.4 ARM MHz=1800, core volt=0.9500V, CPU temp=40.0'C, pmic temp=45.8'C
Buluetooth Commands sudo hciconfig hci0 reset
sudo invoke-rc.d bluetooth restart
|
For the first two examples, although using -O2 produced faster single precision and integer calculations from cached data, performance using RAM was reduced to half speed. The integer stress test also regained appropriate cache based speeds, but with no loss on RAM performance.
It seems that anyone hoping for faster SIMD operation, with these types of program, should also try to compile not using vectorisation, to verify performance gains.
########### Memory Reading Speed Test 64 Bit gcc 8 0pt -02 ###########
Memory x[m]=x[m]+s*y[m] Int+ x[m]=x[m]+y[m] x[m]=y[m]
KBytes Dble Sngl Int32 Dble Sngl Int32 Dble Sngl Int32
Used MB/S MB/S MB/S MB/S MB/S MB/S MB/S MB/S MB/S
8 16065 11315 8296 16099 9473 9459 12353 8037 9349
16 16245 11407 8309 16259 9522 9513 12569 7993 9466
32 14290 10468 7747 14377 8451 8248 12673 8039 9525
64 12853 10212 7867 13049 7747 7975 10854 7452 9026
128 12970 10307 7958 13149 7852 8070 10159 7610 9094
256 13021 10286 7986 13157 7958 8078 9714 7706 8986
512 12781 10259 7958 13009 7951 8079 9631 7665 9033
1024 3689 4372 3978 4432 3886 3902 5865 5469 5928
2048 1800 1792 1722 1805 1769 1750 3023 2984 2949
4096 1921 1933 1905 1918 1910 1894 2658 2678 2686
8192 1962 1961 1809 1952 1955 1926 2596 2601 2613
########### Memory Reading Speed Test 64 Bit gcc 8 0pt -03 ###########
8 18133 4792 4749 18693 5259 5275 13962 11182 11182
256 14783 4646 4716 14698 5053 5063 9666 9768 9809
8192 2036 3940 3882 2034 3935 3995 2642 2643 2638
##### NEON Speed Test 64 Bit gcc 8 Opt -02 #####
Vector Reading Speed in MBytes/Second
Memory Float v=v+s*v Int v=v+v+s Neon v=v+v
KBytes Norm Neon Norm Neon Float Int
16 11286 19667 8090 18132 19678 22533
32 10394 14494 7193 13225 14233 14562
64 10765 13825 7457 12642 13846 14040
128 11057 14324 7769 13237 14394 14612
256 11113 14477 7844 13318 14530 14674
512 11149 14560 7893 13392 14627 14637
1024 4513 4758 3637 3808 4211 4770
4096 2063 2053 2086 2042 2060 2062
16384 2058 2051 2054 2056 2054 2043
65536 2059 2045 2049 2064 2049 2050
##### NEON Speed Test 64 Bit gcc 8 Opt -03 #####
16 4496 19696 4790 17870 18908 21817
256 3992 14148 4716 13508 14311 14312
65536 3319 2057 3803 2011 2059 2063
#### MP-Integer-Test 64 Bit v2-gcc8 opt -02 ####
MB/second
KB KB MB Same All
Secs Thrds 16 160 16 Sumcheck Tests
4.2 1 8040 7892 3783 00000000 Yes
3.2 2 17193 15430 3685 FFFFFFFF Yes
3.0 4 29261 29819 3329 5A5A5A5A Yes
3.0 8 29886 31708 3383 AAAAAAAA Yes
3.0 16 30410 33010 3365 CCCCCCCC Yes
2.9 32 30375 33435 3392 0F0F0F0F Yes
#### MP-Integer-Test 64 Bit v2-gcc8 opt -03 ####
7.4 1 3455 3481 3074 00000000 Yes
4.7 2 7047 6975 3507 FFFFFFFF Yes
3.6 4 13712 13977 3357 5A5A5A5A Yes
3.6 8 13631 13696 3353 AAAAAAAA Yes
3.7 16 13184 13906 3351 CCCCCCCC Yes
3.6 32 12617 13960 3414 0F0F0F0F Yes
|