# Android Benchmarks for 32 bit and 64 bit CPUs from ARM, Intel and MIPS - Roy Longbottom's PC benchmark Collection

Roy Longbottom at Linkedin  Roy Longbottom's Android Benchmarks
For 32 Bit and 64 Bit CPUs from ARM, Intel and MIPS


Contents


General Logged Configuration Whetstone Benchmark
Dhrystone Benchmark Linpack Benchmark Livermore Loops Benchmark
MemSpeed Benchmark NeonSpeed Benchmark BusSpeed Benchmark
RandMem Benchmark FFT Benchmarks MP-Whetstone Benchmark
MP-Dhrystone Benchmark NEON-Linpack-MP Benchmark MP-BusSpeed Benchmark
MP-RandMem Benchmark MP-MFLOPS Benchmark NEON-MFLOPS-MP Benchmark
OpenGL Benchmark Java Drawing Benchmark CPU MHz Benchmark
Battery Drain Test DriveSpeed Benchmark System Details

Download Benchmark Apps


A Settings, Security option may need changing to allow installation of non-Market applications.
All have an option to save results via Email. The first set automatically select benchmark code for ARM,
Intel or MIPS processors at run time, for 32 bit architecture or 64 bit when supported.

Logo NativeWhetstone2.apk
First standard benchmark
Download
Logo Dhrystone2i.apk
First integer benchmark
Download
Logo LinpackDP2.apk
First comptutational benchmark
Download
Logo LinpackSP2.apk
Single precision Linpack
Download
Logo LivermoreLoops2.apk
First supercomputer benchmark
Download
Logo MemSpeedi.apk
Floating Point Cache and
RAM Test
Download
Logo BusSpeedv7i.apk
Integer Bus, Cache and RAM
Test
Download
Logo RandMemi.apk
Random/Serial Access
Cache and RAM Test
Download
Logo fft1.apk
Original FFT Benchmark
Download
Logo fft3c.apk
Optimised FFT Benchmark
Download
Logo MP-WHETSi.apk
Whetstone Floating and Fixed Point Tests
Download
Logo MP-Dhryi.apk
Dhrystone Integer Benchmark
Download
Logo MP-MFLOPS2i.apk
CPU, Cache, RAM MFLOPS Long Running Test
Download
Logo MP-BusSpd2i.apk
Long running vesion
with staggered start
Download
Logo MP-RndMemi.apk
Multithreaded RandMem
Benchmark
Download
Logo NEON-Linpacki.apk
Linpack Benchmark using ARM
NEON Intrinsic Functions
Download
Logo NeonSpeedi.apk
NEON Memory Speed Test
Using Intrinsic Functions
Download
Logo NEON-MFLOPS2i-MP.apk
MP-MFLOPS using ARM
NEON Intrinsic Functions
Download
Logo NEON-Linpacki-MP.apk
Linpack MP Benchmark using
NEON Intrinsic Functions
Download
Logo ZIP File 32 Bit Versions
To check performance gains of 64 bit benchmarks
Download

The above were produced using gcc 4.8, via Eclipse, running under Linux Ubuntu 14.04
Following are older 32 bit benchmarks that are still relevant.

Logo Java Whetstone.apk
First standard benchmark
Download
Logo LinpackJava.apk
All Java version
Download
Logo JavaOpenGL1.apk
3D Graphics Frames Per second
Download
Logo JavaDraw.apk
Draw Frames Per Second
Download
Logo CP_MHz2.apk
Measure CPU MHz
Download
Logo BatteryTest.apk
Battery Drain Test using graphics
Download
Logo DriveSpeed.apk
SD card/internal drive tests
Download
Logo DriveSpd2.apk
Drive tests, user defined path
Download

General

My original Android benchmarks were compiled to only run on ARM CPUs using 32 bit instructions. These are available from a copy in British Library Archives or from here. The newer ones automatically select benchmark code for ARM, Intel or MIPS processors at run time, for 32 bit architecture or 64 bit when supported. These were produced using a later version of the gcc compiler. When evaluating performance differences of 64 bit operation, those at 32 bits should be produced by the same compiler version. These are in the 32 bit zip file that can be downloaded from above. It should be noted that these are recognised, by Android, as identical to the 64 bit versions, that might need to be reinstalled. The version is identified in the output display.

The original ARM native code benchmarks will run on Intel CPUs, but slowly, via an Android based compatibility layer, called Houdini, that maps ARM instructions into those for X86 processors. The new ones use native Intel instructions. After installing Android 5.0, on the Intel tablet, the original ARM native code benchmarks were rerun. As shown in the results below, significant speed gains could be obtained.

The latest benchmarks were compiled using gcc 4.8, via Eclipse Android Development Tools. The project files, with source code, are in Android Intel-ARM Benchmarks.zip. Limited tests show that these projects can also be used to produce the benchmarks via Android Studio. The zip file now includes the projects for the above earlier tests, in an folder named Old.

All Java and native C/C++ based benchmarks use the same Java front end to run the benchmarks and display the results, an example being below. There are Run, Information and Save buttons, the latter to eMail results to me and/or whoever. The results are also saved in text based log files that also identifies system characteristics. As indicated, results identify whether 32 bit or 64 bit code has been executed.

Logo

New Version of android benchmarks.htm - The revised version of this report will contain results from running a wide range of the 32/64 bit benchmarks on a particular tablet or phone, plus any from newer top end devices. For detail and results from the original benchmarks see the last report.

Strategy - These benchmarks, based on 50 years experience, do not attempt to provide an overall performance rating (the Lies, Damned Lies and Benchmarks type), as it is meaningless in representing the diverse variety of user activities. The programs are intended to identify best and worst performance characteristics that might explain why a particular application is fast or slow.

CPU Benchmarks - The first set the Classic Benchmarks that were the original programs that set standards of performance for computers, comprising Whetstone, Dhrystone, Linpack (including NEON-Linpack) and Livermore Loops.

Memory Benchmarks - Next are programs that measure performance with data from caches and RAM. MemSpeed (including NeonSpeed variant), BusSpeed and RandMrm all use the same range of data sizes beteen 4 KB and 64 MB. Then there is a Fast Fourier Transform benchmark with multiple data sizes.

MultiThreading Benchmarks - These all measure performance using 1, 2, 4 and 8 threads. The first are MP-Whetstone, MP-Dhrystone and MP-Linpack (including NEON-Linpack-MP). The next batch all use memory sized 12.8 KB, 128 KB and 12.8 MB, comprising MP-MFLOPS (including NEON-MFLOPS MP), MP-BusSpeed and MP-RandMem.

Older Benchmarks - These include graphics and SD drive benchmarks.

Windows 10 Tablet - The C code part of the benchmarks has been used as the basis of programs compiled, as 32 bits and 64 bits, to run on Intel processors via Windows. Results are included below for comparison purposes, but performance might not be the same as that from Android versions running on the same Intel processor model (See system W1). The benchmark execution files are in WinTablet.zip.

March 2016 - A second Windows 10 tablet was obtained, using the same Atom CPU, with the added dual boot option to use Android. This uses a 64 bit Linux kernel but, unfortunately, Android is a 32 bit variety. Results for both Operating Systems are in=ncluded below.

To Start


Logged Configuration

Following are examples of ARM and Intel based system information included in the log files.


 ARM CPU Based System Information

 Device Asus Nexus 7
 Screen pixels w x h 800 x 1205
 Android Build Version      5.0.2
 Processor : ARMv7 Processor rev 9 (v7l)
 processor : 0
 BogoMIPS : 1993.93
 processor : 1
 BogoMIPS : 1993.93
 processor : 2
 BogoMIPS : 1993.93
 processor : 3
 BogoMIPS : 1993.93
 Features : swp half thumb fastmult vfp edsp neon vfpv3 tls
 CPU implementer : 0x41
 CPU architecture: 7
 CPU variant : 0x2
 CPU part : 0xc09
 CPU revision : 9
 Hardware : grouper
 Revision : 0000
 Linux version 3.1.10-g6ff7a51 (android-build@vpbs1.?mtv.?corp.?google.?com?)
 (gcc version 4.7 (GCC) ) #1 SMP PREEMPT Wed Dec 10 19:55:59 UTC 2014


 Intel CPU Based System Information

 Device Asus K013
 Screen pixels w x h 800 x 1216
 Android Build Version      4.4.2
 d : 0, siblings : 4, core id : 3, cpu cores : 4, apicid : 6, initial apicid : 6
 fdiv_bug : no, f00f_bug : no, coma_bug : no, fpu : yes, fpu_exception : yes
 cpuid level : 11, wp : yes
 flags : fpu vme + numerous others including up to SSE4
 bogomips : 2666.77
 clflush size : 64
 cache_alignment : 64
 address sizes : 36 bits physical, 48 bits virtual
 processor : 3
 vendor_id : GenuineIntel
 cpu family : 6
 model : 55
 model name : Intel(R) Atom(TM) CPU  Z3745  @ 1.33GHz
 stepping : 8
 microcode : 0x81b
 cpu MHz : 1862.000
 cache size : 1024 KB
 physical i
 Linux version 3.10.20-g268162b (3.2.23.182) (gcc version 4.7 (GCC) ) #1 SMP
 PREEMPT Tue Sep 16 10:49:37 CST 2014
   

To Start


Whetstone Benchmark - NativeWhetstone2.apk, Java Whetstone.apk, WhetsNN.exe

This provides an overall rating in MWIPS, plus separate results for the eight test procedures in MFLOPS (floating point) and MOPS (functions and integer). For full details and results via Windows. Linux, Android and via different programming languages, see all modern Whetstone Benchmark results (including Windows tablet versions running on desktop PCs), also British Library Archived Files Results up to 2012 and Whetstone Benchmark History and Results from the 1960’s.

Below are results from the original benchmark for comparison with the new one, compiled for 32 bit systems. The initial aim was to show performance improvements of using native code on Intel Atom processors, rather than via the Houdini compatibility translation, where speeds of system A1 were around twice as fast. Note original ARM version (from here) performance differences on A1, Intel Atom based tablet, following upgrade to Android 5.0.

The downside of the later gcc 4.8 compilation were much slower MWIPS ratings using ARM CPUs. This was due to the extremely slow speeds on the EXP tests that dominate overall running time.

On a given platform, as other CPU only benchmarks, performance tends to be proportional to CPU MHz. Considering this, the particular code appears to suit the Qualcomm Snapdragon 800 and shows no real advantage of the ARM v8-A53 over the V7 varieties.

Java results are also included, particularly to show the effects of Android 5 using ART virtual machine instead of Dalvik. For this particular benchmark, there are gains and losses, but all are slower than the native compiled versions.

A5 and W2 Dual Boot Tablet - Differences in results from Microsoft and Android compilers are reflected. Atom Z8300 results for W2 are slower than W1. Later, similar results were obtained.



 System   ARM   MHz Android MWIPS  ------MFLOPS-------   ------------MOPS--------------
 See      CPU        Build           1      2      3     COS   EXP  FIXPT      IF  EQUAL

 Original ARM Version
 A1    Z3745   1866  4.4.2 1075.4  373.8  311.5  284.5  21.9  14.2 1421.1  1839.2  797.0
 A1    Z3745   1866  5.0    773.4  548.6  371.9  368.2  11.4  11.3 1545.4  2139.5  793.1
 A5    Z8300   1840  5.1    747.7  475.7  376.8  315.2  11.1  11.6 1424.5  1988.9  686.0
 T7    v7-A9   1200  4.1.2 1115.0  271.3  250.7  256.4  25.8  14.6 1190.0  1797.0 1198.7
 T22   v8-A53  1300  5.0.2 1433.7  348.0  319.3  308.2  36.3  19.8 1551.4  1861.9  611.0
 T11   v7-A15  1700  4.2.2 1477.7  363.9  220.6  307.5  39.7  18.0 1690.5  2527.9 1127.9
 T21   QU-800  2150  4.4.3 2035.1  665.7  640.0  531.6  45.2  23.1 3535.2  3180.4 2120.0

 ARM/Intel 32 Bit Version
 A1    Z3745   1866  4.4.2 1888.4  665.8  504.4  492.0  35.7  27.5 3191.4  3585.8 2146.7
 A5 ## Z8300   1840  5.1   1896.6  612.8  472.8  455.5  39.0  26.5 2916.9  3458.9 1943.3
 T7     v7-A9  1200  4.1.2  731.1  273.6  253.0  252.8  28.0   5.0 1185.2  2383.4 1192.1
 T11   v7-A15  1700  4.2.2  907.4  363.3  327.1  303.1  33.6   6.3 1506.9  2476.5 1122.6
 T21   QU-800  2150  4.4.3 1973.8  679.6  648.4  525.6  44.7  21.9 3516.7  3147.2 1567.7
 T22   v8-A53  1300  5.0.2  834.7  348.9  312.7  310.9  36.7   5.4 1556.7  1867.2  570.5

 ARM/Intel 64 Bit Version
 T22   v8-A53  1300  5.0.2 1494.2  347.1  307.0  305.9  37.5  20.6 1552.2  1863.7 1239.1

 Intel/Windows 32 Bit Version 
 W1 Atom Z8300 1840 Win10  1833    630    661    478    35.0  27.3 1302    1094  10833
 W2 ##   Z8300 1840 Win10  1710    604    616    439    32.6  25.7 1232    1060  10148

 Intel/Windows 64 Bit Version 
 W1 Atom Z8300 1840 Win10  1985    662    477    476    43.0  28.9 1244    1053  10931
 W2 ##   Z8300 1840 Win10  1918    584    458    461    40.7  28.8 1205     992  10527

 Java Version
 A1    Z3745   1866  4.4.2  687.9  428.4  346.8  105.8  27.4  20.0  723.8   172.6   60.1
 A1    Z3745   1866  5.0    826.9  182.0  307.1  126.8  32.5  22.3  382.1   126.6  129.6
 A5 ## Z8300   1840  5.1    833.2  286.1  292.3  130.9  32.3  22.1  449.4   127.1  116.9
 T7    v7-A9   1200  4.4.2  341.8   75.6  104.9   62.1  16.1   5.2  214.3   194.1   43.4
 T7    v7-A9   1200  5.1.1  411.3  102.2  157.5   70.4  12.6   8.3  266.1   104.8   78.9
 T11   v7-A15  1700  4.2.2  533.9  131.4  209.4  102.5  20.4   6.7  475.8   174.8  105.7
 T21   QU-800  2150  4.4.3  855.1  124.0  272.9  159.3  36.3  20.1  572.2   169.3   78.3
 T22   v8-A53  1300  5.0.2  363.9   83.0  149.3   65.1  10.8   7.6  268.1    88.2   59.6
 T22   v8-A53  1300  5.1    391.3   86.0  155.7   66.8  12.3   8.6  277.8    89.8   61.4

 W1 Atom Z8300 1840  Java8  843.6  435.6  336.0  174.3  20.3  12.7  826.3   154.8  788.4
 W2 ##   Z8300 1840  Java8  749.5  374.7  290.7  151.4  17.8  12.0  738.2   133.5  664.3
  
                          ## A5 and W2 Same Dual Boot Tablet
  

To Start


Dhrystone Benchmark - Dhrystone2i.apk, Dhry2NN.exe

The Dhrystone integer benchmark produces a performance rating in Vax MIPS (AKA DMIPS). Further details of the Dhrystone benchmark, and results from Windows and Linux based PCs, can be found in all modern Dhrystone Benchmark results (including Windows tablet versions running on desktop PCs) with those up to late 2012 also in British Library Archives. The shown ratio, MIPS/MHz, is often quoted, with this depending on compiler optimisation (or over-optimisation) but is normally constant using the same benchmark on the same range of processors.

Using native x86 code, performance of the Intel Atom based tablet A1 is 30% faster than the original ARM to Intel translated program but, on the other systems, the newer 32 bit compilations are slower. At least tablet T22 is nearly twice as fast when compiled for 64 bit operation. Following an upgrade to Android 5.0, A1 ARM to Intel translation produced performance equivalent to native code. Original can be obtained from here.


 System   ARM    MHz   Android       Vax       MIPS
 See                                MIPS       /MHz

 Original ARM Version
 A1    Z3745   1866     4.4.2       1840       0.99
 A1    Z3745   1866     5.0         2488       1.33
 A5 ## Z8300   1840     5.1         2365       1.29
 T7    v7-A9   1200     4.1.2       1610       1.34
 T22   v8-A53  1300     5.0.2       1683       1.29
 T11   v7-A15  1700     4.2.2       3189       1.88
 T21   QU-800  2150     4.4.3       3854       1.79

 ARM/Intel 32 Bit Version
 A1    Z3745   1866     4.4.2       2451       1.31
 A4    Z8300   1840     5.1.1       2430       1.32
 A5 ## Z8300   1840     5.1         2318       1.26
 T7    v7-A9   1200     4.1.2       1317       1.10
 T11   v7-A15  1700     4.2.2       2551       1.50
 T21   QU-800  2150     4.4.3       3319       1.54

 T22   v8-A53  1300     5.0.2       1423       1.09

 ARM/Intel 64 Bit Version
 T22   v8-A53  1300     5.0.2       2569       1.98

 Intel/Windows 32 Bit Version 
 W1 Atom Z8300 1840  Win 10         3044       1.65
 W2 ##   Z8300 1840  Win 10         2906       1.60

 Intel/Windows 64 Bit Version 
 W1 Atom Z8300 1840  Win 10         3291       1.79
 W2 ##   Z8300 1840  Win 10         3195       1.74

      ## A5 and W2 Same Dual Boot Tablet
   

To Start


Linpack Benchmark - LinpackDP2.apk, LinpackSP2.apk, LinpackJava.apk,
NEON-Linpacki.apk, Linpack32SSE.exe, Linpack64SSE.exe

The Linpack benchmark speed is measured in MFLOPS, officially for double precision floating point calculations. A version was produced using NEON functions, that only provides single precision operation. So, for comparison purposes, an available C code option, to define single precision data, was used to produce a new version and this has usually lead to a higher MFLOPS speed. Results from various hardware and software platforms can be found in all modern Linpack Benchmark results (including Windows tablet versions running on desktop PCs), with those up to 2013 in British Library Archives.

Single and Double Precision (SP and DP) - Performance of the Linpack benchmark is almost entirely dependent on the calculation x[i]=x[i]+c*y[i], where changes in CPU instructions used can have a dramatic effect. Using the original benchmarks, SP was often nearly twice as fast as DP, affected by use of such as ARM vfpv4 instructions that execute fused multiply-accumulate instructions. Tablet T11 has vfpv4 but T7 does not - See System Details. As a reminder, the original programs on Intel CPUs used a slower instruction conversion procedure, with the later ones with native Intel code. The newer compilations could make further use of advanced instructions, changing relative SP/DP and 32/64 bit performance.

Java results are also included, particularly to show the effects of Android 5 using ART virtual machine instead of Dalvik. For this particular benchmark, ART produces improvements, except for the original A1 Intel Atom results, all are slower than the native compiled versions

NEON-Linpack - This is a single precision version, using NEON intrinsic functions, where further details can be found in android neon benchmarks.htm. Performance can be double that of normal SP results. Note little gain on T22, at 64 bits, where normal SP might also use advanced SIMD instructions.

Following an upgrade to Android 5.0, A1 ARM to Intel translation produced some significant performance gains but Java based speeds were slower. Original ARM only version can be obtained from here.

Windows - both the latest 64 bit and 32 bit compilations use SSE2 instructions (32 bit incompatible with old systems), with similar speeds. Sample Single Precision compilations also produced similar performance.

 
 System   ARM    MHz   Android  LinpackDP  LinpackSP NEONLinpack LinpackJava
 See                              MFLOPS     MFLOPS  SP MFLOPS     MFLOPS

 Original ARM Version
  A1    Z3745   1866     4.4.2    168.16     296.63     443.42     252.49
  A1    Z3745   1866     5.0      253.83     293.20     680.85     166.09
  A5 ## Z8300   1840     5.1      238.04     318.00     746.36     174.67
  T7    v7-A9   1200     4.1.2    151.05     201.30     376.00      56.44
  T22   v8-A53  1300     5.0.2    156.70     184.09     393.34      86.09
  T11   v7-A15  1700     4.2.2    459.17     803.04    1334.90     143.06
  T21   QU-800  2150     4.4.3    389.52     751.95    1250.14     340.44

 ARM/Intel 32 Bit Version
  A1    Z3745   1866     4.4.2    362.63     408.87     900.17    See above 
  A1    Z3745   1866     5.0      363.98     406.59     900.64    See above
  A5 ## Z8300   1840     5.1      609.39     644.32     942.12    See above
  T7    v7-A9   1200     4.1.2    159.34     199.84     346.78
  T7    v7-A9   1200     5.1.1    160.25     198.96     346.12      89.50
  T22   v8-A53  1300     5.0.2    172.28     180.64     407.08
  T22   v8-A53  1300     5.1      178.04     187.03     421.86      91.28
  T11   v7-A15  1700     4.2.2    826.36     952.88    1411.86    See above
  T21   QU-800  2150     4.4.3    629.92     790.83    1325.00    See above

 ARM/Intel 64 Bit Version
  T22   v8-A53  1300     5.0.2    338.00     479.69     505.12
  T22   v8-A53  1300     5.1      347.55     492.78     520.79
  P33   QU-810  2000     5.0.2              1277.76 

Intel/Windows 32 Bit Version 
  W1 Atom Z8300 1840    Win 10    615.80                          See 64b 
  W2 ##   Z8300 1840    Win 10    613.50                          See 64b 

Intel/Windows 64 Bit Version 
  W1 Atom Z8300 1840    Win 10    638.75                           254.73
  W2 ##   Z8300 1840    Win 10    636.00                           265.66

              ## A5 and W2 Same Dual Boot Tablet
    

To Start


Livermore Loops Benchmark - LivermoreLoops2.apk, Lloops32.exe, Lloops64.exe

The Livermore Loops comprise 24 kernels of numerical application with speeds calculated in MFLOPS. A summary is also produced, with maximum, minimum and various mean values, geometric mean being the official average. As for other of these benchmarks, details and results from various hardware and software platforms are provided in all modern Livermore Loops Benchmark results (including Windows tablet versions running on desktop PCs), with results up to late 2012 in British Library Archives.

The following comparisons are derived from shown MFLOPS of the 24 kernels for each system. divided by CPU MHz, and compared to those from T7 Cortex-A9 CPU. They can indicate the effectiveness of particular levels of hardware and compiler technology. The large maximum ratios are due to different CPUs executing certain kernels much faster.

These are followed by MFLOPS comparisons for A1 Atom to show performance improvement in the ARM to Intel instruction conversion, that came with Android 5, comparing the new native Intel code version with the latter. he Cortex-A7 New/Original ratios are also provided (with similar average but wide variety).

MFLOPS per MHz comparisons with Cortex A9 CPU Bits Avg Min Max CPU Bits Avg Min Max Cortex-A15 Android 32 1.52 0.85 3.23 Qualcomm 800 Android 32 1.30 0.65 5.94 Cortex-A53 Android 32 0.87 0.62 1.26 Cortex-A53 Android 64 1.48 0.57 7.38 Atom Z8300 Windows 32 1.95 1.16 7.27 Atom Z8300 Windows 64 1.62 0.92 5.16 Atom Z3745 Android 32 1.97 1.06 8.11 Other MFLOPS Comparisons Atom Z3745 Android 5/4.4 1.39 1.06 2.02 Atom Z3745 Android Nat/5 2.03 1.11 3.67 Cortex-A7 Android New/Old 1.05 0.16 1.51

The Android and Windows Intel Atom speeds perform much better than the ARM CPUs of the same MHz at 32 bits, but the 64 bit compilation is slower than at 32 bits (old floating point can be faster than SSE2 instructions for double precision). On the other hand, the Cortex-A53 can be slower than than the Cortex-A9 at 32 bits, but 64 bit operation is much faster. The Cortex-A15 is best of the ARM compatible CPU bunch at 32 bits.

Original ARM only version can be obtained from here.


 System   CPU   MHz Android                           MFLOPS 24 Loops
 
 Original ARM Version ----------------------------------------------------------------

   A1    Z3745 1866  4.4.2  9.5 secs     201.2   257.3   237.5   205.6   122.5   180.0
                                         308.3   450.0   535.3   370.4   104.8    77.1
  Max   Average Geomean Harmean   Min     80.0    95.1   153.8   136.4   202.0   268.9
 535.8   201.9   172.4   146.7    48.8   179.5   209.7   145.0    95.0   254.2    51.3
 
   A1    Z3745 1866  5.0       9.9 secs  374.9   274.8   327.6   295.6   247.9   227.8
                                         468.5   538.6   569.2   396.2   167.9   141.9
  Max   Average Geomean Harmean   Min    109.6   114.5   210.5   150.5   250.6   333.4
 569.8   266.6   233.5   199.8    59.9   287.9   238.0   261.3   114.9   372.8    64.0

   T7   v7-A9 1200  4.1.2     10.0 secs  241.7   233.4   383.5   388.7    98.4   147.1
                                         293.1   258.5   314.6   181.1    99.1    95.3
  Max   Average Geomean Harmean   Min     80.6    68.1   171.6   226.9   346.2   176.9
 391.9   202.1   181.3   160.9    68.1   202.6   184.9   119.5   102.1   200.9    88.5

   T11  v7-A15 1700  4.2.2    10.0 secs  646.8   671.1   839.9   789.7   176.2   671.6
                                        1078.4  1243.4  1018.8   367.0   130.0   165.9
  Max   Average Geomean Harmean   Min    117.6   210.7   370.5   521.1   657.3   625.4
1252.8   476.0   375.8   288.8    90.8   270.8   269.1   458.3   196.3   432.5   112.7

   T21  QU-800 2150  4.4.3    10.0 secs  570.4   624.2   915.6   861.4   175.5   545.4
                                         636.9   911.1   750.6   293.9   130.5   207.0
  Max   Average Geomean Harmean   Min    115.0   159.8   330.5   327.1   608.7   592.8
1075.5   437.1   356.7   284.4   100.3   330.2   267.3   244.2   153.8   356.2   106.2
  
 ARM/Intel 32 Bit Version ------------------------------------------------------------

   A1    Z3745 1866  4.4.2     9.5 secs  484.6   529.2  1031.2   929.2   274.5   365.6
                                         661.9   873.1   825.6   479.1   612.9   520.7
  Max   Average Geomean Harmean   Min    156.8   324.4   339.4   497.8   693.1   481.8
1031.2   480.0   429.8   378.6   154.7   373.0   329.1   388.6   181.8   650.1   169.2

   A5 ## Z8300 1840  5.1      9.6 secs   689.4   701.4  1108.3   873.6   230.1   488.4
                                         662.2   770.0   876.7   404.9   439.6   428.2
  Max   Average Geomean Harmean   Min    141.2   280.7   293.4   466.1   540.3   432.7
1108.3   495.8   433.6   370.6   133.2   313.9   307.8   649.7   176.1   662.0   148.3

   T11  v7-A15 1700  4.2.2    10.0 secs  496.9   814.9   843.7   801.7   175.5   188.6
                                        1223.8  1411.4   760.3   452.5   132.7   120.7
  Max   Average Geomean Harmean   Min    107.1   264.7    34.3   529.0   592.6   728.2
1411.4   471.2   342.1   219.5    34.3   275.2   266.8   530.7   198.8   502.8   117.8


   T21  QU-800 2150  4.4.3    10.1 secs  640.9   814.9   813.8   808.4   201.6   182.0
                                         643.0  1158.9   779.9   351.4   133.1   176.2
  Max   Average Geomean Harmean   Min    113.6   178.4   286.5   294.7   516.7   667.5
1159.4   446.9   356.0   280.3   112.3   327.5   281.7   297.9   153.6   613.1   117.0

   T7    v7-A9 1200  4.4.2    10.2 secs  245.2   268.8   394.7   390.7   118.2   157.2
                                         297.4   308.1   344.7   226.7    90.8    74.7
  Max   Average Geomean Harmean   Min     85.6    81.7    26.9   227.5   338.9   240.3
 396.6   207.6   175.6   136.1    26.8   204.9   180.6   179.9   110.8   271.4    78.5

 ARM/Intel 32 Bit Version Then 64 Bit ------------------------------------------------

   T22  v8-A53 1300  5.0.2     9.7 secs  163.4   243.4   272.1   270.3   109.5   111.2
                                         282.2   389.0   360.6   219.6   124.0    61.8
  Max   Average Geomean Harmean   Min     67.6    87.4    27.3   224.2   340.1   241.9
 393.4   188.3   158.3   124.6    27.1   168.5   198.8   120.2   120.6   277.7    79.1

 ARM/Intel 64 Bit Version ------------------------------------------------------------

   T22  v8-A53 1300  5.0.2     9.7 secs  451.4   191.4   243.2   272.4   144.9   144.5
                                         749.4   411.1   453.6   261.1   138.0   206.1
  Max   Average Geomean Harmean   Min    122.5   130.1   215.0   249.8   411.6   395.4
 772.2   265.9   232.5   206.3    97.8   241.7   248.1   152.8   118.7   317.2   103.7


 Intel/Windows 32 Bit Version --------------------------------------------------------

  W1  Atom Z8300 1840 MHz Win10          721.4   702.3   862.7   988.7   245.3   489.6
                                         875.8   794.8   980.5   441.3   201.1   446.7
  Max   Average Geomean Harmean   Min    201.0   240.8   299.8   499.9   603.5   459.3
 988.7   504.9   448.8   395.8   189.6   446.6   336.0   607.8   199.0   705.3   277.8


  W2 ##    Z8300 1840 MHz Win10          749.7   731.3   894.0   988.1   251.2   489.2
                                         883.7   797.3   968.3   434.5   200.5   454.0
  Max   Average Geomean Harmean   Min    202.9   240.7   301.1   521.4   604.3   457.0
 988.1   503.5   446.9   393.5   183.5   443.6   333.7   587.1   200.6   697.1   276.4

 Intel/Windows 64 Bit Version --------------------------------------------------------

  W1  Atom Z8300 1840 MHz Win10          655.2   651.9   728.1   688.9   217.3   457.6
                                         732.2   735.7   965.5   378.3   170.7   381.6
  Max   Average Geomean Harmean   Min    196.6   196.4   213.0   434.4   522.6   420.6
 965.5   433.6   375.8   320.0   117.2   385.6   283.1   572.9   156.7   584.5   129.7

  W2 ##    Z8300 1840 MHz Win10          743.0   734.7   834.0   808.1   233.4   547.9
                                         878.6   857.4  1074.4   440.8   201.7   450.4
  Max   Average Geomean Harmean   Min    215.4   228.6   247.9   500.4   608.1   484.8
1074.4   500.3   433.7   369.8   143.4   440.7   327.7   650.4   180.8   682.2   151.6

                          ## A5 and W2 Same Dual Boot Tablet
  

To Start


MemSpeed Benchmark - MemSpeedi.apk, MemSpeed32.exe, MemSpeed64.exe

This benchmark measures data reading speeds in MegaBytes per second carrying out calculations on arrays of cache and RAM data, sized 2 x 8 KB to 2 x 32 MB. Calculations are x[m]=x[m]+s*y[m] and x[m]=x[m]+y[m], using double and single precision floating point and x[m]=x[m]+s+y[m] and x[m]=x[m]+y[m] with integers. Million Floating Point Operations Per Second (MFLOPS) speed can be calculated by dividing double precision MB/second by 8 and 16, for the two tests, and single precision speeds by 4 and 8. Assembly listings for integer tests show that Millions of Instructions Per Second (MIPS) can be found by multiplying MB/second by 0.78 with 2 adds and 0.66 for the other test. Cache sizes are indicated by varying performance as memory usage changes. For more details and older results see here, with results up to 2013 in British Library Archives.

The native ARM/Intel results, on Intel Atom based A1, averaged 44% faster than the original translated speeds via L1 cache data, 27% using L2 and 14% from RAM. Running under Android 5.0, the translated benchmark speeds were similar to the new version, in most cases. (Original ARM only version can be obtained from here).

Initial measurement, running the new 32 bit version on ARM CPUs, produced similar results to the original benchmark.

First results, to provide 64/32 bit comparisons on ARM CPUs, were on Tablet T22, where average 64/32 bit speed ratios, were 2.20 times, using cached data, and 1.58 times from RAM.

The benchmark is based on, and is similar to, my original Windows MemSpeed bencmarks, where details and results can he found here. These can be compared with the new Windows tablet version, from later compiler, with 32 bit and 64 bit results included below for Tablet W1 and a fast Core i7 desktop CPU.

A5 and W2 Dual Boot Tablet - Android and Windows speeds are quite similar. RAM speed appears to be slower than W1, that has the same hardware specification.

Following the results are processor technology comparisons with the ARM Cortex-A9 CPU, based on MB/second divided by CPU MHz, demonstrating that each has its strengths and weaknesses.


 ##################### T7 Original ######################

 T7, ARM Cortex-A9 1200 MHz, Android 4.1.2, DDR3 5.3 GB/s
       4 x 32 KB L1 cache, 1 MB shared L2 cache

 Android MemSpeed Benchmark 17-Oct-2012 20.19

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   1735    888   2456   2726   1364   2818 L1
      32   1448    760   1474   1700   1039   1648
      64   1318    719   1290   1468    952   1385 L2
     128   1279    715   1289   1443    944   1336
     256   1268    714   1279   1435    943   1313
     512   1158    691   1204   1321    892   1228
    1024    729    553    735    772    632    742
    4096    445    392    425    442    421    439 RAM
   16384    435    390    428    435    412    431
   65536    445    404    393    450    432    449

          Total Elapsed Time   12.2 seconds

 #################### T7 ARM-Intel #####################

 ARM/Intel MemSpeed Benchmark 1.1 25-Apr-2015 12.24

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   1856   1019   2537   2913   1459   2544
      32   1416    832   1327   1508    920   1345
      64   1286    779   1198   1418    908   1296
     128   1282    781   1195   1424    912   1305
     256   1278    774   1190   1433    878   1298
     512   1197    752   1122   1340    862   1216
    1024    833    626    822    903    695    857
    4096    463    420    456    463    440    459
   16384    459    426    453    455    435    458
   65536    463    430    411    462    443    452

          Total Elapsed Time   11.5 seconds

 
 #################### T11 Original #####################

 T11 Samsung EXYNOS 5250 1700 MHz Cortex-A15, Android 4.2.2
  2 GB DDR3-1600 RAM, dual channel, 12.8 GB/sec
    Dual core, 2 x 32 KB L1 cache, 1 MB shared L2 cache

  Android MemSpeed Benchmark 1.1 09-Aug-2013 17.04

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   7296   4159   3513   9375   5453   6211 L1
      32   7253   4540   3882   7364   4873   4839
      64   6902   4265   3878   7026   4373   4274 L2
     128   6735   4032   2480   4005   2797   3288
     256   5859   3775   2192   4527   3263   3676
     512   5795   3781   3568   6282   3819   3818
    1024   2609   1757   1754   2607   1805   1825
    4096   1614   1422   1471   1654   1342   1441 RAM
   16384   1624   1412   1474   1642   1336   1443
   65536   1617   1408   1479   1368   1321   1423

          Total Elapsed Time   10.7 seconds
 
 #################### T11 ARM-Intel ####################

 ARM/Intel MemSpeed Benchmark 1.1 23-Apr-2015 12.26

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   6540   4359   4580  10119   6292   6502
      32   8185   5132   4682   8729   4622   4465
      64   5770   3530   3473   5780   3447   3782
     128   5311   3386   3475   5225   3441   3451
     256   5667   3642   3678   5805   3643   3726
     512   5047   3318   3334   4869   3303   3337
    1024   2015   1469   1423   2050   1452   1386
    4096   1535   1322   1342   1598   1381   1385
   16384   1505   1379   1406   1584   1387   1384
   65536   1509   1306   1332   1585   1387   1382

          Total Elapsed Time   10.8 seconds


 #################### T21 Original #####################

 T21 Qualcomm Snapdragon 800 2150 MHz, Android 4.4.4
    Dual Channel 32 Bit LPDDR3-1866 RAM 14.9 GB/s
     L1 caches 4 x 16 KB, L2 cache shared 2048 KB

 Android MemSpeed Benchmark 1.1 02-Jun-2015 11.01

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   8922   4635   3566  12412   5648   3774 L1
      32   5116   3542   2773   7594   4827   3657 L2 
      64   5174   3393   2684   5652   3757   3130
     128   5286   3387   2648   5443   3758   3194
     256   4937   3446   2889   7469   4624   3449
     512   4941   3459   2915   7452   4566   3724
    1024   4837   3449   2848   7065   4455   3722
    4096   2840   2606   2343   2581   2458   2567 RAM
   16384   2606   2423   2232   2395   2238   2338
   65536   2653   2453   2257   2457   2312   2420

          Total Elapsed Time    9.7 seconds

     Maximum SP MFLOPS 1159 Integer MIPS 2802


 #################### T21 ARM-Intel ####################

 ARM/Intel MemSpeed Benchmark 1.1 02-Jun-2015 11.27

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   8074   4831   2603  11252   5065   3892 L1
      32   5302   4138   3709   7252   4985   3693 L2
      64   4801   3510   2832   5739   3684   3015 
     128   4502   3783   3577   5991   3914   3547
     256   4907   3913   3934   6876   4280   4056
     512   4686   3883   3921   6236   4215   4060
    1024   4716   3808   3823   6131   4185   3942 
    4096   2691   2603   2679   2249   2634   2709 RAM
   16384   2227   2223   2420   1798   2191   2445
   65536   2099   2106   2306   1738   2040   2346

          Total Elapsed Time    9.9 seconds

     Maximum SP MFLOPS 1207 Integer MIPS 2898


 
 ###################### T22 32 Bit ######################

  T22, ARM Cortex-A53 1300 MHz, Android 5.0.2 
  Single Channel RAM, LPDDR3 666 MHz, 5.3 GB/second
        4 x 32 KB L1 cache, 512 KB L2 cache

  ARM/Intel MemSpeed Benchmark 1.2 05-Aug-2015 17.16
           Compiled for 32 bit ARM v7a

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   1940    971   1693   2470   1278   2084 L1
      32   1879    955   1676   2378   1255   1967
      64   1801    938   1615   2254   1218   1912 L2
     128   1706    941   1620   2279   1224   1872
     256   1818    935   1570   2291   1155   1875
     512   1633    884   1451   2008   1132   1704
    1024   1276    781   1181   1454    938   1324 RAM
    4096   1335    808   1260   1533   1010   1386
   16384   1342    813   1270   1487   1013   1419
   65536   1346    809   1274   1546   1031   1252

          Total Elapsed Time   11.7 seconds


###################### T22 64 Bit ######################

 ARM/Intel MemSpeed Benchmark 1.2 05-Aug-2015 17.29
           Compiled for 64 bit ARM v8a

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   4092   2198   3951   5293   3611   4408 L1
      32   3753   2496   3630   4651   3300   3992
      64   3407   2388   3368   3715   3023   3677 L2
     128   3496   2462   3521   4137   3139   3844
     256   3535   2481   3573   4199   3322   3911
     512   3054   2248   3126   3556   2548   3372
    1024   1714   1704   2029   2069   1854   2099 RAM
    4096   1832   1595   1841   1914   1780   1897
   16384   1844   1601   1850   1925   1798   1891
   65536   1859   1608   1837   1921   1795   1812

          Total Elapsed Time   10.2 seconds


 #################### A1 Original #######################

 A1 Quad Core 1.86 GHz Intel Atom Z3745, Android 4.4
   Dual Channel LPDDR3-1066 Bandwidth 17.1 GB/s
            4 x 24 KB L1, 2 x 1 MB L2

  Android MemSpeed Benchmark 1.1 01-Feb-2015 10.06

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   2773   1745   2821   5993   3274   3094 L1
      32   3088   1690   2451   4849   2769   2896 L2
      64   3066   1694   2245   3883   2434   2568 
     128   3084   1695   2261   3886   2466   2524
     256   3158   1732   2285   3964   2264   2176
     512   2666   1721   2295   3959   2505   2561
    1024   2938   1659   2163   3567   2356   2443
    4096   2775   1653   2123   3055   2307   2395 RAM
   16384   2827   1659   2121   3208   2321   2411
   65536   2840   1661   2112   3248   2314   2406
 
         Total Elapsed Time   10.8 seconds
 
 ############### A1 Original Android 5.0 ################

 Android MemSpeed Benchmark 1.1 26-Oct-2015 16.01

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   3033   1738   4531   8287   3925   4532 L1
      32   3048   1701   3690   5913   3371   3737 L2
      64   3041   1727   3253   4613   2798   3137
     128   3040   1748   3263   4613   2848   3190
     256   3072   1746   3253   4568   2835   3070
     512   3055   1740   3229   4541   2824   3163
    1024   2841   1672   2969   4114   2689   2949
    4096   2727   1658   2797   3157   2643   2784 RAM
   16384   2746   1658   2786   3156   2646   2814
   65536   2672   1658   2807   3150   2643   2808

          Total Elapsed Time   10.4 seconds
 
 #################### A1 ARM-Intel ######################

 ARM/Intel MemSpeed Benchmark 1.1 23-Apr-2015 11.46

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   3287   1859   4560   9789   4688   7316 L1
      32   3233   1856   3807   6633   3990   4030 L2
      64   3304   1860   2965   4457   2996   3894
     128   3303   1855   3006   4463   3113   3992
     256   3306   1860   2978   4463   3093   3946
     512   3307   1862   2964   4377   3097   3958
    1024   3031   1778   2766   3993   2867   3472
    4096   2863   1776   2692   3129   2763   3046 RAM
   16384   2857   1776   2702   3063   2768   3050
   65536   2865   1765   2702   3176   2782   3087

          Total Elapsed Time   10.1 seconds


 ########### A5 ARM-Intel Dual Boot With W2 #############

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 ARM/Intel MemSpeed Benchmark 1.2 24-Mar-2016 16.38
           Compiled for 32 bit Intel x86

              Reading Speed in MBytes/Second
  Memory  x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]
  KBytes   Dble   Sngl    Int   Dble   Sngl    Int

      16   6580   3487   4236   6996   3453   6933 L1
      32   4446   2719   3320   5506   3068   3445 l2
      64   4781   2870   2783   4127   2670   3449
     128   4800   2800   2692   4107   2722   3482
     256   4847   2885   2642   4043   2715   3460
     512   4740   2954   2603   3948   2633   3220
    1024   3623   2444   2424   3376   2317   2850
    4096   2645   2429   2353   2583   2301   2577 RAM
   16384   2576   2390   2335   2603   2295   2545
   65536   2594   2213   2203   2658   2318   1902

          Total Elapsed Time   10.1 seconds

 
 ################# W1 Windows 10 32 bit #################

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Windows 10, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 Memory Reading Speed Benchmark From C/C++ 18.00.21005.1 for x86

                Start of test Wed Dec 23 20:57:00 2015

  Memory   x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]         x[m]=y[m]
  KBytes    Dble   Sngl  Int32   Dble   Sngl  Int32   Dble   Sngl  Int32
    Used    MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S

      16    6533   3377   6232   6541   3239   7031  14212   3650   3444 L1
      32    4955   3008   3732   4656   2764   3582   8596   2339   2268 L2
      64    4502   2616   3202   4891   2841   3393   5115   1992   2106
     128    4741   2818   3767   4963   2996   3206   5177   2110   2165
     256    4795   2889   3096   4662   2864   3251   4947   2059   2088
     512    4792   2921   3452   4518   2924   3292   4936   2022   2230
    1024    4316   2491   3175   4201   2535   3163   4546   1974   1952
    4096    3585   2618   2993   3485   2503   2961   1856   1710   1798 RAM
   16384    3621   2676   3163   3629   2521   3218   1860   1825   1727
   65536    3567   2579   3219   3521   2555   3250   1858   1749   1771

                End of test Wed Dec 23 20:57:15 2015

 
 ################# W1 Windows 10 64 bit #################

 Memory Reading Speed Benchmark From C/C++ 18.00.21005.1 for x64

             Start of test Wed Dec 23 21:14:38 2015

  Memory   x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]         x[m]=y[m]
  KBytes    Dble   Sngl  Int32   Dble   Sngl  Int32   Dble   Sngl  Int32
    Used    MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S

      16    6553   3300   6519   6338   3134   6891   6406   3565   3518 L1
      32    5132   2814   3431   5130   3119   3451   5114   2156   2196 L2
      64    4348   2973   3207   4674   2802   3189   3334   2075   2216
     128    4600   2929   3082   4335   2974   3391   3234   2164   2020
     256    4351   2826   3529   4507   2930   3192   3256   1945   2059
     512    4673   3009   3415   4594   2698   3357   3278   2059   2072
    1024    4430   2519   2807   4154   2691   2938   2947   2049   1811
    4096    3541   2629   3227   3563   2654   2860   1858   1763   1829 RAM
   16384    3587   2762   3086   3541   2433   3172   1847   1733   1741
   65536    3508   2616   3097   3591   2568   3095   1841   1794   1684

                End of test Wed Dec 23 21:14:52 2015


  ######## W2 Windows 10 32 bit Dual Boot With A5 ########

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Windows 10, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

     Memory Reading Speed Benchmark From C/C++ 18.00.21005.1 for x86

               Start of test Mon Apr 11 23:42:19 2016

  Memory   x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]         x[m]=y[m]
  KBytes    Dble   Sngl  Int32   Dble   Sngl  Int32   Dble   Sngl  Int32
    Used    MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S

      16    6614   3624   6378   6504   3641   7247  14229   3548   3399 L1
      32    5123   3040   4017   5262   3055   3889   8664   2453   2423 K2
      64    4945   3006   3551   4974   2971   3531   5599   2293   2233
     128    4644   3009   3709   4983   3009   3516   5499   2134   2229
     256    4979   2964   3545   4986   3007   3520   5405   2251   2244
     512    4926   2991   3381   4914   2995   3550   5072   2221   1671
    1024    3400   2670   3343   4601   2723   3166   4483   2089   2093
    4096    2785   2582   2741   2747   2573   2720   1417   1415   1386 RAM
   16384    2678   2391   2730   2778   2501   2719   1409   1376   1412
   65536    2771   2540   2719   2754   2587   2749   1403   1403   1413

                End of test Mon Apr 11 23:42:34 2016

  ######## W2 Windows 10 64 bit Dual Boot With A5 ########

    Memory Reading Speed Benchmark From C/C++ 18.00.21005.1 for x64

               Start of test Mon Apr 11 23:46:41 2016

  Memory   x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]         x[m]=y[m]
  KBytes    Dble   Sngl  Int32   Dble   Sngl  Int32   Dble   Sngl  Int32
    Used    MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S

      16    6304   3285   7069   7052   3639   7304   7244   3641   3634 L1
      32    5264   3120   3665   5317   3117   3687   5451   2462   2406 L2
      64    4998   3074   3529   4999   3084   3568   3553   2294   2184
     128    4718   3084   3541   4973   3052   3552   3557   2226   2240
     256    4971   3074   3552   4977   3003   3538   3421   2245   2228
     512    4932   3083   3514   4953   3049   3546   3352   2228   2034
    1024    4664   2953   3329   4631   2916   3367   3106   2101   2078
    4096    2712   2553   2668   2686   2604   2687   1379   1354   1402 RAM
   16384    2787   2602   2737   2751   2586   2728   1417   1414   1413
   65536    2784   2620   2748   2703   2576   2751   1396   1418   1399

                End of test Mon Apr 11 23:46:56 2016


 ========================================================
        Top end 2015 PC - Core i7-4820K at 3.9 GHz 
 ========================================================

  Memory   x[m]=x[m]+s*y[m] Int+   x[m]=x[m]+y[m]         x[m]=y[m]
  KBytes    Dble   Sngl  Int32   Dble   Sngl  Int32   Dble   Sngl  Int32
    Used    MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S   MB/S
 32 Bit
      16   33625  18739  18205  33727  19627  19643  53146  14729  14722 L1
     256   31208  18912  17930  30724  19812  20422  29081  13025  12998 L2
    4096   24497  17539  17846  22309  14066  16152  17040  10884  10876 L3
   65536   14722  12402  12522  14576  12782  12889   7398   7598   7585 RAM

 64 Bit
      16   33775  16842  14901  33782  16948  16924  29180  14652  14666 L1
     256   31182  17652  15752  33216  17847  17604  23216  13254  13286 L2
    4096   25009  17093  15360  25601  17336  17079  14941  10864  10880 L3
   65536   14696  10754  12497  15026  13235  13400   7496   7652   7667 RAM


 ========================================================
             MB/sec/MHz Comparisons With Cortex-A9
 ========================================================

               x[m]=x[m]+s*y[m]Int+   x[m]=x[m]+y[m
                Dble   Sngl    Int   Dble   Sngl    Int

 T11      L1    2.49   3.02   1.27   2.45   3.04   1.80
 Cortex   L2    3.13   3.32   2.18   2.86   2.93   2.03
 A15      RAM   2.30   2.14   2.29   2.42   2.21   2.16

 T21      L1    2.43   2.65   0.57   2.16   1.94   0.85
 Qualcomm L2    2.14   2.82   1.85   2.68   2.72   1.74
 800      RAM   2.53   2.73   3.13   2.10   2.57   2.90

 T22      L1    0.96   0.88   0.62   0.78   0.81   0.76
 Cortex   L2    1.31   1.12   1.22   1.48   1.21   1.33
 A53      RAM   2.68   1.74   2.86   3.09   2.15   2.56

 A1       L1    1.14   1.18   1.16   2.17   2.07   1.86
 Atom     L2    1.67   1.55   1.61   2.00   2.27   1.96
 Z3745    RAM   3.98   2.64   4.23   4.42   4.04   4.39

 W1       L1    2.30   2.16   1.60   1.46   1.45   1.80
 Atom     L2    2.45   2.43   1.70   2.12   2.13   1.63
 Z8300    RAM   5.02   3.91   5.11   4.97   3.76   4.69

 PC       L1    5.57   5.66   2.21   3.56   4.14   2.38
 Core i7  L2    6.78   6.99   4.16   6.27   6.63   4.67
 4820K    RAM   3.52   4.90   3.22   3.16   4.33   3.06


 ========================================================
           64 bit compilations compared with 32 bit
 ARM shows significant gains (different instructions) but
 Intel/Windows does not (probably same SSE type instructions).            
 ========================================================

               x[m]=x[m]+s*y[m]Int+   x[m]=x[m]+y[m
                Dble   Sngl    Int   Dble   Sngl    Int

 T22      L1    2.11   2.26   2.33   2.14   2.83   2.12
 Cortex   L2    1.94   2.65   2.28   1.83   2.88   2.09
 A53      RAM   1.38   1.99   1.44   1.24   1.74   1.45

 W1       L1    1.00   0.98   1.05   0.97   0.97   0.98
 Atom     L2    0.91   0.98   1.14   0.97   1.02   0.98
 Z8300    RAM   0.98   1.01   0.96   1.02   1.01   0.95
   

To Start


NeonSpeed Benchmark - NeonSpeedi.apk

This benchmark carries out the same calculations as the MemSpeed Benchmark measuring data reading speeds in Mega Bytes per second, with functions accessing arrays of cache and RAM based data, sized 2 x 8 KB to 2 x 32 MB. Calculations are x[m]=x[m]+s*y[m] and x[m]=x[m]+y[m] single precision floating point with x[m]=x[m]+s+y[m] and x[m]=x[m]+y[m] with integers. Million Floating Point Operations Per Second (MFLOPS) speed can calculated by dividing single precision MB/second by 4 and 8, for the two tests. The first set of calculations use normal functions followed by some using NEON Intrinsic Functions. The last two columns are NEON only results. For further details and results see android neon benchmarks.htm.

On tablet A1, with the Intel Atom CPU, the 32 bit native code version produced some significant performance gains over the original ARM benchmark (available from here), but rerunning this via Android 5.0 produced much faster speeds, some better than native code compilation.

The later compiler produced some slower and some faster speeds on ARM based tablets.

Details are provided for the 64 bit version on T22. As with NEON-Linpack, many results from 32 bit and 64 bit compilations, via NEON intrinsic functions, were similar. With normal code, the 64 bit compilations were up to near four times faster than those at 32 bits.

Following the results are further MB per second/CPU MHz comparisons. Subject to variations due to cache occupancy, the comparisons for normal calculations are the same as MemSpeed. Then, more modern processors performed relatively better, using NEON instructions.


 ##################### T7 Original ######################

  T7, ARM Cortex-A9 1200 MHz, Android 4.1.2, 

   Android NeonSpeed Benchmark 15-Dec-2012 14.38

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16    860   2575   2325   2918   3053   3245 L1
      32    950   2551   2400   2823   2944   3131
      64    744   1396   1329   1434   1465   1496 L2
     128    713   1342   1319   1365   1392   1417
     256    714   1339   1311   1357   1377   1400
     512    708   1323   1299   1348   1358   1383
    1024    608    875    869    917    930    952
    4096    460    493    492    481    488    504 RAM
   16384    460    498    487    507    506    504
   65536    459    495    469    251    503    505

          Total Elapsed Time   11.5 seconds

#################### T7 ARM-Intel #####################

 ARM/Intel NeonSpeed Benchmark V1.1 09-May-2015 18.07

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16    881   2440   2501   3334   3206   3465
      32    901   1868   1705   2260   2083   2186
      64    801   1395   1365   1573   1548   1581
     128    784   1282   1278   1405   1389   1411
     256    787   1279   1285   1420   1380   1409
     512    777   1266   1267   1409   1370   1394
    1024    604    786    762    769    770    828
    4096    458    479    477    463    486    488
   16384    436    447    448    469    470    469
   65536    450    472    469    240    482    483

          Total Elapsed Time   11.5 seconds


 #################### T11 Original #####################

 T11 Samsung EXYNOS 5250 1.7 GHz Cortex-A15, Android 4.2.2

 Android NeonSpeed Benchmark V1.1 09-Aug-2013 17.10

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16   3793   9641   4375  13023  13456  13562 L1
      32   5777  11410   4993  11718  11365  11143
      64   4122   6692   3855   6539   6682   7210 L2
     128   4017   6565   3849   6475   6520   6983
     256   4067   6562   3836   6459   6495   7038
     512   3900   6531   3820   6428   6490   7095
    1024   1821   2544   1774   2532   2554   2539
    4096   1141   1645   1536   1612   1615   1635 RAM
   16384   1437   1695   1490   1576   1694   1668
   65536   1424   1675   1475   1699   1687   1694

          Total Elapsed Time   11.2 seconds

 #################### T11 ARM-Intel ####################

 ARM/Intel NeonSpeed Benchmark V1.1 09-May-2015 18.17

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16   2252   4964   3321   6602   7304   7237
      32   4202   8364   4543   8366   8553   8101
      64   3710   6096   3860   6570   6348   6182
     128   3802   5581   3874   6044   5624   5877
     256   3654   5618   3501   6154   5655   5783
     512   3597   5688   3723   6130   5812   5684
    1024   1727   2466   1659   2481   2454   2472
    4096   1479   1718   1421   1714   1713   1706
   16384   1488   1704   1435   1576   1705   1694
   65536   1477   1755   1453   1754   1759   1752

          Total Elapsed Time   10.8 seconds


 #################### T21 Original #####################

 T21 Qualcomm Snapdragon 800 2150 MHz, Android 4.4.4
     Dual Channel 32 Bit LPDDR3-1866 RAM 14.9 GB/s

 Android NeonSpeed Benchmark V1.1 23-Jul-2015 13.00

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16   4324  13809   4498  14660  17501  18186 L1
      32   3587   6845   2922   8073   6981   7035 L2
      64   3347   6894   2912   8078   6964   6938
     128   3343   6651   2919   7922   6726   6999
     256   3511   6963   3002   8071   6902   6897
     512   3476   6628   3025   7827   6613   6818
    1024   3172   4627   2773   6424   4800   4806
    4096   2653   2051   2378   3613   2090   2054 RAM
   16384   2356   1891   2118   3165   1955   1962
   65536   2424   1923   2167   3368   1933   1925

          Total Elapsed Time    9.9 seconds

 #################### T21 ARM-Intel #################### 

 ARM/Intel NeonSpeed Benchmark V1.1 23-Jul-2015 13.03

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16   3623  16704   4623  15187  17446  16719
      32   3455   9210   2997   8723   9280   9112
      64   3336   7721   3002   8544   8469   8581
     128   3415   7664   3111   8481   7549   7638
     256   3584   7526   3087   8500   7849   7805
     512   3538   7422   3154   8266   7567   7541
    1024   3513   7227   3067   7789   7294   7261
    4096   2302   1673   2413   3107   1693   1677
   16384   2286   1616   2323   3024   1620   1617
   65536   2322   1617   2271   2505   1634   1600

          Total Elapsed Time    9.9 seconds


 ###################### T22 32 Bit ######################

  T22, Quad Core ARM Cortex-A53 1300 MHz, Android 5.0.2 

 ARM/Intel NeonSpeed Benchmark V1.2 13-Aug-2015 16.32
           Compiled for 32 bit ARM v7a

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16    971   3853   1807   4059   3957   4397 L1
      32    970   3812   1800   3983   3891   4323
      64    927   3228   1605   3038   3269   3521 L2
     128    926   3321   1681   3343   3354   3596
     256    936   3386   1693   3449   3413   3667
     512    898   2889   1578   2996   2927   3118
    1024    794   1859   1345   2057   1996   1924 RAM
    4096    794   1796   1250   1788   1813   1835
   16384    792   1773   1270   1820   1829   1864
   65536    796   1811   1289   1852   1832   1880

          Total Elapsed Time   11.3 seconds

 ###################### T22 64 Bit ######################

 ARM/Intel NeonSpeed Benchmark V1.2 13-Aug-2015 16.37
           Compiled for 64 bit ARM v8a

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16   3054   4055   3605   4376   4911   5094
      32   2922   3787   3435   4198   4546   4682
      64   2795   3514   3259   3658   4050   4116
     128   2886   3529   3373   3924   4148   3963
     256   2883   3641   3264   3942   4193   4276
     512   2454   3165   2985   3385   3586   3542
    1024   1633   2000   1835   2043   2114   2105
    4096   1738   1893   1899   1900   1956   1955
   16384   1757   1870   1886   1802   1921   1846
   65536   1755   1875   1870   1903   1936   1937

          Total Elapsed Time   10.2 seconds


 #################### A1 Original #######################

 A1 Quad Core 1.86 GHz Intel Atom Z3745, Android 4.4
   Dual Channel LPDDR3-1066 Bandwidth 17.1 GB/s

 Android NeonSpeed Benchmark V1.1 02-Feb-2015 17.09

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16   1778   3940   2807   5474   4997   5062 L1
      32   1781   3576   2636   4431   4316   4291 L2
      64   1772   3589   2639   4480   4337   4332
     128   1784   3589   2641   4423   4320   4320
     256   1766   3592   2642   4400   4347   4358
     512   1784   3585   2633   4375   4350   4355
    1024   1705   3253   2448   3760   3789   3788
    4096   1673   3021   2366   3257   3245   3237 RAM
   16384   1672   2948   2349   3062   3157   3151
   65536   1675   2967   2345   3190   3168   3168

          Total Elapsed Time   10.8 seconds
 
 ############### A1 Original Android 5.0 ################

 Android NeonSpeed Benchmark V1.1 04-Nov-2015 10.49

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16   1746   5026   4902   7781   8155   8196
      32   1720   4652   4031   5868   6168   6192
      64   1641   4611   4043   5909   6233   6240
     128   1740   4709   4022   5811   6209   6216
     256   1740   4710   4040   5887   6222   6218
     512   1714   4526   3862   5594   5886   5893
    1024   1630   4134   3642   4876   5021   4740
    4096   1589   3344   3183   3407   3415   3412
   16384   1608   3329   3180   3385   3409   3390
   65536   1470   2962   3166   2778   3380   3384

          Total Elapsed Time   10.1 seconds

 #################### A1 ARM-Intel ######################

 ARM/Intel NeonSpeed Benchmark V1.1 09-May-2015 16.54

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16   1816   5996   4916   6244   6882   6880
      32   1851   4703   3985   5200   5609   5711
      64   1862   3845   3121   4174   4441   4520
     128   1841   3929   3110   4179   4411   4487
     256   1863   3932   3092   4179   4412   4493
     512   1861   3938   3090   3894   4215   4415
    1024   1784   3475   2738   3130   3223   3443
    4096   1741   2376   2649   2998   3112   3139
   16384   1774   3086   2780   3116   3140   3145
   65536   1774   2987   2547   2328   3126   3072

          Total Elapsed Time   10.1 seconds


 #################### A5 ARM Intel ######################

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 ARM/Intel NeonSpeed Benchmark V1.2 28-Mar-2016 11.19
           Compiled for 32 bit Intel x86

       Vector Reading Speed in MBytes/Second
  Memory  Float v=v+s*v  Int v=v+v+s   Neon v=v+v
  KBytes   Norm   Neon   Norm   Neon  Float    Int

      16   3523   6058   4676   6052   6245   6163
      32   2926   5163   3585   5275   5711   5322
      64   2988   4406   2894   4892   4786   4713
     128   2691   4085   2876   4648   4657   4662
     256   2856   4274   2912   4865   4752   4714
     512   2848   3929   2635   4550   4557   4509
    1024   2613   3536   2634   3629   3597   3633
    4096   2259   1928   2229   2407   2512   2503
   16384   1654   1849   2373   2543   2592   2595
   65536   2375   2632   2429   1902   2711   2731

          Total Elapsed Time   10.3 seconds


 ========================================================
             MB/sec/MHz Comparisons With Cortex-A9
 ========================================================

              Float v=v+s*v   Int v=v+v+s   Neon  v=v+v
                Norm   Neon   Norm   Neon  Float    Int

 A1       L1    1.32   1.62   1.50   1.48   1.73   1.68
 Atom     L2    1.52   1.98   1.55   1.89   2.06   2.05
 Z3745    RAM   2.54   4.07   3.49   6.24   4.17   4.09

 T11      L1    3.29   3.16   1.88   2.61   2.90   2.62
 Cortex   L2    3.27   3.17   2.07   3.07   2.99   2.88
 A15      RAM   2.32   2.62   2.19   5.16   2.58   2.56

 T21      L1    2.30   3.82   1.03   2.54   3.04   2.69
 Qualcomm L2    2.54   3.28   1.34   3.34   3.17   3.09
 800      RAM   2.88   1.91   2.70   5.83   1.89   1.85

 T22      L1    0.99   1.88   0.97   1.63   1.72   1.83
 Cortex   L2    1.10   2.44   1.22   2.24   2.28   2.40
 A53      RAM   1.63   3.54   2.54   7.12   3.51   3.59
   

To Start


BusSpeed Benchmark - BusSpeedv7i.apk, BusSpeed32.exe, BusSpeed64.exe

This benchmark (based on PC version with details and results here) is designed to identify reading data in bursts over buses. The program starts by reading a word (4 bytes) with an address increment of 32 words (128 bytes) before reading another word. The increment is reduced by half on successive tests, until all data is read. On reading data from RAM, 64 Byte bursts are typically used. Then, measured reading speed reduces from a maximum, when all data is read, to a minimum on using 16 word increments (64 bytes). Potential maximum speed can be estimated by multiplying this minimum value by 16. With this burst rate, measured speed at 32 word and 16 word increments are likely to be the same. Cache sizes are indicated by varying speed as memory use changes. Note, with smallest L1 cache demands, measured speed can be low due to overheads when reading little data. For more details and further results see here, with results up to 2013 in British Library Archives.

Comparing results from different versions, on a particular system, there can be unusual differences on burst reading speeds. Those quoted here are for the most important measurements for reading all data.

On Intel Atom based tablet A1, there was little difference between the old ARM version, with conversion, to the new 32 bit native code program, nor using Android 5.0 instead of 4.4.

Average revised 32 bit version performance improvements, via caches/RAM, were 8%/17% for T7 Cortex-A9, 11%/27% for T11 Cortex-A15 and 27%/-8% on T21 Snapdragon 800. Corresponding T22 Cortex-A53 64/32 bit improvements were 61%/25%.

After the results are further MB per second/CPU MHz comparisons, for this integer data streaming benchmark that can demonstrate maximum data transfer speed from RAM. As the latter might not be dependent on CPU speed, direct MB/second comparisons are also provided. These are dependent on bus speed, 32 bit or 64 bit bus width and whether one or two channels are available, one problem being that is it is often difficult to identify what is provided. Note that multithreaded benchmarks might be needed to fully utilise memory bandwidth - see later results.

Results of the Windows version are also included for a tablet and, for comparison purposes, a desk top PC with 4 memory channels. Intel systems have 64 bit bus widths.

A5 and W2 Dual Boot Tablet - Android and Windows speeds are again quite similar. Simillarly, with RAM speed slower than W1.


 ##################### T7 Original ######################

 T7, ARM Cortex-A9 1200 MHz, Android 4.1.2, DDR3 5.3 GB/s
      4 x 32 KB L1 cache, 1 MB shared L2 cache

 Android BusSpeed Benchmark 19-Oct-2012 17.29

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   2723   2420   3044   3364   3499   3500 L1
      32   1054   1087   1061   1382   1565   2145
      64    436    433    419    652    751   1160 L2
     128    345    337    337    542    633    943
     256    329    309    322    522    614    961
     512    339    299    311    506    574    937
    1024    170    168    180    269    349    629
    4096     59     55     84    127    176    338 RAM
   16384     56     56     83    125    173    335
   65536     56     56     82    125    174    334

          Total Elapsed Time    5.7 seconds
 
 #################### T7 ARM-Intel #####################

 ARM/Intel BusSpeed Benchmark 1.1 v7 25-Apr-2015 12.30

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   2940   3344   3625   3866   3862   3893
      32    698    707    682   1071   1208   1826
      64    448    477    465    726    851   1357
     128    367    355    292    542    657   1070
     256    334    344    341    546    651   1059
     512    326    336    336    531    629   1025
    1024    169    175    197    309    411    749
    4096     58     58     83    131    191    395
   16384     56     57     83    129    189    392
   65536     56     48     82    129    187    388

          Total Elapsed Time    5.6 seconds


 #################### T11 Original #####################

 T11 Samsung EXYNOS 5250 1.7 GHz Cortex-A15, Android 4.2.2
   2 GB DDR3-1600 RAM, dual channel, 12.8 GB/sec
     2 x 32 KB L1 cache, 1 MB shared L2 cache

 Android BusSpeed Benchmark 1.1 v7 09-Aug-2013 17.07

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   3193   3451   4412   5272   5389   6191 L1
      32   1298   1558   1990   3478   4264   4420
      64    804    928   1209   2442   3263   3426 L2
     128    784    904   1175   2321   3148   3333
     256    780    908   1181   2336   3142   3327
     512    788    907   1165   2312   3120   3300
    1024    360    387    384    803   1348   1744
    4096    145    146    194    507    648   1378 RAM
   16384    141    136    190    507    638   1373
   65536    142    141    191    506    643   1371

          Total Elapsed Time    5.3 seconds

 #################### T11 ARM-Intel ####################

 ARM/Intel BusSpeed Benchmark 1.1 v7 23-Apr-2015 12.15

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   2085   3208   4055   4553   5272   5758
      32   1282   1811   2498   4182   4867   5163
      64    600    864   1309   2974   3504   3841
     128    614    892   1310   3027   3500   3826
     256    614    892   1337   3050   3509   3828
     512    618    888   1319   3042   3382   3811
    1024    425    479    444   1244   1803   2291
    4096    146    146    191    590   1050   1751
   16384    141    139    186    585   1039   1725
   65536    139    139    187    585   1039   1721

          Total Elapsed Time    5.3 seconds


 #################### T21 Original #####################

 T21 Qualcomm Snapdragon 800 2150 MHz, Android 4.4.4
   Dual Channel 32 Bit LPDDR3-1866 RAM 14.9 GB/s
   L1 caches 4 x 16 KB, L2 cache shared 2048 KB

 Android BusSpeed Benchmark 1.1 v7 04-Jun-2015 17.00

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   1382   1350   3122   4300   4938   5283 L1
      32   1106   1118   2026   2637   3786   5210 L2
      64   1064   1118   2058   2679   3820   5251
     128   1123   1170   2081   2688   3669   4166
     256   1121   1196   2109   2623   3873   3429
     512    940   1127   2050   2684   3777   4795
    1024    951   1124   2038   2655   3759   4950
    4096    239    375    472    806   1486   2679 RAM
   16384    239    370    464    806   1476   2656
   65536    239    368    495    854   1537   2792

          Total Elapsed Time    5.0 seconds


 #################### P36 ARM-Intel ####################

 P36 LGE LG-H811 Qualcomm® Snapdragon 808, 1.8 GHz 
        64-bit Hexa-Core, Android 5.1

 ARM/Intel BusSpeed Benchmark 1.2 05-Jan-2016 12.24
           Compiled for 64 bit ARM v8a

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   1211   1532   3673   5686   6244   6484
      32   1514   1952   3190   5034   6258   6576
      64    794   1096   2109   3840   5643   6171
     128    721   1005   1928   3603   5416   6032
     256    721   1004   1938   3606   5423   6025
     512    724   1009   1941   3610   5428   6022
    1024    384    411    698   1112   1732   2607
    4096    198    228    459    859   1529   2659
   16384    199    231    462    930   1624   2936
   65536    194    232    467    923   1635   3036

          Total Elapsed Time    5.2 seconds


 #################### T21 ARM-Intel ####################

 ARM/Intel BusSpeed Benchmark 1.1 v7 04-Jun-2015 17.00

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   1328   1442   2797   4291   4699   5685
      32   1165   1100   1933   2848   3603   5844
      64   1147   1055   2007   2846   3586   5890
     128   1181   1136   2008   2711   3600   5878
     256   1185   1126   2018   2716   3568   5873
     512   1022   1026   1805   2525   3378   5611
    1024    796    843   1584   2202   3088   5053
    4096    199    294    431    657   1166   2409
   16384    200    299    430    659   1167   2408
   65536    205    301    436    668   1173   2380

          Total Elapsed Time    5.2 seconds


 ###################### T22 32 Bit ######################

 T22, ARM Cortex-A53 1300 MHz, Android 5.0.2 
 LPDDR3 RAM, single channel, 1333 MHz = 5.3 GB/sec
     4 x 32 KB L1 cache, 512 KB shared L2 cache

 ARM/Intel BusSpeed Benchmark 1.2 06-Aug-2015 10.57
           Compiled for 32 bit ARM v7a

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16    874    932   1814   2302   2355   2263 L1
      32    758    803   1309   1820   2323   2386
      64    653    671   1203   1741   2206   2332 L2
     128    603    620   1107   1693   2222   2351
     256    574    589   1075   1711   2211   2327
     512    332    372    681   1075   1863   2120
    1024    137    193    371    578   1322   2129 RAM
    4096    172    179    351    567   1151   2126
   16384    172    178    351    504   1117   2136
   65536    172    177    349    478    882   2129

          Total Elapsed Time    5.3 seconds


 ###################### T22 64 Bit ######################

 T22, ARM Cortex-A53 1300 MHz, Android 5.0.2 

 ARM/Intel BusSpeed Benchmark 1.2 06-Aug-2015 11.02
           Compiled for 64 bit ARM v8a

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   3188   3635   3937   4327   4372   4462
      32   1478   1607   2246   3382   3853   4144
      64    600    622   1163   2011   2972   3585
     128    558    575   1056   1889   2892   3525
     256    538    550   1028   1826   2837   3260
     512    371    425    813   1490   2403   3202
    1024    136    196    382    728   1423   2750
    4096    170    177    346    669   1340   2652
   16384    169    174    341    678   1352   2663
   65536    168    174    341    676   1347   2611

          Total Elapsed Time    5.2 seconds


 #################### A1 Original #######################

 A1 Quad Core 1.86 GHz Intel Atom Z3745, Android 4.4
   Dual Channel LPDDR3-1066 Bandwidth 17.1 GB/s
            4 x 24 KB L1, 2 x 1 MB L2

 Android BusSpeed Benchmark 1.1 v7 21-Dec-2014 16.06

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   4178   3473   6270   6713   6759   6869 L1
      32   1420   1529   2252   2686   3702   5108 L2
      64   1385   1498   2276   2629   3657   5108
     128   1394   1542   2278   2614   3640   5092
     256   1410   1576   2258   2607   3259   5110
     512   1417   1574   2274   2602   3700   5119
    1024    349    428    888   1431   2848   4306 
    4096    215    265    593   1181   2289   3891 RAM
   16384    210    266    596   1181   2278   3897
   65536    220    272    600   1193   2346   3886

          Total Elapsed Time    5.1 seconds

 ################## A1 V1 Android 5.0 ###################

 Android BusSpeed Benchmark 1.1 v7 29-Oct-2015 11.31

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   5747   5958   6843   7041   7163   7220
      32   1446   1722   2244   2725   3745   5288
      64   1436   1745   2322   2738   3777   5314
     128   1461   1779   2348   2734   3749   5259
     256   1462   1787   2367   2735   3761   5285
     512   1450   1757   2331   2593   3707   5258
    1024    376    484   1107   1845   2954   4545
    4096    217    258    585   1179   2310   3895
   16384    217    255    593   1172   2302   3894
   65536    214    257    593   1163   2305   3916

          Total Elapsed Time    5.1 seconds

 #################### A1 ARM-Intel ######################

 ARM/Intel BusSpeed Benchmark 1.1 v7 22-Apr-2015 21.42

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   4845   5705   6403   6926   7094   7167
      32   1407   1716   2255   2646   3713   5094
      64   1395   1703   2257   2689   3754   4843
     128   1283   1571   2108   2620   3671   5135
     256   1416   1753   2288   2679   3687   5178
     512   1439   1372   2251   2510   3679   5183
    1024    350    409    942   1696   2792   4403
    4096    213    253    564   1188   2173   3631
   16384    219    259    600   1189   2330   3920
   65536    218    259    599   1102   2323   3716

          Total Elapsed Time    5.1 seconds


 ########### A5 ARM-Intel Dual Boot With W2 #############

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 ARM/Intel BusSpeed Benchmark 1.2 28-Mar-2016 11.44
           Compiled for 32 bit Intel x86

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   5857   5444   6672   6835   6777   6923
      32   1432   1525   2134   2492   3511   4828
      64   1400   1523   2392   2688   3485   4690
     128   1444   1546   2253   2419   3153   4427
     256   1410   1588   2387   2750   3593   4949
     512   1464   1567   2367   2692   3530   4643
    1024    236    284    601   1276   2118   3462
    4096    176    202    322    561   1505   3000
   16384    173    202    417    796   1585   3061
   65536    172    199    429    655   1095   2189

          Total Elapsed Time    5.0 seconds


 ################# W1 Windows 10 32 bit #################

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Windows 10, 4 GB DDR 3 1600

    BusSpeed From C/C++ 18.00.21005.1 for x86

     Start of test Wed Dec 23 20:52:24 2015

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   5481   5833   6688   6396   6329   6647 L1
      32   1407   1581   2161   2678   3546   4436 L2
      64   1409   1480   2297   2678   3534   5089
     128   1461   1588   2441   2618   3505   4762
     256   1346   1593   2441   2631   3571   4988
     512   1398   1576   2299   2636   3283   4980
    1024    902   1038   1845   2011   2994   4384
    4096    237    270    570   1095   2100   3684 RAM
   16384    239    273    565   1083   2118   3870
   65536    240    275    547   1089   2088   3746

        End of test Wed Dec 23 20:52:34 2015

 ################# W1 Windows 10 64 bit #################

    BusSpeed From C/C++ 18.00.21005.1 for x64

     Start of test Wed Dec 23 21:06:27 2015

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   5047   5323   6188   6349   6742   6265
      32   1456   1518   2177   2725   3657   5063
      64   1364   1475   2274   2884   3535   4991
     128   1485   1584   2180   2797   3627   5055
     256   1368   1397   2257   2739   3387   4724
     512   1477   1518   2377   2595   3448   4851
    1024    707    709   1295   2038   2529   3913
    4096    239    270    559   1089   2131   3693
   16384    239    272    567   1095   2094   3635
   65536    236    269    566   1096   2115   3854

        End of test Wed Dec 23 21:06:37 2015


  ######## W2 Windows 10 32 bit Dual Boot With A5 ########

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Windows 10, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

    BusSpeed From C/C++ 18.00.21005.1 for x86

     Start of test Mon Apr 11 23:59:13 2016

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   5166   5420   6209   6078   6748   6330
      32   1371   1475   2118   2587   3272   4429
      64   1388   1397   2211   2488   3167   4437
     128   1296   1444   2245   2586   3353   4427
     256   1414   1445   2293   2511   3299   4468
     512   1309   1404   2128   2795   3473   4563
    1024    677    748   1433   1906   2295   3853
    4096    180    210    446    858   1674   3273
   16384    182    215    438    845   1649   3244
   65536    180    212    434    851   1666   3251

        End of test Mon Apr 11 23:59:23 2016


  ######## W2 Windows 10 64 bit Dual Boot With A5 ########

    BusSpeed From C/C++ 18.00.21005.1 for x64

     Start of test Mon Apr 11 23:47:35 2016

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All

      16   4767   5335   6232   6677   6468   6875
      32   1473   1440   2227   2691   3362   4752
      64   1398   1556   2234   2702   3616   4768
     128   1404   1571   2256   2606   3485   4869
     256   1416   1422   2228   2535   3567   4747
     512   1488   1593   2271   2614   3444   4854
    1024    940    755   1698   2256   3179   4449
    4096    184    212    443    860   1647   3246
   16384    179    211    437    856   1654   3154
   65536    183    213    441    842   1651   3271

        End of test Mon Apr 11 23:47:45 2016


 ========================================================
        Top end 2015 PC - Core i7-4820K at 3.9 GHz 
 ========================================================

    Reading Speed 4 Byte Words in MBytes/Second
  Memory  Inc32  Inc16   Inc8   Inc4   Inc2   Read
  KBytes  Words  Words  Words  Words  Words    All
32 Bit
      16  13170  13179  18786  19440  19399  19373
     256   5411   5708  10003  14452  17994  20098
    4096   2713   2780   5358   9510  15915  19871
   65536    660    963   2064   4220   8659  12891

64 Bit
      16  13183  13185  20016  20441  20481  20495
     256   6436   6636  11038  16140  19734  21099
    4096   2701   2776   5339   9573  16245  20827
   65536    685   1041   2183   4265   8731  15120


 ========================================================
             MB/sec/MHz Comparisons With Cortex-A9
 ========================================================

               Inc32  Inc16   Inc8   Inc4   Inc2   Read    RAM
               Words  Words  Words  Words  Words    All MB/sec

 T11      L1    0.50   0.68   0.79   0.83   0.96   1.04
 Cortex   L2    1.30   1.83   2.77   3.94   3.80   2.55
 A15      RAM   1.75   2.04   1.61   3.20   3.92   3.13   4.44

 T21      L1    0.25   0.24   0.43   0.62   0.68   0.82
 Qualcomm L2    1.98   1.83   3.30   2.78   3.06   3.10
 800      RAM   2.04   3.50   2.97   2.89   3.50   3.42   6.13

 T22      L1    0.27   0.26   0.46   0.55   0.56   0.54
 Cortex   L2    1.59   1.58   2.91   2.89   3.14   2.03
 A53      RAM   2.84   3.40   3.93   3.42   4.35   5.07   5.49

 A1       L1    1.06   1.10   1.14   1.16   1.19   1.19
 Atom     L2    2.74   3.29   4.33   3.17   3.65   3.15
 Z3745    RAM   2.51   3.48   4.71   5.51   8.01   6.18   9.58

 W1       L1    1.22   1.14   1.20   1.08   1.07   1.11
 Atom     L2    2.63   3.02   4.67   3.14   3.58   3.07
 Z8300    RAM   2.80   3.74   4.35   5.51   7.28   6.30   9.65

 PC       L1    1.38   1.21   1.59   1.55   1.55   1.53
 Core i7  L2    4.98   5.11   9.03   8.14   8.50   5.84
 4820K    RAM   3.63   6.17   7.74  10.07  14.25  10.22  33.22


 ========================================================
           64 bit compilations compared with 32 bit
 ========================================================

 T22      L1    3.65   3.90   2.17   1.88   1.86   1.97
 Cortex   L2    0.94   0.93   0.96   1.07   1.28   1.40
 A53      RAM   0.98   0.98   0.98   1.41   1.53   1.23

 W1       L1    0.92   0.91   0.93   0.99   1.07   0.94
 Atom     L2    1.02   0.88   0.92   1.04   0.95   0.95
 Z8300    RAM   0.98   0.98   1.03   1.01   1.01   1.03

 PC       L1    1.00   1.00   1.07   1.05   1.06   1.06
 Core i7  L2    1.19   1.16   1.10   1.12   1.10   1.05
 4820K    RAM   1.04   1.08   1.06   1.01   1.01   1.17
   

To Start


RandMem Benchmark - RandMemi.apk, RandMem32.exe, RandMem64.exe

RandMem benchmark carries out four tests at increasing data sizes to produce data transfer speeds in MBytes Per Second from caches and memory. Serial and random address selections are employed, using the same program structure, with read and read/write tests using 32 bit integers. The main purpose is to demonstrate how much slower performance can be through using random access. Here, speed can be considerably influenced by reading and writing in bursts, where much of the data is not used, and by the size of preceding caches. For more details and further results see here, with results up to 2013 in British Library Archives.

On tablet A1, with the Intel Atom processor, results for the new 32 bit version were essentially the same as the Houdini instruction conversion of original ARM code via Android 5, both averaging 30% improvement, over the original Android 4 speeds on read only tests, but similar with reading and writing. The latter pattern of improvements were also apparent for 64 bit versus 32 bit benchmark modes on tablet T22, with the ARM Cortex-A53 processor, but only using cache based data. The later 32 bit benchmark produced inconsistent gains and some losses, running on the other ARM compatible systems (up to October 2015).

The benchmark code is the same as used on the Windows and Linux PC versions, with details and results here, where some of these results are also included.

Further MB per second/CPU MHz and 64 bit/32 bit comparisons are provided, showing the usual variability in performance.


 ##################### T7 Original ######################

 T7,   ARM Cortex-A9 1200 MHz, Android 4.1.2, 

  Android RandMem Benchmark 20-Oct-2012 11.14

    MBytes/Second Transferring 4 Byte Words  
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     2788     3041     2795     3041 L1
       32     2769     3011     2767     3020
       64     1027     1038      839      911 L2
      128      916      918      616      649
      256      904      905      514      538
      512      899      907      475      499
     1024      712      699      345      354
     4096      323      284       92       88 RAM
    16384      316      282       73       70
    65536      314      281       65       62

          Total Elapsed Time   10.9 seconds

 #################### T7 ARM-Intel #####################

 ARM/Intel RandMem Benchmark 1.1 25-Apr-2015 12.33

    MBytes/Second Transferring 4 Byte Words
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     2521     3175     2490     3038
       32     1427     1451     1218     1446
       64     1133     1052      853      907
      128     1039      871      646      650
      256     1028      909      543      518
      512     1025      895      499      502
     1024      700      489      242      236
     4096      487      282       90       88
    16384      483      281       71       70
    65536      478      274       63       62

          Total Elapsed Time   11.3 seconds


 #################### T11 Original #####################

 T11 Samsung EXYNOS 5250 2.0 GHz Cortex-A15, Android 4.2.2
                Measured 1.7 GHz

 Android RandMem Benchmark 1.1 13-Aug-2013 17.29

    MBytes/Second Transferring 4 Byte Words  
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     2881     2478     3388     3650 L1
       32     4301     2968     3197     3249
       64     3669     2511     2201     2249 L2
      128     3566     2560     1571     1566
      256     3557     2461     1334     1256
      512     3524     2547     1136     1098
     1024     1933     1144      534      513
     4096     1993     1064      184      173 RAM
    16384     1970     1086      141      144
    65536     1973     1117      106      104

          Total Elapsed Time    9.1 seconds

 #################### T11 ARM-Intel ####################

 ARM/Intel RandMem Benchmark 1.1 23-Apr-2015 20.42

    MBytes/Second Transferring 4 Byte Words
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     3642     3102     5464     4114
       32     5462     3409     4096     3737
       64     4800     2785     2028     2064
      128     4308     2575     1572     1589
      256     4381     2574     1332     1260
      512     4311     2544     1215     1097
     1024     2033     1156      513      471
     4096     1891     1042      213      178
    16384     2028     1032      154      139
    65536     2033     1055      109      106

          Total Elapsed Time    9.2 seconds


 #################### T21 Original #####################

 T21 Qualcomm Snapdragon 800 2150 MHz, Android 4.4.4

 Android RandMem Benchmark 1.1 10-Jun-2015 12.43

    MBytes/Second Transferring 4 Byte Words  
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     4407     4704     3995     4900 L1
       32     2611     3071     2207     2703 L2
       64     2496     2797     1821     2139
      128     2080     3173     1668     1758
      256     2425     3183     1439     1520
      512     2359     3116     1193     1355
     1024     2366     3117      368      382
     4096     2293     2280      201      209 RAM
    16384     2293     2237      170      175
    65536     2299     2261      146      150

          Total Elapsed Time    8.5 seconds


#################### T21 ARM-Intel ####################

 ARM/Intel RandMem Benchmark 1.1 10-Jun-2015 12.45

    MBytes/Second Transferring 4 Byte Words  
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     5005     4626     4067     4863
       32     3253     2994     2246     2622
       64     3223     2855     1986     2072
      128     2861     3128     1912     1776
      256     3246     3174     1666     1523
      512     3195     3111     1469     1372
     1024     3190     3079      369      383
     4096     3027     2381      212      213
    16384     3065     2300      174      177
    65536     3080     2281      150      150

          Total Elapsed Time    8.6 seconds


 ###################### T22 32 Bit ######################

  T22, ARM Cortex-A53 1300 MHz, Android 5.0.2 

 ARM/Intel RandMem Benchmark 1.2 06-Aug-2015 12.29
           Compiled for 32 bit ARM v7a

    MBytes/Second Transferring 4 Byte Words
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     2807     3606     2753     3595 L1
       32     2719     3433     1429     1930
       64     2615     3266      914     1166 L2
      128     2592     3243      705      828
      256     2570     3223      637      720
      512     2367     2684      237      347
     1024     2137     1855      120      163 RAM
     4096     1918     1658       83       97
    16384     2152     1665       74       85
    65536     2104     1652       72       64

          Total Elapsed Time   11.6 seconds


###################### T22 64 Bit ######################

  T22, ARM Cortex-A53 1300 MHz, Android 5.0.2 

 ARM/Intel RandMem Benchmark 1.2 06-Aug-2015 12.32
           Compiled for 64 bit ARM v8a

    MBytes/Second Transferring 4 Byte Words
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     3865     3033     3798     3027
       32     3622     2760     3105     2734
       64     3094     2803     1011     1077
      128     3074     2740      776      801
      256     3050     2771      718      693
      512     2420     2463      270      371
     1024     1322     1853      131      164
     4096     1754     1598       87      100
    16384     1791     1586       75       91
    65536     1856     1609       57       68

          Total Elapsed Time   14.6 seconds


 #################### A1 Original #######################

 A1 Quad Core 1.86 GHz Intel Atom Z3745, Android 4.4
   Dual Channel LPDDR3-1066 Bandwidth 17.1 GB/s

  Android RandMem Benchmark 1.1 01-Feb-2015 10.12

    MBytes/Second Transferring 4 Byte Words
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     3434     5064     3462     5113 L1
       32     2833     4042     2652     3645 L2
       64     2837     4058     2068     2561 
      128     2822     4041     1809     2205
      256     2828     4040     1435     1755
      512     2816     3997     1245     1456
     1024     2578     3256      379      445
     4096     2412     1946      209      268 RAM
    16384     2485     2039      179      217
    65536     2457     2041      140      170

          Total Elapsed Time   11.8 seconds

 ################## A1 V1 Android 5.0 ###################

 Android RandMem Benchmark 1.1 29-Oct-2015 12.11

    MBytes/Second Transferring 4 Byte Words
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     5138     5638     5113     5611
       32     3778     4226     3683     3774
       64     3767     4249     2715     2647
      128     3747     4234     2305     2261
      256     3652     4218     1805     1847
      512     3739     4209     1521     1565
     1024     3300     3442      562      653
     4096     3026     2094      268      277
    16384     3009     2075      213      220
    65536     2918     2064      157      176

          Total Elapsed Time   16.6 seconds

 #################### A1 ARM-Intel ######################

 ARM/Intel RandMem Benchmark 1.1 23-Apr-2015 17.27

    MBytes/Second Transferring 4 Byte Words
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     4291     5626     4584     5630
       32     3217     3792     3492     3783
       64     3677     4253     2629     2644
      128     3666     4241     2299     2289
      256     3688     3930     1829     1850
      512     3682     4189     1522     1592
     1024     3285     3558      562      667
     4096     2999     2007      272      274
    16384     3019     2065      210      220
    65536     2989     2068      141      186

          Total Elapsed Time    8.8 seconds


 ########### A5 ARM-Intel Dual Boot With W2 #############

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 ARM/Intel RandMem Benchmark 1.2 28-Mar-2016 11.46
           Compiled for 32 bit Intel x86

    MBytes/Second Transferring 4 Byte Words
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     4167     5227     4397     5225
       32     3425     3779     3123     3518
       64     3249     3853     2386     2284
      128     3169     3635     2110     2127
      256     3271     3604     1752     1758
      512     3361     3847     1515     1503
     1024     2782     2843      457      564
     4096     2565     1845      216      233
    16384     2592     1922      149      186
    65536     2583     1286      125      144

          Total Elapsed Time    9.5 seconds


 ################# W1 Windows 10 32 bit #################

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Windows 10, 4 GB DDR 3 1600

 RandMem Benchmark From C/C++ 18.00.21005.1 for x86

     Start of test Wed Dec 23 21:04:02 2015

     MBytes/Second Transferring 4 Byte Words  
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     4081     5576     4641     5577
       32     3436     3900     3561     3247
       64     3413     3923     2703     2738
      128     3416     3891     2336     2386
      256     3437     3901     1819     1830
      512     3385     3897     1574     1592
     1024     3204     3592     1256     1328
     4096     2831     2745      248      272
    16384     2882     2733      203      219
    65536     2812     2735      146      160

       End of test Wed Dec 23 21:04:13 2015

 ################# W1 Windows 10 64 bit #################

 RandMem Benchmark From C/C++ 18.00.21005.1 for x64

     Start of test Wed Dec 23 21:21:13 2015

     MBytes/Second Transferring 4 Byte Words  
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     3075     3919     3494     4406
       32     2691     3496     2958     3159
       64     2952     3174     2312     2405
      128     2734     3342     2002     1984
      256     2720     3257     1742     1627
      512     2864     3261     1517     1503
     1024     2573     2994     1036     1148
     4096     2555     2603      248      303
    16384     2543     2576      230      204
    65536     2670     2646      142      173

       End of test Wed Dec 23 21:21:26 2015


  ######## W2 Windows 10 32 bit Dual Boot With A5 ########

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Windows 10, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 RandMem Benchmark From C/C++ 18.00.21005.1 for x86

     Start of test Tue Apr 12 00:04:05 2016

     MBytes/Second Transferring 4 Byte Words  
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     4036     5400     4334     5181
       32     2992     3674     3178     3591
       64     2999     3415     2345     2402
      128     2985     3405     2162     2087
      256     3082     3374     1638     1688
      512     3251     3879     1578     1564
     1024     3047     3002      905     1099
     4096     2469     2054      238      253
    16384     2682     2029      194      205
    65536     2532     2050      127      152

       End of test Tue Apr 12 00:04:17 2016

  ######## W2 Windows 10 64 bit Dual Boot With A5 ########

 RandMem Benchmark From C/C++ 18.00.21005.1 for x64

     Start of test Tue Apr 12 00:05:25 2016

     MBytes/Second Transferring 4 Byte Words  
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt

       16     3144     4044     3490     4284
       32     2855     3283     2864     3277
       64     2660     3387     2408     2396
      128     2955     3260     2220     2115
      256     2910     3301     1756     1683
      512     2935     3463     1600     1482
     1024     2545     2705     1127     1223
     4096     2368     2031      238      254
    16384     2533     2030      196      199
    65536     2466     2043      136      158

       End of test Tue Apr 12 00:05:43 2016


 ==============================================
   Top end 2015 PC - Core i7-4820K at 3.9 GHz 
 ==============================================

     MBytes/Second Transferring 4 Byte Words  
   Memory     Serial.......     Random.......
   KBytes     Read   Rd/Wrt     Read   Rd/Wrt
32 Bit
       16    25790    26457    24349    25976
      256    22820    22093    10557     7046
     4096    20516    16317     2212     2036
    65536    13013    10116      654      644

64 Bit
       16    25947    25477    25277    26063
      256    21456    21136     9080     6107
     4096    21368    17510     2205     2173
    65536    12410    10451      676      643


 ===========================================
    MB/sec/MHz Comparisons With Cortex-A9
 ===========================================

                  Serial.....   Random.....
                  Read Rd/Wrt   Read Rd/Wrt

 T11      L1      1.02   0.69   1.55   0.96
 Cortex   L2      3.01   2.00   1.73   1.72
 A15      RAM     3.00   2.72   1.22   1.21

 T21      L1      1.11   0.81   0.91   0.89
 Qualcomm L2      1.76   1.95   1.71   1.64
 800      RAM     3.60   4.65   1.33   1.35

 T22      L1      1.03   1.05   1.02   1.09
 Cortex   L2      2.31   3.27   1.08   1.28
 A53      RAM     4.06   5.57   1.05   0.95

 A1       L1      1.10   1.14   1.19   1.20
 Atom     L2      2.31   2.79   2.17   2.30
 Z3745    RAM     4.03   4.87   1.44   1.94

 W1       L1      1.06   1.15   1.22   1.20
 Atom     L2      2.18   2.80   2.18   2.30
 Z8300    RAM     3.84   6.51   1.51   1.68

 PC       L1      3.15   2.56   3.01   2.63
 Core i7  L2      6.83   7.48   5.98   4.19
 4820K    RAM     8.38  11.36   3.19   3.20

 ===========================================
   64 bit compilations compared with 32 bit
 ===========================================

 T22      L1      1.38   0.84   1.38   0.84
 Cortex   L2      1.19   0.86   1.13   0.96
 A53      RAM     0.88   0.97   0.79   1.06

 W1       L1      0.75   0.70   0.75   0.79
 Atom     L2      0.79   0.83   0.96   0.89
 Z8300    RAM     0.95   0.97   0.97   1.08

 PC       L1      1.01   0.96   1.04   1.00
 Core i7  L2      0.94   0.96   0.86   0.87
 4820K    RAM     0.95   1.03   1.03   1.00
   

To Start

FFT Benchmarks - fft1.apk, fft3c.apk,
For Windows - fft1-32.exe, fft3c-32.exe, fft1-64.exe, fft3c-64.exe

The benchmarks run code for single and double precision Fast Fourier Transforms of size 1024 to 1048576 (1K to 1024K), each one being run three times to identify variance. Results are displayed and saved in a log file (FFT-tests.txt), with FFT running time in milliseconds. Besides Android, the bechmarks are available to run via Windows and Linux. Two versions are available FFT1, original version and with optimised C code as FFT3c. Further details, results, and links for benchmarks and source code are in FFTBenchmarks.htm. The Android benchmarks are only available in the later 32 or 64 bit mode. Example results are below.

Version 3 Improvements - All systems produced significant gains, using the optimised benchmark, but some struggled running the smaller FFTs.

64 Bit Differences - Initially, only one tablet was available that runs at 64 bits, a Lenovo TAB 2 A8-50F using Android 5. In this case, 64 bit and 32 bit results were similar for the non-optimised version, but averaged 40% faster with the more efficient code.

Double and Single Precision - Using 64 bit DP numbers, instead of 32 bit for SP, can produce much slower speeds when a lower level cache space is exceeded and also though using more RAM based data. Other than these, there are slower and faster results.

Android Upgrades - First identified upgrades to Android 5, indicated better average performance but with wide variations on individual tests.

Intel/Windows 10 - 32 bit and 64 bit Intel/Windows results are now included for Atom and Core i7 CPUs.

A5 and W2 Dual Boot Tablet - Android and Windows speeds are again generally, similar, except for Version 3, where W2 is faster. Again W2 results uning RAM are slower than W1.


               Single Precision and Double Precision Results in milliseconds 

         T7 Nexus 7                            T11 VOYO A15       T21 Kindle HDX 7
         Cortex-A9 1.2 GHz                     Cortex-A15 1.7 GHz Qualcomm 800 2.1 GHz
L1/L2 KB 32/1024                               32/2048            16/2048
         Android 4.1.2      Android 5.0.2      Android 4.2.2      Android 4.4.3
          32 Bit             32 Bit             32 Bit             32 Bit
   K Size      SP      DP         SP      DP         SP      DP         SP      DP
Version 1.0
        1    0.64    0.38       0.18    0.21       0.10    0.17       0.14    0.18
        2    0.77    0.97       0.40    0.67       0.22    0.36       0.33    0.53
        4    1.14    1.77       1.13    1.86       0.57    0.90       1.03    1.30
        8    3.28    4.40       3.26    5.12       2.12    2.31       2.50    3.09
       16    7.76    9.39       7.74    9.69       4.71    5.97       1.95    2.20
       32   17.80   22.26      18.09   22.73      10.76   11.37       4.18    5.77
       64   61.05  140.58      41.64   84.68      20.10   49.70      14.61   20.01
      128  153.19  289.15     139.98  274.54      77.67  213.70      33.19   60.52
      256  450.16  645.72     444.09  645.70     408.51  448.95     107.49  310.93
      512 1084.11 1457.85    1102.20 1438.29     782.85 1101.70     584.54  497.23
     1024 2388.33 3129.21    2388.56 3185.93    1799.89 2280.30     875.95  963.37

Version 3c.0
        1    0.66    0.21       0.27    0.25       0.23    0.08       0.35    0.07
        2    1.09    0.55       0.65    0.65       0.50    0.17       0.81    0.19
        4    2.67    1.38       1.67    1.45       1.07    0.41       1.66    0.41
        8    3.56    3.09       4.30    3.23       2.41    0.90       1.08    0.90
       16    7.78    9.08       8.33   10.35       5.26    3.23       3.36    2.66
       32   17.85   22.02      19.23   25.38      11.88    8.88       6.54    6.07
       64   39.52   52.11      46.41   58.90      23.75   23.08      12.57   13.56
      128   89.73  118.45     103.31  128.44      49.74   53.11      27.41   33.09
      256  203.34  258.56     221.99  267.12     100.25  120.66      63.39   72.55
      512  437.25  552.00     464.30  558.13     226.76  264.30     150.38  156.30
     1024  918.32 1175.65     933.05 1182.49     505.68  586.18     306.32  337.07


         T22 Lenovo TAB 2 A8-50F
         ARM Cortex-A53 1.3 GHz
L1/L2 KB 32/512
         Android 5.0.2
          64 Bit             32 Bit
   K Size      SP      DP         SP      DP
Version 1.0
        1    0.20    0.21       0.21    0.21
        2    0.44    0.50       0.43    0.53
        4    1.06    1.26       1.03    1.24
        8    2.52    3.03       2.52    2.85
       16    5.89    6.41       5.68    6.60
       32   14.09   25.29      13.05   30.59
       64   49.97  109.32      45.80   92.16
      128  188.37  256.98     153.25  221.98
      256  447.62  583.33     362.62  504.60
      512  826.77 1019.84     840.44 1107.14
     1024 1846.27 2299.97    1835.82 2423.72

Version 3c.0
        1    0.17    0.20       0.34    0.20
        2    0.37    0.48       0.74    0.47
        4    2.55    1.07       1.62    1.06
        8    1.93    2.40       3.63    2.33
       16    4.59    5.64       8.07    9.12
       32   10.68   15.40      18.20   22.93
       64   28.17   36.16      45.33   50.41
      128   66.87   82.23     101.38  112.46
      256  148.69  193.91     222.13  264.79
      512  347.25  424.72     501.52  550.88
     1024  760.74  960.28    1085.65 1206.83


                             Intel CPUs Android

                                               Dual Boot with W2
         A1 Asus MemoPad 7                     A5 Teclast X98 Plus
         Atom Z3745 1.86 GHz                   Atom Z8300 1.84 GHz
 L1/L2/L324/1024                               KB 24/1024/0
         Android 4.4.2      Android 5.0        Android 5.1
          32 Bit             32 Bit             32 Bit
   K Size      SP      DP         SP      DP         SP      DP
 Version 1.0
        1    0.09    0.11       0.10    0.09       0.09    0.12
        2    0.21    0.29       0.16    0.23       0.18    0.31
        4    0.61    0.66       0.48    0.52       0.61    0.57
        8    1.35    1.17       1.07    1.17       1.17    1.56
       16    3.20    2.57       2.38    2.59       3.15    3.34
       32    5.41    5.75       5.30    6.02       6.65    9.20
       64   11.74   29.95      11.77   28.31      15.62   45.48
      128   67.54   99.31      54.05   97.58      49.67  110.14
      256  194.13  225.94     189.11  219.98     222.78  264.65
      512  438.49  501.59     433.06  487.49     521.72  602.38
     1024  970.84 1121.61     968.37 1116.94    1187.13 1433.75

Version 3c.0
        1    0.09    0.08       0.10    0.08       0.15    0.13
        2    0.21    0.20       0.16    0.20       0.20    0.21
        4    0.50    0.43       1.66    0.43       0.45    0.52
        8    1.12    0.96       0.87    0.96       0.97    1.05
       16    2.64    2.86       2.01    2.34       2.14    2.61
       32    4.87    5.56       4.51    5.73       4.82    6.53
       64   11.11   15.03      10.01   14.47      11.10   17.79
      128   27.29   34.77      26.80   33.71      29.95   43.74
      256   62.57   72.93      61.16   72.04      77.43   86.13
      512  132.64  157.56     131.10  152.68     152.95  185.74
     1024  282.99  332.37     274.01  363.60     314.54  460.91


                             Intel CPUs - Windows

                                               Dual Boot with A5
         W1 Pipo W1S Tablet                    W2 Teclast X98 Plus
         Atom Z8300 1.84 GHz                   Atom Z8300 1.84 GHz
L1/L2/L3 KB 24/1024/0                          KB 24/1024/0
         Windows 10                            Windows 10
          32 bit             64 bit             32 Bit             64 Bit
    K Size     SP      DP         SP      DP         SP      DP         SP      DP
   Version 1.0
        1    0.11    0.12       0.10    0.12       0.11    0.12       0.10    0.12
        2    0.24    0.45       0.23    0.35       0.24    0.34       0.22    0.33
        4    0.67    0.75       0.63    0.74       0.65    0.74       0.72    0.74
        8    1.44    1.80       1.50    1.69       1.46    1.66       1.37    1.68
       16    3.29    3.71       3.16    3.65       3.25    3.61       3.21    3.78
       32    7.32    7.83       5.94    6.98       7.33    8.10       6.98    7.97
       64   14.36   31.51      13.95   25.44      16.40   28.29      15.96   29.96
      128   46.45  120.79      50.90  115.44      38.56  121.13      76.10  136.39
      256  209.39  235.36     203.02  266.34     232.47  266.35     259.73  298.24
      512  455.89  534.68     491.49  576.91     565.20  597.42     596.50  629.28
     1024 1024.78 1195.81    1040.39 1182.20    1205.59 1450.84    1288.20 1439.44

   Version 3c.0
        1    0.08    0.08       0.08    0.09       0.08    0.09       0.09    0.08
        2    0.19    0.20       0.20    0.22       0.19    0.23       0.18    0.19
        4    0.46    0.44       0.46    0.48       0.45    0.51       0.48    0.43
        8    1.20    0.97       1.06    1.07       1.00    1.12       1.08    0.93
       16    2.27    2.26       2.26    2.25       2.67    2.68       2.51    2.50
       32    5.11    5.54       5.31    5.83       5.54    5.59       5.74    6.06
       64   12.48   14.29      11.22   15.59      10.64   14.72      12.54   14.77
      128   27.62   34.25      27.47   31.65      32.82   36.71      28.28   36.95
      256   71.32   70.99      62.74   67.95      66.71   77.48      67.25   78.47
      512  143.07  144.60     140.50  146.76     157.72  153.43     150.14  168.63
     1024  298.00  322.13     289.98  334.07     332.39  365.36     300.79  370.48


         2015 Top End Desktop PC
         Corei7-4820K 3.9 GHz
 L1/L2/L332/256/10 MB
         Windows 10
          32 bit             64 bit
   K Size      SP      DP         SP      DP
 Version 1.0
        1    0.02    0.02       0.02    0.02
        2    0.04    0.04       0.04    0.04
        4    0.09    0.12       0.08    0.12
        8    0.26    0.31       0.25    0.30
       16    0.65    0.77       0.62    0.76
       32    1.59    1.96       1.51    1.93
       64    4.33    4.87       3.91    4.78
      128    9.94   10.57       9.21   10.60
      256   21.87   22.00      21.01   22.06
      512   45.09   55.15      44.72   58.29
     1024  105.75  199.77     111.23  199.11

Version 3c.0
        1    0.02    0.02       0.01    0.01
        2    0.03    0.03       0.03    0.03
        4    0.07    0.08       0.06    0.07
        8    0.16    0.18       0.14    0.16
       16    0.37    0.41       0.33    0.38
       32    0.81    0.86       0.73    0.82
       64    1.76    1.86       1.56    1.75
      128    3.77    4.05       3.38    3.76
      256    8.24    9.36       7.38    8.78
      512   19.09   22.96      17.28   22.50
     1024   45.68   57.37      42.19   56.66
   

To Start


MP-Whetstone Benchmark - MP-WHETSi, MPWhets32.exe, MPWhets64.exe

For more information on Whetstone Benchmark see stand alone version, above. The multithreading version runs multiple copies of the same shared code, with separate variables. In this case, performance of each of the eight test functions and overall MWIPS ratings is invariably (nearly) proportional to the number of CPU cores available. The driving program checks that calculations on every thread produce consistent numeric results.

The gcc 4.8 based ARM/Intel version, running on the Intel Atom tablet, is rated at twice the speed of the original, due to the use of native code. The fixed point results indicate overoptimisation, but the test uses little of the overall time, this being mainly dependent on the Cos, Exp and third MFLOPS tests. Running the original ARM converted code version via Android 5.0, mainly produced better performance, but an overall lower rating, due to slower Cos and Exp tests, same as stand alone version above.

Also the same as the stand alone version, the new native ARM program was generally slower, running on tablets T7, T11 and T21,

On T22, with the Cortex-A53 CPU, the new 32 bit single thread tests appeared to be slower than the stand alone version, but that was not the case at 64 bits, apparently indicating a 64 bit performance gain.

Intel/Windows 10 - 32 bit and 64 bit Intel/Windows results are now included for Atom and Core i7 CPUs. Note that, based overall seconds, on the Atom/Windows 10 tablet is not as efficient as the other systems (4 cores 347% of 1 core).

A5 and W2 Dual Boot Tablet - Android and Windows speeds are significantly different, on some tests, because of the different compilers, particularly due to optimisation, but these tests do not affect the overall MWIPS results much. The latter averages 18% faster via Android but both show 2 and 4 thread performance gains of around 1.9 and 3.5 times.


 ##################### T7 Original ######################

 T7,   ARM Cortex-A9 1300 MHz, Android 4.1.2, 
            Measured 1200 MHz

     Android MP-Whetstone Benchmark V1.0 17-Oct-2012 13.49

                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If  Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS   MOPS

 1T  1033.7  247.4  235.4  266.0  25.3  15.0   448.4   630.9  513.5
 2T  2058.1  456.3  473.0  532.4  50.0  30.1   898.1  1198.4 1026.6
 4T  4122.8  831.9  944.7 1064.6 100.7  60.1  1797.0  2392.2 2053.4
 8T  4163.2 1016.0  948.2 1069.5 101.8  60.9  1808.0  2414.2 2051.5

  Overall Seconds   5.28 1T,   5.34 2T,   5.42 4T,  10.81 8T

 #################### T7 ARM-Intel #####################

    ARM/Intel MP-Whetstone Benchmark V1.1 30-Apr-2015 21.32

                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If  Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS   MOPS

 1T   602.2  242.3  242.3  140.2  27.2   4.9   482.8  1425.2  239.1
 2T  1208.7  481.2  484.2  280.8  55.0   9.9   970.0  2869.6  478.7
 4T  2398.7  805.4  966.7  562.5 109.5  19.5  1938.2  5722.5  957.1
 8T  2429.1  974.6 1076.2  562.4 110.9  19.7  1981.5  5816.1  963.6

  Overall Seconds   4.94 1T,   4.93 2T,   5.08 4T,   9.93 8T


 #################### T11 Original #####################

 T11 Samsung EXYNOS 5250 2.0 GHz Cortex-A15, Android 4.2.2
                Measured 1.7 GHz

   Android MP-Whetstone Benchmark V1.1 06-Sep-2013 12.49
 
                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If  Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS   MOPS

 1T  1308.2  345.9  379.0  294.1  30.8  17.2  1351.4  1265.7  843.1
 2T  2886.6  782.1  782.6  614.0  80.1  34.3  2775.2  2463.7 1667.5
 4T  3086.0  998.6  788.1  610.6  79.2  44.5  3472.0  2526.4 2191.4
 8T  2930.0  788.2  843.5  616.5  80.5  35.0  2846.0  2799.1 1686.2

  Overall Seconds   3.54 1T,   3.30 2T,   6.62 4T,  13.16 8T

 #################### T11 ARM-Intel ####################

    ARM/Intel MP-Whetstone Benchmark V1.1 30-Apr-2015 21.23

                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If  Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS   MOPS

 1T   837.2  340.1  341.7  191.2  39.1   6.2  1521.1  2532.8  629.3
 2T  1676.2  596.2  683.2  387.3  77.8  12.4  3056.9  5055.1 1263.6
 4T  1697.7  687.5  869.4  394.5  78.1  12.4  2980.7  6518.4 1258.8
 8T  1685.2  685.9  691.0  389.7  78.3  12.4  3086.3  5113.7 1262.0

  Overall Seconds   4.06 1T,   4.07 2T,   8.12 4T,  16.19 8T

 
 #################### T21 Original #####################

    T21 Qualcomm Snapdragon 800 2150 MHz, Android 4.4.4

    Android MP-Whetstone Benchmark V1.1 06-Jul-2015 10.42

                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp  Fixpt     If  Equal
                 1      2      3  MOPS  MOPS   MOPS   MOPS   MOPS

 1T  1877.1  645.2  642.6  524.1  44.0  22.3 1364.7 1572.1  898.9
 2T  3668.6 1220.2 1262.4 1021.9  85.9  43.8 2663.5 3078.4 1753.4
 4T  7426.9 2375.5 2474.7 2097.7 175.7  88.2 5052.6 6240.4 3555.0
 8T  7706.6 2692.2 2746.2 2186.9 180.1  90.3 5822.5 6902.7 3681.3

 Overall Seconds   4.44 1T,   4.62 2T,   4.64 4T,   9.00 8T

          Total Elapsed Time   24.1 seconds

 #################### T21 ARM-Intel #################### 

    ARM/Intel MP-Whetstone Benchmark V1.1 22-Jul-2015 12.02
 
                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If  Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS   MOPS

 1T  1598.0  512.1  508.7  311.7  43.6  22.1  1142.9  2123.3  598.4
 2T  3161.2  960.0  996.7  614.2  86.7  43.8  2258.9  3820.9 1194.7
 4T  6348.0 1593.5 2019.5 1231.5 174.2  88.5  4471.1  8139.4 2398.3
 8T  6419.6 2058.2 2077.5 1252.6 175.0  88.7  4520.9  8875.0 2409.0

 Overall Seconds   4.88 1T,   5.00 2T,   5.05 4T,   9.92 8T

          Total Elapsed Time   29.2 seconds


 ###################### T22 32 Bit ######################

  T22, Quad Core ARM Cortex-A53 1300 MHz, Android 5.0.2 

     ARM/Intel MP-Whetstone Benchmark V1.2 10-Aug-2015 11.30
            Compiled for 32 bit ARM v7a

                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If  Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS   MOPS

 1T   676.4  275.9  281.9  147.9  35.4   5.3   600.3   901.0  285.5
 2T  1362.5  533.8  561.7  298.0  70.9  10.8  1203.1  1838.9  574.0
 4T  2698.6  903.9 1071.7  594.4 141.2  21.5  2346.1  3305.5 1138.5
 8T  2830.1 1463.2 1393.0  614.2 152.5  21.9  3243.9  4418.3 1171.4

 Overall Seconds   4.95 1T,   4.94 2T,   5.11 4T,  10.09 8T

###################### T22 64 Bit ######################

     ARM/Intel MP-Whetstone Benchmark V1.2 10-Aug-2015 11.34
            Compiled for 64 bit ARM v8a

                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If  Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS   MOPS

 1T  1524.8  328.6  348.8  297.6  37.3  19.9 1462579  1867.2 1238.0
 2T  3062.5  688.8  697.9  596.0  75.5  39.8 2097113  3726.7 2481.3
 4T  6085.4 1214.9 1360.5 1185.4 150.5  79.4 2449153  7055.0 4951.8
 8T  6222.4 1495.2 1545.6 1204.2 152.2  80.6 3869846  9218.8 5154.1

 Overall Seconds   4.92 1T,   4.90 2T,   5.05 4T,   9.97 8T


 #################### A1 Original #######################

 A1 Quad Core 1.86 GHz Intel Atom Z3745, Android 4.4
   Dual Channel LPDDR3-1066 Bandwidth 17.1 GB/s

   Android MP-Whetstone Benchmark V1.1 04-Feb-2015 11.39

                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If  Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS   MOPS

 1T   953.7  363.0  382.4  267.8  21.0  13.2   413.1  1842.4  392.3
 2T  1921.2  726.0  663.5  541.4  42.6  27.0   816.1  3662.6  793.3
 4T  3820.6 1419.2 1514.6 1081.5  84.1  54.0  1543.8  6292.4 1588.5
 8T  4003.8 1912.9 1872.4 1114.1  86.5  56.4  2053.1  8292.6 1599.7

  Overall Seconds   4.88 1T,   4.87 2T,   4.96 4T,  10.05 8T

 ################## A1 V1 Android 5.0 ###################

   Android MP-Whetstone Benchmark V1.1 05-Nov-2015 11.06

                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If  Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS   MOPS

 1T   748.8  405.9  411.8  367.0  11.3  11.1   898.0  2129.1  459.8
 2T  1468.5  822.0  827.5  744.8  22.4  22.2  1088.8  4228.4  924.5
 4T  2781.0 1242.8 1638.6 1415.5  40.3  44.3  3404.6  8283.2 1852.1
 8T  3050.7 1854.5 1831.0 1566.7  45.4  45.3  4519.7 10332.5 1844.5

  Overall Seconds   5.00 1T,   5.09 2T,   5.72 4T,  10.30 8T

 #################### A1 ARM-Intel ######################

    ARM/Intel MP-Whetstone Benchmark V1.1 30-Apr-2015 17.35
 
                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If  Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS   MOPS

 1T  1916.9  691.4  691.3  497.2  35.3  27.6 10209.8  2787.3 1351.8
 2T  3800.3 1377.6 1381.2  980.0  70.1  54.7 20248.0  5252.8 2748.7
 4T  7604.9 2713.2 2711.8 1977.1 140.2 110.0 33906.3  9526.5 5550.8
 8T  7798.1 3141.5 3627.2 2064.2 141.2 110.2 59590.6 12743.7 5711.5

  Overall Seconds   4.94 1T,   5.00 2T,   5.06 4T,  10.11 8T


 ########### A5 ARM-Intel Dual Boot With W2 #############

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
          Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 ARM/Intel MP-Whetstone Benchmark V1.2 14-Apr-2016 17.09
           Compiled for 32 bit Intel x86

                    Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If  Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS   MOPS

 1T  2121.9  695.0  695.7  483.5  39.6  34.8 10102.2  2700.8 1358.9
 2T  4123.2 1319.0 1351.2  903.1  78.9  67.2 19593.6  5336.0 2604.5
 4T  7368.1 2394.0 2375.9 1668.8 139.0 119.8 35711.8  9359.2 4603.0
 8T  7391.0 2397.4 2769.0 1658.4 137.7 121.8 36643.4  9953.9 4670.9

 Overall Seconds   4.88 1T,   5.04 2T,   5.84 4T,  11.52 8T


 
 ################# W1 Windows 10 32 bit #################

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Windows 10, 4 GB DDR 3 1600

     MP-Whetstone Benchmark From C/C++ 18.00.21005.1 for x86

                   Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If   Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS    MOPS

 1T  1816.7  568.3  580.3  477.8  34.9  26.9  1395.8  1100.4  7327.8
 2T  3469.7 1145.9 1086.9  905.6  66.1  52.6  2684.4  2118.7 13383.7
 4T  6337.0 2026.1 2029.6 1658.4 121.2  95.1  4886.7  3800.8 24933.3
 8T  6900.2 2162.4 2326.0 1870.2 134.7  98.8  6089.9  4071.4 29659.9

 Overall Seconds   4.80 1T,   5.02 2T,   5.53 4T,  13.07 8T

 ################# W1 Windows 10 64 bit #################

     MP-Whetstone Benchmark From C/C++ 18.00.21005.1 for x64

                   Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If   Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS    MOPS

 1T  1994.3  537.7  536.4  476.9  42.3  28.8  1420.0  1099.2  7305.8
 2T  3760.6 1080.6 1075.4  894.9  79.9  53.3  2842.5  2115.5 12762.4
 4T  6946.5 1850.0 1883.3 1655.9 146.8 101.3  4946.3  3787.9 25246.0
 8T  7556.2 1891.4 2159.3 1867.7 163.1 104.8  5362.5  4283.3 26001.8

 Overall Seconds   4.89 1T,   5.19 2T,   5.66 4T,  13.26 8T


 ######## W2 Windows 10 32 bit Dual Boot With A5 ########

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Windows 10, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

     MP-Whetstone Benchmark From C/C++ 18.00.21005.1 for x86

      Start of test Fri Apr 15 16:28:12 2016

                   Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If   Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS    MOPS

 1T  1776.5  561.5  581.1  466.3  34.1  26.2  1402.2  1093.2  6981.4
 2T  3364.9 1014.1 1020.8  832.8  65.6  51.6  2643.0  2027.2 12415.1
 4T  6316.1 1987.1 2016.5 1655.2 121.2  94.2  4860.8  3793.2 24941.8
 8T  6563.4 2372.8 2031.4 1850.4 122.8  96.6  5667.8  3844.8 28561.7

 Overall Seconds   4.75 1T,   5.06 2T,   5.39 4T,  11.56 8T

 ######## W2 Windows 10 64 bit Dual Boot With A5 ########
 
    MP-Whetstone Benchmark From C/C++ 18.00.21005.1 for x64

      Start of test Fri Apr 15 16:38:09 2016

                   Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If   Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS    MOPS

 1T  1954.1  506.3  538.0  469.7  40.4  29.1  1411.3  1091.8  7280.9
 2T  3615.7 1011.5  989.7  873.6  77.1  51.7  2477.6  1907.0 13107.0
 4T  6941.8 1877.9 1879.3 1652.7 147.1 100.9  4946.8  3789.6 25046.5
 8T  7124.5 2128.2 1975.4 1705.5 149.7 103.3  5058.7  4284.8 28862.8

 Overall Seconds   4.95 1T,   5.36 2T,   5.59 4T,  11.72 8T


 ==================================================================
           Top end 2015 PC - Core i7-4820K at 3.9 GHz 
 ==================================================================
 32 Bit
     MP-Whetstone Benchmark From C/C++ 18.00.21005.1 for x86

                   Using 1, 2, 4 and 8 Threads
      MWIPS MFLOPS MFLOPS MFLOPS   Cos   Exp   Fixpt      If   Equal
                 1      2      3  MOPS  MOPS    MOPS    MOPS    MOPS

 1T  5273.9 1114.8 1119.2  921.1 129.4  90.7  3404.0  5351.3 22213.6
 2T 11031.8 2238.4 2304.9 1938.0 271.1 189.4  6973.5 11713.2 46821.3
 4T 21347.8 4713.1 4718.0 3879.9 493.4 375.2 14335.7 21161.6 89584.4
 8T 39679.6 9374.0 9397.5 7687.6 874.8 726.5 24631.8 23418.6 93465.8

 Overall Seconds   4.97 1T,   4.76 2T,   4.99 4T,   5.59 8T

 64 Bit
     MP-Whetstone Benchmark From C/C++ 18.00.21005.1 for x64

                     Using 1, 2, 4 and 8 Threads
       MWIPS MFLOPS  MFLOPS MFLOPS    Cos   Exp   Fixpt      If   Equal
                  1       2      3   MOPS  MOPS    MOPS    MOPS    MOPS

 1T  6200.6  1236.5  1236.2  870.8  206.0 108.8  3359.1  4767.4 23413.2
 2T 13050.4  2603.8  2606.2 1891.4  432.6 217.5  7076.8 10041.6 46840.3
 4T 25336.0  5195.2  5211.7 3707.1  832.8 422.9 13626.9 16962.6 78345.9
 8T 46141.7 10293.2 10379.0 7242.4 1332.7 814.2 24394.5 23451.3 93587.6

 Overall Seconds   4.82 1T,   4.60 2T,   4.91 4T,   5.50 8T
   

To Start


MP Dhrystone Benchmark - MP-Dhryi.apk, MPDhry32.exe, MPDhry64.exe

For further details see Dhrystone Benchmark above and the following, including further results Android MultiThreading Benchmark Apps. This multithreading benchmark runs using 1, 2, 4 and 8 threads, executing multiple copies of the same program. An initial calibration, using a single thread, determines the number of passes needed for an overall execution time of 1 second. Then all threads are run using the same pass count, running time being extended when there are more threads than CPUs. The same calculations are carried out on each thread. Separate data arrays are used for each thread but some variables can be used by all threads. The latter is probably responsible for failure to increase throughput much, using multiple threads or, in the case of A1, with the Atom CPU, reduced throughput using more than one thread.

On all the initial results shown, there was little difference in performance between the original and the new 32 bit version but T22, with the Cortex-A53, produced significant gains at 64 bits.

T21, the Kindle Fire with a Quad Core Qualcomm Snapdragon 800 CPU, failed to run using the new ARM/Intel version, and obtained a rather excessive score with 8 threads via the original benchmark (but similar to a possible 4 x 2850).

W1 and PC Windows/Intel results now included. Note that the systems using ARM processors increased performance with multiple threads but those with Intel CPUs did not.

A5 and W2 Dual Boot Tablet - Android and Windows results demonstrate similar MP performance degradation, with Windows speeds being somewhat faster. Multiple runs via W1 and W2 can produce virtually the same performance, but there can be wide variations (as shown here).


                                               VAX MIPS or DMIPS

                                                     Threads
 System CPU      MHz   Android         1        2        4        8     None 
 See                               

 Original ARM Version
 A1    Z3745   1866 x4   4.4.2      2360     1394     1334     1321     1840
 A1    Z3745   1866 x4   5.0        2411     1633     1313     1298     2488
 
 T7    v7-A9   1200 x4   4.1.2      1584     2749     3836     3569     1610
 T22   v8-A53  1300 x4   5.0.2      1686     2943     4232     4323     1683
 T11   v7-A15  1700 x2   4.2.2      2271     4281     4326     4171     3189
 T21   QU-800  2150 x4   4.4.3      2850     4395     7736    11821     3854

 ARM/Intel 32 Bit Version
 A1    Z3745   1866 x4   4.4.2      2365     1322     1323     1319     2451
 A5 ## z8300   1840 x4   5.1        2256     1155     1163     1054     2318

 T7    v7-A9   1200 x4   4.1.2      1464     2399     3575     3737     1317
 T22   v8-A53  1300 x4   5.0.2      1412     2559     4038     4291     1423
 T11   v7-A15  1700 x2   4.2.2      2295     4057     3902     4096     2551
 T21   QU-800  2150 x4   4.4.3      Failed to run                       3319

 ARM/Intel 64 Bit Version
 T22   v8-A53  1300 x4   5.0.2      2548     4311     5560     5613     2569

 Intel/Windows 32 Bit Version
 W1    Z8300   1840 x4   Win10      3284     1477     1235     1313     3044
 W2 ## Z8300   1840 x4   Win10      2521     1730     1333     1285     2906
 PC   Core i7  3900 x4   Win10     12776     7175     6116     7876    12090

 Intel/Windows 64 Bit Version
 W1    Z8300   1840 x4   Win10      3745     1625     1400     1436     3291
 W2 ## Z8300   1840 x4   Win10      3717     1566     1386     1441     3195
 PC   Core i7  3900 x4   Win10     15129     8535     7278     8769    11686

                       ## A5 and W2 Same Dual Boot Tablet
    

To Start


NEON-Linpack-MP Benchmark - NEON-Linpacki-MP.apk
Windows SSE - MPLinpackBBPP.exe - BB 32 or 64, PP SP or DP

This is a multithreading version of the above. Further details and results can be found in here. The benchmark is run on 100x100, 500x500 and 1000x1000 matrices using 0, 1, 2 and 4 separate threads, the programming code for zero theads being the same as the above example. Multithreading performance, using this standard linear equation solver, is severely degraded, due to overheads, the zero thread results being the only ones of real use.

Performance of A1, with the Intel CPU and using native Intel compilation, is shown to be twice as fast as the Houdini ARM to Intel converted version, except at N = 1000, which is mainly dependent on calculations from data in RAM. Then, when running the ARM only version, using Android upgraded to 5.0, the performance difference was considerably reduced.

On ARM CPUs, speeds obtained from 32 bit and 64 bit compilations were similar, due to the programs use a limited number of identical NEON intrinsic functions. For the same reason, the new ARM/Intel version produced similar results as the original.

The program checks that the same numeric results are produced, irrespective of the number of threads used, at each matrix size. Then, due to rounding effects, these are slightly different from ARM and Intel hardware, as shown below.

A5 and W2 Dual Boot Tablet - The Windows benchmark uses only the N=500 parameter but 32 bit and 64 bit versions are available using single and double precision numbers and SSE/SSE2 instructions. The no thread speeds are faster via the Android compiler.


    MFLOPS 0 to 4 Threads, N 100, 500, 1000

 ##################### T7 Original ######################

 Android Linpack NEON SP MP Benchmark 31-Jan-2013 12.14

  T7, ARM Cortex-A9 1200 MHz, Android 4.1.2, 

 Threads      None        1        2        4

  N  100     413.47    45.95    48.22    48.34
  N  500     253.08   187.51   189.69   189.94
  N 1000     148.76   135.49   136.08   136.17

#################### T7 ARM-Intel #####################

 ARM/Intel Linpack NEON SP MP Benchmark 14-May-2015 15.40

  Threads      None        1        2        4

  N  100     385.49    28.79    29.06    29.25
  N  500     272.07   184.85   183.70   183.18
  N 1000     147.09   131.92   132.44   130.05


 #################### T11 Original #####################

 Android Linpack NEON SP MP Benchmark 13-Aug-2013 23.28

 T11 Samsung EXYNOS 5250 1.7 GHz Cortex-A15, Android 4.2.2

  Threads      None        1        2        4

  N  100    1399.82    54.86    55.31    54.66
  N  500    1154.21   434.16   434.06   436.97
  N 1000     571.26   482.57   487.25   485.80

 #################### T11 ARM-Intel ####################

 ARM/Intel Linpack NEON SP MP Benchmark 14-May-2015 15.44

  Threads      None        1        2        4

  N  100    1497.90    61.13    63.13    61.87
  N  500    1399.10   491.49   489.29   494.69
  N 1000     586.14   499.00   504.97   497.49


 #################### T21 Original #####################

 T21 Qualcomm Snapdragon 800 2150 MHz, Android 4.4.4

 Android Linpack NEON SP MP Benchmark 26-Jul-2015 11.46

  Threads      None        1        2        4

  N  100    1311.08    12.38    12.93    15.05
  N  500    2271.56   344.04   419.52   381.73
  N 1000     837.30   540.99   523.52   564.87


 #################### T21 ARM-Intel #################### 

 ARM/Intel Linpack NEON SP MP Benchmark 26-Jul-2015 11.51

  Threads      None        1        2        4

  N  100    1308.07    14.89    11.77    11.63
  N  500    2341.17   407.96   481.02   415.12
  N 1000     901.21   551.80   566.77   564.31


 ###################### T22 32 Bit ######################

  T22, Quad Core ARM Cortex-A53 1300 MHz, Android 5.0.2 

  ARM/Intel Linpack NEON SP MP Benchmark 1.2 13-Aug-2015 12.52
            Compiled for 32 bit ARM v7a

  Threads      None        1        2        4

  N  100     460.74    22.35    23.16    23.82
  N  500     480.63   336.52   339.94   303.66
  N 1000     470.02   405.86   403.01   405.98

 ###################### T22 64 Bit ######################

 ARM/Intel Linpack NEON SP MP Benchmark 1.2 13-Aug-2015 12.57
           Compiled for 64 bit ARM v8a

  Threads      None        1        2        4

  N  100     548.67    27.70    33.93    37.00
  N  500     470.04   285.95   297.79   301.67
  N 1000     519.02   441.84   443.47   441.91

 
 #################### A1 Original #######################

 Android Linpack NEON SP MP Benchmark 07-Feb-2015 18.42

 A1 Quad Core 1.86 GHz Intel Atom Z3745, Android 4.4

 Threads      None        1        2        4

 N  100     452.39    21.00    23.48    17.48
 N  500     663.38   275.56    88.66   312.71
 N 1000     617.04   380.60   191.26   195.61

 ################## A1 V1 Android 5.0 ###################

 Android Linpack NEON SP MP Benchmark 05-Nov-2015 11.49

    MFLOPS 0 to 4 Threads, N 100, 500, 1000
 Threads      None        1        2        4

 N  100     662.21    25.84    25.59    25.43
 N  500    1022.76   317.51   310.52   311.49
 N 1000     861.75   549.32   558.52   547.91

 #################### A1 ARM-Intel ######################

 ARM/Intel Linpack NEON SP MP Benchmark 1.2 06-Nov-2015 22.11
           Compiled for 32 bit Intel x86

    MFLOPS 0 to 4 Threads, N 100, 500, 1000
 Threads      None        1        2        4

 N  100     979.81    49.01    42.69    45.34
 N  500    1160.24   369.43   349.04   334.87
 N 1000     716.94   560.86   535.46   486.61
 

 ########## A5 ARM-Intel Dual Boot With W2 ############

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
          Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 ARM/Intel Linpack NEON SP MP Benchmark 1.2 14-Apr-2016 17.22
           Compiled for 32 bit Intel x86

    MFLOPS 0 to 4 Threads, N 100, 500, 1000
 Threads      None        1        2        4

 N  100    1131.44    16.52    16.05    17.00
 N  500    1427.56   234.84   231.15   266.46
 N 1000     874.35   474.20   423.36   577.54


 ################# W1 Windows 10 32 bit #################

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Windows 10, 4 GB DDR 3 1600

 Linpack Single Precision MultiThreaded Benchmark
   32 Bit, N=500, Wed Dec 23 21:01:12 2015

 Threads          0        1        2        4
 MFLOPS      740.71   256.40   226.44   163.99

 Linpack Double Precision MultiThreaded Benchmark
   32 Bit, N=500, Wed Dec 23 21:00:30 2015

 Threads          0        1        2        4
 MFLOPS      480.73   194.42   196.76   148.52

 ################# W1 Windows 10 64 bit #################

 Linpack Single Precision MultiThreaded Benchmark
      64 Bit, N=500, Wed Dec 23 21:17:19 2015

 Threads          0        1        2        4
 MFLOPS      707.50   263.47   240.46   197.31

 Linpack Double Precision MultiThreaded Benchmark
      64 Bit, N=500, Wed Dec 23 21:16:42 2015

 Threads          0        1        2        4
 MFLOPS      488.12   205.02   202.39   165.47


 ######## W2 Windows 10 32 bit Dual Boot With A5 ########
 
  Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
          Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 Linpack Single Precision MultiThreaded Benchmark
     32 Bit, N=500, Fri Apr 15 16:23:55 2016

 Threads          0        1        2        4
 MFLOPS      626.40   231.31   183.87   129.48

 Linpack Double Precision MultiThreaded Benchmark
      32 Bit, N=500, Fri Apr 15 16:23:21 2016

 Threads          0        1        2        4
 MFLOPS      412.89   221.03   148.56    94.62

 ######## W2 Windows 10 64 bit Dual Boot With A5 ########

 Linpack Single Precision MultiThreaded Benchmark
      64 Bit, N=500, Fri Apr 15 16:36:10 2016

 Threads          0        1        2        4
 MFLOPS      662.15   241.59   228.59   195.97
 ResidN        3.96     3.96     3.96     3.96

 Linpack Double Precision MultiThreaded Benchmark
 64 Bit, N=500, Fri Apr 15 16:35:42 2016

 Threads          0        1        2        4
 MFLOPS      527.64   195.54   180.62   154.02


################### Numeric Results ###################

 NR=norm resid RE=resid MA=machep X0=x[0]-1 XN=x[n-1]-1

 Single Precision

 N              100             500            1000

 ARM
 NR            1.60            3.96           11.32
 RE  3.80277634e-05  4.72068787e-04  2.70068645e-03
 MA  1.19209290e-07  1.19209290e-07  1.19209290e-07
 X0 -1.38282776e-05  5.26905060e-05  1.62243843e-04
 XN -7.51018524e-06  3.26633453e-05 -6.65783882e-05

 Intel
 NR            1.68            3.96           11.39
 RE  4.00543213e-05  4.72545624e-04  2.71725655e-03
 MA  1.19209290e-07  1.19209290e-07  1.19209290e-07
 X0 -1.38282776e-05  5.26905060e-05  1.62243843e-04
 XN -7.51018524e-06  3.26633453e-05 -6.65783882e-05


 Double Precision Intel SSE2

                               5.76
                    1.27986510e-012
                    2.22044605e-016
                    5.59552404e-014
                    3.39728246e-014
   

To Start


MP-BusSpeed Benchmark - MP-BusSpdi.apk, MP-BusSpd2i.apk, MPbusSpeed32.exe, MPbusSpeed64.exe

This is a multithreading version of the above. and here for further results. In the original MP-BusSpdi benchmark, all threads read data from the beginning. With large shared caches, this could lead to exaggerated data transfer speeds for RAM based data, using multiple threads. The revised MP-BusSpd2i attempts to avoid this by arranging for threads to have staggered starting points, but each still reading all the data, besides having a much longer running time for consistent scores. Performance using a single thread is similar to the non-threaded version and it is clear that multiple threads are needed to demonstrate maximum throughput. As usual, maximum RAM speeds can be estimated from burst transfer results, such as 16 times Inc16 MB/second. some results are provided below.

MP-BusSpdi.apk ican be downloded from here.

Using A1, with the Intel Atom CPU, the initial Houdini ARM to Intel conversion speeds were slightly slower than the results from the native code compilations, but this was made up on running via Android 5.

Results for the original version, running on ARM CPUs, are not all shown, as they were similar to those for the new version. See here. On T22, with the Cortex-A53, performance could be more than twice as fast, reading all data, using the 64 bit compilation.

The problem associated with shared caches is probably best identified by wide variations in the burst reading tests, that are not apparent in the long running versions (see T7 and T21 below ).

Intel/Windows Versions - Comparing the Atom CPUs, performance is quite similar between Android and Windows compilations. On the Core i7, it seems clear that the huge L3 cache affects the measured RAM speed.

A5 and W2 Dual Boot Tablet - Results via Windows and Android are quite similar. Windows 64 bit compilation is somewhat faster than the 32 bit version. RAM speed is slower than tablet W1.


 #################### T7 ARM-Intel #####################

 T7, ARM Cortex-A9 1.2 GHz, DDR3-1333, 5.3 GB/s Android 4.1.2, 
        4 x 32 KB L1 cache, 1 MB shared L2 cache

 ARM/Intel MP-BusSpd v7 Benchmark V1.1 05-May-2015 14.35

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   2853   3392   3376   3511   3551   3494
      2T   2857   3389   3542   5540   5730   5595
      4T   7257  10326  10289  10997  11373  11100
      8T   6584  10325  10485  11175  11322  11189
122.9 1T    362    379    347    546    623    978
      2T    516    530    508    726   1227   1840
      4T    598    658    548   1181   1556   2657
      8T    721    733    736   1181   1548   2653
12288 1T     58     57     84    123    173    334
      2T    111    111    182    248    348    664
      4T     87     85    276    463    687   1290
      8T    154    107    147    429    441   1242

          Total Elapsed Time   12.7 seconds

 ########## T7 New Long Version

 ARM/Intel MP-BusSpd2 Benchmark V1.0 24-Jul-2015 15.59

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   2166   2774   3181   3307   3377   3263
      2T   3924   5188   5207   5754   5759   5805
      4T   7570  10011  10252  11165  11375  11777
      8T   3510   4786   9011   8318  11351  11544
122.9 1T    383    409    359    558    663    983
      2T    525    541    520    741   1241   1814
      4T    739    752    753   1219   1590   2776
      8T    735    741    753   1218   1607   2737
49152 1T     56     51     81    126    172    330
      2T     65     67    107    196    335    620
      4T     70     68    108    215    426    835
      8T     70     68    109    215    428    851

          Total Elapsed Time   48.2 seconds


 Maximum RAM Speed Estimate = 68 x 16 = 1088 MB/second


 #################### T11 ARM-Intel ####################

 T11 Samsung EXYNOS 5250 1.7 GHz Cortex-A15, Android 4.2.2
  Dual core, 2 x 32 KB L1 cache, 1 MB shared L2 cache

 ARM/Intel MP-BusSpd v7 Benchmark V1.1 05-May-2015 14.45

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   2165   3591   4256   5587   5998   6109
      2T   4121   6469   9530  11381  11846  11936
      4T   4106   6438   8827   6793   9802  12080
      8T   4098   6390   9534  10141  10996  11603
122.9 1T    464    740   1173   2395   3276   3340
      2T    579    989   1934   3994   5431   5792
      4T    579    988   1930   3873   5469   5821
      8T    580    985   1915   3999   5408   5812
12288 1T    134    172    211    462    602   1904
      2T    269    343    387    934   1217   2685
      4T    252    231    374    768    991   2625
      8T    231    254    367    781   1104   2782

          Total Elapsed Time   12.1 seconds

 ########## T11 New Long Version

 ARM/Intel MP-BusSpd2 Benchmark V1.0 24-Jul-2015 17.07

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   3499   4539   5499   5505   6134   6045
      2T   3775   7202   8377  10605  10457  11319
      4T   3982   6676   7687   9326   9707  10807
      8T   2546   3643   7891   8003  10725  11097
122.9 1T    672    901   1336   2784   3274   3334
      2T    568    969   1931   3894   5427   5221
      4T    574    971   1912   3831   5256   4811
      8T    559    971   1917   3878   5387   5162
49152 1T    140    142    193    575    989   1499
      2T    221    223    342    769   1379   2355
      4T    228    223    344    783   1382   2376
      8T    223    223    342    787   1385   2352

          Total Elapsed Time   49.9 seconds

 Maximum RAM Speed Estimate = 223 x 16 = 2568 MB/second


 #################### T21 Original #####################

 T21 Qualcomm Snapdragon 800 2150 MHz, Android 4.4.4
     Dual Channel 32 Bit LPDDR3-1866 RAM 14.9 GB/s
     L1 caches 4 x 16 KB, L2 cache shared 2048 KB

 Android MP-BusSpd v7 Benchmark V1.1 29-Jun-2015 18.37

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   2580   2206   5048   5176   5679   5989
      2T   4062   5175   9340   9868  10971  11281
      4T   4688  10324  16552  17196  21714  23708
      8T   8467   9834  16698  18183  21936  23693
122.9 1T   1152   1052   2068   3035   3927   5723
      2T   1710   1840   3094   5001   7963  11475
      4T   2047   2002   5031   9267  14698  22920
      8T   2235   2275   5223   9348  14234  21783
12288 1T    262    382    508    867   1466   2661
      2T    464    766   1049   1754   3186   5735
      4T    612   1018   1796   3149   5892   9095
      8T    575    680   1277   2308   4987   7948

          Total Elapsed Time   12.7 seconds

 Impossible Maximum RAM Speed 1018 x 16 = 16288 MB/second

 #################### T21 ARM-Intel #################### 

 ARM/Intel MP-BusSpd v7 Benchmark V1.1 23-May-2015 17.05

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   1840   2073   3512   3554   4829   5243
      2T   3432   4591   7128   7651   9120   9821
      4T   4398   7855  13752  15428  18530  20235
      8T   6692   9507  13857  16110  18143  18796
122.9 1T    860    753   2011   2841   3205   5282
      2T   1505   1609   3076   5038   8089  10421
      4T   1924   1981   4299   7588  14614  20754
      8T   1909   1988   4264   7980  13884  19027
12288 1T    270    379    538    856   1626   2859
      2T    471    677   1098   1849   3304   5924
      4T    549    787   1066   1874   6274  10781
      8T    713    853   1649   2258   4664   8321

          Total Elapsed Time   13.1 seconds

 ########## T21 New Long Version

 ARM/Intel MP-BusSpd2 Benchmark V1.0 24-Jul-2015 15.39

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   2247   2616   4010   4443   4909   5614
      2T   3558   4725   7241   9048   9747  10892
      4T   6074   8303  13442  16937  18525  21068
      8T   3998   5106  14314  13615  18200  20740
122.9 1T    874   1198   2024   2935   4529   5345
      2T   1686   1702   3174   5357   7688  10545
      4T   1988   2139   4465   8171  14969  21169
      8T   1972   2139   4468   8195  15261  21132
49152 1T    292    406    516    899   1663   2929
      2T    449    541    962   1569   2851   4776
      4T    495    605   1109   2439   4161   8243
      8T    530    564   1156   2149   4172   7907

          Total Elapsed Time   48.0 seconds

 Maximum RAM Speed Estimate = 605 x 16 = 9680 MB/second


 ###################### T22 32 Bit ######################

 T22, Tab 2 A8-50, 1.3 GHz quad core 64 bit ARM Cortex-A53
  Single Channel RAM, LPDDR3 666 MHz, 5.3 GB/second
     4 x 32 KB L1 cache, 512 KB L2 cache

 ARM/Intel MP-BusSpd Benchmark V1.2 12-Aug-2015 16.13
           Compiled for 32 bit ARM v7a

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   1849   2140   2079   2211   2270   2297
      2T   3663   4252   4294   4400   4370   4580
      4T   4630   5574   5691   5893   6015   6083
      8T   5331   5775   6033   6622   7968   8023
122.9 1T    597    621   1119   1815   2135   2237
      2T    869    943   1644   2992   3740   4412
      4T    949    951   1922   3736   6468   7779
      8T    948    978   1911   3717   6464   7542
12288 1T    123    174    344    678   1215   1840
      2T    243    310    672   1332   2383   3974
      4T    302    285    594   1282   2271   4606
      8T    279    295    654   1198   2749   4660

          Total Elapsed Time   12.8 seconds

 ########## T22 Long Version

 ARM/Intel MP-BusSpd2 Benchmark V1.2 12-Aug-2015 16.14
           Compiled for 32 bit ARM v7a

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   1877   2124   2176   2266   2296   2343
      2T   3625   4198   4341   4468   4536   4613
      4T   5733   7541   8293   8830   8024   9042
      8T   2985   3829   7438   6117   8108   8923
122.9 1T    604    625   1142   1846   2150   2284
      2T    924    950   1793   3277   4270   4504
      4T    962    989   1939   3765   6798   8862
      8T    965    993   1933   3748   6651   8239
49152 1T    165    175    344    677   1285   1979
      2T    234    238    482    961   1907   3547
      4T    266    298    562   1224   2296   4478
      8T    272    275    538   1098   2149   4282

          Total Elapsed Time   48.8 seconds

 Maximum RAM Speed Estimate = 298 x 16 = 4768 MB/second

 ###################### T22 64 Bit ######################

 ARM/Intel MP-BusSpd2 Benchmark V1.2 12-Aug-2015 16.18
           Compiled for 64 bit ARM v8a

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   2610   2472   2586   2727   2748   5841
      2T   4404   4681   4994   5369   5420  11297
      4T   6546   8125   9105  10243  10319  20610
      8T   3380   4023   7919   7146   9871  19852
122.9 1T    604    621   1110   1872   2446   5100
      2T    919    948   1855   3433   4853  10037
      4T    961    974   1984   3924   7491  14935
      8T    963    942   1931   3915   7572  14689
49152 1T    173    177    340    692   1300   2653
      2T    266    241    479    968   1883   3724
      4T    304    277    556   1130   2126   4328
      8T    279    278    544   1138   2179   4275

          Total Elapsed Time   49.4 seconds

 #################### A1 Original #######################

 A1 Quad Core 1.86 GHz Intel Atom Z3745, Android 4.4
   Dual Channel LPDDR3-1066 Bandwidth 17.1 GB/s
            4 x 24 KB L1, 2 x 1 MB L2

 Android MP-BusSpd v7 Benchmark V1.1 05-May-2015 13.02

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   3990   4458   6123   6512   6438   6729
      2T   3894   5699   8948  10299  11800  12555
      4T   5046   7109  11952  14750  15533  23304
      8T   4533   7464  13097  16970  21674  22225
122.9 1T   1304   1613   2291   2661   3667   5063
      2T   2568   3145   4529   5365   7440  10147
      4T   4117   4801   7963   7495   8239  18911
      8T   3130   5016   7355   8543  11648  15845
12288 1T    190    265    601   1203   2316   3832
      2T    244    448    995   1771   3599   6575
      4T    427    584    860   1741   3439   7449
      8T    395    510    855   1613   3547   6776

          Total Elapsed Time   13.5 seconds

 ################## A1 V1 Android 5.0 ###################

 Android MP-BusSpd v7 Benchmark V1.1 05-Nov-2015 11.52

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   5509   6152   6796   6937   7060   7056
      2T   4635   6757   9294  11284  12612  13486
      4T   4545   9383  15861  21378  15369  23493
      8T   4473   8723  15965  18476  23438  22747
122.9 1T   1467   1782   2386   2737   3799   5299
      2T   2225   3460   4683   5421   7507  10514
      4T   2493   5703   8165   9941  11313  11259
      8T   4119   5481   6992   8726  12919  17166
12288 1T    213    253    589   1176   2309   3903
      2T    252    396    842   1668   3325   6759
      4T    404    437   1130   1659   4562   6911
      8T    414    507    836   1902   3607   6670

          Total Elapsed Time   13.9 seconds

 #################### A1 ARM-Intel ######################

 ARM/Intel MP-BusSpd v7 Benchmark V1.1 05-May-2015 14.28

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   5925   6494   6778   6979   7047   7026
      2T   3966   7029   9689  11689  12856  13654
      4T   4438   8698  16739  22057  23946  25729
      8T   4455   8619  15787  19934  22576  20804
122.9 1T   1490   1975   2360   2802   3818   5330
      2T   2881   3798   4647   5531   7536  10546
      4T   4452   6338   5910  10217  14650  19903
      8T   4096   5075   6264   9213  12610  15821
12288 1T    206    273    593   1198   2343   3935
      2T    276    455    842   1821   3319   6591
      4T    445    730   1401   2076   4457   7525
      8T    424    539    954   1829   3688   7064

          Total Elapsed Time   13.0 seconds

 ########## A1 New Long Version

 ARM/Intel MP-BusSpd2 Benchmark V1.0 24-Jul-2015 15.50

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   5431   6110   6780   6262   6655   7313
      2T   3550   4464   7375   9825  11777  12442
      4T   2027   4442   4399   8841  17611  23509
      8T    983   2477   5063   4433   8568  15867
122.9 1T   1499   1991   2357   2839   3818   5382
      2T   2816   3808   4708   5592   7557  10677
      4T   4316   6313   7991   9816  14335  19993
      8T   4235   5610   7917   8791  12828  19661
49152 1T    215    275    611   1183   2328   3922
      2T    276    435    787   1671   3323   6507
      4T    398    455    884   1754   3490   6971
      8T    376    511    867   1746   3512   7510

          Total Elapsed Time   48.6 seconds

 Maximum RAM Speed Estimate = 511 x 16 = 8176 MB/second


 ########### A5 ARM-Intel Dual Boot With W2 #############

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
           Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 ARM/Intel MP-BusSpd2 Benchmark V1.2 14-Apr-2016 17.28
           Compiled for 32 bit Intel x86

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   5322   6275   6475   6901   6959   6925
      2T   4625   4163   6792   8964  10879  11027
      4T   2221   3775   4091   8006  15158  19631
      8T   1178   1840   3907   3884   8002  15691
122.9 1T   1438   1891   2342   2601   3477   4957
      2T   2509   3489   4597   5115   6807   9275
      4T   3591   4849   6905   8356  11204  14596
      8T   3868   5327   7014   7860  10754  15998
49152 1T    179    205    391    802   1372   3023
      2T    238    310    495   1204   2397   4559
      4T    240    336    653   1170   2008   4969
      8T    291    321    681   1316   2378   5329

          Total Elapsed Time   50.3 seconds

 Maximum RAM Speed Estimate = 336 x 16 = 5376 MB/second


 ################# W1 Windows 10 32 bit #################

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
  Windows 10 4 GB DDR3 1600 dual channel 12.8 GB/s

   MP-BusSpeed From C/C++ 18.00.21005.1 for x86

      Start of test Wed Dec 23 20:57:34 2015

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   6170   6348   6836   6869   7029   6743
      2T   1859   3059   5657   7800   9685  10880
      4T    989   1804   3289   5900  10157  16055
      8T    473    843   1578   3101   5665  10124
122.9 1T   1476   1532   2319   2679   3515   4824
      2T   2234   2733   4337   5226   6710   9655
      4T   3428   4628   6956   8606  10978  16225
      8T   2675   3965   6432   8355  11139  15714
49152 1T    241    273    565   1090   2130   3848
      2T    346    409    734   1591   3082   5762
      4T    499    496    947   1887   3818   7634
      8T    476    500    930   1888   3932   7625

          End of test Wed Dec 23 20:58:22 2015

 Maximum RAM Speed Estimate = 500 x 16 = 8000 MB/second


 ################# W1 Windows 10 64 bit #################

   MPbusSpeed64 From C/C++ 18.00.21005.1 for x64

      Start of test Wed Dec 23 21:15:07 2015

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   5222   6158   6233   6523   6404   6580
      2T   1882   3670   6113   8124   9540  10760
      4T   1089   1817   3378   6083  10832  15242
      8T    505    837   1846   3250   5899   9788
122.9 1T   1424   1540   2285   2544   3490   4854
      2T   2567   2756   4233   4920   6579   9820
      4T   3444   4858   6699   8186  11628  16690
      8T   2593   3644   5671   7370   9304  13630
49152 1T    240    268    566   1097   2070   3860
      2T    342    411    754   1448   2940   5836
      4T    451    494    894   1902   3804   7526
      8T    424    503    935   1830   3710   7180

          End of test Wed Dec 23 21:15:55 2015


 ######## W2 Windows 10 32 bit Dual Boot With A5 ########

  Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
          Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

   MP-BusSpeed From C/C++ 18.00.21005.1 for x86

      Start of test Fri Apr 15 16:19:46 2016

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   5387   5874   6023   6023   6158   6175
      2T   2051   3414   5527   6968   9063   9875
      4T   1105   1897   3213   5706   9238  13066
      8T    452    830   1874   3063   5620   8967
122.9 1T   1266   1286   2041   2420   3084   4283
      2T   2258   2657   3976   4624   5973   8438
      4T   3163   4119   5893   7241  10447  15588
      8T   2540   3404   5628   8170   8647  12274
49152 1T    139    170    319    592    986   2063
      2T    202    225    442    802   1633   3542
      4T    295    359    597   1220   2489   5001
      8T    282    313    651   1159   2359   5166

          End of test Fri Apr 15 16:20:38 2016

 Maximum RAM Speed Estimate = 313 x 16 = 5008 MB/second


 ######## W2 Windows 10 64 bit Dual Boot With A5 ########

   MPbusSpeed64 From C/C++ 18.00.21005.1 for x64

      Start of test Fri Apr 15 16:31:03 2016

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T   5414   5881   5982   6593   6320   6915
      2T   2004   3844   6095   8469  10032  11237
      4T    977   1709   3311   6239  12238  17994
      8T    498    862   1737   3185   5915  10456
122.9 1T   1515   1537   2447   2750   3625   5040
      2T   2330   2730   4064   4923   6364   9105
      4T   3702   4830   7300   8835  11707  16740
      8T   2587   3613   5718   7715   9699  16216
49152 1T    183    198    429    834   1652   3143
      2T    244    303    565   1144   2221   4537
      4T    346    324    644   1284   2552   5123
      8T    306    307    618   1249   2421   4874

          End of test Fri Apr 15 16:31:54 2016


 ==============================================
   Top end 2015 PC - Core i7-4820K at 3.9 GHz 
   Quad core, 8 threads, 10 MB shared L3 cache
   RAM 1600 MHz, quad channel, 51.2 GB/sec
 ==============================================

  Intel/Windows 32 Bit Version 

   MP-BusSpeed From C/C++ 18.00.21005.1 for x86

      Start of test Sun Feb 14 18:30:05 2016

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T  14262  14567  19724  19553  19374  19743
      2T  10737  12187  18359  23285  31442  31491
      4T   5537   7660  13862  24507  32888  42530
      8T   3967   6138  14340  22999  39199  60117
122.9 1T   7263   7213  11664  16448  19425  20552
      2T  10361   9428  20446  31143  34263  40155
      4T  18846  21063  38732  54792  57770  56587
      8T  22328  32794  54749  69742  79276  80967
49152 1T    668   1031   2141   4185   8650  14974
      2T   1210   1726   3867   7731  15627  28522
      4T   2161   3177   6122  11449  25009  41192
      8T   4728   4106   9842  23118  43257  61779

          End of test Sun Feb 14 18:31:00 2016

  Intel/Windows 64 Bit Version 

   MPbusSpeed64 From C/C++ 18.00.21005.1 for x64

      Start of test Sun Feb 14 18:46:52 2016

   MB/Second Reading Data, 1, 2, 4 and 8 Threads
  KB      Inc32  Inc16   Inc8   Inc4   Inc2  RdAll

 12.3 1T  14760  14788  21402  20729  20934  21032
      2T  12570  19878  27089  35589  37688  41618
      4T   7000  11473  21725  34776  51827  74198
      8T   3728   6525  14160  23059  40659  66975
122.9 1T   7571   7448  11828  16724  20283  21671
      2T  13291  13676  22360  32586  39872  42740
      4T  18270  21303  37555  62890  78583  84191
      8T  21030  30880  53098  71255  91804 103575
49152 1T    663   1037   2159   4187   8611  15218
      2T   1207   1720   3908   6418  15470  27796
      4T   2319   2382   7002  13639  23754  46951
      8T   4728   5602  12178  21784  35170  80274

          End of test Sun Feb 14 18:47:43 2016
  

To Start


MP-RandMem Benchmark - MP-RndMemi.apk, MPRandMem32.exe, MPRandMem64.exe

This is an ARM/Intel version of the longer running MP-RndMem Benchmark, as the original, short version, produced inconsistent performance measurements. It is a multithreading variety of RandMem above. For further details and more results see here.

On tablet A1, with the Intel Atom CPU, the initial Houdini ARM to Intel conversion speeds were significantly slower than the results from the native code compilations. This problem was overcome via Android 5 procedures, when most results were faster.

On ARM based tablets, the new ARM/Intel compilations were generally slower than the original produced by an earlier compiler, but most of the difference was regained on a 64 bit version.

Intel/Windows Versions - Maximum data size of 12.3 MB was fine for early Android devices but performance can be affected by shared L2 caches on later ones. The Core i7 test, at this size, is mainly using the 10 MB L3 cache.

A5 and W2 Dual Boot Tablet - The Windows compilation is faster than the Android version on cache based read/write tests. RAM speed is again showed to be slower than tablet W1.


 ##################### T7 Original ######################

  T7, ARM Cortex-A9 1200 MHz, Android 4.1.2, 
  4 x 32 KB L1 cache, 1 MB shared L2 cache

 Android MP-RndMem2 Benchmark V2.1 06-May-2015 12.17

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    3120    3060    3128    3078
       2T    6098    3003    6083    3004
       4T   11354    2948   11188    2942
       8T   11403    2857   10412    2872
 122.9 1T     996     983     661     699
       2T    1868     984    1012     697
       4T    2600     982    1483     699
       8T    2534     976    1459     694
 12288 1T     335     286      91      80
       2T     640     288     113      82
       4T     892     286     130      82
       8T     925     287     127      81

          Total Elapsed Time   44.7 seconds

 #################### T7 ARM-Intel #####################

 ARM/Intel MP-RndMem Benchmark V1.1 06-May-2015 11.59

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    3060    2001    2867    1904
       2T    5459    1879    5463    1867
       4T   10797    1852   10537    1856
       8T   10090    1802   10608    1813
 122.9 1T     968     823     588     547
       2T    1749     785     902     618
       4T    2716     812    1328     672
       8T    2733     810    1407     673
 12288 1T     329     274      90      82
       2T     636     272     112      82
       4T     849     271     128      82
       8T     869     271     126      81

          Total Elapsed Time   45.4 seconds


 #################### T11 Original #####################

 T11 Samsung EXYNOS 5250 1.7 GHz Cortex-A15, Android 4.2.2
     2 x 32 KB L1 cache, 1 MB shared L2 cache

 Android MP-RndMem2 Benchmark V2.1 06-May-2015 12.13

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    6696    4438    6594    4483
       2T   12338    3078   12263    3573
       4T   12419    2834   12166    2907
       8T   12314    2903   11991    2934
 122.9 1T    3371    2916    1639    1748
       2T    6409    1922    2052    1097
       4T    6155    1892    2027    1186
       8T    6045    2105    2015    1192
 12288 1T    1394    1048     153     133
       2T    2245     985     285     123
       4T    2277    1002     285     132
       8T    2165    1001     286     127

          Total Elapsed Time   44.0 seconds

 #################### T11 ARM-Intel ####################

 ARM/Intel MP-RndMem Benchmark V1.1 06-May-2015 12.07

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    6315    4486    6345    4484
       2T   11837    2910   11846    3112
       4T   11864    2835   11553    2858
       8T   11821    3003   11805    3198
 122.9 1T    3963    2681    1670    1704
       2T    6672    1782    2040    1125
       4T    6493    1817    2033    1218
       8T    6673    1738    2038    1303
 12288 1T    1805    1081     177     145
       2T    2543    1066     279     137
       4T    2600    1065     276     136
       8T    2662    1073     281     138

          Total Elapsed Time   43.7 seconds


 #################### T21 Original #####################

 T21 Qualcomm Snapdragon 800 2150 MHz, Android 4.4.4
     Dual Channel 32 Bit LPDDR3-1866 RAM 14.9 GB/s
     L1 caches 4 x 16 KB, L2 cache shared 2048 KB
  
 Android MP-RndMem2 Benchmark V2.1 08-Jul-2015 16.33

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    5088    5325    4262    4711
       2T    9752    4902    8895    4570
       4T   17379    4653   17434    4096
       8T   19771    4698   17358    4424
 122.9 1T    2714    2578    1923    2163
       2T    5614    2502    3483    2107
       4T   10859    2219    4835    1972
       8T   10654    2410    4904    1923
 12288 1T    1798     952     186     204
       2T    3489     974     341     195
       4T    6515     943     563     196
       8T    6218     922     563     187

          Total Elapsed Time   42.3 seconds

 #################### T21 ARM-Intel #################### 

 ARM/Intel MP-RndMem Benchmark V1.1 09-Jul-2015 11.48

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    4186    3777    4055    3933
       2T    9324    3541    7710    3619
       4T   16594    3350   15731    3142
       8T   18117    3291   16187    3262
 122.9 1T    2423    2043    1610    1683
       2T    5235    2029    3013    1641
       4T   10148    1935    4662    1565
       8T   10015    1834    4611    1474
 12288 1T    1363     886     171     186
       2T    2643     845     325     187
       4T    5197     823     534     184
       8T    4801     835     542     184

          Total Elapsed Time   42.6 seconds

 
 #################### T22 Original ######################

  T22, Quad Core ARM Cortex-A53 1300 MHz, Android 5.0.2 
        4 x 32 KB L1 cache, 512 KB shared L2 cache

 Android MP-RndMem2 Benchmark V2.1 11-Nov-2015 13.03

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    3401    3874    3435    3892
       2T    6777    3817    6592    3773
       4T   13025    3729   12630    3685
       8T   12848    3654   12113    3654
 122.9 1T    3257    3583     827     946
       2T    6416    3572    1481     943
       4T   11897    3564    2205     934
       8T   11106    3550    2173     945
 12288 1T    2397    1734      82      93
       2T    4652    1725     161      94
       4T    5834    1748     287      94
       8T    4774    1743     276      93

          Total Elapsed Time   45.9 seconds

 ###################### T22 32 Bit ######################

 ARM/Intel MP-RndMem Benchmark V1.2 12-Aug-2015 17.13
           Compiled for 32 bit ARM v7a

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    2894    2438    2887    2433
       2T    5665    2402    5663    2403
       4T   10922    2369   11100    2310
       8T   10065    2293   10648    2265
 122.9 1T    2681    2368     757     758
       2T    5351    2360    1398     769
       4T   10056    2308    2121     772
       8T    8838    2351    1916     742
 12288 1T    2309    1662      80      78
       2T    3986    1683     164      73
       4T    5419    1684     283      82
       8T    4658    1694     279      82

          Total Elapsed Time   44.6 seconds

###################### T22 64 Bit ######################

 ARM/Intel MP-RndMem Benchmark V1.2 12-Aug-2015 17.15
           Compiled for 64 bit ARM v8a

 12.29 1T    4445    3109    4455    3089
       2T    8010    3100    8072    3105
       4T   15909    3057   14711    3040
       8T   14764    3036   14570    3037
 122.9 1T    3457    2888     842     876
       2T    6537    2924    1524     876
       4T   11095    2892    2119     861
       8T   11729    2916    2080     874
 12288 1T    2475    1679      81      78
       2T    4155    1713     163      73
       4T    5503    1711     285      89
       8T    4519    1717     281      89

          Total Elapsed Time   48.1 seconds


 #################### A1 Original #######################

 A1 Quad Core 1.86 GHz Intel Atom Z3745, Android 4.4
   Dual Channel LPDDR3-1066 Bandwidth 17.1 GB/s
            4 x 24 KB L1, 2 x 1 MB L2

 Android MP-RndMem2 Benchmark V2.1 06-May-2015 12.14

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    1337    2505    1337    2509
       2T    2637    2513    2657    2521
       4T    3535    2420    3484    2454
       8T    3195    2403    3088    2406
 122.9 1T    1305    2280     963    1758
       2T    2581    2285    1945    1748
       4T    3588    2130    3125    1740
       8T    3211    2269    2949    1745
 12288 1T    1248    1962     101     215
       2T    2469    1940     191     214
       4T    3462    1954     323     214
       8T    3127    1926     318     212

          Total Elapsed Time   43.7 seconds

 ################## A1 V1 Android 5.0 ###################

 Android MP-RndMem2 Benchmark V2.1 05-Nov-2015 11.55

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    5580    5533    5554    5455
       2T   10460    5393    8625    5336
       4T   15584    5013   12183    5211
       8T   14687    4850    9754    4882
 122.9 1T    4180    4368    2557    2522
       2T    8301    4276    5072    2511
       4T   15613    4238    7764    2425
       8T   14496    4259    7278    2466
 12288 1T    3360    2180     239     239
       2T    6219    2140     379     240
       4T    6758    2135     418     238
       8T    6991    2131     418     232

          Total Elapsed Time   47.6 seconds

 #################### A1 ARM-Intel ######################

 ARM/Intel MP-RndMem Benchmark V1.1 06-May-2015 11.54

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    4643    3593    4710    3641
       2T    8583    3552    8761    3564
       4T   12707    3450   12496    3384
       8T   10410    3389   10796    3408
 122.9 1T    3733    2874    2408    2150
       2T    7259    2871    4781    2165
       4T   11726    2897    7656    2133
       8T   11673    2853    7100    2113
 12288 1T    3153    2087     226     238
       2T    5782    2073     327     238
       4T    6451    1997     447     236
       8T    6471    2071     446     233

          Total Elapsed Time   41.5 seconds


 ########### A5 ARM-Intel Dual Boot With W2 #############

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
          Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 ARM/Intel MP-RndMem Benchmark V1.2 14-Apr-2016 17.41
           Compiled for 32 bit Intel x86

   MB/Second Using 1, 2, 4 and 8 Threads
  KB        SerRD SerRDWR   RndRD RndRDWR

 12.29 1T    4395    3558    4562    3346
       2T    8094    3465    7975    3372
       4T   11923    3377   11375    3220
       8T   10165    3207   10220    3205
 122.9 1T    3519    2796    2360    1993
       2T    6875    2591    4233    1970
       4T   10225    2761    5943    1935
       8T   10158    2755    6363    2052
 12288 1T    2586    1846     187     192
       2T    3890    1728     310     213
       4T    5035    1986     373     194
       8T    3972    1887     359     186

          Total Elapsed Time   44.0 seconds


 ################# W1 Windows 10 32 bit #################

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
  Windows 10 4 GB DDR3 1600 dual channel 12.8 GB/s

   MPRandMem32 From C/C++ 18.00.21005.1 for x86

      Start of test Tue Feb 23 16:32:45 2016

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

  12.3 1T    3579    4426    3810    4392
       2T    6705    4466    6222    3999
       4T    8756    4276    8634    4208
       8T    7881    4522    7667    4559
 122.9 1T    3038    3157    1830    1926
       2T    5168    3298    3378    1962
       4T    9739    3190    5510    1917
       8T   10773    3522    6569    2147
 12288 1T    2212    1944     212     230
       2T    4813    2513     350     234
       4T    5409    2568     514     257
       8T    6343    2396     534     236

          End of test Tue Feb 23 16:33:28 2016

 ################# W1 Windows 10 64 bit #################

   MPRandMem64 From C/C++ 18.00.21005.1 for x64

      Start of test Tue Feb 23 16:34:58 2016

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

  12.3 1T    3939    4626    3289    4071
       2T    6506    3974    6369    3885
       4T   12623    3903   12118    3808
       8T   11890    3548   12392    3683
 122.9 1T    3126    3144    2110    2102
       2T    5601    3058    4076    1998
       4T    9730    2974    6727    1934
       8T   10426    2982    6760    1874
 12288 1T    2407    1806     227     249
       2T    4526    2282     364     235
       4T    5573    2160     481     245
       8T    6040    2304     557     247

          End of test Tue Feb 23 16:35:43 2016


 ######## W2 Windows 10 32 bit Dual Boot With A5 ########

  Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
          Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2
 
 MPRandMem32 From C/C++ 18.00.21005.1 for x86

  MB/Second Using 1, 2, 4 and 8 Threads
  KB       SerRD SerRDWR   RndRD RndRDWR

 12.3 1T    4007    4947    4173    4934
      2T    7649    4864    7601    4856
      4T    9434    4785    8546    4760
      8T    7251    4551    7335    4551
122.9 1T    3186    3554    2128    2130
      2T    6264    3612    3886    2163
      4T   10021    3650    6521    2422
      8T    9185    3993    6634    2457
12288 1T    2869    2146     208     221
      2T    4116    2139     317     222
      4T    5210    2149     387     221
      8T    4897    1907     378     192

          End of test Fri Apr 15 16:27:44 2016

 ######## W2 Windows 10 64 bit Dual Boot With A5 ########

 MPRandMem64 From C/C++ 18.00.21005.1 for x64

      Start of test Fri Apr 15 16:37:15 2016

  MB/Second Using 1, 2, 4 and 8 Threads
  KB       SerRD SerRDWR   RndRD RndRDWR

 12.3 1T    3981    4466    3901    4354
      2T    6715    4593    6536    4441
      4T   12576    4447   12493    4373
      8T   10432    4399   10103    4391
122.9 1T    3238    3528    2488    2340
      2T    5875    3651    4422    2329
      4T   11142    3680    6866    2355
      8T    8803    3657    5702    2341
12288 1T    2576    2131     214     209
      2T    4088    2123     339     222
      4T    4850    2133     407     223
      8T    4963    2142     405     225

          End of test Fri Apr 15 16:38:00 2016


 ==============================================
   Top end 2015 PC - Core i7-4820K at 3.9 GHz 
   Quad core, 8 threads, 10 MB shared L3 cache
   RAM 1600 MHz, quad channel, 51.2 GB/sec
 ==============================================

  Intel/Windows 32 Bit Version 
 
   MPRandMem32 From C/C++ 18.00.21005.1 for x86

      Start of test Tue Feb 23 16:05:00 2016

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

  12.3 1T   26590   29369   27321   27593
       2T   52121   30063   48980   27757
       4T   66651   29464   72519   27466
       8T   58774   28464   57426   26236
 122.9 1T   25876   28670   13416    8815
       2T   46692   28183   21803    8767
       4T   82678   28469   46885    8497
       8T   83158   28482   49158    8677
 12288 1T   16527   13042    1196    1191
       2T   27888   12767    2389    1188
       4T   49291   13049    3393    1191
       8T   84109   12954    4176    1192

          End of test Tue Feb 23 16:05:41 2016

  Intel/Windows 64 Bit Version

   MPRandMem64 From C/C++ 18.00.21005.1 for x64

      Start of test Tue Feb 23 16:06:04 2016

   MB/Second Using 1, 2, 4 and 8 Threads
   KB       SerRD SerRDWR   RndRD RndRDWR

  12.3 1T   26322   28220   25930   28695
       2T   54658   30081   39512   27874
       4T   99694   29950   89274   27925
       8T   88620   29773   85848   27924
 122.9 1T   25196   27993   13424    8633
       2T   44627   28207   21816    8785
       4T   65329   28108   44155    8620
       8T   91445   28208   53751    8715
 12288 1T   17662   13110    1301    1198
       2T   32242   12856    2595    1198
       4T   57536   13117    4905    1197
       8T   85697   13079    4645    1197

          End of test Tue Feb 23 16:06:46 2016

   

To Start


MP-MFLOPS Benchmark - MP-MFLOPS2i, MPmflops32.exe, MPmflops64.exe

The arithmetic operations executed are of the form x[i] = (x[i] + a) * b - (x[i] + c) * d + (x[i] + e) * f with 2 and 32 operations per input data word, using 1, 2, 4 and 8 threads. Data sizes are limited to three to use L1 cache, L2 cache and RAM at 12.8, 128 and 12800 KB (3200, 32000 and 3200000 single precision floating point words). Each thread uses the same calculations but accessing different segments of the data. The program checks for consistent numeric results, primarily to show that all calculations are carried out and can be run. The numeric results start with values of 1.0, with subsequent calculations reducing the values, the amount depending on the number of calculations. Further details, results and links to download original MP-MFLOPS benchmark can be found here, with more details of the latest MP-MFLOP2S compilations here. The newer versions have longer running times that avoid inconsistent speeds produced by the original.

Using Tablet A1, with the Intel Atom CPU, the original ARM only version was much slower than the native code variety, at 32 operations per word, and running via Android 5.0 was not much faster. Similarly, there was little difference on ARM based systems, between the original and later compilations.

Tablet T22 results, from the 64 bit compilation, showed that it could be much faster than the 32 bit benchmark, up to 3.7 times at 2 operations per word. The reason is that 64 bit vector SIMD instructions were produced, instead of scalars.

MFLOPS/MHz Comparisons - These are provided to compare different CPU technology. None of these are particularly good, the best being The Cortex A53 at 64 bits, producing just over 1 result per cycle per CPU.

Intel/Windows Versions - The compiler used for these appears to be somewhat more advanced than that used for Intel/Android, implementing full SIMD SSE instructions for 64 bit and 32 bit benchmarks. The result is that a Z8300 Atom CPU core produced up to 1.66 MFLOPS/MHz. The maximum speed of a Core i7, using SSE instructions, is 4 multiplies and 4 linked adds per cycle (8 MFLOPS/MHz). This benchmark demonstrated more than 5.5 MFLOPS/MHz.

A5 and W2 Dual Boot Tablet - The Windows compilation is much faster than the Android version, as SSE SIMD type instructions are used. For comparable performance see A5 results below in section NEON-MFLOPS-MP Benchmark. This uses hand coded NEON intrinsic functions, rather than compiler generated machine code. Speeds from the 64 bit version appear to be somewhat faster than the 32 bit variety. However, note that there can ve wide variations in recorded results.


 ##################### T7 Original ######################

 T7,   ARM Cortex-A9 1200 MHz, Android 4.1.2, 

 Android MP-MFLOPS2 Benchmark V2.1 05-Feb-2015 11.37

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      182     156     114     598     578     572
 2T      365     321     194    1194    1163    1141
 4T      716     655     233    2367    2316    2240
 8T      717     682     233    2347    2371    2246

          Total Elapsed Time  135.5 seconds

 #################### T7 ARM-Intel #####################

 ARM/Intel MP-MFLOPS2 Benchmark V2.1 28-Apr-2015 17.44

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      188     156     116     598     578     574
 2T      365     319     197    1195    1161    1145
 4T      682     709     237    2372    2345    2249
 8T      678     731     237    2361    2381    2254

           Total Elapsed Time  135.0 seconds


 #################### T11 Original #####################

 T11 Samsung EXYNOS 5250 1.7 GHz Cortex-A15, Android 4.2.2

 Android MP-MFLOPS2 Benchmark V2.1 29-Apr-2015 10.22

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      845     817     544    1546    1539    1512
 2T     1593    1668     648    3140    3067    2977
 4T     1974    1775     645    2963    3093    2845
 8T     1935    2059     652    3108    3147    2985

          Total Elapsed Time   58.5 seconds

 #################### T11 ARM-Intel ####################

 ARM/Intel MP-MFLOPS2 Benchmark V2.1 28-Apr-2015 20.30

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      695     756     536    1537    1501    1476
 2T     1319    1527     645    3151    3077    3000
 4T     1604    1567     657    3035    3095    2997
 8T     1604    1639     658    3108    3125    2996
 
         Total Elapsed Time  59.1 seconds


 #################### T21 Original #####################

 T21 Qualcomm Snapdragon 800 2150 MHz, Android 4.4.4

Android MP-MFLOPS2 Benchmark V2.1 05-Jul-2015 15.35

    FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      718     781     590    1214    1220    1228
 2T     1572    1583    1118    2406    2436    2442
 4T     2338    2959    1836    4867    4911    4859
 8T     3148    3266    1866    4870    4916    4888

          Total Elapsed Time   56.4 seconds

 #################### T21 ARM-Intel #################### 

 ARM/Intel MP-MFLOPS2 Benchmark V2.1 05-Jul-2015 16.50

    FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      822     768     636    1232    1228    1231
 2T     1662    1637    1184    2460    2463    2446
 4T     2509    3216    1659    4519    4762    4900
 8T     2965    3193    1881    4847    4925    4880


 ###################### T22 32 Bit ######################

  T22, Quad Core ARM Cortex-A53 1300 MHz, Android 5.0.2 

 ARM/Intel MP-MFLOPS2 Benchmark V2.2 09-Aug-2015 21.17
            Compiled for 32 bit ARM v7a

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      190     190     184     670     672     664
 2T      377     378     370    1343    1345    1329
 4T      707     755     725    2657    2669    2621
 8T      722     736     714    2640    2672    2631

           Total Elapsed Time  113.0 seconds

###################### T22 64 Bit ######################

 ARM/Intel MP-MFLOPS2 Benchmark V2.2 09-Aug-2015 21.24
            Compiled for 64 bit ARM v8a

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      705     701     636    1398    1394    1362
 2T     1376    1395     942    2794    2797    2757
 4T     2063    2602     962    5491    5546    5336
 8T     2474    2611     957    5367    5500    5417

           Total Elapsed Time   51.6 seconds


 #################### A1 Original #######################

 A1 Quad Core 1.86 GHz Intel Atom Z3745, Android 4.4
   Dual Channel LPDDR3-1066 Bandwidth 17.1 GB/s

 Android MP-MFLOPS2 Benchmark V2.1 04-Feb-2015 11.03

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      502     501     476     575     575     573
 2T     1012     975     921    1133    1140    1115
 4T     1571    1627     979    2238    2255    2258
 8T     1550    1890    1007    2235    2239    2217

          Total Elapsed Time  117.4 seconds

 ################## A1 V1 Android 5.0 ##################

 Android MP-MFLOPS2 Benchmark V2.1 05-Nov-2015 11.59

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      607     586     559     556     553     555
 2T     1174    1153    1057    1111    1115    1112
 4T     1539    2220     992    2181    2207    2179
 8T     1736    2097    1011    2184    2194    2178

          Total Elapsed Time  119.2 seconds

 #################### A1 ARM-Intel ######################

 ARM/Intel MP-MFLOPS2 Benchmark V2.1 28-Apr-2015 17.24

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      695     696     661    1061    1061    1055
 2T     1335    1382    1058    2088    2086    2102
 4T     1832    2635     979    3993    4125    4145
 8T     2026    2557    1007    3842    4044    4110

         Total Elapsed Time   65.8 seconds


 ########### A5 ARM-Intel Dual Boot With W2 #############

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
          Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 ARM/Intel MP-MFLOPS2 Benchmark V2.2 14-Apr-2016 17.53
           Compiled for 32 bit Intel x86

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      422     450     401     945     964     939
 2T      795     849     754    1809    1859    1815
 4T     1161    1514    1084    3043    3159    3144
 8T     1141    1376    1065    3173    3241    3234

          Total Elapsed Time   78.8 seconds


 ################# W1 Windows 10 32 bit #################

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
         Windows 10, 4 GB DDR 3 1600

    MP-MFLOPS From C/C++ 18.00.21005.1 for x86

      FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     1467    1388    1215    2537    2529    2486
 2T     2773    2825    1659    4937    4958    4740
 4T     3334    4845    1512    8453    8813    8694
 8T     2818    5068    1575    8338    8896    8627

 ################# W1 Windows 10 64 bit #################

    MP-MFLOPS From C/C++ 18.00.21005.1 for x64

      FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     1470    1471    1252    2936    3060    2996
 2T     2775    2982    1653    5593    5860    5680
 4T     3610    5290    1520    9401   10488   10326
 8T     3132    5178    1562    8957    8365   10433


 ######## W2 Windows 10 32 bit Dual Boot With A5 ########

  Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
          Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

    MP-MFLOPS From C/C++ 18.00.21005.1 for x86

      Start of test Sat May 21 19:10:08 2016

      FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     1415    1346     968    2368    2274    2227
 2T     2336    2436     857    4460    4433    4181
 4T     2718    4196    1046    7192    7984    7678
 8T     3073    3220    1071    6133    8773    6413

 ######## W2 Windows 10 64 bit Dual Boot With A5 ########
 
   MP-MFLOPS From C/C++ 18.00.21005.1 for x64

      Start of test Fri Apr 15 16:41:27 2016

      FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     1560    1584    1034    2952    2965    2877
 2T     2590    2757    1160    5369    5862    5333
 4T     3852    5094    1090    9407   10478   10331
 8T     3480    4973    1133    7748   10417    7742


 ################# PC Windows 10 32 bit #################

    Top end 2015 PC - Core i7-4820K at 3.9 GHz 

    MP-MFLOPS From C/C++ 18.00.21005.1 for x86

      FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T    11945   10323    6088   21760   21813   21691
 2T    18020   20096   11072   34309   43919   45673
 4T    25662   42897   13955   55831   89194   90429
 8T    22256   49955   14299   80928   90240   88848

 ################# PC Windows 10 64 bit #################

    MP-MFLOPS From C/C++ 18.00.21005.1 for x64

      FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T    14218   12522    6044   22097   22201   22087
 2T    21473   24706   11189   42464   44797   46061
 4T    24241   28250   15774   59471   90548   81144
 8T    27512   57442   14238   82808   92377   92959


################ Comparison MFLOPS/MHz ################

     FPU Add & Multiply using 1, 2, 4 and Threads
                     2 Ops/Word          32 Ops/Word
        KB       12.8    128  12800   12.8    128  12800

 Quad      1T    0.16   0.13   0.10   0.50   0.48   0.48
 Cortex    2T    0.30   0.27   0.16   1.00   0.97   0.95
 A9        4T    0.57   0.59   0.20   1.98   1.95   1.87

 Dual      1T    0.41   0.44   0.32   0.90   0.88   0.87
 Cortex    2T    0.78   0.90   0.38   1.85   1.81   1.76
 A15       4T    0.94   0.92   0.39   1.79   1.82   1.76

 Quad      1T    0.26   0.24   0.20   0.39   0.39   0.39
 Qualcomm  2T    0.53   0.52   0.38   0.78   0.78   0.78
 800       4T    0.80   1.02   0.53   1.43   1.51   1.56

 Quad      1T    0.15   0.15   0.14   0.52   0.52   0.51
 Cortex    2T    0.29   0.29   0.28   1.03   1.03   1.02
 A53 32b   4T    0.54   0.58   0.56   2.04   2.05   2.02

 Quad      1T    0.54   0.54   0.49   1.08   1.07   1.05
 Cortex    2T    1.06   1.07   0.72   2.15   2.15   2.12
 A53 64b   4T    1.59   2.00   0.74   4.22   4.27   4.10

 Quad      1T    0.37   0.37   0.36   0.57   0.57   0.57
 Atom      2T    0.72   0.74   0.57   1.12   1.12   1.13
 Z3745     4T    0.98   1.42   0.53   2.15   2.22   2.23

 Windows

 Quad      1T    0.80   0.75   0.66   1.38   1.37   1.35
 Atom      2T    1.51   1.54   0.90   2.68   2.69   2.58
 Z8300 32b 4T    1.81   2.63   0.82   4.59   4.79   4.73

 Quad      1T    0.80   0.80   0.68   1.60   1.66   1.63
 Atom      2T    1.51   1.62   0.90   3.04   3.18   3.09
 Z8300 64b 4T    1.96   2.88   0.83   5.11   5.70   5.61

 Quad + HT 1T    3.06   2.65   1.56   5.58   5.59   5.56
 Core i7   2T    4.62   5.15   2.84   8.80  11.26  11.71
 4820K 32b 4T    6.58  11.00   3.58  14.32  22.87  23.19
           8T    5.71  12.81   3.67  20.75  23.14  22.78

 Quad + HT 1T    3.65   3.21   1.55   5.67   5.69   5.66
 Core i7   2T    5.51   6.33   2.87  10.89  11.49  11.81
 4820K 64b 4T    6.22   7.24   4.04  15.25  23.22  20.81
           8T    7.05  14.73   3.65  21.23  23.69  23.84
    

To Start


NEON-MFLOPS-MP Benchmark - NEON-MFLOPS2i-MP.apk

NEON-MFLOPS-MP carries out the same calculations as MP-MFLOPS Benchmarks above, but with NEON intrinsic functions used for all calculations. For further results see here. The effect of using these functions, instead of leaving it to the compiler, is that 32 bit performance, on ARM based systems, was similar between the original and new benchmarks.

T22 NEON 64 bit compilation produced a small performance gain over 32 bit results, at 2 operations per word, but near double speed at 32 operations, the latter benefiting from availability of sufficient registers for all the variables.

On the Intel Atom based tablet A1, via the ARM to Intel conversion layer, performance was similar via Android 4 and 5, but the native code version was more than twice as fast at 32 operations per word.

MFLOPS/MHz Comparisons are also provided, including examples on maximum speeds from the non-NEON version, demonstrating NEON gains of up to twice as fast. A result submitted for P33, with an ARM Cortex-A57 produced the best single core performance (at November 2015) of 3.47 results per cycle at 64 bits, followed by the Cortex-A53 at 2.13. This is still disappointing, compared with Intel desktop processors, such as the Core 2 onwards, at 6 per clock cycle out of a maximum of 8, with SSE SIMD code (See Linux results).

Following the performance details are the numeric results of calculations from the fixed parameters used in the new version, for both ARM and Intel. It seems that Tablet T11 has an intermittent fault, as it occasionally fails to calculate a correct answer or causes the Tablet to crash and reboot. Now, this also appears to happen using the older version. The benchmark appeared to run successfully with an Energy Saving On setting, where performance was much slower and CPU MHz was measured as 1000 MHz instead of 1700 (see results below).

 
 ##################### T7 Original ######################

  T7, ARM Cortex-A9 1200 MHz, Android 4.1.2, 

 Android NEON-MFLOPS-MP Benchmark V1.0 20-Dec-2012 16.57

   FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      532     402     124    1135    1044     960
 2T     1255     798     213    2041    1987    1916
 4T     2441    1553     229    4185    4034    3450
 8T     1922    2403     226    3774    3996    3346

          Total Elapsed Time    4.5 seconds

 #################### T7 ARM-Intel #####################

 ARM/Intel NEON-MFLOPS2-MP Benchmark V2.1 13-May-2015 12.24

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      657     407     132    1077    1074    1053
 2T     1265     817     222    2147    2150    2078
 4T     2024    1695     234    4214    4276    3555
 8T     2435    2495     234    4196    4100    3523

          Total Elapsed Time   39.0 seconds


 #################### T11 Original #####################

 T11 Samsung EXYNOS 5250 1.7 GHz Cortex-A15, Android 4.2.2
                    Dual Core

 Android NEON-MFLOPS-MP Benchmark V1.1 13-Sep-2013 13.44

         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     1847    1415     597    3772    4096    3545
 2T     3649    3309     664    8065    7966    7505
 4T     3670    3922     658    7753    8148    7490
 8T     5664    5570     681    8092    8355    7672
 
        Total Elapsed Time   13.0 seconds

 #################### T11 ARM-Intel ####################

 ARM/Intel NEON-MFLOPS2-MP Benchmark V2.1 13-May-2015 12.07

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     1965    1630     582    3792    4077    3521
 2T     3789    2690     663    8497    8133    7297
 4T     5714    4883     654    8364    8192    7554
 8T     5414    6316     673    7976    8437    6635

          Total Elapsed Time   13.0 seconds


 ######## T11 ARM-Intel Power Saving On 1.0 GHz ########

 ARM/Intel NEON-MFLOPS2-MP Benchmark V2.1 13-Nov-2015 16.55

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     1935    1290     645    2516    2397    2339
 2T     3664    2644     684    4945    4780    4657
 4T     3436    3337     690    4911    4931    4674
 8T     3133    3543     689    4818    4959    4651

          Total Elapsed Time   19.2 seconds


 #################### T21 Original #####################

 T21 Qualcomm Snapdragon 800 2150 MHz, Android 4.4.4
     Dual Channel 32 Bit LPDDR3-1866 RAM 14.9 GB/s

 Android NEON-MFLOPS2-MP Benchmark V2.1 25-Jul-2015 18.44

    FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     2757    2576     771    2808    2825    2800
 2T     5662    5525    1516    5631    5664    5570
 4T     6550    7846    1945   11167   11281   10939
 8T    10273   10928    1981   10851   11211   11350

          Total Elapsed Time   40.0 seconds

 #################### T21 ARM-Intel #################### 

 ARM/Intel NEON-MFLOPS2-MP Benchmark V2.1 28-Jun-2015 16.32

    FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     3049    2857     622    2923    2874    2098
 2T     5508    4887    1009    5477    5736    4349
 4T     5643    5282    1410   11244   11601    8564
 8T     9294   11156    1681   11288   11605    8946

          Total Elapsed Time   14.0 seconds


 ###################### T22 32 Bit ######################

  T22, Quad Core ARM Cortex-A53 1300 MHz, Android 5.0.2 

 ARM/Intel NEON-MFLOPS2-MP Benchmark V2.2 13-Aug-2015 16.35
           Compiled for 32 bit ARM v7a

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      619     613     575    1444    1446    1426
 2T     1174    1206     889    2894    2902    2839
 4T     1585    1616     901    5679    5726    5596
 8T     2075    2130     944    5400    5585    5519

          Total Elapsed Time   25.8 seconds

 ###################### T22 64 Bit ######################

 ARM/Intel NEON-MFLOPS2-MP Benchmark V2.2 13-Aug-2015 16.38
           Compiled for 64 bit ARM v8a

      FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      726     745     647    2766    2774    2639
 2T     1397    1402     903    5523    5552    5371
 4T     1871    1930     898   10780   10479   10439
 8T     2496    2876    1011    9736   10679    9900

          Total Elapsed Time   15.1 seconds


##################### P33 64 Bit ##################### 

 P33 Quad-core 2 GHz Qualcomm Snapdragon 810, Android 5.0.2 
            4 x Cortex-A57 and 4 x Cortex-A53

 ARM/Intel NEON-MFLOPS2-MP Benchmark V2.2 16-Sep-2015 17.59
           Compiled for 64 bit ARM v8a

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     2811    3126    1089    6943    6589    6342
 2T     2488    4114    1541   12084   10559    8809
 4T     4759    5480    2038   16516   14826   11960
 8T     4840    8985    2452   22082   23563   12461

          Total Elapsed Time    7.6 seconds


 #################### A1 Original #######################

 A1 Quad Core 1.86 GHz Intel Atom Z3745, Android 4.4
   Dual Channel LPDDR3-1066 Bandwidth 17.1 GB/s

 Android NEON-MFLOPS2-MP Benchmark V2.1 07-Feb-2015 18.38

    FPU Add & Multiply using 1, 2, 4 and 8 Threads
        2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     1796    1520    1025    1231    1228    1227
 2T     3354    2959    1047    2427    2445    2445
 4T     4627    5508     978    4690    4791    4733
 8T     3861    6307    1030    4611    4869    4742

          Total Elapsed Time   88.3 seconds

 ################## A1 V1 Android 5.0 ##################

 Android NEON-MFLOPS2-MP Benchmark V2.1 05-Nov-2015 12.09

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     1969    1913     832    1230    1245    1225
 2T     3537    3632    1046    2482    2487    2445
 4T     3388    6497     982    4546    4847    4819
 8T     4197    6863    1026    4640    4899    4828

          Total Elapsed Time   87.7 seconds


 #################### A1 ARM-Intel ######################

 ARM/Intel NEON-MFLOPS2-MP Benchmark V2.1 13-May-2015 12.17

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     2151    1962    1064    2619    2694    2650
 2T     4421    3849    1048    5296    5463    5343
 4T     5886    6652     982    9592   10735   10362
 8T     3744    7284    1018    9085   10791    9493

          Total Elapsed Time   13.8 seconds

 
 #################### A5 ARM Intel ######################

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
          Android 5.1, 4 GB DDR 3 1600
             4 x 24 KB L1, 2 x 1 MB L2

 ARM/Intel NEON-MFLOPS2-MP Benchmark V2.2 14-Apr-2016 17.57
           Compiled for 32 bit Intel x86

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T     1501    1551    1030    2520    2485    2301
 2T     2300    2957    1161    4699    4999    4632
 4T     3106    5126    1097    7929    8173    8015
 8T     2692    4623    1108    7830    8432    7989

          Total Elapsed Time   15.7 second


################ Comparison MFLOPS/MHz ################

               FPU Add & Multiply using 1, 2, 4 and Threads
                     2 Ops/Word          32 Ops/Word     Normal
          KB    12.8    128  12800   12.8    128  12800    12.8

 A1       1T    1.16   1.05   0.57   1.41   1.45   1.42    0.57
 Atom     2T    2.38   2.07   0.56   2.85   2.94   2.87    1.12
 Z3745    4T    3.16   3.58   0.53   5.16   5.77   5.57    2.15

 T7       1T    0.55   0.34   0.11   0.90   0.90   0.88    0.50
 Cortex   2T    1.05   0.68   0.19   1.79   1.79   1.73    1.00
 A9       4T    1.69   1.41   0.20   3.51   3.56   2.96    1.98

 T11      1T    1.16   0.96   0.34   2.23   2.40   2.07    0.90
 Cortex   2T    2.23   1.58   0.39   5.00   4.78   4.29    1.85
 A15      4T    3.36   2.87   0.38   4.92   4.82   4.44    1.79

 T21      1T    0.97   0.91   0.20   0.93   0.91   0.67    0.39
 Qualcomm 2T    1.75   1.55   0.32   1.74   1.82   1.38    0.78
 800      4T    1.79   1.68   0.45   3.57   3.68   2.72    1.43

 T22      1T    0.48   0.47   0.44   1.11   1.11   1.10    0.52
 Cortex   2T    0.90   0.93   0.68   2.23   2.23   2.18    1.03
 A53 32b  4T    1.22   1.24   0.69   4.37   4.40   4.30    2.04

 T22      1T    0.56   0.57   0.50   2.13   2.13   2.03    1.08
 Cortex   2T    1.07   1.08   0.69   4.25   4.27   4.13    2.15
 A53 64b  4T    1.44   1.48   0.69   8.29   8.06   8.03    4.22

 P33      1T    1.41   1.56   0.54   3.47   3.29   3.17
 Cortex   2T    1.24   2.06   0.77   6.04   5.28   4.40
 A57 64b  4T    2.38   2.74   1.02   8.26   7.41   5.98


 ##################### New Results #####################

       Results x 100000, 12345 indicates ERRORS

       ARM/Intel NEON-MFLOPS2-MP Benchmark V2.1
 1T    44934   86735   99850   36770   79897   99759
 2T    44934   86735   99850   36770   79897   99759
 4T    44934   86735   99850   36770   79897   99759
 8T    44934   86735   99850   36770   79897   99759
 
 T11   44934   12345   99850   36770   79897   99759
 
       Android NEON-MFLOPS-MP Benchmark V1.1
 1T    86735   98519   99984   79897   97638   99975
 2T    86735   98519   99984   79897   97638   99975
 4T    86735   98519   99984   79897   97638   99975
 8T    86735   98519   99984   79897   97638   99975
 
       Android NEON-MFLOPS2-MP Benchmark V2.1 
 1T    40015   66980   99522   35216   54898   99234
 2T    40015   66980   99522   35216   54898   99234
 4T    40015   66980   99522   35216   54898   99234
 8T    40015   66980   99522   35216   54898   99234
  


To Start


OpenGL Benchmark - JavaOpenGL1.apk

The benchmark does not rely on complex visual scenes or mathematical functions. The objective being to generate moderate to excessive loading via multiple simple objects. It uses all Java code, with OpenGL ES GL10 statements, to measure graphics performance in Frames Per Second (FPS). Four tests draw a background of 50 cubes first as wireframes then colour shaded. The third test views the cubes in and out of a tunnel with slotted sides and roof, also containing rotating plates. The last test adds textures to the cubes and plates. The 50 cubes are redrawn 15, 30 and 60 times, with randomised positions, colours rotational settings. With 6 x 2 triangles per cube, minimum triangles per frame for the three sets of tests are 9000, 18000 and 36000.

An example of the last scene is on the right. The tunnel is provided to show 3D effects, the plates rotating in fixed positions. The numerous cubes are in the distant background, the tunnel slots showing that they are still there, with size varying according to proximity. The cubes appear more as jumping objects, with changing colours and position.

Android 5 has switched to ART virtual machine for Java, instead of Dalvik. First results indicate severe degradation in performance with this benchmark.

Further details and results can be found here. This includes information on Vertical Synchronisation (VSYNC) that limits Frames Per Second (FPS) to 60 and can lead to heavier loading reducing speed in 50% steps. as is apparent in the results below. Links to my Windows and Linux OpenGL benchmarks are also provided.


OpenGL Screen

On tablets A1 and T7 Android was upgraded to version 5.0, leading a reduction in measured speeds by up to 50%, possibly suggesting that VSYNC had change to 30 FPS. The graphics in A5 appear to be slightly faster than A1, but maximum speed appears to be similarly restricted to 30 FPS.

Except for tablet T15, none of the results are particularly good at the heavier loading. T15 results were also produced via Android 5, with several measurements at near 60 FPS, suggesting that speed reductions on the other tablets are not solely dependent on Android 5.


 ########################## A1 ##########################

 A1 Asus MemoPad 7, Quad Core 1.86 GHz Intel Atom Z3745
        Intel HD Graphics, Android 4.4.2 

 Android Java OpenGL Benchmark 21-Dec-2014 16.30

            --------- Frames Per Second --------
 Triangles WireFrame   Shaded  Shaded+ Textured

    9000+      37.95    37.64    29.86    23.63
   18000+      19.44    19.70    17.26    13.26
   36000+       9.99     9.93     9.35     7.17

      Screen Pixels 1280 Wide 736 High

      Total Elapsed Time  120.6 seconds

 #################### A1 Android 5.0 ####################
 
 Android Java OpenGL Benchmark 10-Oct-2015 13.44

           --------- Frames Per Second --------
 Triangles WireFrame   Shaded  Shaded+ Textured

    9000+      25.87    25.89    20.27    16.29
   18000+      13.43    13.56    11.72     9.38
   36000+       6.92     6.73     6.32     4.98

      Screen Pixels 800 Wide 1216 High

      Total Elapsed Time  120.9 seconds


 #################### A5 Android 5.1 ######################

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
        Intel HD Graphics, Android 5.1 

 Android Java OpenGL Benchmark 21-May-2016 13.00

           --------- Frames Per Second --------
 Triangles WireFrame   Shaded  Shaded+ Textured

    9000+      29.77    30.17    22.58    18.54
   18000+      16.09    16.03    13.70    10.78
   36000+       8.31     8.27     7.79     5.76

      Screen Pixels 2048 Wide 1440 High

      Total Elapsed Time  121.0 seconds


 ########################## T7 ##########################

 T7 Nexus 7 Quad 1200 MHz Cortex-A9, Android 4.1.2
   nVidia ULP GeForce Graphics 12 core, 416 MHz

 Android Java OpenGL Benchmark 06-Mar-2013 21.51

           --------- Frames Per Second --------
 Triangles WireFrame   Shaded  Shaded+ Textured
 
   9000+      42.18    43.57    33.38    23.54
  18000+      23.68    23.47    19.91    13.38
  36000+      12.05    11.95    11.00     7.10

      Screen Pixels 1280 Wide 736 High

      Total Elapsed Time  121.0 seconds

 #################### T7 Android 5.0 ####################

 Android Java OpenGL Benchmark 12-Oct-2015 16.06

           --------- Frames Per Second --------
 Triangles WireFrame   Shaded  Shaded+ Textured

    9000+      22.61    23.23    17.71    13.46
   18000+      12.03    12.11    10.36     7.57
   36000+       6.14     6.01     5.64     4.03

      Screen Pixels 1280 Wide 736 High

      Total Elapsed Time  121.5 seconds


 ########################## T11 #########################

 T11 Samsung EXYNOS 5250 Dual 1.7 GHz Cortex-A15, Android 4.2.2
             Mali-T604 Quad Core GPU

 Android Java OpenGL Benchmark 09-Aug-2013 09.42

           --------- Frames Per Second --------
 Triangles WireFrame   Shaded  Shaded+ Textured
 
   9000+      39.13    41.52    32.19    27.25
  18000+      22.03    20.73    19.69    16.30
  36000+      12.24    12.23    10.75     8.68

      Screen Pixels 1920 Wide 1032 High

      Total Elapsed Time  120.8 seconds



 ########################## T15 #########################

 T15  HTC Nexus 9, dual core Denver CPU 2400 MHz, Android 5.0.1 
              Kepler DX1 Graphics

 Android Java OpenGL Benchmark 28-Jan-2015 22.38

           --------- Frames Per Second --------
 Triangles WireFrame   Shaded  Shaded+ Textured

   9000+      59.79    59.84    59.84    57.79
  18000+      59.97    59.26    52.64    32.74
  36000+      31.33    30.95    29.02    17.59

      Screen Pixels 2048 Wide 1440 High

      Total Elapsed Time  121.0 seconds


 ########################## T21 #########################

 T21 Quad Core 2.2 GHz Snapdragon 800, Android 4.4.3
         GPU Qualcomm Adreno 330, 578 MHz

 Android Java OpenGL Benchmark 27-Jul-2015 16.50

           --------- Frames Per Second --------
 Triangles WireFrame   Shaded  Shaded+ Textured

   9000+      35.05    35.45    25.60    21.58
  18000+      18.04    18.05    15.32    12.73
  36000+       9.28     9.33     8.47     6.91

      Screen Pixels 1200 Wide 1803 High

      Total Elapsed Time  120.8 seconds


 ########################## T22 #########################

  T22 1.3 GHz quad core 64 bit MediaTek ARM Cortex-A53
         Android 5.0, GPU Mali T720  MP2

 Android Java OpenGL Benchmark 26-Aug-2015 16.24

           --------- Frames Per Second --------
 Triangles WireFrame   Shaded  Shaded+ Textured

    9000+      22.55    22.11    16.67    14.27
   18000+      11.55    11.60     9.98     8.27
   36000+       5.92     5.98     5.48     4.48

       Screen Pixels 800 Wide 1216 High

       Total Elapsed Time  120.9 seconds

  


To Start


OpenGL Drawing Benchmark - JavaDraw.apk

This all Java benchmark uses small to rather excessive simple objects to measure drawing performance, again via Frames Per Second (FPS). Five tests draw on a background of continuously changing colour shades. The image on the right is after four tests.

  • Test 1 loads a PNG file twice, the bitmaps moving for each frame, one right to left and back, the other circling.

  • Plus Test 2 generates 2 SweepGradient multi-coloured circles moving towards the centre and back.

  • Plus Test 3 draws 200 random small circles in the middle of the screen (mainly hidden on a small screen).

  • Plus Test 4 draws 80 lines from the centre of each side to the opposite side, again with changing colours.

  • Plus Test 5 draws the same small random circles as Test 3 but with 4000, filling the screen.

  • Each test runs for approximately 10 seconds.

Further details and results can be found here, that includes links to an off line version that runs on PCs via Windows and Linux.


OpenGL Screen

As with Java OpenGL, speeds are limited to 60 FPS by imposed VSYNC. In general, there was not a great deal of differences in performance on the initial systems shown here. In the cases of Android upgrades to version 5. it was virtually identical to tablet A1 but T7 speed was much faster on the tests least dependent on CPU speed.

March 2016 - Results from W1, the Windows 10 based tablet, indicate that VSYNC is not imposed, producing the fastest speeds at this time. Windows/Android dual boot tablet W2/A5, confirms the faster Windows performance (via Java). However, the android version runs at full screen, as opposed to a fixed 1280 x 720 with the Windows variety. The latter was recompiled to use full screen, producing much slower speeds (see below). Windows results from the PC, with a reasonably powerful graphics card, are also shown, to reflect the huge difference in performance.

A5 and W2 Dual Boot Tablet - At Screen pixels 2048 x 1440, the Windows speed is slower than via Android, on the first test, but faster on others. A second test on W2, at 1280 x 720, demonstrates faster speed using a smaller window.


 ########################## T7 ##########################

 T7 Nexus 7 Quad 1200 MHz Cortex-A9, Android 4.2.1
   nVidia ULP GeForce Graphics 12 core, 416 MHz

 Android Java Drawing Benchmark 12-Apr-2013 19.50

 Test                            Frames     FPS

 Display PNG Bitmap Twice          204    20.38
 Plus 2 SweepGradient Circles      165    16.48
 Plus 200 Random Small Circles     145    14.50
 Plus 320 Long Lines               113    11.30
 Plus 4000 Random Small Circles     39     3.81

      Screen pixels 1280 Wide 736 High

      Total Elapsed Time   50.4 seconds

     Maximum 19.2 Million Pixels Per Second


 #################### T7 Android 5.0 ####################

 Android Java Drawing Benchmark 01-Oct-2015 12.24

 Test                            Frames     FPS

 Display PNG Bitmap Twice          487    48.70
 Plus 2 SweepGradient Circles      297    29.66
 Plus 200 Random mall Circles      231    23.02
 Plus 320 Long Lines               149    14.85
 Plus 4000 Random Small Circles     39     3.90

      Screen pixels 1280 Wide 736 High

      Total Elapsed Time   50.1 seconds


 ########################## T11 #########################

 T11 Samsung EXYNOS 5250 2.0 GHz Cortex-A15, Android 4.2.2
          Mali-T604 quad core GPU

 Android Java Drawing Benchmark 09-Aug-2013 09.39

 Test                            Frames     FPS

 Display PNG Bitmap Twice          558    55.74
 Plus 2 SweepGradient Circles      277    27.66
 Plus 200 Random Small Circles     244    24.36
 Plus 320 Long Lines               169    16.84
 Plus 4000 Random Small Circles     68     6.72

      Screen pixels 1920 Wide 1032 High

      Total Elapsed Time   50.4 seconds

    Maximum 110 Million Pixels Per Second       


 ########################## T21 #########################

 T21 2.2 GHz Quad Core Snapdragon 800, Android 4.4.3
         GPU Qualcomm Adreno 330, 578 MHz

 Android Java Drawing Benchmark 27-Jul-2015 16.47

 Test                            Frames     FPS

 Display PNG Bitmap Twice          533    53.24
 Plus 2 SweepGradient Circles      248    24.73
 Plus 200 Random Small Circles     218    21.72
 Plus 320 Long Lines               158    15.75
 Plus 4000 Random Small Circles     57     5.61

      Screen pixels 1200 Wide 1803 High

      Total Elapsed Time   50.3 seconds


 ########################## T22 #########################

 T22 1.3 GHz quad core 64 bit MediaTek ARM Cortex-A53
       Android 5.0, GPU Mali T720  MP2

 Android Java Drawing Benchmark 26-Aug-2015 16.21

 Test                            Frames     FPS

 Display PNG Bitmap Twice          558    55.72
 Plus 2 SweepGradient Circles      368    36.70
 Plus 200 Random Small Circles     286    28.52
 Plus 320 Long Lines               178    17.76
 Plus 4000 Random Small Circles     50     4.99

      Screen pixels 800 Wide 1216 High

      Total Elapsed Time   51.5 seconds


 ########################## A1 ##########################

 A1 Asus MemoPad 7, Quad Core 1.86 GHz Intel Atom Z3745
         Intel HD Graphics, Android 4.4.2 

 Android Java Drawing Benchmark 21-Dec-2014 16.35

 Test                            Frames     FPS

 Display PNG Bitmap Twice          599    59.79
 Plus 2 SweepGradient Circles      486    48.55
 Plus 200 Random Small Circles     383    38.25
 Plus 320 Long Lines               219    21.88
 Plus 4000 Random Small Circles     64     6.38

      Screen pixels 1280 Wide 736 High

      Total Elapsed Time   50.1 seconds

 #################### A1 Android 5.0 ####################

  Android Java Drawing Benchmark 10-Oct-2015 13.42

 Test                            Frames     FPS

 Display PNG Bitmap Twice          595    59.40
 Plus 2 SweepGradient Circles      458    45.79
 Plus 200 Random Small Circles     383    38.27
 Plus 320 Long Lines               199    19.81
 Plus 4000 Random Small Circles     56     5.60

      Screen pixels 800 Wide 1216 High

      Total Elapsed Time   50.1 seconds


 #################### A5 Android 5.1 ####################
 Same Tablet as W2

 Teclast X98 Plus, Intel Atom Z8300 1.44 GHz, Turbo 1.84
         Intel HD Graphics, Android 5.1

 Android Java Drawing Benchmark 02-Mar-2016 17.37

 Test                            Frames     FPS

 Display PNG Bitmap Twice          447    44.62
 Plus 2 SweepGradient Circles      212    21.12
 Plus 200 Random Small Circles     171    17.02
 Plus 320 Long Lines                93     9.25
 Plus 4000 Random Small Circles     32     3.13

      Screen pixels 2048 Wide 1440 High

      Total Elapsed Time   50.4 seconds


 ############## W1 Windows 10 1280 x 720 ##############

   Intel Atom Z8300 quad core 1.44 GHz Turbo 1.84
      Windows 10, Intel HD Graphics Gen8

   Java Drawing Benchmark, Dec 27 2015, 21:51:45
            Produced by javac 1.7.0_2

  Test                              Frames      FPS

  Display PNG Bitmap Twice Pass 1      872    87.13
  Display PNG Bitmap Twice Pass 2      991    98.95
  Plus 2 SweepGradient Circles         961    95.98
  Plus 200 Random Small Circles        782    78.08
  Plus 320 Long Lines                  605    60.44
  Plus 4000 Random Small Circles       164    16.32

         Total Elapsed Time  60.1 seconds

  Operating System    Windows 10, Arch. x86, Version 10.0
  Java Vendor         Oracle Corporation, Version  1.8.0_66
  Intel64 Family 6 Model 76 Stepping 3, GenuineIntel, 4 CPUs


 ############## W2 Windows 10 1280 x 720 ##############
 Same Tablet as A5

 Teclast X98 Plus, Intel Atom Z8300 1.44 GHz, Turbo 1.84
       Windows 10, Intel HD Graphics Gen8

   Java Drawing Benchmark, Mar 2 2016, 21:30:58
            Produced by javac 1.7.0_2

  Test                              Frames      FPS

  Display PNG Bitmap Twice Pass 1      748    74.78
  Display PNG Bitmap Twice Pass 2      833    83.24
  Plus 2 SweepGradient Circles         828    82.78
  Plus 200 Random Small Circles        690    68.99
  Plus 320 Long Lines                  560    55.94
  Plus 4000 Random Small Circles       163    16.30

         Total Elapsed Time  60.0 seconds

  Operating System    Windows 10, Arch. x86, Version 10.0
  Java Vendor         Oracle Corporation, Version  1.8.0_66
  Intel64 Family 6 Model 76 Stepping 3, GenuineIntel, 4 CPUs

 ############ W2 Windows 10 2048 x 1440 #############

   Java Drawing Benchmark, Mar 3 2016, 12:22:42
     Produced by javac 1.7.0_2 2048 x 1440

  Test                              Frames      FPS

  Display PNG Bitmap Twice Pass 1      275    27.42
  Display PNG Bitmap Twice Pass 2      301    30.01
  Plus 2 SweepGradient Circles         296    29.54
  Plus 200 Random Small Circles        286    28.51
  Plus 320 Long Lines                  225    22.45
  Plus 4000 Random Small Circles       118    11.72

         Total Elapsed Time  60.3 seconds

  Operating System    Windows 10, Arch. x86, Version 10.0
  Java Vendor         Oracle Corporation, Version  1.8.0_66
  Intel64 Family 6 Model 76 Stepping 3, GenuineIntel, 4 CPUs


 ########### PC Windows 10 GeForce GTX 650 ###########
               Core i7-4820K at 3.9 GHz 

   Java Drawing Benchmark, Mar 7 2016, 10:56:24
     Produced by javac 1.7.0_2 2048 x 1440

  Test                              Frames      FPS

  Display PNG Bitmap Twice Pass 1     5237   523.39
  Display PNG Bitmap Twice Pass 2     5477   547.04
  Plus 2 SweepGradient Circles        5484   548.07
  Plus 200 Random Small Circles       5144   513.58
  Plus 320 Long Lines                 4736   473.32
  Plus 4000 Random Small Circles       735    73.49

         Total Elapsed Time  60.0 seconds

  Operating System    Windows 10, Arch. x86, Version 10.0
  Java Vendor         Oracle Corporation, Version  1.8.0_60
  Intel64 Family 6 Model 62 Stepping 4, GenuineIntel, 8 CPUs
 

To Start


CPU MHz Benchmark - CP_MHz2.apk

This program measures CPU MHz samples over 30 seconds, with 300 reports at 100 millisecond intervals (timing functions and overheads increase this time to 120 ms or above). The procedures are open a benchmark, open MHz program and run, switch in benchmark from recent screens and run, save benchmark results, switch in MHz program from recent screens and save results when finished. Further details and results can be found here. and here

Below is an example, over the first 18 seconds, whilst running NEON-MFLOPS-MP benchmark (taking 14.6 seconds). In this case, MHz is fairly constant, but the frequency can vary a lot on other devices, or might run at a constant low value, if power saving is switched on.



  T22, Quad Core ARM Cortex-A53 1300 MHz, Android 5.0.2 

 ARM/Intel NEON-MFLOPS2-MP Benchmark V2.2 15-Nov-2015 17.21
            Compiled for 64 bit ARM v8a

     FPU Add & Multiply using 1, 2, 4 and 8 Threads
         2 Ops/Word              32 Ops/Word
 KB     12.8     128   12800    12.8     128   12800
 MFLOPS
 1T      785     771     669    2862    2851    2739
 2T     1485    1499     895    5654    5729    5606
 4T     1937    2074     995   10862   11024   10636
 8T     2678    3021    1012    9971   10730   10534

        Total Elapsed Time   14.6 seconds

  Android CPU MHz 100 ms Sampling 15-Nov-2015 17:21:45

  0.00   1300   0.12   1300   0.23    299   0.38    299   0.53   1300
  0.67   1300   0.88   1300   1.01   1300   1.15   1300   1.29   1300
  1.43   1300   1.57   1300   1.70   1300   1.83   1300   1.97   1300
  2.11   1300   2.25   1300   2.39   1300   2.53   1300   2.67   1300 X
  2.81   1300   2.95   1300   3.09   1300   3.22   1300   3.36   1300
  3.49   1300   3.63   1300   3.76   1300   3.90   1300   4.04   1300
  4.18   1235   4.30   1300   4.42   1300   4.59   1300   4.76   1300
  4.91   1300   5.08   1300   5.23    819   5.40   1300   5.55    299 X
  5.74   1300   5.91   1300   6.09   1300   6.25   1300   6.41   1300
  6.59   1300   6.76   1300   6.92   1300   7.08   1300   7.24   1300
  7.40   1300   7.52   1300   7.68    299   7.88    442   8.06   1300
  8.24   1300   8.40    819   8.56   1300   8.71   1300   8.86   1300 X
  9.01   1300   9.16   1300   9.32   1300   9.48   1300   9.64   1300
  9.80   1300   9.97   1300  10.13   1300  10.27   1300  10.43   1300
 10.57   1300  10.72   1300  10.88   1300  11.02   1300  11.17   1300
 11.33   1300  11.47   1300  11.62   1300  11.78   1300  11.92   1300 X
 12.07   1300  12.22   1300  12.37   1300  12.53   1300  12.68   1300
 12.84   1300  12.99   1300  13.15   1300  13.30   1300  13.46   1300
 13.61   1300  13.76   1300  13.92   1300  14.08   1300  14.24   1300
 14.40   1300  14.56   1300  14.72   1300  14.88   1300  15.04   1300 X
 15.20   1300  15.36   1300  15.52   1300  15.69   1300  15.86   1300
 16.02   1300  16.21   1300  16.39   1300  16.55   1300  16.71   1300
 16.85   1300  17.00   1300  17.14   1300  17.30   1300  17.45   1300
 17.60   1300  17.75   1300  17.91   1300  18.05   1300  18.21   1300 
  

To Start


Battery Drain Test - BatteryTest.apk

The program runs the second most demanding OpenGL drawing benchmark test except CPU MHz is displayed, along with Frames Per Second (FPS) and running time in minutes, the MHz figure being the average of one measurement per frame. Default running arrangements are 60 passes of one second each, producing two columns of results that are displayed and saved on the Internal Drive. The CPU MHz is the average of samples taken once per frame. These results are to demonstrate any reductions as the battery capacity reduces. Before running, Display/Power Settings should be changed to never switch off and CPU to run at maximum speed, if possible.

Three buttons are provided where, besides the Run and usual Email option, to save results, there is a Time button, enabling manual input of the number of seconds for each pass. After rebooting, following a flat battery turning the device off, and after recharging, restarting the program reads and displays the saved results, ready for E-mailing. NOTE: some Android versions will not open a log file for saving results.

Following are results from a test set to run for 2 hours (60 x 120 seconds) and run twice. Displayed MHz, whilst the test was running, showed rapid variations that affected the final speed and FPS had a similar variation, but these were fairly constant over 4 hours.


 T21 Quad Core Qualcomm Snapdragon 800, Android 4.4.3
         GPU Qualcomm Adreno 330, 578 MHz

 Up to 60 120 second runs, MHz 1 sample/frame
 Log File /storage/emulated/0/BatteryTest.txt

 Android Battery Test 28-Jul-2015 11.08  28-Jul-2015 13.28

Run    FPS   MHz   Run    FPS   MHz     Run    FPS   MHz   Run    FPS   MHz
/b>
  1   12.0  2100     2   12.1  1937       1   12.1  2014     2   12.2  2005
  3    7.6  1874     4   12.1  1966       3   12.0  1975     4   12.0  2005
  5   12.2  1993     6   12.2  1996       5   12.1  1962     6   12.1  1948
  7   12.2  1996     8   12.2  1966       7   12.2  1979     8   12.2  2004
  9   12.0  1935    10   12.3  1925       9   12.1  1959    10   12.1  2060
 11   12.3  1983    12   12.0  2015      11   12.0  2017    12   12.2  1992
 13   11.9  2013    14   12.1  2000      13   12.1  1987    14   12.2  1964
 15   12.1  1934    16   12.1  2005      15   12.1  1973    16   12.1  1978
 17   12.0  1948    18   12.0  2000      17   12.1  1998    18   12.1  1977
 19   11.9  1979    20   12.0  1972      19   12.0  2007    20   12.0  1956
 21   12.0  1997    22   12.0  1994      21   11.9  1966    22   12.1  1975
 23   12.2  2035    24   12.1  2013      23   12.0  1978    24   12.0  2012
 25   12.2  1981    26   12.1  1977      25   12.1  1988    26   12.1  2010
 27   12.2  1976    28   12.2  1991      27   12.1  2004    28   12.0  1994
 29   12.2  2000    30   12.3  1984      29   12.1  1989    30   12.2  2004
 31   12.2  1986    32   12.3  1964      31   12.2  2009    32   12.1  1979
 33   12.2  1955    34   12.1  1980      33   12.1  1945    34   12.0  1951
 35   12.1  2002    36   12.2  2045      35   12.1  1997    36   12.1  2022
 37   12.1  1993    38   12.2  2010      37   12.2  2038    38   12.1  2024
 39   12.1  1947    40   12.1  1959      39   12.1  1997    40   12.1  2049
 41   11.9  1949    42   12.0  1993      41   12.2  1996    42   12.1  1994
 43   12.1  1953    44   12.2  2005      43   12.0  1978    44   12.0  1985
 45   12.4  1928    46   12.3  1989      45   11.9  1947    46   12.2  1982
 47   12.1  1987    48   12.1  1969      47   12.1  2022    48   11.8  1964
 49   12.1  1992    50   12.1  1999      49   11.9  1985    50   12.1  1991
 51   12.2  1929    52   12.1  1955      51   12.1  1988    52   12.0  2002
 53   12.4  1950    54   12.3  1990      53   12.0  2009    54   11.9  2018
 55   12.3  1930    56   12.2  1922      55   12.0  1994    56   12.0  1974
 57   12.4  1952    58   12.1  1977      57   12.1  1950    58   12.1  2009
 59   12.1  1986    60   12.2  1962      59   12.0  1976    60   12.0  2010

 Total Elapsed Time  7614.9 seconds      Total Elapsed Time  7202.9 seconds
  

To Start


DriveSpeed Benchmarks - DriveSpeed.apk, DriveSpd2.apk, drivespeed32.exe

This is primarily intended for measuring performance of SD cards and internal drives, but can also be used to test USB drives. DriveSpeed carries out four tests.

Test 1 - Write and read three 8 and 16 MB; Results given in MBytes/second
Test 2 - Write 8 MB, read can be cached in RAM; Results given in MBytes/second
Test 3 - Random write and read 1 KB from 4 to 16 MB; Results are Average time in milliseconds
Test 4 - Write and read 200 files 4 KB to 16 KB; Results in MB/sec, msecs/file and delete seconds.

The first DriveSpeed benchmark has two run buttons, RunS for an SD card and RunI for the internal drive, the file path being identified by standard functions. The external SD test worked on earlier Android tablets but failed on later Android versions. RunS ran but provided distorted reading speeds by caching data in RAM. An extra button was added to prevent large files from being deleted and a read only option to measure uncached speeds after rebooting.

DriveSpd2 requires input of the file path to use and this might be identified using a file browser app. The file path can sometimes be selected for internal drives, SD cards and USB devices but there are complications associated with permissions and caching.

Running these benchmarks can require a lot of experimentation. Lots of paths, results and explanations are here and here. Following are example DriveSpd2 results from T22 ( Lenovo Tab 2 A8-50) testing an external SD card, T11 (Voyo A15) from a USB 3 flash drive and read only benchmark results.

Intel/Windows Versions - Results for Tablet W1 main drive are below, with USB 3 and SD card speeds Here along with some via Windows and Linux.


 ########################## T22 #########################

 T22 1.3 GHz quad core 64 bit MediaTek ARM Cortex-A53

 Android DriveSpeed2 Benchmark 1.0 28-Aug-2015 12.56
                 Data Not Cached

                      MBytes/Second
   MB    Write1 Write2 Write3  Read1  Read2  Read3
    8       3.7    3.7    3.6   20.3   20.6   20.4
   16       2.6    3.7    3.7   20.5   20.5   20.5
 Cached
    8      52.4  107.8   13.2  228.8  226.3  226.7

 Random      Write                Read
 From MB     4      8     16      4      8     16
 msecs    4.65   4.91  18.23   0.01   0.01   0.66

 200 Files   Write                Read            Delete
 File KB     4      8     16      4      8     16   secs
 MB/sec   0.07   0.18   0.49   2.16   3.79   6.51
 msecs   59.14  44.59  33.61   1.90   2.16   2.52  2.099

        Total Elapsed Time   85.4 seconds
        File Path Used - /storage/sdcard1/
          Drive MB   15258 Free   14687


 ########################## T11 #########################

 T11 Samsung EXYNOS 5250 Dual 1.7 GHz Cortex-A15,

 Android DriveSpeed2 Benchmark 1.0 10-Dec-2013 12.52
            Data Not Cached

                      MBytes/Second
   MB    Write1 Write2 Write3  Read1  Read2  Read3
    8      40.9   46.6   46.2  100.7   95.9   71.4
   16      45.2   51.9   51.1   98.8   70.7   66.2
  Cached
    8     150.4  127.7   50.9  687.6  688.7  709.2

 Random      Write                Read
 From MB     4      8     16      4      8     16
 msecs    0.91   0.90   0.82   0.01   0.01   0.02

 200 Files   Write                Read            Delete 
 File KB     4      8     16      4      8     16   secs 
 MB/sec   0.56   1.18   1.85   4.20  13.33  34.79  
 msecs    7.29   6.96   8.88   0.98   0.61   0.47  0.149

       Total Elapsed Time   24.8 seconds
       File Path Used - /mnt/udisk/
         Drive MB   30517 Free   30466


 ###################### Read Only #######################

 Android DriveSpeed Benchmark Internal Drive Read Only

                     MBytes/Second
 Device  Write1 Write2 Write3  Read1  Read2  Read3

  T7        0.0    0.0    0.0   41.7   42.8   39.0
  T11       0.0    0.0    0.0   53.7   53.5   53.9
  T21       0.0    0.0    0.0  102.9  104.0  103.6
  T22       0.0    0.0    0.0  127.7  145.7  139.9
  A1        0.0    0.0    0.0  155.7  128.6  156.2


 ################## W1 DriveSpeed32.exe W1 Windows 10 #################

 Current Directory Path: C:\Test
 Total MB   58722, Free MB   45286, Used MB   13436

 Windows Storage Speed Test 32-Bit Version 1.2, Mon Jan 04 16:09:25 2016

                Copyright (C) Roy Longbottom 2011

     8 MB File         1          2          3          4          5
 Writing MB/sec     100.68     101.04     110.81     105.04     113.32
 Reading MB/sec     154.58     155.78     132.18     153.97     153.86

    16 MB File         1          2          3          4          5
 Writing MB/sec     115.96     117.50     118.53     113.16     116.46
 Reading MB/sec     150.29     155.47     156.13     150.62     157.92

    32 MB File         1          2          3          4          5
 Writing MB/sec     118.84     118.26     123.01     123.42     125.39
 Reading MB/sec     146.70     153.65     146.41     148.77     155.54

 ---------------------------------------------------------------------
 8 MB Cached File      1          2          3          4          5
 Writing MB/sec     176.10     292.34     462.14     201.19     452.46
 Reading MB/sec     599.06     830.94     992.19     878.99    1033.57

 ---------------------------------------------------------------------
 Bus Speed Block KB     64        128        256        512       1024
 Reading MB/sec     101.09     107.71     123.43     139.70     136.62

 ---------------------------------------------------------------------
 1 KB Blocks File MB >    2      4      8     16     32     64    128
 Random Read  msecs    0.22   0.18   0.18   0.18   0.18   0.19   0.19
 Random Write msecs    0.13   0.13   0.13   0.13   0.14   0.19   0.21

 ---------------------------------------------------------------------
 500 Files   Write             Read             Delete
 File KB     MB/sec  ms/File   MB/sec  ms/File  Seconds
       2       0.56     3.68     3.00     0.68    0.629
       4       0.84     4.85     6.79     0.60    0.541
       8       1.92     4.27    13.34     0.61    0.502
      16       1.01    16.17    22.14     0.74    0.528
      32       1.95    16.81    38.21     0.86    0.527
      64       3.75    17.50    59.57     1.10    0.490

              End of test Mon Jan 04 16:10:53 2016
  


To Start

System Details




 T7      Nexus 7 quad core CPU 1.3, GHz 1.2 GHz > 1 core
         Device Asus Nexus 7
         RAM 1 GB DDR3L-1333 Bandwidth 5.3 GB/sec
         Screen pixels w x h 1280 x 736 MHz 
         Twelve-core Nvidia GeForce ULP graphics 416 MHz
         Android Build Version      4.1.2
         Processor : ARMv7 Processor rev 9 (v7l)
         processor : 0  BogoMIPS : 1993.93
         processor : 1  BogoMIPS : 1993.93
         processor : 2  BogoMIPS : 1993.93
         processor : 3  BogoMIPS : 1993.93
         Features  : swp half thumb fastmult vfp edsp neon vfpv3 tls 
         CPU implementer : 0x41
         CPU architecture: 7
         CPU variant     : 0x2
         CPU part        : 0xc09             - Cortex-A9
         CPU revision    : 9
         Hardware        : grouper           - nVidia Tegra 3 T30L
         Revision        : 0000
         Linux version    3.1.10
         Runs at 1.2 GHz

 T11     Voyo A15, Samsung EXYNOS 5250 Dual core 2.0 GHz Cortex-A15, 
         Device Urbetter VOYO A15
         Mali-T604 GPU, 2 GB DDR3-1600 RAM, dual channel, 12.8 GB/s
         Screen pixels w x h 1920 x 1032 
         Android Build Version      4.2.2  - Jelly Bean
         Processor       : ARMv7 Processor rev 4 (v7l)
         processor       : 0
         BogoMIPS        : 992.87
         processor       : 1
         BogoMIPS        : 997.78
         Features        : swp half thumb fastmult vfp edsp neon vfpv3 tls vfpv4
                           idiva idivt 
         CPU implementer : 0x41
         CPU architecture: 7
         CPU variant     : 0x0
         CPU part        : 0xc0f
         CPU revision    : 4
         Hardware        : SMDK5250
         Linux version 3.4.35Ut
         Runs at 1.7 GHz

 T15     HTC Nexus 9, dual core Denver CPU 2400 MHz 
         Screen pixels w x h 2048 x 1440
         Android Build Version      5.0.1
         Processor : NVIDIA Denver 1.0 rev 0 (aarch64)
         processor : 0 & 1
         Features : fp asimd aes pmull sha1 sha2 crc32
         CPU implementer : 0x4e
         CPU architecture: AArch64
         CPU variant : 0x0
         CPU part : 0x000
         CPU revision : 0
         Hardware : Flounder
         Revision : 0000
         MTS version : 33410787
         Linux version 3.10.40

 T21     Kindle Fire HDX 7, 2.2 GHz  Quad Core Qualcomm Snapdragon 800 (Krait 400) 
         2 x 32 Bit LPDDR3-1866 Memory, 14.9 GB/s, GPU Qualcomm Adreno 330, 578 MHz
         Device Amazon KFTHWI
         Screen pixels w x h 1200 x 1803 
         Android Build Version      4.4.3
         Processor       : ARMv7 Processor rev 0 (v7l)
         processor       :  0, 1, 2, 3
         BogoMIPS        : 38.40
         Features        : swp half thumb fastmult vfp edsp neon vfpv3 tls vfpv4 idiva idivt 
         CPU implementer : 0x51
         CPU architecture: 7
         CPU variant     : 0x2
         CPU part        : 0x06f
         CPU revision    : 0
         Hardware        : Qualcomm MSM8974
         Revision        : 0000
         Linux version 3.4.0-perf (gcc version 4.7)  

 T22     Lenovo Tab 2 A8-50, 1.3 GHz quad core 64 bit MediaTek ARM Cortex-A53 
         1 GB LPDDR3, GPU Mali T720  MP2
         Device LENOVO Lenovo TAB 2 A8-50F
         Screen pixels w x h 800 x 1216
         Android Build Version      5.0.2
         Processor : AArch64 Processor rev 3 (aarch64)
         processor : 0, 1, 2
         BogoMIPS  : 26.0
         Features : fp asimd aes pmull sha1 sha2 crc32
         CPU implementer : 0x41
         CPU architecture: AArch64
         CPU variant : 0x0
         CPU part : 0xd03
         CPU revision : 3
         Hardware : MT8161
         Linux version 3.10.65 

 P33     Sony Xperia Z3+ E6533, Quad-core 1.5 GHz & Quad-core 2 GHz Qualcomm
         Snapdragon 810 64-bit CPU
         Screen pixels w x h 1080 x 1776
         Android Build Version      5.0.2
         Processor : AArch64 Processor rev 1 (aarch64)
         processor : 0 to 7
         Features : fp asimd evtstrm aes pmull sha1 sha2 crc32
         CPU implementer : 0x41
         CPU architecture: 8
         CPU variant : 0x1
         CPU part : 0xd07
         CPU revision : 1
         Hardware : Qualcomm Technologies, Inc MSM8994
         Linux version 3.?10.?49

 P36     LGE LG-H811 Qualcomm Snapdragon 808, 1.8 GHz 64-bit Hexa-Core 
         Device LGE LG-H811
         Screen pixels w x h 1440 x 2392
         Android Build Version      5.1
         Processor : AArch64 Processor rev 2 (aarch64)
         processor : 0, 1, 2, 3, 4, 5
         Features : fp asimd evtstrm aes pmull sha1 sha2 crc32
         CPU implementer : 0x41
         CPU architecture: 8
         CPU variant : 0x1
         CPU part : 0xd07
         CPU revision : 2
         Hardware : Qualcomm Technologies, Inc MSM8992
         Revision : 000b
         Linux version 3.10.49-

 A1      Asus MemoPad 7 ME176CEX, 1.86 GHz Atom Intel Atom Z3745 
         Device Asus K013
         Screen pixels w x h 800 x 1216
         Android Build Version      4.4.2
         Processor : ARMv7 processor rev 1 (v7l)
         BogoMIPS : 1500.0
         Features : neon vfp swp half thumb fastmult edsp vfpv3
         CPU implementer : 0x69
         CPU architecture: 7
         CPU variant : 0x1
         CPU part : 0x001
         CPU revision : 1
         Hardware : placeholder
         Revision : 0001
         Linux version 3.10.20
         Mainly runs at 1.86 GHz Turbo Boost

 A4      Intel(R) Atom x5-Z8300 1.84 GHz (turbo) 
         Device Intel cht_cr_rvp
         Screen pixels w x h 800 x 1216
         Android Build Version      5.1.1
         : 6 initial apicid : 6
         fpu : yes
         fpu_exception : yes
         cpuid level : 11
         wp : yes
         flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat
          pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx rdtscp lm
          constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc
          aperfmperf nonstop_tsc_s3 pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2
          ssse3 cx16 xtpr pdcm sse4_1 sse4_2 movbe popcnt tsc_deadline_timer aes
          rdrand lahf_lm 3dnowprefetch ida arat epb dtherm tpr_shadow vnmi
          flexpriority ept vpid tsc_adjust smep erms
         bogomips : 2879.90
         clflush size : 64
         cache_alignment : 64
         address sizes : 36 bits physical, 48 bits virtual
         power management:
         processor : 3
         vendor_id : GenuineIntel
         cpu family : 6
         model : 76
         model name : Intel(R) Atom(TM) x5-Z8300  CPU @ 1.44GHz
         stepping : 3
         microcode : 0x358
         cpu MHz : 1840.000
         cache size : 1024 KB
         physical id : 0
         siblings : 4
         core id : 3
         cpu cores : 4
         apicid
         Linux version 3.14.37 


 A5      Same tablet as W2 - Intel Atom Z8300 1.44 GHz, Turbo 1.84
         Device Teclast X98 Plus(A5C8)
         Screen pixels w x h 2048 x 1440
         Android Build Version      5.1
         Processor : ARMv7 processor rev 1 (v7l)
         BogoMIPS : 1500.0
         Features : neon vfp swp half thumb fastmult edsp vfpv3 vfpv4 idiva idivt
         CPU implementer : 0x69
         CPU architecture: 7
         CPU variant : 0x1
         CPU part : 0x001
         CPU revision : 1
         Hardware : placeholder
         Revision : 0001
         Linux version 3.14.37-x86_64-L1-R429

 W1      Pipo W1S Tablet. Intel Z8300 quad core 1.44 GHz Turbo 1.84
         Windows 10, 4 GB DDR 3 1600
         CPU GenuineIntel, Features Code BFEBFBFF, Model Code 000406C3
         Intel(R) Atom(TM) x5-Z8300  CPU @ 1.44GHz Measured 1440 MHz
         Has MMX, Has SSE, Has SSE2, Has SSE3, No 3DNow,
         AMD64 processor architecture, 4 CPUs 
         Windows NT  Version 6.2, build 9200, 
         Memory 4020 MB, Free 2520 MB

 W2      Same tablet as A5
         Teclast X98 Plus, Intel Atom Z8300 1.44 GHz, Turbo 1.84
         CPU GenuineIntel, Features Code BFEBFBFF, Model Code 000406C3
         Intel(R) Atom(TM) x5-Z8300  CPU @ 1.44GHz Measured 1440 MHz
         Has MMX, Has SSE, Has SSE2, Has SSE3, No 3DNow,
         Intel processor architecture, 4 CPUs 
         Windows NT  Version 6.2, build 9200, 
         Memory 4021 MB, Free 2540 MB
         User Virtual Space 4096 MB, Free 4083 MB
         64 Bit
         AMD64 processor architecture, 4 CPUs
         User Virtual Space 134217728 MB, Free 134217716 MB

 PC      Core i7 4820K quad core + HT at 3900 MHz Turbo
         CPU GenuineIntel, Features Code BFEBFBFF, Model Code 000306E4
         Intel(R) Core(TM) i7-4820K CPU @ 3.70GHz Measured 3711 MHz
         Has MMX, Has SSE, Has SSE2, Has SSE3, No 3DNow,
         AMD64 processor architecture, 8 CPUs 
         Windows NT  Version 6.2, build 9200, 
         Memory 32705 MB, Free 30584 MB
         User Virtual Space 134217728 MB, Free 134217715 MB
   

To Start


Roy Longbottom at Linkedin  Roy Longbottom May 2016



The Official Internet Home for my Benchmarks is via the link
Roy Longbottom's PC Benchmark Collection