Last active
November 6, 2023 19:20
-
-
Save dougallj/2764fe3d70955858c45ab11910b3dd21 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Raw data. These were dumped from iPhones/iPads using wall-timers, not | |
perf-counters. They contain some likely issues and inconsistencies that | |
haven't been fully investigated. Mostly correct, but it's worth | |
double-checking anything odd. (For example, "TBL (two register table)" | |
can have better throughput than is listed sometimes, as can some other | |
three-operand SIMD things iirc.) | |
The goal is to find the fastest rate at which an instruction can run. If | |
there are multiple rows with the same label, the "correct" value is the | |
minimum. For example: | |
ADCS (64-bit) 1->2 1.000200 | |
ADCS (64-bit) 1->3 1.001804 | |
ADCS (64-bit) 1->4 1.000273 | |
ADCS (64-bit) 4->2 0.998992 | |
ADCS (64-bit) 4->3 1.003760 | |
ADCS (64-bit) 4->4 1.000002 | |
ADCS (64-bit) throughput 0.668505 | |
ADCS (64-bit) throughput 0.334375 | |
The first six rows show the latencies from each output operand to each | |
input operand. In this case, one-cycle (plus some noise), regardless of | |
operand, so ADCS has 1c latency. | |
The last two rows have the same label. This is because the script used | |
two different methods to try to measure the throughput. In that case, | |
take the minimum. Here, that's 0.334375 (3 operations-per-cycle, plus | |
some noise). | |
Details of the experiments typically will match the experiments linked at: | |
https://dougallj.github.io/applecpu/firestorm-int.html | |
For some more useful notes on A15/A16, see: | |
https://twitter.com/dougallj/status/1534002276091629569 | |
https://twitter.com/dougallj/status/1581109455269556224 | |
Some operations added since A14/M1 are listed in a separate file at the top. | |
(Not for any particular reason.) I recall checking the uop counts using | |
Instruments.app, but the results weren't surprising, just throughput/0.25. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
USDOT (vector, 8B) 1->1 3.011408 | |
USDOT (vector, 8B) 1->2 2.999861 | |
USDOT (vector, 8B) 1->3 3.000007 | |
USDOT (vector, 8B) throughput 0.250289 | |
USDOT (vector, 8B) throughput 0.249937 | |
USDOT (vector, 16B) 1->1 3.011412 | |
USDOT (vector, 16B) 1->2 3.011416 | |
USDOT (vector, 16B) 1->3 3.005572 | |
USDOT (vector, 16B) throughput 0.250307 | |
USDOT (vector, 16B) throughput 0.250705 | |
USDOT (by element, 8B) 1->1 2.999560 | |
USDOT (by element, 8B) 1->2 2.999421 | |
USDOT (by element, 8B) 1->3 2.999712 | |
USDOT (by element, 8B) throughput 0.250379 | |
USDOT (by element, 8B) throughput 0.250677 | |
USDOT (by element, 16B) 1->1 2.999133 | |
USDOT (by element, 16B) 1->2 3.011844 | |
USDOT (by element, 16B) 1->3 3.000281 | |
USDOT (by element, 16B) throughput 0.251697 | |
USDOT (by element, 16B) throughput 0.250786 | |
SUDOT (by element, 8B) 1->1 2.999716 | |
SUDOT (by element, 8B) 1->2 3.011997 | |
SUDOT (by element, 8B) 1->3 3.012864 | |
SUDOT (by element, 8B) throughput 0.250307 | |
SUDOT (by element, 8B) throughput 0.250786 | |
SUDOT (by element, 16B) 1->1 2.999560 | |
SUDOT (by element, 16B) 1->2 2.999851 | |
SUDOT (by element, 16B) 1->3 3.000149 | |
SUDOT (by element, 16B) throughput 0.250362 | |
SUDOT (by element, 16B) throughput 0.250749 | |
SMMLA (vector) 1->1 6.012858 | |
SMMLA (vector) 1->2 6.011124 | |
SMMLA (vector) 1->3 6.013146 | |
SMMLA (vector) throughput 0.501499 | |
SMMLA (vector) throughput 0.526893 | |
UMMLA (vector) 1->1 6.012425 | |
UMMLA (vector) 1->2 6.014019 | |
UMMLA (vector) 1->3 6.011566 | |
UMMLA (vector) throughput 0.501572 | |
UMMLA (vector) throughput 0.533232 | |
USMMLA (vector) 1->1 6.013001 | |
USMMLA (vector) 1->2 6.024128 | |
USMMLA (vector) 1->3 6.013001 | |
USMMLA (vector) throughput 0.501571 | |
USMMLA (vector) throughput 0.524005 | |
BFCVT (scalar) 1->2 3.000572 | |
BFCVT (scalar) throughput 0.250072 | |
BFCVTN (vector, 4H) 1->2 2.998987 | |
BFCVTN (vector, 4H) throughput 0.249982 | |
BFCVTN2 (vector, 8H) 1->1 3.011275 | |
BFCVTN2 (vector, 8H) 1->2 3.011997 | |
BFCVTN2 (vector, 8H) throughput 0.250397 | |
BFCVTN2 (vector, 8H) throughput 0.250750 | |
BFDOT (by element, 2S) 1->1 3.000295 | |
BFDOT (by element, 2S) 1->2 10.022082 | |
BFDOT (by element, 2S) 1->3 10.010851 | |
BFDOT (by element, 2S) throughput 0.751477 | |
BFDOT (by element, 2S) throughput 0.752962 | |
BFDOT (by element, 4S) 1->1 3.011704 | |
BFDOT (by element, 4S) 1->2 10.023098 | |
BFDOT (by element, 4S) 1->3 10.057951 | |
BFDOT (by element, 4S) throughput 1.001733 | |
BFDOT (by element, 4S) throughput 1.002404 | |
BFDOT (vector, 2S) 1->1 3.001285 | |
BFDOT (vector, 2S) 1->2 10.016038 | |
BFDOT (vector, 2S) 1->3 10.024926 | |
BFDOT (vector, 2S) throughput 0.751645 | |
BFDOT (vector, 2S) throughput 0.746704 | |
BFDOT (vector, 4S) 1->1 3.000149 | |
BFDOT (vector, 4S) 1->2 10.015276 | |
BFDOT (vector, 4S) 1->3 10.013981 | |
BFDOT (vector, 4S) throughput 1.002420 | |
BFDOT (vector, 4S) throughput 1.001788 | |
BFMLALB 1->1 4.010544 | |
BFMLALB 1->2 4.003002 | |
BFMLALB 1->3 4.005643 | |
BFMLALB throughput 0.250342 | |
BFMLALB throughput 0.250740 | |
BFMLALT 1->1 4.000135 | |
BFMLALT 1->2 4.011854 | |
BFMLALT 1->3 4.011854 | |
BFMLALT throughput 0.250307 | |
BFMLALT throughput 0.250823 | |
BFMMLA 1->1 6.011416 | |
BFMMLA 1->2 13.012826 | |
BFMMLA 1->3 13.018100 | |
BFMMLA throughput 2.002597 | |
BFMMLA throughput 2.003052 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
ADC (32-bit) 1->2 1.000328 | |
ADC (32-bit) 1->3 1.065060 | |
ADC (32-bit) 1->4 1.026448 | |
ADC (32-bit) throughput 0.501056 | |
ADC (64-bit) 1->2 1.018105 | |
ADC (64-bit) 1->3 1.000130 | |
ADC (64-bit) 1->4 1.014143 | |
ADC (64-bit) throughput 0.529629 | |
ADCS (32-bit) 1->2 1.000064 | |
ADCS (32-bit) 1->3 1.185754 | |
ADCS (32-bit) 1->4 0.992562 | |
ADCS (32-bit) 4->2 1.010281 | |
ADCS (32-bit) 4->3 1.101900 | |
ADCS (32-bit) 4->4 0.979480 | |
ADCS (32-bit) throughput 1.470419 | |
ADCS (32-bit) throughput 3.098103 | |
ADCS (64-bit) 1->2 1.000459 | |
ADCS (64-bit) 1->3 1.206819 | |
ADCS (64-bit) 1->4 1.137327 | |
ADCS (64-bit) 4->2 1.016239 | |
ADCS (64-bit) 4->3 1.065384 | |
ADCS (64-bit) 4->4 0.993688 | |
ADCS (64-bit) throughput 1.628992 | |
ADCS (64-bit) throughput 1.216955 | |
ADD (sxtb, 32-bit) 1->2 2.305780 | |
ADD (sxtb, 32-bit) 1->3 2.069723 | |
ADD (sxtb, 32-bit) throughput 1.141637 | |
ADD (sxtb, 64-bit) 1->2 2.008446 | |
ADD (sxtb, 64-bit) 1->3 1.996458 | |
ADD (sxtb, 64-bit) throughput 1.026466 | |
ADD (uxtb, 32-bit) 1->2 2.045769 | |
ADD (uxtb, 32-bit) 1->3 2.267341 | |
ADD (uxtb, 32-bit) throughput 1.076846 | |
ADD (uxtb, 64-bit) 1->2 2.248260 | |
ADD (uxtb, 64-bit) 1->3 2.244789 | |
ADD (uxtb, 64-bit) throughput 1.018928 | |
ADD (sxth, 32-bit) 1->2 1.968989 | |
ADD (sxth, 32-bit) 1->3 2.146923 | |
ADD (sxth, 32-bit) throughput 1.025622 | |
ADD (sxth, 64-bit) 1->2 1.638829 | |
ADD (sxth, 64-bit) 1->3 5.096116 | |
ADD (sxth, 64-bit) throughput 1.819292 | |
ADD (uxth, 32-bit) 1->2 2.783994 | |
ADD (uxth, 32-bit) 1->3 2.041513 | |
ADD (uxth, 32-bit) throughput 1.338542 | |
ADD (uxth, 64-bit) 1->2 2.267933 | |
ADD (uxth, 64-bit) 1->3 2.148907 | |
ADD (uxth, 64-bit) throughput 1.123890 | |
ADD (sxtw, 32-bit) 1->2 1.118829 | |
ADD (sxtw, 32-bit) 1->3 1.287855 | |
ADD (sxtw, 32-bit) throughput 0.672615 | |
ADD (sxtw, 64-bit) 1->2 1.981503 | |
ADD (sxtw, 64-bit) 1->3 2.075520 | |
ADD (sxtw, 64-bit) throughput 1.029882 | |
ADD (uxtw, 32-bit) 1->2 1.003474 | |
ADD (uxtw, 32-bit) 1->3 1.005366 | |
ADD (uxtw, 32-bit) throughput 0.530983 | |
ADD (uxtw, 64-bit) 1->2 1.954458 | |
ADD (uxtw, 64-bit) 1->3 2.023042 | |
ADD (uxtw, 64-bit) throughput 0.981864 | |
ADD (uxtx, 64-bit) 1->2 1.011896 | |
ADD (uxtx, 64-bit) 1->3 1.031497 | |
ADD (uxtx, 64-bit) throughput 0.501686 | |
ADD (sxtx, 64-bit) 1->2 1.019828 | |
ADD (sxtx, 64-bit) 1->3 1.002947 | |
ADD (sxtx, 64-bit) throughput 0.506215 | |
ADD (immediate, 32-bit) 1->2 1.005660 | |
ADD (immediate, 32-bit) throughput 0.507280 | |
ADD (immediate, 64-bit) 1->2 1.005872 | |
ADD (immediate, 64-bit) throughput 0.649187 | |
ADD (shifted immediate, 32-bit) 1->2 1.245265 | |
ADD (shifted immediate, 32-bit) throughput 0.454882 | |
ADD (shifted immediate, 64-bit) 1->2 0.862306 | |
ADD (shifted immediate, 64-bit) throughput 0.503656 | |
ADD (register, 32-bit) 1->2 1.003475 | |
ADD (register, 32-bit) 1->3 1.008386 | |
ADD (register, 32-bit) throughput 0.519958 | |
ADD (register, 64-bit) 1->2 0.991370 | |
ADD (register, 64-bit) 1->3 1.000461 | |
ADD (register, 64-bit) throughput 0.497339 | |
ADD (register, lsl, 32-bit) 1->2 2.135982 | |
ADD (register, lsl, 32-bit) 1->3 2.009442 | |
ADD (register, lsl, 32-bit) throughput 1.024767 | |
ADD (register, lsl, 64-bit) 1->2 1.977639 | |
ADD (register, lsl, 64-bit) 1->3 2.055476 | |
ADD (register, lsl, 64-bit) throughput 1.011385 | |
ADD (register, lsr, 32-bit) 1->2 2.035674 | |
ADD (register, lsr, 32-bit) 1->3 1.928535 | |
ADD (register, lsr, 32-bit) throughput 1.010973 | |
ADD (register, lsr, 64-bit) 1->2 1.969817 | |
ADD (register, lsr, 64-bit) 1->3 2.064701 | |
ADD (register, lsr, 64-bit) throughput 0.987366 | |
ADD (register, asr, 32-bit) 1->2 1.972884 | |
ADD (register, asr, 32-bit) 1->3 1.938710 | |
ADD (register, asr, 32-bit) throughput 1.011145 | |
ADD (register, asr, 64-bit) 1->2 1.997084 | |
ADD (register, asr, 64-bit) 1->3 2.192536 | |
ADD (register, asr, 64-bit) throughput 1.002014 | |
ADDS (sxtb, 32-bit) 1->2 2.003509 | |
ADDS (sxtb, 32-bit) 1->3 2.012358 | |
ADDS (sxtb, 32-bit) 4->2 2.369218 | |
ADDS (sxtb, 32-bit) 4->3 2.681993 | |
ADDS (sxtb, 32-bit) throughput 0.972342 | |
ADDS (sxtb, 64-bit) 1->2 1.887766 | |
ADDS (sxtb, 64-bit) 1->3 2.017153 | |
ADDS (sxtb, 64-bit) 4->2 2.006051 | |
ADDS (sxtb, 64-bit) 4->3 2.034557 | |
ADDS (sxtb, 64-bit) throughput 1.013898 | |
ADDS (uxtb, 32-bit) 1->2 2.041362 | |
ADDS (uxtb, 32-bit) 1->3 2.023481 | |
ADDS (uxtb, 32-bit) 4->2 2.047022 | |
ADDS (uxtb, 32-bit) 4->3 2.010306 | |
ADDS (uxtb, 32-bit) throughput 0.968183 | |
ADDS (uxtb, 64-bit) 1->2 1.969487 | |
ADDS (uxtb, 64-bit) 1->3 2.030044 | |
ADDS (uxtb, 64-bit) 4->2 2.020575 | |
ADDS (uxtb, 64-bit) 4->3 2.028597 | |
ADDS (uxtb, 64-bit) throughput 0.991116 | |
ADDS (sxth, 32-bit) 1->2 1.961602 | |
ADDS (sxth, 32-bit) 1->3 2.029689 | |
ADDS (sxth, 32-bit) 4->2 2.060730 | |
ADDS (sxth, 32-bit) 4->3 2.049295 | |
ADDS (sxth, 32-bit) throughput 1.009880 | |
ADDS (sxth, 64-bit) 1->2 2.021403 | |
ADDS (sxth, 64-bit) 1->3 2.005044 | |
ADDS (sxth, 64-bit) 4->2 1.999518 | |
ADDS (sxth, 64-bit) 4->3 1.471938 | |
ADDS (sxth, 64-bit) throughput 0.994147 | |
ADDS (uxth, 32-bit) 1->2 1.715751 | |
ADDS (uxth, 32-bit) 1->3 2.060796 | |
ADDS (uxth, 32-bit) 4->2 1.737027 | |
ADDS (uxth, 32-bit) 4->3 1.717218 | |
ADDS (uxth, 32-bit) throughput 1.365911 | |
ADDS (uxth, 64-bit) 1->2 1.715717 | |
ADDS (uxth, 64-bit) 1->3 2.009413 | |
ADDS (uxth, 64-bit) 4->2 1.996251 | |
ADDS (uxth, 64-bit) 4->3 2.155195 | |
ADDS (uxth, 64-bit) throughput 1.178284 | |
ADDS (sxtw, 32-bit) 1->2 0.840275 | |
ADDS (sxtw, 32-bit) 1->3 0.968779 | |
ADDS (sxtw, 32-bit) 4->2 1.412684 | |
ADDS (sxtw, 32-bit) 4->3 1.101602 | |
ADDS (sxtw, 32-bit) throughput 0.746584 | |
ADDS (sxtw, 64-bit) 1->2 2.239178 | |
ADDS (sxtw, 64-bit) 1->3 2.223924 | |
ADDS (sxtw, 64-bit) 4->2 2.230632 | |
ADDS (sxtw, 64-bit) 4->3 2.093078 | |
ADDS (sxtw, 64-bit) throughput 1.000601 | |
ADDS (uxtw, 32-bit) 1->2 0.977771 | |
ADDS (uxtw, 32-bit) 1->3 0.999934 | |
ADDS (uxtw, 32-bit) 4->2 1.050133 | |
ADDS (uxtw, 32-bit) 4->3 0.993755 | |
ADDS (uxtw, 32-bit) throughput 0.711428 | |
ADDS (uxtw, 64-bit) 1->2 2.139346 | |
ADDS (uxtw, 64-bit) 1->3 2.027802 | |
ADDS (uxtw, 64-bit) 4->2 2.050998 | |
ADDS (uxtw, 64-bit) 4->3 1.964709 | |
ADDS (uxtw, 64-bit) throughput 0.992627 | |
ADDS (uxtx, 64-bit) 1->2 0.989438 | |
ADDS (uxtx, 64-bit) 1->3 1.004255 | |
ADDS (uxtx, 64-bit) 4->2 1.018702 | |
ADDS (uxtx, 64-bit) 4->3 1.008356 | |
ADDS (uxtx, 64-bit) throughput 0.696606 | |
ADDS (sxtx, 64-bit) 1->2 0.972288 | |
ADDS (sxtx, 64-bit) 1->3 1.079045 | |
ADDS (sxtx, 64-bit) 4->2 1.006488 | |
ADDS (sxtx, 64-bit) 4->3 1.019511 | |
ADDS (sxtx, 64-bit) throughput 0.690283 | |
ADDS (immediate, 32-bit) 1->2 1.002903 | |
ADDS (immediate, 32-bit) 3->2 1.006094 | |
ADDS (immediate, 32-bit) throughput 0.706096 | |
ADDS (immediate, 64-bit) 1->2 0.996735 | |
ADDS (immediate, 64-bit) 3->2 1.005236 | |
ADDS (immediate, 64-bit) throughput 0.705760 | |
ADDS (shifted immediate, 32-bit) 1->2 0.990086 | |
ADDS (shifted immediate, 32-bit) 3->2 1.021732 | |
ADDS (shifted immediate, 32-bit) throughput 0.696755 | |
ADDS (shifted immediate, 64-bit) 1->2 0.996352 | |
ADDS (shifted immediate, 64-bit) 3->2 1.041453 | |
ADDS (shifted immediate, 64-bit) throughput 0.698475 | |
ADDS (register, 32-bit) 1->2 1.000044 | |
ADDS (register, 32-bit) 1->3 1.012763 | |
ADDS (register, 32-bit) 4->2 1.064203 | |
ADDS (register, 32-bit) 4->3 1.002127 | |
ADDS (register, 32-bit) throughput 0.697693 | |
ADDS (register, 64-bit) 1->2 0.980761 | |
ADDS (register, 64-bit) 1->3 0.999978 | |
ADDS (register, 64-bit) 4->2 1.041001 | |
ADDS (register, 64-bit) 4->3 1.001272 | |
ADDS (register, 64-bit) throughput 0.695792 | |
ADDS (register, lsl, 32-bit) 1->2 1.996622 | |
ADDS (register, lsl, 32-bit) 1->3 2.006598 | |
ADDS (register, lsl, 32-bit) 4->2 2.009544 | |
ADDS (register, lsl, 32-bit) 4->3 2.097717 | |
ADDS (register, lsl, 32-bit) throughput 1.076571 | |
ADDS (register, lsl, 64-bit) 1->2 2.620855 | |
ADDS (register, lsl, 64-bit) 1->3 1.912899 | |
ADDS (register, lsl, 64-bit) 4->2 2.019897 | |
ADDS (register, lsl, 64-bit) 4->3 2.026723 | |
ADDS (register, lsl, 64-bit) throughput 1.008496 | |
ADDS (register, lsr, 32-bit) 1->2 1.925519 | |
ADDS (register, lsr, 32-bit) 1->3 2.014567 | |
ADDS (register, lsr, 32-bit) 4->2 2.055343 | |
ADDS (register, lsr, 32-bit) 4->3 2.009699 | |
ADDS (register, lsr, 32-bit) throughput 1.019827 | |
ADDS (register, lsr, 64-bit) 1->2 1.994836 | |
ADDS (register, lsr, 64-bit) 1->3 2.012895 | |
ADDS (register, lsr, 64-bit) 4->2 2.039148 | |
ADDS (register, lsr, 64-bit) 4->3 2.122175 | |
ADDS (register, lsr, 64-bit) throughput 1.008376 | |
ADDS (register, asr, 32-bit) 1->2 1.997324 | |
ADDS (register, asr, 32-bit) 1->3 1.978939 | |
ADDS (register, asr, 32-bit) 4->2 2.032429 | |
ADDS (register, asr, 32-bit) 4->3 1.992028 | |
ADDS (register, asr, 32-bit) throughput 1.007937 | |
ADDS (register, asr, 64-bit) 1->2 2.001048 | |
ADDS (register, asr, 64-bit) 1->3 2.017696 | |
ADDS (register, asr, 64-bit) 4->2 2.002177 | |
ADDS (register, asr, 64-bit) 4->3 2.007157 | |
ADDS (register, asr, 64-bit) throughput 0.997310 | |
ADR throughput 0.500687 | |
ADRP throughput 0.507934 | |
AND (immediate, 32-bit) 1->2 1.002492 | |
AND (immediate, 32-bit) throughput 0.503703 | |
AND (immediate, 64-bit) 1->2 1.021403 | |
AND (immediate, 64-bit) throughput 0.501949 | |
AND (register, 32-bit) 1->2 1.025722 | |
AND (register, 32-bit) 1->3 0.994129 | |
AND (register, 32-bit) throughput 0.505942 | |
AND (register, 64-bit) 1->2 1.011855 | |
AND (register, 64-bit) 1->3 1.003968 | |
AND (register, 64-bit) throughput 0.503544 | |
AND (register, lsl, 32-bit) 1->2 2.012762 | |
AND (register, lsl, 32-bit) 1->3 1.985060 | |
AND (register, lsl, 32-bit) throughput 1.006270 | |
AND (register, lsl, 64-bit) 1->2 2.000220 | |
AND (register, lsl, 64-bit) 1->3 1.999609 | |
AND (register, lsl, 64-bit) throughput 0.859892 | |
AND (register, lsr, 32-bit) 1->2 1.993760 | |
AND (register, lsr, 32-bit) 1->3 1.980518 | |
AND (register, lsr, 32-bit) throughput 1.018310 | |
AND (register, lsr, 64-bit) 1->2 1.984590 | |
AND (register, lsr, 64-bit) 1->3 2.000064 | |
AND (register, lsr, 64-bit) throughput 0.985867 | |
AND (register, asr, 32-bit) 1->2 2.073742 | |
AND (register, asr, 32-bit) 1->3 2.001008 | |
AND (register, asr, 32-bit) throughput 1.012320 | |
AND (register, asr, 64-bit) 1->2 2.038101 | |
AND (register, asr, 64-bit) 1->3 1.999871 | |
AND (register, asr, 64-bit) throughput 1.022001 | |
AND (register, ror, 32-bit) 1->2 2.021421 | |
AND (register, ror, 32-bit) 1->3 2.032253 | |
AND (register, ror, 32-bit) throughput 1.005094 | |
AND (register, ror, 64-bit) 1->2 1.946970 | |
AND (register, ror, 64-bit) 1->3 2.026964 | |
AND (register, ror, 64-bit) throughput 0.999297 | |
ADDS (immediate, 32-bit) 1->2 1.004686 | |
ADDS (immediate, 32-bit) 3->2 1.011073 | |
ADDS (immediate, 32-bit) throughput 0.691243 | |
ADDS (immediate, 64-bit) 1->2 0.997127 | |
ADDS (immediate, 64-bit) 3->2 0.998684 | |
ADDS (immediate, 64-bit) throughput 0.701587 | |
ANDS (register, 32-bit) 1->2 1.000260 | |
ANDS (register, 32-bit) 1->3 1.013965 | |
ANDS (register, 32-bit) 4->2 1.025220 | |
ANDS (register, 32-bit) 4->3 1.013095 | |
ANDS (register, 32-bit) throughput 0.693797 | |
ANDS (register, 64-bit) 1->2 1.013034 | |
ANDS (register, 64-bit) 1->3 0.985058 | |
ANDS (register, 64-bit) 4->2 1.052209 | |
ANDS (register, 64-bit) 4->3 0.986524 | |
ANDS (register, 64-bit) throughput 0.700339 | |
ANDS (register, lsl, 32-bit) 1->2 2.031674 | |
ANDS (register, lsl, 32-bit) 1->3 1.996120 | |
ANDS (register, lsl, 32-bit) 4->2 2.033221 | |
ANDS (register, lsl, 32-bit) 4->3 1.998323 | |
ANDS (register, lsl, 32-bit) throughput 1.166100 | |
ANDS (register, lsl, 64-bit) 1->2 2.050727 | |
ANDS (register, lsl, 64-bit) 1->3 1.933125 | |
ANDS (register, lsl, 64-bit) 4->2 1.950764 | |
ANDS (register, lsl, 64-bit) 4->3 2.076381 | |
ANDS (register, lsl, 64-bit) throughput 0.991434 | |
ANDS (register, lsr, 32-bit) 1->2 2.013980 | |
ANDS (register, lsr, 32-bit) 1->3 1.993236 | |
ANDS (register, lsr, 32-bit) 4->2 2.007127 | |
ANDS (register, lsr, 32-bit) 4->3 2.000964 | |
ANDS (register, lsr, 32-bit) throughput 1.009948 | |
ANDS (register, lsr, 64-bit) 1->2 2.001183 | |
ANDS (register, lsr, 64-bit) 1->3 1.998653 | |
ANDS (register, lsr, 64-bit) 4->2 2.101789 | |
ANDS (register, lsr, 64-bit) 4->3 2.041889 | |
ANDS (register, lsr, 64-bit) throughput 1.006352 | |
ANDS (register, asr, 32-bit) 1->2 1.956188 | |
ANDS (register, asr, 32-bit) 1->3 2.007627 | |
ANDS (register, asr, 32-bit) 4->2 2.021969 | |
ANDS (register, asr, 32-bit) 4->3 2.561366 | |
ANDS (register, asr, 32-bit) throughput 0.999221 | |
ANDS (register, asr, 64-bit) 1->2 2.076139 | |
ANDS (register, asr, 64-bit) 1->3 1.985512 | |
ANDS (register, asr, 64-bit) 4->2 2.025977 | |
ANDS (register, asr, 64-bit) 4->3 2.039046 | |
ANDS (register, asr, 64-bit) throughput 1.008617 | |
ANDS (register, ror, 32-bit) 1->2 2.013392 | |
ANDS (register, ror, 32-bit) 1->3 1.955192 | |
ANDS (register, ror, 32-bit) 4->2 2.061305 | |
ANDS (register, ror, 32-bit) 4->3 1.999389 | |
ANDS (register, ror, 32-bit) throughput 1.014483 | |
ANDS (register, ror, 64-bit) 1->2 1.999495 | |
ANDS (register, ror, 64-bit) 1->3 2.006040 | |
ANDS (register, ror, 64-bit) 4->2 1.986745 | |
ANDS (register, ror, 64-bit) 4->3 2.002430 | |
ANDS (register, ror, 64-bit) throughput 1.010648 | |
ASR (immediate, 32-bit) 1->2 1.003795 | |
ASR (immediate, 32-bit) throughput 0.378440 | |
ASR (immediate, 64-bit) 1->2 1.000786 | |
ASR (immediate, 64-bit) throughput 0.514935 | |
ASR (register, 32-bit) 1->2 1.000044 | |
ASR (register, 32-bit) 1->3 1.003242 | |
ASR (register, 32-bit) throughput 0.536185 | |
ASR (register, 64-bit) 1->2 0.999946 | |
ASR (register, 64-bit) 1->3 0.988798 | |
ASR (register, 64-bit) throughput 0.625165 | |
B throughput 2.957976 | |
B.cc (not taken) throughput 0.656999 | |
B.cc (taken) throughput 3.936628 | |
BFC (32-bit) 1->1 1.018429 | |
BFC (32-bit) throughput 1.011443 | |
BFC (64-bit) 1->1 1.011030 | |
BFC (64-bit) throughput 1.010017 | |
BFI (32-bit) 1->1 1.001702 | |
BFI (32-bit) 1->2 1.010644 | |
BFI (32-bit) throughput 1.009519 | |
BFI (64-bit) 1->1 0.978997 | |
BFI (64-bit) 1->2 1.053014 | |
BFI (64-bit) throughput 1.003776 | |
BFXIL (32-bit) 1->1 1.004178 | |
BFXIL (32-bit) 1->2 0.984636 | |
BFXIL (32-bit) throughput 0.991258 | |
BFXIL (64-bit) 1->1 1.003853 | |
BFXIL (64-bit) 1->2 0.994393 | |
BFXIL (64-bit) throughput 1.004087 | |
BIC (register, 32-bit) 1->2 0.986266 | |
BIC (register, 32-bit) 1->3 1.008514 | |
BIC (register, 32-bit) throughput 0.502199 | |
BIC (register, 64-bit) 1->2 0.991586 | |
BIC (register, 64-bit) 1->3 1.001294 | |
BIC (register, 64-bit) throughput 0.503478 | |
BIC (register, lsl, 32-bit) 1->2 2.064351 | |
BIC (register, lsl, 32-bit) 1->3 1.982097 | |
BIC (register, lsl, 32-bit) throughput 1.001350 | |
BIC (register, lsl, 64-bit) 1->2 1.970603 | |
BIC (register, lsl, 64-bit) 1->3 2.041813 | |
BIC (register, lsl, 64-bit) throughput 0.990221 | |
BIC (register, lsr, 32-bit) 1->2 1.998182 | |
BIC (register, lsr, 32-bit) 1->3 2.027896 | |
BIC (register, lsr, 32-bit) throughput 1.008181 | |
BIC (register, lsr, 64-bit) 1->2 1.998270 | |
BIC (register, lsr, 64-bit) 1->3 2.010987 | |
BIC (register, lsr, 64-bit) throughput 1.013852 | |
BIC (register, asr, 32-bit) 1->2 2.046867 | |
BIC (register, asr, 32-bit) 1->3 2.061921 | |
BIC (register, asr, 32-bit) throughput 0.993025 | |
BIC (register, asr, 64-bit) 1->2 2.006994 | |
BIC (register, asr, 64-bit) 1->3 1.998007 | |
BIC (register, asr, 64-bit) throughput 1.007149 | |
BIC (register, ror, 32-bit) 1->2 1.993305 | |
BIC (register, ror, 32-bit) 1->3 2.011490 | |
BIC (register, ror, 32-bit) throughput 0.970330 | |
BIC (register, ror, 64-bit) 1->2 1.969328 | |
BIC (register, ror, 64-bit) 1->3 2.016551 | |
BIC (register, ror, 64-bit) throughput 1.018243 | |
BICS (register, 32-bit) 1->2 0.993881 | |
BICS (register, 32-bit) 1->3 0.997237 | |
BICS (register, 32-bit) 4->2 1.037582 | |
BICS (register, 32-bit) 4->3 1.019124 | |
BICS (register, 32-bit) throughput 0.692911 | |
BICS (register, 64-bit) 1->2 1.019357 | |
BICS (register, 64-bit) 1->3 0.985674 | |
BICS (register, 64-bit) 4->2 1.000168 | |
BICS (register, 64-bit) 4->3 0.959482 | |
BICS (register, 64-bit) throughput 0.700310 | |
BICS (register, lsl, 32-bit) 1->2 2.063021 | |
BICS (register, lsl, 32-bit) 1->3 1.993847 | |
BICS (register, lsl, 32-bit) 4->2 2.009846 | |
BICS (register, lsl, 32-bit) 4->3 1.996666 | |
BICS (register, lsl, 32-bit) throughput 1.003766 | |
BICS (register, lsl, 64-bit) 1->2 2.006840 | |
BICS (register, lsl, 64-bit) 1->3 2.028959 | |
BICS (register, lsl, 64-bit) 4->2 2.028586 | |
BICS (register, lsl, 64-bit) 4->3 1.920076 | |
BICS (register, lsl, 64-bit) throughput 1.007291 | |
BICS (register, lsr, 32-bit) 1->2 1.974773 | |
BICS (register, lsr, 32-bit) 1->3 2.001184 | |
BICS (register, lsr, 32-bit) 4->2 2.056956 | |
BICS (register, lsr, 32-bit) 4->3 1.997960 | |
BICS (register, lsr, 32-bit) throughput 0.993692 | |
BICS (register, lsr, 64-bit) 1->2 1.990053 | |
BICS (register, lsr, 64-bit) 1->3 2.015915 | |
BICS (register, lsr, 64-bit) 4->2 1.993354 | |
BICS (register, lsr, 64-bit) 4->3 1.999038 | |
BICS (register, lsr, 64-bit) throughput 1.010170 | |
BICS (register, asr, 32-bit) 1->2 2.001161 | |
BICS (register, asr, 32-bit) 1->3 2.046813 | |
BICS (register, asr, 32-bit) 4->2 2.010782 | |
BICS (register, asr, 32-bit) 4->3 2.022168 | |
BICS (register, asr, 32-bit) throughput 1.014096 | |
BICS (register, asr, 64-bit) 1->2 2.040863 | |
BICS (register, asr, 64-bit) 1->3 2.232306 | |
BICS (register, asr, 64-bit) 4->2 2.002359 | |
BICS (register, asr, 64-bit) 4->3 1.994684 | |
BICS (register, asr, 64-bit) throughput 1.017351 | |
BICS (register, ror, 32-bit) 1->2 2.022315 | |
BICS (register, ror, 32-bit) 1->3 2.007930 | |
BICS (register, ror, 32-bit) 4->2 2.014116 | |
BICS (register, ror, 32-bit) 4->3 1.919306 | |
BICS (register, ror, 32-bit) throughput 1.004646 | |
BICS (register, ror, 64-bit) 1->2 2.001445 | |
BICS (register, ror, 64-bit) 1->3 2.002983 | |
BICS (register, ror, 64-bit) 4->2 2.072538 | |
BICS (register, ror, 64-bit) 4->3 2.048727 | |
BICS (register, ror, 64-bit) throughput 1.011328 | |
BL throughput 3.213531 | |
CBNZ (not taken) throughput 0.713315 | |
CBNZ (taken) throughput 3.901890 | |
CBZ (not taken) throughput 0.716811 | |
CBZ (taken) throughput 3.927248 | |
CCMN (immediate, 32-bit) 2->1 1.060855 | |
CCMN (immediate, 32-bit) 2->2 0.998708 | |
CCMN (immediate, 32-bit) throughput 1.386021 | |
CCMN (immediate, 32-bit) throughput 1.196692 | |
CCMN (immediate, 64-bit) 2->1 0.981321 | |
CCMN (immediate, 64-bit) 2->2 1.004255 | |
CCMN (immediate, 64-bit) throughput 1.404276 | |
CCMN (immediate, 64-bit) throughput 1.201834 | |
CCMN (register, 32-bit) 3->1 1.017644 | |
CCMN (register, 32-bit) 3->2 0.977802 | |
CCMN (register, 32-bit) 3->3 1.002969 | |
CCMN (register, 32-bit) throughput 1.406677 | |
CCMN (register, 32-bit) throughput 1.188983 | |
CCMN (register, 64-bit) 3->1 0.990562 | |
CCMN (register, 64-bit) 3->2 1.012457 | |
CCMN (register, 64-bit) 3->3 0.998708 | |
CCMN (register, 64-bit) throughput 1.369157 | |
CCMN (register, 64-bit) throughput 1.200490 | |
CCMP (immediate, 32-bit) 2->1 1.015986 | |
CCMP (immediate, 32-bit) 2->2 1.001798 | |
CCMP (immediate, 32-bit) throughput 1.396633 | |
CCMP (immediate, 32-bit) throughput 1.211639 | |
CCMP (immediate, 64-bit) 2->1 1.068745 | |
CCMP (immediate, 64-bit) 2->2 0.986029 | |
CCMP (immediate, 64-bit) throughput 1.354471 | |
CCMP (immediate, 64-bit) throughput 1.209199 | |
CCMP (register, 32-bit) 3->1 1.001644 | |
CCMP (register, 32-bit) 3->2 1.065740 | |
CCMP (register, 32-bit) 3->3 1.001869 | |
CCMP (register, 32-bit) throughput 1.395859 | |
CCMP (register, 32-bit) throughput 1.206721 | |
CCMP (register, 64-bit) 3->1 0.997215 | |
CCMP (register, 64-bit) 3->2 1.015132 | |
CCMP (register, 64-bit) 3->3 1.003267 | |
CCMP (register, 64-bit) throughput 1.403281 | |
CCMP (register, 64-bit) throughput 1.208350 | |
CINC (32-bit) 1->2 1.051189 | |
CINC (32-bit) 1->3 1.073586 | |
CINC (32-bit) throughput 0.529602 | |
CINC (64-bit) 1->2 1.000000 | |
CINC (64-bit) 1->3 1.051228 | |
CINC (64-bit) throughput 0.499346 | |
CINV (32-bit) 1->2 0.996329 | |
CINV (32-bit) 1->3 1.046101 | |
CINV (32-bit) throughput 0.505966 | |
CINV (64-bit) 1->2 0.985391 | |
CINV (64-bit) 1->3 0.989185 | |
CINV (64-bit) throughput 0.495994 | |
CLREX throughput 5.080461 | |
CLREX throughput 5.037555 | |
CLS (32-bit) 1->2 1.002947 | |
CLS (32-bit) throughput 0.508090 | |
CLS (64-bit) 1->2 1.009779 | |
CLS (64-bit) throughput 0.505589 | |
CLZ (32-bit) 1->2 1.012262 | |
CLZ (32-bit) throughput 0.495622 | |
CLZ (64-bit) 1->2 1.003211 | |
CLZ (64-bit) throughput 0.504741 | |
CMN (sxtb, 32-bit) 3->1 2.027740 | |
CMN (sxtb, 32-bit) 3->2 1.940013 | |
CMN (sxtb, 32-bit) throughput 1.008648 | |
CMN (sxtb, 64-bit) 3->1 2.013335 | |
CMN (sxtb, 64-bit) 3->2 2.002066 | |
CMN (sxtb, 64-bit) throughput 0.980217 | |
CMN (uxtb, 32-bit) 3->1 1.962957 | |
CMN (uxtb, 32-bit) 3->2 2.025733 | |
CMN (uxtb, 32-bit) throughput 1.048162 | |
CMN (uxtb, 64-bit) 3->1 2.067263 | |
CMN (uxtb, 64-bit) 3->2 1.961703 | |
CMN (uxtb, 64-bit) throughput 0.976382 | |
CMN (sxth, 32-bit) 3->1 2.009017 | |
CMN (sxth, 32-bit) 3->2 1.909491 | |
CMN (sxth, 32-bit) throughput 0.993616 | |
CMN (sxth, 64-bit) 3->1 2.000088 | |
CMN (sxth, 64-bit) 3->2 2.059811 | |
CMN (sxth, 64-bit) throughput 1.011895 | |
CMN (uxth, 32-bit) 3->1 2.050756 | |
CMN (uxth, 32-bit) 3->2 2.008979 | |
CMN (uxth, 32-bit) throughput 1.001007 | |
CMN (uxth, 64-bit) 3->1 1.975705 | |
CMN (uxth, 64-bit) 3->2 1.993955 | |
CMN (uxth, 64-bit) throughput 1.015369 | |
CMN (sxtw, 32-bit) 3->1 0.971203 | |
CMN (sxtw, 32-bit) 3->2 1.023594 | |
CMN (sxtw, 32-bit) throughput 0.679017 | |
CMN (sxtw, 64-bit) 3->1 2.056970 | |
CMN (sxtw, 64-bit) 3->2 2.008730 | |
CMN (sxtw, 64-bit) throughput 1.005698 | |
CMN (uxtw, 32-bit) 3->1 0.963861 | |
CMN (uxtw, 32-bit) 3->2 1.048408 | |
CMN (uxtw, 32-bit) throughput 0.673679 | |
CMN (uxtw, 64-bit) 3->1 1.978097 | |
CMN (uxtw, 64-bit) 3->2 2.048142 | |
CMN (uxtw, 64-bit) throughput 1.005932 | |
CMN (uxtx, 64-bit) 3->1 0.975829 | |
CMN (uxtx, 64-bit) 3->2 1.013634 | |
CMN (uxtx, 64-bit) throughput 0.679276 | |
CMN (sxtx, 64-bit) 3->1 1.029008 | |
CMN (sxtx, 64-bit) 3->2 1.008693 | |
CMN (sxtx, 64-bit) throughput 0.675731 | |
CMN (immediate, 32-bit) 2->1 0.932579 | |
CMN (immediate, 32-bit) throughput 0.665653 | |
CMN (immediate, 64-bit) 2->1 0.998686 | |
CMN (immediate, 64-bit) throughput 0.674255 | |
CMN (shifted immediate, 32-bit) 2->1 1.003443 | |
CMN (shifted immediate, 32-bit) throughput 0.681622 | |
CMN (shifted immediate, 64-bit) 2->1 0.994525 | |
CMN (shifted immediate, 64-bit) throughput 0.670779 | |
CMN (register, 32-bit) 3->1 1.004304 | |
CMN (register, 32-bit) 3->2 0.961878 | |
CMN (register, 32-bit) throughput 0.665655 | |
CMN (register, 64-bit) 3->1 1.023289 | |
CMN (register, 64-bit) 3->2 1.006833 | |
CMN (register, 64-bit) throughput 0.676722 | |
CMN (register, lsl, 32-bit) 3->1 2.018392 | |
CMN (register, lsl, 32-bit) 3->2 2.009321 | |
CMN (register, lsl, 32-bit) throughput 1.001064 | |
CMN (register, lsl, 64-bit) 3->1 1.999715 | |
CMN (register, lsl, 64-bit) 3->2 2.025291 | |
CMN (register, lsl, 64-bit) throughput 1.009126 | |
CMN (register, lsr, 32-bit) 3->1 2.021829 | |
CMN (register, lsr, 32-bit) 3->2 2.016429 | |
CMN (register, lsr, 32-bit) throughput 1.008137 | |
CMN (register, lsr, 64-bit) 3->1 2.024138 | |
CMN (register, lsr, 64-bit) 3->2 2.007664 | |
CMN (register, lsr, 64-bit) throughput 1.005756 | |
CMN (register, asr, 32-bit) 3->1 2.029997 | |
CMN (register, asr, 32-bit) 3->2 2.077151 | |
CMN (register, asr, 32-bit) throughput 1.007367 | |
CMN (register, asr, 64-bit) 3->1 1.997116 | |
CMN (register, asr, 64-bit) 3->2 1.997284 | |
CMN (register, asr, 64-bit) throughput 1.112375 | |
CMP (sxtb, 32-bit) 3->1 2.002555 | |
CMP (sxtb, 32-bit) 3->2 1.994041 | |
CMP (sxtb, 32-bit) throughput 1.008776 | |
CMP (sxtb, 64-bit) 3->1 2.024136 | |
CMP (sxtb, 64-bit) 3->2 2.018441 | |
CMP (sxtb, 64-bit) throughput 1.010538 | |
CMP (uxtb, 32-bit) 3->1 2.022679 | |
CMP (uxtb, 32-bit) 3->2 2.018249 | |
CMP (uxtb, 32-bit) throughput 1.018622 | |
CMP (uxtb, 64-bit) 3->1 2.000634 | |
CMP (uxtb, 64-bit) 3->2 1.998026 | |
CMP (uxtb, 64-bit) throughput 0.999044 | |
CMP (sxth, 32-bit) 3->1 2.032000 | |
CMP (sxth, 32-bit) 3->2 2.017935 | |
CMP (sxth, 32-bit) throughput 1.007475 | |
CMP (sxth, 64-bit) 3->1 2.250734 | |
CMP (sxth, 64-bit) 3->2 2.034505 | |
CMP (sxth, 64-bit) throughput 1.102485 | |
CMP (uxth, 32-bit) 3->1 1.994480 | |
CMP (uxth, 32-bit) 3->2 2.093771 | |
CMP (uxth, 32-bit) throughput 1.007628 | |
CMP (uxth, 64-bit) 3->1 2.032084 | |
CMP (uxth, 64-bit) 3->2 2.069161 | |
CMP (uxth, 64-bit) throughput 0.987011 | |
CMP (sxtw, 32-bit) 3->1 1.003742 | |
CMP (sxtw, 32-bit) 3->2 1.011842 | |
CMP (sxtw, 32-bit) throughput 0.672947 | |
CMP (sxtw, 64-bit) 3->1 1.999430 | |
CMP (sxtw, 64-bit) 3->2 1.998926 | |
CMP (sxtw, 64-bit) throughput 1.006540 | |
CMP (uxtw, 32-bit) 3->1 1.040665 | |
CMP (uxtw, 32-bit) 3->2 1.006774 | |
CMP (uxtw, 32-bit) throughput 0.667814 | |
CMP (uxtw, 64-bit) 3->1 2.089135 | |
CMP (uxtw, 64-bit) 3->2 2.004452 | |
CMP (uxtw, 64-bit) throughput 1.016573 | |
CMP (uxtx, 64-bit) 3->1 1.000943 | |
CMP (uxtx, 64-bit) 3->2 1.080769 | |
CMP (uxtx, 64-bit) throughput 0.670455 | |
CMP (sxtx, 64-bit) 3->1 1.001009 | |
CMP (sxtx, 64-bit) 3->2 0.989583 | |
CMP (sxtx, 64-bit) throughput 0.675626 | |
CMP (immediate, 32-bit) 2->1 1.019167 | |
CMP (immediate, 32-bit) throughput 0.670765 | |
CMP (immediate, 64-bit) 2->1 0.981843 | |
CMP (immediate, 64-bit) throughput 0.670153 | |
CMP (shifted immediate, 32-bit) 2->1 1.459658 | |
CMP (shifted immediate, 32-bit) throughput 0.673227 | |
CMP (shifted immediate, 64-bit) 2->1 0.944707 | |
CMP (shifted immediate, 64-bit) throughput 0.672941 | |
CMP (register, 32-bit) 3->1 1.002300 | |
CMP (register, 32-bit) 3->2 0.962852 | |
CMP (register, 32-bit) throughput 0.680460 | |
CMP (register, 64-bit) 3->1 0.955695 | |
CMP (register, 64-bit) 3->2 1.003602 | |
CMP (register, 64-bit) throughput 0.675589 | |
CMP (register, lsl, 32-bit) 3->1 2.042338 | |
CMP (register, lsl, 32-bit) 3->2 2.001645 | |
CMP (register, lsl, 32-bit) throughput 1.022590 | |
CMP (register, lsl, 64-bit) 3->1 2.016142 | |
CMP (register, lsl, 64-bit) 3->2 2.038679 | |
CMP (register, lsl, 64-bit) throughput 0.995140 | |
CMP (register, lsr, 32-bit) 3->1 2.026776 | |
CMP (register, lsr, 32-bit) 3->2 2.059450 | |
CMP (register, lsr, 32-bit) throughput 1.004158 | |
CMP (register, lsr, 64-bit) 3->1 2.000154 | |
CMP (register, lsr, 64-bit) 3->2 2.057775 | |
CMP (register, lsr, 64-bit) throughput 1.005062 | |
CMP (register, asr, 32-bit) 3->1 2.013027 | |
CMP (register, asr, 32-bit) 3->2 1.939340 | |
CMP (register, asr, 32-bit) throughput 1.004357 | |
CMP (register, asr, 64-bit) 3->1 2.073163 | |
CMP (register, asr, 64-bit) 3->2 1.988741 | |
CMP (register, asr, 64-bit) throughput 1.001130 | |
CNEG (32-bit) 1->2 1.022786 | |
CNEG (32-bit) 1->3 1.008123 | |
CNEG (32-bit) throughput 0.502571 | |
CNEG (64-bit) 1->2 0.997128 | |
CNEG (64-bit) 1->3 0.998621 | |
CNEG (64-bit) throughput 0.501432 | |
CSDB throughput 0.339313 | |
CSEL (32-bit) 1->2 1.003387 | |
CSEL (32-bit) 1->3 1.003706 | |
CSEL (32-bit) 1->4 1.039943 | |
CSEL (32-bit) throughput 0.496596 | |
CSEL (64-bit) 1->2 1.002434 | |
CSEL (64-bit) 1->3 1.015220 | |
CSEL (64-bit) 1->4 0.960055 | |
CSEL (64-bit) throughput 0.506917 | |
CSET (32-bit) 1->2 1.018177 | |
CSET (32-bit) throughput 0.503708 | |
CSET (64-bit) 1->2 1.006950 | |
CSET (64-bit) throughput 0.528508 | |
CSETM (32-bit) 1->2 1.002684 | |
CSETM (32-bit) throughput 0.500933 | |
CSETM (64-bit) 1->2 1.021662 | |
CSETM (64-bit) throughput 0.495254 | |
CSINC (32-bit) 1->2 1.004266 | |
CSINC (32-bit) 1->3 0.999003 | |
CSINC (32-bit) 1->4 1.036318 | |
CSINC (32-bit) throughput 0.481779 | |
CSINC (64-bit) 1->2 1.019120 | |
CSINC (64-bit) 1->3 1.010315 | |
CSINC (64-bit) 1->4 1.017646 | |
CSINC (64-bit) throughput 0.503925 | |
CSINV (32-bit) 1->2 1.025028 | |
CSINV (32-bit) 1->3 1.009668 | |
CSINV (32-bit) 1->4 1.042550 | |
CSINV (32-bit) throughput 0.500829 | |
CSINV (64-bit) 1->2 1.008961 | |
CSINV (64-bit) 1->3 0.992428 | |
CSINV (64-bit) 1->4 1.095996 | |
CSINV (64-bit) throughput 0.504084 | |
CSNEG (32-bit) 1->2 0.993489 | |
CSNEG (32-bit) 1->3 0.965814 | |
CSNEG (32-bit) 1->4 1.006920 | |
CSNEG (32-bit) throughput 0.503568 | |
CSNEG (64-bit) 1->2 1.005702 | |
CSNEG (64-bit) 1->3 1.005402 | |
CSNEG (64-bit) 1->4 1.039318 | |
CSNEG (64-bit) throughput 0.500810 | |
DMB (SY) throughput 8.443930 | |
DMB (ST) throughput 9.034754 | |
DMB (LD) throughput 8.922060 | |
DMB (ISH) throughput 8.911093 | |
DMB (ISHST) throughput 9.028898 | |
DMB (ISHLD) throughput 9.033158 | |
DMB (NSH) throughput 8.840704 | |
DMB (NSHST) throughput 9.008664 | |
DMB (NSHLD) throughput 9.014649 | |
DMB (OSH) throughput 8.984112 | |
DMB (OSHST) throughput 8.954245 | |
DMB (OSHLD) throughput 9.040394 | |
DSB (SY) throughput 17.760345 | |
DSB (ST) throughput 17.974107 | |
DSB (LD) throughput 17.590773 | |
DSB (ISH) throughput 17.751116 | |
DSB (ISHST) throughput 17.919762 | |
DSB (ISHLD) throughput 17.614600 | |
DSB (NSH) throughput 16.760877 | |
DSB (NSHST) throughput 17.931979 | |
DSB (NSHLD) throughput 17.722172 | |
DSB (OSH) throughput 17.428704 | |
DSB (OSHST) throughput 17.693256 | |
DSB (OSHLD) throughput 17.724142 | |
EON (register, 32-bit) 1->2 0.977097 | |
EON (register, 32-bit) 1->3 1.032401 | |
EON (register, 32-bit) throughput 0.504260 | |
EON (register, 64-bit) 1->2 1.015548 | |
EON (register, 64-bit) 1->3 0.998948 | |
EON (register, 64-bit) throughput 0.507833 | |
EON (register, lsl, 32-bit) 1->2 2.051940 | |
EON (register, lsl, 32-bit) 1->3 2.023947 | |
EON (register, lsl, 32-bit) throughput 1.011491 | |
EON (register, lsl, 64-bit) 1->2 1.986951 | |
EON (register, lsl, 64-bit) 1->3 2.022498 | |
EON (register, lsl, 64-bit) throughput 1.016258 | |
EON (register, lsr, 32-bit) 1->2 1.999673 | |
EON (register, lsr, 32-bit) 1->3 2.033105 | |
EON (register, lsr, 32-bit) throughput 1.001836 | |
EON (register, lsr, 64-bit) 1->2 1.983962 | |
EON (register, lsr, 64-bit) 1->3 2.001118 | |
EON (register, lsr, 64-bit) throughput 0.992152 | |
EON (register, asr, 32-bit) 1->2 1.984804 | |
EON (register, asr, 32-bit) 1->3 2.017043 | |
EON (register, asr, 32-bit) throughput 1.007630 | |
EON (register, asr, 64-bit) 1->2 1.990773 | |
EON (register, asr, 64-bit) 1->3 2.009911 | |
EON (register, asr, 64-bit) throughput 1.011806 | |
EON (register, ror, 32-bit) 1->2 2.068936 | |
EON (register, ror, 32-bit) 1->3 2.048219 | |
EON (register, ror, 32-bit) throughput 0.995500 | |
EON (register, ror, 64-bit) 1->2 1.999430 | |
EON (register, ror, 64-bit) 1->3 2.016683 | |
EON (register, ror, 64-bit) throughput 1.181655 | |
EOR (immediate, 32-bit) 1->2 0.997128 | |
EOR (immediate, 32-bit) throughput 0.504190 | |
EOR (immediate, 64-bit) 1->2 0.998839 | |
EOR (immediate, 64-bit) throughput 0.501308 | |
EOR (register, 32-bit) 1->2 0.988429 | |
EOR (register, 32-bit) 1->3 1.002926 | |
EOR (register, 32-bit) throughput 0.504486 | |
EOR (register, 64-bit) 1->2 0.990176 | |
EOR (register, 64-bit) 1->3 0.998664 | |
EOR (register, 64-bit) throughput 0.507206 | |
EOR (register, lsl, 32-bit) 1->2 2.008577 | |
EOR (register, lsl, 32-bit) 1->3 2.024034 | |
EOR (register, lsl, 32-bit) throughput 1.038339 | |
EOR (register, lsl, 64-bit) 1->2 2.081573 | |
EOR (register, lsl, 64-bit) 1->3 2.012720 | |
EOR (register, lsl, 64-bit) throughput 1.017374 | |
EOR (register, lsr, 32-bit) 1->2 2.030325 | |
EOR (register, lsr, 32-bit) 1->3 2.008929 | |
EOR (register, lsr, 32-bit) throughput 0.998852 | |
EOR (register, lsr, 64-bit) 1->2 2.008022 | |
EOR (register, lsr, 64-bit) 1->3 2.074639 | |
EOR (register, lsr, 64-bit) throughput 1.004551 | |
EOR (register, asr, 32-bit) 1->2 2.027857 | |
EOR (register, asr, 32-bit) 1->3 2.014910 | |
EOR (register, asr, 32-bit) throughput 0.970906 | |
EOR (register, asr, 64-bit) 1->2 2.047870 | |
EOR (register, asr, 64-bit) 1->3 2.002250 | |
EOR (register, asr, 64-bit) throughput 1.004677 | |
EOR (register, ror, 32-bit) 1->2 2.030404 | |
EOR (register, ror, 32-bit) 1->3 2.000920 | |
EOR (register, ror, 32-bit) throughput 1.001243 | |
EOR (register, ror, 64-bit) 1->2 1.996160 | |
EOR (register, ror, 64-bit) 1->3 2.042421 | |
EOR (register, ror, 64-bit) throughput 1.016226 | |
EXTR (register, 32-bit) 1->2 1.010227 | |
EXTR (register, 32-bit) 1->3 2.005497 | |
EXTR (register, 32-bit) throughput 1.035548 | |
EXTR (register, 64-bit) 1->2 0.991818 | |
EXTR (register, 64-bit) 1->3 2.008446 | |
EXTR (register, 64-bit) throughput 1.019914 | |
ISB (SY) throughput 25.581788 | |
LDNP (32-bit) 1->3 (with chain penalty) 4.044393 | |
LDNP (32-bit) 2->3 (with chain penalty) 4.070474 | |
LDNP (32-bit) throughput 1.003932 | |
LDNP (64-bit) 1->3 (with chain penalty) 4.046541 | |
LDNP (64-bit) 2->3 (with chain penalty) 4.026934 | |
LDNP (64-bit) throughput 1.009622 | |
LDP (32-bit) 1->3 (with chain penalty) 4.047958 | |
LDP (32-bit) 2->3 (with chain penalty) 4.878055 | |
LDP (32-bit) throughput 1.004853 | |
LDP (64-bit) 1->3 (with chain penalty) 4.174227 | |
LDP (64-bit) 2->3 (with chain penalty) 4.087767 | |
LDP (64-bit) throughput 1.005140 | |
LDP (post-index, 32-bit) 1->3 (with chain penalty) 3.903878 | |
LDP (post-index, 32-bit) 2->3 (with chain penalty) 4.143860 | |
LDP (post-index, 32-bit) throughput 1.084549 | |
LDP (post-index, 64-bit) 1->3 (with chain penalty) 12.546301 | |
LDP (post-index, 64-bit) 2->3 (with chain penalty) 11.669506 | |
LDP (post-index, 64-bit) throughput 1.074776 | |
LDP (pre-index, 32-bit) 1->3 (with chain penalty) 4.230391 | |
LDP (pre-index, 32-bit) 2->3 (with chain penalty) 4.139245 | |
LDP (pre-index, 32-bit) throughput 1.081937 | |
LDP (pre-index, 64-bit) 1->3 (with chain penalty) 12.194854 | |
LDP (pre-index, 64-bit) 2->3 (with chain penalty) 12.465035 | |
LDP (pre-index, 64-bit) throughput 1.081925 | |
LDP (signed offset, 32-bit) 1->3 (with chain penalty) 3.841953 | |
LDP (signed offset, 32-bit) 2->3 (with chain penalty) 4.108388 | |
LDP (signed offset, 32-bit) throughput 1.006257 | |
LDP (signed offset, 64-bit) 1->3 (with chain penalty) 4.057196 | |
LDP (signed offset, 64-bit) 2->3 (with chain penalty) 4.163538 | |
LDP (signed offset, 64-bit) throughput 1.012624 | |
LDPSW (post-index) 1->3 (with chain penalty) 4.179043 | |
LDPSW (post-index) 2->3 (with chain penalty) 4.079035 | |
LDPSW (post-index) throughput 1.075887 | |
LDPSW (pre-index) 1->3 (with chain penalty) 4.177090 | |
LDPSW (pre-index) 2->3 (with chain penalty) 4.218474 | |
LDPSW (pre-index) throughput 1.026233 | |
LDPSW (signed offset) 1->3 (with chain penalty) 4.209227 | |
LDPSW (signed offset) 2->3 (with chain penalty) 4.013489 | |
LDPSW (signed offset) throughput 1.022170 | |
LDR (32-bit) 1->2 (with chain penalty) 3.971448 | |
LDR (32-bit) throughput 1.011684 | |
LDR (64-bit) 1->2 (with chain penalty) 4.072593 | |
LDR (64-bit) throughput 0.998471 | |
LDR (post-index, 32-bit) 1->2 (with chain penalty) 4.351603 | |
LDR (post-index, 32-bit) throughput 1.083341 | |
LDR (post-index, 64-bit) 1->2 (with chain penalty) 4.229440 | |
LDR (post-index, 64-bit) throughput 1.033304 | |
LDR (pre-index, 32-bit) 1->2 (with chain penalty) 4.243514 | |
LDR (pre-index, 32-bit) throughput 1.070808 | |
LDR (pre-index, 64-bit) 1->2 (with chain penalty) 4.253933 | |
LDR (pre-index, 64-bit) throughput 1.080068 | |
LDR (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.024049 | |
LDR (unsigned offset, 32-bit) throughput 1.017205 | |
LDR (unsigned offset, 64-bit) 1->2 (with chain penalty) 3.969632 | |
LDR (unsigned offset, 64-bit) throughput 1.007177 | |
LDR (literal, 32-bit) throughput 1.011905 | |
LDR (literal, 64-bit) throughput 1.015191 | |
LDR (register, 32-bit) 1->2 (with chain penalty) 4.092763 | |
LDR (register, 32-bit) 1->3 (with chain penalty) 4.017211 | |
LDR (register, 32-bit) throughput 0.992980 | |
LDR (register, 64-bit) 1->2 (with chain penalty) 4.057472 | |
LDR (register, 64-bit) 1->3 (with chain penalty) 4.090575 | |
LDR (register, 64-bit) throughput 1.010506 | |
LDR (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.093121 | |
LDR (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.078751 | |
LDR (register, uxtw, 32-bit) throughput 0.993836 | |
LDR (register, uxtw, 64-bit) 1->2 (with chain penalty) 3.815059 | |
LDR (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.122432 | |
LDR (register, uxtw, 64-bit) throughput 1.007306 | |
LDR (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.058102 | |
LDR (register, sxtw, 32-bit) 1->3 (with chain penalty) 4.003536 | |
LDR (register, sxtw, 32-bit) throughput 1.004945 | |
LDR (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.047149 | |
LDR (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.089495 | |
LDR (register, sxtw, 64-bit) throughput 0.997869 | |
LDR (register, lsl, 32-bit) 1->2 (with chain penalty) 4.106239 | |
LDR (register, lsl, 32-bit) 1->3 (with chain penalty) 4.148664 | |
LDR (register, lsl, 32-bit) throughput 1.006492 | |
LDR (register, lsl, 64-bit) 1->2 (with chain penalty) 4.064046 | |
LDR (register, lsl, 64-bit) 1->3 (with chain penalty) 4.006514 | |
LDR (register, lsl, 64-bit) throughput 1.005486 | |
LDRB 1->2 (with chain penalty) 3.938439 | |
LDRB throughput 0.987648 | |
LDRB (post-index) 1->2 (with chain penalty) 4.391652 | |
LDRB (post-index) throughput 1.083751 | |
LDRB (pre-index) 1->2 (with chain penalty) 4.209724 | |
LDRB (pre-index) throughput 1.025403 | |
LDRB (unsigned offset) 1->2 (with chain penalty) 4.067285 | |
LDRB (unsigned offset) throughput 0.841919 | |
LDRB (register) 1->2 (with chain penalty) 4.078150 | |
LDRB (register) 1->3 (with chain penalty) 4.075915 | |
LDRB (register) throughput 1.010469 | |
LDRB (register, uxtw) 1->2 (with chain penalty) 4.121407 | |
LDRB (register, uxtw) 1->3 (with chain penalty) 3.981325 | |
LDRB (register, uxtw) throughput 1.003415 | |
LDRB (register, sxtw) 1->2 (with chain penalty) 4.057791 | |
LDRB (register, sxtw) 1->3 (with chain penalty) 4.060007 | |
LDRB (register, sxtw) throughput 1.027547 | |
LDRH 1->2 (with chain penalty) 4.032823 | |
LDRH throughput 1.013230 | |
LDRH (post-index) 1->2 (with chain penalty) 4.189693 | |
LDRH (post-index) throughput 1.085782 | |
LDRH (pre-index) 1->2 (with chain penalty) 4.286687 | |
LDRH (pre-index) throughput 1.023389 | |
LDRH (unsigned offset) 1->2 (with chain penalty) 4.001576 | |
LDRH (unsigned offset) throughput 1.014868 | |
LDRH (register) 1->2 (with chain penalty) 4.143153 | |
LDRH (register) 1->3 (with chain penalty) 3.881328 | |
LDRH (register) throughput 1.005329 | |
LDRH (register, uxtw) 1->2 (with chain penalty) 4.039911 | |
LDRH (register, uxtw) 1->3 (with chain penalty) 4.044468 | |
LDRH (register, uxtw) throughput 1.005168 | |
LDRH (register, sxtw) 1->2 (with chain penalty) 4.116180 | |
LDRH (register, sxtw) 1->3 (with chain penalty) 4.128620 | |
LDRH (register, sxtw) throughput 1.014622 | |
LDRH (register, lsl) 1->2 (with chain penalty) 5.110571 | |
LDRH (register, lsl) 1->3 (with chain penalty) 4.053479 | |
LDRH (register, lsl) throughput 0.992300 | |
LDRSB (32-bit) 1->2 (with chain penalty) 4.060524 | |
LDRSB (32-bit) throughput 1.005136 | |
LDRSB (64-bit) 1->2 (with chain penalty) 4.040987 | |
LDRSB (64-bit) throughput 0.997351 | |
LDRSB (post-index, 32-bit) 1->2 (with chain penalty) 4.304442 | |
LDRSB (post-index, 32-bit) throughput 1.098438 | |
LDRSB (post-index, 64-bit) 1->2 (with chain penalty) 4.300424 | |
LDRSB (post-index, 64-bit) throughput 1.076438 | |
LDRSB (pre-index, 32-bit) 1->2 (with chain penalty) 4.476465 | |
LDRSB (pre-index, 32-bit) throughput 0.988187 | |
LDRSB (pre-index, 64-bit) 1->2 (with chain penalty) 4.214619 | |
LDRSB (pre-index, 64-bit) throughput 1.060600 | |
LDRSB (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.021815 | |
LDRSB (unsigned offset, 32-bit) throughput 1.003236 | |
LDRSB (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.058586 | |
LDRSB (unsigned offset, 64-bit) throughput 1.000011 | |
LDRSB (register, 32-bit) 1->2 (with chain penalty) 4.059067 | |
LDRSB (register, 32-bit) 1->3 (with chain penalty) 4.025416 | |
LDRSB (register, 32-bit) throughput 0.971683 | |
LDRSB (register, 64-bit) 1->2 (with chain penalty) 4.452249 | |
LDRSB (register, 64-bit) 1->3 (with chain penalty) 3.696003 | |
LDRSB (register, 64-bit) throughput 1.003044 | |
LDRSB (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.017409 | |
LDRSB (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.109331 | |
LDRSB (register, uxtw, 32-bit) throughput 1.005799 | |
LDRSB (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.065447 | |
LDRSB (register, uxtw, 64-bit) 1->3 (with chain penalty) 3.746108 | |
LDRSB (register, uxtw, 64-bit) throughput 0.994660 | |
LDRSB (register, sxtw, 32-bit) 1->2 (with chain penalty) 3.869439 | |
LDRSB (register, sxtw, 32-bit) 1->3 (with chain penalty) 4.098361 | |
LDRSB (register, sxtw, 32-bit) throughput 1.022140 | |
LDRSB (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.108026 | |
LDRSB (register, sxtw, 64-bit) 1->3 (with chain penalty) 3.999254 | |
LDRSB (register, sxtw, 64-bit) throughput 0.978809 | |
LDRSH (32-bit) 1->2 (with chain penalty) 4.041448 | |
LDRSH (32-bit) throughput 1.008837 | |
LDRSH (64-bit) 1->2 (with chain penalty) 4.055635 | |
LDRSH (64-bit) throughput 1.008577 | |
LDRSH (post-index, 32-bit) 1->2 (with chain penalty) 4.134653 | |
LDRSH (post-index, 32-bit) throughput 1.087029 | |
LDRSH (post-index, 64-bit) 1->2 (with chain penalty) 4.317774 | |
LDRSH (post-index, 64-bit) throughput 1.091839 | |
LDRSH (pre-index, 32-bit) 1->2 (with chain penalty) 4.128648 | |
LDRSH (pre-index, 32-bit) throughput 1.087311 | |
LDRSH (pre-index, 64-bit) 1->2 (with chain penalty) 4.047829 | |
LDRSH (pre-index, 64-bit) throughput 1.049689 | |
LDRSH (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.038525 | |
LDRSH (unsigned offset, 32-bit) throughput 1.000895 | |
LDRSH (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.063464 | |
LDRSH (unsigned offset, 64-bit) throughput 1.011666 | |
LDRSH (register, 32-bit) 1->2 (with chain penalty) 4.034206 | |
LDRSH (register, 32-bit) 1->3 (with chain penalty) 4.130790 | |
LDRSH (register, 32-bit) throughput 0.991864 | |
LDRSH (register, 64-bit) 1->2 (with chain penalty) 4.729945 | |
LDRSH (register, 64-bit) 1->3 (with chain penalty) 4.023932 | |
LDRSH (register, 64-bit) throughput 1.007128 | |
LDRSH (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.111685 | |
LDRSH (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.121450 | |
LDRSH (register, uxtw, 32-bit) throughput 1.010207 | |
LDRSH (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.103386 | |
LDRSH (register, uxtw, 64-bit) 1->3 (with chain penalty) 3.974509 | |
LDRSH (register, uxtw, 64-bit) throughput 1.007329 | |
LDRSH (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.028790 | |
LDRSH (register, sxtw, 32-bit) 1->3 (with chain penalty) 3.934001 | |
LDRSH (register, sxtw, 32-bit) throughput 1.013163 | |
LDRSH (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.005696 | |
LDRSH (register, sxtw, 64-bit) 1->3 (with chain penalty) 3.831607 | |
LDRSH (register, sxtw, 64-bit) throughput 1.009420 | |
LDRSH (register, lsl, 32-bit) 1->2 (with chain penalty) 3.819916 | |
LDRSH (register, lsl, 32-bit) 1->3 (with chain penalty) 4.060493 | |
LDRSH (register, lsl, 32-bit) throughput 0.995939 | |
LDRSH (register, lsl, 64-bit) 1->2 (with chain penalty) 4.039633 | |
LDRSH (register, lsl, 64-bit) 1->3 (with chain penalty) 4.054429 | |
LDRSH (register, lsl, 64-bit) throughput 1.010285 | |
LDRSW 1->2 (with chain penalty) 4.056849 | |
LDRSW throughput 1.004890 | |
LDRSW (post-index) 1->2 (with chain penalty) 4.221380 | |
LDRSW (post-index) throughput 1.043827 | |
LDRSW (pre-index) 1->2 (with chain penalty) 4.272394 | |
LDRSW (pre-index) throughput 1.101287 | |
LDRSW (unsigned offset) 1->2 (with chain penalty) 3.883019 | |
LDRSW (unsigned offset) throughput 1.014983 | |
LDRSW (literal) throughput 1.009622 | |
LDRSW (register) 1->2 (with chain penalty) 3.981519 | |
LDRSW (register) 1->3 (with chain penalty) 4.068463 | |
LDRSW (register) throughput 0.965625 | |
LDRSW (register, uxtw) 1->2 (with chain penalty) 4.106054 | |
LDRSW (register, uxtw) 1->3 (with chain penalty) 3.942762 | |
LDRSW (register, uxtw) throughput 1.000514 | |
LDRSW (register, sxtw) 1->2 (with chain penalty) 3.810174 | |
LDRSW (register, sxtw) 1->3 (with chain penalty) 4.046573 | |
LDRSW (register, sxtw) throughput 1.005332 | |
LDRSW (register, lsl) 1->2 (with chain penalty) 3.953119 | |
LDRSW (register, lsl) 1->3 (with chain penalty) 4.076571 | |
LDRSW (register, lsl) throughput 0.995241 | |
LDUR (32-bit) 1->2 (with chain penalty) 4.104774 | |
LDUR (32-bit) throughput 1.001598 | |
LDUR (64-bit) 1->2 (with chain penalty) 4.080134 | |
LDUR (64-bit) throughput 1.009448 | |
LDURB 1->2 (with chain penalty) 4.139776 | |
LDURB throughput 1.015552 | |
LDURH 1->2 (with chain penalty) 4.025947 | |
LDURH throughput 1.018075 | |
LDURSB (32-bit) 1->2 (with chain penalty) 4.085610 | |
LDURSB (32-bit) throughput 1.006289 | |
LDURSB (64-bit) 1->2 (with chain penalty) 4.080201 | |
LDURSB (64-bit) throughput 1.007633 | |
LDURSH (32-bit) 1->2 (with chain penalty) 4.068606 | |
LDURSH (32-bit) throughput 1.004755 | |
LDURSH (64-bit) 1->2 (with chain penalty) 4.048513 | |
LDURSH (64-bit) throughput 0.991765 | |
LDURSW 1->2 (with chain penalty) 4.019505 | |
LDURSW throughput 0.979031 | |
LSL (immediate, 32-bit) 1->2 0.997302 | |
LSL (immediate, 32-bit) throughput 0.503338 | |
LSL (immediate, 64-bit) 1->2 1.015301 | |
LSL (immediate, 64-bit) throughput 0.503262 | |
LSL (register, 32-bit) 1->2 1.022436 | |
LSL (register, 32-bit) 1->3 1.016407 | |
LSL (register, 32-bit) throughput 0.499357 | |
LSL (register, 64-bit) 1->2 0.989473 | |
LSL (register, 64-bit) 1->3 1.030173 | |
LSL (register, 64-bit) throughput 0.499789 | |
LSR (immediate, 32-bit) 1->2 0.988585 | |
LSR (immediate, 32-bit) throughput 0.501864 | |
LSR (immediate, 64-bit) 1->2 1.002246 | |
LSR (immediate, 64-bit) throughput 0.511032 | |
LSR (register, 32-bit) 1->2 1.002903 | |
LSR (register, 32-bit) 1->3 0.998532 | |
LSR (register, 32-bit) throughput 0.501050 | |
LSR (register, 64-bit) 1->2 1.020321 | |
LSR (register, 64-bit) 1->3 0.998730 | |
LSR (register, 64-bit) throughput 0.501147 | |
MADD (32-bit) 1->2 3.067119 | |
MADD (32-bit) 1->3 3.002686 | |
MADD (32-bit) 1->4 1.000504 | |
MADD (32-bit) throughput 1.007380 | |
MADD (64-bit) 1->2 4.102906 | |
MADD (64-bit) 1->3 4.026845 | |
MADD (64-bit) 1->4 1.994087 | |
MADD (64-bit) throughput 2.009217 | |
MNEG (32-bit) 1->2 2.981975 | |
MNEG (32-bit) 1->3 2.990850 | |
MNEG (32-bit) throughput 1.009844 | |
MNEG (64-bit) 1->2 4.170855 | |
MNEG (64-bit) 1->3 3.963927 | |
MNEG (64-bit) throughput 2.042000 | |
MOV (bitmask immediate, 32-bit) throughput 0.503976 | |
MOV (bitmask immediate, 64-bit) throughput 0.512462 | |
MOV (from sp, 32-bit) throughput 0.518603 | |
MOV (from sp, 64-bit) throughput 0.336146 | |
MOVK (32-bit) 1->1 1.003659 | |
MOVK (32-bit) throughput 0.511871 | |
MOVK (64-bit) 1->1 0.996474 | |
MOVK (64-bit) throughput 0.498991 | |
MOVN (32-bit) throughput 0.504761 | |
MOVN (64-bit) throughput 0.500471 | |
MOVZ (32-bit) throughput 0.501836 | |
MOVZ (64-bit) throughput 0.502475 | |
MRS (CNTFRQ_EL0) throughput 15.925776 | |
MRS (CNTPCT_EL0) throughput 15.943505 | |
MRS (CNTVCT_EL0) throughput 16.165839 | |
MRS (DCZID_EL0) throughput 1.269430 | |
MRS (FPCR) throughput 1.257801 | |
MRS (FPSR) throughput 7.062337 | |
MRS (NZCV) throughput 0.507324 | |
MRS (TPIDRRO_EL0) throughput 1.256763 | |
MRS (TPIDR_EL0) throughput 1.287417 | |
MSR (FPCR) throughput 8.550029 | |
MSR (FPSR) throughput 8.252713 | |
MSR (TPIDR_EL0) throughput 10.792015 | |
MSR (NZCV) throughput 0.713828 | |
MSUB (32-bit) 1->2 2.960646 | |
MSUB (32-bit) 1->3 2.967032 | |
MSUB (32-bit) 1->4 1.010344 | |
MSUB (32-bit) throughput 1.009293 | |
MSUB (64-bit) 1->2 4.047751 | |
MSUB (64-bit) 1->3 4.025354 | |
MSUB (64-bit) 1->4 1.992074 | |
MSUB (64-bit) throughput 2.034131 | |
MUL (32-bit) 1->2 2.976164 | |
MUL (32-bit) 1->3 3.076357 | |
MUL (32-bit) throughput 1.002403 | |
MUL (64-bit) 1->2 4.051503 | |
MUL (64-bit) 1->3 4.064561 | |
MUL (64-bit) throughput 2.042069 | |
MVN (register, 32-bit) 1->2 0.987009 | |
MVN (register, 32-bit) throughput 0.508398 | |
MVN (register, 64-bit) 1->2 1.001342 | |
MVN (register, 64-bit) throughput 0.495858 | |
MVN (register, lsl, 32-bit) 1->2 1.994074 | |
MVN (register, lsl, 32-bit) throughput 1.009698 | |
MVN (register, lsl, 64-bit) 1->2 2.016489 | |
MVN (register, lsl, 64-bit) throughput 1.014539 | |
MVN (register, lsr, 32-bit) 1->2 2.004031 | |
MVN (register, lsr, 32-bit) throughput 0.882336 | |
MVN (register, lsr, 64-bit) 1->2 1.988706 | |
MVN (register, lsr, 64-bit) throughput 0.997821 | |
MVN (register, asr, 32-bit) 1->2 1.931022 | |
MVN (register, asr, 32-bit) throughput 1.010703 | |
MVN (register, asr, 64-bit) 1->2 2.098831 | |
MVN (register, asr, 64-bit) throughput 0.976565 | |
MVN (register, ror, 32-bit) 1->2 2.024842 | |
MVN (register, ror, 32-bit) throughput 0.972185 | |
MVN (register, ror, 64-bit) 1->2 2.001163 | |
MVN (register, ror, 64-bit) throughput 1.019052 | |
NEG (register, 32-bit) 1->2 0.996874 | |
NEG (register, 32-bit) throughput 0.504799 | |
NEG (register, 64-bit) 1->2 1.009016 | |
NEG (register, 64-bit) throughput 0.494766 | |
NEG (register, lsl, 32-bit) 1->2 2.002033 | |
NEG (register, lsl, 32-bit) throughput 1.010907 | |
NEG (register, lsl, 64-bit) 1->2 2.006840 | |
NEG (register, lsl, 64-bit) throughput 0.997460 | |
NEG (register, lsr, 32-bit) 1->2 1.994174 | |
NEG (register, lsr, 32-bit) throughput 0.998729 | |
NEG (register, lsr, 64-bit) 1->2 1.972403 | |
NEG (register, lsr, 64-bit) throughput 1.010143 | |
NEG (register, asr, 32-bit) 1->2 2.019339 | |
NEG (register, asr, 32-bit) throughput 1.011852 | |
NEG (register, asr, 64-bit) 1->2 2.009973 | |
NEG (register, asr, 64-bit) throughput 0.989676 | |
NEGS (register, 32-bit) 1->2 1.022263 | |
NEGS (register, 32-bit) 3->2 1.059463 | |
NEGS (register, 32-bit) throughput 0.694140 | |
NEGS (register, 64-bit) 1->2 0.990512 | |
NEGS (register, 64-bit) 3->2 1.015714 | |
NEGS (register, 64-bit) throughput 0.704142 | |
NEGS (register, lsl, 32-bit) 1->2 1.999871 | |
NEGS (register, lsl, 32-bit) 3->2 1.928226 | |
NEGS (register, lsl, 32-bit) throughput 0.996164 | |
NEGS (register, lsl, 64-bit) 1->2 1.993877 | |
NEGS (register, lsl, 64-bit) 3->2 2.017083 | |
NEGS (register, lsl, 64-bit) throughput 1.010918 | |
NEGS (register, lsr, 32-bit) 1->2 2.038549 | |
NEGS (register, lsr, 32-bit) 3->2 2.018510 | |
NEGS (register, lsr, 32-bit) throughput 1.013227 | |
NEGS (register, lsr, 64-bit) 1->2 2.029009 | |
NEGS (register, lsr, 64-bit) 3->2 1.980736 | |
NEGS (register, lsr, 64-bit) throughput 1.010742 | |
NEGS (register, asr, 32-bit) 1->2 2.007103 | |
NEGS (register, asr, 32-bit) 3->2 1.998093 | |
NEGS (register, asr, 32-bit) throughput 1.005952 | |
NEGS (register, asr, 64-bit) 1->2 2.059078 | |
NEGS (register, asr, 64-bit) 3->2 2.073004 | |
NEGS (register, asr, 64-bit) throughput 0.978139 | |
NGC (register, 32-bit) 1->2 0.997031 | |
NGC (register, 32-bit) 1->3 1.034986 | |
NGC (register, 32-bit) throughput 0.503635 | |
NGC (register, 64-bit) 1->2 0.984671 | |
NGC (register, 64-bit) 1->3 1.037006 | |
NGC (register, 64-bit) throughput 0.468107 | |
NGCS (register, 32-bit) 1->2 1.101673 | |
NGCS (register, 32-bit) 1->3 1.047002 | |
NGCS (register, 32-bit) 3->2 0.962422 | |
NGCS (register, 32-bit) 3->3 1.019873 | |
NGCS (register, 32-bit) throughput 1.324228 | |
NGCS (register, 32-bit) throughput 1.215830 | |
NGCS (register, 64-bit) 1->2 0.976148 | |
NGCS (register, 64-bit) 1->3 1.023024 | |
NGCS (register, 64-bit) 3->2 1.019287 | |
NGCS (register, 64-bit) 3->3 0.994054 | |
NGCS (register, 64-bit) throughput 1.400749 | |
NGCS (register, 64-bit) throughput 1.206793 | |
NOP throughput 0.344755 | |
ORN (register, 32-bit) 1->2 0.988865 | |
ORN (register, 32-bit) 1->3 0.996737 | |
ORN (register, 32-bit) throughput 0.502343 | |
ORN (register, 64-bit) 1->2 1.004090 | |
ORN (register, 64-bit) 1->3 1.002838 | |
ORN (register, 64-bit) throughput 0.507296 | |
ORN (register, lsl, 32-bit) 1->2 2.002500 | |
ORN (register, lsl, 32-bit) 1->3 2.031274 | |
ORN (register, lsl, 32-bit) throughput 1.003409 | |
ORN (register, lsl, 64-bit) 1->2 2.000262 | |
ORN (register, lsl, 64-bit) 1->3 2.058051 | |
ORN (register, lsl, 64-bit) throughput 1.006899 | |
ORN (register, lsr, 32-bit) 1->2 2.031945 | |
ORN (register, lsr, 32-bit) 1->3 2.011339 | |
ORN (register, lsr, 32-bit) throughput 1.009045 | |
ORN (register, lsr, 64-bit) 1->2 1.969180 | |
ORN (register, lsr, 64-bit) 1->3 2.058440 | |
ORN (register, lsr, 64-bit) throughput 1.012134 | |
ORN (register, asr, 32-bit) 1->2 2.044316 | |
ORN (register, asr, 32-bit) 1->3 2.058993 | |
ORN (register, asr, 32-bit) throughput 0.999094 | |
ORN (register, asr, 64-bit) 1->2 2.049511 | |
ORN (register, asr, 64-bit) 1->3 2.000569 | |
ORN (register, asr, 64-bit) throughput 1.003979 | |
ORN (register, ror, 32-bit) 1->2 2.028455 | |
ORN (register, ror, 32-bit) 1->3 1.970518 | |
ORN (register, ror, 32-bit) throughput 0.997469 | |
ORN (register, ror, 64-bit) 1->2 1.998223 | |
ORN (register, ror, 64-bit) 1->3 1.994437 | |
ORN (register, ror, 64-bit) throughput 1.008887 | |
ORR (immediate, 32-bit) 1->2 0.997324 | |
ORR (immediate, 32-bit) throughput 0.506800 | |
ORR (immediate, 64-bit) 1->2 1.006224 | |
ORR (immediate, 64-bit) throughput 0.501054 | |
ORR (register, 32-bit) 1->2 0.997120 | |
ORR (register, 32-bit) 1->3 1.010859 | |
ORR (register, 32-bit) throughput 0.502856 | |
ORR (register, 64-bit) 1->2 0.998905 | |
ORR (register, 64-bit) 1->3 1.007217 | |
ORR (register, 64-bit) throughput 0.495476 | |
ORR (register, lsl, 32-bit) 1->2 2.110541 | |
ORR (register, lsl, 32-bit) 1->3 2.009559 | |
ORR (register, lsl, 32-bit) throughput 0.961392 | |
ORR (register, lsl, 64-bit) 1->2 1.998378 | |
ORR (register, lsl, 64-bit) 1->3 2.036976 | |
ORR (register, lsl, 64-bit) throughput 1.005420 | |
ORR (register, lsr, 32-bit) 1->2 2.000473 | |
ORR (register, lsr, 32-bit) 1->3 2.005124 | |
ORR (register, lsr, 32-bit) throughput 0.988392 | |
ORR (register, lsr, 64-bit) 1->2 2.018910 | |
ORR (register, lsr, 64-bit) 1->3 2.030744 | |
ORR (register, lsr, 64-bit) throughput 1.004932 | |
ORR (register, asr, 32-bit) 1->2 2.013788 | |
ORR (register, asr, 32-bit) 1->3 2.001206 | |
ORR (register, asr, 32-bit) throughput 1.014340 | |
ORR (register, asr, 64-bit) 1->2 2.032318 | |
ORR (register, asr, 64-bit) 1->3 2.000304 | |
ORR (register, asr, 64-bit) throughput 1.013947 | |
ORR (register, ror, 32-bit) 1->2 2.002750 | |
ORR (register, ror, 32-bit) 1->3 2.008731 | |
ORR (register, ror, 32-bit) throughput 1.007724 | |
ORR (register, ror, 64-bit) 1->2 2.033155 | |
ORR (register, ror, 64-bit) 1->3 1.998070 | |
ORR (register, ror, 64-bit) throughput 1.007996 | |
PRFM (register, PLDL1KEEP) throughput 3.986847 | |
PRFM (register, PLDL1KEEP) throughput 1.007763 | |
PRFM (register, PLDL1STRM) throughput 4.597682 | |
PRFM (register, PLDL1STRM) throughput 1.002881 | |
PRFM (register, PLDL2KEEP) throughput 3.515471 | |
PRFM (register, PLDL2KEEP) throughput 2.174587 | |
PRFM (register, PLDL2STRM) throughput 3.485936 | |
PRFM (register, PLDL2STRM) throughput 2.146856 | |
PRFM (register, PLDL3KEEP) throughput 3.335165 | |
PRFM (register, PLDL3KEEP) throughput 2.023664 | |
PRFM (register, PLDL3STRM) throughput 3.290304 | |
PRFM (register, PLDL3STRM) throughput 2.009286 | |
PRFM (register, PLIL1KEEP) throughput 3.515846 | |
PRFM (register, PLIL1KEEP) throughput 2.220636 | |
PRFM (register, PLIL1STRM) throughput 3.974945 | |
PRFM (register, PLIL1STRM) throughput 2.164438 | |
PRFM (register, PLIL2KEEP) throughput 3.601024 | |
PRFM (register, PLIL2KEEP) throughput 2.111450 | |
PRFM (register, PLIL2STRM) throughput 3.479894 | |
PRFM (register, PLIL2STRM) throughput 2.165973 | |
PRFM (register, PLIL3KEEP) throughput 3.336266 | |
PRFM (register, PLIL3KEEP) throughput 2.149180 | |
PRFM (register, PLIL3STRM) throughput 3.368316 | |
PRFM (register, PLIL3STRM) throughput 2.156700 | |
PRFM (register, PSTL1KEEP) throughput 3.943530 | |
PRFM (register, PSTL1KEEP) throughput 1.004560 | |
PRFM (register, PSTL1STRM) throughput 4.335877 | |
PRFM (register, PSTL1STRM) throughput 1.025412 | |
PRFM (register, PSTL2KEEP) throughput 3.498117 | |
PRFM (register, PSTL2KEEP) throughput 2.236191 | |
PRFM (register, PSTL2STRM) throughput 3.493368 | |
PRFM (register, PSTL2STRM) throughput 2.175155 | |
PRFM (register, PSTL3KEEP) throughput 3.400666 | |
PRFM (register, PSTL3KEEP) throughput 2.021109 | |
PRFM (register, PSTL3STRM) throughput 3.370850 | |
PRFM (register, PSTL3STRM) throughput 2.044863 | |
PSSBB throughput 17.742926 | |
RBIT (32-bit) 1->2 1.001773 | |
RBIT (32-bit) throughput 0.491451 | |
RBIT (64-bit) 1->2 0.995512 | |
RBIT (64-bit) throughput 0.499946 | |
REV (32-bit) 1->2 1.007455 | |
REV (32-bit) throughput 0.500317 | |
REV (64-bit) 1->2 0.998839 | |
REV (64-bit) throughput 0.504681 | |
REV16 (32-bit) 1->2 1.012279 | |
REV16 (32-bit) throughput 0.506150 | |
REV16 (64-bit) 1->2 0.998773 | |
REV16 (64-bit) throughput 0.494626 | |
REV32 1->2 1.004292 | |
REV32 throughput 0.504542 | |
ROR (immediate, 32-bit) 1->2 1.001425 | |
ROR (immediate, 32-bit) throughput 0.505124 | |
ROR (immediate, 64-bit) 1->2 0.998751 | |
ROR (immediate, 64-bit) throughput 0.511169 | |
ROR (register, 32-bit) 1->2 1.002706 | |
ROR (register, 32-bit) 1->3 1.000129 | |
ROR (register, 32-bit) throughput 0.538705 | |
ROR (register, 64-bit) 1->2 1.000066 | |
ROR (register, 64-bit) 1->3 1.000416 | |
ROR (register, 64-bit) throughput 0.504569 | |
SBC (32-bit) 1->2 1.006299 | |
SBC (32-bit) 1->3 1.018122 | |
SBC (32-bit) 1->4 1.050022 | |
SBC (32-bit) throughput 0.505852 | |
SBC (64-bit) 1->2 0.995534 | |
SBC (64-bit) 1->3 1.041184 | |
SBC (64-bit) 1->4 0.999745 | |
SBC (64-bit) throughput 0.503993 | |
SBCS (32-bit) 1->2 0.994929 | |
SBCS (32-bit) 1->3 1.030072 | |
SBCS (32-bit) 1->4 0.999718 | |
SBCS (32-bit) 4->2 0.998038 | |
SBCS (32-bit) 4->3 1.002144 | |
SBCS (32-bit) 4->4 1.003968 | |
SBCS (32-bit) throughput 1.393591 | |
SBCS (32-bit) throughput 1.221082 | |
SBCS (64-bit) 1->2 0.996983 | |
SBCS (64-bit) 1->3 0.998774 | |
SBCS (64-bit) 1->4 1.018107 | |
SBCS (64-bit) 4->2 1.004970 | |
SBCS (64-bit) 4->3 1.027721 | |
SBCS (64-bit) 4->4 1.043388 | |
SBCS (64-bit) throughput 1.394769 | |
SBCS (64-bit) throughput 1.214808 | |
SBFIZ (32-bit) 1->2 0.997150 | |
SBFIZ (32-bit) throughput 0.501254 | |
SBFIZ (64-bit) 1->2 0.990069 | |
SBFIZ (64-bit) throughput 0.501864 | |
SBFX (32-bit) 1->2 0.991299 | |
SBFX (32-bit) throughput 0.500232 | |
SBFX (64-bit) 1->2 0.977471 | |
SBFX (64-bit) throughput 0.504415 | |
SDIV (fast, 32-bit) 1->2 7.091602 | |
SDIV (fast, 32-bit) 1->3 6.889388 | |
SDIV (fast, 32-bit) throughput 6.959454 | |
SDIV (slow, 32-bit) 1->2 20.155832 | |
SDIV (slow, 32-bit) 1->3 20.143558 | |
SDIV (slow, 32-bit) throughput 20.043998 | |
SDIV (slow, 32-bit) 1->2 20.093463 | |
SDIV (slow, 32-bit) 1->3 20.131564 | |
SDIV (slow, 32-bit) throughput 20.181549 | |
SDIV (fast, 64-bit) 1->2 7.041108 | |
SDIV (fast, 64-bit) 1->3 7.023381 | |
SDIV (fast, 64-bit) throughput 6.987216 | |
SDIV (medium, 64-bit) 1->2 19.394689 | |
SDIV (medium, 64-bit) 1->3 20.205211 | |
SDIV (medium, 64-bit) throughput 20.179404 | |
SDIV (medium, 64-bit) 1->2 20.178163 | |
SDIV (medium, 64-bit) 1->3 19.559730 | |
SDIV (medium, 64-bit) throughput 20.251444 | |
SDIV (slow, 64-bit) 1->2 36.004227 | |
SDIV (slow, 64-bit) 1->3 36.812225 | |
SDIV (slow, 64-bit) throughput 35.062594 | |
SDIV (slow, 64-bit) 1->2 36.267698 | |
SDIV (slow, 64-bit) 1->3 35.982284 | |
SDIV (slow, 64-bit) throughput 35.348887 | |
SMADDL 1->2 3.012322 | |
SMADDL 1->3 2.984304 | |
SMADDL 1->4 0.992845 | |
SMADDL throughput 1.011557 | |
SMNEGL 1->2 3.051580 | |
SMNEGL 1->3 3.008761 | |
SMNEGL throughput 1.002101 | |
SMSUBL 1->2 3.052942 | |
SMSUBL 1->3 3.095829 | |
SMSUBL 1->4 0.997583 | |
SMSUBL throughput 0.988684 | |
SMULH 1->2 3.981830 | |
SMULH 1->3 4.058964 | |
SMULH throughput 1.989850 | |
SMULL 1->2 3.031260 | |
SMULL 1->3 2.987869 | |
SMULL throughput 1.000167 | |
SSBB throughput 17.954255 | |
STLRB throughput 4.993618 | |
STLRH throughput 5.018081 | |
STLXP (32-bit) throughput 3.265198 | |
STLXP (32-bit) throughput 2.983491 | |
STLXP (64-bit) throughput 3.756547 | |
STLXP (64-bit) throughput 3.002096 | |
STLXR (32-bit) throughput 3.357687 | |
STLXR (32-bit) throughput 3.032819 | |
STLXR (64-bit) throughput 3.543574 | |
STLXR (64-bit) throughput 2.996474 | |
STLXRB throughput 3.030090 | |
STLXRB throughput 2.969336 | |
STLXRH throughput 3.067380 | |
STLXRH throughput 3.017659 | |
STNP (32-bit) throughput 1.623523 | |
STNP (32-bit) throughput 1.629658 | |
STNP (64-bit) throughput 1.654872 | |
STNP (64-bit) throughput 1.605381 | |
STP (32-bit) throughput 1.301634 | |
STP (64-bit) throughput 1.310045 | |
STP (post-index, 32-bit) 3->3 1.325378 | |
STP (post-index, 32-bit) throughput 1.312650 | |
STP (post-index, 64-bit) 3->3 1.341117 | |
STP (post-index, 64-bit) throughput 1.131411 | |
STP (pre-index, 32-bit) 3->3 1.307511 | |
STP (pre-index, 32-bit) throughput 1.343719 | |
STP (pre-index, 64-bit) 3->3 1.351292 | |
STP (pre-index, 64-bit) throughput 1.360616 | |
STP (signed offset, 32-bit) throughput 1.303443 | |
STP (signed offset, 64-bit) throughput 1.309789 | |
STR (32-bit) throughput 1.317332 | |
STR (64-bit) throughput 1.304332 | |
STR (post-index, 32-bit) 2->2 1.317997 | |
STR (post-index, 32-bit) throughput 1.389737 | |
STR (post-index, 64-bit) 2->2 1.296906 | |
STR (post-index, 64-bit) throughput 1.313127 | |
STR (pre-index, 32-bit) 2->2 1.326055 | |
STR (pre-index, 32-bit) throughput 1.329539 | |
STR (pre-index, 64-bit) 2->2 1.316946 | |
STR (pre-index, 64-bit) throughput 1.325778 | |
STR (unsigned offset, 32-bit) throughput 1.299059 | |
STR (unsigned offset, 64-bit) throughput 1.302797 | |
STR (register, 32-bit) throughput 1.309067 | |
STR (register, 64-bit) throughput 1.299706 | |
STR (register, uxtw, 32-bit) throughput 1.297272 | |
STR (register, uxtw, 64-bit) throughput 1.307501 | |
STR (register, sxtw, 32-bit) throughput 1.315486 | |
STR (register, sxtw, 64-bit) throughput 1.314574 | |
STR (register, lsl, 32-bit) throughput 1.309826 | |
STR (register, lsl, 64-bit) throughput 1.247463 | |
STRB throughput 1.305440 | |
STRB (post-index) 2->2 1.317801 | |
STRB (post-index) throughput 1.313126 | |
STRB (pre-index) 2->2 1.308737 | |
STRB (pre-index) throughput 1.332777 | |
STRB (unsigned offset) throughput 1.318997 | |
STRB (register) throughput 1.296288 | |
STRB (register, uxtw) throughput 1.303047 | |
STRB (register, sxtw) throughput 1.293752 | |
STRH throughput 1.311604 | |
STRH (post-index) 2->2 1.322000 | |
STRH (post-index) throughput 1.334765 | |
STRH (pre-index) 2->2 1.301857 | |
STRH (pre-index) throughput 1.335888 | |
STRH (unsigned offset) throughput 1.301120 | |
STRH (register) throughput 1.308421 | |
STRH (register, uxtw) throughput 1.297228 | |
STRH (register, sxtw) throughput 1.316197 | |
STUR (32-bit) throughput 1.311619 | |
STUR (64-bit) throughput 1.315147 | |
STURB throughput 1.299996 | |
STURH throughput 1.295586 | |
STXP (64-bit) throughput 2.968561 | |
STXP (64-bit) throughput 2.988119 | |
STXR (32-bit) throughput 2.316949 | |
STXR (32-bit) throughput 3.051856 | |
STXR (64-bit) throughput 2.523231 | |
STXR (64-bit) throughput 2.958454 | |
STXRB throughput 2.154190 | |
STXRB throughput 3.026010 | |
STXRH throughput 2.061933 | |
STXRH throughput 3.037008 | |
SUB (sxtb, 32-bit) 1->2 2.028968 | |
SUB (sxtb, 32-bit) 1->3 2.022052 | |
SUB (sxtb, 32-bit) throughput 1.003985 | |
SUB (sxtb, 64-bit) 1->2 2.062456 | |
SUB (sxtb, 64-bit) 1->3 2.007082 | |
SUB (sxtb, 64-bit) throughput 1.008219 | |
SUB (uxtb, 32-bit) 1->2 2.013572 | |
SUB (uxtb, 32-bit) 1->3 1.998071 | |
SUB (uxtb, 32-bit) throughput 1.011570 | |
SUB (uxtb, 64-bit) 1->2 2.020628 | |
SUB (uxtb, 64-bit) 1->3 2.002365 | |
SUB (uxtb, 64-bit) throughput 1.010586 | |
SUB (sxth, 32-bit) 1->2 1.908815 | |
SUB (sxth, 32-bit) 1->3 2.068727 | |
SUB (sxth, 32-bit) throughput 1.008354 | |
SUB (sxth, 64-bit) 1->2 2.015370 | |
SUB (sxth, 64-bit) 1->3 2.016440 | |
SUB (sxth, 64-bit) throughput 1.012354 | |
SUB (uxth, 32-bit) 1->2 2.011405 | |
SUB (uxth, 32-bit) 1->3 2.000417 | |
SUB (uxth, 32-bit) throughput 1.014511 | |
SUB (uxth, 64-bit) 1->2 1.998246 | |
SUB (uxth, 64-bit) 1->3 2.018493 | |
SUB (uxth, 64-bit) throughput 1.004645 | |
SUB (sxtw, 32-bit) 1->2 0.995900 | |
SUB (sxtw, 32-bit) 1->3 0.985586 | |
SUB (sxtw, 32-bit) throughput 0.507496 | |
SUB (sxtw, 64-bit) 1->2 1.997109 | |
SUB (sxtw, 64-bit) 1->3 2.082160 | |
SUB (sxtw, 64-bit) throughput 0.995020 | |
SUB (uxtw, 32-bit) 1->2 1.225960 | |
SUB (uxtw, 32-bit) 1->3 1.003079 | |
SUB (uxtw, 32-bit) throughput 0.500851 | |
SUB (uxtw, 64-bit) 1->2 2.028259 | |
SUB (uxtw, 64-bit) 1->3 2.016029 | |
SUB (uxtw, 64-bit) throughput 1.016495 | |
SUB (uxtx, 64-bit) 1->2 0.998970 | |
SUB (uxtx, 64-bit) 1->3 0.994604 | |
SUB (uxtx, 64-bit) throughput 0.504904 | |
SUB (sxtx, 64-bit) 1->2 1.003541 | |
SUB (sxtx, 64-bit) 1->3 1.050796 | |
SUB (sxtx, 64-bit) throughput 0.509152 | |
SUB (immediate, 32-bit) 1->2 0.984442 | |
SUB (immediate, 32-bit) throughput 0.500609 | |
SUB (immediate, 64-bit) 1->2 0.986519 | |
SUB (immediate, 64-bit) throughput 0.513428 | |
SUB (shifted immediate, 32-bit) 1->2 1.021567 | |
SUB (shifted immediate, 32-bit) throughput 0.505276 | |
SUB (shifted immediate, 64-bit) 1->2 1.008930 | |
SUB (shifted immediate, 64-bit) throughput 0.505390 | |
SUB (register, 32-bit) 1->2 0.986456 | |
SUB (register, 32-bit) 1->3 1.000613 | |
SUB (register, 32-bit) throughput 0.499673 | |
SUB (register, 64-bit) 1->2 0.994728 | |
SUB (register, 64-bit) 1->3 0.995186 | |
SUB (register, 64-bit) throughput 0.549387 | |
SUB (register, lsl, 32-bit) 1->2 2.007060 | |
SUB (register, lsl, 32-bit) 1->3 2.007201 | |
SUB (register, lsl, 32-bit) throughput 1.011864 | |
SUB (register, lsl, 64-bit) 1->2 2.004065 | |
SUB (register, lsl, 64-bit) 1->3 2.015465 | |
SUB (register, lsl, 64-bit) throughput 0.993700 | |
SUB (register, lsr, 32-bit) 1->2 2.018297 | |
SUB (register, lsr, 32-bit) 1->3 2.014869 | |
SUB (register, lsr, 32-bit) throughput 1.015836 | |
SUB (register, lsr, 64-bit) 1->2 2.032766 | |
SUB (register, lsr, 64-bit) 1->3 1.995118 | |
SUB (register, lsr, 64-bit) throughput 1.006461 | |
SUB (register, asr, 32-bit) 1->2 2.045744 | |
SUB (register, asr, 32-bit) 1->3 1.994463 | |
SUB (register, asr, 32-bit) throughput 1.006441 | |
SUB (register, asr, 64-bit) 1->2 2.004473 | |
SUB (register, asr, 64-bit) 1->3 2.006773 | |
SUB (register, asr, 64-bit) throughput 1.023835 | |
SUBS (sxtb, 32-bit) 1->2 1.994482 | |
SUBS (sxtb, 32-bit) 1->3 1.941896 | |
SUBS (sxtb, 32-bit) 4->2 2.011105 | |
SUBS (sxtb, 32-bit) 4->3 2.020295 | |
SUBS (sxtb, 32-bit) throughput 1.011824 | |
SUBS (sxtb, 64-bit) 1->2 2.011528 | |
SUBS (sxtb, 64-bit) 1->3 2.015547 | |
SUBS (sxtb, 64-bit) 4->2 2.015461 | |
SUBS (sxtb, 64-bit) 4->3 2.025203 | |
SUBS (sxtb, 64-bit) throughput 1.018512 | |
SUBS (uxtb, 32-bit) 1->2 1.957742 | |
SUBS (uxtb, 32-bit) 1->3 2.007600 | |
SUBS (uxtb, 32-bit) 4->2 2.009017 | |
SUBS (uxtb, 32-bit) 4->3 2.029060 | |
SUBS (uxtb, 32-bit) throughput 0.993527 | |
SUBS (uxtb, 64-bit) 1->2 2.028840 | |
SUBS (uxtb, 64-bit) 1->3 2.001294 | |
SUBS (uxtb, 64-bit) 4->2 2.022637 | |
SUBS (uxtb, 64-bit) 4->3 2.181614 | |
SUBS (uxtb, 64-bit) throughput 1.011328 | |
SUBS (sxth, 32-bit) 1->2 2.001489 | |
SUBS (sxth, 32-bit) 1->3 2.029052 | |
SUBS (sxth, 32-bit) 4->2 2.022827 | |
SUBS (sxth, 32-bit) 4->3 2.132333 | |
SUBS (sxth, 32-bit) throughput 0.997502 | |
SUBS (sxth, 64-bit) 1->2 1.940230 | |
SUBS (sxth, 64-bit) 1->3 2.006950 | |
SUBS (sxth, 64-bit) 4->2 2.026225 | |
SUBS (sxth, 64-bit) 4->3 2.032140 | |
SUBS (sxth, 64-bit) throughput 1.006579 | |
SUBS (uxth, 32-bit) 1->2 2.024235 | |
SUBS (uxth, 32-bit) 1->3 1.983973 | |
SUBS (uxth, 32-bit) 4->2 2.039366 | |
SUBS (uxth, 32-bit) 4->3 2.031664 | |
SUBS (uxth, 32-bit) throughput 1.007411 | |
SUBS (uxth, 64-bit) 1->2 1.990545 | |
SUBS (uxth, 64-bit) 1->3 2.009302 | |
SUBS (uxth, 64-bit) 4->2 1.998488 | |
SUBS (uxth, 64-bit) 4->3 2.006288 | |
SUBS (uxth, 64-bit) throughput 1.011150 | |
SUBS (sxtw, 32-bit) 1->2 0.993160 | |
SUBS (sxtw, 32-bit) 1->3 0.987266 | |
SUBS (sxtw, 32-bit) 4->2 1.027440 | |
SUBS (sxtw, 32-bit) 4->3 1.010220 | |
SUBS (sxtw, 32-bit) throughput 0.692947 | |
SUBS (sxtw, 64-bit) 1->2 2.028740 | |
SUBS (sxtw, 64-bit) 1->3 2.016122 | |
SUBS (sxtw, 64-bit) 4->2 2.019401 | |
SUBS (sxtw, 64-bit) 4->3 1.971867 | |
SUBS (sxtw, 64-bit) throughput 1.014582 | |
SUBS (uxtw, 32-bit) 1->2 0.999246 | |
SUBS (uxtw, 32-bit) 1->3 0.991171 | |
SUBS (uxtw, 32-bit) 4->2 1.011213 | |
SUBS (uxtw, 32-bit) 4->3 0.998113 | |
SUBS (uxtw, 32-bit) throughput 0.695417 | |
SUBS (uxtw, 64-bit) 1->2 1.989032 | |
SUBS (uxtw, 64-bit) 1->3 1.995315 | |
SUBS (uxtw, 64-bit) 4->2 2.026973 | |
SUBS (uxtw, 64-bit) 4->3 2.033164 | |
SUBS (uxtw, 64-bit) throughput 1.000916 | |
SUBS (uxtx, 64-bit) 1->2 1.005220 | |
SUBS (uxtx, 64-bit) 1->3 1.003167 | |
SUBS (uxtx, 64-bit) 4->2 1.040271 | |
SUBS (uxtx, 64-bit) 4->3 1.063937 | |
SUBS (uxtx, 64-bit) throughput 0.748907 | |
SUBS (sxtx, 64-bit) 1->2 1.162885 | |
SUBS (sxtx, 64-bit) 1->3 0.932625 | |
SUBS (sxtx, 64-bit) 4->2 0.966853 | |
SUBS (sxtx, 64-bit) 4->3 1.027689 | |
SUBS (sxtx, 64-bit) throughput 0.715815 | |
SUBS (immediate, 32-bit) 1->2 0.990309 | |
SUBS (immediate, 32-bit) 3->2 1.017803 | |
SUBS (immediate, 32-bit) throughput 0.693814 | |
SUBS (immediate, 64-bit) 1->2 0.996059 | |
SUBS (immediate, 64-bit) 3->2 1.040072 | |
SUBS (immediate, 64-bit) throughput 0.697542 | |
SUBS (shifted immediate, 32-bit) 1->2 1.004537 | |
SUBS (shifted immediate, 32-bit) 3->2 1.029406 | |
SUBS (shifted immediate, 32-bit) throughput 0.692940 | |
SUBS (shifted immediate, 64-bit) 1->2 0.998861 | |
SUBS (shifted immediate, 64-bit) 3->2 0.997523 | |
SUBS (shifted immediate, 64-bit) throughput 0.699309 | |
SUBS (register, 32-bit) 1->2 0.993789 | |
SUBS (register, 32-bit) 1->3 1.006864 | |
SUBS (register, 32-bit) 4->2 1.368141 | |
SUBS (register, 32-bit) 4->3 1.002423 | |
SUBS (register, 32-bit) throughput 0.688424 | |
SUBS (register, 64-bit) 1->2 0.997193 | |
SUBS (register, 64-bit) 1->3 1.001820 | |
SUBS (register, 64-bit) 4->2 0.972127 | |
SUBS (register, 64-bit) 4->3 1.024015 | |
SUBS (register, 64-bit) throughput 0.689549 | |
SUBS (register, lsl, 32-bit) 1->2 1.969119 | |
SUBS (register, lsl, 32-bit) 1->3 2.036333 | |
SUBS (register, lsl, 32-bit) 4->2 2.027163 | |
SUBS (register, lsl, 32-bit) 4->3 1.904781 | |
SUBS (register, lsl, 32-bit) throughput 1.013826 | |
SUBS (register, lsl, 64-bit) 1->2 2.007545 | |
SUBS (register, lsl, 64-bit) 1->3 2.006664 | |
SUBS (register, lsl, 64-bit) 4->2 2.012141 | |
SUBS (register, lsl, 64-bit) 4->3 1.947231 | |
SUBS (register, lsl, 64-bit) throughput 1.006104 | |
SUBS (register, lsr, 32-bit) 1->2 2.001251 | |
SUBS (register, lsr, 32-bit) 1->3 1.998289 | |
SUBS (register, lsr, 32-bit) 4->2 1.995401 | |
SUBS (register, lsr, 32-bit) 4->3 2.087861 | |
SUBS (register, lsr, 32-bit) throughput 0.983165 | |
SUBS (register, lsr, 64-bit) 1->2 1.986526 | |
SUBS (register, lsr, 64-bit) 1->3 2.003114 | |
SUBS (register, lsr, 64-bit) 4->2 2.064560 | |
SUBS (register, lsr, 64-bit) 4->3 2.056404 | |
SUBS (register, lsr, 64-bit) throughput 1.009377 | |
SUBS (register, asr, 32-bit) 1->2 2.055502 | |
SUBS (register, asr, 32-bit) 1->3 2.000811 | |
SUBS (register, asr, 32-bit) 4->2 2.051997 | |
SUBS (register, asr, 32-bit) 4->3 2.041100 | |
SUBS (register, asr, 32-bit) throughput 1.001872 | |
SUBS (register, asr, 64-bit) 1->2 1.946877 | |
SUBS (register, asr, 64-bit) 1->3 2.016297 | |
SUBS (register, asr, 64-bit) 4->2 2.002303 | |
SUBS (register, asr, 64-bit) 4->3 2.001711 | |
SUBS (register, asr, 64-bit) throughput 0.998324 | |
SXTB (32-bit) 1->2 0.962689 | |
SXTB (32-bit) throughput 0.510085 | |
SXTB (64-bit) 1->2 1.004468 | |
SXTB (64-bit) throughput 0.505368 | |
SXTH (32-bit) 1->2 1.004553 | |
SXTH (32-bit) throughput 0.506993 | |
SXTH (64-bit) 1->2 1.000132 | |
SXTH (64-bit) throughput 0.501385 | |
SXTW 1->2 1.003048 | |
SXTW throughput 0.491165 | |
TBNZ (not taken) throughput 0.718616 | |
TBNZ (taken) throughput 3.941054 | |
TBZ (not taken) throughput 0.715419 | |
TBZ (taken) throughput 3.975367 | |
TST (immediate, 32-bit) 2->1 1.032814 | |
TST (immediate, 32-bit) throughput 0.667282 | |
TST (immediate, 64-bit) 2->1 0.977192 | |
TST (immediate, 64-bit) throughput 0.669894 | |
TST (register, 32-bit) 3->1 0.998292 | |
TST (register, 32-bit) 3->2 0.998267 | |
TST (register, 32-bit) throughput 0.678333 | |
TST (register, 64-bit) 3->1 1.022280 | |
TST (register, 64-bit) 3->2 0.990647 | |
TST (register, 64-bit) throughput 0.670197 | |
TST (register, lsl, 32-bit) 3->1 2.057407 | |
TST (register, lsl, 32-bit) 3->2 1.997828 | |
TST (register, lsl, 32-bit) throughput 1.000786 | |
TST (register, lsl, 64-bit) 3->1 2.031343 | |
TST (register, lsl, 64-bit) 3->2 2.008092 | |
TST (register, lsl, 64-bit) throughput 1.007465 | |
TST (register, lsr, 32-bit) 3->1 2.027355 | |
TST (register, lsr, 32-bit) 3->2 1.990792 | |
TST (register, lsr, 32-bit) throughput 0.998200 | |
TST (register, lsr, 64-bit) 3->1 2.022151 | |
TST (register, lsr, 64-bit) 3->2 1.970962 | |
TST (register, lsr, 64-bit) throughput 1.011328 | |
TST (register, asr, 32-bit) 3->1 1.993933 | |
TST (register, asr, 32-bit) 3->2 2.040365 | |
TST (register, asr, 32-bit) throughput 1.010991 | |
TST (register, asr, 64-bit) 3->1 2.069563 | |
TST (register, asr, 64-bit) 3->2 2.014430 | |
TST (register, asr, 64-bit) throughput 1.056777 | |
UBFIZ (32-bit) 1->2 1.006937 | |
UBFIZ (32-bit) throughput 0.501135 | |
UBFIZ (64-bit) 1->2 1.003400 | |
UBFIZ (64-bit) throughput 0.508060 | |
UBFX (32-bit) 1->2 1.030094 | |
UBFX (32-bit) throughput 0.503354 | |
UBFX (64-bit) 1->2 1.000439 | |
UBFX (64-bit) throughput 0.509494 | |
UDIV (fast, 32-bit) 1->2 6.987534 | |
UDIV (fast, 32-bit) 1->3 7.080969 | |
UDIV (fast, 32-bit) throughput 6.906448 | |
UDIV (slow, 32-bit) 1->2 20.033958 | |
UDIV (slow, 32-bit) 1->3 20.136273 | |
UDIV (slow, 32-bit) throughput 20.152933 | |
UDIV (fast, 64-bit) 1->2 7.084540 | |
UDIV (fast, 64-bit) 1->3 6.904647 | |
UDIV (fast, 64-bit) throughput 7.069000 | |
UDIV (medium, 64-bit) 1->2 20.153040 | |
UDIV (medium, 64-bit) 1->3 20.176154 | |
UDIV (medium, 64-bit) throughput 19.176891 | |
UDIV (slow, 64-bit) 1->2 36.615219 | |
UDIV (slow, 64-bit) 1->3 35.385052 | |
UDIV (slow, 64-bit) throughput 36.007720 | |
UMADDL 1->2 3.007907 | |
UMADDL 1->3 3.046113 | |
UMADDL 1->4 0.998301 | |
UMADDL throughput 0.991808 | |
UMNEGL 1->2 3.073124 | |
UMNEGL 1->3 3.027567 | |
UMNEGL throughput 1.006605 | |
UMSUBL 1->2 3.004029 | |
UMSUBL 1->3 3.007454 | |
UMSUBL 1->4 1.012594 | |
UMSUBL throughput 1.006393 | |
UMULH 1->2 3.533482 | |
UMULH 1->3 4.018509 | |
UMULH throughput 2.017083 | |
UMULL 1->2 3.042159 | |
UMULL 1->3 3.013704 | |
UMULL throughput 1.002097 | |
UXTB 1->2 1.008607 | |
UXTB throughput 0.524309 | |
UXTH 1->2 1.001381 | |
UXTH throughput 0.498791 | |
YIELD throughput 0.338875 | |
ABS (vector, 8B) 1->2 3.007250 | |
ABS (vector, 8B) throughput 1.015256 | |
ABS (vector, 16B) 1->2 2.997544 | |
ABS (vector, 16B) throughput 1.005269 | |
ABS (vector, 4H) 1->2 2.939025 | |
ABS (vector, 4H) throughput 1.002923 | |
ABS (vector, 8H) 1->2 3.025956 | |
ABS (vector, 8H) throughput 1.020965 | |
ABS (vector, 2S) 1->2 2.999697 | |
ABS (vector, 2S) throughput 1.022875 | |
ABS (vector, 4S) 1->2 2.996147 | |
ABS (vector, 4S) throughput 1.068286 | |
ABS (vector, 2D) 1->2 2.991116 | |
ABS (vector, 2D) throughput 1.007554 | |
ABS (vector, D) 1->2 3.007523 | |
ABS (vector, D) throughput 1.013336 | |
ADD (vector, 8B) 1->2 2.033059 | |
ADD (vector, 8B) 1->3 2.028099 | |
ADD (vector, 8B) throughput 1.007672 | |
ADD (vector, 16B) 1->2 2.036088 | |
ADD (vector, 16B) 1->3 1.998401 | |
ADD (vector, 16B) throughput 1.011445 | |
ADD (vector, 4H) 1->2 1.996914 | |
ADD (vector, 4H) 1->3 2.018701 | |
ADD (vector, 4H) throughput 1.013600 | |
ADD (vector, 8H) 1->2 2.020899 | |
ADD (vector, 8H) 1->3 2.024322 | |
ADD (vector, 8H) throughput 1.010555 | |
ADD (vector, 2S) 1->2 1.999805 | |
ADD (vector, 2S) 1->3 2.015027 | |
ADD (vector, 2S) throughput 1.008079 | |
ADD (vector, 4S) 1->2 2.001659 | |
ADD (vector, 4S) 1->3 1.989961 | |
ADD (vector, 4S) throughput 1.018138 | |
ADD (vector, 2D) 1->2 2.032458 | |
ADD (vector, 2D) 1->3 1.994197 | |
ADD (vector, 2D) throughput 1.002314 | |
ADD (vector, D) 1->2 2.159004 | |
ADD (vector, D) 1->3 1.995849 | |
ADD (vector, D) throughput 1.012542 | |
ADDHN (vector, 8H) 1->2 3.983935 | |
ADDHN (vector, 8H) 1->3 4.059027 | |
ADDHN (vector, 8H) throughput 1.013663 | |
ADDHN (vector, 4S) 1->2 4.004232 | |
ADDHN (vector, 4S) 1->3 4.038881 | |
ADDHN (vector, 4S) throughput 1.003135 | |
ADDHN (vector, 2D) 1->2 4.015611 | |
ADDHN (vector, 2D) 1->3 4.106365 | |
ADDHN (vector, 2D) throughput 1.009300 | |
ADDHN2 (vector, 8H) 1->1 4.003108 | |
ADDHN2 (vector, 8H) 1->2 3.985854 | |
ADDHN2 (vector, 8H) 1->3 4.035528 | |
ADDHN2 (vector, 8H) throughput 1.002281 | |
ADDHN2 (vector, 8H) throughput 1.006784 | |
ADDHN2 (vector, 4S) 1->1 3.937920 | |
ADDHN2 (vector, 4S) 1->2 4.019440 | |
ADDHN2 (vector, 4S) 1->3 4.030841 | |
ADDHN2 (vector, 4S) throughput 1.012815 | |
ADDHN2 (vector, 4S) throughput 0.924417 | |
ADDHN2 (vector, 2D) 1->1 3.999506 | |
ADDHN2 (vector, 2D) 1->2 4.010053 | |
ADDHN2 (vector, 2D) 1->3 4.030814 | |
ADDHN2 (vector, 2D) throughput 0.991461 | |
ADDHN2 (vector, 2D) throughput 1.010206 | |
ADDP (scalar) 1->2 2.003254 | |
ADDP (scalar) throughput 1.006389 | |
ADDP (vector, 8B) 1->2 1.979064 | |
ADDP (vector, 8B) 1->3 2.053154 | |
ADDP (vector, 8B) throughput 1.005871 | |
ADDP (vector, 16B) 1->2 2.037051 | |
ADDP (vector, 16B) 1->3 2.017942 | |
ADDP (vector, 16B) throughput 1.006277 | |
ADDP (vector, 4H) 1->2 1.997258 | |
ADDP (vector, 4H) 1->3 1.971871 | |
ADDP (vector, 4H) throughput 0.984254 | |
ADDP (vector, 8H) 1->2 2.005846 | |
ADDP (vector, 8H) 1->3 2.010812 | |
ADDP (vector, 8H) throughput 0.993541 | |
ADDP (vector, 2S) 1->2 2.007545 | |
ADDP (vector, 2S) 1->3 2.053019 | |
ADDP (vector, 2S) throughput 0.985483 | |
ADDP (vector, 4S) 1->2 2.034625 | |
ADDP (vector, 4S) 1->3 2.053833 | |
ADDP (vector, 4S) throughput 0.983663 | |
ADDP (vector, 2D) 1->2 1.998465 | |
ADDP (vector, 2D) 1->3 2.020057 | |
ADDP (vector, 2D) throughput 1.007501 | |
ADDV (8B) 1->2 2.971574 | |
ADDV (8B) throughput 1.007575 | |
ADDV (16B) 1->2 2.979880 | |
ADDV (16B) throughput 1.007326 | |
ADDV (4H) 1->2 2.952905 | |
ADDV (4H) throughput 1.015763 | |
ADDV (8H) 1->2 3.019495 | |
ADDV (8H) throughput 0.994723 | |
ADDV (4S) 1->2 2.998971 | |
ADDV (4S) throughput 1.010202 | |
AESD 1->1 5.039451 | |
AESD 1->2 5.004992 | |
AESD throughput 2.022245 | |
AESD throughput 2.003695 | |
AESD + AESIMC 1->1 5.034084 | |
AESD + AESIMC 1->2 5.022808 | |
AESD + AESIMC throughput 2.018294 | |
AESD + AESIMC throughput 2.018709 | |
AESE 1->1 4.996964 | |
AESE 1->2 5.038355 | |
AESE throughput 2.031805 | |
AESE throughput 2.012394 | |
AESE + AESMC 1->1 5.032591 | |
AESE + AESMC 1->2 4.920683 | |
AESE + AESMC throughput 1.995480 | |
AESE + AESMC throughput 1.979431 | |
AESIMC 1->2 2.014255 | |
AESIMC throughput 0.987556 | |
AESMC 1->2 2.006111 | |
AESMC throughput 1.004387 | |
AND (vector, 8B) 1->2 1.957855 | |
AND (vector, 8B) 1->3 2.032406 | |
AND (vector, 8B) throughput 1.008990 | |
AND (vector, 16B) 1->2 2.018490 | |
AND (vector, 16B) 1->3 2.062440 | |
AND (vector, 16B) throughput 1.242809 | |
BIC (vector, immediate, 4H) 1->1 2.025167 | |
BIC (vector, immediate, 4H) throughput 1.182451 | |
BIC (vector, immediate, 4H) throughput 0.945917 | |
BIC (vector, immediate, 8H) 1->1 2.614613 | |
BIC (vector, immediate, 8H) throughput 0.877937 | |
BIC (vector, immediate, 8H) throughput 1.046293 | |
BIC (vector, immediate, 2S) 1->1 2.014907 | |
BIC (vector, immediate, 2S) throughput 1.021426 | |
BIC (vector, immediate, 2S) throughput 1.002870 | |
BIC (vector, immediate, 4S) 1->1 2.029656 | |
BIC (vector, immediate, 4S) throughput 0.998721 | |
BIC (vector, immediate, 4S) throughput 1.011562 | |
BIC (vector, register, 8B) 1->2 2.018957 | |
BIC (vector, register, 8B) 1->3 1.919775 | |
BIC (vector, register, 8B) throughput 0.996863 | |
BIC (vector, register, 16B) 1->2 1.985062 | |
BIC (vector, register, 16B) 1->3 2.006509 | |
BIC (vector, register, 16B) throughput 1.001146 | |
BIF (vector, 8B) 1->1 2.005091 | |
BIF (vector, 8B) 1->2 2.006887 | |
BIF (vector, 8B) 1->3 2.013004 | |
BIF (vector, 8B) throughput 0.987802 | |
BIF (vector, 8B) throughput 1.010952 | |
BIF (vector, 16B) 1->1 2.037882 | |
BIF (vector, 16B) 1->2 1.998377 | |
BIF (vector, 16B) 1->3 1.993803 | |
BIF (vector, 16B) throughput 1.011696 | |
BIF (vector, 16B) throughput 0.988295 | |
BIT (vector, 8B) 1->1 2.010047 | |
BIT (vector, 8B) 1->2 2.010549 | |
BIT (vector, 8B) 1->3 2.120874 | |
BIT (vector, 8B) throughput 1.004719 | |
BIT (vector, 8B) throughput 0.937619 | |
BIT (vector, 16B) 1->1 1.998202 | |
BIT (vector, 16B) 1->2 2.029733 | |
BIT (vector, 16B) 1->3 2.005593 | |
BIT (vector, 16B) throughput 1.007276 | |
BIT (vector, 16B) throughput 1.005503 | |
BSL (vector, 8B) 1->1 1.957963 | |
BSL (vector, 8B) 1->2 1.987802 | |
BSL (vector, 8B) 1->3 2.098683 | |
BSL (vector, 8B) throughput 1.002056 | |
BSL (vector, 8B) throughput 1.023296 | |
BSL (vector, 16B) 1->1 2.042226 | |
BSL (vector, 16B) 1->2 2.008775 | |
BSL (vector, 16B) 1->3 2.055397 | |
BSL (vector, 16B) throughput 0.835919 | |
BSL (vector, 16B) throughput 1.005752 | |
CLS (vector, 8B) 1->2 2.002580 | |
CLS (vector, 8B) throughput 1.009771 | |
CLS (vector, 16B) 1->2 2.022808 | |
CLS (vector, 16B) throughput 1.003902 | |
CLS (vector, 4H) 1->2 1.979671 | |
CLS (vector, 4H) throughput 0.988978 | |
CLS (vector, 8H) 1->2 2.045748 | |
CLS (vector, 8H) throughput 0.993252 | |
CLS (vector, 2S) 1->2 1.993378 | |
CLS (vector, 2S) throughput 1.008890 | |
CLS (vector, 4S) 1->2 1.973815 | |
CLS (vector, 4S) throughput 0.998547 | |
CLZ (vector, 8B) 1->2 2.026835 | |
CLZ (vector, 8B) throughput 0.994124 | |
CLZ (vector, 16B) 1->2 2.039297 | |
CLZ (vector, 16B) throughput 1.006855 | |
CLZ (vector, 4H) 1->2 1.884179 | |
CLZ (vector, 4H) throughput 1.008123 | |
CLZ (vector, 8H) 1->2 2.070992 | |
CLZ (vector, 8H) throughput 1.013810 | |
CLZ (vector, 2S) 1->2 2.011339 | |
CLZ (vector, 2S) throughput 1.011075 | |
CLZ (vector, 4S) 1->2 2.020322 | |
CLZ (vector, 4S) throughput 0.989674 | |
CMEQ (register, 8B) 1->2 3.017637 | |
CMEQ (register, 8B) 1->3 3.001862 | |
CMEQ (register, 8B) throughput 1.001866 | |
CMEQ (register, 16B) 1->2 3.503537 | |
CMEQ (register, 16B) 1->3 3.003091 | |
CMEQ (register, 16B) throughput 0.980771 | |
CMEQ (register, 4H) 1->2 3.051864 | |
CMEQ (register, 4H) 1->3 3.051241 | |
CMEQ (register, 4H) throughput 1.000321 | |
CMEQ (register, 8H) 1->2 3.066953 | |
CMEQ (register, 8H) 1->3 2.998652 | |
CMEQ (register, 8H) throughput 1.007762 | |
CMEQ (register, 2S) 1->2 2.948785 | |
CMEQ (register, 2S) 1->3 2.937869 | |
CMEQ (register, 2S) throughput 1.041115 | |
CMEQ (register, 4S) 1->2 2.966070 | |
CMEQ (register, 4S) 1->3 2.995876 | |
CMEQ (register, 4S) throughput 1.004902 | |
CMEQ (register, 2D) 1->2 2.991247 | |
CMEQ (register, 2D) 1->3 3.026451 | |
CMEQ (register, 2D) throughput 1.013013 | |
CMEQ (register, D) 1->2 3.014670 | |
CMEQ (register, D) 1->3 3.029199 | |
CMEQ (register, D) throughput 1.010273 | |
CMEQ (zero, 8B) 1->2 3.044741 | |
CMEQ (zero, 8B) throughput 0.988561 | |
CMEQ (zero, 16B) 1->2 3.015411 | |
CMEQ (zero, 16B) throughput 1.010857 | |
CMEQ (zero, 4H) 1->2 3.023272 | |
CMEQ (zero, 4H) throughput 1.003591 | |
CMEQ (zero, 8H) 1->2 2.998620 | |
CMEQ (zero, 8H) throughput 1.012617 | |
CMEQ (zero, 2S) 1->2 3.048844 | |
CMEQ (zero, 2S) throughput 1.002140 | |
CMEQ (zero, 4S) 1->2 3.010727 | |
CMEQ (zero, 4S) throughput 1.007749 | |
CMEQ (zero, 2D) 1->2 3.021259 | |
CMEQ (zero, 2D) throughput 1.006377 | |
CMEQ (zero, D) 1->2 3.015079 | |
CMEQ (zero, D) throughput 0.993009 | |
CMGE (register, 8B) 1->2 3.003197 | |
CMGE (register, 8B) 1->3 2.994217 | |
CMGE (register, 8B) throughput 1.003785 | |
CMGE (register, 16B) 1->2 3.021376 | |
CMGE (register, 16B) 1->3 3.125078 | |
CMGE (register, 16B) throughput 0.996632 | |
CMGE (register, 4H) 1->2 2.963498 | |
CMGE (register, 4H) 1->3 3.048888 | |
CMGE (register, 4H) throughput 1.023741 | |
CMGE (register, 8H) 1->2 3.024323 | |
CMGE (register, 8H) 1->3 3.006578 | |
CMGE (register, 8H) throughput 0.998227 | |
CMGE (register, 2S) 1->2 2.958471 | |
CMGE (register, 2S) 1->3 3.057998 | |
CMGE (register, 2S) throughput 1.017653 | |
CMGE (register, 4S) 1->2 3.004656 | |
CMGE (register, 4S) 1->3 3.019552 | |
CMGE (register, 4S) throughput 0.996919 | |
CMGE (register, 2D) 1->2 3.074881 | |
CMGE (register, 2D) 1->3 3.014300 | |
CMGE (register, 2D) throughput 1.006788 | |
CMGE (register, D) 1->2 3.014212 | |
CMGE (register, D) 1->3 3.083429 | |
CMGE (register, D) throughput 1.170342 | |
CMGE (zero, 8B) 1->2 2.716496 | |
CMGE (zero, 8B) throughput 1.002402 | |
CMGE (zero, 16B) 1->2 3.001949 | |
CMGE (zero, 16B) throughput 1.009566 | |
CMGE (zero, 4H) 1->2 3.021817 | |
CMGE (zero, 4H) throughput 1.008171 | |
CMGE (zero, 8H) 1->2 3.029009 | |
CMGE (zero, 8H) throughput 1.019968 | |
CMGE (zero, 2S) 1->2 2.947408 | |
CMGE (zero, 2S) throughput 1.050242 | |
CMGE (zero, 4S) 1->2 2.298464 | |
CMGE (zero, 4S) throughput 1.058524 | |
CMGE (zero, 2D) 1->2 3.025326 | |
CMGE (zero, 2D) throughput 1.013574 | |
CMGE (zero, D) 1->2 3.045478 | |
CMGE (zero, D) throughput 1.039020 | |
CMGT (register, 8B) 1->2 3.121036 | |
CMGT (register, 8B) 1->3 3.013067 | |
CMGT (register, 8B) throughput 1.010115 | |
CMGT (register, 16B) 1->2 3.037771 | |
CMGT (register, 16B) 1->3 3.023097 | |
CMGT (register, 16B) throughput 1.012334 | |
CMGT (register, 4H) 1->2 3.016516 | |
CMGT (register, 4H) 1->3 3.025002 | |
CMGT (register, 4H) throughput 0.997583 | |
CMGT (register, 8H) 1->2 3.009520 | |
CMGT (register, 8H) 1->3 2.998078 | |
CMGT (register, 8H) throughput 1.004880 | |
CMGT (register, 2S) 1->2 3.017148 | |
CMGT (register, 2S) 1->3 3.016181 | |
CMGT (register, 2S) throughput 1.027698 | |
CMGT (register, 4S) 1->2 3.006284 | |
CMGT (register, 4S) 1->3 3.098666 | |
CMGT (register, 4S) throughput 1.004115 | |
CMGT (register, 2D) 1->2 3.012786 | |
CMGT (register, 2D) 1->3 3.024081 | |
CMGT (register, 2D) throughput 1.007721 | |
CMGT (register, D) 1->2 3.004547 | |
CMGT (register, D) 1->3 2.994373 | |
CMGT (register, D) throughput 1.000727 | |
CMGT (zero, 8B) 1->2 3.090299 | |
CMGT (zero, 8B) throughput 1.008163 | |
CMGT (zero, 16B) 1->2 3.004462 | |
CMGT (zero, 16B) throughput 1.010336 | |
CMGT (zero, 4H) 1->2 2.971203 | |
CMGT (zero, 4H) throughput 1.014112 | |
CMGT (zero, 8H) 1->2 2.991186 | |
CMGT (zero, 8H) throughput 0.980895 | |
CMGT (zero, 2S) 1->2 2.924302 | |
CMGT (zero, 2S) throughput 1.018588 | |
CMGT (zero, 4S) 1->2 3.041080 | |
CMGT (zero, 4S) throughput 1.017192 | |
CMGT (zero, 2D) 1->2 2.932733 | |
CMGT (zero, 2D) throughput 1.007295 | |
CMGT (zero, D) 1->2 3.032086 | |
CMGT (zero, D) throughput 1.020047 | |
CMHI (register, 8B) 1->2 3.034346 | |
CMHI (register, 8B) 1->3 2.997679 | |
CMHI (register, 8B) throughput 1.012243 | |
CMHI (register, 16B) 1->2 3.012712 | |
CMHI (register, 16B) 1->3 3.675759 | |
CMHI (register, 16B) throughput 1.117940 | |
CMHI (register, 4H) 1->2 3.069672 | |
CMHI (register, 4H) 1->3 3.136666 | |
CMHI (register, 4H) throughput 1.120472 | |
CMHI (register, 8H) 1->2 3.050667 | |
CMHI (register, 8H) 1->3 3.011052 | |
CMHI (register, 8H) throughput 1.135903 | |
CMHI (register, 2S) 1->2 2.977206 | |
CMHI (register, 2S) 1->3 2.635471 | |
CMHI (register, 2S) throughput 1.020468 | |
CMHI (register, 4S) 1->2 3.065508 | |
CMHI (register, 4S) 1->3 2.993282 | |
CMHI (register, 4S) throughput 1.128183 | |
CMHI (register, 2D) 1->2 3.103759 | |
CMHI (register, 2D) 1->3 3.002096 | |
CMHI (register, 2D) throughput 1.127916 | |
CMHI (register, D) 1->2 3.030767 | |
CMHI (register, D) 1->3 2.642310 | |
CMHI (register, D) throughput 1.046791 | |
CMHS (register, 8B) 1->2 2.996946 | |
CMHS (register, 8B) 1->3 3.005851 | |
CMHS (register, 8B) throughput 1.156565 | |
CMHS (register, 16B) 1->2 3.011669 | |
CMHS (register, 16B) 1->3 3.019267 | |
CMHS (register, 16B) throughput 1.092231 | |
CMHS (register, 4H) 1->2 2.987132 | |
CMHS (register, 4H) 1->3 3.022592 | |
CMHS (register, 4H) throughput 1.277792 | |
CMHS (register, 8H) 1->2 3.140781 | |
CMHS (register, 8H) 1->3 3.057998 | |
CMHS (register, 8H) throughput 1.067974 | |
CMHS (register, 2S) 1->2 3.095741 | |
CMHS (register, 2S) 1->3 3.188748 | |
CMHS (register, 2S) throughput 1.085919 | |
CMHS (register, 4S) 1->2 2.902468 | |
CMHS (register, 4S) 1->3 3.045540 | |
CMHS (register, 4S) throughput 1.115180 | |
CMHS (register, 2D) 1->2 3.977539 | |
CMHS (register, 2D) 1->3 4.083088 | |
CMHS (register, 2D) throughput 1.118848 | |
CMHS (register, D) 1->2 2.942961 | |
CMHS (register, D) 1->3 2.919819 | |
CMHS (register, D) throughput 0.928489 | |
CMLE (zero, 8B) 1->2 3.020901 | |
CMLE (zero, 8B) throughput 1.145843 | |
CMLE (zero, 16B) 1->2 3.032013 | |
CMLE (zero, 16B) throughput 1.113597 | |
CMLE (zero, 4H) 1->2 2.840190 | |
CMLE (zero, 4H) throughput 1.155493 | |
CMLE (zero, 8H) 1->2 3.024597 | |
CMLE (zero, 8H) throughput 1.138283 | |
CMLE (zero, 2S) 1->2 3.094546 | |
CMLE (zero, 2S) throughput 1.026355 | |
CMLE (zero, 4S) 1->2 3.059740 | |
CMLE (zero, 4S) throughput 1.030018 | |
CMLE (zero, 2D) 1->2 3.002029 | |
CMLE (zero, 2D) throughput 1.019380 | |
CMLE (zero, D) 1->2 3.095277 | |
CMLE (zero, D) throughput 1.046601 | |
CMLT (zero, 8B) 1->2 3.019816 | |
CMLT (zero, 8B) throughput 1.002855 | |
CMLT (zero, 16B) 1->2 2.998929 | |
CMLT (zero, 16B) throughput 1.037633 | |
CMLT (zero, 4H) 1->2 3.101886 | |
CMLT (zero, 4H) throughput 1.030073 | |
CMLT (zero, 8H) 1->2 3.032422 | |
CMLT (zero, 8H) throughput 0.886951 | |
CMLT (zero, 2S) 1->2 3.000154 | |
CMLT (zero, 2S) throughput 0.985943 | |
CMLT (zero, 4S) 1->2 2.938466 | |
CMLT (zero, 4S) throughput 0.997628 | |
CMLT (zero, 2D) 1->2 3.048376 | |
CMLT (zero, 2D) throughput 1.018682 | |
CMLT (zero, D) 1->2 3.001443 | |
CMLT (zero, D) throughput 1.012767 | |
CMTST (register, 8B) 1->2 2.038135 | |
CMTST (register, 8B) 1->3 2.043341 | |
CMTST (register, 8B) throughput 1.015485 | |
CMTST (register, 16B) 1->2 1.996475 | |
CMTST (register, 16B) 1->3 1.977750 | |
CMTST (register, 16B) throughput 1.003309 | |
CMTST (register, 4H) 1->2 2.097947 | |
CMTST (register, 4H) 1->3 2.014694 | |
CMTST (register, 4H) throughput 1.007663 | |
CMTST (register, 8H) 1->2 2.229339 | |
CMTST (register, 8H) 1->3 2.002552 | |
CMTST (register, 8H) throughput 1.033817 | |
CMTST (register, 2S) 1->2 1.992121 | |
CMTST (register, 2S) 1->3 2.023400 | |
CMTST (register, 2S) throughput 1.006595 | |
CMTST (register, 4S) 1->2 2.014266 | |
CMTST (register, 4S) 1->3 1.966606 | |
CMTST (register, 4S) throughput 1.020447 | |
CMTST (register, 2D) 1->2 2.034957 | |
CMTST (register, 2D) 1->3 2.011556 | |
CMTST (register, 2D) throughput 1.009277 | |
CMTST (register, D) 1->2 2.026841 | |
CMTST (register, D) 1->3 1.973072 | |
CMTST (register, D) throughput 1.021966 | |
CNT (8B) 1->2 2.016255 | |
CNT (8B) throughput 1.007778 | |
CNT (16B) 1->2 2.067843 | |
CNT (16B) throughput 0.991455 | |
DUP (element, vector, 8B) 1->2 2.012469 | |
DUP (element, vector, 8B) throughput 1.006932 | |
DUP (element, vector, 16B) 1->2 2.027843 | |
DUP (element, vector, 16B) throughput 1.013051 | |
DUP (element, vector, 4H) 1->2 1.984385 | |
DUP (element, vector, 4H) throughput 0.993924 | |
DUP (element, vector, 8H) 1->2 2.089872 | |
DUP (element, vector, 8H) throughput 1.005545 | |
DUP (element, vector, 2S) 1->2 2.030527 | |
DUP (element, vector, 2S) throughput 1.013533 | |
DUP (element, vector, 4S) 1->2 2.008498 | |
DUP (element, vector, 4S) throughput 1.005530 | |
DUP (element, vector, 2D) 1->2 1.994050 | |
DUP (element, vector, 2D) throughput 1.009440 | |
DUP (element, scalar, B) 1->2 2.013076 | |
DUP (element, scalar, B) throughput 1.009509 | |
DUP (element, scalar, H) 1->2 1.991251 | |
DUP (element, scalar, H) throughput 1.005150 | |
DUP (element, scalar, S) 1->2 2.011195 | |
DUP (element, scalar, S) throughput 0.994445 | |
DUP (element, scalar, D) 1->2 2.005768 | |
DUP (element, scalar, D) throughput 1.010762 | |
DUP (general, 8B) 1->2 roundtrip 9.077797 | |
DUP (general, 8B) throughput 1.011090 | |
DUP (general, 16B) 1->2 roundtrip 9.070450 | |
DUP (general, 16B) throughput 1.007342 | |
DUP (general, 4H) 1->2 roundtrip 9.107848 | |
DUP (general, 4H) throughput 1.018217 | |
DUP (general, 8H) 1->2 roundtrip 8.951617 | |
DUP (general, 8H) throughput 1.007653 | |
DUP (general, 2S) 1->2 roundtrip 8.921037 | |
DUP (general, 2S) throughput 1.009119 | |
DUP (general, 4S) 1->2 roundtrip 9.172240 | |
DUP (general, 4S) throughput 1.012397 | |
DUP (general, 2D) 1->2 roundtrip 9.059852 | |
DUP (general, 2D) throughput 1.008670 | |
EOR (vector, 8B) 1->2 1.998796 | |
EOR (vector, 8B) 1->3 2.023973 | |
EOR (vector, 8B) throughput 1.002293 | |
EOR (vector, 16B) 1->2 1.932684 | |
EOR (vector, 16B) 1->3 2.011900 | |
EOR (vector, 16B) throughput 1.045204 | |
EXT (vector, 8B) 1->2 2.009566 | |
EXT (vector, 8B) 1->3 2.018625 | |
EXT (vector, 8B) throughput 1.004671 | |
EXT (vector, 16B) 1->2 1.974049 | |
EXT (vector, 16B) 1->3 2.017323 | |
EXT (vector, 16B) throughput 1.003258 | |
FABD (vector, 2S) 1->2 4.020232 | |
FABD (vector, 2S) 1->3 4.062504 | |
FABD (vector, 2S) throughput 0.992576 | |
FABD (vector, 4S) 1->2 4.058655 | |
FABD (vector, 4S) 1->3 3.973624 | |
FABD (vector, 4S) throughput 1.010846 | |
FABD (vector, 2D) 1->2 4.013813 | |
FABD (vector, 2D) 1->3 4.087835 | |
FABD (vector, 2D) throughput 1.013977 | |
FABD (scalar, S) 1->2 4.011514 | |
FABD (scalar, S) 1->3 4.064278 | |
FABD (scalar, S) throughput 1.000141 | |
FABD (scalar, D) 1->2 4.014078 | |
FABD (scalar, D) 1->3 4.007827 | |
FABD (scalar, D) throughput 0.952025 | |
FABS (scalar, S) 1->2 2.079151 | |
FABS (scalar, S) throughput 1.045693 | |
FABS (scalar, D) 1->2 2.051216 | |
FABS (scalar, D) throughput 1.002666 | |
FABS (vector, 2S) 1->2 1.991951 | |
FABS (vector, 2S) throughput 1.025060 | |
FABS (vector, 4S) 1->2 1.999739 | |
FABS (vector, 4S) throughput 0.998324 | |
FABS (vector, 2D) 1->2 1.998181 | |
FABS (vector, 2D) throughput 0.984424 | |
FACGE (vector, 2S) 1->2 2.019408 | |
FACGE (vector, 2S) 1->3 2.004124 | |
FACGE (vector, 2S) throughput 1.006681 | |
FACGE (vector, 4S) 1->2 2.050559 | |
FACGE (vector, 4S) 1->3 1.997645 | |
FACGE (vector, 4S) throughput 1.004183 | |
FACGE (vector, 2D) 1->2 2.006489 | |
FACGE (vector, 2D) 1->3 2.005144 | |
FACGE (vector, 2D) throughput 1.019028 | |
FACGE (scalar, S) 1->2 1.993269 | |
FACGE (scalar, S) 1->3 1.982889 | |
FACGE (scalar, S) throughput 1.006143 | |
FACGE (scalar, D) 1->2 2.030526 | |
FACGE (scalar, D) 1->3 2.028616 | |
FACGE (scalar, D) throughput 0.973117 | |
FACGT (vector, 2S) 1->2 2.018682 | |
FACGT (vector, 2S) 1->3 2.000251 | |
FACGT (vector, 2S) throughput 1.009477 | |
FACGT (vector, 4S) 1->2 1.962496 | |
FACGT (vector, 4S) 1->3 2.029663 | |
FACGT (vector, 4S) throughput 0.993334 | |
FACGT (vector, 2D) 1->2 1.985457 | |
FACGT (vector, 2D) 1->3 2.018949 | |
FACGT (vector, 2D) throughput 1.007604 | |
FACGT (scalar, S) 1->2 2.006470 | |
FACGT (scalar, S) 1->3 2.008751 | |
FACGT (scalar, S) throughput 1.014913 | |
FACGT (scalar, D) 1->2 2.005913 | |
FACGT (scalar, D) 1->3 2.015529 | |
FACGT (scalar, D) throughput 0.999891 | |
FADD (vector, 2S) 1->2 4.186355 | |
FADD (vector, 2S) 1->3 4.004781 | |
FADD (vector, 2S) throughput 0.991373 | |
FADD (vector, 4S) 1->2 4.032909 | |
FADD (vector, 4S) 1->3 4.028926 | |
FADD (vector, 4S) throughput 1.013261 | |
FADD (vector, 2D) 1->2 4.015857 | |
FADD (vector, 2D) 1->3 3.977230 | |
FADD (vector, 2D) throughput 1.020233 | |
FADD (scalar, S) 1->2 4.023823 | |
FADD (scalar, S) 1->3 4.030685 | |
FADD (scalar, S) throughput 1.003422 | |
FADD (scalar, D) 1->2 4.018040 | |
FADD (scalar, D) 1->3 3.997376 | |
FADD (scalar, D) throughput 1.017668 | |
FADDP (scalar, S) 1->2 4.078243 | |
FADDP (scalar, S) throughput 1.009822 | |
FADDP (scalar, D) 1->2 4.000109 | |
FADDP (scalar, D) throughput 1.009813 | |
FADDP (vector, 2S) 1->2 4.021927 | |
FADDP (vector, 2S) 1->3 3.977572 | |
FADDP (vector, 2S) throughput 1.000250 | |
FADDP (vector, 4S) 1->2 4.047988 | |
FADDP (vector, 4S) 1->3 4.005085 | |
FADDP (vector, 4S) throughput 1.007634 | |
FADDP (vector, 2D) 1->2 4.019850 | |
FADDP (vector, 2D) 1->3 4.064151 | |
FADDP (vector, 2D) throughput 1.007084 | |
FCCMP (scalar, S) 3->1 2.026000 | |
FCCMP (scalar, S) 3->2 1.990341 | |
FCCMP (scalar, S) 3->3 2.007662 | |
FCCMP (scalar, S) throughput 1.961695 | |
FCCMP (scalar, D) 3->1 2.041952 | |
FCCMP (scalar, D) 3->2 2.024981 | |
FCCMP (scalar, D) 3->3 2.037518 | |
FCCMP (scalar, D) throughput 1.962348 | |
FCCMPE (scalar, S) 3->1 2.010596 | |
FCCMPE (scalar, S) 3->2 2.035419 | |
FCCMPE (scalar, S) 3->3 1.893853 | |
FCCMPE (scalar, S) throughput 2.011223 | |
FCCMPE (scalar, D) 3->1 2.221593 | |
FCCMPE (scalar, D) 3->2 2.031535 | |
FCCMPE (scalar, D) 3->3 1.979717 | |
FCCMPE (scalar, D) throughput 1.962478 | |
FCMEQ (vector, 2S) 1->2 1.994281 | |
FCMEQ (vector, 2S) 1->3 2.019096 | |
FCMEQ (vector, 2S) throughput 0.985184 | |
FCMEQ (vector, 4S) 1->2 2.006378 | |
FCMEQ (vector, 4S) 1->3 2.002407 | |
FCMEQ (vector, 4S) throughput 1.023163 | |
FCMEQ (vector, 2D) 1->2 1.998442 | |
FCMEQ (vector, 2D) 1->3 2.021900 | |
FCMEQ (vector, 2D) throughput 1.008448 | |
FCMEQ (scalar, S) 1->2 2.044424 | |
FCMEQ (scalar, S) 1->3 1.941633 | |
FCMEQ (scalar, S) throughput 0.993240 | |
FCMEQ (scalar, D) 1->2 2.034857 | |
FCMEQ (scalar, D) 1->3 1.993322 | |
FCMEQ (scalar, D) throughput 1.010470 | |
FCMEQ (vector, zero, 2S) 1->2 1.999609 | |
FCMEQ (vector, zero, 2S) throughput 0.958830 | |
FCMEQ (vector, zero, 4S) 1->2 2.022439 | |
FCMEQ (vector, zero, 4S) throughput 1.005521 | |
FCMEQ (vector, zero, 2D) 1->2 2.000299 | |
FCMEQ (vector, zero, 2D) throughput 1.004661 | |
FCMEQ (scalar, zero, S) 1->2 1.997701 | |
FCMEQ (scalar, zero, S) throughput 1.014353 | |
FCMEQ (scalar, zero, D) 1->2 2.016910 | |
FCMEQ (scalar, zero, D) throughput 1.010781 | |
FCMGE (vector, 2S) 1->2 2.052891 | |
FCMGE (vector, 2S) 1->3 2.017369 | |
FCMGE (vector, 2S) throughput 1.009269 | |
FCMGE (vector, 4S) 1->2 2.042709 | |
FCMGE (vector, 4S) 1->3 2.051338 | |
FCMGE (vector, 4S) throughput 1.007357 | |
FCMGE (vector, 2D) 1->2 1.999868 | |
FCMGE (vector, 2D) 1->3 2.003926 | |
FCMGE (vector, 2D) throughput 1.001823 | |
FCMGE (scalar, S) 1->2 2.013379 | |
FCMGE (scalar, S) 1->3 2.074096 | |
FCMGE (scalar, S) throughput 1.010846 | |
FCMGE (scalar, D) 1->2 2.024731 | |
FCMGE (scalar, D) 1->3 1.998662 | |
FCMGE (scalar, D) throughput 1.003274 | |
FCMGE (vector, zero, 2S) 1->2 2.019012 | |
FCMGE (vector, zero, 2S) throughput 0.998160 | |
FCMGE (vector, zero, 4S) 1->2 2.088223 | |
FCMGE (vector, zero, 4S) throughput 1.003577 | |
FCMGE (vector, zero, 2D) 1->2 1.981894 | |
FCMGE (vector, zero, 2D) throughput 1.005990 | |
FCMGE (scalar, zero, S) 1->2 1.998820 | |
FCMGE (scalar, zero, S) throughput 0.997829 | |
FCMGE (scalar, zero, D) 1->2 1.955271 | |
FCMGE (scalar, zero, D) throughput 1.007795 | |
FCMGT (vector, 2S) 1->2 2.033346 | |
FCMGT (vector, 2S) 1->3 2.060248 | |
FCMGT (vector, 2S) throughput 1.005667 | |
FCMGT (vector, 4S) 1->2 2.063154 | |
FCMGT (vector, 4S) 1->3 2.018164 | |
FCMGT (vector, 4S) throughput 0.999238 | |
FCMGT (vector, 2D) 1->2 2.000109 | |
FCMGT (vector, 2D) 1->3 1.996513 | |
FCMGT (vector, 2D) throughput 1.006019 | |
FCMGT (scalar, S) 1->2 1.998377 | |
FCMGT (scalar, S) 1->3 1.718885 | |
FCMGT (scalar, S) throughput 1.019332 | |
FCMGT (scalar, D) 1->2 1.974903 | |
FCMGT (scalar, D) 1->3 2.010380 | |
FCMGT (scalar, D) throughput 0.969292 | |
FCMGT (vector, zero, 2S) 1->2 2.002879 | |
FCMGT (vector, zero, 2S) throughput 0.997094 | |
FCMGT (vector, zero, 4S) 1->2 2.020541 | |
FCMGT (vector, zero, 4S) throughput 0.988926 | |
FCMGT (vector, zero, 2D) 1->2 2.006422 | |
FCMGT (vector, zero, 2D) throughput 1.007257 | |
FCMGT (scalar, zero, S) 1->2 2.023738 | |
FCMGT (scalar, zero, S) throughput 0.990396 | |
FCMGT (scalar, zero, D) 1->2 2.038861 | |
FCMGT (scalar, zero, D) throughput 1.020140 | |
FCMLE (vector, zero, 2S) 1->2 2.002696 | |
FCMLE (vector, zero, 2S) throughput 1.013642 | |
FCMLE (vector, zero, 4S) 1->2 2.000965 | |
FCMLE (vector, zero, 4S) throughput 1.008865 | |
FCMLE (vector, zero, 2D) 1->2 1.994207 | |
FCMLE (vector, zero, 2D) throughput 1.015452 | |
FCMLE (scalar, zero, S) 1->2 1.950953 | |
FCMLE (scalar, zero, S) throughput 0.988528 | |
FCMLE (scalar, zero, D) 1->2 2.060134 | |
FCMLE (scalar, zero, D) throughput 1.019344 | |
FCMLT (vector, zero, 2S) 1->2 1.953004 | |
FCMLT (vector, zero, 2S) throughput 1.005601 | |
FCMLT (vector, zero, 4S) 1->2 2.004144 | |
FCMLT (vector, zero, 4S) throughput 1.011701 | |
FCMLT (vector, zero, 2D) 1->2 2.042188 | |
FCMLT (vector, zero, 2D) throughput 0.997285 | |
FCMLT (scalar, zero, S) 1->2 2.003444 | |
FCMLT (scalar, zero, S) throughput 0.994145 | |
FCMLT (scalar, zero, D) 1->2 2.001075 | |
FCMLT (scalar, zero, D) throughput 1.013383 | |
FCMP (scalar, S) 3->1 2.259199 | |
FCMP (scalar, S) 3->2 1.994852 | |
FCMP (scalar, S) throughput 1.053492 | |
FCMP (scalar, D) 3->1 2.031906 | |
FCMP (scalar, D) 3->2 2.031709 | |
FCMP (scalar, D) throughput 1.051698 | |
FCMPE (scalar, S) 3->1 2.000606 | |
FCMPE (scalar, S) 3->2 2.026060 | |
FCMPE (scalar, S) throughput 1.054464 | |
FCMPE (scalar, D) 3->1 1.978239 | |
FCMPE (scalar, D) 3->2 2.015925 | |
FCMPE (scalar, D) throughput 1.057230 | |
FCSEL (scalar, S) 1->2 2.006553 | |
FCSEL (scalar, S) 1->3 1.994042 | |
FCSEL (scalar, S) 1->4 2.128829 | |
FCSEL (scalar, S) throughput 1.119522 | |
FCSEL (scalar, D) 1->2 2.008249 | |
FCSEL (scalar, D) 1->3 1.998399 | |
FCSEL (scalar, D) 1->4 2.030249 | |
FCSEL (scalar, D) throughput 0.999905 | |
FCVT (S to D) 1->2 2.006682 | |
FCVT (S to D) throughput 1.006180 | |
FCVT (D to S) 1->2 3.039506 | |
FCVT (D to S) throughput 0.988332 | |
FCVTAS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.061319 | |
FCVTAS (scalar, S to 32-bit GPR) throughput 2.138660 | |
FCVTAS (scalar, S to 64-bit GPR) 1->2 roundtrip 10.071778 | |
FCVTAS (scalar, S to 64-bit GPR) throughput 2.343706 | |
FCVTAS (scalar, D to 32-bit GPR) 1->2 roundtrip 9.630054 | |
FCVTAS (scalar, D to 32-bit GPR) throughput 2.138890 | |
FCVTAS (scalar, D to 64-bit GPR) 1->2 roundtrip 9.997730 | |
FCVTAS (scalar, D to 64-bit GPR) throughput 2.159988 | |
FCVTAS (scalar, S to S) 1->2 3.011380 | |
FCVTAS (scalar, S to S) throughput 1.007939 | |
FCVTAS (scalar, S to S) 1->2 3.035954 | |
FCVTAS (scalar, S to S) throughput 1.008043 | |
FCVTAS (scalar, D to D) 1->2 3.543495 | |
FCVTAS (scalar, D to D) throughput 0.966808 | |
FCVTAS (scalar, D to D) 1->2 2.994436 | |
FCVTAS (scalar, D to D) throughput 1.008486 | |
FCVTAS (vector, 2S) 1->2 3.006437 | |
FCVTAS (vector, 2S) throughput 0.990713 | |
FCVTAS (vector, 4S) 1->2 2.995379 | |
FCVTAS (vector, 4S) throughput 1.008809 | |
FCVTAS (vector, 2D) 1->2 3.043062 | |
FCVTAS (vector, 2D) throughput 1.021970 | |
FCVTAU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.031291 | |
FCVTAU (scalar, S to 32-bit GPR) throughput 2.134542 | |
FCVTAU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.005041 | |
FCVTAU (scalar, S to 64-bit GPR) throughput 2.138267 | |
FCVTAU (scalar, D to 32-bit GPR) 1->2 roundtrip 10.077352 | |
FCVTAU (scalar, D to 32-bit GPR) throughput 2.140064 | |
FCVTAU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.268318 | |
FCVTAU (scalar, D to 64-bit GPR) throughput 2.116247 | |
FCVTAU (scalar, S to S) 1->2 2.962183 | |
FCVTAU (scalar, S to S) throughput 1.016910 | |
FCVTAU (scalar, S to S) 1->2 3.002125 | |
FCVTAU (scalar, S to S) throughput 1.011476 | |
FCVTAU (scalar, D to D) 1->2 3.016996 | |
FCVTAU (scalar, D to D) throughput 1.010180 | |
FCVTAU (scalar, D to D) 1->2 3.010248 | |
FCVTAU (scalar, D to D) throughput 0.997116 | |
FCVTAU (vector, 2S) 1->2 3.005802 | |
FCVTAU (vector, 2S) throughput 1.009040 | |
FCVTAU (vector, 4S) 1->2 2.934530 | |
FCVTAU (vector, 4S) throughput 1.003303 | |
FCVTAU (vector, 2D) 1->2 3.018729 | |
FCVTAU (vector, 2D) throughput 1.006175 | |
FCVTL (vector, 4S) 1->2 2.000548 | |
FCVTL (vector, 4S) throughput 1.012766 | |
FCVTL (vector, 2D) 1->2 2.018332 | |
FCVTL (vector, 2D) throughput 0.957642 | |
FCVTL2 (vector, 4S) 1->2 2.020370 | |
FCVTL2 (vector, 4S) throughput 0.991783 | |
FCVTL2 (vector, 2D) 1->2 1.896799 | |
FCVTL2 (vector, 2D) throughput 1.005209 | |
FCVTMS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.092557 | |
FCVTMS (scalar, S to 32-bit GPR) throughput 2.136479 | |
FCVTMS (scalar, S to 64-bit GPR) 1->2 roundtrip 10.068027 | |
FCVTMS (scalar, S to 64-bit GPR) throughput 2.144211 | |
FCVTMS (scalar, D to 32-bit GPR) 1->2 roundtrip 9.968189 | |
FCVTMS (scalar, D to 32-bit GPR) throughput 2.078078 | |
FCVTMS (scalar, D to 64-bit GPR) 1->2 roundtrip 10.049909 | |
FCVTMS (scalar, D to 64-bit GPR) throughput 2.131093 | |
FCVTMS (scalar, S to S) 1->2 2.995375 | |
FCVTMS (scalar, S to S) throughput 1.034098 | |
FCVTMS (scalar, S to S) 1->2 2.983652 | |
FCVTMS (scalar, S to S) throughput 1.005147 | |
FCVTMS (scalar, D to D) 1->2 3.014731 | |
FCVTMS (scalar, D to D) throughput 0.980925 | |
FCVTMS (scalar, D to D) 1->2 3.048954 | |
FCVTMS (scalar, D to D) throughput 1.007998 | |
FCVTMS (vector, 2S) 1->2 2.996018 | |
FCVTMS (vector, 2S) throughput 1.001127 | |
FCVTMS (vector, 4S) 1->2 3.035517 | |
FCVTMS (vector, 4S) throughput 1.012515 | |
FCVTMS (vector, 2D) 1->2 3.044227 | |
FCVTMS (vector, 2D) throughput 1.010762 | |
FCVTMU (scalar, S to 32-bit GPR) 1->2 roundtrip 9.657614 | |
FCVTMU (scalar, S to 32-bit GPR) throughput 2.137284 | |
FCVTMU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.035573 | |
FCVTMU (scalar, S to 64-bit GPR) throughput 3.986068 | |
FCVTMU (scalar, D to 32-bit GPR) 1->2 roundtrip 9.969303 | |
FCVTMU (scalar, D to 32-bit GPR) throughput 2.156926 | |
FCVTMU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.176627 | |
FCVTMU (scalar, D to 64-bit GPR) throughput 2.120289 | |
FCVTMU (scalar, S to S) 1->2 3.154726 | |
FCVTMU (scalar, S to S) throughput 0.990315 | |
FCVTMU (scalar, S to S) 1->2 2.850040 | |
FCVTMU (scalar, S to S) throughput 1.004179 | |
FCVTMU (scalar, D to D) 1->2 3.011799 | |
FCVTMU (scalar, D to D) throughput 1.023759 | |
FCVTMU (scalar, D to D) 1->2 3.019573 | |
FCVTMU (scalar, D to D) throughput 0.996169 | |
FCVTMU (vector, 2S) 1->2 2.965877 | |
FCVTMU (vector, 2S) throughput 1.005303 | |
FCVTMU (vector, 4S) 1->2 3.021353 | |
FCVTMU (vector, 4S) throughput 1.003691 | |
FCVTMU (vector, 2D) 1->2 3.008732 | |
FCVTMU (vector, 2D) throughput 1.006584 | |
FCVTN (vector, 4S) 1->2 3.008211 | |
FCVTN (vector, 4S) throughput 0.967936 | |
FCVTN (vector, 2D) 1->2 3.035143 | |
FCVTN (vector, 2D) throughput 1.011916 | |
FCVTN2 (vector, 4S) 1->1 3.034493 | |
FCVTN2 (vector, 4S) 1->2 3.066599 | |
FCVTN2 (vector, 4S) throughput 0.997495 | |
FCVTN2 (vector, 4S) throughput 0.990466 | |
FCVTN2 (vector, 2D) 1->1 2.930046 | |
FCVTN2 (vector, 2D) 1->2 3.027742 | |
FCVTN2 (vector, 2D) throughput 1.016622 | |
FCVTN2 (vector, 2D) throughput 1.018647 | |
FCVTNS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.106616 | |
FCVTNS (scalar, S to 32-bit GPR) throughput 2.137110 | |
FCVTNS (scalar, S to 64-bit GPR) 1->2 roundtrip 10.169629 | |
FCVTNS (scalar, S to 64-bit GPR) throughput 2.142879 | |
FCVTNS (scalar, D to 32-bit GPR) 1->2 roundtrip 9.910934 | |
FCVTNS (scalar, D to 32-bit GPR) throughput 2.128367 | |
FCVTNS (scalar, D to 64-bit GPR) 1->2 roundtrip 9.935200 | |
FCVTNS (scalar, D to 64-bit GPR) throughput 2.110051 | |
FCVTNS (scalar, S to S) 1->2 3.085578 | |
FCVTNS (scalar, S to S) throughput 1.004500 | |
FCVTNS (scalar, S to S) 1->2 3.021324 | |
FCVTNS (scalar, S to S) throughput 1.010441 | |
FCVTNS (scalar, D to D) 1->2 2.918875 | |
FCVTNS (scalar, D to D) throughput 1.009264 | |
FCVTNS (scalar, D to D) 1->2 3.057155 | |
FCVTNS (scalar, D to D) throughput 1.004902 | |
FCVTNS (vector, 2S) 1->2 3.055068 | |
FCVTNS (vector, 2S) throughput 1.006224 | |
FCVTNS (vector, 4S) 1->2 3.005351 | |
FCVTNS (vector, 4S) throughput 1.005869 | |
FCVTNS (vector, 2D) 1->2 2.998025 | |
FCVTNS (vector, 2D) throughput 1.008698 | |
FCVTNU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.023124 | |
FCVTNU (scalar, S to 32-bit GPR) throughput 2.168133 | |
FCVTNU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.064284 | |
FCVTNU (scalar, S to 64-bit GPR) throughput 2.153950 | |
FCVTNU (scalar, D to 32-bit GPR) 1->2 roundtrip 10.088122 | |
FCVTNU (scalar, D to 32-bit GPR) throughput 2.140951 | |
FCVTNU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.031406 | |
FCVTNU (scalar, D to 64-bit GPR) throughput 2.155204 | |
FCVTNU (scalar, S to S) 1->2 3.002529 | |
FCVTNU (scalar, S to S) throughput 0.985668 | |
FCVTNU (scalar, S to S) 1->2 3.015989 | |
FCVTNU (scalar, S to S) throughput 1.007821 | |
FCVTNU (scalar, D to D) 1->2 2.956014 | |
FCVTNU (scalar, D to D) throughput 1.012010 | |
FCVTNU (scalar, D to D) 1->2 3.004490 | |
FCVTNU (scalar, D to D) throughput 1.007665 | |
FCVTNU (vector, 2S) 1->2 3.009386 | |
FCVTNU (vector, 2S) throughput 1.009267 | |
FCVTNU (vector, 4S) 1->2 3.013269 | |
FCVTNU (vector, 4S) throughput 1.005183 | |
FCVTNU (vector, 2D) 1->2 2.994656 | |
FCVTNU (vector, 2D) throughput 1.011827 | |
FCVTPS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.115852 | |
FCVTPS (scalar, S to 32-bit GPR) throughput 2.119694 | |
FCVTPS (scalar, S to 64-bit GPR) 1->2 roundtrip 10.060751 | |
FCVTPS (scalar, S to 64-bit GPR) throughput 2.139101 | |
FCVTPS (scalar, D to 32-bit GPR) 1->2 roundtrip 10.129125 | |
FCVTPS (scalar, D to 32-bit GPR) throughput 2.135400 | |
FCVTPS (scalar, D to 64-bit GPR) 1->2 roundtrip 9.901799 | |
FCVTPS (scalar, D to 64-bit GPR) throughput 2.144636 | |
FCVTPS (scalar, S to S) 1->2 3.054144 | |
FCVTPS (scalar, S to S) throughput 1.016800 | |
FCVTPS (scalar, S to S) 1->2 2.970039 | |
FCVTPS (scalar, S to S) throughput 1.015167 | |
FCVTPS (scalar, D to D) 1->2 2.992329 | |
FCVTPS (scalar, D to D) throughput 1.007597 | |
FCVTPS (scalar, D to D) 1->2 3.011568 | |
FCVTPS (scalar, D to D) throughput 0.993552 | |
FCVTPS (vector, 2S) 1->2 2.996585 | |
FCVTPS (vector, 2S) throughput 1.027355 | |
FCVTPS (vector, 4S) 1->2 2.876506 | |
FCVTPS (vector, 4S) throughput 1.009946 | |
FCVTPS (vector, 2D) 1->2 3.018495 | |
FCVTPS (vector, 2D) throughput 1.004521 | |
FCVTPU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.102810 | |
FCVTPU (scalar, S to 32-bit GPR) throughput 2.139521 | |
FCVTPU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.057561 | |
FCVTPU (scalar, S to 64-bit GPR) throughput 2.137785 | |
FCVTPU (scalar, D to 32-bit GPR) 1->2 roundtrip 10.068706 | |
FCVTPU (scalar, D to 32-bit GPR) throughput 2.118206 | |
FCVTPU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.078257 | |
FCVTPU (scalar, D to 64-bit GPR) throughput 2.136209 | |
FCVTPU (scalar, S to S) 1->2 3.019607 | |
FCVTPU (scalar, S to S) throughput 0.995515 | |
FCVTPU (scalar, S to S) 1->2 3.023934 | |
FCVTPU (scalar, S to S) throughput 1.005547 | |
FCVTPU (scalar, D to D) 1->2 2.996166 | |
FCVTPU (scalar, D to D) throughput 1.012636 | |
FCVTPU (scalar, D to D) 1->2 3.068843 | |
FCVTPU (scalar, D to D) throughput 1.025063 | |
FCVTPU (vector, 2S) 1->2 2.998772 | |
FCVTPU (vector, 2S) throughput 1.007055 | |
FCVTPU (vector, 4S) 1->2 3.064272 | |
FCVTPU (vector, 4S) throughput 1.007256 | |
FCVTPU (vector, 2D) 1->2 2.999999 | |
FCVTPU (vector, 2D) throughput 1.000789 | |
FCVTXN (scalar, D) 1->2 3.036707 | |
FCVTXN (scalar, D) throughput 1.010853 | |
FCVTXN (vector, 2D) 1->2 3.044755 | |
FCVTXN (vector, 2D) throughput 1.005045 | |
FCVTXN2 (vector, 2D) 1->1 3.021704 | |
FCVTXN2 (vector, 2D) 1->2 3.028494 | |
FCVTXN2 (vector, 2D) throughput 0.996499 | |
FCVTXN2 (vector, 2D) throughput 0.978953 | |
FCVTZS (scalar, integer, S to 32-bit GPR) 1->2 roundtrip 9.957923 | |
FCVTZS (scalar, integer, S to 32-bit GPR) throughput 2.117176 | |
FCVTZS (scalar, integer, S to 64-bit GPR) 1->2 roundtrip 10.053207 | |
FCVTZS (scalar, integer, S to 64-bit GPR) throughput 2.145150 | |
FCVTZS (scalar, integer, D to 32-bit GPR) 1->2 roundtrip 9.974343 | |
FCVTZS (scalar, integer, D to 32-bit GPR) throughput 2.155256 | |
FCVTZS (scalar, integer, D to 64-bit GPR) 1->2 roundtrip 10.107432 | |
FCVTZS (scalar, integer, D to 64-bit GPR) throughput 2.169202 | |
FCVTZS (scalar, integer, S to S) 1->2 3.020957 | |
FCVTZS (scalar, integer, S to S) throughput 1.002141 | |
FCVTZS (scalar, integer, S to S) 1->2 3.049770 | |
FCVTZS (scalar, integer, S to S) throughput 1.007672 | |
FCVTZS (scalar, integer, D to D) 1->2 3.025703 | |
FCVTZS (scalar, integer, D to D) throughput 0.991487 | |
FCVTZS (scalar, integer, D to D) 1->2 3.029313 | |
FCVTZS (scalar, integer, D to D) throughput 1.011726 | |
FCVTZS (vector, integer, 2S) 1->2 3.037974 | |
FCVTZS (vector, integer, 2S) throughput 1.002672 | |
FCVTZS (vector, integer, 4S) 1->2 3.008892 | |
FCVTZS (vector, integer, 4S) throughput 1.010439 | |
FCVTZS (vector, integer, 2D) 1->2 2.977815 | |
FCVTZS (vector, integer, 2D) throughput 1.005455 | |
FCVTZS (scalar, fixed-point, S to 32-bit GPR) 1->2 roundtrip 10.117074 | |
FCVTZS (scalar, fixed-point, S to 32-bit GPR) throughput 2.115324 | |
FCVTZS (scalar, fixed-point, S to 64-bit GPR) 1->2 roundtrip 10.090627 | |
FCVTZS (scalar, fixed-point, S to 64-bit GPR) throughput 2.128979 | |
FCVTZS (scalar, fixed-point, D to 32-bit GPR) 1->2 roundtrip 9.970467 | |
FCVTZS (scalar, fixed-point, D to 32-bit GPR) throughput 2.139603 | |
FCVTZS (scalar, fixed-point, D to 64-bit GPR) 1->2 roundtrip 10.108589 | |
FCVTZS (scalar, fixed-point, D to 64-bit GPR) throughput 2.136684 | |
FCVTZS (scalar, fixed-point, S to S) 1->2 2.967562 | |
FCVTZS (scalar, fixed-point, S to S) throughput 0.991106 | |
FCVTZS (scalar, fixed-point, S to S) 1->2 3.006859 | |
FCVTZS (scalar, fixed-point, S to S) throughput 1.010855 | |
FCVTZS (scalar, fixed-point, D to D) 1->2 3.008519 | |
FCVTZS (scalar, fixed-point, D to D) throughput 1.004345 | |
FCVTZS (scalar, fixed-point, D to D) 1->2 3.025204 | |
FCVTZS (scalar, fixed-point, D to D) throughput 1.011902 | |
FCVTZS (vector, fixed-point, 2S) 1->2 3.010901 | |
FCVTZS (vector, fixed-point, 2S) throughput 1.002275 | |
FCVTZS (vector, fixed-point, 4S) 1->2 3.027401 | |
FCVTZS (vector, fixed-point, 4S) throughput 1.021032 | |
FCVTZS (vector, fixed-point, 2D) 1->2 3.021803 | |
FCVTZS (vector, fixed-point, 2D) throughput 1.013874 | |
FCVTZU (scalar, integer, S to 32-bit GPR) 1->2 roundtrip 10.127905 | |
FCVTZU (scalar, integer, S to 32-bit GPR) throughput 2.144489 | |
FCVTZU (scalar, integer, S to 64-bit GPR) 1->2 roundtrip 9.924366 | |
FCVTZU (scalar, integer, S to 64-bit GPR) throughput 2.145475 | |
FCVTZU (scalar, integer, D to 32-bit GPR) 1->2 roundtrip 10.004019 | |
FCVTZU (scalar, integer, D to 32-bit GPR) throughput 2.144170 | |
FCVTZU (scalar, integer, D to 64-bit GPR) 1->2 roundtrip 10.009939 | |
FCVTZU (scalar, integer, D to 64-bit GPR) throughput 2.137347 | |
FCVTZU (scalar, integer, S to S) 1->2 2.996139 | |
FCVTZU (scalar, integer, S to S) throughput 0.989946 | |
FCVTZU (scalar, integer, S to S) 1->2 3.028813 | |
FCVTZU (scalar, integer, S to S) throughput 1.006595 | |
FCVTZU (scalar, integer, D to D) 1->2 2.997894 | |
FCVTZU (scalar, integer, D to D) throughput 1.008997 | |
FCVTZU (scalar, integer, D to D) 1->2 3.009142 | |
FCVTZU (scalar, integer, D to D) throughput 1.016126 | |
FCVTZU (vector, integer, 2S) 1->2 3.001444 | |
FCVTZU (vector, integer, 2S) throughput 1.027382 | |
FCVTZU (vector, integer, 4S) 1->2 3.062963 | |
FCVTZU (vector, integer, 4S) throughput 0.987467 | |
FCVTZU (vector, integer, 2D) 1->2 3.003308 | |
FCVTZU (vector, integer, 2D) throughput 1.010305 | |
FCVTZU (scalar, fixed-point, S to 32-bit GPR) 1->2 roundtrip 10.035348 | |
FCVTZU (scalar, fixed-point, S to 32-bit GPR) throughput 2.148109 | |
FCVTZU (scalar, fixed-point, S to 64-bit GPR) 1->2 roundtrip 10.132975 | |
FCVTZU (scalar, fixed-point, S to 64-bit GPR) throughput 2.143680 | |
FCVTZU (scalar, fixed-point, D to 32-bit GPR) 1->2 roundtrip 8.016477 | |
FCVTZU (scalar, fixed-point, D to 32-bit GPR) throughput 2.093145 | |
FCVTZU (scalar, fixed-point, D to 64-bit GPR) 1->2 roundtrip 9.948674 | |
FCVTZU (scalar, fixed-point, D to 64-bit GPR) throughput 2.133497 | |
FCVTZU (scalar, fixed-point, S to S) 1->2 2.975704 | |
FCVTZU (scalar, fixed-point, S to S) throughput 1.010735 | |
FCVTZU (scalar, fixed-point, S to S) 1->2 3.024078 | |
FCVTZU (scalar, fixed-point, S to S) throughput 1.004389 | |
FCVTZU (scalar, fixed-point, D to D) 1->2 3.055684 | |
FCVTZU (scalar, fixed-point, D to D) throughput 1.007572 | |
FCVTZU (scalar, fixed-point, D to D) 1->2 2.957352 | |
FCVTZU (scalar, fixed-point, D to D) throughput 1.000410 | |
FCVTZU (vector, fixed-point, 2S) 1->2 3.025273 | |
FCVTZU (vector, fixed-point, 2S) throughput 1.008710 | |
FCVTZU (vector, fixed-point, 4S) 1->2 3.004430 | |
FCVTZU (vector, fixed-point, 4S) throughput 0.896252 | |
FCVTZU (vector, fixed-point, 2D) 1->2 3.008015 | |
FCVTZU (vector, fixed-point, 2D) throughput 1.008929 | |
FDIV (vector, 2S) 1->2 11.883940 | |
FDIV (vector, 2S) 1->3 12.123440 | |
FDIV (vector, 2S) throughput 11.168964 | |
FDIV (vector, 4S) 1->2 23.266495 | |
FDIV (vector, 4S) 1->3 23.078169 | |
FDIV (vector, 4S) throughput 21.705846 | |
FDIV (vector, 2D) 1->2 37.311787 | |
FDIV (vector, 2D) 1->3 36.626648 | |
FDIV (vector, 2D) throughput 33.593491 | |
FDIV (scalar, S) 1->2 11.873748 | |
FDIV (scalar, S) 1->3 11.978797 | |
FDIV (scalar, S) throughput 10.935595 | |
FDIV (scalar, D) 1->2 19.266442 | |
FDIV (scalar, D) 1->3 18.849536 | |
FDIV (scalar, D) throughput 18.058017 | |
FMADD (scalar, S) 1->2 4.082473 | |
FMADD (scalar, S) 1->3 4.741659 | |
FMADD (scalar, S) 1->4 4.070124 | |
FMADD (scalar, S) throughput 1.003896 | |
FMADD (scalar, D) 1->2 4.034459 | |
FMADD (scalar, D) 1->3 4.000263 | |
FMADD (scalar, D) 1->4 3.996803 | |
FMADD (scalar, D) throughput 1.007117 | |
FMAX (vector, 2S) 1->2 2.058751 | |
FMAX (vector, 2S) 1->3 1.994328 | |
FMAX (vector, 2S) throughput 0.997165 | |
FMAX (vector, 4S) 1->2 2.017198 | |
FMAX (vector, 4S) 1->3 1.968115 | |
FMAX (vector, 4S) throughput 0.864791 | |
FMAX (vector, 2D) 1->2 2.043467 | |
FMAX (vector, 2D) 1->3 1.952274 | |
FMAX (vector, 2D) throughput 1.009664 | |
FMAX (scalar, S) 1->2 1.994350 | |
FMAX (scalar, S) 1->3 2.029510 | |
FMAX (scalar, S) throughput 1.088810 | |
FMAX (scalar, D) 1->2 1.998907 | |
FMAX (scalar, D) 1->3 2.024255 | |
FMAX (scalar, D) throughput 0.993129 | |
FMAXNM (vector, 2S) 1->2 1.998202 | |
FMAXNM (vector, 2S) 1->3 2.006422 | |
FMAXNM (vector, 2S) throughput 1.011905 | |
FMAXNM (vector, 4S) 1->2 2.006369 | |
FMAXNM (vector, 4S) 1->3 1.996996 | |
FMAXNM (vector, 4S) throughput 1.007701 | |
FMAXNM (vector, 2D) 1->2 1.993662 | |
FMAXNM (vector, 2D) 1->3 2.020417 | |
FMAXNM (vector, 2D) throughput 0.983012 | |
FMAXNM (scalar, S) 1->2 2.031086 | |
FMAXNM (scalar, S) 1->3 2.010665 | |
FMAXNM (scalar, S) throughput 1.008022 | |
FMAXNM (scalar, D) 1->2 2.018790 | |
FMAXNM (scalar, D) 1->3 2.007214 | |
FMAXNM (scalar, D) throughput 1.011180 | |
FMAXNMP (scalar) 1->2 2.039425 | |
FMAXNMP (scalar) throughput 0.989602 | |
FMAXNMP (vector, 2S) 1->2 2.001535 | |
FMAXNMP (vector, 2S) 1->3 1.994370 | |
FMAXNMP (vector, 2S) throughput 1.003765 | |
FMAXNMP (vector, 4S) 1->2 1.950738 | |
FMAXNMP (vector, 4S) 1->3 1.999036 | |
FMAXNMP (vector, 4S) throughput 1.002633 | |
FMAXNMP (vector, 2D) 1->2 2.013135 | |
FMAXNMP (vector, 2D) 1->3 2.027319 | |
FMAXNMP (vector, 2D) throughput 1.001035 | |
FMAXNMV (4S) 1->2 3.055475 | |
FMAXNMV (4S) throughput 0.992706 | |
FMAXP (scalar) 1->2 2.024463 | |
FMAXP (scalar) throughput 1.009954 | |
FMAXP (vector, 2S) 1->2 1.999738 | |
FMAXP (vector, 2S) 1->3 1.994195 | |
FMAXP (vector, 2S) throughput 0.985423 | |
FMAXP (vector, 4S) 1->2 2.015513 | |
FMAXP (vector, 4S) 1->3 1.968863 | |
FMAXP (vector, 4S) throughput 1.006171 | |
FMAXP (vector, 2D) 1->2 1.991650 | |
FMAXP (vector, 2D) 1->3 2.023224 | |
FMAXP (vector, 2D) throughput 1.010471 | |
FMAXV (4S) 1->2 3.008885 | |
FMAXV (4S) throughput 1.007646 | |
FMIN (vector, 2S) 1->2 1.999013 | |
FMIN (vector, 2S) 1->3 2.005168 | |
FMIN (vector, 2S) throughput 1.019480 | |
FMIN (vector, 4S) 1->2 2.001579 | |
FMIN (vector, 4S) 1->3 1.993973 | |
FMIN (vector, 4S) throughput 1.014117 | |
FMIN (vector, 2D) 1->2 2.008661 | |
FMIN (vector, 2D) 1->3 1.992335 | |
FMIN (vector, 2D) throughput 1.007207 | |
FMIN (scalar, S) 1->2 1.954927 | |
FMIN (scalar, S) 1->3 2.038642 | |
FMIN (scalar, S) throughput 1.007022 | |
FMIN (scalar, D) 1->2 2.000483 | |
FMIN (scalar, D) 1->3 2.022813 | |
FMIN (scalar, D) throughput 0.996324 | |
FMINNM (vector, 2S) 1->2 2.008665 | |
FMINNM (vector, 2S) 1->3 1.935620 | |
FMINNM (vector, 2S) throughput 1.012933 | |
FMINNM (vector, 4S) 1->2 1.998422 | |
FMINNM (vector, 4S) 1->3 2.035447 | |
FMINNM (vector, 4S) throughput 1.008743 | |
FMINNM (vector, 2D) 1->2 2.024284 | |
FMINNM (vector, 2D) 1->3 1.973579 | |
FMINNM (vector, 2D) throughput 1.029452 | |
FMINNM (scalar, S) 1->2 1.982217 | |
FMINNM (scalar, S) 1->3 2.010575 | |
FMINNM (scalar, S) throughput 0.992056 | |
FMINNM (scalar, D) 1->2 2.036164 | |
FMINNM (scalar, D) 1->3 1.994406 | |
FMINNM (scalar, D) throughput 1.008352 | |
FMINNMP (scalar) 1->2 1.999873 | |
FMINNMP (scalar) throughput 1.004880 | |
FMINNMP (vector, 2S) 1->2 2.017897 | |
FMINNMP (vector, 2S) 1->3 2.016706 | |
FMINNMP (vector, 2S) throughput 1.009574 | |
FMINNMP (vector, 4S) 1->2 2.013970 | |
FMINNMP (vector, 4S) 1->3 1.971694 | |
FMINNMP (vector, 4S) throughput 1.016926 | |
FMINNMP (vector, 2D) 1->2 2.016093 | |
FMINNMP (vector, 2D) 1->3 2.020953 | |
FMINNMP (vector, 2D) throughput 0.988013 | |
FMINNMV (4S) 1->2 3.373177 | |
FMINNMV (4S) throughput 0.920609 | |
FMINP (scalar) 1->2 1.999483 | |
FMINP (scalar) throughput 1.014325 | |
FMINP (vector, 2S) 1->2 2.035823 | |
FMINP (vector, 2S) 1->3 2.014370 | |
FMINP (vector, 2S) throughput 1.007597 | |
FMINP (vector, 4S) 1->2 2.007389 | |
FMINP (vector, 4S) 1->3 2.014145 | |
FMINP (vector, 4S) throughput 1.028193 | |
FMINP (vector, 2D) 1->2 2.000921 | |
FMINP (vector, 2D) 1->3 2.005968 | |
FMINP (vector, 2D) throughput 1.005843 | |
FMINV (4S) 1->2 3.029391 | |
FMINV (4S) throughput 1.010122 | |
FMLA (by element, S) 1->1 4.006175 | |
FMLA (by element, S) 1->2 4.984882 | |
FMLA (by element, S) 1->3 3.991444 | |
FMLA (by element, S) throughput 1.006591 | |
FMLA (by element, S) throughput 1.010021 | |
FMLA (by element, D) 1->1 3.975259 | |
FMLA (by element, D) 1->2 3.940885 | |
FMLA (by element, D) 1->3 4.038907 | |
FMLA (by element, D) throughput 1.008316 | |
FMLA (by element, D) throughput 0.991591 | |
FMLA (by element, 2S) 1->1 4.037765 | |
FMLA (by element, 2S) 1->2 3.995187 | |
FMLA (by element, 2S) 1->3 3.998575 | |
FMLA (by element, 2S) throughput 1.012940 | |
FMLA (by element, 2S) throughput 0.970028 | |
FMLA (by element, 4S) 1->1 3.940953 | |
FMLA (by element, 4S) 1->2 4.026599 | |
FMLA (by element, 4S) 1->3 4.010439 | |
FMLA (by element, 4S) throughput 1.008649 | |
FMLA (by element, 4S) throughput 1.004096 | |
FMLA (by element, 2D) 1->1 4.037325 | |
FMLA (by element, 2D) 1->2 3.922274 | |
FMLA (by element, 2D) 1->3 4.004814 | |
FMLA (by element, 2D) throughput 1.007994 | |
FMLA (by element, 2D) throughput 1.004828 | |
FMLA (vector, 2S) 1->1 3.828071 | |
FMLA (vector, 2S) 1->2 3.854801 | |
FMLA (vector, 2S) 1->3 4.049547 | |
FMLA (vector, 2S) throughput 1.014533 | |
FMLA (vector, 2S) throughput 1.012293 | |
FMLA (vector, 4S) 1->1 4.011280 | |
FMLA (vector, 4S) 1->2 4.148114 | |
FMLA (vector, 4S) 1->3 4.065493 | |
FMLA (vector, 4S) throughput 0.985554 | |
FMLA (vector, 4S) throughput 0.991961 | |
FMLA (vector, 2D) 1->1 4.034901 | |
FMLA (vector, 2D) 1->2 4.042159 | |
FMLA (vector, 2D) 1->3 3.985739 | |
FMLA (vector, 2D) throughput 1.051972 | |
FMLA (vector, 2D) throughput 0.997011 | |
FMLS (by element, S) 1->1 3.989459 | |
FMLS (by element, S) 1->2 4.018233 | |
FMLS (by element, S) 1->3 3.999159 | |
FMLS (by element, S) throughput 1.007718 | |
FMLS (by element, S) throughput 1.012744 | |
FMLS (by element, D) 1->1 4.013681 | |
FMLS (by element, D) 1->2 3.992698 | |
FMLS (by element, D) 1->3 3.994065 | |
FMLS (by element, D) throughput 1.003612 | |
FMLS (by element, D) throughput 0.995715 | |
FMLS (by element, 2S) 1->1 4.312570 | |
FMLS (by element, 2S) 1->2 3.991247 | |
FMLS (by element, 2S) 1->3 3.963227 | |
FMLS (by element, 2S) throughput 1.009872 | |
FMLS (by element, 2S) throughput 1.005650 | |
FMLS (by element, 4S) 1->1 4.026918 | |
FMLS (by element, 4S) 1->2 3.963492 | |
FMLS (by element, 4S) 1->3 4.041039 | |
FMLS (by element, 4S) throughput 1.001368 | |
FMLS (by element, 4S) throughput 0.997427 | |
FMLS (by element, 2D) 1->1 3.992732 | |
FMLS (by element, 2D) 1->2 4.022931 | |
FMLS (by element, 2D) 1->3 4.010768 | |
FMLS (by element, 2D) throughput 1.015805 | |
FMLS (by element, 2D) throughput 1.009735 | |
FMLS (vector, 2S) 1->1 4.037870 | |
FMLS (vector, 2S) 1->2 4.060799 | |
FMLS (vector, 2S) 1->3 4.012879 | |
FMLS (vector, 2S) throughput 1.008993 | |
FMLS (vector, 2S) throughput 1.012918 | |
FMLS (vector, 4S) 1->1 4.016206 | |
FMLS (vector, 4S) 1->2 4.003382 | |
FMLS (vector, 4S) 1->3 4.106987 | |
FMLS (vector, 4S) throughput 1.018002 | |
FMLS (vector, 4S) throughput 1.012029 | |
FMLS (vector, 2D) 1->1 4.001770 | |
FMLS (vector, 2D) 1->2 4.009655 | |
FMLS (vector, 2D) 1->3 4.003818 | |
FMLS (vector, 2D) throughput 1.009772 | |
FMLS (vector, 2D) throughput 1.003348 | |
FMOV (S to 32-bit GPR) 1->2 roundtrip 7.030600 | |
FMOV (S to 32-bit GPR) throughput 1.006643 | |
FMOV (D to 64-bit GPR) 1->2 roundtrip 7.084774 | |
FMOV (D to 64-bit GPR) throughput 0.997339 | |
FMOV (D[1] to 64-bit GPR) 1->2 roundtrip 7.048773 | |
FMOV (D[1] to 64-bit GPR) throughput 1.010402 | |
FMOV (S from 32-bit GPR) 1->2 roundtrip 7.396586 | |
FMOV (S from 32-bit GPR) throughput 1.001215 | |
FMOV (D from 64-bit GPR) 1->2 roundtrip 6.888423 | |
FMOV (D from 64-bit GPR) throughput 1.012986 | |
FMOV (D[1] from 64-bit GPR) 1->2 roundtrip 9.024353 | |
FMOV (D[1] from 64-bit GPR) throughput 1.014697 | |
FMSUB (scalar, S) 1->2 4.082884 | |
FMSUB (scalar, S) 1->3 3.978226 | |
FMSUB (scalar, S) 1->4 3.994261 | |
FMSUB (scalar, S) throughput 1.006920 | |
FMSUB (scalar, D) 1->2 4.038553 | |
FMSUB (scalar, D) 1->3 4.129726 | |
FMSUB (scalar, D) 1->4 4.038282 | |
FMSUB (scalar, D) throughput 1.038617 | |
FMUL (by element, 2S) 1->2 4.018129 | |
FMUL (by element, 2S) 1->3 4.048136 | |
FMUL (by element, 2S) throughput 1.009439 | |
FMUL (by element, 4S) 1->2 4.031579 | |
FMUL (by element, 4S) 1->3 4.048261 | |
FMUL (by element, 4S) throughput 1.007131 | |
FMUL (by element, 2D) 1->2 4.027147 | |
FMUL (by element, 2D) 1->3 4.026787 | |
FMUL (by element, 2D) throughput 1.004884 | |
FMUL (vector, 2S) 1->2 4.064166 | |
FMUL (vector, 2S) 1->3 3.959252 | |
FMUL (vector, 2S) throughput 1.009615 | |
FMUL (vector, 4S) 1->2 3.982433 | |
FMUL (vector, 4S) 1->3 4.002582 | |
FMUL (vector, 4S) throughput 1.085657 | |
FMUL (vector, 2D) 1->2 3.999198 | |
FMUL (vector, 2D) 1->3 3.865610 | |
FMUL (vector, 2D) throughput 1.019729 | |
FMUL (scalar, S) 1->2 4.055596 | |
FMUL (scalar, S) 1->3 4.038593 | |
FMUL (scalar, S) throughput 1.009986 | |
FMUL (scalar, D) 1->2 3.956153 | |
FMUL (scalar, D) 1->3 4.018195 | |
FMUL (scalar, D) throughput 1.007575 | |
FMULX (vector, 2S) 1->2 3.988963 | |
FMULX (vector, 2S) 1->3 4.043888 | |
FMULX (vector, 2S) throughput 0.999541 | |
FMULX (vector, 4S) 1->2 3.991698 | |
FMULX (vector, 4S) 1->3 4.046189 | |
FMULX (vector, 4S) throughput 0.994239 | |
FMULX (vector, 2D) 1->2 4.040918 | |
FMULX (vector, 2D) 1->3 3.994809 | |
FMULX (vector, 2D) throughput 1.019513 | |
FMULX (scalar, S) 1->2 3.989386 | |
FMULX (scalar, S) 1->3 4.005870 | |
FMULX (scalar, S) throughput 1.015183 | |
FMULX (scalar, D) 1->2 3.999210 | |
FMULX (scalar, D) 1->3 4.042303 | |
FMULX (scalar, D) throughput 1.006547 | |
FMULX (by element, 2S) 1->2 4.011035 | |
FMULX (by element, 2S) 1->3 4.024376 | |
FMULX (by element, 2S) throughput 1.011603 | |
FMULX (by element, 4S) 1->2 4.006432 | |
FMULX (by element, 4S) 1->3 4.246269 | |
FMULX (by element, 4S) throughput 0.997508 | |
FMULX (by element, 2D) 1->2 4.080346 | |
FMULX (by element, 2D) 1->3 4.030396 | |
FMULX (by element, 2D) throughput 1.000696 | |
FNEG (scalar, S) 1->2 2.006492 | |
FNEG (scalar, S) throughput 1.004427 | |
FNEG (scalar, D) 1->2 2.023274 | |
FNEG (scalar, D) throughput 1.010066 | |
FNEG (vector, 2S) 1->2 2.004948 | |
FNEG (vector, 2S) throughput 0.994966 | |
FNEG (vector, 4S) 1->2 2.007939 | |
FNEG (vector, 4S) throughput 0.998960 | |
FNEG (vector, 2D) 1->2 2.003487 | |
FNEG (vector, 2D) throughput 0.984485 | |
FNMADD (scalar, S) 1->2 4.233963 | |
FNMADD (scalar, S) 1->3 3.927177 | |
FNMADD (scalar, S) 1->4 4.043547 | |
FNMADD (scalar, S) throughput 0.992799 | |
FNMADD (scalar, D) 1->2 4.027616 | |
FNMADD (scalar, D) 1->3 4.014648 | |
FNMADD (scalar, D) 1->4 4.055288 | |
FNMADD (scalar, D) throughput 1.014207 | |
FNMSUB (scalar, S) 1->2 3.433737 | |
FNMSUB (scalar, S) 1->3 4.060290 | |
FNMSUB (scalar, S) 1->4 3.996146 | |
FNMSUB (scalar, S) throughput 1.009507 | |
FNMSUB (scalar, D) 1->2 4.016078 | |
FNMSUB (scalar, D) 1->3 4.114792 | |
FNMSUB (scalar, D) 1->4 3.925991 | |
FNMSUB (scalar, D) throughput 1.034054 | |
FNMUL (scalar, S) 1->2 4.166939 | |
FNMUL (scalar, S) 1->3 4.342061 | |
FNMUL (scalar, S) throughput 0.996482 | |
FNMUL (scalar, D) 1->2 4.013942 | |
FNMUL (scalar, D) 1->3 4.003179 | |
FNMUL (scalar, D) throughput 0.998093 | |
FRECPE (scalar, S) 1->2 3.997818 | |
FRECPE (scalar, S) throughput 1.009859 | |
FRECPE (scalar, D) 1->2 3.950194 | |
FRECPE (scalar, D) throughput 1.011271 | |
FRECPE (vector, 2S) 1->2 4.041366 | |
FRECPE (vector, 2S) throughput 0.988755 | |
FRECPE (vector, 4S) 1->2 3.990729 | |
FRECPE (vector, 4S) throughput 1.035258 | |
FRECPE (vector, 2D) 1->2 3.997526 | |
FRECPE (vector, 2D) throughput 1.008873 | |
FRECPS (vector, 2S) 1->2 4.037676 | |
FRECPS (vector, 2S) 1->3 4.011194 | |
FRECPS (vector, 2S) throughput 1.002013 | |
FRECPS (vector, 4S) 1->2 3.998033 | |
FRECPS (vector, 4S) 1->3 3.998490 | |
FRECPS (vector, 4S) throughput 1.000093 | |
FRECPS (vector, 2D) 1->2 4.068930 | |
FRECPS (vector, 2D) 1->3 4.032468 | |
FRECPS (vector, 2D) throughput 1.004159 | |
FRECPS (scalar, S) 1->2 4.061499 | |
FRECPS (scalar, S) 1->3 4.015724 | |
FRECPS (scalar, S) throughput 1.014283 | |
FRECPS (scalar, D) 1->2 4.014294 | |
FRECPS (scalar, D) 1->3 4.060255 | |
FRECPS (scalar, D) throughput 1.006518 | |
FRECPX (scalar, S) 1->2 4.011629 | |
FRECPX (scalar, S) throughput 1.006801 | |
FRECPX (scalar, D) 1->2 4.016711 | |
FRECPX (scalar, D) throughput 1.005877 | |
FRINTA (scalar, S) 1->2 2.992317 | |
FRINTA (scalar, S) throughput 1.003425 | |
FRINTA (scalar, D) 1->2 3.019803 | |
FRINTA (scalar, D) throughput 1.016749 | |
FRINTA (vector, 2S) 1->2 3.009917 | |
FRINTA (vector, 2S) throughput 1.002605 | |
FRINTA (vector, 4S) 1->2 3.009200 | |
FRINTA (vector, 4S) throughput 1.014306 | |
FRINTA (vector, 2D) 1->2 2.971849 | |
FRINTA (vector, 2D) throughput 1.011259 | |
FRINTI (scalar, S) 1->2 3.016120 | |
FRINTI (scalar, S) throughput 0.993472 | |
FRINTI (scalar, D) 1->2 2.988140 | |
FRINTI (scalar, D) throughput 0.996758 | |
FRINTI (vector, 2S) 1->2 3.032104 | |
FRINTI (vector, 2S) throughput 0.997301 | |
FRINTI (vector, 4S) 1->2 2.984252 | |
FRINTI (vector, 4S) throughput 1.010048 | |
FRINTI (vector, 2D) 1->2 3.022348 | |
FRINTI (vector, 2D) throughput 1.004951 | |
FRINTM (scalar, S) 1->2 2.961350 | |
FRINTM (scalar, S) throughput 1.014867 | |
FRINTM (scalar, D) 1->2 2.986126 | |
FRINTM (scalar, D) throughput 1.002337 | |
FRINTM (vector, 2S) 1->2 3.030927 | |
FRINTM (vector, 2S) throughput 1.014066 | |
FRINTM (vector, 4S) 1->2 3.001897 | |
FRINTM (vector, 4S) throughput 0.989276 | |
FRINTM (vector, 2D) 1->2 3.034166 | |
FRINTM (vector, 2D) throughput 1.012167 | |
FRINTN (scalar, S) 1->2 2.994305 | |
FRINTN (scalar, S) throughput 1.009897 | |
FRINTN (scalar, D) 1->2 2.991695 | |
FRINTN (scalar, D) throughput 1.118566 | |
FRINTN (vector, 2S) 1->2 3.027700 | |
FRINTN (vector, 2S) throughput 1.073678 | |
FRINTN (vector, 4S) 1->2 3.076270 | |
FRINTN (vector, 4S) throughput 1.079775 | |
FRINTN (vector, 2D) 1->2 2.250889 | |
FRINTN (vector, 2D) throughput 1.020851 | |
FRINTP (scalar, S) 1->2 3.095631 | |
FRINTP (scalar, S) throughput 1.034992 | |
FRINTP (scalar, D) 1->2 3.042085 | |
FRINTP (scalar, D) throughput 1.011978 | |
FRINTP (vector, 2S) 1->2 2.998376 | |
FRINTP (vector, 2S) throughput 0.999987 | |
FRINTP (vector, 4S) 1->2 2.939942 | |
FRINTP (vector, 4S) throughput 1.006159 | |
FRINTP (vector, 2D) 1->2 3.027959 | |
FRINTP (vector, 2D) throughput 1.014666 | |
FRINTX (scalar, S) 1->2 2.971286 | |
FRINTX (scalar, S) throughput 1.017913 | |
FRINTX (scalar, D) 1->2 3.008794 | |
FRINTX (scalar, D) throughput 1.009592 | |
FRINTX (vector, 2S) 1->2 3.048071 | |
FRINTX (vector, 2S) throughput 1.003441 | |
FRINTX (vector, 4S) 1->2 2.990690 | |
FRINTX (vector, 4S) throughput 0.985103 | |
FRINTX (vector, 2D) 1->2 3.028110 | |
FRINTX (vector, 2D) throughput 1.052700 | |
FRINTZ (scalar, S) 1->2 3.018930 | |
FRINTZ (scalar, S) throughput 0.995798 | |
FRINTZ (scalar, D) 1->2 3.027587 | |
FRINTZ (scalar, D) throughput 1.007381 | |
FRINTZ (vector, 2S) 1->2 2.994327 | |
FRINTZ (vector, 2S) throughput 1.030017 | |
FRINTZ (vector, 4S) 1->2 2.994568 | |
FRINTZ (vector, 4S) throughput 1.031143 | |
FRINTZ (vector, 2D) 1->2 3.036996 | |
FRINTZ (vector, 2D) throughput 1.013634 | |
FRSQRTE (scalar, S) 1->2 3.995433 | |
FRSQRTE (scalar, S) throughput 0.989990 | |
FRSQRTE (scalar, D) 1->2 4.024469 | |
FRSQRTE (scalar, D) throughput 1.003917 | |
FRSQRTE (vector, 2S) 1->2 4.039535 | |
FRSQRTE (vector, 2S) throughput 1.017098 | |
FRSQRTE (vector, 4S) 1->2 3.996559 | |
FRSQRTE (vector, 4S) throughput 0.996325 | |
FRSQRTE (vector, 2D) 1->2 4.023388 | |
FRSQRTE (vector, 2D) throughput 1.010604 | |
FRSQRTS (vector, 2S) 1->2 4.063842 | |
FRSQRTS (vector, 2S) 1->3 4.025462 | |
FRSQRTS (vector, 2S) throughput 1.008591 | |
FRSQRTS (vector, 4S) 1->2 3.823533 | |
FRSQRTS (vector, 4S) 1->3 4.019797 | |
FRSQRTS (vector, 4S) throughput 1.013359 | |
FRSQRTS (vector, 2D) 1->2 4.074096 | |
FRSQRTS (vector, 2D) 1->3 3.998948 | |
FRSQRTS (vector, 2D) throughput 1.002740 | |
FRSQRTS (scalar, S) 1->2 3.998574 | |
FRSQRTS (scalar, S) 1->3 4.052372 | |
FRSQRTS (scalar, S) throughput 1.017829 | |
FRSQRTS (scalar, D) 1->2 4.011187 | |
FRSQRTS (scalar, D) 1->3 4.056374 | |
FRSQRTS (scalar, D) throughput 1.009142 | |
FSQRT (scalar, S) 1->2 17.139374 | |
FSQRT (scalar, S) throughput 16.229360 | |
FSQRT (scalar, D) 1->2 31.141787 | |
FSQRT (scalar, D) throughput 28.601205 | |
FSQRT (vector, 2S) 1->2 16.866213 | |
FSQRT (vector, 2S) throughput 16.106713 | |
FSQRT (vector, 4S) 1->2 32.078250 | |
FSQRT (vector, 4S) throughput 30.735117 | |
FSQRT (vector, 2D) 1->2 60.562890 | |
FSQRT (vector, 2D) throughput 59.485221 | |
FSUB (vector, 2S) 1->2 4.011366 | |
FSUB (vector, 2S) 1->3 4.024436 | |
FSUB (vector, 2S) throughput 1.009158 | |
FSUB (vector, 4S) 1->2 4.056793 | |
FSUB (vector, 4S) 1->3 4.031407 | |
FSUB (vector, 4S) throughput 1.007814 | |
FSUB (vector, 2D) 1->2 4.015526 | |
FSUB (vector, 2D) 1->3 4.043335 | |
FSUB (vector, 2D) throughput 1.003129 | |
FSUB (scalar, S) 1->2 4.013624 | |
FSUB (scalar, S) 1->3 3.991329 | |
FSUB (scalar, S) throughput 0.992512 | |
FSUB (scalar, D) 1->2 4.046151 | |
FSUB (scalar, D) 1->3 3.991030 | |
FSUB (scalar, D) throughput 1.004966 | |
INS (element, B) 1->1 2.006290 | |
INS (element, B) 1->2 1.994043 | |
INS (element, B) throughput 1.010848 | |
INS (element, B) throughput 0.997193 | |
INS (element, H) 1->1 1.999562 | |
INS (element, H) 1->2 2.027488 | |
INS (element, H) throughput 1.009209 | |
INS (element, H) throughput 1.006922 | |
INS (element, S) 1->1 1.999934 | |
INS (element, S) 1->2 1.995473 | |
INS (element, S) throughput 1.007201 | |
INS (element, S) throughput 1.001724 | |
INS (element, D) 1->1 2.018831 | |
INS (element, D) 1->2 2.022761 | |
INS (element, D) throughput 1.008592 | |
INS (element, D) throughput 0.992961 | |
INS (general, B) 1->1 1.967488 | |
INS (general, B) 1->2 roundtrip 9.053829 | |
INS (general, B) throughput 0.996189 | |
INS (general, H) 1->1 1.995948 | |
INS (general, H) 1->2 roundtrip 9.839381 | |
INS (general, H) throughput 1.009186 | |
INS (general, S) 1->1 1.998664 | |
INS (general, S) 1->2 roundtrip 9.044913 | |
INS (general, S) throughput 0.992849 | |
INS (general, D) 1->1 1.999210 | |
INS (general, D) 1->2 roundtrip 9.086861 | |
INS (general, D) throughput 1.010689 | |
LD1 (multiple, one register, 8B) 1->2 roundtrip 7.094598 | |
LD1 (multiple, one register, 8B) throughput 1.007628 | |
LD1 (multiple, one register, 16B) 1->2 roundtrip 7.106324 | |
LD1 (multiple, one register, 16B) throughput 1.018383 | |
LD1 (multiple, one register, 4H) 1->2 roundtrip 7.088227 | |
LD1 (multiple, one register, 4H) throughput 1.011977 | |
LD1 (multiple, one register, 8H) 1->2 roundtrip 7.069806 | |
LD1 (multiple, one register, 8H) throughput 1.003262 | |
LD1 (multiple, one register, 2S) 1->2 roundtrip 7.114671 | |
LD1 (multiple, one register, 2S) throughput 1.018454 | |
LD1 (multiple, one register, 4S) 1->2 roundtrip 7.097158 | |
LD1 (multiple, one register, 4S) throughput 1.001019 | |
LD1 (multiple, one register, 1D) 1->2 roundtrip 7.076132 | |
LD1 (multiple, one register, 1D) throughput 0.979773 | |
LD1 (multiple, one register, 2D) 1->2 roundtrip 7.150492 | |
LD1 (multiple, one register, 2D) throughput 0.986771 | |
LD1 (multiple, two registers, 8B) 1->3 roundtrip 6.996186 | |
LD1 (multiple, two registers, 8B) 2->3 roundtrip 7.978079 | |
LD1 (multiple, two registers, 8B) throughput 2.017906 | |
LD1 (multiple, two registers, 16B) 1->3 roundtrip 6.975963 | |
LD1 (multiple, two registers, 16B) 2->3 roundtrip 8.001197 | |
LD1 (multiple, two registers, 16B) throughput 1.991158 | |
LD1 (multiple, two registers, 4H) 1->3 roundtrip 7.082772 | |
LD1 (multiple, two registers, 4H) 2->3 roundtrip 7.865790 | |
LD1 (multiple, two registers, 4H) throughput 2.016546 | |
LD1 (multiple, two registers, 8H) 1->3 roundtrip 7.054840 | |
LD1 (multiple, two registers, 8H) 2->3 roundtrip 8.092806 | |
LD1 (multiple, two registers, 8H) throughput 2.031060 | |
LD1 (multiple, two registers, 2S) 1->3 roundtrip 7.190780 | |
LD1 (multiple, two registers, 2S) 2->3 roundtrip 8.197140 | |
LD1 (multiple, two registers, 2S) throughput 2.014596 | |
LD1 (multiple, two registers, 4S) 1->3 roundtrip 7.078812 | |
LD1 (multiple, two registers, 4S) 2->3 roundtrip 8.041236 | |
LD1 (multiple, two registers, 4S) throughput 2.025199 | |
LD1 (multiple, two registers, 1D) 1->3 roundtrip 7.034100 | |
LD1 (multiple, two registers, 1D) 2->3 roundtrip 8.088614 | |
LD1 (multiple, two registers, 1D) throughput 2.070803 | |
LD1 (multiple, two registers, 2D) 1->3 roundtrip 7.150959 | |
LD1 (multiple, two registers, 2D) 2->3 roundtrip 8.071093 | |
LD1 (multiple, two registers, 2D) throughput 2.023146 | |
LD1 (multiple, three registers, 8B) throughput 3.026971 | |
LD1 (multiple, three registers, 16B) throughput 2.999183 | |
LD1 (multiple, three registers, 4H) throughput 2.960124 | |
LD1 (multiple, three registers, 8H) throughput 2.955356 | |
LD1 (multiple, three registers, 2S) throughput 2.993971 | |
LD1 (multiple, three registers, 4S) throughput 3.017486 | |
LD1 (multiple, three registers, 1D) throughput 3.033198 | |
LD1 (multiple, three registers, 2D) throughput 2.945340 | |
LD1 (multiple, four registers, 8B) throughput 4.015912 | |
LD1 (multiple, four registers, 16B) throughput 3.987346 | |
LD1 (multiple, four registers, 4H) throughput 4.033705 | |
LD1 (multiple, four registers, 8H) throughput 4.038455 | |
LD1 (multiple, four registers, 2S) throughput 3.949162 | |
LD1 (multiple, four registers, 4S) throughput 3.914522 | |
LD1 (multiple, four registers, 1D) throughput 4.017619 | |
LD1 (multiple, four registers, 2D) throughput 3.971328 | |
LD1 (single structure, B) 1->2 roundtrip 9.071130 | |
LD1 (single structure, B) throughput 2.009144 | |
LD1 (single structure, H) 1->2 roundtrip 9.009524 | |
LD1 (single structure, H) throughput 2.021958 | |
LD1 (single structure, S) 1->2 roundtrip 9.099139 | |
LD1 (single structure, S) throughput 1.999768 | |
LD1 (single structure, D) 1->2 roundtrip 9.411967 | |
LD1 (single structure, D) throughput 1.982592 | |
LD1R (8B) 1->2 roundtrip 8.523228 | |
LD1R (8B) throughput 1.006268 | |
LD1R (16B) 1->2 roundtrip 8.948784 | |
LD1R (16B) throughput 1.000139 | |
LD1R (4H) 1->2 roundtrip 9.131817 | |
LD1R (4H) throughput 1.008283 | |
LD1R (8H) 1->2 roundtrip 9.104459 | |
LD1R (8H) throughput 1.004553 | |
LD1R (2S) 1->2 roundtrip 9.039798 | |
LD1R (2S) throughput 1.166126 | |
LD1R (4S) 1->2 roundtrip 10.799477 | |
LD1R (4S) throughput 0.973315 | |
LD1R (1D) 1->2 roundtrip 8.873077 | |
LD1R (1D) throughput 0.997192 | |
LD1R (2D) 1->2 roundtrip 9.148959 | |
LD1R (2D) throughput 0.998921 | |
LD2 (multiple structures, 8B) 1->3 roundtrip 9.121930 | |
LD2 (multiple structures, 8B) 2->3 roundtrip 8.994152 | |
LD2 (multiple structures, 8B) throughput 2.012176 | |
LD2 (multiple structures, 16B) 1->3 roundtrip 10.076148 | |
LD2 (multiple structures, 16B) 2->3 roundtrip 11.123065 | |
LD2 (multiple structures, 16B) throughput 1.962271 | |
LD2 (multiple structures, 4H) 1->3 roundtrip 9.090301 | |
LD2 (multiple structures, 4H) 2->3 roundtrip 9.875551 | |
LD2 (multiple structures, 4H) throughput 1.991854 | |
LD2 (multiple structures, 8H) 1->3 roundtrip 10.048960 | |
LD2 (multiple structures, 8H) 2->3 roundtrip 11.112125 | |
LD2 (multiple structures, 8H) throughput 1.891502 | |
LD2 (multiple structures, 2S) 1->3 roundtrip 9.100471 | |
LD2 (multiple structures, 2S) 2->3 roundtrip 9.901684 | |
LD2 (multiple structures, 2S) throughput 2.033195 | |
LD2 (multiple structures, 4S) 1->3 roundtrip 10.164552 | |
LD2 (multiple structures, 4S) 2->3 roundtrip 11.193533 | |
LD2 (multiple structures, 4S) throughput 2.013163 | |
LD2 (multiple structures, 2D) 1->3 roundtrip 10.082453 | |
LD2 (multiple structures, 2D) 2->3 roundtrip 11.165924 | |
LD2 (multiple structures, 2D) throughput 2.014126 | |
LD2 (single structure, B) 1->3 roundtrip 9.013802 | |
LD2 (single structure, B) 2->3 roundtrip 10.203858 | |
LD2 (single structure, B) throughput 1.996736 | |
LD2 (single structure, H) 1->3 roundtrip 9.172477 | |
LD2 (single structure, H) 2->3 roundtrip 10.246846 | |
LD2 (single structure, H) throughput 2.036272 | |
LD2 (single structure, S) 1->3 roundtrip 9.340898 | |
LD2 (single structure, S) 2->3 roundtrip 9.672291 | |
LD2 (single structure, S) throughput 2.011843 | |
LD2 (single structure, D) 1->3 roundtrip 9.107647 | |
LD2 (single structure, D) 2->3 roundtrip 10.040064 | |
LD2 (single structure, D) throughput 2.014421 | |
LD2R (8B) 1->3 roundtrip 8.864812 | |
LD2R (8B) 2->3 roundtrip 10.217025 | |
LD2R (8B) throughput 1.949422 | |
LD2R (16B) 1->3 roundtrip 9.100755 | |
LD2R (16B) 2->3 roundtrip 10.039998 | |
LD2R (16B) throughput 2.026419 | |
LD2R (4H) 1->3 roundtrip 9.057277 | |
LD2R (4H) 2->3 roundtrip 10.046369 | |
LD2R (4H) throughput 2.028164 | |
LD2R (8H) 1->3 roundtrip 9.120667 | |
LD2R (8H) 2->3 roundtrip 10.151903 | |
LD2R (8H) throughput 2.023252 | |
LD2R (2S) 1->3 roundtrip 9.155496 | |
LD2R (2S) 2->3 roundtrip 10.078405 | |
LD2R (2S) throughput 2.008488 | |
LD2R (4S) 1->3 roundtrip 8.331569 | |
LD2R (4S) 2->3 roundtrip 10.163671 | |
LD2R (4S) throughput 2.009089 | |
LD2R (1D) 1->3 roundtrip 7.358907 | |
LD2R (1D) 2->3 roundtrip 10.789044 | |
LD2R (1D) throughput 2.029207 | |
LD2R (2D) 1->3 roundtrip 9.012012 | |
LD2R (2D) 2->3 roundtrip 10.171791 | |
LD2R (2D) throughput 2.030970 | |
LD3 (multiple structures, 8B) throughput 2.993885 | |
LD3 (multiple structures, 16B) throughput 3.132193 | |
LD3 (multiple structures, 4H) throughput 3.029481 | |
LD3 (multiple structures, 8H) throughput 3.184917 | |
LD3 (multiple structures, 2S) throughput 3.025650 | |
LD3 (multiple structures, 4S) throughput 3.169154 | |
LD3 (multiple structures, 2D) throughput 3.157428 | |
LD3 (single structure, B) throughput 3.040378 | |
LD3 (single structure, H) throughput 3.042053 | |
LD3 (single structure, S) throughput 3.020677 | |
LD3 (single structure, D) throughput 3.026556 | |
LD3R (8B) throughput 3.048117 | |
LD3R (16B) throughput 3.003918 | |
LD3R (4H) throughput 2.973801 | |
LD3R (8H) throughput 3.008481 | |
LD3R (2S) throughput 3.174389 | |
LD3R (4S) throughput 3.057643 | |
LD3R (1D) throughput 3.035130 | |
LD3R (2D) throughput 3.026668 | |
LD4 (multiple structures, 8B) throughput 3.956560 | |
LD4 (multiple structures, 16B) throughput 8.062444 | |
LD4 (multiple structures, 4H) throughput 4.069654 | |
LD4 (multiple structures, 8H) throughput 8.066979 | |
LD4 (multiple structures, 2S) throughput 3.999731 | |
LD4 (multiple structures, 4S) throughput 7.948301 | |
LD4 (multiple structures, 2D) throughput 8.060163 | |
LD4 (multiple, 8B, post-index) throughput 4.010692 | |
LD4 (multiple, 16B, post-index) throughput 7.981206 | |
LD4 (multiple, 4H, post-index) throughput 3.962781 | |
LD4 (multiple, 8H, post-index) throughput 8.093958 | |
LD4 (multiple, 2S, post-index) throughput 3.957269 | |
LD4 (multiple, 4S, post-index) throughput 8.088370 | |
LD4 (multiple, 2D, post-index) throughput 8.020004 | |
LD4 (single structure, B) throughput 3.825810 | |
LD4 (single structure, H) throughput 4.008274 | |
LD4 (single structure, S) throughput 4.000988 | |
LD4 (single structure, D) throughput 4.027281 | |
LD4 (single, B, post-index) throughput 3.956026 | |
LD4 (single, H, post-index) throughput 4.041723 | |
LD4 (single, S, post-index) throughput 3.989958 | |
LD4 (single, D, post-index) throughput 4.122687 | |
LD4R (8B) throughput 4.053561 | |
LD4R (16B) throughput 4.037118 | |
LD4R (4H) throughput 4.039982 | |
LD4R (8H) throughput 4.041912 | |
LD4R (2S) throughput 4.023787 | |
LD4R (4S) throughput 4.034545 | |
LD4R (1D) throughput 3.986303 | |
LD4R (2D) throughput 3.977720 | |
LDNP (S) 1->3 roundtrip 7.000762 | |
LDNP (S) 2->3 roundtrip 8.119773 | |
LDNP (S) throughput 1.997603 | |
LDNP (D) 1->3 roundtrip 6.992467 | |
LDNP (D) 2->3 roundtrip 7.865003 | |
LDNP (D) throughput 2.009390 | |
LDNP (Q) 1->3 roundtrip 7.140608 | |
LDNP (Q) 2->3 roundtrip 8.078744 | |
LDNP (Q) throughput 1.970361 | |
LDNP (signed offset, S) 1->3 roundtrip 6.980402 | |
LDNP (signed offset, S) 2->3 roundtrip 8.000654 | |
LDNP (signed offset, S) throughput 2.000679 | |
LDNP (signed offset, D) 1->3 roundtrip 6.897616 | |
LDNP (signed offset, D) 2->3 roundtrip 7.811365 | |
LDNP (signed offset, D) throughput 1.997081 | |
LDNP (signed offset, Q) 1->3 roundtrip 7.085803 | |
LDNP (signed offset, Q) 2->3 roundtrip 11.023365 | |
LDNP (signed offset, Q) throughput 1.947348 | |
LDP (S) 1->3 roundtrip 7.003742 | |
LDP (S) 2->3 roundtrip 8.083244 | |
LDP (S) throughput 2.022976 | |
LDP (D) 1->3 roundtrip 7.130645 | |
LDP (D) 2->3 roundtrip 8.163688 | |
LDP (D) throughput 2.043748 | |
LDP (Q) 1->3 roundtrip 7.064457 | |
LDP (Q) 2->3 roundtrip 8.172987 | |
LDP (Q) throughput 2.018059 | |
LDP (post-index, S) 1->3 roundtrip 7.297353 | |
LDP (post-index, S) 2->3 roundtrip 7.994153 | |
LDP (post-index, S) throughput 2.015806 | |
LDP (post-index, D) 1->3 roundtrip 7.202317 | |
LDP (post-index, D) 2->3 roundtrip 8.219693 | |
LDP (post-index, D) throughput 2.104758 | |
LDP (post-index, Q) 1->3 roundtrip 7.153011 | |
LDP (post-index, Q) 2->3 roundtrip 8.082561 | |
LDP (post-index, Q) throughput 2.079079 | |
LDP (pre-index, S) 1->3 roundtrip 7.059594 | |
LDP (pre-index, S) 2->3 roundtrip 8.324455 | |
LDP (pre-index, S) throughput 2.101429 | |
LDP (pre-index, D) 1->3 roundtrip 7.325863 | |
LDP (pre-index, D) 2->3 roundtrip 8.219350 | |
LDP (pre-index, D) throughput 2.070732 | |
LDP (pre-index, Q) 1->3 roundtrip 7.081176 | |
LDP (pre-index, Q) 2->3 roundtrip 7.872452 | |
LDP (pre-index, Q) throughput 2.113459 | |
LDP (signed offset, S) 1->3 roundtrip 8.130788 | |
LDP (signed offset, S) 2->3 roundtrip 7.815204 | |
LDP (signed offset, S) throughput 2.021555 | |
LDP (signed offset, D) 1->3 roundtrip 7.051319 | |
LDP (signed offset, D) 2->3 roundtrip 8.174056 | |
LDP (signed offset, D) throughput 1.791227 | |
LDP (signed offset, Q) 1->3 roundtrip 6.905685 | |
LDP (signed offset, Q) 2->3 roundtrip 8.148309 | |
LDP (signed offset, Q) throughput 1.998897 | |
LDR (S) 1->2 roundtrip 7.149626 | |
LDR (S) throughput 1.014067 | |
LDR (D) 1->2 roundtrip 7.037952 | |
LDR (D) throughput 1.003665 | |
LDR (Q) 1->2 roundtrip 6.968110 | |
LDR (Q) throughput 1.022553 | |
LDR (post-index, S) 1->2 roundtrip 7.103144 | |
LDR (post-index, S) throughput 1.070907 | |
LDR (post-index, D) 1->2 roundtrip 6.889222 | |
LDR (post-index, D) throughput 1.057968 | |
LDR (post-index, Q) 1->2 roundtrip 14.647482 | |
LDR (post-index, Q) throughput 1.096726 | |
LDR (pre-index, S) 1->2 roundtrip 7.332851 | |
LDR (pre-index, S) throughput 1.098441 | |
LDR (pre-index, D) 1->2 roundtrip 7.304321 | |
LDR (pre-index, D) throughput 1.066280 | |
LDR (pre-index, Q) 1->2 roundtrip 15.798588 | |
LDR (pre-index, Q) throughput 1.091894 | |
LDR (unsigned offset, S) 1->2 roundtrip 7.195555 | |
LDR (unsigned offset, S) throughput 1.010922 | |
LDR (unsigned offset, D) 1->2 roundtrip 7.020637 | |
LDR (unsigned offset, D) throughput 1.006117 | |
LDR (unsigned offset, Q) 1->2 roundtrip 7.126580 | |
LDR (unsigned offset, Q) throughput 1.017005 | |
LDR (literal, S) throughput 1.012070 | |
LDR (literal, D) throughput 1.013506 | |
LDR (literal, Q) throughput 0.977463 | |
LDR (register, S) 1->2 roundtrip 6.999976 | |
LDR (register, S) 1->3 roundtrip 7.081642 | |
LDR (register, S) throughput 1.000024 | |
LDR (register, D) 1->2 roundtrip 7.153651 | |
LDR (register, D) 1->3 roundtrip 7.025644 | |
LDR (register, D) throughput 1.008530 | |
LDR (register, Q) 1->2 roundtrip 7.096594 | |
LDR (register, Q) 1->3 roundtrip 7.137113 | |
LDR (register, Q) throughput 1.007492 | |
LDR (register, uxtw, S) 1->2 roundtrip 7.040530 | |
LDR (register, uxtw, S) 1->3 roundtrip 7.149083 | |
LDR (register, uxtw, S) throughput 1.005010 | |
LDR (register, uxtw, D) 1->2 roundtrip 7.069544 | |
LDR (register, uxtw, D) 1->3 roundtrip 6.927661 | |
LDR (register, uxtw, D) throughput 1.010564 | |
LDR (register, uxtw, Q) 1->2 roundtrip 7.034146 | |
LDR (register, uxtw, Q) 1->3 roundtrip 7.044710 | |
LDR (register, uxtw, Q) throughput 1.015642 | |
LDR (register, sxtw, S) 1->2 roundtrip 7.204588 | |
LDR (register, sxtw, S) 1->3 roundtrip 7.086510 | |
LDR (register, sxtw, S) throughput 1.020610 | |
LDR (register, sxtw, D) 1->2 roundtrip 6.894803 | |
LDR (register, sxtw, D) 1->3 roundtrip 7.061228 | |
LDR (register, sxtw, D) throughput 1.011801 | |
LDR (register, sxtw, Q) 1->2 roundtrip 7.050658 | |
LDR (register, sxtw, Q) 1->3 roundtrip 7.432550 | |
LDR (register, sxtw, Q) throughput 1.017093 | |
LDR (register, lsl, S) 1->2 roundtrip 7.061313 | |
LDR (register, lsl, S) 1->3 roundtrip 6.983034 | |
LDR (register, lsl, S) throughput 1.006468 | |
LDR (register, lsl, D) 1->2 roundtrip 7.291773 | |
LDR (register, lsl, D) 1->3 roundtrip 6.832418 | |
LDR (register, lsl, D) throughput 1.006824 | |
LDR (register, lsl, Q) 1->2 roundtrip 7.240369 | |
LDR (register, lsl, Q) 1->3 roundtrip 8.062310 | |
LDR (register, lsl, Q) throughput 1.031528 | |
LDUR (S) 1->2 roundtrip 7.010993 | |
LDUR (S) throughput 1.004433 | |
LDUR (D) 1->2 roundtrip 7.080629 | |
LDUR (D) throughput 1.002511 | |
LDUR (Q) 1->2 roundtrip 7.027892 | |
LDUR (Q) throughput 1.013174 | |
MLA (by element, 4H) 1->1 1.994123 | |
MLA (by element, 4H) 1->2 3.040918 | |
MLA (by element, 4H) 1->3 3.002875 | |
MLA (by element, 4H) throughput 1.008161 | |
MLA (by element, 4H) throughput 1.001151 | |
MLA (by element, 8H) 1->1 2.060267 | |
MLA (by element, 8H) 1->2 3.038995 | |
MLA (by element, 8H) 1->3 3.024616 | |
MLA (by element, 8H) throughput 1.001055 | |
MLA (by element, 8H) throughput 1.001348 | |
MLA (by element, 2S) 1->1 1.969948 | |
MLA (by element, 2S) 1->2 3.008906 | |
MLA (by element, 2S) 1->3 3.014796 | |
MLA (by element, 2S) throughput 1.029021 | |
MLA (by element, 2S) throughput 1.024570 | |
MLA (by element, 4S) 1->1 2.066809 | |
MLA (by element, 4S) 1->2 3.008056 | |
MLA (by element, 4S) 1->3 3.010495 | |
MLA (by element, 4S) throughput 1.012663 | |
MLA (by element, 4S) throughput 1.021976 | |
MLA (vector, 8B) 1->1 2.010361 | |
MLA (vector, 8B) 1->2 3.007129 | |
MLA (vector, 8B) 1->3 3.015417 | |
MLA (vector, 8B) throughput 0.993066 | |
MLA (vector, 8B) throughput 1.009545 | |
MLA (vector, 16B) 1->1 1.998514 | |
MLA (vector, 16B) 1->2 3.043332 | |
MLA (vector, 16B) 1->3 3.005717 | |
MLA (vector, 16B) throughput 1.009101 | |
MLA (vector, 16B) throughput 0.994515 | |
MLA (vector, 4H) 1->1 2.017646 | |
MLA (vector, 4H) 1->2 3.022631 | |
MLA (vector, 4H) 1->3 3.052392 | |
MLA (vector, 4H) throughput 1.010587 | |
MLA (vector, 4H) throughput 1.008197 | |
MLA (vector, 8H) 1->1 2.007711 | |
MLA (vector, 8H) 1->2 3.063713 | |
MLA (vector, 8H) 1->3 3.020140 | |
MLA (vector, 8H) throughput 1.006693 | |
MLA (vector, 8H) throughput 1.016858 | |
MLA (vector, 2S) 1->1 1.954744 | |
MLA (vector, 2S) 1->2 3.049163 | |
MLA (vector, 2S) 1->3 3.059576 | |
MLA (vector, 2S) throughput 1.012389 | |
MLA (vector, 2S) throughput 1.007875 | |
MLA (vector, 4S) 1->1 1.978865 | |
MLA (vector, 4S) 1->2 3.017502 | |
MLA (vector, 4S) 1->3 3.009832 | |
MLA (vector, 4S) throughput 1.001708 | |
MLA (vector, 4S) throughput 1.007259 | |
MLS (by element, 4H) 1->1 2.030876 | |
MLS (by element, 4H) 1->2 2.997982 | |
MLS (by element, 4H) 1->3 3.020230 | |
MLS (by element, 4H) throughput 1.005687 | |
MLS (by element, 4H) throughput 1.107040 | |
MLS (by element, 8H) 1->1 1.959786 | |
MLS (by element, 8H) 1->2 3.025610 | |
MLS (by element, 8H) 1->3 3.132079 | |
MLS (by element, 8H) throughput 0.844740 | |
MLS (by element, 8H) throughput 1.013551 | |
MLS (by element, 2S) 1->1 2.034451 | |
MLS (by element, 2S) 1->2 3.018182 | |
MLS (by element, 2S) 1->3 3.019767 | |
MLS (by element, 2S) throughput 1.007092 | |
MLS (by element, 2S) throughput 1.004681 | |
MLS (by element, 4S) 1->1 2.027798 | |
MLS (by element, 4S) 1->2 3.014254 | |
MLS (by element, 4S) 1->3 2.994567 | |
MLS (by element, 4S) throughput 1.007058 | |
MLS (by element, 4S) throughput 1.013739 | |
MLS (vector, 8B) 1->1 1.997500 | |
MLS (vector, 8B) 1->2 3.077964 | |
MLS (vector, 8B) 1->3 2.974763 | |
MLS (vector, 8B) throughput 1.006896 | |
MLS (vector, 8B) throughput 0.998205 | |
MLS (vector, 16B) 1->1 2.014013 | |
MLS (vector, 16B) 1->2 2.978360 | |
MLS (vector, 16B) 1->3 3.011215 | |
MLS (vector, 16B) throughput 1.015056 | |
MLS (vector, 16B) throughput 1.008860 | |
MLS (vector, 4H) 1->1 2.007282 | |
MLS (vector, 4H) 1->2 3.056088 | |
MLS (vector, 4H) 1->3 3.171090 | |
MLS (vector, 4H) throughput 1.037902 | |
MLS (vector, 4H) throughput 0.968117 | |
MLS (vector, 8H) 1->1 2.017309 | |
MLS (vector, 8H) 1->2 3.031487 | |
MLS (vector, 8H) 1->3 2.998027 | |
MLS (vector, 8H) throughput 1.008959 | |
MLS (vector, 8H) throughput 0.999855 | |
MLS (vector, 2S) 1->1 2.034893 | |
MLS (vector, 2S) 1->2 3.044404 | |
MLS (vector, 2S) 1->3 3.028880 | |
MLS (vector, 2S) throughput 1.014938 | |
MLS (vector, 2S) throughput 1.013332 | |
MLS (vector, 4S) 1->1 2.024152 | |
MLS (vector, 4S) 1->2 3.048691 | |
MLS (vector, 4S) 1->3 3.021361 | |
MLS (vector, 4S) throughput 0.985544 | |
MLS (vector, 4S) throughput 1.004311 | |
MOV (vector, 8B) 1->2 2.001052 | |
MOV (vector, 8B) throughput 1.008864 | |
MOV (vector, 16B) 1->2 1.515897 | |
MOV (vector, 16B) throughput 0.674997 | |
MOVI (vector, 4H, zero) throughput 0.335973 | |
MOVI (vector, 8H, zero) throughput 0.334946 | |
MOVI (vector, 2S, zero) throughput 0.336656 | |
MOVI (vector, 4S, zero) throughput 0.334022 | |
MOVI (vector, 4H) throughput 1.006349 | |
MOVI (vector, 8H) throughput 1.132170 | |
MOVI (vector, 2S) throughput 1.028326 | |
MOVI (vector, 4S) throughput 0.994822 | |
MOVI (vector, 4H, lsl) throughput 1.006999 | |
MOVI (vector, 8H, lsl) throughput 1.013197 | |
MOVI (vector, 2S, lsl) throughput 1.010297 | |
MOVI (vector, 4S, lsl) throughput 1.003825 | |
MOVI (vector, 2S, msl) throughput 1.001414 | |
MOVI (vector, 4S, msl) throughput 0.990220 | |
MOVI (vector, 64-bit, 2D) throughput 1.005672 | |
MOVI (vector, 64-bit, D) throughput 1.015231 | |
MUL (by element, 4H) 1->2 3.399468 | |
MUL (by element, 4H) 1->3 3.001490 | |
MUL (by element, 4H) throughput 0.998619 | |
MUL (by element, 8H) 1->2 3.033811 | |
MUL (by element, 8H) 1->3 2.968722 | |
MUL (by element, 8H) throughput 1.012007 | |
MUL (by element, 2S) 1->2 3.091594 | |
MUL (by element, 2S) 1->3 3.000657 | |
MUL (by element, 2S) throughput 1.005459 | |
MUL (by element, 4S) 1->2 2.987555 | |
MUL (by element, 4S) 1->3 3.001425 | |
MUL (by element, 4S) throughput 1.007751 | |
MUL (vector, 8B) 1->2 3.021184 | |
MUL (vector, 8B) 1->3 3.040622 | |
MUL (vector, 8B) throughput 1.002308 | |
MUL (vector, 16B) 1->2 3.000043 | |
MUL (vector, 16B) 1->3 3.030257 | |
MUL (vector, 16B) throughput 1.014655 | |
MUL (vector, 4H) 1->2 3.015437 | |
MUL (vector, 4H) 1->3 3.082729 | |
MUL (vector, 4H) throughput 1.015050 | |
MUL (vector, 8H) 1->2 3.030969 | |
MUL (vector, 8H) 1->3 3.011160 | |
MUL (vector, 8H) throughput 0.996488 | |
MUL (vector, 2S) 1->2 2.992596 | |
MUL (vector, 2S) 1->3 3.008077 | |
MUL (vector, 2S) throughput 1.007121 | |
MUL (vector, 4S) 1->2 3.048376 | |
MUL (vector, 4S) 1->3 3.013681 | |
MUL (vector, 4S) throughput 1.013426 | |
MVNI (vector, 4H) throughput 1.004394 | |
MVNI (vector, 8H) throughput 1.022595 | |
MVNI (vector, 2S) throughput 1.007236 | |
MVNI (vector, 4S) throughput 1.009667 | |
MVNI (vector, 4H, lsl) throughput 1.010625 | |
MVNI (vector, 8H, lsl) throughput 0.975818 | |
MVNI (vector, 2S, lsl) throughput 1.008869 | |
MVNI (vector, 4S, lsl) throughput 1.001930 | |
MVNI (vector, 2S, msl) throughput 0.995008 | |
MVNI (vector, 4S, msl) throughput 1.008937 | |
NEG (vector, 8B) 1->2 2.062360 | |
NEG (vector, 8B) throughput 1.008300 | |
NEG (vector, 16B) 1->2 2.006906 | |
NEG (vector, 16B) throughput 1.029551 | |
NEG (vector, 4H) 1->2 2.013239 | |
NEG (vector, 4H) throughput 1.008943 | |
NEG (vector, 8H) 1->2 2.005176 | |
NEG (vector, 8H) throughput 0.933436 | |
NEG (vector, 2S) 1->2 2.035211 | |
NEG (vector, 2S) throughput 1.007089 | |
NEG (vector, 4S) 1->2 2.001447 | |
NEG (vector, 4S) throughput 1.012813 | |
NEG (vector, 2D) 1->2 2.006949 | |
NEG (vector, 2D) throughput 1.004561 | |
NOT (vector, 8B) 1->2 2.013655 | |
NOT (vector, 8B) throughput 0.996764 | |
NOT (vector, 16B) 1->2 1.946171 | |
NOT (vector, 16B) throughput 1.000896 | |
ORN (vector, 8B) 1->2 2.051137 | |
ORN (vector, 8B) 1->3 2.043809 | |
ORN (vector, 8B) throughput 1.005183 | |
ORN (vector, 16B) 1->2 2.009745 | |
ORN (vector, 16B) 1->3 2.011862 | |
ORN (vector, 16B) throughput 1.009445 | |
ORR (vector, immediate, 4H) 1->1 2.014141 | |
ORR (vector, immediate, 4H) throughput 1.001582 | |
ORR (vector, immediate, 4H) throughput 1.004559 | |
ORR (vector, immediate, 8H) 1->1 2.005124 | |
ORR (vector, immediate, 8H) throughput 1.012243 | |
ORR (vector, immediate, 8H) throughput 1.000840 | |
ORR (vector, immediate, 2S) 1->1 2.023244 | |
ORR (vector, immediate, 2S) throughput 1.007483 | |
ORR (vector, immediate, 2S) throughput 1.056901 | |
ORR (vector, immediate, 4S) 1->1 1.928399 | |
ORR (vector, immediate, 4S) throughput 1.039218 | |
ORR (vector, immediate, 4S) throughput 1.035488 | |
ORR (vector, 8B) 1->2 2.040337 | |
ORR (vector, 8B) 1->3 2.112669 | |
ORR (vector, 8B) throughput 1.039529 | |
ORR (vector, 16B) 1->2 2.014870 | |
ORR (vector, 16B) 1->3 2.105945 | |
ORR (vector, 16B) throughput 1.008895 | |
PMUL (8B) 1->2 3.017152 | |
PMUL (8B) 1->3 3.006557 | |
PMUL (8B) throughput 1.023880 | |
PMUL (16B) 1->2 3.001964 | |
PMUL (16B) 1->3 3.020032 | |
PMUL (16B) throughput 1.006121 | |
PMULL (1Q) 1->2 3.009735 | |
PMULL (1Q) 1->3 3.014877 | |
PMULL (1Q) throughput 1.036657 | |
PMULL (8H) 1->2 3.004653 | |
PMULL (8H) 1->3 3.028874 | |
PMULL (8H) throughput 1.007143 | |
PMULL2 (1Q) 1->2 3.012720 | |
PMULL2 (1Q) 1->3 3.023533 | |
PMULL2 (1Q) throughput 0.998412 | |
PMULL2 (8H) 1->2 3.008883 | |
PMULL2 (8H) 1->3 3.003038 | |
PMULL2 (8H) throughput 1.003411 | |
PMULL + EOR (1Q) 1->2 4.560702 | |
PMULL + EOR (1Q) 1->3 4.429133 | |
PMULL + EOR (1Q) 1->4 4.519806 | |
PMULL + EOR (1Q) throughput 1.007774 | |
PMULL + EOR (8H) 1->2 4.462245 | |
PMULL + EOR (8H) 1->3 4.395711 | |
PMULL + EOR (8H) 1->4 4.494503 | |
PMULL + EOR (8H) throughput 1.013351 | |
RADDHN (8H) 1->2 4.052868 | |
RADDHN (8H) 1->3 4.036968 | |
RADDHN (8H) throughput 1.007992 | |
RADDHN (4S) 1->2 3.954327 | |
RADDHN (4S) 1->3 4.012892 | |
RADDHN (4S) throughput 1.008545 | |
RADDHN (2D) 1->2 4.050395 | |
RADDHN (2D) 1->3 5.055486 | |
RADDHN (2D) throughput 1.004871 | |
RADDHN2 (8H) 1->1 4.013042 | |
RADDHN2 (8H) 1->2 4.013946 | |
RADDHN2 (8H) 1->3 4.018853 | |
RADDHN2 (8H) throughput 1.010915 | |
RADDHN2 (8H) throughput 1.004368 | |
RADDHN2 (4S) 1->1 3.979254 | |
RADDHN2 (4S) 1->2 4.066697 | |
RADDHN2 (4S) 1->3 4.011824 | |
RADDHN2 (4S) throughput 0.981022 | |
RADDHN2 (4S) throughput 1.004288 | |
RADDHN2 (2D) 1->1 4.170248 | |
RADDHN2 (2D) 1->2 3.992816 | |
RADDHN2 (2D) 1->3 3.987506 | |
RADDHN2 (2D) throughput 0.997334 | |
RADDHN2 (2D) throughput 1.006608 | |
RBIT (vector, 8B) 1->2 2.010620 | |
RBIT (vector, 8B) throughput 1.005727 | |
RBIT (vector, 16B) 1->2 1.937261 | |
RBIT (vector, 16B) throughput 0.999526 | |
REV16 (vector, 8B) 1->2 2.015333 | |
REV16 (vector, 8B) throughput 1.016656 | |
REV16 (vector, 16B) 1->2 2.040223 | |
REV16 (vector, 16B) throughput 0.987701 | |
REV32 (vector, 8B) 1->2 2.010075 | |
REV32 (vector, 8B) throughput 1.025881 | |
REV32 (vector, 16B) 1->2 2.015111 | |
REV32 (vector, 16B) throughput 0.988869 | |
REV32 (vector, 4H) 1->2 1.891187 | |
REV32 (vector, 4H) throughput 1.011735 | |
REV32 (vector, 8H) 1->2 1.989317 | |
REV32 (vector, 8H) throughput 0.994255 | |
REV64 (vector, 8B) 1->2 1.980373 | |
REV64 (vector, 8B) throughput 1.013282 | |
REV64 (vector, 16B) 1->2 2.020988 | |
REV64 (vector, 16B) throughput 1.005975 | |
REV64 (vector, 4H) 1->2 2.026497 | |
REV64 (vector, 4H) throughput 1.012111 | |
REV64 (vector, 8H) 1->2 2.009837 | |
REV64 (vector, 8H) throughput 1.006337 | |
REV64 (vector, 2S) 1->2 2.008773 | |
REV64 (vector, 2S) throughput 0.991170 | |
REV64 (vector, 4S) 1->2 2.017196 | |
REV64 (vector, 4S) throughput 1.010071 | |
RSHRN (8H) 1->2 4.021408 | |
RSHRN (8H) throughput 0.984314 | |
RSHRN (4S) 1->2 4.059482 | |
RSHRN (4S) throughput 1.011370 | |
RSHRN (2D) 1->2 3.988616 | |
RSHRN (2D) throughput 1.003271 | |
RSHRN2 (8H) 1->1 4.035963 | |
RSHRN2 (8H) 1->2 4.016628 | |
RSHRN2 (8H) throughput 0.999328 | |
RSHRN2 (8H) throughput 0.997781 | |
RSHRN2 (4S) 1->1 4.082512 | |
RSHRN2 (4S) 1->2 3.924463 | |
RSHRN2 (4S) throughput 1.014018 | |
RSHRN2 (4S) throughput 1.001584 | |
RSHRN2 (2D) 1->1 4.005322 | |
RSHRN2 (2D) 1->2 4.014959 | |
RSHRN2 (2D) throughput 1.004425 | |
RSHRN2 (2D) throughput 1.004984 | |
RSUBHN (8H) 1->2 4.006977 | |
RSUBHN (8H) 1->3 4.043029 | |
RSUBHN (8H) throughput 1.012152 | |
RSUBHN (4S) 1->2 4.001765 | |
RSUBHN (4S) 1->3 4.151385 | |
RSUBHN (4S) throughput 0.996860 | |
RSUBHN (2D) 1->2 4.042812 | |
RSUBHN (2D) 1->3 4.009043 | |
RSUBHN (2D) throughput 1.009465 | |
RSUBHN2 (8H) 1->1 4.022807 | |
RSUBHN2 (8H) 1->2 4.035021 | |
RSUBHN2 (8H) 1->3 4.020748 | |
RSUBHN2 (8H) throughput 0.997086 | |
RSUBHN2 (8H) throughput 1.010594 | |
RSUBHN2 (4S) 1->1 4.034768 | |
RSUBHN2 (4S) 1->2 4.004540 | |
RSUBHN2 (4S) 1->3 4.237777 | |
RSUBHN2 (4S) throughput 1.010794 | |
RSUBHN2 (4S) throughput 1.001294 | |
RSUBHN2 (2D) 1->1 3.984003 | |
RSUBHN2 (2D) 1->2 4.048976 | |
RSUBHN2 (2D) 1->3 4.031761 | |
RSUBHN2 (2D) throughput 1.008312 | |
RSUBHN2 (2D) throughput 1.002700 | |
SABA (vector, 8B) 1->1 1.994392 | |
SABA (vector, 8B) 1->2 3.047644 | |
SABA (vector, 8B) 1->3 3.028680 | |
SABA (vector, 8B) throughput 1.005149 | |
SABA (vector, 8B) throughput 0.862255 | |
SABA (vector, 16B) 1->1 1.998728 | |
SABA (vector, 16B) 1->2 3.054772 | |
SABA (vector, 16B) 1->3 3.201108 | |
SABA (vector, 16B) throughput 1.022395 | |
SABA (vector, 16B) throughput 1.002724 | |
SABA (vector, 4H) 1->1 1.955581 | |
SABA (vector, 4H) 1->2 3.097438 | |
SABA (vector, 4H) 1->3 3.143701 | |
SABA (vector, 4H) throughput 1.016649 | |
SABA (vector, 4H) throughput 1.014533 | |
SABA (vector, 8H) 1->1 2.001360 | |
SABA (vector, 8H) 1->2 3.004486 | |
SABA (vector, 8H) 1->3 2.864721 | |
SABA (vector, 8H) throughput 1.003880 | |
SABA (vector, 8H) throughput 1.008136 | |
SABA (vector, 2S) 1->1 2.043830 | |
SABA (vector, 2S) 1->2 3.014002 | |
SABA (vector, 2S) 1->3 3.067800 | |
SABA (vector, 2S) throughput 1.004516 | |
SABA (vector, 2S) throughput 1.011657 | |
SABA (vector, 4S) 1->1 1.994240 | |
SABA (vector, 4S) 1->2 3.034458 | |
SABA (vector, 4S) 1->3 3.040355 | |
SABA (vector, 4S) throughput 1.012051 | |
SABA (vector, 4S) throughput 1.023626 | |
SABAL (vector, 8H) 1->1 2.000457 | |
SABAL (vector, 8H) 1->2 2.997173 | |
SABAL (vector, 8H) 1->3 3.046583 | |
SABAL (vector, 8H) throughput 1.012400 | |
SABAL (vector, 8H) throughput 1.014495 | |
SABAL (vector, 4S) 1->1 2.000153 | |
SABAL (vector, 4S) 1->2 3.023993 | |
SABAL (vector, 4S) 1->3 2.975585 | |
SABAL (vector, 4S) throughput 1.021654 | |
SABAL (vector, 4S) throughput 1.000362 | |
SABAL (vector, 2D) 1->1 2.001028 | |
SABAL (vector, 2D) 1->2 3.027742 | |
SABAL (vector, 2D) 1->3 2.990141 | |
SABAL (vector, 2D) throughput 0.987691 | |
SABAL (vector, 2D) throughput 0.986973 | |
SABAL2 (vector, 8H) 1->1 2.027271 | |
SABAL2 (vector, 8H) 1->2 3.021909 | |
SABAL2 (vector, 8H) 1->3 3.000709 | |
SABAL2 (vector, 8H) throughput 1.002900 | |
SABAL2 (vector, 8H) throughput 1.003088 | |
SABAL2 (vector, 4S) 1->1 1.997083 | |
SABAL2 (vector, 4S) 1->2 3.004890 | |
SABAL2 (vector, 4S) 1->3 2.996424 | |
SABAL2 (vector, 4S) throughput 1.016295 | |
SABAL2 (vector, 4S) throughput 1.008308 | |
SABAL2 (vector, 2D) 1->1 1.996212 | |
SABAL2 (vector, 2D) 1->2 3.008902 | |
SABAL2 (vector, 2D) 1->3 3.046473 | |
SABAL2 (vector, 2D) throughput 1.007528 | |
SABAL2 (vector, 2D) throughput 1.010111 | |
SABD (vector, 8B) 1->2 3.028707 | |
SABD (vector, 8B) 1->3 3.024048 | |
SABD (vector, 8B) throughput 1.005396 | |
SABD (vector, 16B) 1->2 2.959471 | |
SABD (vector, 16B) 1->3 2.967510 | |
SABD (vector, 16B) throughput 0.990578 | |
SABD (vector, 4H) 1->2 3.009212 | |
SABD (vector, 4H) 1->3 2.967480 | |
SABD (vector, 4H) throughput 1.009673 | |
SABD (vector, 8H) 1->2 3.047693 | |
SABD (vector, 8H) 1->3 3.002095 | |
SABD (vector, 8H) throughput 0.970576 | |
SABD (vector, 2S) 1->2 3.011112 | |
SABD (vector, 2S) 1->3 2.995547 | |
SABD (vector, 2S) throughput 1.132258 | |
SABD (vector, 4S) 1->2 2.974151 | |
SABD (vector, 4S) 1->3 3.009981 | |
SABD (vector, 4S) throughput 1.048315 | |
SABDL (vector, 8H) 1->2 3.118561 | |
SABDL (vector, 8H) 1->3 3.007303 | |
SABDL (vector, 8H) throughput 0.984402 | |
SABDL (vector, 4S) 1->2 3.022789 | |
SABDL (vector, 4S) 1->3 2.962503 | |
SABDL (vector, 4S) throughput 1.005068 | |
SABDL (vector, 2D) 1->2 3.000389 | |
SABDL (vector, 2D) 1->3 3.021050 | |
SABDL (vector, 2D) throughput 1.019036 | |
SABDL2 (vector, 8H) 1->2 2.972069 | |
SABDL2 (vector, 8H) 1->3 3.016052 | |
SABDL2 (vector, 8H) throughput 1.004494 | |
SABDL2 (vector, 4S) 1->2 3.022530 | |
SABDL2 (vector, 4S) 1->3 3.028053 | |
SABDL2 (vector, 4S) throughput 1.015183 | |
SABDL2 (vector, 2D) 1->2 3.004781 | |
SABDL2 (vector, 2D) 1->3 3.006666 | |
SABDL2 (vector, 2D) throughput 1.007565 | |
SADALP (vector, 4H) 1->1 2.008577 | |
SADALP (vector, 4H) 1->2 2.952439 | |
SADALP (vector, 4H) throughput 0.989935 | |
SADALP (vector, 4H) throughput 1.015091 | |
SADALP (vector, 8H) 1->1 2.004141 | |
SADALP (vector, 8H) 1->2 3.846702 | |
SADALP (vector, 8H) throughput 1.012482 | |
SADALP (vector, 8H) throughput 0.995170 | |
SADALP (vector, 2S) 1->1 1.985839 | |
SADALP (vector, 2S) 1->2 3.027417 | |
SADALP (vector, 2S) throughput 1.006622 | |
SADALP (vector, 2S) throughput 1.006187 | |
SADALP (vector, 4S) 1->1 1.999058 | |
SADALP (vector, 4S) 1->2 2.977011 | |
SADALP (vector, 4S) throughput 1.009454 | |
SADALP (vector, 4S) throughput 1.044498 | |
SADALP (vector, 1D) 1->1 1.993797 | |
SADALP (vector, 1D) 1->2 3.034943 | |
SADALP (vector, 1D) throughput 1.008253 | |
SADALP (vector, 1D) throughput 1.006941 | |
SADALP (vector, 2D) 1->1 2.046205 | |
SADALP (vector, 2D) 1->2 2.999386 | |
SADALP (vector, 2D) throughput 1.010839 | |
SADALP (vector, 2D) throughput 0.989324 | |
SADDL (vector, 8H) 1->2 2.005225 | |
SADDL (vector, 8H) 1->3 2.012942 | |
SADDL (vector, 8H) throughput 0.987288 | |
SADDL (vector, 4S) 1->2 2.003268 | |
SADDL (vector, 4S) 1->3 2.005789 | |
SADDL (vector, 4S) throughput 1.002487 | |
SADDL (vector, 2D) 1->2 1.965871 | |
SADDL (vector, 2D) 1->3 1.974976 | |
SADDL (vector, 2D) throughput 1.008132 | |
SADDL2 (vector, 8H) 1->2 2.030874 | |
SADDL2 (vector, 8H) 1->3 2.007753 | |
SADDL2 (vector, 8H) throughput 1.001471 | |
SADDL2 (vector, 4S) 1->2 2.004473 | |
SADDL2 (vector, 4S) 1->3 1.964516 | |
SADDL2 (vector, 4S) throughput 1.000653 | |
SADDL2 (vector, 2D) 1->2 2.057561 | |
SADDL2 (vector, 2D) 1->3 1.997192 | |
SADDL2 (vector, 2D) throughput 1.005264 | |
SADDLP (vector, 4H) 1->2 2.033428 | |
SADDLP (vector, 4H) throughput 0.975096 | |
SADDLP (vector, 8H) 1->2 2.025358 | |
SADDLP (vector, 8H) throughput 1.017959 | |
SADDLP (vector, 2S) 1->2 1.994348 | |
SADDLP (vector, 2S) throughput 0.994529 | |
SADDLP (vector, 4S) 1->2 2.003772 | |
SADDLP (vector, 4S) throughput 1.000782 | |
SADDLP (vector, 1D) 1->2 2.001098 | |
SADDLP (vector, 1D) throughput 0.991205 | |
SADDLP (vector, 2D) 1->2 1.987527 | |
SADDLP (vector, 2D) throughput 0.982868 | |
SADDLV (vector, 8B) 1->2 2.876083 | |
SADDLV (vector, 8B) throughput 1.027311 | |
SADDLV (vector, 16B) 1->2 3.026380 | |
SADDLV (vector, 16B) throughput 1.036184 | |
SADDLV (vector, 4H) 1->2 3.029262 | |
SADDLV (vector, 4H) throughput 1.019246 | |
SADDLV (vector, 8H) 1->2 3.037059 | |
SADDLV (vector, 8H) throughput 1.004186 | |
SADDLV (vector, 4S) 1->2 2.936999 | |
SADDLV (vector, 4S) throughput 1.015364 | |
SADDW (vector, 8H) 1->2 2.022099 | |
SADDW (vector, 8H) 1->3 2.028720 | |
SADDW (vector, 8H) throughput 1.007651 | |
SADDW (vector, 4S) 1->2 2.062081 | |
SADDW (vector, 4S) 1->3 2.003254 | |
SADDW (vector, 4S) throughput 1.011508 | |
SADDW (vector, 2D) 1->2 2.046820 | |
SADDW (vector, 2D) 1->3 1.989716 | |
SADDW (vector, 2D) throughput 1.005253 | |
SADDW2 (vector, 8H) 1->2 2.016670 | |
SADDW2 (vector, 8H) 1->3 2.046380 | |
SADDW2 (vector, 8H) throughput 0.986631 | |
SADDW2 (vector, 4S) 1->2 2.006650 | |
SADDW2 (vector, 4S) 1->3 2.024316 | |
SADDW2 (vector, 4S) throughput 1.022784 | |
SADDW2 (vector, 2D) 1->2 2.010169 | |
SADDW2 (vector, 2D) 1->3 1.994109 | |
SADDW2 (vector, 2D) throughput 1.017668 | |
SCVTF (scalar, fixed-point, S from 32-bit GPR) 1->2 roundtrip 9.918009 | |
SCVTF (scalar, fixed-point, S from 32-bit GPR) throughput 1.009686 | |
SCVTF (scalar, fixed-point, D from 32-bit GPR) 1->2 roundtrip 10.074537 | |
SCVTF (scalar, fixed-point, D from 32-bit GPR) throughput 1.007089 | |
SCVTF (scalar, fixed-point, S from 64-bit GPR) 1->2 roundtrip 10.100717 | |
SCVTF (scalar, fixed-point, S from 64-bit GPR) throughput 1.007194 | |
SCVTF (scalar, fixed-point, D from 64-bit GPR) 1->2 roundtrip 10.044557 | |
SCVTF (scalar, fixed-point, D from 64-bit GPR) throughput 1.012941 | |
SCVTF (scalar, integer, S from 32-bit GPR) 1->2 roundtrip 9.935330 | |
SCVTF (scalar, integer, S from 32-bit GPR) throughput 1.006438 | |
SCVTF (scalar, integer, D from 32-bit GPR) 1->2 roundtrip 10.027651 | |
SCVTF (scalar, integer, D from 32-bit GPR) throughput 1.015946 | |
SCVTF (scalar, integer, S from 64-bit GPR) 1->2 roundtrip 10.006854 | |
SCVTF (scalar, integer, S from 64-bit GPR) throughput 1.006996 | |
SCVTF (scalar, integer, D from 64-bit GPR) 1->2 roundtrip 10.039849 | |
SCVTF (scalar, integer, D from 64-bit GPR) throughput 1.004608 | |
SCVTF (scalar, fixed-point, S from S) 1->2 2.999579 | |
SCVTF (scalar, fixed-point, S from S) throughput 1.012036 | |
SCVTF (scalar, fixed-point, D from D) 1->2 3.011327 | |
SCVTF (scalar, fixed-point, D from D) throughput 0.999926 | |
SCVTF (vector, fixed-point, 2S from 2S) 1->2 3.024322 | |
SCVTF (vector, fixed-point, 2S from 2S) throughput 1.012638 | |
SCVTF (vector, fixed-point, 4S from 4S) 1->2 2.986756 | |
SCVTF (vector, fixed-point, 4S from 4S) throughput 1.005748 | |
SCVTF (vector, fixed-point, 2D from 2D) 1->2 3.023377 | |
SCVTF (vector, fixed-point, 2D from 2D) throughput 0.974618 | |
SCVTF (scalar, integer, S from S) 1->2 3.066739 | |
SCVTF (scalar, integer, S from S) throughput 1.009499 | |
SCVTF (scalar, integer, D from D) 1->2 3.029985 | |
SCVTF (scalar, integer, D from D) throughput 1.001533 | |
SCVTF (vector, integer, 2S from 2S) 1->2 3.036120 | |
SCVTF (vector, integer, 2S from 2S) throughput 1.008807 | |
SCVTF (vector, integer, 4S from 4S) 1->2 2.884876 | |
SCVTF (vector, integer, 4S from 4S) throughput 0.971467 | |
SCVTF (vector, integer, 2D from 2D) 1->2 3.038833 | |
SCVTF (vector, integer, 2D from 2D) throughput 1.014511 | |
SHA1C 1->1 5.024894 | |
SHA1C 1->2 5.060340 | |
SHA1C 1->3 5.028331 | |
SHA1C throughput 4.036938 | |
SHA1C throughput 3.908395 | |
SHA1H 1->2 1.955366 | |
SHA1H throughput 1.005860 | |
SHA1M 1->1 5.073646 | |
SHA1M 1->2 4.985275 | |
SHA1M 1->3 5.038137 | |
SHA1M throughput 4.028662 | |
SHA1M throughput 4.035788 | |
SHA1P 1->1 5.012074 | |
SHA1P 1->2 4.998680 | |
SHA1P 1->3 5.025577 | |
SHA1P throughput 3.996057 | |
SHA1P throughput 4.022321 | |
SHA1SU0 1->1 2.005135 | |
SHA1SU0 1->2 1.999934 | |
SHA1SU0 1->3 2.047283 | |
SHA1SU0 throughput 0.994366 | |
SHA1SU0 throughput 1.015655 | |
SHA1SU1 1->1 1.994633 | |
SHA1SU1 1->2 1.998092 | |
SHA1SU1 throughput 1.003034 | |
SHA1SU1 throughput 1.007335 | |
SHA256H 1->1 5.081018 | |
SHA256H 1->2 5.041674 | |
SHA256H 1->3 4.990472 | |
SHA256H throughput 4.060842 | |
SHA256H throughput 4.023415 | |
SHA256H2 1->1 5.027326 | |
SHA256H2 1->2 5.018499 | |
SHA256H2 1->3 5.026376 | |
SHA256H2 throughput 3.985535 | |
SHA256H2 throughput 3.992722 | |
SHA256SU0 1->1 1.994962 | |
SHA256SU0 1->2 2.011590 | |
SHA256SU0 throughput 1.016746 | |
SHA256SU0 throughput 1.020090 | |
SHA256SU1 1->1 3.037013 | |
SHA256SU1 1->2 2.952537 | |
SHA256SU1 1->3 3.006110 | |
SHA256SU1 throughput 1.004076 | |
SHA256SU1 throughput 1.000225 | |
SHADD (vector, 8B) 1->2 2.014433 | |
SHADD (vector, 8B) 1->3 2.000811 | |
SHADD (vector, 8B) throughput 1.005955 | |
SHADD (vector, 16B) 1->2 1.975699 | |
SHADD (vector, 16B) 1->3 2.009890 | |
SHADD (vector, 16B) throughput 1.014230 | |
SHADD (vector, 4H) 1->2 1.981427 | |
SHADD (vector, 4H) 1->3 2.018562 | |
SHADD (vector, 4H) throughput 0.988409 | |
SHADD (vector, 8H) 1->2 2.015649 | |
SHADD (vector, 8H) 1->3 1.996123 | |
SHADD (vector, 8H) throughput 1.008086 | |
SHADD (vector, 2S) 1->2 2.007543 | |
SHADD (vector, 2S) 1->3 2.037379 | |
SHADD (vector, 2S) throughput 1.010055 | |
SHADD (vector, 4S) 1->2 2.018575 | |
SHADD (vector, 4S) 1->3 1.983412 | |
SHADD (vector, 4S) throughput 1.009104 | |
SHL (vector, 8B) 1->2 2.007887 | |
SHL (vector, 8B) throughput 1.005609 | |
SHL (vector, 16B) 1->2 1.994632 | |
SHL (vector, 16B) throughput 1.013253 | |
SHL (vector, 4H) 1->2 2.019109 | |
SHL (vector, 4H) throughput 1.009391 | |
SHL (vector, 8H) 1->2 1.998490 | |
SHL (vector, 8H) throughput 1.024142 | |
SHL (vector, 2S) 1->2 2.012197 | |
SHL (vector, 2S) throughput 0.998608 | |
SHL (vector, 4S) 1->2 2.025752 | |
SHL (vector, 4S) throughput 1.004070 | |
SHL (vector, 2D) 1->2 1.999145 | |
SHL (vector, 2D) throughput 1.004371 | |
SHL (scalar, D) 1->2 2.008872 | |
SHL (scalar, D) throughput 0.992533 | |
SHLL (vector, 8H) 1->2 2.048405 | |
SHLL (vector, 8H) throughput 1.021637 | |
SHLL (vector, 4S) 1->2 2.024587 | |
SHLL (vector, 4S) throughput 1.006487 | |
SHLL (vector, 2D) 1->2 2.018170 | |
SHLL (vector, 2D) throughput 1.006623 | |
SHLL2 (vector, 8H) 1->2 1.958272 | |
SHLL2 (vector, 8H) throughput 0.997561 | |
SHLL2 (vector, 4S) 1->2 2.022873 | |
SHLL2 (vector, 4S) throughput 1.007682 | |
SHLL2 (vector, 2D) 1->2 2.011115 | |
SHLL2 (vector, 2D) throughput 0.996469 | |
SHRN (8H) 1->2 4.362977 | |
SHRN (8H) throughput 0.964550 | |
SHRN (4S) 1->2 3.081054 | |
SHRN (4S) throughput 1.130533 | |
SHRN (2D) 1->2 3.165798 | |
SHRN (2D) throughput 0.864060 | |
SHRN2 (8H) 1->1 4.047584 | |
SHRN2 (8H) 1->2 3.972984 | |
SHRN2 (8H) throughput 1.000829 | |
SHRN2 (8H) throughput 1.012462 | |
SHRN2 (4S) 1->1 3.856936 | |
SHRN2 (4S) 1->2 4.028558 | |
SHRN2 (4S) throughput 1.008792 | |
SHRN2 (4S) throughput 0.999261 | |
SHRN2 (2D) 1->1 4.066367 | |
SHRN2 (2D) 1->2 4.048712 | |
SHRN2 (2D) throughput 1.004621 | |
SHRN2 (2D) throughput 1.010732 | |
SHSUB (vector, 8B) 1->2 1.996891 | |
SHSUB (vector, 8B) 1->3 2.005031 | |
SHSUB (vector, 8B) throughput 1.000687 | |
SHSUB (vector, 16B) 1->2 2.037747 | |
SHSUB (vector, 16B) 1->3 2.268977 | |
SHSUB (vector, 16B) throughput 1.013036 | |
SHSUB (vector, 4H) 1->2 2.005428 | |
SHSUB (vector, 4H) 1->3 2.028858 | |
SHSUB (vector, 4H) throughput 1.007361 | |
SHSUB (vector, 8H) 1->2 1.933147 | |
SHSUB (vector, 8H) 1->3 2.011587 | |
SHSUB (vector, 8H) throughput 1.006291 | |
SHSUB (vector, 2S) 1->2 2.009723 | |
SHSUB (vector, 2S) 1->3 2.003049 | |
SHSUB (vector, 2S) throughput 1.007588 | |
SHSUB (vector, 4S) 1->2 2.000811 | |
SHSUB (vector, 4S) 1->3 2.006818 | |
SHSUB (vector, 4S) throughput 1.013420 | |
SLI (vector, 8B) 1->1 1.994415 | |
SLI (vector, 8B) 1->2 2.018824 | |
SLI (vector, 8B) throughput 0.995906 | |
SLI (vector, 8B) throughput 1.008599 | |
SLI (vector, 16B) 1->1 1.999433 | |
SLI (vector, 16B) 1->2 2.002454 | |
SLI (vector, 16B) throughput 0.999471 | |
SLI (vector, 16B) throughput 1.017845 | |
SLI (vector, 4H) 1->1 2.017881 | |
SLI (vector, 4H) 1->2 2.013723 | |
SLI (vector, 4H) throughput 0.993576 | |
SLI (vector, 4H) throughput 1.011954 | |
SLI (vector, 8H) 1->1 2.009149 | |
SLI (vector, 8H) 1->2 2.012060 | |
SLI (vector, 8H) throughput 0.978534 | |
SLI (vector, 8H) throughput 0.945842 | |
SLI (vector, 2S) 1->1 1.996716 | |
SLI (vector, 2S) 1->2 2.001229 | |
SLI (vector, 2S) throughput 1.005853 | |
SLI (vector, 2S) throughput 0.987195 | |
SLI (vector, 4S) 1->1 2.052204 | |
SLI (vector, 4S) 1->2 2.004190 | |
SLI (vector, 4S) throughput 1.000006 | |
SLI (vector, 4S) throughput 1.006689 | |
SLI (vector, 2D) 1->1 1.998370 | |
SLI (vector, 2D) 1->2 2.031189 | |
SLI (vector, 2D) throughput 1.004385 | |
SLI (vector, 2D) throughput 1.008674 | |
SLI (scalar, D) 1->1 2.005833 | |
SLI (scalar, D) 1->2 1.990370 | |
SLI (scalar, D) throughput 1.005812 | |
SLI (scalar, D) throughput 1.015759 | |
SMAX (vector, 8B) 1->2 3.014671 | |
SMAX (vector, 8B) 1->3 3.019427 | |
SMAX (vector, 8B) throughput 1.012925 | |
SMAX (vector, 16B) 1->2 2.996147 | |
SMAX (vector, 16B) 1->3 3.001992 | |
SMAX (vector, 16B) throughput 1.061324 | |
SMAX (vector, 4H) 1->2 3.009434 | |
SMAX (vector, 4H) 1->3 2.922535 | |
SMAX (vector, 4H) throughput 1.034461 | |
SMAX (vector, 8H) 1->2 3.000109 | |
SMAX (vector, 8H) 1->3 3.031359 | |
SMAX (vector, 8H) throughput 1.010554 | |
SMAX (vector, 2S) 1->2 3.026101 | |
SMAX (vector, 2S) 1->3 3.007740 | |
SMAX (vector, 2S) throughput 0.998060 | |
SMAX (vector, 4S) 1->2 2.999604 | |
SMAX (vector, 4S) 1->3 2.993899 | |
SMAX (vector, 4S) throughput 1.009445 | |
SMAXP (vector, 8B) 1->2 3.048076 | |
SMAXP (vector, 8B) 1->3 2.996582 | |
SMAXP (vector, 8B) throughput 1.009835 | |
SMAXP (vector, 16B) 1->2 2.998057 | |
SMAXP (vector, 16B) 1->3 3.039413 | |
SMAXP (vector, 16B) throughput 0.997744 | |
SMAXP (vector, 4H) 1->2 2.996512 | |
SMAXP (vector, 4H) 1->3 3.102324 | |
SMAXP (vector, 4H) throughput 1.431876 | |
SMAXP (vector, 8H) 1->2 3.079655 | |
SMAXP (vector, 8H) 1->3 3.011058 | |
SMAXP (vector, 8H) throughput 1.007411 | |
SMAXP (vector, 2S) 1->2 3.027074 | |
SMAXP (vector, 2S) 1->3 3.013224 | |
SMAXP (vector, 2S) throughput 1.011341 | |
SMAXP (vector, 4S) 1->2 3.011237 | |
SMAXP (vector, 4S) 1->3 3.043309 | |
SMAXP (vector, 4S) throughput 0.992858 | |
SMAXV (vector, 8B) 1->2 3.165228 | |
SMAXV (vector, 8B) throughput 0.985150 | |
SMAXV (vector, 16B) 1->2 3.038462 | |
SMAXV (vector, 16B) throughput 1.007722 | |
SMAXV (vector, 4H) 1->2 3.003219 | |
SMAXV (vector, 4H) throughput 1.081817 | |
SMAXV (vector, 8H) 1->2 3.002671 | |
SMAXV (vector, 8H) throughput 1.005036 | |
SMAXV (vector, 4S) 1->2 3.010799 | |
SMAXV (vector, 4S) throughput 1.009986 | |
SMIN (vector, 8B) 1->2 3.016535 | |
SMIN (vector, 8B) 1->3 3.014031 | |
SMIN (vector, 8B) throughput 1.016578 | |
SMIN (vector, 16B) 1->2 3.005854 | |
SMIN (vector, 16B) 1->3 3.020804 | |
SMIN (vector, 16B) throughput 1.007599 | |
SMIN (vector, 4H) 1->2 3.025290 | |
SMIN (vector, 4H) 1->3 3.019607 | |
SMIN (vector, 4H) throughput 1.012207 | |
SMIN (vector, 8H) 1->2 2.996149 | |
SMIN (vector, 8H) 1->3 3.296800 | |
SMIN (vector, 8H) throughput 0.989174 | |
SMIN (vector, 2S) 1->2 3.023693 | |
SMIN (vector, 2S) 1->3 3.039746 | |
SMIN (vector, 2S) throughput 1.008986 | |
SMIN (vector, 4S) 1->2 2.991694 | |
SMIN (vector, 4S) 1->3 3.018861 | |
SMIN (vector, 4S) throughput 1.003750 | |
SMINP (vector, 8B) 1->2 3.030001 | |
SMINP (vector, 8B) 1->3 3.028025 | |
SMINP (vector, 8B) throughput 1.008092 | |
SMINP (vector, 16B) 1->2 3.018684 | |
SMINP (vector, 16B) 1->3 3.003794 | |
SMINP (vector, 16B) throughput 1.008781 | |
SMINP (vector, 4H) 1->2 3.045878 | |
SMINP (vector, 4H) 1->3 3.049751 | |
SMINP (vector, 4H) throughput 1.000417 | |
SMINP (vector, 8H) 1->2 2.949443 | |
SMINP (vector, 8H) 1->3 3.033535 | |
SMINP (vector, 8H) throughput 0.965571 | |
SMINP (vector, 2S) 1->2 3.013553 | |
SMINP (vector, 2S) 1->3 3.515670 | |
SMINP (vector, 2S) throughput 1.007884 | |
SMINP (vector, 4S) 1->2 3.090913 | |
SMINP (vector, 4S) 1->3 3.040656 | |
SMINP (vector, 4S) throughput 1.023803 | |
SMINV (vector, 8B) 1->2 2.997982 | |
SMINV (vector, 8B) throughput 1.008893 | |
SMINV (vector, 16B) 1->2 2.997939 | |
SMINV (vector, 16B) throughput 1.015140 | |
SMINV (vector, 4H) 1->2 3.008753 | |
SMINV (vector, 4H) throughput 1.000623 | |
SMINV (vector, 8H) 1->2 3.011327 | |
SMINV (vector, 8H) throughput 1.009928 | |
SMINV (vector, 4S) 1->2 3.001358 | |
SMINV (vector, 4S) throughput 1.011418 | |
SMLAL (by element, 4S) 1->1 2.018334 | |
SMLAL (by element, 4S) 1->2 3.018965 | |
SMLAL (by element, 4S) 1->3 3.043051 | |
SMLAL (by element, 4S) throughput 1.007378 | |
SMLAL (by element, 4S) throughput 1.010860 | |
SMLAL (by element, 2D) 1->1 2.036607 | |
SMLAL (by element, 2D) 1->2 2.998465 | |
SMLAL (by element, 2D) 1->3 2.998466 | |
SMLAL (by element, 2D) throughput 1.006621 | |
SMLAL (by element, 2D) throughput 1.016241 | |
SMLAL2 (by element, 4S) 1->1 1.982368 | |
SMLAL2 (by element, 4S) 1->2 3.009029 | |
SMLAL2 (by element, 4S) 1->3 3.013799 | |
SMLAL2 (by element, 4S) throughput 1.001305 | |
SMLAL2 (by element, 4S) throughput 0.987019 | |
SMLAL2 (by element, 2D) 1->1 2.036859 | |
SMLAL2 (by element, 2D) 1->2 3.003571 | |
SMLAL2 (by element, 2D) 1->3 3.023052 | |
SMLAL2 (by element, 2D) throughput 1.007320 | |
SMLAL2 (by element, 2D) throughput 1.023619 | |
SMLAL (vector, 8H) 1->1 1.991925 | |
SMLAL (vector, 8H) 1->2 2.965061 | |
SMLAL (vector, 8H) 1->3 3.031171 | |
SMLAL (vector, 8H) throughput 1.006054 | |
SMLAL (vector, 8H) throughput 1.006391 | |
SMLAL (vector, 4S) 1->1 2.022681 | |
SMLAL (vector, 4S) 1->2 2.968041 | |
SMLAL (vector, 4S) 1->3 2.982587 | |
SMLAL (vector, 4S) throughput 1.008911 | |
SMLAL (vector, 4S) throughput 0.988664 | |
SMLAL (vector, 2D) 1->1 2.016841 | |
SMLAL (vector, 2D) 1->2 3.021547 | |
SMLAL (vector, 2D) 1->3 3.013640 | |
SMLAL (vector, 2D) throughput 1.008892 | |
SMLAL (vector, 2D) throughput 1.007548 | |
SMLAL2 (vector, 8H) 1->1 2.020276 | |
SMLAL2 (vector, 8H) 1->2 2.999020 | |
SMLAL2 (vector, 8H) 1->3 3.048935 | |
SMLAL2 (vector, 8H) throughput 1.006715 | |
SMLAL2 (vector, 8H) throughput 1.000328 | |
SMLAL2 (vector, 4S) 1->1 1.964508 | |
SMLAL2 (vector, 4S) 1->2 3.030549 | |
SMLAL2 (vector, 4S) 1->3 3.024920 | |
SMLAL2 (vector, 4S) throughput 0.991948 | |
SMLAL2 (vector, 4S) throughput 0.995149 | |
SMLAL2 (vector, 2D) 1->1 1.998314 | |
SMLAL2 (vector, 2D) 1->2 2.899380 | |
SMLAL2 (vector, 2D) 1->3 3.026814 | |
SMLAL2 (vector, 2D) throughput 1.007750 | |
SMLAL2 (vector, 2D) throughput 1.039592 | |
SMLSL (by element, 4S) 1->1 2.009166 | |
SMLSL (by element, 4S) 1->2 3.137904 | |
SMLSL (by element, 4S) 1->3 2.987486 | |
SMLSL (by element, 4S) throughput 1.015109 | |
SMLSL (by element, 4S) throughput 1.012478 | |
SMLSL (by element, 2D) 1->1 2.005500 | |
SMLSL (by element, 2D) 1->2 3.011338 | |
SMLSL (by element, 2D) 1->3 2.970615 | |
SMLSL (by element, 2D) throughput 1.011992 | |
SMLSL (by element, 2D) throughput 1.004947 | |
SMLSL2 (by element, 4S) 1->1 1.989763 | |
SMLSL2 (by element, 4S) 1->2 3.047848 | |
SMLSL2 (by element, 4S) 1->3 2.951297 | |
SMLSL2 (by element, 4S) throughput 1.001514 | |
SMLSL2 (by element, 4S) throughput 1.026799 | |
SMLSL2 (by element, 2D) 1->1 2.032012 | |
SMLSL2 (by element, 2D) 1->2 2.992554 | |
SMLSL2 (by element, 2D) 1->3 3.045947 | |
SMLSL2 (by element, 2D) throughput 1.008740 | |
SMLSL2 (by element, 2D) throughput 1.007644 | |
SMLSL (vector, 8H) 1->1 1.996996 | |
SMLSL (vector, 8H) 1->2 3.065685 | |
SMLSL (vector, 8H) 1->3 3.135816 | |
SMLSL (vector, 8H) throughput 1.006559 | |
SMLSL (vector, 8H) throughput 1.007429 | |
SMLSL (vector, 4S) 1->1 1.994349 | |
SMLSL (vector, 4S) 1->2 2.819862 | |
SMLSL (vector, 4S) 1->3 3.024500 | |
SMLSL (vector, 4S) throughput 1.010604 | |
SMLSL (vector, 4S) throughput 0.978929 | |
SMLSL (vector, 2D) 1->1 1.932752 | |
SMLSL (vector, 2D) 1->2 3.046364 | |
SMLSL (vector, 2D) 1->3 3.002125 | |
SMLSL (vector, 2D) throughput 1.010234 | |
SMLSL (vector, 2D) throughput 1.014273 | |
SMLSL2 (vector, 8H) 1->1 2.002562 | |
SMLSL2 (vector, 8H) 1->2 2.738021 | |
SMLSL2 (vector, 8H) 1->3 3.145145 | |
SMLSL2 (vector, 8H) throughput 1.167923 | |
SMLSL2 (vector, 8H) throughput 1.011521 | |
SMLSL2 (vector, 4S) 1->1 1.998554 | |
SMLSL2 (vector, 4S) 1->2 3.027955 | |
SMLSL2 (vector, 4S) 1->3 3.034694 | |
SMLSL2 (vector, 4S) throughput 1.035149 | |
SMLSL2 (vector, 4S) throughput 1.005616 | |
SMLSL2 (vector, 2D) 1->1 1.971424 | |
SMLSL2 (vector, 2D) 1->2 2.998181 | |
SMLSL2 (vector, 2D) 1->3 3.005257 | |
SMLSL2 (vector, 2D) throughput 0.993260 | |
SMLSL2 (vector, 2D) throughput 1.013377 | |
SMULL (by element, 4S) 1->2 3.028554 | |
SMULL (by element, 4S) 1->3 3.002851 | |
SMULL (by element, 4S) throughput 1.017259 | |
SMULL (by element, 2D) 1->2 3.028333 | |
SMULL (by element, 2D) 1->3 2.994239 | |
SMULL (by element, 2D) throughput 1.007325 | |
SMULL2 (by element, 4S) 1->2 2.888566 | |
SMULL2 (by element, 4S) 1->3 2.939609 | |
SMULL2 (by element, 4S) throughput 1.017117 | |
SMULL2 (by element, 2D) 1->2 3.025217 | |
SMULL2 (by element, 2D) 1->3 3.050674 | |
SMULL2 (by element, 2D) throughput 1.004841 | |
SMULL (vector, 8H) 1->2 2.997894 | |
SMULL (vector, 8H) 1->3 2.962427 | |
SMULL (vector, 8H) throughput 1.012578 | |
SMULL (vector, 4S) 1->2 3.021884 | |
SMULL (vector, 4S) 1->3 3.038248 | |
SMULL (vector, 4S) throughput 1.011746 | |
SMULL (vector, 2D) 1->2 2.996088 | |
SMULL (vector, 2D) 1->3 2.996722 | |
SMULL (vector, 2D) throughput 1.010501 | |
SMULL2 (vector, 8H) 1->2 2.997676 | |
SMULL2 (vector, 8H) 1->3 3.038463 | |
SMULL2 (vector, 8H) throughput 0.973446 | |
SMULL2 (vector, 4S) 1->2 3.026666 | |
SMULL2 (vector, 4S) 1->3 3.076801 | |
SMULL2 (vector, 4S) throughput 0.997702 | |
SMULL2 (vector, 2D) 1->2 2.994197 | |
SMULL2 (vector, 2D) 1->3 3.029254 | |
SMULL2 (vector, 2D) throughput 0.987966 | |
SQABS (scalar, B) 1->2 3.030896 | |
SQABS (scalar, B) throughput 1.008203 | |
SQABS (scalar, H) 1->2 2.981377 | |
SQABS (scalar, H) throughput 1.009071 | |
SQABS (scalar, S) 1->2 3.028390 | |
SQABS (scalar, S) throughput 1.009327 | |
SQABS (scalar, D) 1->2 3.052132 | |
SQABS (scalar, D) throughput 0.972796 | |
SQABS (vector, 8B) 1->2 3.040826 | |
SQABS (vector, 8B) throughput 1.006626 | |
SQABS (vector, 16B) 1->2 2.997793 | |
SQABS (vector, 16B) throughput 0.988591 | |
SQABS (vector, 4H) 1->2 2.998026 | |
SQABS (vector, 4H) throughput 1.003934 | |
SQABS (vector, 8H) 1->2 2.997916 | |
SQABS (vector, 8H) throughput 1.004963 | |
SQABS (vector, 2S) 1->2 3.021002 | |
SQABS (vector, 2S) throughput 1.007884 | |
SQABS (vector, 4S) 1->2 3.005438 | |
SQABS (vector, 4S) throughput 1.005262 | |
SQABS (vector, 2D) 1->2 3.037397 | |
SQABS (vector, 2D) throughput 1.026496 | |
SQADD (scalar, B) 1->2 3.059032 | |
SQADD (scalar, B) 1->3 3.012455 | |
SQADD (scalar, B) throughput 0.988183 | |
SQADD (scalar, H) 1->2 3.036362 | |
SQADD (scalar, H) 1->3 3.054491 | |
SQADD (scalar, H) throughput 1.012725 | |
SQADD (scalar, S) 1->2 2.940832 | |
SQADD (scalar, S) 1->3 3.030529 | |
SQADD (scalar, S) throughput 1.011217 | |
SQADD (scalar, D) 1->2 3.048315 | |
SQADD (scalar, D) 1->3 2.972213 | |
SQADD (scalar, D) throughput 0.999130 | |
SQADD (vector, 8B) 1->2 3.011260 | |
SQADD (vector, 8B) 1->3 3.037475 | |
SQADD (vector, 8B) throughput 1.015563 | |
SQADD (vector, 16B) 1->2 3.029073 | |
SQADD (vector, 16B) 1->3 3.022583 | |
SQADD (vector, 16B) throughput 1.007246 | |
SQADD (vector, 4H) 1->2 3.036450 | |
SQADD (vector, 4H) 1->3 3.007324 | |
SQADD (vector, 4H) throughput 1.013047 | |
SQADD (vector, 8H) 1->2 3.024412 | |
SQADD (vector, 8H) 1->3 3.021251 | |
SQADD (vector, 8H) throughput 1.009738 | |
SQADD (vector, 2S) 1->2 2.995616 | |
SQADD (vector, 2S) 1->3 3.040906 | |
SQADD (vector, 2S) throughput 1.005874 | |
SQADD (vector, 4S) 1->2 3.022564 | |
SQADD (vector, 4S) 1->3 3.025945 | |
SQADD (vector, 4S) throughput 1.000129 | |
SQADD (vector, 2D) 1->2 3.001302 | |
SQADD (vector, 2D) 1->3 3.000317 | |
SQADD (vector, 2D) throughput 1.005721 | |
SQDMLAL (by element, S) 1->1 2.063067 | |
SQDMLAL (by element, S) 1->2 3.031570 | |
SQDMLAL (by element, S) 1->3 3.020869 | |
SQDMLAL (by element, S) throughput 1.011596 | |
SQDMLAL (by element, S) throughput 1.009479 | |
SQDMLAL (by element, D) 1->1 2.001521 | |
SQDMLAL (by element, D) 1->2 3.070309 | |
SQDMLAL (by element, D) 1->3 2.965584 | |
SQDMLAL (by element, D) throughput 1.006835 | |
SQDMLAL (by element, D) throughput 1.000458 | |
SQDMLAL (by element, 4S) 1->1 1.996842 | |
SQDMLAL (by element, 4S) 1->2 2.995337 | |
SQDMLAL (by element, 4S) 1->3 2.997131 | |
SQDMLAL (by element, 4S) throughput 1.063145 | |
SQDMLAL (by element, 4S) throughput 1.008735 | |
SQDMLAL (by element, 2D) 1->1 2.001141 | |
SQDMLAL (by element, 2D) 1->2 3.010072 | |
SQDMLAL (by element, 2D) 1->3 2.984179 | |
SQDMLAL (by element, 2D) throughput 1.013829 | |
SQDMLAL (by element, 2D) throughput 1.003164 | |
SQDMLAL2 (by element, 4S) 1->1 1.992423 | |
SQDMLAL2 (by element, 4S) 1->2 2.978450 | |
SQDMLAL2 (by element, 4S) 1->3 2.994436 | |
SQDMLAL2 (by element, 4S) throughput 1.007557 | |
SQDMLAL2 (by element, 4S) throughput 1.007814 | |
SQDMLAL2 (by element, 2D) 1->1 2.058875 | |
SQDMLAL2 (by element, 2D) 1->2 2.996821 | |
SQDMLAL2 (by element, 2D) 1->3 3.011843 | |
SQDMLAL2 (by element, 2D) throughput 1.010135 | |
SQDMLAL2 (by element, 2D) throughput 1.008760 | |
SQDMLAL (scalar, S) 1->1 2.020211 | |
SQDMLAL (scalar, S) 1->2 2.950131 | |
SQDMLAL (scalar, S) 1->3 3.000282 | |
SQDMLAL (scalar, S) throughput 1.008622 | |
SQDMLAL (scalar, S) throughput 1.010580 | |
SQDMLAL (scalar, D) 1->1 2.035017 | |
SQDMLAL (scalar, D) 1->2 3.015484 | |
SQDMLAL (scalar, D) 1->3 3.082880 | |
SQDMLAL (scalar, D) throughput 1.011757 | |
SQDMLAL (scalar, D) throughput 1.019818 | |
SQDMLAL (vector, 4S) 1->1 2.050013 | |
SQDMLAL (vector, 4S) 1->2 3.016384 | |
SQDMLAL (vector, 4S) 1->3 3.035496 | |
SQDMLAL (vector, 4S) throughput 1.010305 | |
SQDMLAL (vector, 4S) throughput 0.991992 | |
SQDMLAL (vector, 2D) 1->1 1.988543 | |
SQDMLAL (vector, 2D) 1->2 3.002507 | |
SQDMLAL (vector, 2D) 1->3 2.994396 | |
SQDMLAL (vector, 2D) throughput 1.007203 | |
SQDMLAL (vector, 2D) throughput 1.000294 | |
SQDMLAL2 (vector, 4S) 1->1 2.016075 | |
SQDMLAL2 (vector, 4S) 1->2 2.960048 | |
SQDMLAL2 (vector, 4S) 1->3 2.990349 | |
SQDMLAL2 (vector, 4S) throughput 1.012169 | |
SQDMLAL2 (vector, 4S) throughput 0.978944 | |
SQDMLAL2 (vector, 2D) 1->1 2.008731 | |
SQDMLAL2 (vector, 2D) 1->2 2.946622 | |
SQDMLAL2 (vector, 2D) 1->3 2.956194 | |
SQDMLAL2 (vector, 2D) throughput 1.005384 | |
SQDMLAL2 (vector, 2D) throughput 0.999726 | |
SQDMLSL (by element, S) 1->1 2.008769 | |
SQDMLSL (by element, S) 1->2 2.997732 | |
SQDMLSL (by element, S) 1->3 3.029536 | |
SQDMLSL (by element, S) throughput 1.000715 | |
SQDMLSL (by element, S) throughput 1.033963 | |
SQDMLSL (by element, D) 1->1 3.077649 | |
SQDMLSL (by element, D) 1->2 3.019847 | |
SQDMLSL (by element, D) 1->3 3.006729 | |
SQDMLSL (by element, D) throughput 1.003004 | |
SQDMLSL (by element, D) throughput 1.006773 | |
SQDMLSL (by element, 4S) 1->1 2.006883 | |
SQDMLSL (by element, 4S) 1->2 2.988174 | |
SQDMLSL (by element, 4S) 1->3 3.011317 | |
SQDMLSL (by element, 4S) throughput 1.014966 | |
SQDMLSL (by element, 4S) throughput 1.005343 | |
SQDMLSL (by element, 2D) 1->1 1.980101 | |
SQDMLSL (by element, 2D) 1->2 2.963060 | |
SQDMLSL (by element, 2D) 1->3 3.024235 | |
SQDMLSL (by element, 2D) throughput 1.004592 | |
SQDMLSL (by element, 2D) throughput 1.003652 | |
SQDMLSL2 (by element, 4S) 1->1 2.009494 | |
SQDMLSL2 (by element, 4S) 1->2 3.008662 | |
SQDMLSL2 (by element, 4S) 1->3 3.017479 | |
SQDMLSL2 (by element, 4S) throughput 1.012346 | |
SQDMLSL2 (by element, 4S) throughput 1.010358 | |
SQDMLSL2 (by element, 2D) 1->1 2.047914 | |
SQDMLSL2 (by element, 2D) 1->2 3.003910 | |
SQDMLSL2 (by element, 2D) 1->3 3.008754 | |
SQDMLSL2 (by element, 2D) throughput 0.999300 | |
SQDMLSL2 (by element, 2D) throughput 1.010722 | |
SQDMLSL (scalar, S) 1->1 2.012161 | |
SQDMLSL (scalar, S) 1->2 3.006063 | |
SQDMLSL (scalar, S) 1->3 3.041839 | |
SQDMLSL (scalar, S) throughput 1.003093 | |
SQDMLSL (scalar, S) throughput 0.944744 | |
SQDMLSL (scalar, D) 1->1 1.952814 | |
SQDMLSL (scalar, D) 1->2 3.001665 | |
SQDMLSL (scalar, D) 1->3 2.981992 | |
SQDMLSL (scalar, D) throughput 1.005814 | |
SQDMLSL (scalar, D) throughput 1.008966 | |
SQDMLSL (vector, 4S) 1->1 2.033534 | |
SQDMLSL (vector, 4S) 1->2 3.017303 | |
SQDMLSL (vector, 4S) 1->3 3.036988 | |
SQDMLSL (vector, 4S) throughput 1.011478 | |
SQDMLSL (vector, 4S) throughput 0.990710 | |
SQDMLSL (vector, 2D) 1->1 1.976681 | |
SQDMLSL (vector, 2D) 1->2 3.097330 | |
SQDMLSL (vector, 2D) 1->3 3.050067 | |
SQDMLSL (vector, 2D) throughput 0.978688 | |
SQDMLSL (vector, 2D) throughput 1.002714 | |
SQDMLSL2 (vector, 4S) 1->1 2.046299 | |
SQDMLSL2 (vector, 4S) 1->2 3.007451 | |
SQDMLSL2 (vector, 4S) 1->3 3.046911 | |
SQDMLSL2 (vector, 4S) throughput 1.012941 | |
SQDMLSL2 (vector, 4S) throughput 1.005545 | |
SQDMLSL2 (vector, 2D) 1->1 2.001096 | |
SQDMLSL2 (vector, 2D) 1->2 3.030086 | |
SQDMLSL2 (vector, 2D) 1->3 2.998171 | |
SQDMLSL2 (vector, 2D) throughput 1.006908 | |
SQDMLSL2 (vector, 2D) throughput 1.023380 | |
SQDMULH (by element, H) 1->2 3.018221 | |
SQDMULH (by element, H) 1->3 2.994414 | |
SQDMULH (by element, H) throughput 1.010454 | |
SQDMULH (by element, S) 1->2 3.017036 | |
SQDMULH (by element, S) 1->3 3.003224 | |
SQDMULH (by element, S) throughput 0.994935 | |
SQDMULH (by element, 4H) 1->2 2.999781 | |
SQDMULH (by element, 4H) 1->3 3.008819 | |
SQDMULH (by element, 4H) throughput 1.008895 | |
SQDMULH (by element, 8H) 1->2 2.997152 | |
SQDMULH (by element, 8H) 1->3 3.051179 | |
SQDMULH (by element, 8H) throughput 1.010914 | |
SQDMULH (by element, 2S) 1->2 3.001971 | |
SQDMULH (by element, 2S) 1->3 3.011320 | |
SQDMULH (by element, 2S) throughput 1.026674 | |
SQDMULH (by element, 4S) 1->2 2.994590 | |
SQDMULH (by element, 4S) 1->3 3.073794 | |
SQDMULH (by element, 4S) throughput 1.010619 | |
SQDMULH (scalar, H) 1->2 3.003027 | |
SQDMULH (scalar, H) 1->3 3.027681 | |
SQDMULH (scalar, H) throughput 1.008266 | |
SQDMULH (scalar, S) 1->2 2.957018 | |
SQDMULH (scalar, S) 1->3 3.019689 | |
SQDMULH (scalar, S) throughput 1.012640 | |
SQDMULH (vector, 4H) 1->2 3.029588 | |
SQDMULH (vector, 4H) 1->3 3.052536 | |
SQDMULH (vector, 4H) throughput 0.995427 | |
SQDMULH (vector, 8H) 1->2 3.007565 | |
SQDMULH (vector, 8H) 1->3 3.043273 | |
SQDMULH (vector, 8H) throughput 1.015234 | |
SQDMULH (vector, 2S) 1->2 3.009883 | |
SQDMULH (vector, 2S) 1->3 2.991053 | |
SQDMULH (vector, 2S) throughput 1.010461 | |
SQDMULH (vector, 4S) 1->2 3.016951 | |
SQDMULH (vector, 4S) 1->3 3.027263 | |
SQDMULH (vector, 4S) throughput 1.006139 | |
SQDMULL (by element, S) 1->2 3.021293 | |
SQDMULL (by element, S) 1->3 3.023933 | |
SQDMULL (by element, S) throughput 1.009658 | |
SQDMULL (by element, D) 1->2 2.996364 | |
SQDMULL (by element, D) 1->3 3.091142 | |
SQDMULL (by element, D) throughput 0.985792 | |
SQDMULL (by element, 4S) 1->2 3.005023 | |
SQDMULL (by element, 4S) 1->3 3.032151 | |
SQDMULL (by element, 4S) throughput 1.009910 | |
SQDMULL (by element, 2D) 1->2 3.010271 | |
SQDMULL (by element, 2D) 1->3 3.027407 | |
SQDMULL (by element, 2D) throughput 1.156127 | |
SQDMULL2 (by element, 4S) 1->2 2.939879 | |
SQDMULL2 (by element, 4S) 1->3 3.010096 | |
SQDMULL2 (by element, 4S) throughput 1.012005 | |
SQDMULL2 (by element, 2D) 1->2 3.041270 | |
SQDMULL2 (by element, 2D) 1->3 2.994283 | |
SQDMULL2 (by element, 2D) throughput 1.010870 | |
SQDMULL (scalar, S) 1->2 3.024021 | |
SQDMULL (scalar, S) 1->3 2.999913 | |
SQDMULL (scalar, S) throughput 0.990838 | |
SQDMULL (scalar, D) 1->2 3.043764 | |
SQDMULL (scalar, D) 1->3 3.005856 | |
SQDMULL (scalar, D) throughput 1.006343 | |
SQDMULL (vector, 4S) 1->2 2.998201 | |
SQDMULL (vector, 4S) 1->3 3.006724 | |
SQDMULL (vector, 4S) throughput 1.020151 | |
SQDMULL (vector, 2D) 1->2 2.930049 | |
SQDMULL (vector, 2D) 1->3 3.048686 | |
SQDMULL (vector, 2D) throughput 1.006360 | |
SQDMULL2 (vector, 4S) 1->2 3.013088 | |
SQDMULL2 (vector, 4S) 1->3 3.026165 | |
SQDMULL2 (vector, 4S) throughput 1.008745 | |
SQDMULL2 (vector, 2D) 1->2 2.960073 | |
SQDMULL2 (vector, 2D) 1->3 3.006176 | |
SQDMULL2 (vector, 2D) throughput 1.022609 | |
SQNEG (scalar, B) 1->2 3.022970 | |
SQNEG (scalar, B) throughput 0.991153 | |
SQNEG (scalar, H) 1->2 2.994067 | |
SQNEG (scalar, H) throughput 0.999937 | |
SQNEG (scalar, S) 1->2 3.007601 | |
SQNEG (scalar, S) throughput 1.014851 | |
SQNEG (scalar, D) 1->2 3.008448 | |
SQNEG (scalar, D) throughput 1.013411 | |
SQNEG (vector, 8B) 1->2 3.015064 | |
SQNEG (vector, 8B) throughput 0.992666 | |
SQNEG (vector, 16B) 1->2 3.023612 | |
SQNEG (vector, 16B) throughput 0.992200 | |
SQNEG (vector, 4H) 1->2 3.458001 | |
SQNEG (vector, 4H) throughput 0.982479 | |
SQNEG (vector, 8H) 1->2 3.012727 | |
SQNEG (vector, 8H) throughput 1.012153 | |
SQNEG (vector, 2S) 1->2 3.000043 | |
SQNEG (vector, 2S) throughput 1.010406 | |
SQNEG (vector, 4S) 1->2 3.024931 | |
SQNEG (vector, 4S) throughput 0.989045 | |
SQNEG (vector, 2D) 1->2 3.080291 | |
SQNEG (vector, 2D) throughput 0.997592 | |
SQRDMULH (by element, H) 1->2 2.988678 | |
SQRDMULH (by element, H) 1->3 3.018002 | |
SQRDMULH (by element, H) throughput 0.997913 | |
SQRDMULH (by element, S) 1->2 3.062211 | |
SQRDMULH (by element, S) 1->3 3.045826 | |
SQRDMULH (by element, S) throughput 1.004968 | |
SQRDMULH (by element, 4H) 1->2 2.992526 | |
SQRDMULH (by element, 4H) 1->3 3.094218 | |
SQRDMULH (by element, 4H) throughput 1.035839 | |
SQRDMULH (by element, 8H) 1->2 3.012952 | |
SQRDMULH (by element, 8H) 1->3 2.996358 | |
SQRDMULH (by element, 8H) throughput 1.005388 | |
SQRDMULH (by element, 2S) 1->2 3.033996 | |
SQRDMULH (by element, 2S) 1->3 2.980525 | |
SQRDMULH (by element, 2S) throughput 1.010288 | |
SQRDMULH (by element, 4S) 1->2 3.018518 | |
SQRDMULH (by element, 4S) 1->3 2.964894 | |
SQRDMULH (by element, 4S) throughput 1.004546 | |
SQRDMULH (scalar, H) 1->2 3.033961 | |
SQRDMULH (scalar, H) 1->3 3.028367 | |
SQRDMULH (scalar, H) throughput 0.994179 | |
SQRDMULH (scalar, S) 1->2 3.027590 | |
SQRDMULH (scalar, S) 1->3 2.991406 | |
SQRDMULH (scalar, S) throughput 1.006067 | |
SQRDMULH (vector, 4H) 1->2 3.032329 | |
SQRDMULH (vector, 4H) 1->3 3.035980 | |
SQRDMULH (vector, 4H) throughput 1.001773 | |
SQRDMULH (vector, 8H) 1->2 3.024292 | |
SQRDMULH (vector, 8H) 1->3 3.047216 | |
SQRDMULH (vector, 8H) throughput 1.008829 | |
SQRDMULH (vector, 2S) 1->2 3.111107 | |
SQRDMULH (vector, 2S) 1->3 3.002881 | |
SQRDMULH (vector, 2S) throughput 0.991666 | |
SQRDMULH (vector, 4S) 1->2 3.022658 | |
SQRDMULH (vector, 4S) 1->3 3.034676 | |
SQRDMULH (vector, 4S) throughput 1.009273 | |
SQRSHL (scalar, B) 1->2 2.941774 | |
SQRSHL (scalar, B) 1->3 2.961367 | |
SQRSHL (scalar, B) throughput 1.015579 | |
SQRSHL (scalar, H) 1->2 3.023225 | |
SQRSHL (scalar, H) 1->3 3.061363 | |
SQRSHL (scalar, H) throughput 1.003523 | |
SQRSHL (scalar, S) 1->2 3.024466 | |
SQRSHL (scalar, S) 1->3 3.011196 | |
SQRSHL (scalar, S) throughput 1.006383 | |
SQRSHL (scalar, D) 1->2 3.029044 | |
SQRSHL (scalar, D) 1->3 2.995321 | |
SQRSHL (scalar, D) throughput 0.969640 | |
SQRSHL (vector, 8B) 1->2 3.021308 | |
SQRSHL (vector, 8B) 1->3 3.028128 | |
SQRSHL (vector, 8B) throughput 0.993309 | |
SQRSHL (vector, 16B) 1->2 2.947511 | |
SQRSHL (vector, 16B) 1->3 3.027136 | |
SQRSHL (vector, 16B) throughput 1.006448 | |
SQRSHL (vector, 4H) 1->2 3.013844 | |
SQRSHL (vector, 4H) 1->3 3.009637 | |
SQRSHL (vector, 4H) throughput 0.997915 | |
SQRSHL (vector, 8H) 1->2 3.033295 | |
SQRSHL (vector, 8H) 1->3 3.017171 | |
SQRSHL (vector, 8H) throughput 1.005028 | |
SQRSHL (vector, 2S) 1->2 2.986493 | |
SQRSHL (vector, 2S) 1->3 3.028522 | |
SQRSHL (vector, 2S) throughput 1.041037 | |
SQRSHL (vector, 4S) 1->2 2.999002 | |
SQRSHL (vector, 4S) 1->3 3.012777 | |
SQRSHL (vector, 4S) throughput 1.020321 | |
SQRSHL (vector, 2D) 1->2 3.022022 | |
SQRSHL (vector, 2D) 1->3 2.992322 | |
SQRSHL (vector, 2D) throughput 1.011341 | |
SQRSHRN (H) 1->2 4.044909 | |
SQRSHRN (H) throughput 1.008622 | |
SQRSHRN (S) 1->2 3.990626 | |
SQRSHRN (S) throughput 1.007941 | |
SQRSHRN (D) 1->2 4.016668 | |
SQRSHRN (D) throughput 1.007334 | |
SQRSHRN (8H) 1->2 4.001891 | |
SQRSHRN (8H) throughput 0.997130 | |
SQRSHRN (4S) 1->2 4.075342 | |
SQRSHRN (4S) throughput 1.005955 | |
SQRSHRN (2D) 1->2 4.060986 | |
SQRSHRN (2D) throughput 1.012811 | |
SQRSHRN2 (8H) 1->1 3.945436 | |
SQRSHRN2 (8H) 1->2 4.043918 | |
SQRSHRN2 (8H) throughput 1.007572 | |
SQRSHRN2 (8H) throughput 1.012020 | |
SQRSHRN2 (4S) 1->1 4.010794 | |
SQRSHRN2 (4S) 1->2 4.062548 | |
SQRSHRN2 (4S) throughput 0.993664 | |
SQRSHRN2 (4S) throughput 1.023827 | |
SQRSHRN2 (2D) 1->1 4.041180 | |
SQRSHRN2 (2D) 1->2 4.047548 | |
SQRSHRN2 (2D) throughput 1.012155 | |
SQRSHRN2 (2D) throughput 1.005845 | |
SQRSHRUN (H) 1->2 4.046964 | |
SQRSHRUN (H) throughput 0.993446 | |
SQRSHRUN (S) 1->2 4.020155 | |
SQRSHRUN (S) throughput 1.015091 | |
SQRSHRUN (D) 1->2 3.996869 | |
SQRSHRUN (D) throughput 1.007691 | |
SQRSHRUN (8H) 1->2 4.020304 | |
SQRSHRUN (8H) throughput 1.002897 | |
SQRSHRUN (4S) 1->2 4.050080 | |
SQRSHRUN (4S) throughput 1.005837 | |
SQRSHRUN (2D) 1->2 3.995072 | |
SQRSHRUN (2D) throughput 0.975688 | |
SQRSHRUN2 (8H) 1->1 3.992269 | |
SQRSHRUN2 (8H) 1->2 4.036976 | |
SQRSHRUN2 (8H) throughput 1.011830 | |
SQRSHRUN2 (8H) throughput 1.019330 | |
SQRSHRUN2 (4S) 1->1 4.040445 | |
SQRSHRUN2 (4S) 1->2 4.026185 | |
SQRSHRUN2 (4S) throughput 1.003689 | |
SQRSHRUN2 (4S) throughput 1.024804 | |
SQRSHRUN2 (2D) 1->1 4.008378 | |
SQRSHRUN2 (2D) 1->2 3.992595 | |
SQRSHRUN2 (2D) throughput 1.006839 | |
SQRSHRUN2 (2D) throughput 1.005814 | |
SQSHL (immediate, scalar, B) 1->2 2.002807 | |
SQSHL (immediate, scalar, B) throughput 1.004761 | |
SQSHL (immediate, scalar, H) 1->2 2.001141 | |
SQSHL (immediate, scalar, H) throughput 1.022186 | |
SQSHL (immediate, scalar, S) 1->2 1.998204 | |
SQSHL (immediate, scalar, S) throughput 1.022394 | |
SQSHL (immediate, scalar, D) 1->2 2.093115 | |
SQSHL (immediate, scalar, D) throughput 0.992772 | |
SQSHL (immediate, vector, 8B) 1->2 1.990732 | |
SQSHL (immediate, vector, 8B) throughput 0.997495 | |
SQSHL (immediate, vector, 16B) 1->2 1.998796 | |
SQSHL (immediate, vector, 16B) throughput 1.011386 | |
SQSHL (immediate, vector, 4H) 1->2 2.043820 | |
SQSHL (immediate, vector, 4H) throughput 1.016537 | |
SQSHL (immediate, vector, 8H) 1->2 2.042974 | |
SQSHL (immediate, vector, 8H) throughput 0.985141 | |
SQSHL (immediate, vector, 2S) 1->2 2.004539 | |
SQSHL (immediate, vector, 2S) throughput 1.012469 | |
SQSHL (immediate, vector, 4S) 1->2 2.019520 | |
SQSHL (immediate, vector, 4S) throughput 1.006394 | |
SQSHL (immediate, vector, 2D) 1->2 2.054789 | |
SQSHL (immediate, vector, 2D) throughput 1.007273 | |
SQSHL (scalar, B) 1->2 2.029979 | |
SQSHL (scalar, B) 1->3 2.005345 | |
SQSHL (scalar, B) throughput 0.976494 | |
SQSHL (scalar, H) 1->2 2.008876 | |
SQSHL (scalar, H) 1->3 2.024345 | |
SQSHL (scalar, H) throughput 1.006258 | |
SQSHL (scalar, S) 1->2 1.984060 | |
SQSHL (scalar, S) 1->3 2.024626 | |
SQSHL (scalar, S) throughput 0.990092 | |
SQSHL (scalar, D) 1->2 2.008170 | |
SQSHL (scalar, D) 1->3 2.045979 | |
SQSHL (scalar, D) throughput 1.005439 | |
SQSHL (vector, 8B) 1->2 1.957525 | |
SQSHL (vector, 8B) 1->3 2.007276 | |
SQSHL (vector, 8B) throughput 1.000290 | |
SQSHL (vector, 16B) 1->2 1.998577 | |
SQSHL (vector, 16B) 1->3 2.021055 | |
SQSHL (vector, 16B) throughput 1.011017 | |
SQSHL (vector, 4H) 1->2 1.996001 | |
SQSHL (vector, 4H) 1->3 2.031004 | |
SQSHL (vector, 4H) throughput 1.026959 | |
SQSHL (vector, 8H) 1->2 2.022018 | |
SQSHL (vector, 8H) 1->3 1.997215 | |
SQSHL (vector, 8H) throughput 1.013342 | |
SQSHL (vector, 2S) 1->2 1.980555 | |
SQSHL (vector, 2S) 1->3 1.998180 | |
SQSHL (vector, 2S) throughput 1.010001 | |
SQSHL (vector, 4S) 1->2 2.014343 | |
SQSHL (vector, 4S) 1->3 2.042580 | |
SQSHL (vector, 4S) throughput 0.991113 | |
SQSHL (vector, 2D) 1->2 2.050667 | |
SQSHL (vector, 2D) 1->3 1.992473 | |
SQSHL (vector, 2D) throughput 0.991783 | |
SQSHLU (scalar, B) 1->2 1.997987 | |
SQSHLU (scalar, B) throughput 1.010819 | |
SQSHLU (scalar, H) 1->2 2.015922 | |
SQSHLU (scalar, H) throughput 1.009796 | |
SQSHLU (scalar, S) 1->2 2.040741 | |
SQSHLU (scalar, S) throughput 1.009644 | |
SQSHLU (scalar, D) 1->2 2.028307 | |
SQSHLU (scalar, D) throughput 0.998967 | |
SQSHLU (vector, 8B) 1->2 2.024871 | |
SQSHLU (vector, 8B) throughput 1.011381 | |
SQSHLU (vector, 16B) 1->2 2.006400 | |
SQSHLU (vector, 16B) throughput 1.010938 | |
SQSHLU (vector, 4H) 1->2 2.005169 | |
SQSHLU (vector, 4H) throughput 1.007738 | |
SQSHLU (vector, 8H) 1->2 1.956433 | |
SQSHLU (vector, 8H) throughput 1.007000 | |
SQSHLU (vector, 2S) 1->2 2.014839 | |
SQSHLU (vector, 2S) throughput 0.995066 | |
SQSHLU (vector, 4S) 1->2 2.053489 | |
SQSHLU (vector, 4S) throughput 1.016482 | |
SQSHLU (vector, 2D) 1->2 1.975397 | |
SQSHLU (vector, 2D) throughput 0.997049 | |
SQSHRN (H) 1->2 4.074271 | |
SQSHRN (H) throughput 1.004941 | |
SQSHRN (S) 1->2 4.024059 | |
SQSHRN (S) throughput 1.001946 | |
SQSHRN (D) 1->2 4.011118 | |
SQSHRN (D) throughput 1.005894 | |
SQSHRN (8H) 1->2 4.061210 | |
SQSHRN (8H) throughput 1.011185 | |
SQSHRN (4S) 1->2 4.048537 | |
SQSHRN (4S) throughput 1.006635 | |
SQSHRN (2D) 1->2 4.050717 | |
SQSHRN (2D) throughput 1.009690 | |
SQSHRN2 (8H) 1->1 4.048465 | |
SQSHRN2 (8H) 1->2 4.020062 | |
SQSHRN2 (8H) throughput 1.015373 | |
SQSHRN2 (8H) throughput 0.999781 | |
SQSHRN2 (4S) 1->1 4.022341 | |
SQSHRN2 (4S) 1->2 4.019364 | |
SQSHRN2 (4S) throughput 1.015449 | |
SQSHRN2 (4S) throughput 1.010501 | |
SQSHRN2 (2D) 1->1 4.070353 | |
SQSHRN2 (2D) 1->2 3.913278 | |
SQSHRN2 (2D) throughput 1.015200 | |
SQSHRN2 (2D) throughput 0.994711 | |
SQSHRUN (H) 1->2 4.019676 | |
SQSHRUN (H) throughput 1.007204 | |
SQSHRUN (S) 1->2 4.055879 | |
SQSHRUN (S) throughput 1.004375 | |
SQSHRUN (D) 1->2 4.016405 | |
SQSHRUN (D) throughput 1.006677 | |
SQSHRUN (8H) 1->2 4.067435 | |
SQSHRUN (8H) throughput 1.005798 | |
SQSHRUN (4S) 1->2 4.014255 | |
SQSHRUN (4S) throughput 1.007143 | |
SQSHRUN (2D) 1->2 4.004206 | |
SQSHRUN (2D) throughput 1.070318 | |
SQSHRUN2 (8H) 1->1 4.017183 | |
SQSHRUN2 (8H) 1->2 3.989229 | |
SQSHRUN2 (8H) throughput 1.001304 | |
SQSHRUN2 (8H) throughput 1.004631 | |
SQSHRUN2 (4S) 1->1 4.037875 | |
SQSHRUN2 (4S) 1->2 4.033689 | |
SQSHRUN2 (4S) throughput 1.006090 | |
SQSHRUN2 (4S) throughput 0.994905 | |
SQSHRUN2 (2D) 1->1 3.917931 | |
SQSHRUN2 (2D) 1->2 4.054396 | |
SQSHRUN2 (2D) throughput 0.998261 | |
SQSHRUN2 (2D) throughput 1.007131 | |
SQSUB (scalar, B) 1->2 2.981465 | |
SQSUB (scalar, B) 1->3 3.008732 | |
SQSUB (scalar, B) throughput 0.985692 | |
SQSUB (scalar, H) 1->2 3.005388 | |
SQSUB (scalar, H) 1->3 2.999611 | |
SQSUB (scalar, H) throughput 0.993227 | |
SQSUB (scalar, S) 1->2 3.050701 | |
SQSUB (scalar, S) 1->3 2.942637 | |
SQSUB (scalar, S) throughput 1.007241 | |
SQSUB (scalar, D) 1->2 3.017470 | |
SQSUB (scalar, D) 1->3 3.031626 | |
SQSUB (scalar, D) throughput 1.005404 | |
SQSUB (vector, 8B) 1->2 3.535745 | |
SQSUB (vector, 8B) 1->3 3.002093 | |
SQSUB (vector, 8B) throughput 1.005792 | |
SQSUB (vector, 16B) 1->2 2.992246 | |
SQSUB (vector, 16B) 1->3 3.019290 | |
SQSUB (vector, 16B) throughput 1.004578 | |
SQSUB (vector, 4H) 1->2 2.992596 | |
SQSUB (vector, 4H) 1->3 3.011118 | |
SQSUB (vector, 4H) throughput 0.996988 | |
SQSUB (vector, 8H) 1->2 3.030460 | |
SQSUB (vector, 8H) 1->3 3.004093 | |
SQSUB (vector, 8H) throughput 1.008856 | |
SQSUB (vector, 2S) 1->2 3.056952 | |
SQSUB (vector, 2S) 1->3 3.002063 | |
SQSUB (vector, 2S) throughput 1.009047 | |
SQSUB (vector, 4S) 1->2 3.010814 | |
SQSUB (vector, 4S) 1->3 2.894054 | |
SQSUB (vector, 4S) throughput 1.039455 | |
SQSUB (vector, 2D) 1->2 3.002015 | |
SQSUB (vector, 2D) 1->3 3.007554 | |
SQSUB (vector, 2D) throughput 0.987977 | |
SQXTN (H) 1->2 3.999101 | |
SQXTN (H) throughput 1.009455 | |
SQXTN (S) 1->2 4.027434 | |
SQXTN (S) throughput 1.006020 | |
SQXTN (D) 1->2 4.020745 | |
SQXTN (D) throughput 0.992342 | |
SQXTN (8H) 1->2 3.944556 | |
SQXTN (8H) throughput 1.004991 | |
SQXTN (4S) 1->2 4.061731 | |
SQXTN (4S) throughput 1.110145 | |
SQXTN (2D) 1->2 4.017352 | |
SQXTN (2D) throughput 0.989306 | |
SQXTN2 (8H) 1->1 4.120973 | |
SQXTN2 (8H) 1->2 4.002409 | |
SQXTN2 (8H) throughput 1.006755 | |
SQXTN2 (8H) throughput 0.997769 | |
SQXTN2 (4S) 1->1 4.001798 | |
SQXTN2 (4S) 1->2 3.996399 | |
SQXTN2 (4S) throughput 1.007424 | |
SQXTN2 (4S) throughput 1.006744 | |
SQXTN2 (2D) 1->1 4.025506 | |
SQXTN2 (2D) 1->2 4.001699 | |
SQXTN2 (2D) throughput 1.003951 | |
SQXTN2 (2D) throughput 1.008574 | |
SQXTUN (H) 1->2 3.995158 | |
SQXTUN (H) throughput 0.956994 | |
SQXTUN (S) 1->2 4.501508 | |
SQXTUN (S) throughput 1.305454 | |
SQXTUN (D) 1->2 4.002290 | |
SQXTUN (D) throughput 1.065169 | |
SQXTUN (8H) 1->2 4.145597 | |
SQXTUN (8H) throughput 0.857118 | |
SQXTUN (4S) 1->2 5.614371 | |
SQXTUN (4S) throughput 0.992712 | |
SQXTUN (2D) 1->2 4.043530 | |
SQXTUN (2D) throughput 0.999016 | |
SQXTUN2 (8H) 1->1 4.068083 | |
SQXTUN2 (8H) 1->2 4.009159 | |
SQXTUN2 (8H) throughput 1.001524 | |
SQXTUN2 (8H) throughput 1.000270 | |
SQXTUN2 (4S) 1->1 4.042815 | |
SQXTUN2 (4S) 1->2 4.025439 | |
SQXTUN2 (4S) throughput 1.015190 | |
SQXTUN2 (4S) throughput 1.014095 | |
SQXTUN2 (2D) 1->1 3.964880 | |
SQXTUN2 (2D) 1->2 4.016418 | |
SQXTUN2 (2D) throughput 0.866530 | |
SQXTUN2 (2D) throughput 1.003073 | |
SRHADD (vector, 8B) 1->2 1.971217 | |
SRHADD (vector, 8B) 1->3 1.994021 | |
SRHADD (vector, 8B) throughput 1.007886 | |
SRHADD (vector, 16B) 1->2 1.986813 | |
SRHADD (vector, 16B) 1->3 2.014738 | |
SRHADD (vector, 16B) throughput 1.003170 | |
SRHADD (vector, 4H) 1->2 2.008650 | |
SRHADD (vector, 4H) 1->3 2.016537 | |
SRHADD (vector, 4H) throughput 0.979590 | |
SRHADD (vector, 8H) 1->2 2.015288 | |
SRHADD (vector, 8H) 1->3 2.006422 | |
SRHADD (vector, 8H) throughput 1.006593 | |
SRHADD (vector, 2S) 1->2 2.035875 | |
SRHADD (vector, 2S) 1->3 1.990089 | |
SRHADD (vector, 2S) throughput 0.979099 | |
SRHADD (vector, 4S) 1->2 2.006707 | |
SRHADD (vector, 4S) 1->3 1.986371 | |
SRHADD (vector, 4S) throughput 1.013202 | |
SRI (vector, 8B) 1->1 1.981956 | |
SRI (vector, 8B) 1->2 2.002940 | |
SRI (vector, 8B) throughput 0.999081 | |
SRI (vector, 8B) throughput 0.996341 | |
SRI (vector, 16B) 1->1 1.998596 | |
SRI (vector, 16B) 1->2 2.015045 | |
SRI (vector, 16B) throughput 1.014571 | |
SRI (vector, 16B) throughput 0.985127 | |
SRI (vector, 4H) 1->1 2.053954 | |
SRI (vector, 4H) 1->2 1.999979 | |
SRI (vector, 4H) throughput 1.041140 | |
SRI (vector, 4H) throughput 0.949864 | |
SRI (vector, 8H) 1->1 2.038325 | |
SRI (vector, 8H) 1->2 2.032219 | |
SRI (vector, 8H) throughput 1.028275 | |
SRI (vector, 8H) throughput 0.977902 | |
SRI (vector, 2S) 1->1 1.957389 | |
SRI (vector, 2S) 1->2 2.016164 | |
SRI (vector, 2S) throughput 1.003177 | |
SRI (vector, 2S) throughput 1.010862 | |
SRI (vector, 4S) 1->1 1.945017 | |
SRI (vector, 4S) 1->2 2.010639 | |
SRI (vector, 4S) throughput 1.036512 | |
SRI (vector, 4S) throughput 1.005793 | |
SRI (vector, 2D) 1->1 2.012017 | |
SRI (vector, 2D) 1->2 1.986241 | |
SRI (vector, 2D) throughput 1.012600 | |
SRI (vector, 2D) throughput 0.976597 | |
SRI (vector, D) 1->1 2.015145 | |
SRI (vector, D) 1->2 2.032360 | |
SRI (vector, D) throughput 1.005016 | |
SRI (vector, D) throughput 0.753173 | |
SRSHL (vector, 8B) 1->2 2.783539 | |
SRSHL (vector, 8B) 1->3 3.007711 | |
SRSHL (vector, 8B) throughput 1.027569 | |
SRSHL (vector, 16B) 1->2 3.071892 | |
SRSHL (vector, 16B) 1->3 3.010542 | |
SRSHL (vector, 16B) throughput 0.994178 | |
SRSHL (vector, 4H) 1->2 3.011301 | |
SRSHL (vector, 4H) 1->3 3.065755 | |
SRSHL (vector, 4H) throughput 1.010668 | |
SRSHL (vector, 8H) 1->2 3.013410 | |
SRSHL (vector, 8H) 1->3 2.944867 | |
SRSHL (vector, 8H) throughput 1.007963 | |
SRSHL (vector, 2S) 1->2 3.053339 | |
SRSHL (vector, 2S) 1->3 3.005630 | |
SRSHL (vector, 2S) throughput 0.987873 | |
SRSHL (vector, 4S) 1->2 3.026103 | |
SRSHL (vector, 4S) 1->3 3.027968 | |
SRSHL (vector, 4S) throughput 0.978106 | |
SRSHL (vector, 2D) 1->2 2.994525 | |
SRSHL (vector, 2D) 1->3 3.064982 | |
SRSHL (vector, 2D) throughput 0.980113 | |
SRSHL (vector, D) 1->2 3.009105 | |
SRSHL (vector, D) 1->3 3.025024 | |
SRSHL (vector, D) throughput 1.017968 | |
SRSHR (vector, 8B) 1->2 2.950108 | |
SRSHR (vector, 8B) throughput 1.002073 | |
SRSHR (vector, 16B) 1->2 3.068673 | |
SRSHR (vector, 16B) throughput 1.005291 | |
SRSHR (vector, 4H) 1->2 2.998245 | |
SRSHR (vector, 4H) throughput 1.006767 | |
SRSHR (vector, 8H) 1->2 2.998387 | |
SRSHR (vector, 8H) throughput 1.004037 | |
SRSHR (vector, 2S) 1->2 2.967227 | |
SRSHR (vector, 2S) throughput 0.998034 | |
SRSHR (vector, 4S) 1->2 3.248131 | |
SRSHR (vector, 4S) throughput 0.990468 | |
SRSHR (vector, 2D) 1->2 3.080760 | |
SRSHR (vector, 2D) throughput 1.007021 | |
SRSHR (vector, D) 1->2 3.048300 | |
SRSHR (vector, D) throughput 0.994890 | |
SRSRA (vector, 8B) 1->1 2.035251 | |
SRSRA (vector, 8B) 1->2 3.027963 | |
SRSRA (vector, 8B) throughput 0.978757 | |
SRSRA (vector, 8B) throughput 1.010549 | |
SRSRA (vector, 16B) 1->1 2.023224 | |
SRSRA (vector, 16B) 1->2 3.022920 | |
SRSRA (vector, 16B) throughput 1.020323 | |
SRSRA (vector, 16B) throughput 1.008015 | |
SRSRA (vector, 4H) 1->1 1.974387 | |
SRSRA (vector, 4H) 1->2 3.023729 | |
SRSRA (vector, 4H) throughput 1.011265 | |
SRSRA (vector, 4H) throughput 0.958811 | |
SRSRA (vector, 8H) 1->1 2.011432 | |
SRSRA (vector, 8H) 1->2 3.003690 | |
SRSRA (vector, 8H) throughput 1.004595 | |
SRSRA (vector, 8H) throughput 0.991288 | |
SRSRA (vector, 2S) 1->1 2.036990 | |
SRSRA (vector, 2S) 1->2 2.993069 | |
SRSRA (vector, 2S) throughput 0.998871 | |
SRSRA (vector, 2S) throughput 0.924663 | |
SRSRA (vector, 4S) 1->1 1.994502 | |
SRSRA (vector, 4S) 1->2 2.994567 | |
SRSRA (vector, 4S) throughput 1.009606 | |
SRSRA (vector, 4S) throughput 1.030082 | |
SRSRA (vector, 2D) 1->1 2.009475 | |
SRSRA (vector, 2D) 1->2 3.008994 | |
SRSRA (vector, 2D) throughput 0.995329 | |
SRSRA (vector, 2D) throughput 1.002546 | |
SRSRA (vector, D) 1->1 2.006443 | |
SRSRA (vector, D) 1->2 3.016337 | |
SRSRA (vector, D) throughput 1.008227 | |
SRSRA (vector, D) throughput 1.008709 | |
SSHL (vector, 8B) 1->2 2.014519 | |
SSHL (vector, 8B) 1->3 1.968412 | |
SSHL (vector, 8B) throughput 1.009319 | |
SSHL (vector, 16B) 1->2 1.997039 | |
SSHL (vector, 16B) 1->3 2.012074 | |
SSHL (vector, 16B) throughput 1.007759 | |
SSHL (vector, 4H) 1->2 1.973273 | |
SSHL (vector, 4H) 1->3 2.006620 | |
SSHL (vector, 4H) throughput 0.992478 | |
SSHL (vector, 8H) 1->2 1.994042 | |
SSHL (vector, 8H) 1->3 1.997039 | |
SSHL (vector, 8H) throughput 1.006698 | |
SSHL (vector, 2S) 1->2 2.000042 | |
SSHL (vector, 2S) 1->3 2.016488 | |
SSHL (vector, 2S) throughput 1.009561 | |
SSHL (vector, 4S) 1->2 2.056980 | |
SSHL (vector, 4S) 1->3 1.994125 | |
SSHL (vector, 4S) throughput 0.980274 | |
SSHL (vector, 2D) 1->2 2.033418 | |
SSHL (vector, 2D) 1->3 2.006444 | |
SSHL (vector, 2D) throughput 1.015772 | |
SSHL (vector, D) 1->2 2.006799 | |
SSHL (vector, D) 1->3 2.023207 | |
SSHL (vector, D) throughput 1.012886 | |
SSHLL (8H) 1->2 1.998752 | |
SSHLL (8H) throughput 1.004640 | |
SSHLL (4S) 1->2 2.006532 | |
SSHLL (4S) throughput 1.004575 | |
SSHLL (2D) 1->2 1.994462 | |
SSHLL (2D) throughput 0.997030 | |
SSHLL2 (8H) 1->2 2.006465 | |
SSHLL2 (8H) throughput 1.013222 | |
SSHLL2 (4S) 1->2 2.023415 | |
SSHLL2 (4S) throughput 1.002942 | |
SSHLL2 (2D) 1->2 1.977051 | |
SSHLL2 (2D) throughput 1.009587 | |
SSHR (vector, 8B) 1->2 2.006707 | |
SSHR (vector, 8B) throughput 1.001741 | |
SSHR (vector, 16B) 1->2 2.006642 | |
SSHR (vector, 16B) throughput 1.000596 | |
SSHR (vector, 4H) 1->2 2.025920 | |
SSHR (vector, 4H) throughput 1.010956 | |
SSHR (vector, 8H) 1->2 2.028874 | |
SSHR (vector, 8H) throughput 1.017312 | |
SSHR (vector, 2S) 1->2 1.992076 | |
SSHR (vector, 2S) throughput 0.981985 | |
SSHR (vector, 4S) 1->2 2.023256 | |
SSHR (vector, 4S) throughput 1.023756 | |
SSHR (vector, 2D) 1->2 1.985794 | |
SSHR (vector, 2D) throughput 0.993969 | |
SSHR (vector, D) 1->2 1.998558 | |
SSHR (vector, D) throughput 1.013726 | |
SSRA (vector, 8B) 1->1 2.027862 | |
SSRA (vector, 8B) 1->2 3.019492 | |
SSRA (vector, 8B) throughput 1.007081 | |
SSRA (vector, 8B) throughput 1.007624 | |
SSRA (vector, 16B) 1->1 2.008972 | |
SSRA (vector, 16B) 1->2 3.027808 | |
SSRA (vector, 16B) throughput 1.017795 | |
SSRA (vector, 16B) throughput 1.019948 | |
SSRA (vector, 4H) 1->1 2.007990 | |
SSRA (vector, 4H) 1->2 3.036741 | |
SSRA (vector, 4H) throughput 0.992146 | |
SSRA (vector, 4H) throughput 1.010888 | |
SSRA (vector, 8H) 1->1 2.028587 | |
SSRA (vector, 8H) 1->2 3.056328 | |
SSRA (vector, 8H) throughput 0.997658 | |
SSRA (vector, 8H) throughput 1.021438 | |
SSRA (vector, 2S) 1->1 1.767569 | |
SSRA (vector, 2S) 1->2 3.395435 | |
SSRA (vector, 2S) throughput 1.025766 | |
SSRA (vector, 2S) throughput 1.026062 | |
SSRA (vector, 4S) 1->1 1.969858 | |
SSRA (vector, 4S) 1->2 3.022897 | |
SSRA (vector, 4S) throughput 1.011108 | |
SSRA (vector, 4S) throughput 0.993433 | |
SSRA (vector, 2D) 1->1 2.041933 | |
SSRA (vector, 2D) 1->2 2.994046 | |
SSRA (vector, 2D) throughput 0.999225 | |
SSRA (vector, 2D) throughput 1.007019 | |
SSRA (vector, D) 1->1 2.040981 | |
SSRA (vector, D) 1->2 3.056104 | |
SSRA (vector, D) throughput 1.025781 | |
SSRA (vector, D) throughput 1.001879 | |
SSUBL (vector, 8H) 1->2 1.992582 | |
SSUBL (vector, 8H) 1->3 1.992373 | |
SSUBL (vector, 8H) throughput 0.996047 | |
SSUBL (vector, 4S) 1->2 1.993531 | |
SSUBL (vector, 4S) 1->3 2.006908 | |
SSUBL (vector, 4S) throughput 0.999679 | |
SSUBL (vector, 2D) 1->2 1.998838 | |
SSUBL (vector, 2D) 1->3 1.993977 | |
SSUBL (vector, 2D) throughput 1.019006 | |
SSUBL2 (vector, 8H) 1->2 1.993868 | |
SSUBL2 (vector, 8H) 1->3 2.021144 | |
SSUBL2 (vector, 8H) throughput 1.001925 | |
SSUBL2 (vector, 4S) 1->2 2.047370 | |
SSUBL2 (vector, 4S) 1->3 2.010686 | |
SSUBL2 (vector, 4S) throughput 1.011077 | |
SSUBL2 (vector, 2D) 1->2 2.035030 | |
SSUBL2 (vector, 2D) 1->3 2.126175 | |
SSUBL2 (vector, 2D) throughput 0.819675 | |
SSUBW (vector, 8H) 1->2 2.022038 | |
SSUBW (vector, 8H) 1->3 2.000394 | |
SSUBW (vector, 8H) throughput 1.005196 | |
SSUBW (vector, 4S) 1->2 2.014389 | |
SSUBW (vector, 4S) 1->3 2.009567 | |
SSUBW (vector, 4S) throughput 1.015396 | |
SSUBW (vector, 2D) 1->2 2.123727 | |
SSUBW (vector, 2D) 1->3 2.125596 | |
SSUBW (vector, 2D) throughput 1.017504 | |
SSUBW2 (vector, 8H) 1->2 2.011207 | |
SSUBW2 (vector, 8H) 1->3 2.018348 | |
SSUBW2 (vector, 8H) throughput 1.003725 | |
SSUBW2 (vector, 4S) 1->2 2.001141 | |
SSUBW2 (vector, 4S) 1->3 2.036051 | |
SSUBW2 (vector, 4S) throughput 1.007235 | |
SSUBW2 (vector, 2D) 1->2 2.011189 | |
SSUBW2 (vector, 2D) 1->3 2.057219 | |
SSUBW2 (vector, 2D) throughput 1.008526 | |
STNP (S) throughput 1.605318 | |
STNP (D) throughput 1.584840 | |
STNP (Q) throughput 3.220081 | |
STNP (signed offset, S) throughput 1.608980 | |
STNP (signed offset, D) throughput 1.607139 | |
STNP (signed offset, Q) throughput 3.179019 | |
STP (S) throughput 1.305135 | |
STP (D) throughput 1.301307 | |
STP (Q) throughput 2.617626 | |
STP (post-index, S) 3->3 1.856977 | |
STP (post-index, S) throughput 1.386138 | |
STP (post-index, D) 3->3 2.271303 | |
STP (post-index, D) throughput 1.288280 | |
STP (post-index, Q) 3->3 2.557226 | |
STP (post-index, Q) throughput 2.665786 | |
STP (pre-index, S) 3->3 2.032385 | |
STP (pre-index, S) throughput 1.319163 | |
STP (pre-index, D) 3->3 2.160882 | |
STP (pre-index, D) throughput 1.278568 | |
STP (pre-index, Q) 3->3 2.581071 | |
STP (pre-index, Q) throughput 2.516391 | |
STP (signed offset, S) throughput 1.319398 | |
STP (signed offset, D) throughput 1.307083 | |
STP (signed offset, Q) throughput 2.514538 | |
STR (S) throughput 1.308694 | |
STR (D) throughput 1.309687 | |
STR (Q) throughput 1.308230 | |
STR (post-index, S) 3->3 1.967066 | |
STR (post-index, S) throughput 1.312436 | |
STR (post-index, D) 3->3 2.135685 | |
STR (post-index, D) throughput 1.292746 | |
STR (post-index, Q) 3->3 2.170121 | |
STR (post-index, Q) throughput 1.288302 | |
STR (pre-index, S) 3->3 3.461508 | |
STR (pre-index, S) throughput 1.326403 | |
STR (pre-index, D) 3->3 2.078380 | |
STR (pre-index, D) throughput 1.310177 | |
STR (pre-index, Q) 3->3 2.197522 | |
STR (pre-index, Q) throughput 1.310133 | |
STR (signed offset, S) throughput 1.290264 | |
STR (signed offset, D) throughput 1.306027 | |
STR (signed offset, Q) throughput 1.315530 | |
STR (register, S) throughput 1.309290 | |
STR (register, D) throughput 1.316808 | |
STR (register, Q) throughput 1.315633 | |
STR (register, uxtw, S) throughput 1.315043 | |
STR (register, uxtw, D) throughput 1.322165 | |
STR (register, uxtw, Q) throughput 1.279477 | |
STR (register, sxtw, S) throughput 1.309526 | |
STR (register, sxtw, D) throughput 1.335142 | |
STR (register, sxtw, Q) throughput 1.389897 | |
STR (register, lsl, S) throughput 1.286922 | |
STR (register, lsl, D) throughput 1.123577 | |
STR (register, lsl, Q) throughput 1.295475 | |
STUR (S) throughput 1.341832 | |
STUR (D) throughput 1.324120 | |
STUR (Q) throughput 1.287557 | |
SUB (vector, 8B) 1->2 3.787274 | |
SUB (vector, 8B) 1->3 2.112790 | |
SUB (vector, 8B) throughput 0.936980 | |
SUB (vector, 16B) 1->2 1.997610 | |
SUB (vector, 16B) 1->3 2.183081 | |
SUB (vector, 16B) throughput 1.058447 | |
SUB (vector, 4H) 1->2 2.177791 | |
SUB (vector, 4H) 1->3 1.793947 | |
SUB (vector, 4H) throughput 1.010547 | |
SUB (vector, 8H) 1->2 2.042766 | |
SUB (vector, 8H) 1->3 1.984387 | |
SUB (vector, 8H) throughput 1.011733 | |
SUB (vector, 2S) 1->2 2.002896 | |
SUB (vector, 2S) 1->3 2.030517 | |
SUB (vector, 2S) throughput 1.007648 | |
SUB (vector, 4S) 1->2 2.030794 | |
SUB (vector, 4S) 1->3 2.004010 | |
SUB (vector, 4S) throughput 1.010720 | |
SUB (vector, 2D) 1->2 2.009429 | |
SUB (vector, 2D) 1->3 2.005740 | |
SUB (vector, 2D) throughput 1.012890 | |
SUB (vector, D) 1->2 2.031210 | |
SUB (vector, D) 1->3 2.032180 | |
SUB (vector, D) throughput 1.007506 | |
SUBHN (vector, 8H) 1->2 4.096382 | |
SUBHN (vector, 8H) 1->3 3.986949 | |
SUBHN (vector, 8H) throughput 1.009415 | |
SUBHN (vector, 4S) 1->2 4.005146 | |
SUBHN (vector, 4S) 1->3 4.035747 | |
SUBHN (vector, 4S) throughput 0.986602 | |
SUBHN (vector, 2D) 1->2 4.031215 | |
SUBHN (vector, 2D) 1->3 4.025931 | |
SUBHN (vector, 2D) throughput 1.010565 | |
SUBHN2 (vector, 8H) 1->1 4.038957 | |
SUBHN2 (vector, 8H) 1->2 3.971453 | |
SUBHN2 (vector, 8H) 1->3 4.015193 | |
SUBHN2 (vector, 8H) throughput 1.002936 | |
SUBHN2 (vector, 8H) throughput 1.177842 | |
SUBHN2 (vector, 4S) 1->1 3.979886 | |
SUBHN2 (vector, 4S) 1->2 4.004277 | |
SUBHN2 (vector, 4S) 1->3 4.031952 | |
SUBHN2 (vector, 4S) throughput 1.017204 | |
SUBHN2 (vector, 4S) throughput 1.004404 | |
SUBHN2 (vector, 2D) 1->1 4.040523 | |
SUBHN2 (vector, 2D) 1->2 4.035214 | |
SUBHN2 (vector, 2D) 1->3 4.198219 | |
SUBHN2 (vector, 2D) throughput 1.007856 | |
SUBHN2 (vector, 2D) throughput 1.016118 | |
SUQADD (scalar, B) 1->1 3.021792 | |
SUQADD (scalar, B) 1->2 3.676615 | |
SUQADD (scalar, B) throughput 1.004278 | |
SUQADD (scalar, B) throughput 1.009586 | |
SUQADD (scalar, H) 1->1 3.003772 | |
SUQADD (scalar, H) 1->2 3.010316 | |
SUQADD (scalar, H) throughput 0.990544 | |
SUQADD (scalar, H) throughput 1.011364 | |
SUQADD (scalar, S) 1->1 3.005067 | |
SUQADD (scalar, S) 1->2 3.048904 | |
SUQADD (scalar, S) throughput 0.976357 | |
SUQADD (scalar, S) throughput 1.008679 | |
SUQADD (scalar, D) 1->1 3.111536 | |
SUQADD (scalar, D) 1->2 3.056644 | |
SUQADD (scalar, D) throughput 1.002018 | |
SUQADD (scalar, D) throughput 0.997049 | |
SUQADD (vector, 8B) 1->1 3.003197 | |
SUQADD (vector, 8B) 1->2 2.955460 | |
SUQADD (vector, 8B) throughput 1.006053 | |
SUQADD (vector, 8B) throughput 0.992366 | |
SUQADD (vector, 16B) 1->1 2.986354 | |
SUQADD (vector, 16B) 1->2 3.001346 | |
SUQADD (vector, 16B) throughput 1.003372 | |
SUQADD (vector, 16B) throughput 1.022567 | |
SUQADD (vector, 4H) 1->1 3.027812 | |
SUQADD (vector, 4H) 1->2 2.980867 | |
SUQADD (vector, 4H) throughput 1.012929 | |
SUQADD (vector, 4H) throughput 1.004507 | |
SUQADD (vector, 8H) 1->1 2.921773 | |
SUQADD (vector, 8H) 1->2 3.042960 | |
SUQADD (vector, 8H) throughput 1.006304 | |
SUQADD (vector, 8H) throughput 1.009438 | |
SUQADD (vector, 2S) 1->1 3.025636 | |
SUQADD (vector, 2S) 1->2 3.020711 | |
SUQADD (vector, 2S) throughput 0.998366 | |
SUQADD (vector, 2S) throughput 1.009839 | |
SUQADD (vector, 4S) 1->1 2.997149 | |
SUQADD (vector, 4S) 1->2 3.013688 | |
SUQADD (vector, 4S) throughput 1.009714 | |
SUQADD (vector, 4S) throughput 1.010211 | |
SUQADD (vector, 2D) 1->1 3.021993 | |
SUQADD (vector, 2D) 1->2 3.014762 | |
SUQADD (vector, 2D) throughput 0.996278 | |
SUQADD (vector, 2D) throughput 1.009553 | |
SXTL (8H) 1->2 2.036239 | |
SXTL (8H) throughput 1.012692 | |
SXTL (4S) 1->2 2.005374 | |
SXTL (4S) throughput 1.028773 | |
SXTL (2D) 1->2 1.958011 | |
SXTL (2D) throughput 0.973716 | |
SXTL2 (8H) 1->2 1.993839 | |
SXTL2 (8H) throughput 1.005548 | |
SXTL2 (4S) 1->2 2.008258 | |
SXTL2 (4S) throughput 0.997502 | |
SXTL2 (2D) 1->2 2.018356 | |
SXTL2 (2D) throughput 1.009324 | |
TBL (single register table, 8B) 1->2 2.014520 | |
TBL (single register table, 8B) 1->3 2.007906 | |
TBL (single register table, 8B) throughput 0.999135 | |
TBL (single register table, 16B) 1->2 2.030085 | |
TBL (single register table, 16B) 1->3 2.001338 | |
TBL (single register table, 16B) throughput 1.008466 | |
TBL (two register table, 8B) 1->2 2.087254 | |
TBL (two register table, 8B) 1->3 1.980670 | |
TBL (two register table, 8B) 1->4 2.019703 | |
TBL (two register table, 8B) throughput 0.956559 | |
TBL (two register table, 16B) 1->2 2.006641 | |
TBL (two register table, 16B) 1->3 1.975868 | |
TBL (two register table, 16B) 1->4 2.035648 | |
TBL (two register table, 16B) throughput 1.005680 | |
TBL (three register table, 8B) 1->2 3.973592 | |
TBL (three register table, 8B) 1->3 3.988427 | |
TBL (three register table, 8B) 1->4 2.121615 | |
TBL (three register table, 8B) 1->5 4.006453 | |
TBL (three register table, 8B) throughput 1.986089 | |
TBL (three register table, 16B) 1->2 4.057108 | |
TBL (three register table, 16B) 1->3 4.019171 | |
TBL (three register table, 16B) 1->4 2.015637 | |
TBL (three register table, 16B) 1->5 4.004942 | |
TBL (three register table, 16B) throughput 2.037553 | |
TBL (four register table, 8B) 1->2 4.045370 | |
TBL (four register table, 8B) 1->3 5.233667 | |
TBL (four register table, 8B) 1->4 3.505042 | |
TBL (four register table, 8B) 1->5 4.105238 | |
TBL (four register table, 8B) 1->6 4.829315 | |
TBL (four register table, 8B) throughput 3.029285 | |
TBL (four register table, 16B) 1->2 4.064715 | |
TBL (four register table, 16B) 1->3 4.091787 | |
TBL (four register table, 16B) 1->4 4.076072 | |
TBL (four register table, 16B) 1->5 4.128892 | |
TBL (four register table, 16B) 1->6 5.045918 | |
TBL (four register table, 16B) throughput 3.038301 | |
TBX (single register table, 8B) 1->1 1.988499 | |
TBX (single register table, 8B) 1->2 2.031637 | |
TBX (single register table, 8B) 1->3 1.984515 | |
TBX (single register table, 8B) throughput 1.009464 | |
TBX (single register table, 8B) throughput 0.983699 | |
TBX (single register table, 16B) 1->1 1.998443 | |
TBX (single register table, 16B) 1->2 2.084051 | |
TBX (single register table, 16B) 1->3 2.004628 | |
TBX (single register table, 16B) throughput 1.003466 | |
TBX (single register table, 16B) throughput 1.004976 | |
TBX (two register table, 8B) 1->1 3.962639 | |
TBX (two register table, 8B) 1->2 4.003470 | |
TBX (two register table, 8B) 1->3 2.023147 | |
TBX (two register table, 8B) 1->4 4.047642 | |
TBX (two register table, 8B) throughput 2.024486 | |
TBX (two register table, 16B) 1->1 4.012691 | |
TBX (two register table, 16B) 1->2 4.010335 | |
TBX (two register table, 16B) 1->3 2.037535 | |
TBX (two register table, 16B) 1->4 3.746288 | |
TBX (two register table, 16B) throughput 1.963504 | |
TBX (three register table, 8B) 1->1 6.013150 | |
TBX (three register table, 8B) 1->2 6.090373 | |
TBX (three register table, 8B) 1->3 4.041663 | |
TBX (three register table, 8B) 1->4 2.022061 | |
TBX (three register table, 8B) 1->5 6.082229 | |
TBX (three register table, 8B) throughput 3.007549 | |
TBX (three register table, 16B) 1->1 6.055121 | |
TBX (three register table, 16B) 1->2 6.042619 | |
TBX (three register table, 16B) 1->3 4.046945 | |
TBX (three register table, 16B) 1->4 2.071180 | |
TBX (three register table, 16B) 1->5 6.148657 | |
TBX (three register table, 16B) throughput 2.402872 | |
TBX (four register table, 8B) 1->1 8.209048 | |
TBX (four register table, 8B) 1->2 8.033494 | |
TBX (four register table, 8B) 1->3 6.774442 | |
TBX (four register table, 8B) 1->4 4.078132 | |
TBX (four register table, 8B) 1->5 3.038819 | |
TBX (four register table, 8B) 1->6 8.188945 | |
TBX (four register table, 8B) throughput 3.997747 | |
TBX (four register table, 16B) 1->1 8.103817 | |
TBX (four register table, 16B) 1->2 8.247256 | |
TBX (four register table, 16B) 1->3 6.067567 | |
TBX (four register table, 16B) 1->4 3.993432 | |
TBX (four register table, 16B) 1->5 3.019235 | |
TBX (four register table, 16B) 1->6 8.079493 | |
TBX (four register table, 16B) throughput 4.040168 | |
TRN1 (vector, 8B) 1->2 2.007544 | |
TRN1 (vector, 8B) 1->3 1.930856 | |
TRN1 (vector, 8B) throughput 1.043681 | |
TRN1 (vector, 16B) 1->2 2.053645 | |
TRN1 (vector, 16B) 1->3 2.002106 | |
TRN1 (vector, 16B) throughput 1.011553 | |
TRN1 (vector, 4H) 1->2 1.924824 | |
TRN1 (vector, 4H) 1->3 1.998574 | |
TRN1 (vector, 4H) throughput 1.011740 | |
TRN1 (vector, 8H) 1->2 2.006664 | |
TRN1 (vector, 8H) 1->3 2.012696 | |
TRN1 (vector, 8H) throughput 1.011721 | |
TRN1 (vector, 2S) 1->2 2.063406 | |
TRN1 (vector, 2S) 1->3 1.994130 | |
TRN1 (vector, 2S) throughput 1.007461 | |
TRN1 (vector, 4S) 1->2 2.016518 | |
TRN1 (vector, 4S) 1->3 1.986830 | |
TRN1 (vector, 4S) throughput 1.001483 | |
TRN1 (vector, 2D) 1->2 2.028750 | |
TRN1 (vector, 2D) 1->3 1.990383 | |
TRN1 (vector, 2D) throughput 1.008522 | |
TRN2 (vector, 8B) 1->2 1.996841 | |
TRN2 (vector, 8B) 1->3 2.007698 | |
TRN2 (vector, 8B) throughput 1.000368 | |
TRN2 (vector, 16B) 1->2 2.487689 | |
TRN2 (vector, 16B) 1->3 1.994349 | |
TRN2 (vector, 16B) throughput 1.011857 | |
TRN2 (vector, 4H) 1->2 2.020175 | |
TRN2 (vector, 4H) 1->3 2.012585 | |
TRN2 (vector, 4H) throughput 1.015541 | |
TRN2 (vector, 8H) 1->2 2.025428 | |
TRN2 (vector, 8H) 1->3 2.009167 | |
TRN2 (vector, 8H) throughput 0.991419 | |
TRN2 (vector, 2S) 1->2 2.049184 | |
TRN2 (vector, 2S) 1->3 1.992182 | |
TRN2 (vector, 2S) throughput 1.011293 | |
TRN2 (vector, 4S) 1->2 2.015944 | |
TRN2 (vector, 4S) 1->3 1.992119 | |
TRN2 (vector, 4S) throughput 1.010913 | |
TRN2 (vector, 2D) 1->2 2.006443 | |
TRN2 (vector, 2D) 1->3 2.003992 | |
TRN2 (vector, 2D) throughput 1.011887 | |
UABA (vector, 8B) 1->1 2.025334 | |
UABA (vector, 8B) 1->2 3.028766 | |
UABA (vector, 8B) 1->3 3.000193 | |
UABA (vector, 8B) throughput 1.013294 | |
UABA (vector, 8B) throughput 0.996226 | |
UABA (vector, 16B) 1->1 2.077354 | |
UABA (vector, 16B) 1->2 3.008709 | |
UABA (vector, 16B) 1->3 2.955273 | |
UABA (vector, 16B) throughput 1.005379 | |
UABA (vector, 16B) throughput 1.009153 | |
UABA (vector, 4H) 1->1 1.849448 | |
UABA (vector, 4H) 1->2 3.031629 | |
UABA (vector, 4H) 1->3 3.028632 | |
UABA (vector, 4H) throughput 1.004494 | |
UABA (vector, 4H) throughput 0.969859 | |
UABA (vector, 8H) 1->1 1.999311 | |
UABA (vector, 8H) 1->2 2.994042 | |
UABA (vector, 8H) 1->3 3.039247 | |
UABA (vector, 8H) throughput 1.004616 | |
UABA (vector, 8H) throughput 1.007720 | |
UABA (vector, 2S) 1->1 2.029559 | |
UABA (vector, 2S) 1->2 3.000395 | |
UABA (vector, 2S) 1->3 3.034126 | |
UABA (vector, 2S) throughput 0.995009 | |
UABA (vector, 2S) throughput 1.017373 | |
UABA (vector, 4S) 1->1 2.025555 | |
UABA (vector, 4S) 1->2 3.031482 | |
UABA (vector, 4S) 1->3 3.030172 | |
UABA (vector, 4S) throughput 1.010970 | |
UABA (vector, 4S) throughput 1.000708 | |
UABAL (vector, 8H) 1->1 2.001469 | |
UABAL (vector, 8H) 1->2 2.990790 | |
UABAL (vector, 8H) 1->3 3.023720 | |
UABAL (vector, 8H) throughput 0.998667 | |
UABAL (vector, 8H) throughput 1.257772 | |
UABAL (vector, 4S) 1->1 1.978897 | |
UABAL (vector, 4S) 1->2 3.118654 | |
UABAL (vector, 4S) 1->3 3.038871 | |
UABAL (vector, 4S) throughput 1.015919 | |
UABAL (vector, 4S) throughput 0.994891 | |
UABAL (vector, 2D) 1->1 1.998245 | |
UABAL (vector, 2D) 1->2 3.034318 | |
UABAL (vector, 2D) 1->3 3.050697 | |
UABAL (vector, 2D) throughput 1.007248 | |
UABAL (vector, 2D) throughput 0.993672 | |
UABAL2 (vector, 8H) 1->1 1.980325 | |
UABAL2 (vector, 8H) 1->2 2.982690 | |
UABAL2 (vector, 8H) 1->3 3.040837 | |
UABAL2 (vector, 8H) throughput 1.008014 | |
UABAL2 (vector, 8H) throughput 0.997037 | |
UABAL2 (vector, 4S) 1->1 2.102074 | |
UABAL2 (vector, 4S) 1->2 3.008445 | |
UABAL2 (vector, 4S) 1->3 3.002530 | |
UABAL2 (vector, 4S) throughput 0.991396 | |
UABAL2 (vector, 4S) throughput 1.016143 | |
UABAL2 (vector, 2D) 1->1 1.988253 | |
UABAL2 (vector, 2D) 1->2 2.991552 | |
UABAL2 (vector, 2D) 1->3 2.993824 | |
UABAL2 (vector, 2D) throughput 1.003021 | |
UABAL2 (vector, 2D) throughput 1.012586 | |
UABD (vector, 8B) 1->2 2.991651 | |
UABD (vector, 8B) 1->3 3.030113 | |
UABD (vector, 8B) throughput 1.009928 | |
UABD (vector, 16B) 1->2 3.043518 | |
UABD (vector, 16B) 1->3 3.504196 | |
UABD (vector, 16B) throughput 0.985546 | |
UABD (vector, 4H) 1->2 2.996623 | |
UABD (vector, 4H) 1->3 3.005343 | |
UABD (vector, 4H) throughput 1.008001 | |
UABD (vector, 8H) 1->2 3.022037 | |
UABD (vector, 8H) 1->3 2.978581 | |
UABD (vector, 8H) throughput 1.007682 | |
UABD (vector, 2S) 1->2 2.972213 | |
UABD (vector, 2S) 1->3 3.015398 | |
UABD (vector, 2S) throughput 1.007016 | |
UABD (vector, 4S) 1->2 3.072031 | |
UABD (vector, 4S) 1->3 3.033995 | |
UABD (vector, 4S) throughput 1.011398 | |
UABDL (vector, 8H) 1->2 3.000525 | |
UABDL (vector, 8H) 1->3 2.989370 | |
UABDL (vector, 8H) throughput 0.964277 | |
UABDL (vector, 4S) 1->2 3.030226 | |
UABDL (vector, 4S) 1->3 3.009964 | |
UABDL (vector, 4S) throughput 1.006557 | |
UABDL (vector, 2D) 1->2 3.026273 | |
UABDL (vector, 2D) 1->3 2.976180 | |
UABDL (vector, 2D) throughput 1.010979 | |
UABDL2 (vector, 8H) 1->2 3.021883 | |
UABDL2 (vector, 8H) 1->3 3.008730 | |
UABDL2 (vector, 8H) throughput 1.005069 | |
UABDL2 (vector, 4S) 1->2 3.023556 | |
UABDL2 (vector, 4S) 1->3 2.995619 | |
UABDL2 (vector, 4S) throughput 1.006814 | |
UABDL2 (vector, 2D) 1->2 2.994151 | |
UABDL2 (vector, 2D) 1->3 3.079710 | |
UABDL2 (vector, 2D) throughput 1.009071 | |
UADALP (vector, 4H) 1->1 1.994255 | |
UADALP (vector, 4H) 1->2 2.992615 | |
UADALP (vector, 4H) throughput 0.992759 | |
UADALP (vector, 4H) throughput 1.009698 | |
UADALP (vector, 8H) 1->1 2.021764 | |
UADALP (vector, 8H) 1->2 3.030940 | |
UADALP (vector, 8H) throughput 1.003931 | |
UADALP (vector, 8H) throughput 1.006304 | |
UADALP (vector, 2S) 1->1 2.002831 | |
UADALP (vector, 2S) 1->2 3.130221 | |
UADALP (vector, 2S) throughput 1.006881 | |
UADALP (vector, 2S) throughput 1.031648 | |
UADALP (vector, 4S) 1->1 1.963902 | |
UADALP (vector, 4S) 1->2 3.071909 | |
UADALP (vector, 4S) throughput 0.982427 | |
UADALP (vector, 4S) throughput 0.994983 | |
UADALP (vector, 1D) 1->1 1.998574 | |
UADALP (vector, 1D) 1->2 3.006449 | |
UADALP (vector, 1D) throughput 1.002862 | |
UADALP (vector, 1D) throughput 1.004361 | |
UADALP (vector, 2D) 1->1 2.005539 | |
UADALP (vector, 2D) 1->2 3.008710 | |
UADALP (vector, 2D) throughput 1.031721 | |
UADALP (vector, 2D) throughput 1.000400 | |
UADDL (vector, 8H) 1->2 1.994502 | |
UADDL (vector, 8H) 1->3 2.001535 | |
UADDL (vector, 8H) throughput 1.011830 | |
UADDL (vector, 4S) 1->2 2.013393 | |
UADDL (vector, 4S) 1->3 1.988295 | |
UADDL (vector, 4S) throughput 1.003737 | |
UADDL (vector, 2D) 1->2 1.993236 | |
UADDL (vector, 2D) 1->3 2.001229 | |
UADDL (vector, 2D) throughput 1.011995 | |
UADDL2 (vector, 8H) 1->2 1.936694 | |
UADDL2 (vector, 8H) 1->3 2.006554 | |
UADDL2 (vector, 8H) throughput 1.010324 | |
UADDL2 (vector, 4S) 1->2 1.987523 | |
UADDL2 (vector, 4S) 1->3 2.016340 | |
UADDL2 (vector, 4S) throughput 0.997261 | |
UADDL2 (vector, 2D) 1->2 2.009808 | |
UADDL2 (vector, 2D) 1->3 1.819076 | |
UADDL2 (vector, 2D) throughput 1.010621 | |
UADDLP (vector, 4H) 1->2 1.998728 | |
UADDLP (vector, 4H) throughput 0.984378 | |
UADDLP (vector, 8H) 1->2 2.017955 | |
UADDLP (vector, 8H) throughput 1.010852 | |
UADDLP (vector, 2S) 1->2 2.053370 | |
UADDLP (vector, 2S) throughput 1.006622 | |
UADDLP (vector, 4S) 1->2 2.022849 | |
UADDLP (vector, 4S) throughput 1.006401 | |
UADDLP (vector, 1D) 1->2 2.000695 | |
UADDLP (vector, 1D) throughput 0.996908 | |
UADDLP (vector, 2D) 1->2 1.998723 | |
UADDLP (vector, 2D) throughput 1.007794 | |
UADDLV (vector, 8B) 1->2 2.991642 | |
UADDLV (vector, 8B) throughput 1.011196 | |
UADDLV (vector, 16B) 1->2 3.114601 | |
UADDLV (vector, 16B) throughput 0.986153 | |
UADDLV (vector, 4H) 1->2 2.961399 | |
UADDLV (vector, 4H) throughput 1.022958 | |
UADDLV (vector, 8H) 1->2 3.037784 | |
UADDLV (vector, 8H) throughput 1.003171 | |
UADDLV (vector, 4S) 1->2 2.994788 | |
UADDLV (vector, 4S) throughput 1.012952 | |
UADDW (vector, 8H) 1->2 2.000942 | |
UADDW (vector, 8H) 1->3 2.033173 | |
UADDW (vector, 8H) throughput 1.016678 | |
UADDW (vector, 4S) 1->2 2.016507 | |
UADDW (vector, 4S) 1->3 2.024363 | |
UADDW (vector, 4S) throughput 1.000960 | |
UADDW (vector, 2D) 1->2 2.024504 | |
UADDW (vector, 2D) 1->3 2.083938 | |
UADDW (vector, 2D) throughput 1.008246 | |
UADDW2 (vector, 8H) 1->2 2.025464 | |
UADDW2 (vector, 8H) 1->3 1.993823 | |
UADDW2 (vector, 8H) throughput 1.006465 | |
UADDW2 (vector, 4S) 1->2 2.012603 | |
UADDW2 (vector, 4S) 1->3 2.019359 | |
UADDW2 (vector, 4S) throughput 1.009338 | |
UADDW2 (vector, 2D) 1->2 1.997301 | |
UADDW2 (vector, 2D) 1->3 2.023038 | |
UADDW2 (vector, 2D) throughput 1.012700 | |
UCVTF (scalar, fixed-point, S from 32-bit GPR) 1->2 roundtrip 10.167072 | |
UCVTF (scalar, fixed-point, S from 32-bit GPR) throughput 1.056138 | |
UCVTF (scalar, fixed-point, D from 32-bit GPR) 1->2 roundtrip 10.126884 | |
UCVTF (scalar, fixed-point, D from 32-bit GPR) throughput 1.016231 | |
UCVTF (scalar, fixed-point, S from 64-bit GPR) 1->2 roundtrip 11.843969 | |
UCVTF (scalar, fixed-point, S from 64-bit GPR) throughput 1.015234 | |
UCVTF (scalar, fixed-point, D from 64-bit GPR) 1->2 roundtrip 10.087749 | |
UCVTF (scalar, fixed-point, D from 64-bit GPR) throughput 1.008776 | |
UCVTF (scalar, integer, S from 32-bit GPR) 1->2 roundtrip 10.122001 | |
UCVTF (scalar, integer, S from 32-bit GPR) throughput 1.161307 | |
UCVTF (scalar, integer, D from 32-bit GPR) 1->2 roundtrip 10.032361 | |
UCVTF (scalar, integer, D from 32-bit GPR) throughput 1.001224 | |
UCVTF (scalar, integer, S from 64-bit GPR) 1->2 roundtrip 10.067420 | |
UCVTF (scalar, integer, S from 64-bit GPR) throughput 1.022808 | |
UCVTF (scalar, integer, D from 64-bit GPR) 1->2 roundtrip 9.668760 | |
UCVTF (scalar, integer, D from 64-bit GPR) throughput 1.012275 | |
UCVTF (scalar, fixed-point, S from S) 1->2 3.014423 | |
UCVTF (scalar, fixed-point, S from S) throughput 1.012312 | |
UCVTF (scalar, fixed-point, D from D) 1->2 3.025641 | |
UCVTF (scalar, fixed-point, D from D) throughput 0.988661 | |
UCVTF (vector, fixed-point, 2S from 2S) 1->2 2.998245 | |
UCVTF (vector, fixed-point, 2S from 2S) throughput 1.006363 | |
UCVTF (vector, fixed-point, 4S from 4S) 1->2 3.080194 | |
UCVTF (vector, fixed-point, 4S from 4S) throughput 1.018183 | |
UCVTF (vector, fixed-point, 2D from 2D) 1->2 3.057292 | |
UCVTF (vector, fixed-point, 2D from 2D) throughput 1.010477 | |
UCVTF (scalar, integer, S from S) 1->2 3.006262 | |
UCVTF (scalar, integer, S from S) throughput 1.032024 | |
UCVTF (scalar, integer, D from D) 1->2 3.008126 | |
UCVTF (scalar, integer, D from D) throughput 1.003946 | |
UCVTF (vector, integer, 2S from 2S) 1->2 2.962326 | |
UCVTF (vector, integer, 2S from 2S) throughput 1.003804 | |
UCVTF (vector, integer, 4S from 4S) 1->2 3.054815 | |
UCVTF (vector, integer, 4S from 4S) throughput 1.000136 | |
UCVTF (vector, integer, 2D from 2D) 1->2 3.028542 | |
UCVTF (vector, integer, 2D from 2D) throughput 1.016484 | |
UHADD (vector, 8B) 1->2 2.009030 | |
UHADD (vector, 8B) 1->3 2.033149 | |
UHADD (vector, 8B) throughput 1.004222 | |
UHADD (vector, 16B) 1->2 2.033261 | |
UHADD (vector, 16B) 1->3 2.001116 | |
UHADD (vector, 16B) throughput 0.999159 | |
UHADD (vector, 4H) 1->2 2.068931 | |
UHADD (vector, 4H) 1->3 2.025008 | |
UHADD (vector, 4H) throughput 1.013447 | |
UHADD (vector, 8H) 1->2 1.970305 | |
UHADD (vector, 8H) 1->3 2.008202 | |
UHADD (vector, 8H) throughput 1.013495 | |
UHADD (vector, 2S) 1->2 1.974647 | |
UHADD (vector, 2S) 1->3 2.009359 | |
UHADD (vector, 2S) throughput 0.990893 | |
UHADD (vector, 4S) 1->2 1.960995 | |
UHADD (vector, 4S) 1->3 2.002873 | |
UHADD (vector, 4S) throughput 0.988823 | |
UHSUB (vector, 8B) 1->2 1.996995 | |
UHSUB (vector, 8B) 1->3 1.922087 | |
UHSUB (vector, 8B) throughput 1.017265 | |
UHSUB (vector, 16B) 1->2 2.012182 | |
UHSUB (vector, 16B) 1->3 1.932348 | |
UHSUB (vector, 16B) throughput 1.002147 | |
UHSUB (vector, 4H) 1->2 2.019808 | |
UHSUB (vector, 4H) 1->3 2.016630 | |
UHSUB (vector, 4H) throughput 0.983520 | |
UHSUB (vector, 8H) 1->2 1.997193 | |
UHSUB (vector, 8H) 1->3 2.016865 | |
UHSUB (vector, 8H) throughput 0.995683 | |
UHSUB (vector, 2S) 1->2 2.039859 | |
UHSUB (vector, 2S) 1->3 1.974754 | |
UHSUB (vector, 2S) throughput 1.015508 | |
UHSUB (vector, 4S) 1->2 2.003210 | |
UHSUB (vector, 4S) 1->3 2.002828 | |
UHSUB (vector, 4S) throughput 0.996355 | |
UMAX (vector, 8B) 1->2 3.031515 | |
UMAX (vector, 8B) 1->3 3.032825 | |
UMAX (vector, 8B) throughput 1.024084 | |
UMAX (vector, 16B) 1->2 3.037696 | |
UMAX (vector, 16B) 1->3 3.072905 | |
UMAX (vector, 16B) throughput 1.043218 | |
UMAX (vector, 4H) 1->2 3.012145 | |
UMAX (vector, 4H) 1->3 3.025105 | |
UMAX (vector, 4H) throughput 0.999206 | |
UMAX (vector, 8H) 1->2 2.905015 | |
UMAX (vector, 8H) 1->3 3.030832 | |
UMAX (vector, 8H) throughput 1.005483 | |
UMAX (vector, 2S) 1->2 3.003422 | |
UMAX (vector, 2S) 1->3 3.039803 | |
UMAX (vector, 2S) throughput 1.012589 | |
UMAX (vector, 4S) 1->2 3.041508 | |
UMAX (vector, 4S) 1->3 2.887797 | |
UMAX (vector, 4S) throughput 1.002913 | |
UMAXP (vector, 8B) 1->2 3.067574 | |
UMAXP (vector, 8B) 1->3 3.008709 | |
UMAXP (vector, 8B) throughput 1.010730 | |
UMAXP (vector, 16B) 1->2 3.030111 | |
UMAXP (vector, 16B) 1->3 3.336696 | |
UMAXP (vector, 16B) throughput 1.000513 | |
UMAXP (vector, 4H) 1->2 3.007833 | |
UMAXP (vector, 4H) 1->3 3.027140 | |
UMAXP (vector, 4H) throughput 1.020824 | |
UMAXP (vector, 8H) 1->2 3.020410 | |
UMAXP (vector, 8H) 1->3 3.011385 | |
UMAXP (vector, 8H) throughput 1.012852 | |
UMAXP (vector, 2S) 1->2 3.022850 | |
UMAXP (vector, 2S) 1->3 2.963282 | |
UMAXP (vector, 2S) throughput 1.004999 | |
UMAXP (vector, 4S) 1->2 3.034874 | |
UMAXP (vector, 4S) 1->3 3.023905 | |
UMAXP (vector, 4S) throughput 0.995294 | |
UMAXV (vector, 8B) 1->2 2.997126 | |
UMAXV (vector, 8B) throughput 1.011659 | |
UMAXV (vector, 16B) 1->2 3.091381 | |
UMAXV (vector, 16B) throughput 1.019404 | |
UMAXV (vector, 4H) 1->2 3.089994 | |
UMAXV (vector, 4H) throughput 1.000755 | |
UMAXV (vector, 8H) 1->2 2.998202 | |
UMAXV (vector, 8H) throughput 0.994510 | |
UMAXV (vector, 4S) 1->2 3.007655 | |
UMAXV (vector, 4S) throughput 1.007664 | |
UMIN (vector, 8B) 1->2 3.001753 | |
UMIN (vector, 8B) 1->3 2.996605 | |
UMIN (vector, 8B) throughput 0.991748 | |
UMIN (vector, 16B) 1->2 2.998005 | |
UMIN (vector, 16B) 1->3 3.055716 | |
UMIN (vector, 16B) throughput 1.005181 | |
UMIN (vector, 4H) 1->2 2.899232 | |
UMIN (vector, 4H) 1->3 3.004140 | |
UMIN (vector, 4H) throughput 1.006132 | |
UMIN (vector, 8H) 1->2 3.026972 | |
UMIN (vector, 8H) 1->3 2.998158 | |
UMIN (vector, 8H) throughput 1.006189 | |
UMIN (vector, 2S) 1->2 3.088714 | |
UMIN (vector, 2S) 1->3 3.034049 | |
UMIN (vector, 2S) throughput 1.010415 | |
UMIN (vector, 4S) 1->2 3.085099 | |
UMIN (vector, 4S) 1->3 2.993523 | |
UMIN (vector, 4S) throughput 0.988339 | |
UMINP (vector, 8B) 1->2 3.097436 | |
UMINP (vector, 8B) 1->3 3.040114 | |
UMINP (vector, 8B) throughput 1.003416 | |
UMINP (vector, 16B) 1->2 3.019396 | |
UMINP (vector, 16B) 1->3 3.013622 | |
UMINP (vector, 16B) throughput 1.026317 | |
UMINP (vector, 4H) 1->2 3.086303 | |
UMINP (vector, 4H) 1->3 3.086346 | |
UMINP (vector, 4H) throughput 1.014763 | |
UMINP (vector, 8H) 1->2 3.014796 | |
UMINP (vector, 8H) 1->3 2.997784 | |
UMINP (vector, 8H) throughput 1.010398 | |
UMINP (vector, 2S) 1->2 3.037185 | |
UMINP (vector, 2S) 1->3 2.987227 | |
UMINP (vector, 2S) throughput 1.006856 | |
UMINP (vector, 4S) 1->2 3.046842 | |
UMINP (vector, 4S) 1->3 3.048700 | |
UMINP (vector, 4S) throughput 1.017885 | |
UMINV (vector, 8B) 1->2 2.967196 | |
UMINV (vector, 8B) throughput 1.050506 | |
UMINV (vector, 16B) 1->2 3.008446 | |
UMINV (vector, 16B) throughput 0.949252 | |
UMINV (vector, 4H) 1->2 2.995852 | |
UMINV (vector, 4H) throughput 0.983359 | |
UMINV (vector, 8H) 1->2 3.039547 | |
UMINV (vector, 8H) throughput 1.010157 | |
UMINV (vector, 4S) 1->2 3.050322 | |
UMINV (vector, 4S) throughput 1.007656 | |
UMLAL (by element, 4S) 1->1 1.998467 | |
UMLAL (by element, 4S) 1->2 3.003487 | |
UMLAL (by element, 4S) 1->3 2.974830 | |
UMLAL (by element, 4S) throughput 1.007085 | |
UMLAL (by element, 4S) throughput 1.009129 | |
UMLAL (by element, 2D) 1->1 1.928287 | |
UMLAL (by element, 2D) 1->2 3.105260 | |
UMLAL (by element, 2D) 1->3 3.007597 | |
UMLAL (by element, 2D) throughput 1.010491 | |
UMLAL (by element, 2D) throughput 1.008750 | |
UMLAL2 (by element, 4S) 1->1 2.029532 | |
UMLAL2 (by element, 4S) 1->2 3.024084 | |
UMLAL2 (by element, 4S) 1->3 2.999294 | |
UMLAL2 (by element, 4S) throughput 1.010988 | |
UMLAL2 (by element, 4S) throughput 1.004247 | |
UMLAL2 (by element, 2D) 1->1 2.038134 | |
UMLAL2 (by element, 2D) 1->2 3.014036 | |
UMLAL2 (by element, 2D) 1->3 3.012720 | |
UMLAL2 (by element, 2D) throughput 1.004774 | |
UMLAL2 (by element, 2D) throughput 1.005051 | |
UMLAL (vector, 8H) 1->1 2.020636 | |
UMLAL (vector, 8H) 1->2 3.017248 | |
UMLAL (vector, 8H) 1->3 3.008361 | |
UMLAL (vector, 8H) throughput 1.007186 | |
UMLAL (vector, 8H) throughput 1.010548 | |
UMLAL (vector, 4S) 1->1 1.988178 | |
UMLAL (vector, 4S) 1->2 3.057062 | |
UMLAL (vector, 4S) 1->3 3.010645 | |
UMLAL (vector, 4S) throughput 1.003670 | |
UMLAL (vector, 4S) throughput 1.006535 | |
UMLAL (vector, 2D) 1->1 2.017056 | |
UMLAL (vector, 2D) 1->2 3.014316 | |
UMLAL (vector, 2D) 1->3 3.048868 | |
UMLAL (vector, 2D) throughput 1.006027 | |
UMLAL (vector, 2D) throughput 0.984009 | |
UMLAL2 (vector, 8H) 1->1 2.045406 | |
UMLAL2 (vector, 8H) 1->2 3.027867 | |
UMLAL2 (vector, 8H) 1->3 3.076053 | |
UMLAL2 (vector, 8H) throughput 0.926665 | |
UMLAL2 (vector, 8H) throughput 1.009435 | |
UMLAL2 (vector, 4S) 1->1 2.049499 | |
UMLAL2 (vector, 4S) 1->2 3.059788 | |
UMLAL2 (vector, 4S) 1->3 3.766172 | |
UMLAL2 (vector, 4S) throughput 1.006591 | |
UMLAL2 (vector, 4S) throughput 1.010946 | |
UMLAL2 (vector, 2D) 1->1 1.940643 | |
UMLAL2 (vector, 2D) 1->2 3.010820 | |
UMLAL2 (vector, 2D) 1->3 3.010540 | |
UMLAL2 (vector, 2D) throughput 1.002830 | |
UMLAL2 (vector, 2D) throughput 0.996555 | |
UMLSL (by element, 4S) 1->1 2.063145 | |
UMLSL (by element, 4S) 1->2 3.039866 | |
UMLSL (by element, 4S) 1->3 3.019272 | |
UMLSL (by element, 4S) throughput 0.998752 | |
UMLSL (by element, 4S) throughput 1.009123 | |
UMLSL (by element, 2D) 1->1 2.017440 | |
UMLSL (by element, 2D) 1->2 3.091270 | |
UMLSL (by element, 2D) 1->3 3.015936 | |
UMLSL (by element, 2D) throughput 1.007633 | |
UMLSL (by element, 2D) throughput 1.000212 | |
UMLSL2 (by element, 4S) 1->1 2.008010 | |
UMLSL2 (by element, 4S) 1->2 2.996190 | |
UMLSL2 (by element, 4S) 1->3 3.069508 | |
UMLSL2 (by element, 4S) throughput 1.010648 | |
UMLSL2 (by element, 4S) throughput 1.010992 | |
UMLSL2 (by element, 2D) 1->1 2.052582 | |
UMLSL2 (by element, 2D) 1->2 2.999543 | |
UMLSL2 (by element, 2D) 1->3 3.031053 | |
UMLSL2 (by element, 2D) throughput 0.972927 | |
UMLSL2 (by element, 2D) throughput 1.008893 | |
UMLSL (vector, 8H) 1->1 1.998399 | |
UMLSL (vector, 8H) 1->2 2.967244 | |
UMLSL (vector, 8H) 1->3 3.019974 | |
UMLSL (vector, 8H) throughput 1.022772 | |
UMLSL (vector, 8H) throughput 0.998327 | |
UMLSL (vector, 4S) 1->1 2.010600 | |
UMLSL (vector, 4S) 1->2 3.001144 | |
UMLSL (vector, 4S) 1->3 3.028859 | |
UMLSL (vector, 4S) throughput 1.008636 | |
UMLSL (vector, 4S) throughput 1.005847 | |
UMLSL (vector, 2D) 1->1 2.012623 | |
UMLSL (vector, 2D) 1->2 3.033399 | |
UMLSL (vector, 2D) 1->3 3.028052 | |
UMLSL (vector, 2D) throughput 1.015179 | |
UMLSL (vector, 2D) throughput 1.007337 | |
UMLSL2 (vector, 8H) 1->1 2.008621 | |
UMLSL2 (vector, 8H) 1->2 3.027545 | |
UMLSL2 (vector, 8H) 1->3 3.075542 | |
UMLSL2 (vector, 8H) throughput 0.985468 | |
UMLSL2 (vector, 8H) throughput 1.012031 | |
UMLSL2 (vector, 4S) 1->1 2.020032 | |
UMLSL2 (vector, 4S) 1->2 2.967901 | |
UMLSL2 (vector, 4S) 1->3 3.013099 | |
UMLSL2 (vector, 4S) throughput 1.008094 | |
UMLSL2 (vector, 4S) throughput 0.954831 | |
UMLSL2 (vector, 2D) 1->1 1.994480 | |
UMLSL2 (vector, 2D) 1->2 3.013142 | |
UMLSL2 (vector, 2D) 1->3 2.985072 | |
UMLSL2 (vector, 2D) throughput 1.009912 | |
UMLSL2 (vector, 2D) throughput 1.022621 | |
UMULL (by element, 4S) 1->2 3.027131 | |
UMULL (by element, 4S) 1->3 2.998051 | |
UMULL (by element, 4S) throughput 1.005675 | |
UMULL (by element, 2D) 1->2 3.022762 | |
UMULL (by element, 2D) 1->3 3.015626 | |
UMULL (by element, 2D) throughput 1.005252 | |
UMULL2 (by element, 4S) 1->2 3.009984 | |
UMULL2 (by element, 4S) 1->3 3.001293 | |
UMULL2 (by element, 4S) throughput 1.007478 | |
UMULL2 (by element, 2D) 1->2 3.010865 | |
UMULL2 (by element, 2D) 1->3 3.069726 | |
UMULL2 (by element, 2D) throughput 1.013972 | |
UMULL (vector, 8H) 1->2 3.074791 | |
UMULL (vector, 8H) 1->3 3.008818 | |
UMULL (vector, 8H) throughput 0.985439 | |
UMULL (vector, 4S) 1->2 3.006264 | |
UMULL (vector, 4S) 1->3 3.038752 | |
UMULL (vector, 4S) throughput 0.991156 | |
UMULL (vector, 2D) 1->2 3.028496 | |
UMULL (vector, 2D) 1->3 3.032567 | |
UMULL (vector, 2D) throughput 1.003612 | |
UMULL2 (vector, 8H) 1->2 3.024235 | |
UMULL2 (vector, 8H) 1->3 3.002347 | |
UMULL2 (vector, 8H) throughput 1.004558 | |
UMULL2 (vector, 4S) 1->2 3.040400 | |
UMULL2 (vector, 4S) 1->3 2.965595 | |
UMULL2 (vector, 4S) throughput 1.010069 | |
UMULL2 (vector, 2D) 1->2 3.042244 | |
UMULL2 (vector, 2D) 1->3 2.997939 | |
UMULL2 (vector, 2D) throughput 1.003202 | |
UQADD (scalar, B) 1->2 3.017723 | |
UQADD (scalar, B) 1->3 3.510872 | |
UQADD (scalar, B) throughput 0.988537 | |
UQADD (scalar, H) 1->2 3.064452 | |
UQADD (scalar, H) 1->3 3.032481 | |
UQADD (scalar, H) throughput 1.000646 | |
UQADD (scalar, S) 1->2 3.010050 | |
UQADD (scalar, S) 1->3 3.021750 | |
UQADD (scalar, S) throughput 1.012607 | |
UQADD (scalar, D) 1->2 2.984389 | |
UQADD (scalar, D) 1->3 2.992490 | |
UQADD (scalar, D) throughput 1.011660 | |
UQADD (vector, 8B) 1->2 3.019990 | |
UQADD (vector, 8B) 1->3 3.007503 | |
UQADD (vector, 8B) throughput 1.004246 | |
UQADD (vector, 16B) 1->2 3.037617 | |
UQADD (vector, 16B) 1->3 3.032984 | |
UQADD (vector, 16B) throughput 1.000465 | |
UQADD (vector, 4H) 1->2 3.070741 | |
UQADD (vector, 4H) 1->3 3.060261 | |
UQADD (vector, 4H) throughput 1.007271 | |
UQADD (vector, 8H) 1->2 3.008059 | |
UQADD (vector, 8H) 1->3 3.022693 | |
UQADD (vector, 8H) throughput 1.009694 | |
UQADD (vector, 2S) 1->2 2.988833 | |
UQADD (vector, 2S) 1->3 3.004143 | |
UQADD (vector, 2S) throughput 1.011703 | |
UQADD (vector, 4S) 1->2 3.011646 | |
UQADD (vector, 4S) 1->3 2.967322 | |
UQADD (vector, 4S) throughput 1.011497 | |
UQADD (vector, 2D) 1->2 3.018596 | |
UQADD (vector, 2D) 1->3 3.040144 | |
UQADD (vector, 2D) throughput 0.990765 | |
UQRSHL (scalar, B) 1->2 3.010950 | |
UQRSHL (scalar, B) 1->3 3.074329 | |
UQRSHL (scalar, B) throughput 0.991977 | |
UQRSHL (scalar, H) 1->2 3.018930 | |
UQRSHL (scalar, H) 1->3 3.077714 | |
UQRSHL (scalar, H) throughput 1.008894 | |
UQRSHL (scalar, S) 1->2 2.983085 | |
UQRSHL (scalar, S) 1->3 3.016295 | |
UQRSHL (scalar, S) throughput 1.011751 | |
UQRSHL (scalar, D) 1->2 3.030985 | |
UQRSHL (scalar, D) 1->3 2.992553 | |
UQRSHL (scalar, D) throughput 1.007919 | |
UQRSHL (vector, 8B) 1->2 3.053187 | |
UQRSHL (vector, 8B) 1->3 3.008179 | |
UQRSHL (vector, 8B) throughput 1.014104 | |
UQRSHL (vector, 16B) 1->2 3.068276 | |
UQRSHL (vector, 16B) 1->3 3.072716 | |
UQRSHL (vector, 16B) throughput 0.989206 | |
UQRSHL (vector, 4H) 1->2 3.009084 | |
UQRSHL (vector, 4H) 1->3 3.026347 | |
UQRSHL (vector, 4H) throughput 1.012484 | |
UQRSHL (vector, 8H) 1->2 3.004299 | |
UQRSHL (vector, 8H) 1->3 2.997744 | |
UQRSHL (vector, 8H) throughput 1.000080 | |
UQRSHL (vector, 2S) 1->2 3.132499 | |
UQRSHL (vector, 2S) 1->3 3.396190 | |
UQRSHL (vector, 2S) throughput 1.031019 | |
UQRSHL (vector, 4S) 1->2 3.009965 | |
UQRSHL (vector, 4S) 1->3 3.020303 | |
UQRSHL (vector, 4S) throughput 1.005361 | |
UQRSHL (vector, 2D) 1->2 3.011075 | |
UQRSHL (vector, 2D) 1->3 3.023453 | |
UQRSHL (vector, 2D) throughput 1.000492 | |
UQRSHRN (H) 1->2 3.983331 | |
UQRSHRN (H) throughput 1.096999 | |
UQRSHRN (S) 1->2 4.336671 | |
UQRSHRN (S) throughput 1.009975 | |
UQRSHRN (D) 1->2 4.033624 | |
UQRSHRN (D) throughput 1.007755 | |
UQRSHRN (8H) 1->2 4.078490 | |
UQRSHRN (8H) throughput 1.016285 | |
UQRSHRN (4S) 1->2 3.994370 | |
UQRSHRN (4S) throughput 0.998247 | |
UQRSHRN (2D) 1->2 4.055310 | |
UQRSHRN (2D) throughput 1.051889 | |
UQRSHRN2 (8H) 1->1 5.284969 | |
UQRSHRN2 (8H) 1->2 3.946257 | |
UQRSHRN2 (8H) throughput 0.775218 | |
UQRSHRN2 (8H) throughput 1.037457 | |
UQRSHRN2 (4S) 1->1 4.018465 | |
UQRSHRN2 (4S) 1->2 4.001314 | |
UQRSHRN2 (4S) throughput 1.008083 | |
UQRSHRN2 (4S) throughput 0.987702 | |
UQRSHRN2 (2D) 1->1 4.067674 | |
UQRSHRN2 (2D) 1->2 4.034710 | |
UQRSHRN2 (2D) throughput 1.006705 | |
UQRSHRN2 (2D) throughput 1.011347 | |
UQSHL (immediate, scalar, B) 1->2 1.973461 | |
UQSHL (immediate, scalar, B) throughput 1.002886 | |
UQSHL (immediate, scalar, H) 1->2 2.152295 | |
UQSHL (immediate, scalar, H) throughput 1.017848 | |
UQSHL (immediate, scalar, S) 1->2 2.060653 | |
UQSHL (immediate, scalar, S) throughput 0.989302 | |
UQSHL (immediate, scalar, D) 1->2 1.999935 | |
UQSHL (immediate, scalar, D) throughput 1.002383 | |
UQSHL (immediate, vector, 8B) 1->2 1.981970 | |
UQSHL (immediate, vector, 8B) throughput 1.004521 | |
UQSHL (immediate, vector, 16B) 1->2 2.008682 | |
UQSHL (immediate, vector, 16B) throughput 0.994756 | |
UQSHL (immediate, vector, 4H) 1->2 1.998398 | |
UQSHL (immediate, vector, 4H) throughput 1.006707 | |
UQSHL (immediate, vector, 8H) 1->2 2.012395 | |
UQSHL (immediate, vector, 8H) throughput 1.008146 | |
UQSHL (immediate, vector, 2S) 1->2 1.999430 | |
UQSHL (immediate, vector, 2S) throughput 1.019712 | |
UQSHL (immediate, vector, 4S) 1->2 1.966046 | |
UQSHL (immediate, vector, 4S) throughput 1.005506 | |
UQSHL (immediate, vector, 2D) 1->2 1.996747 | |
UQSHL (immediate, vector, 2D) throughput 1.009917 | |
UQSHL (scalar, B) 1->2 1.994056 | |
UQSHL (scalar, B) 1->3 2.039280 | |
UQSHL (scalar, B) throughput 0.994195 | |
UQSHL (scalar, H) 1->2 2.037171 | |
UQSHL (scalar, H) 1->3 2.006466 | |
UQSHL (scalar, H) throughput 1.010734 | |
UQSHL (scalar, S) 1->2 1.999614 | |
UQSHL (scalar, S) 1->3 1.998333 | |
UQSHL (scalar, S) throughput 1.016805 | |
UQSHL (scalar, D) 1->2 2.044420 | |
UQSHL (scalar, D) 1->3 1.992309 | |
UQSHL (scalar, D) throughput 1.013328 | |
UQSHL (vector, 8B) 1->2 1.993776 | |
UQSHL (vector, 8B) 1->3 2.002620 | |
UQSHL (vector, 8B) throughput 0.989713 | |
UQSHL (vector, 16B) 1->2 2.004972 | |
UQSHL (vector, 16B) 1->3 2.036153 | |
UQSHL (vector, 16B) throughput 1.008922 | |
UQSHL (vector, 4H) 1->2 2.026565 | |
UQSHL (vector, 4H) 1->3 1.997033 | |
UQSHL (vector, 4H) throughput 1.007798 | |
UQSHL (vector, 8H) 1->2 2.017110 | |
UQSHL (vector, 8H) 1->3 1.964820 | |
UQSHL (vector, 8H) throughput 1.010416 | |
UQSHL (vector, 2S) 1->2 2.065557 | |
UQSHL (vector, 2S) 1->3 2.030695 | |
UQSHL (vector, 2S) throughput 1.005824 | |
UQSHL (vector, 4S) 1->2 2.048863 | |
UQSHL (vector, 4S) 1->3 2.001206 | |
UQSHL (vector, 4S) throughput 1.004400 | |
UQSHL (vector, 2D) 1->2 2.031710 | |
UQSHL (vector, 2D) 1->3 1.999736 | |
UQSHL (vector, 2D) throughput 1.006574 | |
UQSHRN (H) 1->2 4.006342 | |
UQSHRN (H) throughput 1.009384 | |
UQSHRN (S) 1->2 4.056936 | |
UQSHRN (S) throughput 1.011226 | |
UQSHRN (D) 1->2 4.040596 | |
UQSHRN (D) throughput 0.991970 | |
UQSHRN (8H) 1->2 4.055417 | |
UQSHRN (8H) throughput 1.007980 | |
UQSHRN (4S) 1->2 4.010813 | |
UQSHRN (4S) throughput 1.013092 | |
UQSHRN (2D) 1->2 4.110466 | |
UQSHRN (2D) throughput 1.025580 | |
UQSHRN2 (8H) 1->1 4.318087 | |
UQSHRN2 (8H) 1->2 3.705922 | |
UQSHRN2 (8H) throughput 1.011685 | |
UQSHRN2 (8H) throughput 1.006353 | |
UQSHRN2 (4S) 1->1 4.037083 | |
UQSHRN2 (4S) 1->2 4.015565 | |
UQSHRN2 (4S) throughput 0.988904 | |
UQSHRN2 (4S) throughput 1.006362 | |
UQSHRN2 (2D) 1->1 4.069686 | |
UQSHRN2 (2D) 1->2 4.028105 | |
UQSHRN2 (2D) throughput 1.008035 | |
UQSHRN2 (2D) throughput 0.993281 | |
UQSUB (scalar, B) 1->2 2.994239 | |
UQSUB (scalar, B) 1->3 3.036126 | |
UQSUB (scalar, B) throughput 0.994878 | |
UQSUB (scalar, H) 1->2 3.001974 | |
UQSUB (scalar, H) 1->3 3.055673 | |
UQSUB (scalar, H) throughput 1.010550 | |
UQSUB (scalar, S) 1->2 2.967294 | |
UQSUB (scalar, S) 1->3 3.024720 | |
UQSUB (scalar, S) throughput 0.993717 | |
UQSUB (scalar, D) 1->2 3.018015 | |
UQSUB (scalar, D) 1->3 3.010519 | |
UQSUB (scalar, D) throughput 0.986013 | |
UQSUB (vector, 8B) 1->2 3.033175 | |
UQSUB (vector, 8B) 1->3 3.034664 | |
UQSUB (vector, 8B) throughput 1.031823 | |
UQSUB (vector, 16B) 1->2 2.742839 | |
UQSUB (vector, 16B) 1->3 3.020584 | |
UQSUB (vector, 16B) throughput 1.006164 | |
UQSUB (vector, 4H) 1->2 3.038129 | |
UQSUB (vector, 4H) 1->3 2.939930 | |
UQSUB (vector, 4H) throughput 1.008952 | |
UQSUB (vector, 8H) 1->2 3.032636 | |
UQSUB (vector, 8H) 1->3 3.011427 | |
UQSUB (vector, 8H) throughput 0.998015 | |
UQSUB (vector, 2S) 1->2 3.014097 | |
UQSUB (vector, 2S) 1->3 3.034707 | |
UQSUB (vector, 2S) throughput 1.009144 | |
UQSUB (vector, 4S) 1->2 3.000702 | |
UQSUB (vector, 4S) 1->3 2.995871 | |
UQSUB (vector, 4S) throughput 1.009187 | |
UQSUB (vector, 2D) 1->2 2.979049 | |
UQSUB (vector, 2D) 1->3 3.010512 | |
UQSUB (vector, 2D) throughput 1.010357 | |
UQXTN (H) 1->2 4.016167 | |
UQXTN (H) throughput 1.015597 | |
UQXTN (S) 1->2 4.068655 | |
UQXTN (S) throughput 0.997725 | |
UQXTN (D) 1->2 3.961083 | |
UQXTN (D) throughput 1.018110 | |
UQXTN (8H) 1->2 4.009096 | |
UQXTN (8H) throughput 0.999611 | |
UQXTN (4S) 1->2 4.026732 | |
UQXTN (4S) throughput 1.008459 | |
UQXTN (2D) 1->2 4.055073 | |
UQXTN (2D) throughput 1.019639 | |
UQXTN2 (8H) 1->1 3.978407 | |
UQXTN2 (8H) 1->2 4.030115 | |
UQXTN2 (8H) throughput 1.014675 | |
UQXTN2 (8H) throughput 1.005044 | |
UQXTN2 (4S) 1->1 4.020703 | |
UQXTN2 (4S) 1->2 4.014321 | |
UQXTN2 (4S) throughput 1.010416 | |
UQXTN2 (4S) throughput 1.007642 | |
UQXTN2 (2D) 1->1 4.074411 | |
UQXTN2 (2D) 1->2 4.077278 | |
UQXTN2 (2D) throughput 1.004530 | |
UQXTN2 (2D) throughput 0.989893 | |
URECPE (vector, 2S) 1->2 4.057679 | |
URECPE (vector, 2S) throughput 1.006313 | |
URECPE (vector, 4S) 1->2 4.021441 | |
URECPE (vector, 4S) throughput 1.006110 | |
URHADD (vector, 8B) 1->2 1.998509 | |
URHADD (vector, 8B) 1->3 2.027202 | |
URHADD (vector, 8B) throughput 1.006082 | |
URHADD (vector, 16B) 1->2 2.001536 | |
URHADD (vector, 16B) 1->3 2.017913 | |
URHADD (vector, 16B) throughput 0.989461 | |
URHADD (vector, 4H) 1->2 2.002751 | |
URHADD (vector, 4H) 1->3 2.012138 | |
URHADD (vector, 4H) throughput 1.002063 | |
URHADD (vector, 8H) 1->2 2.000466 | |
URHADD (vector, 8H) 1->3 2.013224 | |
URHADD (vector, 8H) throughput 0.979901 | |
URHADD (vector, 2S) 1->2 2.016209 | |
URHADD (vector, 2S) 1->3 2.006204 | |
URHADD (vector, 2S) throughput 1.007315 | |
URHADD (vector, 4S) 1->2 2.013993 | |
URHADD (vector, 4S) 1->3 1.977044 | |
URHADD (vector, 4S) throughput 1.010831 | |
URSHL (vector, 8B) 1->2 3.004358 | |
URSHL (vector, 8B) 1->3 3.026894 | |
URSHL (vector, 8B) throughput 1.008933 | |
URSHL (vector, 16B) 1->2 2.999457 | |
URSHL (vector, 16B) 1->3 2.991505 | |
URSHL (vector, 16B) throughput 1.011705 | |
URSHL (vector, 4H) 1->2 3.042107 | |
URSHL (vector, 4H) 1->3 2.956567 | |
URSHL (vector, 4H) throughput 1.015617 | |
URSHL (vector, 8H) 1->2 2.972683 | |
URSHL (vector, 8H) 1->3 3.054060 | |
URSHL (vector, 8H) throughput 1.040882 | |
URSHL (vector, 2S) 1->2 3.050813 | |
URSHL (vector, 2S) 1->3 3.006094 | |
URSHL (vector, 2S) throughput 1.011961 | |
URSHL (vector, 4S) 1->2 2.999438 | |
URSHL (vector, 4S) 1->3 3.010005 | |
URSHL (vector, 4S) throughput 0.989823 | |
URSHL (vector, 2D) 1->2 2.994172 | |
URSHL (vector, 2D) 1->3 3.023405 | |
URSHL (vector, 2D) throughput 1.014368 | |
URSHL (vector, D) 1->2 3.082364 | |
URSHL (vector, D) 1->3 3.020212 | |
URSHL (vector, D) throughput 1.006835 | |
URSHR (vector, 8B) 1->2 3.030027 | |
URSHR (vector, 8B) throughput 0.901813 | |
URSHR (vector, 16B) 1->2 3.005746 | |
URSHR (vector, 16B) throughput 1.010552 | |
URSHR (vector, 4H) 1->2 3.019908 | |
URSHR (vector, 4H) throughput 1.009390 | |
URSHR (vector, 8H) 1->2 2.993977 | |
URSHR (vector, 8H) throughput 1.009997 | |
URSHR (vector, 2S) 1->2 3.009017 | |
URSHR (vector, 2S) throughput 1.011055 | |
URSHR (vector, 4S) 1->2 3.002940 | |
URSHR (vector, 4S) throughput 1.011984 | |
URSHR (vector, 2D) 1->2 3.001294 | |
URSHR (vector, 2D) throughput 0.995331 | |
URSHR (vector, D) 1->2 3.001898 | |
URSHR (vector, D) throughput 0.954483 | |
URSQRTE (vector, 2S) 1->2 4.046286 | |
URSQRTE (vector, 2S) throughput 1.012579 | |
URSQRTE (vector, 4S) 1->2 4.036071 | |
URSQRTE (vector, 4S) throughput 1.007617 | |
URSRA (vector, 8B) 1->1 1.970056 | |
URSRA (vector, 8B) 1->2 3.013414 | |
URSRA (vector, 8B) throughput 1.015592 | |
URSRA (vector, 8B) throughput 1.012971 | |
URSRA (vector, 16B) 1->1 2.049146 | |
URSRA (vector, 16B) 1->2 3.006316 | |
URSRA (vector, 16B) throughput 0.989434 | |
URSRA (vector, 16B) throughput 1.021480 | |
URSRA (vector, 4H) 1->1 2.011957 | |
URSRA (vector, 4H) 1->2 3.049123 | |
URSRA (vector, 4H) throughput 1.003221 | |
URSRA (vector, 4H) throughput 1.003317 | |
URSRA (vector, 8H) 1->1 2.012375 | |
URSRA (vector, 8H) 1->2 3.037451 | |
URSRA (vector, 8H) throughput 1.003205 | |
URSRA (vector, 8H) throughput 0.998360 | |
URSRA (vector, 2S) 1->1 1.995227 | |
URSRA (vector, 2S) 1->2 3.017663 | |
URSRA (vector, 2S) throughput 0.998065 | |
URSRA (vector, 2S) throughput 0.973124 | |
URSRA (vector, 4S) 1->1 2.021207 | |
URSRA (vector, 4S) 1->2 2.970649 | |
URSRA (vector, 4S) throughput 1.002866 | |
URSRA (vector, 4S) throughput 1.010577 | |
URSRA (vector, 2D) 1->1 2.029386 | |
URSRA (vector, 2D) 1->2 3.008742 | |
URSRA (vector, 2D) throughput 1.009473 | |
URSRA (vector, 2D) throughput 1.016716 | |
URSRA (vector, D) 1->1 1.990011 | |
URSRA (vector, D) 1->2 3.030444 | |
URSRA (vector, D) throughput 1.018320 | |
URSRA (vector, D) throughput 0.998967 | |
USHL (vector, 8B) 1->2 1.982262 | |
USHL (vector, 8B) 1->3 2.044721 | |
USHL (vector, 8B) throughput 1.004979 | |
USHL (vector, 16B) 1->2 1.994720 | |
USHL (vector, 16B) 1->3 2.001096 | |
USHL (vector, 16B) throughput 1.009338 | |
USHL (vector, 4H) 1->2 2.020958 | |
USHL (vector, 4H) 1->3 1.966442 | |
USHL (vector, 4H) throughput 1.011940 | |
USHL (vector, 8H) 1->2 2.006465 | |
USHL (vector, 8H) 1->3 2.007446 | |
USHL (vector, 8H) throughput 0.997243 | |
USHL (vector, 2S) 1->2 1.971311 | |
USHL (vector, 2S) 1->3 2.014619 | |
USHL (vector, 2S) throughput 1.023315 | |
USHL (vector, 4S) 1->2 2.007411 | |
USHL (vector, 4S) 1->3 1.966042 | |
USHL (vector, 4S) throughput 1.014678 | |
USHL (vector, 2D) 1->2 2.000679 | |
USHL (vector, 2D) 1->3 2.010819 | |
USHL (vector, 2D) throughput 1.007399 | |
USHL (vector, D) 1->2 2.007406 | |
USHL (vector, D) 1->3 2.026892 | |
USHL (vector, D) throughput 1.019861 | |
USHLL (8H) 1->2 2.074269 | |
USHLL (8H) throughput 1.006221 | |
USHLL (4S) 1->2 1.964782 | |
USHLL (4S) throughput 1.013665 | |
USHLL (2D) 1->2 1.964603 | |
USHLL (2D) throughput 1.003747 | |
USHLL2 (8H) 1->2 2.059353 | |
USHLL2 (8H) throughput 1.010832 | |
USHLL2 (4S) 1->2 1.998271 | |
USHLL2 (4S) throughput 1.005987 | |
USHLL2 (2D) 1->2 2.000964 | |
USHLL2 (2D) throughput 1.011923 | |
USHR (vector, 8B) 1->2 2.049367 | |
USHR (vector, 8B) throughput 0.997892 | |
USHR (vector, 16B) 1->2 2.036010 | |
USHR (vector, 16B) throughput 0.998202 | |
USHR (vector, 4H) 1->2 2.005965 | |
USHR (vector, 4H) throughput 1.009395 | |
USHR (vector, 8H) 1->2 2.018181 | |
USHR (vector, 8H) throughput 1.006311 | |
USHR (vector, 2S) 1->2 2.125660 | |
USHR (vector, 2S) throughput 1.032366 | |
USHR (vector, 4S) 1->2 2.105794 | |
USHR (vector, 4S) throughput 1.001931 | |
USHR (vector, 2D) 1->2 2.037103 | |
USHR (vector, 2D) throughput 1.010747 | |
USHR (vector, D) 1->2 2.009817 | |
USHR (vector, D) throughput 1.007375 | |
USQADD (scalar, B) 1->1 3.070526 | |
USQADD (scalar, B) 1->2 2.998311 | |
USQADD (scalar, B) throughput 1.015586 | |
USQADD (scalar, B) throughput 1.007416 | |
USQADD (scalar, H) 1->1 3.024895 | |
USQADD (scalar, H) 1->2 3.026139 | |
USQADD (scalar, H) throughput 1.007012 | |
USQADD (scalar, H) throughput 1.019739 | |
USQADD (scalar, S) 1->1 2.898774 | |
USQADD (scalar, S) 1->2 3.013050 | |
USQADD (scalar, S) throughput 0.997145 | |
USQADD (scalar, S) throughput 1.013852 | |
USQADD (scalar, D) 1->1 3.014540 | |
USQADD (scalar, D) 1->2 3.008864 | |
USQADD (scalar, D) throughput 1.011575 | |
USQADD (scalar, D) throughput 1.008720 | |
USQADD (vector, 8B) 1->1 2.964971 | |
USQADD (vector, 8B) 1->2 2.994524 | |
USQADD (vector, 8B) throughput 1.013431 | |
USQADD (vector, 8B) throughput 1.004135 | |
USQADD (vector, 16B) 1->1 3.023662 | |
USQADD (vector, 16B) 1->2 2.964636 | |
USQADD (vector, 16B) throughput 0.990499 | |
USQADD (vector, 16B) throughput 1.006895 | |
USQADD (vector, 4H) 1->1 2.994415 | |
USQADD (vector, 4H) 1->2 3.030149 | |
USQADD (vector, 4H) throughput 1.005547 | |
USQADD (vector, 4H) throughput 1.008261 | |
USQADD (vector, 8H) 1->1 3.042432 | |
USQADD (vector, 8H) 1->2 3.039167 | |
USQADD (vector, 8H) throughput 0.996934 | |
USQADD (vector, 8H) throughput 0.992876 | |
USQADD (vector, 2S) 1->1 3.025645 | |
USQADD (vector, 2S) 1->2 3.036349 | |
USQADD (vector, 2S) throughput 1.012227 | |
USQADD (vector, 2S) throughput 0.972064 | |
USQADD (vector, 4S) 1->1 2.998899 | |
USQADD (vector, 4S) 1->2 3.028727 | |
USQADD (vector, 4S) throughput 1.006914 | |
USQADD (vector, 4S) throughput 1.009266 | |
USQADD (vector, 2D) 1->1 2.916537 | |
USQADD (vector, 2D) 1->2 3.002162 | |
USQADD (vector, 2D) throughput 0.992278 | |
USQADD (vector, 2D) throughput 1.016504 | |
USRA (vector, 8B) 1->1 2.008662 | |
USRA (vector, 8B) 1->2 3.026691 | |
USRA (vector, 8B) throughput 0.995876 | |
USRA (vector, 8B) throughput 1.007489 | |
USRA (vector, 16B) 1->1 1.998401 | |
USRA (vector, 16B) 1->2 3.019364 | |
USRA (vector, 16B) throughput 1.009325 | |
USRA (vector, 16B) throughput 1.004239 | |
USRA (vector, 4H) 1->1 2.009497 | |
USRA (vector, 4H) 1->2 2.970091 | |
USRA (vector, 4H) throughput 1.011410 | |
USRA (vector, 4H) throughput 1.012972 | |
USRA (vector, 8H) 1->1 2.010711 | |
USRA (vector, 8H) 1->2 3.028507 | |
USRA (vector, 8H) throughput 1.010049 | |
USRA (vector, 8H) throughput 0.988095 | |
USRA (vector, 2S) 1->1 1.968524 | |
USRA (vector, 2S) 1->2 3.010405 | |
USRA (vector, 2S) throughput 0.993129 | |
USRA (vector, 2S) throughput 1.013219 | |
USRA (vector, 4S) 1->1 2.043365 | |
USRA (vector, 4S) 1->2 3.019625 | |
USRA (vector, 4S) throughput 1.008218 | |
USRA (vector, 4S) throughput 0.997444 | |
USRA (vector, 2D) 1->1 2.036521 | |
USRA (vector, 2D) 1->2 3.025509 | |
USRA (vector, 2D) throughput 1.015594 | |
USRA (vector, 2D) throughput 1.008616 | |
USRA (vector, D) 1->1 1.997215 | |
USRA (vector, D) 1->2 3.026202 | |
USRA (vector, D) throughput 1.009821 | |
USRA (vector, D) throughput 0.897408 | |
USUBL (vector, 8H) 1->2 2.015614 | |
USUBL (vector, 8H) 1->3 2.013863 | |
USUBL (vector, 8H) throughput 0.991812 | |
USUBL (vector, 4S) 1->2 1.997192 | |
USUBL (vector, 4S) 1->3 2.023092 | |
USUBL (vector, 4S) throughput 1.003124 | |
USUBL (vector, 2D) 1->2 1.997841 | |
USUBL (vector, 2D) 1->3 2.014361 | |
USUBL (vector, 2D) throughput 1.010381 | |
USUBL2 (vector, 8H) 1->2 2.021050 | |
USUBL2 (vector, 8H) 1->3 2.006532 | |
USUBL2 (vector, 8H) throughput 0.995777 | |
USUBL2 (vector, 4S) 1->2 2.019819 | |
USUBL2 (vector, 4S) 1->3 1.973316 | |
USUBL2 (vector, 4S) throughput 1.011425 | |
USUBL2 (vector, 2D) 1->2 2.004993 | |
USUBL2 (vector, 2D) 1->3 1.993156 | |
USUBL2 (vector, 2D) throughput 1.005047 | |
USUBW (vector, 8H) 1->2 2.041016 | |
USUBW (vector, 8H) 1->3 2.006377 | |
USUBW (vector, 8H) throughput 1.014379 | |
USUBW (vector, 4S) 1->2 2.006329 | |
USUBW (vector, 4S) 1->3 2.006708 | |
USUBW (vector, 4S) throughput 0.985698 | |
USUBW (vector, 2D) 1->2 1.998554 | |
USUBW (vector, 2D) 1->3 2.053920 | |
USUBW (vector, 2D) throughput 1.014242 | |
USUBW2 (vector, 8H) 1->2 2.000614 | |
USUBW2 (vector, 8H) 1->3 2.007719 | |
USUBW2 (vector, 8H) throughput 1.007974 | |
USUBW2 (vector, 4S) 1->2 2.015023 | |
USUBW2 (vector, 4S) 1->3 1.946863 | |
USUBW2 (vector, 4S) throughput 0.987427 | |
USUBW2 (vector, 2D) 1->2 2.046619 | |
USUBW2 (vector, 2D) 1->3 2.434172 | |
USUBW2 (vector, 2D) throughput 1.002191 | |
UXTL (8H) 1->2 1.996426 | |
UXTL (8H) throughput 1.022438 | |
UXTL (4S) 1->2 2.027412 | |
UXTL (4S) throughput 1.013041 | |
UXTL (2D) 1->2 1.997022 | |
UXTL (2D) throughput 0.993712 | |
UXTL2 (8H) 1->2 1.983242 | |
UXTL2 (8H) throughput 1.018955 | |
UXTL2 (4S) 1->2 2.004753 | |
UXTL2 (4S) throughput 0.995269 | |
UXTL2 (2D) 1->2 2.022418 | |
UXTL2 (2D) throughput 1.009490 | |
UZP1 (vector, 8B) 1->2 2.037658 | |
UZP1 (vector, 8B) 1->3 1.996821 | |
UZP1 (vector, 8B) throughput 1.128574 | |
UZP1 (vector, 16B) 1->2 2.028158 | |
UZP1 (vector, 16B) 1->3 1.996201 | |
UZP1 (vector, 16B) throughput 1.000340 | |
UZP1 (vector, 4H) 1->2 2.006850 | |
UZP1 (vector, 4H) 1->3 1.997983 | |
UZP1 (vector, 4H) throughput 1.004215 | |
UZP1 (vector, 8H) 1->2 1.998969 | |
UZP1 (vector, 8H) 1->3 2.034084 | |
UZP1 (vector, 8H) throughput 1.012748 | |
UZP1 (vector, 2S) 1->2 2.021062 | |
UZP1 (vector, 2S) 1->3 1.979898 | |
UZP1 (vector, 2S) throughput 0.987572 | |
UZP1 (vector, 4S) 1->2 1.998533 | |
UZP1 (vector, 4S) 1->3 1.978211 | |
UZP1 (vector, 4S) throughput 1.003713 | |
UZP1 (vector, 2D) 1->2 2.009481 | |
UZP1 (vector, 2D) 1->3 2.022678 | |
UZP1 (vector, 2D) throughput 0.994420 | |
UZP2 (vector, 8B) 1->2 2.006969 | |
UZP2 (vector, 8B) 1->3 2.063862 | |
UZP2 (vector, 8B) throughput 1.019601 | |
UZP2 (vector, 16B) 1->2 2.003443 | |
UZP2 (vector, 16B) 1->3 2.011363 | |
UZP2 (vector, 16B) throughput 1.009820 | |
UZP2 (vector, 4H) 1->2 2.005262 | |
UZP2 (vector, 4H) 1->3 2.008599 | |
UZP2 (vector, 4H) throughput 1.012967 | |
UZP2 (vector, 8H) 1->2 1.993956 | |
UZP2 (vector, 8H) 1->3 2.008664 | |
UZP2 (vector, 8H) throughput 1.007638 | |
UZP2 (vector, 2S) 1->2 1.967523 | |
UZP2 (vector, 2S) 1->3 2.023072 | |
UZP2 (vector, 2S) throughput 1.014149 | |
UZP2 (vector, 4S) 1->2 2.021310 | |
UZP2 (vector, 4S) 1->3 2.021648 | |
UZP2 (vector, 4S) throughput 0.989008 | |
UZP2 (vector, 2D) 1->2 2.046774 | |
UZP2 (vector, 2D) 1->3 2.011616 | |
UZP2 (vector, 2D) throughput 1.006382 | |
XTN (8H) 1->2 2.045587 | |
XTN (8H) throughput 1.012241 | |
XTN (4S) 1->2 1.993432 | |
XTN (4S) throughput 1.010385 | |
XTN (2D) 1->2 1.997324 | |
XTN (2D) throughput 0.993312 | |
XTN2 (8H) 1->1 1.958505 | |
XTN2 (8H) 1->2 1.998959 | |
XTN2 (8H) throughput 1.012375 | |
XTN2 (8H) throughput 1.002959 | |
XTN2 (4S) 1->1 1.999672 | |
XTN2 (4S) 1->2 2.011361 | |
XTN2 (4S) throughput 0.993897 | |
XTN2 (4S) throughput 1.001464 | |
XTN2 (2D) 1->1 2.001335 | |
XTN2 (2D) 1->2 1.969894 | |
XTN2 (2D) throughput 0.995561 | |
XTN2 (2D) throughput 1.014650 | |
ZIP1 (vector, 8B) 1->2 1.981046 | |
ZIP1 (vector, 8B) 1->3 2.033533 | |
ZIP1 (vector, 8B) throughput 1.010053 | |
ZIP1 (vector, 16B) 1->2 2.032022 | |
ZIP1 (vector, 16B) 1->3 2.047964 | |
ZIP1 (vector, 16B) throughput 1.018360 | |
ZIP1 (vector, 4H) 1->2 1.991134 | |
ZIP1 (vector, 4H) 1->3 2.040148 | |
ZIP1 (vector, 4H) throughput 1.019063 | |
ZIP1 (vector, 8H) 1->2 2.004231 | |
ZIP1 (vector, 8H) 1->3 2.010182 | |
ZIP1 (vector, 8H) throughput 0.994850 | |
ZIP1 (vector, 2S) 1->2 2.059960 | |
ZIP1 (vector, 2S) 1->3 2.010262 | |
ZIP1 (vector, 2S) throughput 1.005501 | |
ZIP1 (vector, 4S) 1->2 1.998114 | |
ZIP1 (vector, 4S) 1->3 2.039771 | |
ZIP1 (vector, 4S) throughput 1.001410 | |
ZIP1 (vector, 2D) 1->2 1.994941 | |
ZIP1 (vector, 2D) 1->3 2.046762 | |
ZIP1 (vector, 2D) throughput 1.007975 | |
ZIP2 (vector, 8B) 1->2 1.985348 | |
ZIP2 (vector, 8B) 1->3 2.044116 | |
ZIP2 (vector, 8B) throughput 1.021643 | |
ZIP2 (vector, 16B) 1->2 1.999058 | |
ZIP2 (vector, 16B) 1->3 1.943940 | |
ZIP2 (vector, 16B) throughput 1.004225 | |
ZIP2 (vector, 4H) 1->2 2.022657 | |
ZIP2 (vector, 4H) 1->3 2.001246 | |
ZIP2 (vector, 4H) throughput 1.007186 | |
ZIP2 (vector, 8H) 1->2 2.017150 | |
ZIP2 (vector, 8H) 1->3 2.000989 | |
ZIP2 (vector, 8H) throughput 1.003736 | |
ZIP2 (vector, 2S) 1->2 1.998356 | |
ZIP2 (vector, 2S) 1->3 2.024065 | |
ZIP2 (vector, 2S) throughput 1.004968 | |
ZIP2 (vector, 4S) 1->2 2.020453 | |
ZIP2 (vector, 4S) 1->3 2.018754 | |
ZIP2 (vector, 4S) throughput 1.004913 | |
ZIP2 (vector, 2D) 1->2 1.997973 | |
ZIP2 (vector, 2D) 1->3 2.011876 | |
ZIP2 (vector, 2D) throughput 1.034460 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
ADC (32-bit) 1->2 1.001903 | |
ADC (32-bit) 1->3 1.001863 | |
ADC (32-bit) 1->4 1.003920 | |
ADC (32-bit) throughput 0.333057 | |
ADC (64-bit) 1->2 1.002713 | |
ADC (64-bit) 1->3 0.990760 | |
ADC (64-bit) 1->4 0.993828 | |
ADC (64-bit) throughput 0.354186 | |
ADCS (32-bit) 1->2 1.088066 | |
ADCS (32-bit) 1->3 1.039229 | |
ADCS (32-bit) 1->4 1.016314 | |
ADCS (32-bit) 4->2 0.996122 | |
ADCS (32-bit) 4->3 1.001621 | |
ADCS (32-bit) 4->4 1.003887 | |
ADCS (32-bit) throughput 0.859593 | |
ADCS (32-bit) throughput 0.623696 | |
ADCS (64-bit) 1->2 0.997536 | |
ADCS (64-bit) 1->3 1.006640 | |
ADCS (64-bit) 1->4 1.017710 | |
ADCS (64-bit) 4->2 1.007450 | |
ADCS (64-bit) 4->3 1.038165 | |
ADCS (64-bit) 4->4 1.005506 | |
ADCS (64-bit) throughput 0.796344 | |
ADCS (64-bit) throughput 0.620338 | |
ADD (sxtb, 32-bit) 1->2 1.995841 | |
ADD (sxtb, 32-bit) 1->3 1.991248 | |
ADD (sxtb, 32-bit) throughput 0.667155 | |
ADD (sxtb, 64-bit) 1->2 2.004213 | |
ADD (sxtb, 64-bit) 1->3 2.005386 | |
ADD (sxtb, 64-bit) throughput 0.668243 | |
ADD (uxtb, 32-bit) 1->2 2.002470 | |
ADD (uxtb, 32-bit) 1->3 1.999839 | |
ADD (uxtb, 32-bit) throughput 0.692735 | |
ADD (uxtb, 64-bit) 1->2 2.003438 | |
ADD (uxtb, 64-bit) 1->3 1.993274 | |
ADD (uxtb, 64-bit) throughput 0.672232 | |
ADD (sxth, 32-bit) 1->2 1.999515 | |
ADD (sxth, 32-bit) 1->3 2.822200 | |
ADD (sxth, 32-bit) throughput 0.684144 | |
ADD (sxth, 64-bit) 1->2 2.241766 | |
ADD (sxth, 64-bit) 1->3 2.006549 | |
ADD (sxth, 64-bit) throughput 0.629518 | |
ADD (uxth, 32-bit) 1->2 1.992725 | |
ADD (uxth, 32-bit) 1->3 2.000277 | |
ADD (uxth, 32-bit) throughput 0.673479 | |
ADD (uxth, 64-bit) 1->2 2.003079 | |
ADD (uxth, 64-bit) 1->3 2.006722 | |
ADD (uxth, 64-bit) throughput 0.750779 | |
ADD (sxtw, 32-bit) 1->2 1.001660 | |
ADD (sxtw, 32-bit) 1->3 1.002632 | |
ADD (sxtw, 32-bit) throughput 0.333937 | |
ADD (sxtw, 64-bit) 1->2 2.001741 | |
ADD (sxtw, 64-bit) 1->3 2.001538 | |
ADD (sxtw, 64-bit) throughput 0.666080 | |
ADD (uxtw, 32-bit) 1->2 1.000258 | |
ADD (uxtw, 32-bit) 1->3 1.004697 | |
ADD (uxtw, 32-bit) throughput 0.334053 | |
ADD (uxtw, 64-bit) 1->2 2.000728 | |
ADD (uxtw, 64-bit) 1->3 1.978352 | |
ADD (uxtw, 64-bit) throughput 0.668252 | |
ADD (uxtx, 64-bit) 1->2 0.998141 | |
ADD (uxtx, 64-bit) 1->3 1.001660 | |
ADD (uxtx, 64-bit) throughput 0.333881 | |
ADD (sxtx, 64-bit) 1->2 1.002510 | |
ADD (sxtx, 64-bit) 1->3 1.003240 | |
ADD (sxtx, 64-bit) throughput 0.334626 | |
ADD (immediate, 32-bit) 1->2 1.000322 | |
ADD (immediate, 32-bit) throughput 0.333512 | |
ADD (immediate, 64-bit) 1->2 1.001538 | |
ADD (immediate, 64-bit) throughput 0.357442 | |
ADD (shifted immediate, 32-bit) 1->2 1.002101 | |
ADD (shifted immediate, 32-bit) throughput 0.334842 | |
ADD (shifted immediate, 64-bit) 1->2 0.999151 | |
ADD (shifted immediate, 64-bit) throughput 0.332331 | |
ADD (register, 32-bit) 1->2 0.999273 | |
ADD (register, 32-bit) 1->3 0.997214 | |
ADD (register, 32-bit) throughput 0.333728 | |
ADD (register, 64-bit) 1->2 1.000566 | |
ADD (register, 64-bit) 1->3 0.997938 | |
ADD (register, 64-bit) throughput 0.334417 | |
ADD (register, lsl, 32-bit) 1->2 2.024686 | |
ADD (register, lsl, 32-bit) 1->3 2.000444 | |
ADD (register, lsl, 32-bit) throughput 0.643954 | |
ADD (register, lsl, 64-bit) 1->2 1.342847 | |
ADD (register, lsl, 64-bit) 1->3 1.809081 | |
ADD (register, lsl, 64-bit) throughput 0.804207 | |
ADD (register, lsr, 32-bit) 1->2 2.000342 | |
ADD (register, lsr, 32-bit) 1->3 1.999929 | |
ADD (register, lsr, 32-bit) throughput 0.662794 | |
ADD (register, lsr, 64-bit) 1->2 2.006398 | |
ADD (register, lsr, 64-bit) 1->3 2.284870 | |
ADD (register, lsr, 64-bit) throughput 0.666871 | |
ADD (register, asr, 32-bit) 1->2 2.133751 | |
ADD (register, asr, 32-bit) 1->3 2.001721 | |
ADD (register, asr, 32-bit) throughput 0.668116 | |
ADD (register, asr, 64-bit) 1->2 2.003726 | |
ADD (register, asr, 64-bit) 1->3 1.999637 | |
ADD (register, asr, 64-bit) throughput 0.694452 | |
ADDS (sxtb, 32-bit) 1->2 2.031534 | |
ADDS (sxtb, 32-bit) 1->3 1.984801 | |
ADDS (sxtb, 32-bit) 4->2 2.024389 | |
ADDS (sxtb, 32-bit) 4->3 1.997249 | |
ADDS (sxtb, 32-bit) throughput 0.666822 | |
ADDS (sxtb, 64-bit) 1->2 1.998144 | |
ADDS (sxtb, 64-bit) 1->3 1.995963 | |
ADDS (sxtb, 64-bit) 4->2 1.993183 | |
ADDS (sxtb, 64-bit) 4->3 1.998624 | |
ADDS (sxtb, 64-bit) throughput 0.676349 | |
ADDS (uxtb, 32-bit) 1->2 2.001621 | |
ADDS (uxtb, 32-bit) 1->3 2.004855 | |
ADDS (uxtb, 32-bit) 4->2 2.004900 | |
ADDS (uxtb, 32-bit) 4->3 2.008262 | |
ADDS (uxtb, 32-bit) throughput 0.668861 | |
ADDS (uxtb, 64-bit) 1->2 1.991500 | |
ADDS (uxtb, 64-bit) 1->3 1.974317 | |
ADDS (uxtb, 64-bit) 4->2 2.043171 | |
ADDS (uxtb, 64-bit) 4->3 2.004292 | |
ADDS (uxtb, 64-bit) throughput 0.668924 | |
ADDS (sxth, 32-bit) 1->2 2.003840 | |
ADDS (sxth, 32-bit) 1->3 2.002063 | |
ADDS (sxth, 32-bit) 4->2 1.997857 | |
ADDS (sxth, 32-bit) 4->3 1.995239 | |
ADDS (sxth, 32-bit) throughput 0.666426 | |
ADDS (sxth, 64-bit) 1->2 1.979793 | |
ADDS (sxth, 64-bit) 1->3 1.999111 | |
ADDS (sxth, 64-bit) 4->2 2.003928 | |
ADDS (sxth, 64-bit) 4->3 1.991963 | |
ADDS (sxth, 64-bit) throughput 0.702496 | |
ADDS (uxth, 32-bit) 1->2 2.000687 | |
ADDS (uxth, 32-bit) 1->3 1.970247 | |
ADDS (uxth, 32-bit) 4->2 2.054057 | |
ADDS (uxth, 32-bit) 4->3 2.003314 | |
ADDS (uxth, 32-bit) throughput 0.711197 | |
ADDS (uxth, 64-bit) 1->2 2.139254 | |
ADDS (uxth, 64-bit) 1->3 1.933987 | |
ADDS (uxth, 64-bit) 4->2 2.041828 | |
ADDS (uxth, 64-bit) 4->3 2.148362 | |
ADDS (uxth, 64-bit) throughput 0.700568 | |
ADDS (sxtw, 32-bit) 1->2 1.002915 | |
ADDS (sxtw, 32-bit) 1->3 1.001701 | |
ADDS (sxtw, 32-bit) 4->2 1.002911 | |
ADDS (sxtw, 32-bit) 4->3 0.990223 | |
ADDS (sxtw, 32-bit) throughput 0.500420 | |
ADDS (sxtw, 64-bit) 1->2 2.005507 | |
ADDS (sxtw, 64-bit) 1->3 2.002511 | |
ADDS (sxtw, 64-bit) 4->2 2.002626 | |
ADDS (sxtw, 64-bit) 4->3 2.007413 | |
ADDS (sxtw, 64-bit) throughput 0.674444 | |
ADDS (uxtw, 32-bit) 1->2 0.999274 | |
ADDS (uxtw, 32-bit) 1->3 1.001701 | |
ADDS (uxtw, 32-bit) 4->2 1.004850 | |
ADDS (uxtw, 32-bit) 4->3 1.019735 | |
ADDS (uxtw, 32-bit) throughput 0.500784 | |
ADDS (uxtw, 64-bit) 1->2 1.987062 | |
ADDS (uxtw, 64-bit) 1->3 1.999313 | |
ADDS (uxtw, 64-bit) 4->2 2.002387 | |
ADDS (uxtw, 64-bit) 4->3 2.002871 | |
ADDS (uxtw, 64-bit) throughput 0.670504 | |
ADDS (uxtx, 64-bit) 1->2 0.999031 | |
ADDS (uxtx, 64-bit) 1->3 1.001862 | |
ADDS (uxtx, 64-bit) 4->2 1.035277 | |
ADDS (uxtx, 64-bit) 4->3 1.007976 | |
ADDS (uxtx, 64-bit) throughput 0.711329 | |
ADDS (sxtx, 64-bit) 1->2 0.998111 | |
ADDS (sxtx, 64-bit) 1->3 1.000688 | |
ADDS (sxtx, 64-bit) 4->2 0.996325 | |
ADDS (sxtx, 64-bit) 4->3 1.005022 | |
ADDS (sxtx, 64-bit) throughput 0.500774 | |
ADDS (immediate, 32-bit) 1->2 1.003280 | |
ADDS (immediate, 32-bit) 3->2 0.999031 | |
ADDS (immediate, 32-bit) throughput 0.502610 | |
ADDS (immediate, 64-bit) 1->2 0.998180 | |
ADDS (immediate, 64-bit) 3->2 1.004170 | |
ADDS (immediate, 64-bit) throughput 0.500971 | |
ADDS (shifted immediate, 32-bit) 1->2 1.072839 | |
ADDS (shifted immediate, 32-bit) 3->2 1.011981 | |
ADDS (shifted immediate, 32-bit) throughput 0.501484 | |
ADDS (shifted immediate, 64-bit) 1->2 0.998383 | |
ADDS (shifted immediate, 64-bit) 3->2 1.023486 | |
ADDS (shifted immediate, 64-bit) throughput 0.500106 | |
ADDS (register, 32-bit) 1->2 1.000000 | |
ADDS (register, 32-bit) 1->3 1.025081 | |
ADDS (register, 32-bit) 4->2 1.005262 | |
ADDS (register, 32-bit) 4->3 1.003354 | |
ADDS (register, 32-bit) throughput 0.500232 | |
ADDS (register, 64-bit) 1->2 1.000887 | |
ADDS (register, 64-bit) 1->3 0.772217 | |
ADDS (register, 64-bit) 4->2 1.005539 | |
ADDS (register, 64-bit) 4->3 0.749781 | |
ADDS (register, 64-bit) throughput 1.016991 | |
ADDS (register, lsl, 32-bit) 1->2 3.589544 | |
ADDS (register, lsl, 32-bit) 1->3 1.153530 | |
ADDS (register, lsl, 32-bit) 4->2 3.351728 | |
ADDS (register, lsl, 32-bit) 4->3 2.030090 | |
ADDS (register, lsl, 32-bit) throughput 0.667407 | |
ADDS (register, lsl, 64-bit) 1->2 1.907195 | |
ADDS (register, lsl, 64-bit) 1->3 2.008221 | |
ADDS (register, lsl, 64-bit) 4->2 2.005304 | |
ADDS (register, lsl, 64-bit) 4->3 2.000323 | |
ADDS (register, lsl, 64-bit) throughput 0.672743 | |
ADDS (register, lsr, 32-bit) 1->2 1.999514 | |
ADDS (register, lsr, 32-bit) 1->3 2.007331 | |
ADDS (register, lsr, 32-bit) 4->2 2.007451 | |
ADDS (register, lsr, 32-bit) 4->3 2.000807 | |
ADDS (register, lsr, 32-bit) throughput 0.668367 | |
ADDS (register, lsr, 64-bit) 1->2 2.001454 | |
ADDS (register, lsr, 64-bit) 1->3 1.999273 | |
ADDS (register, lsr, 64-bit) 4->2 2.008180 | |
ADDS (register, lsr, 64-bit) 4->3 2.292598 | |
ADDS (register, lsr, 64-bit) throughput 0.671835 | |
ADDS (register, asr, 32-bit) 1->2 2.001940 | |
ADDS (register, asr, 32-bit) 1->3 2.002349 | |
ADDS (register, asr, 32-bit) 4->2 2.003929 | |
ADDS (register, asr, 32-bit) 4->3 2.003807 | |
ADDS (register, asr, 32-bit) throughput 0.669334 | |
ADDS (register, asr, 64-bit) 1->2 1.996407 | |
ADDS (register, asr, 64-bit) 1->3 2.006953 | |
ADDS (register, asr, 64-bit) 4->2 2.054001 | |
ADDS (register, asr, 64-bit) 4->3 1.994355 | |
ADDS (register, asr, 64-bit) throughput 0.672331 | |
ADR throughput 0.261793 | |
ADRP throughput 0.249558 | |
AND (immediate, 32-bit) 1->2 1.002658 | |
AND (immediate, 32-bit) throughput 0.335666 | |
AND (immediate, 64-bit) 1->2 0.976435 | |
AND (immediate, 64-bit) throughput 0.333986 | |
AND (register, 32-bit) 1->2 1.001863 | |
AND (register, 32-bit) 1->3 1.001660 | |
AND (register, 32-bit) throughput 0.334299 | |
AND (register, 64-bit) 1->2 1.000243 | |
AND (register, 64-bit) 1->3 1.001782 | |
AND (register, 64-bit) throughput 0.334081 | |
AND (register, lsl, 32-bit) 1->2 2.003927 | |
AND (register, lsl, 32-bit) 1->3 2.001699 | |
AND (register, lsl, 32-bit) throughput 0.651652 | |
AND (register, lsl, 64-bit) 1->2 1.998019 | |
AND (register, lsl, 64-bit) 1->3 2.004050 | |
AND (register, lsl, 64-bit) throughput 0.674301 | |
AND (register, lsr, 32-bit) 1->2 2.002871 | |
AND (register, lsr, 32-bit) 1->3 1.999071 | |
AND (register, lsr, 32-bit) throughput 0.667530 | |
AND (register, lsr, 64-bit) 1->2 2.006964 | |
AND (register, lsr, 64-bit) 1->3 2.002389 | |
AND (register, lsr, 64-bit) throughput 0.666111 | |
AND (register, asr, 32-bit) 1->2 2.004258 | |
AND (register, asr, 32-bit) 1->3 2.001666 | |
AND (register, asr, 32-bit) throughput 0.666394 | |
AND (register, asr, 64-bit) 1->2 2.003887 | |
AND (register, asr, 64-bit) 1->3 2.063459 | |
AND (register, asr, 64-bit) throughput 0.668464 | |
AND (register, ror, 32-bit) 1->2 2.004980 | |
AND (register, ror, 32-bit) 1->3 1.991184 | |
AND (register, ror, 32-bit) throughput 0.667424 | |
AND (register, ror, 64-bit) 1->2 2.002633 | |
AND (register, ror, 64-bit) 1->3 1.996365 | |
AND (register, ror, 64-bit) throughput 0.668071 | |
ADDS (immediate, 32-bit) 1->2 0.998282 | |
ADDS (immediate, 32-bit) 3->2 0.999636 | |
ADDS (immediate, 32-bit) throughput 0.501762 | |
ADDS (immediate, 64-bit) 1->2 1.002062 | |
ADDS (immediate, 64-bit) 3->2 1.000566 | |
ADDS (immediate, 64-bit) throughput 0.499869 | |
ANDS (register, 32-bit) 1->2 1.332210 | |
ANDS (register, 32-bit) 1->3 1.000171 | |
ANDS (register, 32-bit) 4->2 0.969679 | |
ANDS (register, 32-bit) 4->3 0.891175 | |
ANDS (register, 32-bit) throughput 0.507547 | |
ANDS (register, 64-bit) 1->2 1.056775 | |
ANDS (register, 64-bit) 1->3 1.008949 | |
ANDS (register, 64-bit) 4->2 1.001171 | |
ANDS (register, 64-bit) 4->3 1.007882 | |
ANDS (register, 64-bit) throughput 0.501807 | |
ANDS (register, lsl, 32-bit) 1->2 2.004252 | |
ANDS (register, lsl, 32-bit) 1->3 2.002389 | |
ANDS (register, lsl, 32-bit) 4->2 2.002954 | |
ANDS (register, lsl, 32-bit) 4->3 1.997171 | |
ANDS (register, lsl, 32-bit) throughput 0.668558 | |
ANDS (register, lsl, 64-bit) 1->2 2.003362 | |
ANDS (register, lsl, 64-bit) 1->3 2.043286 | |
ANDS (register, lsl, 64-bit) 4->2 2.002628 | |
ANDS (register, lsl, 64-bit) 4->3 1.997778 | |
ANDS (register, lsl, 64-bit) throughput 0.667960 | |
ANDS (register, lsr, 32-bit) 1->2 2.002631 | |
ANDS (register, lsr, 32-bit) 1->3 2.002343 | |
ANDS (register, lsr, 32-bit) 4->2 2.008707 | |
ANDS (register, lsr, 32-bit) 4->3 2.004291 | |
ANDS (register, lsr, 32-bit) throughput 0.668623 | |
ANDS (register, lsr, 64-bit) 1->2 1.998425 | |
ANDS (register, lsr, 64-bit) 1->3 1.996082 | |
ANDS (register, lsr, 64-bit) 4->2 1.983005 | |
ANDS (register, lsr, 64-bit) 4->3 1.706663 | |
ANDS (register, lsr, 64-bit) throughput 0.671978 | |
ANDS (register, asr, 32-bit) 1->2 2.002512 | |
ANDS (register, asr, 32-bit) 1->3 2.002469 | |
ANDS (register, asr, 32-bit) 4->2 2.004566 | |
ANDS (register, asr, 32-bit) 4->3 2.004656 | |
ANDS (register, asr, 32-bit) throughput 0.667100 | |
ANDS (register, asr, 64-bit) 1->2 2.001578 | |
ANDS (register, asr, 64-bit) 1->3 2.002834 | |
ANDS (register, asr, 64-bit) 4->2 2.114553 | |
ANDS (register, asr, 64-bit) 4->3 2.011183 | |
ANDS (register, asr, 64-bit) throughput 0.675008 | |
ANDS (register, ror, 32-bit) 1->2 2.007975 | |
ANDS (register, ror, 32-bit) 1->3 2.594237 | |
ANDS (register, ror, 32-bit) 4->2 2.001386 | |
ANDS (register, ror, 32-bit) 4->3 1.957118 | |
ANDS (register, ror, 32-bit) throughput 0.670391 | |
ANDS (register, ror, 64-bit) 1->2 2.003685 | |
ANDS (register, ror, 64-bit) 1->3 2.003848 | |
ANDS (register, ror, 64-bit) 4->2 2.000405 | |
ANDS (register, ror, 64-bit) 4->3 2.004898 | |
ANDS (register, ror, 64-bit) throughput 0.668192 | |
ASR (immediate, 32-bit) 1->2 0.998910 | |
ASR (immediate, 32-bit) throughput 0.333391 | |
ASR (immediate, 64-bit) 1->2 1.002713 | |
ASR (immediate, 64-bit) throughput 0.335044 | |
ASR (register, 32-bit) 1->2 1.001701 | |
ASR (register, 32-bit) 1->3 0.999071 | |
ASR (register, 32-bit) throughput 0.334031 | |
ASR (register, 64-bit) 1->2 0.998342 | |
ASR (register, 64-bit) 1->3 1.001741 | |
ASR (register, 64-bit) throughput 0.334268 | |
B throughput 1.027178 | |
B.cc (not taken) throughput 1.018246 | |
B.cc (taken) throughput 0.614678 | |
BFC (32-bit) 1->1 0.992769 | |
BFC (32-bit) throughput 1.203510 | |
BFC (64-bit) 1->1 0.997577 | |
BFC (64-bit) throughput 1.004566 | |
BFI (32-bit) 1->1 1.000162 | |
BFI (32-bit) 1->2 0.997620 | |
BFI (32-bit) throughput 1.019555 | |
BFI (64-bit) 1->1 0.997415 | |
BFI (64-bit) 1->2 0.840565 | |
BFI (64-bit) throughput 1.001483 | |
BFXIL (32-bit) 1->1 0.936552 | |
BFXIL (32-bit) 1->2 1.415581 | |
BFXIL (32-bit) throughput 1.050595 | |
BFXIL (64-bit) 1->1 1.007046 | |
BFXIL (64-bit) 1->2 1.004738 | |
BFXIL (64-bit) throughput 1.010355 | |
BIC (register, 32-bit) 1->2 1.001822 | |
BIC (register, 32-bit) 1->3 0.998423 | |
BIC (register, 32-bit) throughput 0.334734 | |
BIC (register, 64-bit) 1->2 1.001819 | |
BIC (register, 64-bit) 1->3 1.003927 | |
BIC (register, 64-bit) throughput 0.333488 | |
BIC (register, lsl, 32-bit) 1->2 2.002491 | |
BIC (register, lsl, 32-bit) 1->3 2.006006 | |
BIC (register, lsl, 32-bit) throughput 0.667773 | |
BIC (register, lsl, 64-bit) 1->2 2.006173 | |
BIC (register, lsl, 64-bit) 1->3 2.022625 | |
BIC (register, lsl, 64-bit) throughput 0.670981 | |
BIC (register, lsr, 32-bit) 1->2 2.004739 | |
BIC (register, lsr, 32-bit) 1->3 2.003239 | |
BIC (register, lsr, 32-bit) throughput 0.669672 | |
BIC (register, lsr, 64-bit) 1->2 2.001580 | |
BIC (register, lsr, 64-bit) 1->3 2.001564 | |
BIC (register, lsr, 64-bit) throughput 0.677387 | |
BIC (register, asr, 32-bit) 1->2 2.004899 | |
BIC (register, asr, 32-bit) 1->3 2.000203 | |
BIC (register, asr, 32-bit) throughput 0.668210 | |
BIC (register, asr, 64-bit) 1->2 2.001658 | |
BIC (register, asr, 64-bit) 1->3 2.088013 | |
BIC (register, asr, 64-bit) throughput 0.665782 | |
BIC (register, ror, 32-bit) 1->2 1.981316 | |
BIC (register, ror, 32-bit) 1->3 2.001661 | |
BIC (register, ror, 32-bit) throughput 0.669165 | |
BIC (register, ror, 64-bit) 1->2 2.004050 | |
BIC (register, ror, 64-bit) 1->3 2.001253 | |
BIC (register, ror, 64-bit) throughput 0.688740 | |
BICS (register, 32-bit) 1->2 1.046807 | |
BICS (register, 32-bit) 1->3 1.001822 | |
BICS (register, 32-bit) 4->2 0.996446 | |
BICS (register, 32-bit) 4->3 1.040572 | |
BICS (register, 32-bit) throughput 0.500860 | |
BICS (register, 64-bit) 1->2 1.000162 | |
BICS (register, 64-bit) 1->3 0.998383 | |
BICS (register, 64-bit) 4->2 1.027822 | |
BICS (register, 64-bit) 4->3 1.101789 | |
BICS (register, 64-bit) throughput 0.502201 | |
BICS (register, lsl, 32-bit) 1->2 2.006761 | |
BICS (register, lsl, 32-bit) 1->3 1.968792 | |
BICS (register, lsl, 32-bit) 4->2 1.947245 | |
BICS (register, lsl, 32-bit) 4->3 2.003887 | |
BICS (register, lsl, 32-bit) throughput 0.684048 | |
BICS (register, lsl, 64-bit) 1->2 2.002551 | |
BICS (register, lsl, 64-bit) 1->3 1.992461 | |
BICS (register, lsl, 64-bit) 4->2 2.007167 | |
BICS (register, lsl, 64-bit) 4->3 2.006277 | |
BICS (register, lsl, 64-bit) throughput 0.668038 | |
BICS (register, lsr, 32-bit) 1->2 2.003441 | |
BICS (register, lsr, 32-bit) 1->3 2.006681 | |
BICS (register, lsr, 32-bit) 4->2 2.003478 | |
BICS (register, lsr, 32-bit) 4->3 2.004004 | |
BICS (register, lsr, 32-bit) throughput 0.668066 | |
BICS (register, lsr, 64-bit) 1->2 2.007289 | |
BICS (register, lsr, 64-bit) 1->3 2.005225 | |
BICS (register, lsr, 64-bit) 4->2 1.997013 | |
BICS (register, lsr, 64-bit) 4->3 2.023081 | |
BICS (register, lsr, 64-bit) throughput 0.667725 | |
BICS (register, asr, 32-bit) 1->2 1.814106 | |
BICS (register, asr, 32-bit) 1->3 2.005629 | |
BICS (register, asr, 32-bit) 4->2 2.036902 | |
BICS (register, asr, 32-bit) 4->3 2.058713 | |
BICS (register, asr, 32-bit) throughput 0.668840 | |
BICS (register, asr, 64-bit) 1->2 2.001538 | |
BICS (register, asr, 64-bit) 1->3 2.002633 | |
BICS (register, asr, 64-bit) 4->2 1.998181 | |
BICS (register, asr, 64-bit) 4->3 2.009800 | |
BICS (register, asr, 64-bit) throughput 0.667197 | |
BICS (register, ror, 32-bit) 1->2 2.265385 | |
BICS (register, ror, 32-bit) 1->3 2.559332 | |
BICS (register, ror, 32-bit) 4->2 2.001334 | |
BICS (register, ror, 32-bit) 4->3 2.016278 | |
BICS (register, ror, 32-bit) throughput 0.666107 | |
BICS (register, ror, 64-bit) 1->2 2.032618 | |
BICS (register, ror, 64-bit) 1->3 2.427435 | |
BICS (register, ror, 64-bit) 4->2 1.868111 | |
BICS (register, ror, 64-bit) 4->3 2.006479 | |
BICS (register, ror, 64-bit) throughput 0.668118 | |
BL throughput 1.037941 | |
CBNZ (not taken) throughput 0.590057 | |
CBNZ (taken) throughput 1.047260 | |
CBZ (not taken) throughput 0.584017 | |
CBZ (taken) throughput 1.275696 | |
CCMN (immediate, 32-bit) 2->1 1.013080 | |
CCMN (immediate, 32-bit) 2->2 0.998221 | |
CCMN (immediate, 32-bit) throughput 0.785009 | |
CCMN (immediate, 32-bit) throughput 0.597447 | |
CCMN (immediate, 64-bit) 2->1 1.002628 | |
CCMN (immediate, 64-bit) 2->2 0.998383 | |
CCMN (immediate, 64-bit) throughput 0.787317 | |
CCMN (immediate, 64-bit) throughput 0.599692 | |
CCMN (register, 32-bit) 3->1 1.003482 | |
CCMN (register, 32-bit) 3->2 1.001821 | |
CCMN (register, 32-bit) 3->3 1.001620 | |
CCMN (register, 32-bit) throughput 0.956969 | |
CCMN (register, 32-bit) throughput 0.617040 | |
CCMN (register, 64-bit) 3->1 1.072848 | |
CCMN (register, 64-bit) 3->2 0.981262 | |
CCMN (register, 64-bit) 3->3 1.000000 | |
CCMN (register, 64-bit) throughput 0.721782 | |
CCMN (register, 64-bit) throughput 0.597850 | |
CCMP (immediate, 32-bit) 2->1 0.991903 | |
CCMP (immediate, 32-bit) 2->2 1.183874 | |
CCMP (immediate, 32-bit) throughput 0.791766 | |
CCMP (immediate, 32-bit) throughput 0.599388 | |
CCMP (immediate, 64-bit) 2->1 1.002789 | |
CCMP (immediate, 64-bit) 2->2 0.998343 | |
CCMP (immediate, 64-bit) throughput 0.854149 | |
CCMP (immediate, 64-bit) throughput 0.557530 | |
CCMP (register, 32-bit) 3->1 0.948494 | |
CCMP (register, 32-bit) 3->2 1.033404 | |
CCMP (register, 32-bit) 3->3 0.995646 | |
CCMP (register, 32-bit) throughput 0.786003 | |
CCMP (register, 32-bit) throughput 0.598332 | |
CCMP (register, 64-bit) 3->1 1.003801 | |
CCMP (register, 64-bit) 3->2 1.002715 | |
CCMP (register, 64-bit) 3->3 1.002186 | |
CCMP (register, 64-bit) throughput 0.785822 | |
CCMP (register, 64-bit) throughput 0.599190 | |
CINC (32-bit) 1->2 1.001620 | |
CINC (32-bit) 1->3 0.999919 | |
CINC (32-bit) throughput 0.334717 | |
CINC (64-bit) 1->2 0.998628 | |
CINC (64-bit) 1->3 1.005061 | |
CINC (64-bit) throughput 0.334305 | |
CINV (32-bit) 1->2 1.002430 | |
CINV (32-bit) 1->3 1.002708 | |
CINV (32-bit) throughput 0.333917 | |
CINV (64-bit) 1->2 1.001618 | |
CINV (64-bit) 1->3 0.999757 | |
CINV (64-bit) throughput 0.334271 | |
CLREX throughput 5.042392 | |
CLREX throughput 5.040897 | |
CLS (32-bit) 1->2 0.998384 | |
CLS (32-bit) throughput 0.334799 | |
CLS (64-bit) 1->2 1.052502 | |
CLS (64-bit) throughput 0.369045 | |
CLZ (32-bit) 1->2 0.997415 | |
CLZ (32-bit) throughput 0.333446 | |
CLZ (64-bit) 1->2 0.999072 | |
CLZ (64-bit) throughput 0.334386 | |
CMN (sxtb, 32-bit) 3->1 2.001008 | |
CMN (sxtb, 32-bit) 3->2 2.066975 | |
CMN (sxtb, 32-bit) throughput 0.695930 | |
CMN (sxtb, 64-bit) 3->1 2.007411 | |
CMN (sxtb, 64-bit) 3->2 1.896898 | |
CMN (sxtb, 64-bit) throughput 0.673156 | |
CMN (uxtb, 32-bit) 3->1 1.998749 | |
CMN (uxtb, 32-bit) 3->2 2.073315 | |
CMN (uxtb, 32-bit) throughput 0.696378 | |
CMN (uxtb, 64-bit) 3->1 2.011985 | |
CMN (uxtb, 64-bit) 3->2 2.007530 | |
CMN (uxtb, 64-bit) throughput 0.668089 | |
CMN (sxth, 32-bit) 3->1 1.998423 | |
CMN (sxth, 32-bit) 3->2 2.000687 | |
CMN (sxth, 32-bit) throughput 0.730747 | |
CMN (sxth, 64-bit) 3->1 2.002425 | |
CMN (sxth, 64-bit) 3->2 2.025975 | |
CMN (sxth, 64-bit) throughput 0.671030 | |
CMN (uxth, 32-bit) 3->1 1.997376 | |
CMN (uxth, 32-bit) 3->2 2.003928 | |
CMN (uxth, 32-bit) throughput 0.674851 | |
CMN (uxth, 64-bit) 3->1 1.998869 | |
CMN (uxth, 64-bit) 3->2 2.005540 | |
CMN (uxth, 64-bit) throughput 0.671287 | |
CMN (sxtw, 32-bit) 3->1 1.002589 | |
CMN (sxtw, 32-bit) 3->2 1.006277 | |
CMN (sxtw, 32-bit) throughput 0.362176 | |
CMN (sxtw, 64-bit) 3->1 2.003727 | |
CMN (sxtw, 64-bit) 3->2 2.009111 | |
CMN (sxtw, 64-bit) throughput 0.667497 | |
CMN (uxtw, 32-bit) 3->1 0.998385 | |
CMN (uxtw, 32-bit) 3->2 1.005143 | |
CMN (uxtw, 32-bit) throughput 0.362898 | |
CMN (uxtw, 64-bit) 3->1 2.004010 | |
CMN (uxtw, 64-bit) 3->2 2.010285 | |
CMN (uxtw, 64-bit) throughput 0.720575 | |
CMN (uxtx, 64-bit) 3->1 1.020367 | |
CMN (uxtx, 64-bit) 3->2 1.226029 | |
CMN (uxtx, 64-bit) throughput 0.371948 | |
CMN (sxtx, 64-bit) 3->1 0.998667 | |
CMN (sxtx, 64-bit) 3->2 0.994804 | |
CMN (sxtx, 64-bit) throughput 0.362751 | |
CMN (immediate, 32-bit) 2->1 1.078844 | |
CMN (immediate, 32-bit) throughput 0.368025 | |
CMN (immediate, 64-bit) 2->1 1.005965 | |
CMN (immediate, 64-bit) throughput 0.363733 | |
CMN (shifted immediate, 32-bit) 2->1 0.957338 | |
CMN (shifted immediate, 32-bit) throughput 0.362752 | |
CMN (shifted immediate, 64-bit) 2->1 0.996808 | |
CMN (shifted immediate, 64-bit) throughput 0.362332 | |
CMN (register, 32-bit) 3->1 0.996688 | |
CMN (register, 32-bit) 3->2 1.002593 | |
CMN (register, 32-bit) throughput 0.364062 | |
CMN (register, 64-bit) 3->1 1.003563 | |
CMN (register, 64-bit) 3->2 1.009620 | |
CMN (register, 64-bit) throughput 0.362005 | |
CMN (register, lsl, 32-bit) 3->1 2.004697 | |
CMN (register, lsl, 32-bit) 3->2 2.004861 | |
CMN (register, lsl, 32-bit) throughput 0.667960 | |
CMN (register, lsl, 64-bit) 3->1 2.000848 | |
CMN (register, lsl, 64-bit) 3->2 2.004277 | |
CMN (register, lsl, 64-bit) throughput 0.668306 | |
CMN (register, lsr, 32-bit) 3->1 2.005134 | |
CMN (register, lsr, 32-bit) 3->2 2.001980 | |
CMN (register, lsr, 32-bit) throughput 0.666564 | |
CMN (register, lsr, 64-bit) 3->1 1.995033 | |
CMN (register, lsr, 64-bit) 3->2 2.004931 | |
CMN (register, lsr, 64-bit) throughput 0.663833 | |
CMN (register, asr, 32-bit) 3->1 2.002425 | |
CMN (register, asr, 32-bit) 3->2 1.996159 | |
CMN (register, asr, 32-bit) throughput 0.678795 | |
CMN (register, asr, 64-bit) 3->1 2.014695 | |
CMN (register, asr, 64-bit) 3->2 1.932653 | |
CMN (register, asr, 64-bit) throughput 0.725067 | |
CMP (sxtb, 32-bit) 3->1 1.782066 | |
CMP (sxtb, 32-bit) 3->2 1.934255 | |
CMP (sxtb, 32-bit) throughput 0.677646 | |
CMP (sxtb, 64-bit) 3->1 1.997577 | |
CMP (sxtb, 64-bit) 3->2 2.004009 | |
CMP (sxtb, 64-bit) throughput 0.667853 | |
CMP (uxtb, 32-bit) 3->1 2.129626 | |
CMP (uxtb, 32-bit) 3->2 1.804826 | |
CMP (uxtb, 32-bit) throughput 0.669218 | |
CMP (uxtb, 64-bit) 3->1 2.003684 | |
CMP (uxtb, 64-bit) 3->2 2.041748 | |
CMP (uxtb, 64-bit) throughput 0.667110 | |
CMP (sxth, 32-bit) 3->1 2.017980 | |
CMP (sxth, 32-bit) 3->2 1.997291 | |
CMP (sxth, 32-bit) throughput 0.669837 | |
CMP (sxth, 64-bit) 3->1 2.002998 | |
CMP (sxth, 64-bit) 3->2 1.993540 | |
CMP (sxth, 64-bit) throughput 0.668360 | |
CMP (uxth, 32-bit) 3->1 1.999111 | |
CMP (uxth, 32-bit) 3->2 2.004253 | |
CMP (uxth, 32-bit) throughput 0.666936 | |
CMP (uxth, 64-bit) 3->1 2.006924 | |
CMP (uxth, 64-bit) 3->2 2.009259 | |
CMP (uxth, 64-bit) throughput 0.667296 | |
CMP (sxtw, 32-bit) 3->1 1.001539 | |
CMP (sxtw, 32-bit) 3->2 0.998302 | |
CMP (sxtw, 32-bit) throughput 0.363393 | |
CMP (sxtw, 64-bit) 3->1 2.019436 | |
CMP (sxtw, 64-bit) 3->2 2.272226 | |
CMP (sxtw, 64-bit) throughput 0.667147 | |
CMP (uxtw, 32-bit) 3->1 1.213414 | |
CMP (uxtw, 32-bit) 3->2 1.007384 | |
CMP (uxtw, 32-bit) throughput 0.363343 | |
CMP (uxtw, 64-bit) 3->1 2.000568 | |
CMP (uxtw, 64-bit) 3->2 1.996850 | |
CMP (uxtw, 64-bit) throughput 0.694523 | |
CMP (uxtx, 64-bit) 3->1 1.092283 | |
CMP (uxtx, 64-bit) 3->2 0.997899 | |
CMP (uxtx, 64-bit) throughput 0.363333 | |
CMP (sxtx, 64-bit) 3->1 1.002710 | |
CMP (sxtx, 64-bit) 3->2 1.002991 | |
CMP (sxtx, 64-bit) throughput 0.366922 | |
CMP (immediate, 32-bit) 2->1 1.002389 | |
CMP (immediate, 32-bit) throughput 0.330037 | |
CMP (immediate, 64-bit) 2->1 1.006937 | |
CMP (immediate, 64-bit) throughput 0.381874 | |
CMP (shifted immediate, 32-bit) 2->1 1.034162 | |
CMP (shifted immediate, 32-bit) throughput 0.363702 | |
CMP (shifted immediate, 64-bit) 2->1 1.003887 | |
CMP (shifted immediate, 64-bit) throughput 0.366069 | |
CMP (register, 32-bit) 3->1 0.994874 | |
CMP (register, 32-bit) 3->2 1.001418 | |
CMP (register, 32-bit) throughput 0.368467 | |
CMP (register, 64-bit) 3->1 1.005102 | |
CMP (register, 64-bit) 3->2 1.023646 | |
CMP (register, 64-bit) throughput 0.363029 | |
CMP (register, lsl, 32-bit) 3->1 2.003926 | |
CMP (register, lsl, 32-bit) 3->2 1.997943 | |
CMP (register, lsl, 32-bit) throughput 0.674608 | |
CMP (register, lsl, 64-bit) 3->1 2.007573 | |
CMP (register, lsl, 64-bit) 3->2 2.000806 | |
CMP (register, lsl, 64-bit) throughput 0.667873 | |
CMP (register, lsr, 32-bit) 3->1 2.003768 | |
CMP (register, lsr, 32-bit) 3->2 1.993827 | |
CMP (register, lsr, 32-bit) throughput 0.668364 | |
CMP (register, lsr, 64-bit) 3->1 2.003199 | |
CMP (register, lsr, 64-bit) 3->2 2.007922 | |
CMP (register, lsr, 64-bit) throughput 0.667000 | |
CMP (register, asr, 32-bit) 3->1 2.004089 | |
CMP (register, asr, 32-bit) 3->2 2.000363 | |
CMP (register, asr, 32-bit) throughput 0.667257 | |
CMP (register, asr, 64-bit) 3->1 2.000605 | |
CMP (register, asr, 64-bit) 3->2 2.007289 | |
CMP (register, asr, 64-bit) throughput 0.687485 | |
CNEG (32-bit) 1->2 0.998181 | |
CNEG (32-bit) 1->3 1.001661 | |
CNEG (32-bit) throughput 0.335098 | |
CNEG (64-bit) 1->2 1.002592 | |
CNEG (64-bit) 1->3 1.004374 | |
CNEG (64-bit) throughput 0.334435 | |
CSDB throughput 0.254710 | |
CSEL (32-bit) 1->2 0.998504 | |
CSEL (32-bit) 1->3 0.978022 | |
CSEL (32-bit) 1->4 1.005963 | |
CSEL (32-bit) throughput 0.337953 | |
CSEL (64-bit) 1->2 0.998707 | |
CSEL (64-bit) 1->3 1.008017 | |
CSEL (64-bit) 1->4 1.001661 | |
CSEL (64-bit) throughput 0.334278 | |
CSET (32-bit) 1->2 1.005427 | |
CSET (32-bit) throughput 0.334071 | |
CSET (64-bit) 1->2 1.002951 | |
CSET (64-bit) throughput 0.333640 | |
CSETM (32-bit) 1->2 0.999030 | |
CSETM (32-bit) throughput 0.334416 | |
CSETM (64-bit) 1->2 1.002835 | |
CSETM (64-bit) throughput 0.333673 | |
CSINC (32-bit) 1->2 1.004292 | |
CSINC (32-bit) 1->3 1.002390 | |
CSINC (32-bit) 1->4 0.998019 | |
CSINC (32-bit) throughput 0.334250 | |
CSINC (64-bit) 1->2 1.003239 | |
CSINC (64-bit) 1->3 0.998262 | |
CSINC (64-bit) 1->4 1.003279 | |
CSINC (64-bit) throughput 0.334273 | |
CSINV (32-bit) 1->2 0.996369 | |
CSINV (32-bit) 1->3 1.000040 | |
CSINV (32-bit) 1->4 1.002430 | |
CSINV (32-bit) throughput 0.334417 | |
CSINV (64-bit) 1->2 1.001781 | |
CSINV (64-bit) 1->3 1.001660 | |
CSINV (64-bit) 1->4 1.003720 | |
CSINV (64-bit) throughput 0.334479 | |
CSNEG (32-bit) 1->2 0.998302 | |
CSNEG (32-bit) 1->3 1.042200 | |
CSNEG (32-bit) 1->4 0.998787 | |
CSNEG (32-bit) throughput 0.333498 | |
CSNEG (64-bit) 1->2 0.941456 | |
CSNEG (64-bit) 1->3 1.001862 | |
CSNEG (64-bit) 1->4 0.999636 | |
CSNEG (64-bit) throughput 0.333817 | |
DMB (SY) throughput 4.006277 | |
DMB (ST) throughput 3.994141 | |
DMB (LD) throughput 3.738653 | |
DMB (ISH) throughput 4.100051 | |
DMB (ISHST) throughput 4.022104 | |
DMB (ISHLD) throughput 3.998259 | |
DMB (NSH) throughput 3.995758 | |
DMB (NSHST) throughput 4.476901 | |
DMB (NSHLD) throughput 3.992562 | |
DMB (OSH) throughput 4.060819 | |
DMB (OSHST) throughput 4.039963 | |
DMB (OSHLD) throughput 4.036362 | |
DSB (SY) throughput 16.687661 | |
DSB (ST) throughput 16.149862 | |
DSB (LD) throughput 16.039126 | |
DSB (ISH) throughput 16.123987 | |
DSB (ISHST) throughput 16.055434 | |
DSB (ISHLD) throughput 16.166586 | |
DSB (NSH) throughput 16.124712 | |
DSB (NSHST) throughput 16.026861 | |
DSB (NSHLD) throughput 16.112095 | |
DSB (OSH) throughput 16.772379 | |
DSB (OSHST) throughput 16.135574 | |
DSB (OSHLD) throughput 16.123735 | |
EON (register, 32-bit) 1->2 1.001741 | |
EON (register, 32-bit) 1->3 1.000444 | |
EON (register, 32-bit) throughput 0.334200 | |
EON (register, 64-bit) 1->2 0.998343 | |
EON (register, 64-bit) 1->3 1.001579 | |
EON (register, 64-bit) throughput 0.334223 | |
EON (register, lsl, 32-bit) 1->2 2.001617 | |
EON (register, lsl, 32-bit) 1->3 2.003604 | |
EON (register, lsl, 32-bit) throughput 0.670625 | |
EON (register, lsl, 64-bit) 1->2 1.999152 | |
EON (register, lsl, 64-bit) 1->3 1.973580 | |
EON (register, lsl, 64-bit) throughput 0.668622 | |
EON (register, lsr, 32-bit) 1->2 2.018586 | |
EON (register, lsr, 32-bit) 1->3 2.002430 | |
EON (register, lsr, 32-bit) throughput 0.668155 | |
EON (register, lsr, 64-bit) 1->2 1.817771 | |
EON (register, lsr, 64-bit) 1->3 2.005466 | |
EON (register, lsr, 64-bit) throughput 0.704457 | |
EON (register, asr, 32-bit) 1->2 1.996405 | |
EON (register, asr, 32-bit) 1->3 2.002948 | |
EON (register, asr, 32-bit) throughput 0.667892 | |
EON (register, asr, 64-bit) 1->2 2.005872 | |
EON (register, asr, 64-bit) 1->3 2.007882 | |
EON (register, asr, 64-bit) throughput 0.668209 | |
EON (register, ror, 32-bit) 1->2 2.003401 | |
EON (register, ror, 32-bit) 1->3 2.003760 | |
EON (register, ror, 32-bit) throughput 0.667914 | |
EON (register, ror, 64-bit) 1->2 2.729193 | |
EON (register, ror, 64-bit) 1->3 2.024914 | |
EON (register, ror, 64-bit) throughput 0.673839 | |
EOR (immediate, 32-bit) 1->2 1.002672 | |
EOR (immediate, 32-bit) throughput 0.333061 | |
EOR (immediate, 64-bit) 1->2 0.999233 | |
EOR (immediate, 64-bit) throughput 0.336303 | |
EOR (register, 32-bit) 1->2 0.998065 | |
EOR (register, 32-bit) 1->3 1.001620 | |
EOR (register, 32-bit) throughput 0.334175 | |
EOR (register, 64-bit) 1->2 1.001701 | |
EOR (register, 64-bit) 1->3 1.003604 | |
EOR (register, 64-bit) throughput 0.332755 | |
EOR (register, lsl, 32-bit) 1->2 1.720575 | |
EOR (register, lsl, 32-bit) 1->3 2.000945 | |
EOR (register, lsl, 32-bit) throughput 0.668329 | |
EOR (register, lsl, 64-bit) 1->2 2.018316 | |
EOR (register, lsl, 64-bit) 1->3 2.004492 | |
EOR (register, lsl, 64-bit) throughput 0.671717 | |
EOR (register, lsr, 32-bit) 1->2 2.001658 | |
EOR (register, lsr, 32-bit) 1->3 2.006235 | |
EOR (register, lsr, 32-bit) throughput 0.668018 | |
EOR (register, lsr, 64-bit) 1->2 2.004738 | |
EOR (register, lsr, 64-bit) 1->3 2.001376 | |
EOR (register, lsr, 64-bit) throughput 0.667064 | |
EOR (register, asr, 32-bit) 1->2 1.998950 | |
EOR (register, asr, 32-bit) 1->3 1.999193 | |
EOR (register, asr, 32-bit) throughput 0.668187 | |
EOR (register, asr, 64-bit) 1->2 1.998463 | |
EOR (register, asr, 64-bit) 1->3 1.998800 | |
EOR (register, asr, 64-bit) throughput 0.666610 | |
EOR (register, ror, 32-bit) 1->2 2.009069 | |
EOR (register, ror, 32-bit) 1->3 1.998788 | |
EOR (register, ror, 32-bit) throughput 0.668621 | |
EOR (register, ror, 64-bit) 1->2 1.993436 | |
EOR (register, ror, 64-bit) 1->3 2.002350 | |
EOR (register, ror, 64-bit) throughput 0.859259 | |
EXTR (register, 32-bit) 1->2 1.004043 | |
EXTR (register, 32-bit) 1->3 2.004244 | |
EXTR (register, 32-bit) throughput 1.007592 | |
EXTR (register, 64-bit) 1->2 0.998423 | |
EXTR (register, 64-bit) 1->3 2.002425 | |
EXTR (register, 64-bit) throughput 1.073627 | |
ISB (SY) throughput 26.152276 | |
LDNP (32-bit) 1->3 (with chain penalty) 4.011216 | |
LDNP (32-bit) 2->3 (with chain penalty) 4.067778 | |
LDNP (32-bit) throughput 0.500773 | |
LDNP (64-bit) 1->3 (with chain penalty) 4.013928 | |
LDNP (64-bit) 2->3 (with chain penalty) 4.029279 | |
LDNP (64-bit) throughput 0.501316 | |
LDP (32-bit) 1->3 (with chain penalty) 3.962296 | |
LDP (32-bit) 2->3 (with chain penalty) 4.993629 | |
LDP (32-bit) throughput 0.422988 | |
LDP (64-bit) 1->3 (with chain penalty) 4.818747 | |
LDP (64-bit) 2->3 (with chain penalty) 5.571466 | |
LDP (64-bit) throughput 0.606076 | |
LDP (post-index, 32-bit) 1->3 (with chain penalty) 4.834397 | |
LDP (post-index, 32-bit) 2->3 (with chain penalty) 4.158612 | |
LDP (post-index, 32-bit) throughput 0.833657 | |
LDP (post-index, 64-bit) 1->3 (with chain penalty) 27.169619 | |
LDP (post-index, 64-bit) 2->3 (with chain penalty) 20.001005 | |
LDP (post-index, 64-bit) throughput 0.854760 | |
LDP (pre-index, 32-bit) 1->3 (with chain penalty) 7.875741 | |
LDP (pre-index, 32-bit) 2->3 (with chain penalty) 5.425403 | |
LDP (pre-index, 32-bit) throughput 0.801130 | |
LDP (pre-index, 64-bit) 1->3 (with chain penalty) 29.096643 | |
LDP (pre-index, 64-bit) 2->3 (with chain penalty) 20.744503 | |
LDP (pre-index, 64-bit) throughput 0.829911 | |
LDP (signed offset, 32-bit) 1->3 (with chain penalty) 4.479390 | |
LDP (signed offset, 32-bit) 2->3 (with chain penalty) 4.421192 | |
LDP (signed offset, 32-bit) throughput 0.553061 | |
LDP (signed offset, 64-bit) 1->3 (with chain penalty) 4.445606 | |
LDP (signed offset, 64-bit) 2->3 (with chain penalty) 4.520398 | |
LDP (signed offset, 64-bit) throughput 0.614190 | |
LDPSW (post-index) 1->3 (with chain penalty) 4.531448 | |
LDPSW (post-index) 2->3 (with chain penalty) 2.807169 | |
LDPSW (post-index) throughput 0.886689 | |
LDPSW (pre-index) 1->3 (with chain penalty) 1.880322 | |
LDPSW (pre-index) 2->3 (with chain penalty) 4.373434 | |
LDPSW (pre-index) throughput 0.829710 | |
LDPSW (signed offset) 1->3 (with chain penalty) 4.470898 | |
LDPSW (signed offset) 2->3 (with chain penalty) 3.879052 | |
LDPSW (signed offset) throughput 0.503425 | |
LDR (32-bit) 1->2 (with chain penalty) 3.971645 | |
LDR (32-bit) throughput 0.512350 | |
LDR (64-bit) 1->2 (with chain penalty) 7.773384 | |
LDR (64-bit) throughput 0.485995 | |
LDR (post-index, 32-bit) 1->2 (with chain penalty) 4.752876 | |
LDR (post-index, 32-bit) throughput 0.413392 | |
LDR (post-index, 64-bit) 1->2 (with chain penalty) 4.852585 | |
LDR (post-index, 64-bit) throughput 0.652391 | |
LDR (pre-index, 32-bit) 1->2 (with chain penalty) 3.134445 | |
LDR (pre-index, 32-bit) throughput 0.613696 | |
LDR (pre-index, 64-bit) 1->2 (with chain penalty) 3.927234 | |
LDR (pre-index, 64-bit) throughput 0.501048 | |
LDR (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.297109 | |
LDR (unsigned offset, 32-bit) throughput 0.513772 | |
LDR (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.029887 | |
LDR (unsigned offset, 64-bit) throughput 0.503639 | |
LDR (literal, 32-bit) throughput 0.501545 | |
LDR (literal, 64-bit) throughput 0.541524 | |
LDR (register, 32-bit) 1->2 (with chain penalty) 2.986009 | |
LDR (register, 32-bit) 1->3 (with chain penalty) 1.779754 | |
LDR (register, 32-bit) throughput 0.802882 | |
LDR (register, 64-bit) 1->2 (with chain penalty) 3.205355 | |
LDR (register, 64-bit) 1->3 (with chain penalty) 4.677191 | |
LDR (register, 64-bit) throughput 0.532140 | |
LDR (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.019748 | |
LDR (register, uxtw, 32-bit) 1->3 (with chain penalty) 5.475215 | |
LDR (register, uxtw, 32-bit) throughput 0.472974 | |
LDR (register, uxtw, 64-bit) 1->2 (with chain penalty) 3.816836 | |
LDR (register, uxtw, 64-bit) 1->3 (with chain penalty) 2.874984 | |
LDR (register, uxtw, 64-bit) throughput 0.547399 | |
LDR (register, sxtw, 32-bit) 1->2 (with chain penalty) 2.974502 | |
LDR (register, sxtw, 32-bit) 1->3 (with chain penalty) 4.922296 | |
LDR (register, sxtw, 32-bit) throughput 0.610794 | |
LDR (register, sxtw, 64-bit) 1->2 (with chain penalty) 11.515596 | |
LDR (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.959931 | |
LDR (register, sxtw, 64-bit) throughput 0.518055 | |
LDR (register, lsl, 32-bit) 1->2 (with chain penalty) 4.280039 | |
LDR (register, lsl, 32-bit) 1->3 (with chain penalty) 5.023660 | |
LDR (register, lsl, 32-bit) throughput 0.510059 | |
LDR (register, lsl, 64-bit) 1->2 (with chain penalty) 4.748237 | |
LDR (register, lsl, 64-bit) 1->3 (with chain penalty) 5.069458 | |
LDR (register, lsl, 64-bit) throughput 0.486183 | |
LDRB 1->2 (with chain penalty) 4.388431 | |
LDRB throughput 0.502891 | |
LDRB (post-index) 1->2 (with chain penalty) 5.046648 | |
LDRB (post-index) throughput 0.536790 | |
LDRB (pre-index) 1->2 (with chain penalty) 5.823114 | |
LDRB (pre-index) throughput 0.557724 | |
LDRB (unsigned offset) 1->2 (with chain penalty) 5.454965 | |
LDRB (unsigned offset) throughput 0.535845 | |
LDRB (register) 1->2 (with chain penalty) 8.835895 | |
LDRB (register) 1->3 (with chain penalty) 5.253677 | |
LDRB (register) throughput 0.501086 | |
LDRB (register, uxtw) 1->2 (with chain penalty) 4.131632 | |
LDRB (register, uxtw) 1->3 (with chain penalty) 3.917433 | |
LDRB (register, uxtw) throughput 0.476065 | |
LDRB (register, sxtw) 1->2 (with chain penalty) 4.304624 | |
LDRB (register, sxtw) 1->3 (with chain penalty) 5.237230 | |
LDRB (register, sxtw) throughput 0.504031 | |
LDRH 1->2 (with chain penalty) 4.374333 | |
LDRH throughput 0.604542 | |
LDRH (post-index) 1->2 (with chain penalty) 6.336118 | |
LDRH (post-index) throughput 0.384406 | |
LDRH (pre-index) 1->2 (with chain penalty) 5.237825 | |
LDRH (pre-index) throughput 0.546486 | |
LDRH (unsigned offset) 1->2 (with chain penalty) 5.005581 | |
LDRH (unsigned offset) throughput 0.560254 | |
LDRH (register) 1->2 (with chain penalty) 5.544064 | |
LDRH (register) 1->3 (with chain penalty) 5.557906 | |
LDRH (register) throughput 0.505116 | |
LDRH (register, uxtw) 1->2 (with chain penalty) 4.830699 | |
LDRH (register, uxtw) 1->3 (with chain penalty) 4.132452 | |
LDRH (register, uxtw) throughput 0.500236 | |
LDRH (register, sxtw) 1->2 (with chain penalty) 5.597845 | |
LDRH (register, sxtw) 1->3 (with chain penalty) 4.202584 | |
LDRH (register, sxtw) throughput 0.505879 | |
LDRH (register, lsl) 1->2 (with chain penalty) 4.340692 | |
LDRH (register, lsl) 1->3 (with chain penalty) 4.054429 | |
LDRH (register, lsl) throughput 0.500633 | |
LDRSB (32-bit) 1->2 (with chain penalty) 4.121659 | |
LDRSB (32-bit) throughput 0.523786 | |
LDRSB (64-bit) 1->2 (with chain penalty) 4.143839 | |
LDRSB (64-bit) throughput 0.528274 | |
LDRSB (post-index, 32-bit) 1->2 (with chain penalty) 7.419567 | |
LDRSB (post-index, 32-bit) throughput 0.563418 | |
LDRSB (post-index, 64-bit) 1->2 (with chain penalty) 5.373216 | |
LDRSB (post-index, 64-bit) throughput 0.595227 | |
LDRSB (pre-index, 32-bit) 1->2 (with chain penalty) 4.227090 | |
LDRSB (pre-index, 32-bit) throughput 0.606379 | |
LDRSB (pre-index, 64-bit) 1->2 (with chain penalty) 2.787223 | |
LDRSB (pre-index, 64-bit) throughput 0.641231 | |
LDRSB (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.518528 | |
LDRSB (unsigned offset, 32-bit) throughput 0.583465 | |
LDRSB (unsigned offset, 64-bit) 1->2 (with chain penalty) 5.918985 | |
LDRSB (unsigned offset, 64-bit) throughput 0.527817 | |
LDRSB (register, 32-bit) 1->2 (with chain penalty) 4.073166 | |
LDRSB (register, 32-bit) 1->3 (with chain penalty) 6.104889 | |
LDRSB (register, 32-bit) throughput 0.600752 | |
LDRSB (register, 64-bit) 1->2 (with chain penalty) 4.183647 | |
LDRSB (register, 64-bit) 1->3 (with chain penalty) 4.353711 | |
LDRSB (register, 64-bit) throughput 0.529125 | |
LDRSB (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.133659 | |
LDRSB (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.051926 | |
LDRSB (register, uxtw, 32-bit) throughput 0.511480 | |
LDRSB (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.035595 | |
LDRSB (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.073428 | |
LDRSB (register, uxtw, 64-bit) throughput 0.503281 | |
LDRSB (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.043933 | |
LDRSB (register, sxtw, 32-bit) 1->3 (with chain penalty) 4.355496 | |
LDRSB (register, sxtw, 32-bit) throughput 0.513134 | |
LDRSB (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.343835 | |
LDRSB (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.168091 | |
LDRSB (register, sxtw, 64-bit) throughput 0.520422 | |
LDRSH (32-bit) 1->2 (with chain penalty) 4.394305 | |
LDRSH (32-bit) throughput 0.512486 | |
LDRSH (64-bit) 1->2 (with chain penalty) 4.028591 | |
LDRSH (64-bit) throughput 0.500825 | |
LDRSH (post-index, 32-bit) 1->2 (with chain penalty) 4.001970 | |
LDRSH (post-index, 32-bit) throughput 0.560562 | |
LDRSH (post-index, 64-bit) 1->2 (with chain penalty) 4.232196 | |
LDRSH (post-index, 64-bit) throughput 0.642203 | |
LDRSH (pre-index, 32-bit) 1->2 (with chain penalty) 3.929703 | |
LDRSH (pre-index, 32-bit) throughput 0.387418 | |
LDRSH (pre-index, 64-bit) 1->2 (with chain penalty) 4.364699 | |
LDRSH (pre-index, 64-bit) throughput 0.543606 | |
LDRSH (unsigned offset, 32-bit) 1->2 (with chain penalty) 5.008532 | |
LDRSH (unsigned offset, 32-bit) throughput 0.627982 | |
LDRSH (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.635835 | |
LDRSH (unsigned offset, 64-bit) throughput 0.501159 | |
LDRSH (register, 32-bit) 1->2 (with chain penalty) 2.920338 | |
LDRSH (register, 32-bit) 1->3 (with chain penalty) 4.382840 | |
LDRSH (register, 32-bit) throughput 0.485965 | |
LDRSH (register, 64-bit) 1->2 (with chain penalty) 4.343374 | |
LDRSH (register, 64-bit) 1->3 (with chain penalty) 4.378748 | |
LDRSH (register, 64-bit) throughput 0.556687 | |
LDRSH (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.567619 | |
LDRSH (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.846350 | |
LDRSH (register, uxtw, 32-bit) throughput 0.499067 | |
LDRSH (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.752022 | |
LDRSH (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.173441 | |
LDRSH (register, uxtw, 64-bit) throughput 0.381087 | |
LDRSH (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.445727 | |
LDRSH (register, sxtw, 32-bit) 1->3 (with chain penalty) 4.475037 | |
LDRSH (register, sxtw, 32-bit) throughput 0.534104 | |
LDRSH (register, sxtw, 64-bit) 1->2 (with chain penalty) 5.571131 | |
LDRSH (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.402152 | |
LDRSH (register, sxtw, 64-bit) throughput 0.515916 | |
LDRSH (register, lsl, 32-bit) 1->2 (with chain penalty) 4.381762 | |
LDRSH (register, lsl, 32-bit) 1->3 (with chain penalty) 4.556984 | |
LDRSH (register, lsl, 32-bit) throughput 0.539430 | |
LDRSH (register, lsl, 64-bit) 1->2 (with chain penalty) 4.290519 | |
LDRSH (register, lsl, 64-bit) 1->3 (with chain penalty) 4.291225 | |
LDRSH (register, lsl, 64-bit) throughput 0.572051 | |
LDRSW 1->2 (with chain penalty) 3.624541 | |
LDRSW throughput 0.664266 | |
LDRSW (post-index) 1->2 (with chain penalty) 4.458256 | |
LDRSW (post-index) throughput 0.572662 | |
LDRSW (pre-index) 1->2 (with chain penalty) 4.321967 | |
LDRSW (pre-index) throughput 0.500068 | |
LDRSW (unsigned offset) 1->2 (with chain penalty) 4.686736 | |
LDRSW (unsigned offset) throughput 0.312305 | |
LDRSW (literal) throughput 0.533538 | |
LDRSW (register) 1->2 (with chain penalty) 4.086919 | |
LDRSW (register) 1->3 (with chain penalty) 4.538716 | |
LDRSW (register) throughput 0.576869 | |
LDRSW (register, uxtw) 1->2 (with chain penalty) 4.522640 | |
LDRSW (register, uxtw) 1->3 (with chain penalty) 4.548308 | |
LDRSW (register, uxtw) throughput 0.575413 | |
LDRSW (register, sxtw) 1->2 (with chain penalty) 4.219498 | |
LDRSW (register, sxtw) 1->3 (with chain penalty) 4.346348 | |
LDRSW (register, sxtw) throughput 0.541591 | |
LDRSW (register, lsl) 1->2 (with chain penalty) 4.630674 | |
LDRSW (register, lsl) 1->3 (with chain penalty) 4.193037 | |
LDRSW (register, lsl) throughput 0.425122 | |
LDUR (32-bit) 1->2 (with chain penalty) 4.321447 | |
LDUR (32-bit) throughput 0.488910 | |
LDUR (64-bit) 1->2 (with chain penalty) 4.434077 | |
LDUR (64-bit) throughput 0.655802 | |
LDURB 1->2 (with chain penalty) 4.712086 | |
LDURB throughput 0.520942 | |
LDURH 1->2 (with chain penalty) 4.392170 | |
LDURH throughput 0.481354 | |
LDURSB (32-bit) 1->2 (with chain penalty) 4.473807 | |
LDURSB (32-bit) throughput 0.571954 | |
LDURSB (64-bit) 1->2 (with chain penalty) 4.490645 | |
LDURSB (64-bit) throughput 0.546561 | |
LDURSH (32-bit) 1->2 (with chain penalty) 5.691255 | |
LDURSH (32-bit) throughput 0.513398 | |
LDURSH (64-bit) 1->2 (with chain penalty) 4.418924 | |
LDURSH (64-bit) throughput 0.591257 | |
LDURSW 1->2 (with chain penalty) 4.959228 | |
LDURSW throughput 0.503837 | |
LSL (immediate, 32-bit) 1->2 1.110558 | |
LSL (immediate, 32-bit) throughput 0.344373 | |
LSL (immediate, 64-bit) 1->2 1.001450 | |
LSL (immediate, 64-bit) throughput 0.339696 | |
LSL (register, 32-bit) 1->2 0.943788 | |
LSL (register, 32-bit) 1->3 1.011176 | |
LSL (register, 32-bit) throughput 0.358331 | |
LSL (register, 64-bit) 1->2 1.002224 | |
LSL (register, 64-bit) 1->3 0.995526 | |
LSL (register, 64-bit) throughput 0.340318 | |
LSR (immediate, 32-bit) 1->2 0.999998 | |
LSR (immediate, 32-bit) throughput 0.334506 | |
LSR (immediate, 64-bit) 1->2 0.974744 | |
LSR (immediate, 64-bit) throughput 0.334901 | |
LSR (register, 32-bit) 1->2 0.759411 | |
LSR (register, 32-bit) 1->3 1.017538 | |
LSR (register, 32-bit) throughput 0.324230 | |
LSR (register, 64-bit) 1->2 1.006537 | |
LSR (register, 64-bit) 1->3 1.002235 | |
LSR (register, 64-bit) throughput 0.420014 | |
MADD (32-bit) 1->2 3.138926 | |
MADD (32-bit) 1->3 2.553080 | |
MADD (32-bit) 1->4 1.196828 | |
MADD (32-bit) throughput 1.056894 | |
MADD (64-bit) 1->2 3.257690 | |
MADD (64-bit) 1->3 2.933732 | |
MADD (64-bit) 1->4 0.961741 | |
MADD (64-bit) throughput 0.906788 | |
MNEG (32-bit) 1->2 3.001942 | |
MNEG (32-bit) 1->3 3.316489 | |
MNEG (32-bit) throughput 1.105070 | |
MNEG (64-bit) 1->2 3.482142 | |
MNEG (64-bit) 1->3 2.991488 | |
MNEG (64-bit) throughput 1.139201 | |
MOV (bitmask immediate, 32-bit) throughput 0.304318 | |
MOV (bitmask immediate, 64-bit) throughput 0.251238 | |
MOV (from sp, 32-bit) throughput 0.343093 | |
MOV (from sp, 64-bit) throughput 0.270773 | |
MOVK (32-bit) 1->1 1.278714 | |
MOVK (32-bit) throughput 0.259996 | |
MOVK (64-bit) 1->1 0.999928 | |
MOVK (64-bit) throughput 0.322862 | |
MOVN (32-bit) throughput 0.212576 | |
MOVN (64-bit) throughput 0.251179 | |
MOVZ (32-bit) throughput 0.251117 | |
MOVZ (64-bit) throughput 0.250952 | |
MRS (CNTFRQ_EL0) throughput 19.621549 | |
MRS (CNTPCT_EL0) throughput 1.285935 | |
MRS (CNTVCT_EL0) throughput 1.452263 | |
MRS (DCZID_EL0) throughput 1.324984 | |
MRS (FPCR) throughput 1.305691 | |
MRS (FPSR) throughput 7.396338 | |
MRS (NZCV) throughput 0.331167 | |
MRS (TPIDRRO_EL0) throughput 1.242838 | |
MRS (TPIDR_EL0) throughput 1.507344 | |
MSR (FPCR) throughput 10.534700 | |
MSR (FPSR) throughput 11.260972 | |
MSR (TPIDR_EL0) throughput 10.333137 | |
MSR (NZCV) throughput 0.378226 | |
MSUB (32-bit) 1->2 2.987768 | |
MSUB (32-bit) 1->3 3.016360 | |
MSUB (32-bit) 1->4 1.010691 | |
MSUB (32-bit) throughput 1.073758 | |
MSUB (64-bit) 1->2 3.016238 | |
MSUB (64-bit) 1->3 3.545916 | |
MSUB (64-bit) 1->4 0.993373 | |
MSUB (64-bit) throughput 0.990174 | |
MUL (32-bit) 1->2 3.835368 | |
MUL (32-bit) 1->3 1.896296 | |
MUL (32-bit) throughput 1.084675 | |
MUL (64-bit) 1->2 2.656042 | |
MUL (64-bit) 1->3 2.900016 | |
MUL (64-bit) throughput 1.142993 | |
MVN (register, 32-bit) 1->2 1.007733 | |
MVN (register, 32-bit) throughput 0.340498 | |
MVN (register, 64-bit) 1->2 0.980389 | |
MVN (register, 64-bit) throughput 0.335059 | |
MVN (register, lsl, 32-bit) 1->2 2.441820 | |
MVN (register, lsl, 32-bit) throughput 0.579095 | |
MVN (register, lsl, 64-bit) 1->2 1.982837 | |
MVN (register, lsl, 64-bit) throughput 0.702864 | |
MVN (register, lsr, 32-bit) 1->2 1.966313 | |
MVN (register, lsr, 32-bit) throughput 0.698766 | |
MVN (register, lsr, 64-bit) 1->2 1.996533 | |
MVN (register, lsr, 64-bit) throughput 0.737580 | |
MVN (register, asr, 32-bit) 1->2 1.998020 | |
MVN (register, asr, 32-bit) throughput 0.539739 | |
MVN (register, asr, 64-bit) 1->2 1.968144 | |
MVN (register, asr, 64-bit) throughput 0.721580 | |
MVN (register, ror, 32-bit) 1->2 2.002996 | |
MVN (register, ror, 32-bit) throughput 0.707091 | |
MVN (register, ror, 64-bit) 1->2 2.005102 | |
MVN (register, ror, 64-bit) throughput 0.522015 | |
NEG (register, 32-bit) 1->2 0.998181 | |
NEG (register, 32-bit) throughput 0.332776 | |
NEG (register, 64-bit) 1->2 1.001579 | |
NEG (register, 64-bit) throughput 0.371654 | |
NEG (register, lsl, 32-bit) 1->2 2.004817 | |
NEG (register, lsl, 32-bit) throughput 0.352660 | |
NEG (register, lsl, 64-bit) 1->2 1.999029 | |
NEG (register, lsl, 64-bit) throughput 0.642792 | |
NEG (register, lsr, 32-bit) 1->2 2.017135 | |
NEG (register, lsr, 32-bit) throughput 0.751452 | |
NEG (register, lsr, 64-bit) 1->2 2.001334 | |
NEG (register, lsr, 64-bit) throughput 0.774564 | |
NEG (register, asr, 32-bit) 1->2 1.827614 | |
NEG (register, asr, 32-bit) throughput 0.772062 | |
NEG (register, asr, 64-bit) 1->2 2.038299 | |
NEG (register, asr, 64-bit) throughput 0.654993 | |
NEGS (register, 32-bit) 1->2 1.003690 | |
NEGS (register, 32-bit) 3->2 1.014235 | |
NEGS (register, 32-bit) throughput 0.691212 | |
NEGS (register, 64-bit) 1->2 1.029062 | |
NEGS (register, 64-bit) 3->2 1.264785 | |
NEGS (register, 64-bit) throughput 0.488020 | |
NEGS (register, lsl, 32-bit) 1->2 2.118354 | |
NEGS (register, lsl, 32-bit) 3->2 2.121212 | |
NEGS (register, lsl, 32-bit) throughput 1.037693 | |
NEGS (register, lsl, 64-bit) 1->2 2.052487 | |
NEGS (register, lsl, 64-bit) 3->2 2.403352 | |
NEGS (register, lsl, 64-bit) throughput 0.711306 | |
NEGS (register, lsr, 32-bit) 1->2 1.994714 | |
NEGS (register, lsr, 32-bit) 3->2 2.554842 | |
NEGS (register, lsr, 32-bit) throughput 0.508404 | |
NEGS (register, lsr, 64-bit) 1->2 1.994434 | |
NEGS (register, lsr, 64-bit) 3->2 2.005952 | |
NEGS (register, lsr, 64-bit) throughput 0.816902 | |
NEGS (register, asr, 32-bit) 1->2 1.987701 | |
NEGS (register, asr, 32-bit) 3->2 2.346507 | |
NEGS (register, asr, 32-bit) throughput 0.742936 | |
NEGS (register, asr, 64-bit) 1->2 2.024417 | |
NEGS (register, asr, 64-bit) 3->2 2.615057 | |
NEGS (register, asr, 64-bit) throughput 0.482044 | |
NGC (register, 32-bit) 1->2 1.007653 | |
NGC (register, 32-bit) 1->3 1.009880 | |
NGC (register, 32-bit) throughput 0.337560 | |
NGC (register, 64-bit) 1->2 0.993044 | |
NGC (register, 64-bit) 1->3 1.383560 | |
NGC (register, 64-bit) throughput 0.344979 | |
NGCS (register, 32-bit) 1->2 1.005012 | |
NGCS (register, 32-bit) 1->3 1.034039 | |
NGCS (register, 32-bit) 3->2 1.016602 | |
NGCS (register, 32-bit) 3->3 0.997427 | |
NGCS (register, 32-bit) throughput 0.823515 | |
NGCS (register, 32-bit) throughput 0.622982 | |
NGCS (register, 64-bit) 1->2 0.995766 | |
NGCS (register, 64-bit) 1->3 0.994647 | |
NGCS (register, 64-bit) 3->2 1.520986 | |
NGCS (register, 64-bit) 3->3 0.994531 | |
NGCS (register, 64-bit) throughput 0.873412 | |
NGCS (register, 64-bit) throughput 0.640679 | |
NOP throughput 0.240462 | |
ORN (register, 32-bit) 1->2 1.002183 | |
ORN (register, 32-bit) 1->3 1.008706 | |
ORN (register, 32-bit) throughput 0.309213 | |
ORN (register, 64-bit) 1->2 1.001701 | |
ORN (register, 64-bit) 1->3 1.335583 | |
ORN (register, 64-bit) throughput 0.330946 | |
ORN (register, lsl, 32-bit) 1->2 1.922455 | |
ORN (register, lsl, 32-bit) 1->3 1.700878 | |
ORN (register, lsl, 32-bit) throughput 0.717911 | |
ORN (register, lsl, 64-bit) 1->2 1.996530 | |
ORN (register, lsl, 64-bit) 1->3 2.011210 | |
ORN (register, lsl, 64-bit) throughput 0.710283 | |
ORN (register, lsr, 32-bit) 1->2 2.003401 | |
ORN (register, lsr, 32-bit) 1->3 2.222689 | |
ORN (register, lsr, 32-bit) throughput 0.601278 | |
ORN (register, lsr, 64-bit) 1->2 1.979435 | |
ORN (register, lsr, 64-bit) 1->3 1.990490 | |
ORN (register, lsr, 64-bit) throughput 0.787249 | |
ORN (register, asr, 32-bit) 1->2 2.362830 | |
ORN (register, asr, 32-bit) 1->3 2.108351 | |
ORN (register, asr, 32-bit) throughput 0.709955 | |
ORN (register, asr, 64-bit) 1->2 2.178064 | |
ORN (register, asr, 64-bit) 1->3 2.159680 | |
ORN (register, asr, 64-bit) throughput 0.737944 | |
ORN (register, ror, 32-bit) 1->2 2.004898 | |
ORN (register, ror, 32-bit) 1->3 1.994786 | |
ORN (register, ror, 32-bit) throughput 0.669571 | |
ORN (register, ror, 64-bit) 1->2 1.818422 | |
ORN (register, ror, 64-bit) 1->3 2.219273 | |
ORN (register, ror, 64-bit) throughput 0.623367 | |
ORR (immediate, 32-bit) 1->2 0.997734 | |
ORR (immediate, 32-bit) throughput 0.339660 | |
ORR (immediate, 64-bit) 1->2 0.998866 | |
ORR (immediate, 64-bit) throughput 0.332465 | |
ORR (register, 32-bit) 1->2 0.998302 | |
ORR (register, 32-bit) 1->3 0.996370 | |
ORR (register, 32-bit) throughput 0.339626 | |
ORR (register, 64-bit) 1->2 1.001660 | |
ORR (register, 64-bit) 1->3 0.995164 | |
ORR (register, 64-bit) throughput 0.424834 | |
ORR (register, lsl, 32-bit) 1->2 2.002062 | |
ORR (register, lsl, 32-bit) 1->3 1.994905 | |
ORR (register, lsl, 32-bit) throughput 0.703444 | |
ORR (register, lsl, 64-bit) 1->2 1.912905 | |
ORR (register, lsl, 64-bit) 1->3 2.003516 | |
ORR (register, lsl, 64-bit) throughput 0.672556 | |
ORR (register, lsr, 32-bit) 1->2 2.002593 | |
ORR (register, lsr, 32-bit) 1->3 2.003603 | |
ORR (register, lsr, 32-bit) throughput 0.668405 | |
ORR (register, lsr, 64-bit) 1->2 2.001620 | |
ORR (register, lsr, 64-bit) 1->3 1.998141 | |
ORR (register, lsr, 64-bit) throughput 0.667745 | |
ORR (register, asr, 32-bit) 1->2 2.031339 | |
ORR (register, asr, 32-bit) 1->3 1.998748 | |
ORR (register, asr, 32-bit) throughput 0.669550 | |
ORR (register, asr, 64-bit) 1->2 2.001553 | |
ORR (register, asr, 64-bit) 1->3 1.923998 | |
ORR (register, asr, 64-bit) throughput 0.898453 | |
ORR (register, ror, 32-bit) 1->2 1.359639 | |
ORR (register, ror, 32-bit) 1->3 2.015131 | |
ORR (register, ror, 32-bit) throughput 0.856870 | |
ORR (register, ror, 64-bit) 1->2 2.339577 | |
ORR (register, ror, 64-bit) 1->3 2.057684 | |
ORR (register, ror, 64-bit) throughput 0.737398 | |
PRFM (register, PLDL1KEEP) throughput 2.366658 | |
PRFM (register, PLDL1KEEP) throughput 0.448491 | |
PRFM (register, PLDL1STRM) throughput 3.432769 | |
PRFM (register, PLDL1STRM) throughput 0.500840 | |
PRFM (register, PLDL2KEEP) throughput 2.183492 | |
PRFM (register, PLDL2KEEP) throughput 2.044444 | |
PRFM (register, PLDL2STRM) throughput 2.474707 | |
PRFM (register, PLDL2STRM) throughput 2.337495 | |
PRFM (register, PLDL3KEEP) throughput 1.903101 | |
PRFM (register, PLDL3KEEP) throughput 2.383121 | |
PRFM (register, PLDL3STRM) throughput 1.649719 | |
PRFM (register, PLDL3STRM) throughput 2.293085 | |
PRFM (register, PLIL1KEEP) throughput 2.304429 | |
PRFM (register, PLIL1KEEP) throughput 1.972693 | |
PRFM (register, PLIL1STRM) throughput 2.067304 | |
PRFM (register, PLIL1STRM) throughput 2.090284 | |
PRFM (register, PLIL2KEEP) throughput 2.319946 | |
PRFM (register, PLIL2KEEP) throughput 2.078469 | |
PRFM (register, PLIL2STRM) throughput 2.096154 | |
PRFM (register, PLIL2STRM) throughput 2.053392 | |
PRFM (register, PLIL3KEEP) throughput 1.966426 | |
PRFM (register, PLIL3KEEP) throughput 2.210865 | |
PRFM (register, PLIL3STRM) throughput 1.958898 | |
PRFM (register, PLIL3STRM) throughput 2.223792 | |
PRFM (register, PSTL1KEEP) throughput 2.796336 | |
PRFM (register, PSTL1KEEP) throughput 0.497483 | |
PRFM (register, PSTL1STRM) throughput 3.631398 | |
PRFM (register, PSTL1STRM) throughput 0.498949 | |
PRFM (register, PSTL2KEEP) throughput 2.415593 | |
PRFM (register, PSTL2KEEP) throughput 2.245132 | |
PRFM (register, PSTL2STRM) throughput 2.311644 | |
PRFM (register, PSTL2STRM) throughput 2.134797 | |
PRFM (register, PSTL3KEEP) throughput 1.930098 | |
PRFM (register, PSTL3KEEP) throughput 1.654864 | |
PRFM (register, PSTL3STRM) throughput 2.009697 | |
PRFM (register, PSTL3STRM) throughput 1.521765 | |
PSSBB throughput 29.430606 | |
RBIT (32-bit) 1->2 1.007912 | |
RBIT (32-bit) throughput 0.334038 | |
RBIT (64-bit) 1->2 0.988039 | |
RBIT (64-bit) throughput 0.351251 | |
REV (32-bit) 1->2 0.993001 | |
REV (32-bit) throughput 0.343402 | |
REV (64-bit) 1->2 0.993762 | |
REV (64-bit) throughput 0.420122 | |
REV16 (32-bit) 1->2 1.249206 | |
REV16 (32-bit) throughput 0.350687 | |
REV16 (64-bit) 1->2 1.000258 | |
REV16 (64-bit) throughput 0.339909 | |
REV32 1->2 1.008991 | |
REV32 throughput 0.366831 | |
ROR (immediate, 32-bit) 1->2 0.784685 | |
ROR (immediate, 32-bit) throughput 0.345924 | |
ROR (immediate, 64-bit) 1->2 0.931590 | |
ROR (immediate, 64-bit) throughput 0.338662 | |
ROR (register, 32-bit) 1->2 0.928486 | |
ROR (register, 32-bit) 1->3 0.973620 | |
ROR (register, 32-bit) throughput 0.334887 | |
ROR (register, 64-bit) 1->2 1.165951 | |
ROR (register, 64-bit) 1->3 0.996057 | |
ROR (register, 64-bit) throughput 0.333370 | |
SBC (32-bit) 1->2 0.933983 | |
SBC (32-bit) 1->3 0.933785 | |
SBC (32-bit) 1->4 0.938271 | |
SBC (32-bit) throughput 0.289599 | |
SBC (64-bit) 1->2 0.999797 | |
SBC (64-bit) 1->3 1.001092 | |
SBC (64-bit) 1->4 1.003517 | |
SBC (64-bit) throughput 0.375187 | |
SBCS (32-bit) 1->2 0.995463 | |
SBCS (32-bit) 1->3 1.049725 | |
SBCS (32-bit) 1->4 1.224936 | |
SBCS (32-bit) 4->2 1.005587 | |
SBCS (32-bit) 4->3 0.988058 | |
SBCS (32-bit) 4->4 0.998342 | |
SBCS (32-bit) throughput 1.084498 | |
SBCS (32-bit) throughput 0.687237 | |
SBCS (64-bit) 1->2 0.936380 | |
SBCS (64-bit) 1->3 0.987722 | |
SBCS (64-bit) 1->4 1.242600 | |
SBCS (64-bit) 4->2 1.007963 | |
SBCS (64-bit) 4->3 1.006514 | |
SBCS (64-bit) 4->4 1.014612 | |
SBCS (64-bit) throughput 0.816001 | |
SBCS (64-bit) throughput 0.607842 | |
SBFIZ (32-bit) 1->2 1.000362 | |
SBFIZ (32-bit) throughput 0.344130 | |
SBFIZ (64-bit) 1->2 1.046099 | |
SBFIZ (64-bit) throughput 0.334362 | |
SBFX (32-bit) 1->2 1.005992 | |
SBFX (32-bit) throughput 0.368015 | |
SBFX (64-bit) 1->2 1.139500 | |
SBFX (64-bit) throughput 0.330851 | |
SDIV (fast, 32-bit) 1->2 9.518495 | |
SDIV (fast, 32-bit) 1->3 7.695666 | |
SDIV (fast, 32-bit) throughput 8.582758 | |
SDIV (slow, 32-bit) 1->2 9.345429 | |
SDIV (slow, 32-bit) 1->3 13.881416 | |
SDIV (slow, 32-bit) throughput 15.943366 | |
SDIV (slow, 32-bit) 1->2 18.561055 | |
SDIV (slow, 32-bit) 1->3 10.478884 | |
SDIV (slow, 32-bit) throughput 16.808209 | |
SDIV (fast, 64-bit) 1->2 5.942182 | |
SDIV (fast, 64-bit) 1->3 7.637275 | |
SDIV (fast, 64-bit) throughput 10.470722 | |
SDIV (medium, 64-bit) 1->2 14.345385 | |
SDIV (medium, 64-bit) 1->3 15.115875 | |
SDIV (medium, 64-bit) throughput 16.932808 | |
SDIV (medium, 64-bit) 1->2 12.990643 | |
SDIV (medium, 64-bit) 1->3 15.933564 | |
SDIV (medium, 64-bit) throughput 19.733054 | |
SDIV (slow, 64-bit) 1->2 21.932172 | |
SDIV (slow, 64-bit) 1->3 23.227049 | |
SDIV (slow, 64-bit) throughput 33.841319 | |
SDIV (slow, 64-bit) 1->2 23.680575 | |
SDIV (slow, 64-bit) 1->3 33.495257 | |
SDIV (slow, 64-bit) throughput 28.844340 | |
SMADDL 1->2 2.860401 | |
SMADDL 1->3 3.059803 | |
SMADDL 1->4 0.997017 | |
SMADDL throughput 1.206069 | |
SMNEGL 1->2 2.949267 | |
SMNEGL 1->3 2.522393 | |
SMNEGL throughput 0.803665 | |
SMSUBL 1->2 3.174978 | |
SMSUBL 1->3 2.726028 | |
SMSUBL 1->4 0.996155 | |
SMSUBL throughput 0.855693 | |
SMULH 1->2 3.036645 | |
SMULH 1->3 3.071626 | |
SMULH throughput 1.114094 | |
SMULL 1->2 2.923614 | |
SMULL 1->3 2.500358 | |
SMULL throughput 0.769158 | |
SSBB throughput 23.477772 | |
STLRB throughput 4.702660 | |
STLRH throughput 6.038870 | |
STLXP (32-bit) throughput 3.168703 | |
STLXP (32-bit) throughput 2.996401 | |
STLXP (64-bit) throughput 3.293522 | |
STLXP (64-bit) throughput 3.034821 | |
STLXR (32-bit) throughput 3.155222 | |
STLXR (32-bit) throughput 3.139267 | |
STLXR (64-bit) throughput 3.247223 | |
STLXR (64-bit) throughput 3.252623 | |
STLXRB throughput 3.039395 | |
STLXRB throughput 3.004130 | |
STLXRH throughput 3.036415 | |
STLXRH throughput 3.013581 | |
STNP (32-bit) throughput 1.006236 | |
STNP (32-bit) throughput 1.040422 | |
STNP (64-bit) throughput 1.017249 | |
STNP (64-bit) throughput 1.033935 | |
STP (32-bit) throughput 1.000935 | |
STP (64-bit) throughput 0.999757 | |
STP (post-index, 32-bit) 3->3 0.910829 | |
STP (post-index, 32-bit) throughput 1.012611 | |
STP (post-index, 64-bit) 3->3 1.065837 | |
STP (post-index, 64-bit) throughput 1.026523 | |
STP (pre-index, 32-bit) 3->3 1.023981 | |
STP (pre-index, 32-bit) throughput 1.288093 | |
STP (pre-index, 64-bit) 3->3 1.010690 | |
STP (pre-index, 64-bit) throughput 1.804051 | |
STP (signed offset, 32-bit) throughput 1.776866 | |
STP (signed offset, 64-bit) throughput 1.631674 | |
STR (32-bit) throughput 1.037049 | |
STR (64-bit) throughput 1.066534 | |
STR (post-index, 32-bit) 2->2 0.992272 | |
STR (post-index, 32-bit) throughput 1.122895 | |
STR (post-index, 64-bit) 2->2 0.959052 | |
STR (post-index, 64-bit) throughput 1.291874 | |
STR (pre-index, 32-bit) 2->2 0.980873 | |
STR (pre-index, 32-bit) throughput 1.192080 | |
STR (pre-index, 64-bit) 2->2 1.011426 | |
STR (pre-index, 64-bit) throughput 1.158453 | |
STR (unsigned offset, 32-bit) throughput 1.061188 | |
STR (unsigned offset, 64-bit) throughput 1.324361 | |
STR (register, 32-bit) throughput 1.046200 | |
STR (register, 64-bit) throughput 1.072395 | |
STR (register, uxtw, 32-bit) throughput 1.089956 | |
STR (register, uxtw, 64-bit) throughput 1.018048 | |
STR (register, sxtw, 32-bit) throughput 0.887770 | |
STR (register, sxtw, 64-bit) throughput 1.062124 | |
STR (register, lsl, 32-bit) throughput 1.107123 | |
STR (register, lsl, 64-bit) throughput 1.038845 | |
STRB throughput 0.869107 | |
STRB (post-index) 2->2 1.001037 | |
STRB (post-index) throughput 1.080036 | |
STRB (pre-index) 2->2 0.838151 | |
STRB (pre-index) throughput 1.041129 | |
STRB (unsigned offset) throughput 1.067273 | |
STRB (register) throughput 1.056076 | |
STRB (register, uxtw) throughput 1.171605 | |
STRB (register, sxtw) throughput 1.065470 | |
STRH throughput 1.085214 | |
STRH (post-index) 2->2 1.001321 | |
STRH (post-index) throughput 1.001626 | |
STRH (pre-index) 2->2 0.982245 | |
STRH (pre-index) throughput 1.087158 | |
STRH (unsigned offset) throughput 1.071208 | |
STRH (register) throughput 1.097764 | |
STRH (register, uxtw) throughput 0.917595 | |
STRH (register, sxtw) throughput 1.044746 | |
STUR (32-bit) throughput 1.130743 | |
STUR (64-bit) throughput 1.049623 | |
STURB throughput 0.964056 | |
STURH throughput 0.914970 | |
STXP (64-bit) throughput 3.281451 | |
STXP (64-bit) throughput 2.546651 | |
STXR (32-bit) throughput 2.267968 | |
STXR (32-bit) throughput 3.296121 | |
STXR (64-bit) throughput 2.382106 | |
STXR (64-bit) throughput 3.009963 | |
STXRB throughput 1.939569 | |
STXRB throughput 2.892976 | |
STXRH throughput 2.085682 | |
STXRH throughput 3.540716 | |
SUB (sxtb, 32-bit) 1->2 2.004211 | |
SUB (sxtb, 32-bit) 1->3 2.043246 | |
SUB (sxtb, 32-bit) throughput 0.736602 | |
SUB (sxtb, 64-bit) 1->2 2.005738 | |
SUB (sxtb, 64-bit) 1->3 2.002146 | |
SUB (sxtb, 64-bit) throughput 0.576874 | |
SUB (uxtb, 32-bit) 1->2 2.014217 | |
SUB (uxtb, 32-bit) 1->3 2.044544 | |
SUB (uxtb, 32-bit) throughput 0.696961 | |
SUB (uxtb, 64-bit) 1->2 1.974298 | |
SUB (uxtb, 64-bit) 1->3 2.010651 | |
SUB (uxtb, 64-bit) throughput 0.684330 | |
SUB (sxth, 32-bit) 1->2 2.000087 | |
SUB (sxth, 32-bit) 1->3 2.121735 | |
SUB (sxth, 32-bit) throughput 0.695633 | |
SUB (sxth, 64-bit) 1->2 1.997632 | |
SUB (sxth, 64-bit) 1->3 1.975452 | |
SUB (sxth, 64-bit) throughput 0.710938 | |
SUB (uxth, 32-bit) 1->2 2.051740 | |
SUB (uxth, 32-bit) 1->3 1.573538 | |
SUB (uxth, 32-bit) throughput 0.516153 | |
SUB (uxth, 64-bit) 1->2 1.974093 | |
SUB (uxth, 64-bit) 1->3 2.027425 | |
SUB (uxth, 64-bit) throughput 0.802003 | |
SUB (sxtw, 32-bit) 1->2 0.973586 | |
SUB (sxtw, 32-bit) 1->3 0.987429 | |
SUB (sxtw, 32-bit) throughput 0.300871 | |
SUB (sxtw, 64-bit) 1->2 2.096520 | |
SUB (sxtw, 64-bit) 1->3 2.015650 | |
SUB (sxtw, 64-bit) throughput 0.655648 | |
SUB (uxtw, 32-bit) 1->2 1.001863 | |
SUB (uxtw, 32-bit) 1->3 0.998423 | |
SUB (uxtw, 32-bit) throughput 0.334941 | |
SUB (uxtw, 64-bit) 1->2 1.993801 | |
SUB (uxtw, 64-bit) 1->3 2.001698 | |
SUB (uxtw, 64-bit) throughput 0.750693 | |
SUB (uxtx, 64-bit) 1->2 0.995164 | |
SUB (uxtx, 64-bit) 1->3 1.003968 | |
SUB (uxtx, 64-bit) throughput 0.334201 | |
SUB (sxtx, 64-bit) 1->2 0.991052 | |
SUB (sxtx, 64-bit) 1->3 1.003441 | |
SUB (sxtx, 64-bit) throughput 0.334432 | |
SUB (immediate, 32-bit) 1->2 1.002390 | |
SUB (immediate, 32-bit) throughput 0.392191 | |
SUB (immediate, 64-bit) 1->2 0.996047 | |
SUB (immediate, 64-bit) throughput 0.338321 | |
SUB (shifted immediate, 32-bit) 1->2 1.003848 | |
SUB (shifted immediate, 32-bit) throughput 0.342450 | |
SUB (shifted immediate, 64-bit) 1->2 1.003274 | |
SUB (shifted immediate, 64-bit) throughput 0.334743 | |
SUB (register, 32-bit) 1->2 1.018484 | |
SUB (register, 32-bit) 1->3 0.993164 | |
SUB (register, 32-bit) throughput 0.348630 | |
SUB (register, 64-bit) 1->2 1.008058 | |
SUB (register, 64-bit) 1->3 1.023849 | |
SUB (register, 64-bit) throughput 0.332068 | |
SUB (register, lsl, 32-bit) 1->2 2.234117 | |
SUB (register, lsl, 32-bit) 1->3 1.994112 | |
SUB (register, lsl, 32-bit) throughput 0.733360 | |
SUB (register, lsl, 64-bit) 1->2 1.973130 | |
SUB (register, lsl, 64-bit) 1->3 2.020804 | |
SUB (register, lsl, 64-bit) throughput 0.601726 | |
SUB (register, lsr, 32-bit) 1->2 2.336680 | |
SUB (register, lsr, 32-bit) 1->3 2.189090 | |
SUB (register, lsr, 32-bit) throughput 0.670024 | |
SUB (register, lsr, 64-bit) 1->2 1.959530 | |
SUB (register, lsr, 64-bit) 1->3 2.005255 | |
SUB (register, lsr, 64-bit) throughput 0.525386 | |
SUB (register, asr, 32-bit) 1->2 2.056567 | |
SUB (register, asr, 32-bit) 1->3 2.012673 | |
SUB (register, asr, 32-bit) throughput 0.733580 | |
SUB (register, asr, 64-bit) 1->2 2.034259 | |
SUB (register, asr, 64-bit) 1->3 2.025747 | |
SUB (register, asr, 64-bit) throughput 0.661467 | |
SUBS (sxtb, 32-bit) 1->2 1.716725 | |
SUBS (sxtb, 32-bit) 1->3 2.000566 | |
SUBS (sxtb, 32-bit) 4->2 2.478841 | |
SUBS (sxtb, 32-bit) 4->3 2.000123 | |
SUBS (sxtb, 32-bit) throughput 0.581329 | |
SUBS (sxtb, 64-bit) 1->2 2.040935 | |
SUBS (sxtb, 64-bit) 1->3 1.962865 | |
SUBS (sxtb, 64-bit) 4->2 2.436956 | |
SUBS (sxtb, 64-bit) 4->3 2.535168 | |
SUBS (sxtb, 64-bit) throughput 0.753128 | |
SUBS (uxtb, 32-bit) 1->2 1.993831 | |
SUBS (uxtb, 32-bit) 1->3 2.003435 | |
SUBS (uxtb, 32-bit) 4->2 1.998302 | |
SUBS (uxtb, 32-bit) 4->3 2.013407 | |
SUBS (uxtb, 32-bit) throughput 0.555973 | |
SUBS (uxtb, 64-bit) 1->2 1.927597 | |
SUBS (uxtb, 64-bit) 1->3 1.953495 | |
SUBS (uxtb, 64-bit) 4->2 2.254119 | |
SUBS (uxtb, 64-bit) 4->3 1.953660 | |
SUBS (uxtb, 64-bit) throughput 0.670060 | |
SUBS (sxth, 32-bit) 1->2 2.478737 | |
SUBS (sxth, 32-bit) 1->3 2.004324 | |
SUBS (sxth, 32-bit) 4->2 2.686343 | |
SUBS (sxth, 32-bit) 4->3 1.983611 | |
SUBS (sxth, 32-bit) throughput 0.713921 | |
SUBS (sxth, 64-bit) 1->2 1.985050 | |
SUBS (sxth, 64-bit) 1->3 2.011864 | |
SUBS (sxth, 64-bit) 4->2 2.576234 | |
SUBS (sxth, 64-bit) 4->3 5.129637 | |
SUBS (sxth, 64-bit) throughput 0.523050 | |
SUBS (uxth, 32-bit) 1->2 2.018709 | |
SUBS (uxth, 32-bit) 1->3 2.009619 | |
SUBS (uxth, 32-bit) 4->2 2.008260 | |
SUBS (uxth, 32-bit) 4->3 1.950511 | |
SUBS (uxth, 32-bit) throughput 0.713970 | |
SUBS (uxth, 64-bit) 1->2 1.964434 | |
SUBS (uxth, 64-bit) 1->3 1.744140 | |
SUBS (uxth, 64-bit) 4->2 2.394311 | |
SUBS (uxth, 64-bit) 4->3 2.007369 | |
SUBS (uxth, 64-bit) throughput 0.774438 | |
SUBS (sxtw, 32-bit) 1->2 0.999394 | |
SUBS (sxtw, 32-bit) 1->3 0.998140 | |
SUBS (sxtw, 32-bit) 4->2 1.005628 | |
SUBS (sxtw, 32-bit) 4->3 1.633143 | |
SUBS (sxtw, 32-bit) throughput 0.564462 | |
SUBS (sxtw, 64-bit) 1->2 2.007989 | |
SUBS (sxtw, 64-bit) 1->3 1.997505 | |
SUBS (sxtw, 64-bit) 4->2 2.671290 | |
SUBS (sxtw, 64-bit) 4->3 2.962775 | |
SUBS (sxtw, 64-bit) throughput 0.738461 | |
SUBS (uxtw, 32-bit) 1->2 0.998222 | |
SUBS (uxtw, 32-bit) 1->3 0.911758 | |
SUBS (uxtw, 32-bit) 4->2 1.007395 | |
SUBS (uxtw, 32-bit) 4->3 0.998224 | |
SUBS (uxtw, 32-bit) throughput 0.511613 | |
SUBS (uxtw, 64-bit) 1->2 2.031998 | |
SUBS (uxtw, 64-bit) 1->3 2.005174 | |
SUBS (uxtw, 64-bit) 4->2 1.998384 | |
SUBS (uxtw, 64-bit) 4->3 1.972726 | |
SUBS (uxtw, 64-bit) throughput 0.803577 | |
SUBS (uxtx, 64-bit) 1->2 0.983823 | |
SUBS (uxtx, 64-bit) 1->3 0.997179 | |
SUBS (uxtx, 64-bit) 4->2 0.999915 | |
SUBS (uxtx, 64-bit) 4->3 1.019736 | |
SUBS (uxtx, 64-bit) throughput 0.538506 | |
SUBS (sxtx, 64-bit) 1->2 1.000040 | |
SUBS (sxtx, 64-bit) 1->3 1.007167 | |
SUBS (sxtx, 64-bit) 4->2 0.885515 | |
SUBS (sxtx, 64-bit) 4->3 0.993622 | |
SUBS (sxtx, 64-bit) throughput 0.543007 | |
SUBS (immediate, 32-bit) 1->2 0.957980 | |
SUBS (immediate, 32-bit) 3->2 1.329272 | |
SUBS (immediate, 32-bit) throughput 0.544901 | |
SUBS (immediate, 64-bit) 1->2 0.929336 | |
SUBS (immediate, 64-bit) 3->2 1.101514 | |
SUBS (immediate, 64-bit) throughput 0.506838 | |
SUBS (shifted immediate, 32-bit) 1->2 0.962757 | |
SUBS (shifted immediate, 32-bit) 3->2 1.031988 | |
SUBS (shifted immediate, 32-bit) throughput 0.534780 | |
SUBS (shifted immediate, 64-bit) 1->2 1.008543 | |
SUBS (shifted immediate, 64-bit) 3->2 1.012385 | |
SUBS (shifted immediate, 64-bit) throughput 0.565785 | |
SUBS (register, 32-bit) 1->2 1.000203 | |
SUBS (register, 32-bit) 1->3 1.001863 | |
SUBS (register, 32-bit) 4->2 1.015914 | |
SUBS (register, 32-bit) 4->3 0.996084 | |
SUBS (register, 32-bit) throughput 0.486281 | |
SUBS (register, 64-bit) 1->2 1.001741 | |
SUBS (register, 64-bit) 1->3 1.002632 | |
SUBS (register, 64-bit) 4->2 1.002789 | |
SUBS (register, 64-bit) 4->3 0.998545 | |
SUBS (register, 64-bit) throughput 0.581190 | |
SUBS (register, lsl, 32-bit) 1->2 2.041629 | |
SUBS (register, lsl, 32-bit) 1->3 1.771928 | |
SUBS (register, lsl, 32-bit) 4->2 1.971644 | |
SUBS (register, lsl, 32-bit) 4->3 2.018543 | |
SUBS (register, lsl, 32-bit) throughput 0.747687 | |
SUBS (register, lsl, 64-bit) 1->2 2.001804 | |
SUBS (register, lsl, 64-bit) 1->3 2.005041 | |
SUBS (register, lsl, 64-bit) 4->2 2.162206 | |
SUBS (register, lsl, 64-bit) 4->3 2.009880 | |
SUBS (register, lsl, 64-bit) throughput 0.719842 | |
SUBS (register, lsr, 32-bit) 1->2 1.932946 | |
SUBS (register, lsr, 32-bit) 1->3 2.281339 | |
SUBS (register, lsr, 32-bit) 4->2 1.783663 | |
SUBS (register, lsr, 32-bit) 4->3 2.220550 | |
SUBS (register, lsr, 32-bit) throughput 0.698893 | |
SUBS (register, lsr, 64-bit) 1->2 2.002951 | |
SUBS (register, lsr, 64-bit) 1->3 1.996205 | |
SUBS (register, lsr, 64-bit) 4->2 2.005103 | |
SUBS (register, lsr, 64-bit) 4->3 2.203084 | |
SUBS (register, lsr, 64-bit) throughput 0.674770 | |
SUBS (register, asr, 32-bit) 1->2 2.004216 | |
SUBS (register, asr, 32-bit) 1->3 2.002840 | |
SUBS (register, asr, 32-bit) 4->2 2.276786 | |
SUBS (register, asr, 32-bit) 4->3 2.004126 | |
SUBS (register, asr, 32-bit) throughput 0.668377 | |
SUBS (register, asr, 64-bit) 1->2 2.080564 | |
SUBS (register, asr, 64-bit) 1->3 2.287363 | |
SUBS (register, asr, 64-bit) 4->2 2.066296 | |
SUBS (register, asr, 64-bit) 4->3 2.001721 | |
SUBS (register, asr, 64-bit) throughput 0.671191 | |
SXTB (32-bit) 1->2 0.999913 | |
SXTB (32-bit) throughput 0.334218 | |
SXTB (64-bit) 1->2 1.061738 | |
SXTB (64-bit) throughput 0.339234 | |
SXTH (32-bit) 1->2 0.999913 | |
SXTH (32-bit) throughput 0.338622 | |
SXTH (64-bit) 1->2 1.000000 | |
SXTH (64-bit) throughput 0.335595 | |
SXTW 1->2 1.000002 | |
SXTW throughput 0.333986 | |
TBNZ (not taken) throughput 0.587511 | |
TBNZ (taken) throughput 1.411866 | |
TBZ (not taken) throughput 0.588943 | |
TBZ (taken) throughput 1.337055 | |
TST (immediate, 32-bit) 2->1 0.999998 | |
TST (immediate, 32-bit) throughput 0.362335 | |
TST (immediate, 64-bit) 2->1 0.996204 | |
TST (immediate, 64-bit) throughput 0.366183 | |
TST (register, 32-bit) 3->1 0.998949 | |
TST (register, 32-bit) 3->2 1.002551 | |
TST (register, 32-bit) throughput 0.366800 | |
TST (register, 64-bit) 3->1 1.004296 | |
TST (register, 64-bit) 3->2 1.585223 | |
TST (register, 64-bit) throughput 0.428843 | |
TST (register, lsl, 32-bit) 3->1 2.005090 | |
TST (register, lsl, 32-bit) 3->2 2.019315 | |
TST (register, lsl, 32-bit) throughput 0.668674 | |
TST (register, lsl, 64-bit) 3->1 2.003076 | |
TST (register, lsl, 64-bit) 3->2 2.005181 | |
TST (register, lsl, 64-bit) throughput 0.659913 | |
TST (register, lsr, 32-bit) 3->1 2.004331 | |
TST (register, lsr, 32-bit) 3->2 2.004090 | |
TST (register, lsr, 32-bit) throughput 0.668040 | |
TST (register, lsr, 64-bit) 3->1 2.001214 | |
TST (register, lsr, 64-bit) 3->2 2.103861 | |
TST (register, lsr, 64-bit) throughput 0.693147 | |
TST (register, asr, 32-bit) 3->1 2.028614 | |
TST (register, asr, 32-bit) 3->2 2.000968 | |
TST (register, asr, 32-bit) throughput 0.669397 | |
TST (register, asr, 64-bit) 3->1 1.955793 | |
TST (register, asr, 64-bit) 3->2 2.022348 | |
TST (register, asr, 64-bit) throughput 0.669822 | |
UBFIZ (32-bit) 1->2 0.916562 | |
UBFIZ (32-bit) throughput 0.346592 | |
UBFIZ (64-bit) 1->2 1.028164 | |
UBFIZ (64-bit) throughput 0.331585 | |
UBFX (32-bit) 1->2 0.995410 | |
UBFX (32-bit) throughput 0.335061 | |
UBFX (64-bit) 1->2 1.002672 | |
UBFX (64-bit) throughput 0.334148 | |
UDIV (fast, 32-bit) 1->2 6.995709 | |
UDIV (fast, 32-bit) 1->3 7.035936 | |
UDIV (fast, 32-bit) throughput 7.239954 | |
UDIV (slow, 32-bit) 1->2 13.583309 | |
UDIV (slow, 32-bit) 1->3 13.090824 | |
UDIV (slow, 32-bit) throughput 13.173184 | |
UDIV (fast, 64-bit) 1->2 7.029979 | |
UDIV (fast, 64-bit) 1->3 7.078598 | |
UDIV (fast, 64-bit) throughput 7.045103 | |
UDIV (medium, 64-bit) 1->2 13.018100 | |
UDIV (medium, 64-bit) 1->3 12.219357 | |
UDIV (medium, 64-bit) throughput 13.337779 | |
UDIV (slow, 64-bit) 1->2 21.171416 | |
UDIV (slow, 64-bit) 1->3 21.291294 | |
UDIV (slow, 64-bit) throughput 21.235447 | |
UMADDL 1->2 3.000649 | |
UMADDL 1->3 3.009396 | |
UMADDL 1->4 1.107719 | |
UMADDL throughput 1.002065 | |
UMNEGL 1->2 3.305753 | |
UMNEGL 1->3 3.545043 | |
UMNEGL throughput 1.023279 | |
UMSUBL 1->2 3.009543 | |
UMSUBL 1->3 3.009273 | |
UMSUBL 1->4 0.994410 | |
UMSUBL throughput 1.005992 | |
UMULH 1->2 3.007371 | |
UMULH 1->3 3.040855 | |
UMULH throughput 0.999651 | |
UMULL 1->2 3.052099 | |
UMULL 1->3 3.002870 | |
UMULL throughput 1.000560 | |
UXTB 1->2 1.001051 | |
UXTB throughput 0.334248 | |
UXTH 1->2 1.001660 | |
UXTH throughput 0.332955 | |
YIELD throughput 0.254358 | |
ABS (vector, 8B) 1->2 3.007129 | |
ABS (vector, 8B) throughput 0.500126 | |
ABS (vector, 16B) 1->2 3.001739 | |
ABS (vector, 16B) throughput 0.501514 | |
ABS (vector, 4H) 1->2 3.006235 | |
ABS (vector, 4H) throughput 0.500707 | |
ABS (vector, 8H) 1->2 3.046647 | |
ABS (vector, 8H) throughput 0.499743 | |
ABS (vector, 2S) 1->2 2.999596 | |
ABS (vector, 2S) throughput 0.506094 | |
ABS (vector, 4S) 1->2 2.998949 | |
ABS (vector, 4S) throughput 0.501265 | |
ABS (vector, 2D) 1->2 3.000807 | |
ABS (vector, 2D) throughput 0.502202 | |
ABS (vector, D) 1->2 3.132653 | |
ABS (vector, D) throughput 0.504906 | |
ADD (vector, 8B) 1->2 1.998828 | |
ADD (vector, 8B) 1->3 2.010204 | |
ADD (vector, 8B) throughput 0.500486 | |
ADD (vector, 16B) 1->2 1.998020 | |
ADD (vector, 16B) 1->3 1.956177 | |
ADD (vector, 16B) throughput 0.513662 | |
ADD (vector, 4H) 1->2 2.003894 | |
ADD (vector, 4H) 1->3 2.004860 | |
ADD (vector, 4H) throughput 0.501321 | |
ADD (vector, 8H) 1->2 1.999109 | |
ADD (vector, 8H) 1->3 2.004777 | |
ADD (vector, 8H) throughput 0.500596 | |
ADD (vector, 2S) 1->2 1.997091 | |
ADD (vector, 2S) 1->3 1.992823 | |
ADD (vector, 2S) throughput 0.500380 | |
ADD (vector, 4S) 1->2 1.996446 | |
ADD (vector, 4S) 1->3 2.003119 | |
ADD (vector, 4S) throughput 0.501766 | |
ADD (vector, 2D) 1->2 2.009273 | |
ADD (vector, 2D) 1->3 2.002915 | |
ADD (vector, 2D) throughput 0.501035 | |
ADD (vector, D) 1->2 1.999031 | |
ADD (vector, D) 1->3 1.995198 | |
ADD (vector, D) throughput 0.500850 | |
ADDHN (vector, 8H) 1->2 3.050794 | |
ADDHN (vector, 8H) 1->3 3.004698 | |
ADDHN (vector, 8H) throughput 0.499985 | |
ADDHN (vector, 4S) 1->2 3.005184 | |
ADDHN (vector, 4S) 1->3 3.003886 | |
ADDHN (vector, 4S) throughput 0.502014 | |
ADDHN (vector, 2D) 1->2 3.006722 | |
ADDHN (vector, 2D) 1->3 3.114175 | |
ADDHN (vector, 2D) throughput 0.483302 | |
ADDHN2 (vector, 8H) 1->1 3.003695 | |
ADDHN2 (vector, 8H) 1->2 3.553686 | |
ADDHN2 (vector, 8H) 1->3 2.895089 | |
ADDHN2 (vector, 8H) throughput 0.503806 | |
ADDHN2 (vector, 8H) throughput 0.516070 | |
ADDHN2 (vector, 4S) 1->1 3.003928 | |
ADDHN2 (vector, 4S) 1->2 3.027697 | |
ADDHN2 (vector, 4S) 1->3 3.003928 | |
ADDHN2 (vector, 4S) throughput 0.505403 | |
ADDHN2 (vector, 4S) throughput 0.567695 | |
ADDHN2 (vector, 2D) 1->1 3.008586 | |
ADDHN2 (vector, 2D) 1->2 3.015104 | |
ADDHN2 (vector, 2D) 1->3 2.997130 | |
ADDHN2 (vector, 2D) throughput 0.500000 | |
ADDHN2 (vector, 2D) throughput 0.501769 | |
ADDP (scalar) 1->2 1.999740 | |
ADDP (scalar) throughput 0.485770 | |
ADDP (vector, 8B) 1->2 1.945793 | |
ADDP (vector, 8B) 1->3 2.073932 | |
ADDP (vector, 8B) throughput 0.506383 | |
ADDP (vector, 16B) 1->2 2.002389 | |
ADDP (vector, 16B) 1->3 2.004050 | |
ADDP (vector, 16B) throughput 0.499657 | |
ADDP (vector, 4H) 1->2 1.998949 | |
ADDP (vector, 4H) 1->3 1.911593 | |
ADDP (vector, 4H) throughput 0.469217 | |
ADDP (vector, 8H) 1->2 1.928630 | |
ADDP (vector, 8H) 1->3 2.001374 | |
ADDP (vector, 8H) throughput 0.508104 | |
ADDP (vector, 2S) 1->2 2.041583 | |
ADDP (vector, 2S) 1->3 1.995039 | |
ADDP (vector, 2S) throughput 0.500121 | |
ADDP (vector, 4S) 1->2 1.998425 | |
ADDP (vector, 4S) 1->3 2.002755 | |
ADDP (vector, 4S) throughput 0.500258 | |
ADDP (vector, 2D) 1->2 2.006924 | |
ADDP (vector, 2D) 1->3 2.006438 | |
ADDP (vector, 2D) throughput 0.501918 | |
ADDV (8B) 1->2 2.999030 | |
ADDV (8B) throughput 0.501711 | |
ADDV (16B) 1->2 3.138402 | |
ADDV (16B) throughput 0.564285 | |
ADDV (4H) 1->2 3.041059 | |
ADDV (4H) throughput 0.504261 | |
ADDV (8H) 1->2 3.002345 | |
ADDV (8H) throughput 0.499465 | |
ADDV (4S) 1->2 2.267006 | |
ADDV (4S) throughput 0.502526 | |
AESD 1->1 3.002176 | |
AESD 1->2 2.980445 | |
AESD throughput 0.501584 | |
AESD throughput 0.503072 | |
AESD + AESIMC 1->1 3.005629 | |
AESD + AESIMC 1->2 3.150999 | |
AESD + AESIMC throughput 1.003892 | |
AESD + AESIMC throughput 0.503682 | |
AESE 1->1 2.997372 | |
AESE 1->2 2.999030 | |
AESE throughput 0.500672 | |
AESE throughput 0.503077 | |
AESE + AESMC 1->1 2.985516 | |
AESE + AESMC 1->2 3.008607 | |
AESE + AESMC throughput 1.010763 | |
AESE + AESMC throughput 0.503375 | |
AESIMC 1->2 2.005255 | |
AESIMC throughput 0.490820 | |
AESMC 1->2 1.999232 | |
AESMC throughput 0.500673 | |
AND (vector, 8B) 1->2 2.003888 | |
AND (vector, 8B) 1->3 2.002384 | |
AND (vector, 8B) throughput 0.499328 | |
AND (vector, 16B) 1->2 2.001388 | |
AND (vector, 16B) 1->3 1.992553 | |
AND (vector, 16B) throughput 0.500997 | |
BIC (vector, immediate, 4H) 1->1 1.998060 | |
BIC (vector, immediate, 4H) throughput 0.509014 | |
BIC (vector, immediate, 4H) throughput 0.511099 | |
BIC (vector, immediate, 8H) 1->1 2.020976 | |
BIC (vector, immediate, 8H) throughput 0.502404 | |
BIC (vector, immediate, 8H) throughput 0.483333 | |
BIC (vector, immediate, 2S) 1->1 1.998547 | |
BIC (vector, immediate, 2S) throughput 0.500066 | |
BIC (vector, immediate, 2S) throughput 0.502164 | |
BIC (vector, immediate, 4S) 1->1 2.049391 | |
BIC (vector, immediate, 4S) throughput 0.338233 | |
BIC (vector, immediate, 4S) throughput 0.500222 | |
BIC (vector, register, 8B) 1->2 2.001376 | |
BIC (vector, register, 8B) 1->3 2.051676 | |
BIC (vector, register, 8B) throughput 0.500863 | |
BIC (vector, register, 16B) 1->2 2.003846 | |
BIC (vector, register, 16B) 1->3 1.999111 | |
BIC (vector, register, 16B) throughput 0.502505 | |
BIF (vector, 8B) 1->1 2.005589 | |
BIF (vector, 8B) 1->2 2.003887 | |
BIF (vector, 8B) 1->3 1.996849 | |
BIF (vector, 8B) throughput 0.502516 | |
BIF (vector, 8B) throughput 0.504366 | |
BIF (vector, 16B) 1->1 2.004334 | |
BIF (vector, 16B) 1->2 2.001499 | |
BIF (vector, 16B) 1->3 2.015893 | |
BIF (vector, 16B) throughput 0.501276 | |
BIF (vector, 16B) throughput 0.557655 | |
BIT (vector, 8B) 1->1 1.999396 | |
BIT (vector, 8B) 1->2 2.004328 | |
BIT (vector, 8B) 1->3 2.008941 | |
BIT (vector, 8B) throughput 0.503053 | |
BIT (vector, 8B) throughput 0.514536 | |
BIT (vector, 16B) 1->1 2.015329 | |
BIT (vector, 16B) 1->2 2.001538 | |
BIT (vector, 16B) 1->3 2.002713 | |
BIT (vector, 16B) throughput 0.502566 | |
BIT (vector, 16B) throughput 0.523688 | |
BSL (vector, 8B) 1->1 2.001636 | |
BSL (vector, 8B) 1->2 2.007250 | |
BSL (vector, 8B) 1->3 2.008503 | |
BSL (vector, 8B) throughput 0.521760 | |
BSL (vector, 8B) throughput 0.502481 | |
BSL (vector, 16B) 1->1 2.006034 | |
BSL (vector, 16B) 1->2 2.025592 | |
BSL (vector, 16B) 1->3 2.002505 | |
BSL (vector, 16B) throughput 0.498998 | |
BSL (vector, 16B) throughput 0.505919 | |
CLS (vector, 8B) 1->2 1.996446 | |
CLS (vector, 8B) throughput 0.500612 | |
CLS (vector, 16B) 1->2 2.000647 | |
CLS (vector, 16B) throughput 0.500795 | |
CLS (vector, 4H) 1->2 2.000201 | |
CLS (vector, 4H) throughput 0.500506 | |
CLS (vector, 8H) 1->2 1.996122 | |
CLS (vector, 8H) throughput 0.525698 | |
CLS (vector, 2S) 1->2 2.006266 | |
CLS (vector, 2S) throughput 0.499495 | |
CLS (vector, 4S) 1->2 1.999597 | |
CLS (vector, 4S) throughput 0.501263 | |
CLZ (vector, 8B) 1->2 1.998302 | |
CLZ (vector, 8B) throughput 0.500269 | |
CLZ (vector, 16B) 1->2 1.971159 | |
CLZ (vector, 16B) throughput 0.503386 | |
CLZ (vector, 4H) 1->2 2.005548 | |
CLZ (vector, 4H) throughput 0.499646 | |
CLZ (vector, 8H) 1->2 2.000445 | |
CLZ (vector, 8H) throughput 0.501068 | |
CLZ (vector, 2S) 1->2 1.999434 | |
CLZ (vector, 2S) throughput 0.553521 | |
CLZ (vector, 4S) 1->2 2.042433 | |
CLZ (vector, 4S) throughput 0.505118 | |
CMEQ (register, 8B) 1->2 1.997214 | |
CMEQ (register, 8B) 1->3 2.002388 | |
CMEQ (register, 8B) throughput 0.499884 | |
CMEQ (register, 16B) 1->2 2.173807 | |
CMEQ (register, 16B) 1->3 2.114626 | |
CMEQ (register, 16B) throughput 0.489782 | |
CMEQ (register, 4H) 1->2 1.935435 | |
CMEQ (register, 4H) 1->3 1.999433 | |
CMEQ (register, 4H) throughput 0.499959 | |
CMEQ (register, 8H) 1->2 2.000121 | |
CMEQ (register, 8H) 1->3 2.005658 | |
CMEQ (register, 8H) throughput 0.500838 | |
CMEQ (register, 2S) 1->2 2.006908 | |
CMEQ (register, 2S) 1->3 2.004293 | |
CMEQ (register, 2S) throughput 0.505572 | |
CMEQ (register, 4S) 1->2 1.999675 | |
CMEQ (register, 4S) 1->3 2.003320 | |
CMEQ (register, 4S) throughput 0.500628 | |
CMEQ (register, 2D) 1->2 2.001580 | |
CMEQ (register, 2D) 1->3 2.001498 | |
CMEQ (register, 2D) throughput 0.499859 | |
CMEQ (register, D) 1->2 2.001456 | |
CMEQ (register, D) 1->3 1.987589 | |
CMEQ (register, D) throughput 0.501445 | |
CMEQ (zero, 8B) 1->2 2.032940 | |
CMEQ (zero, 8B) throughput 0.501969 | |
CMEQ (zero, 16B) 1->2 1.990272 | |
CMEQ (zero, 16B) throughput 0.500658 | |
CMEQ (zero, 4H) 1->2 2.006763 | |
CMEQ (zero, 4H) throughput 0.500522 | |
CMEQ (zero, 8H) 1->2 2.002307 | |
CMEQ (zero, 8H) throughput 0.499334 | |
CMEQ (zero, 2S) 1->2 1.995399 | |
CMEQ (zero, 2S) throughput 0.499455 | |
CMEQ (zero, 4S) 1->2 2.001982 | |
CMEQ (zero, 4S) throughput 0.501007 | |
CMEQ (zero, 2D) 1->2 1.937706 | |
CMEQ (zero, 2D) throughput 0.532663 | |
CMEQ (zero, D) 1->2 2.001661 | |
CMEQ (zero, D) throughput 0.501109 | |
CMGE (register, 8B) 1->2 2.000605 | |
CMGE (register, 8B) 1->3 2.004128 | |
CMGE (register, 8B) throughput 0.501564 | |
CMGE (register, 16B) 1->2 1.994195 | |
CMGE (register, 16B) 1->3 2.013586 | |
CMGE (register, 16B) throughput 0.420625 | |
CMGE (register, 4H) 1->2 2.035997 | |
CMGE (register, 4H) 1->3 1.999272 | |
CMGE (register, 4H) throughput 0.501260 | |
CMGE (register, 8H) 1->2 2.001499 | |
CMGE (register, 8H) 1->3 2.007896 | |
CMGE (register, 8H) throughput 0.502019 | |
CMGE (register, 2S) 1->2 2.037051 | |
CMGE (register, 2S) 1->3 2.004732 | |
CMGE (register, 2S) throughput 0.487067 | |
CMGE (register, 4S) 1->2 2.018829 | |
CMGE (register, 4S) 1->3 1.996811 | |
CMGE (register, 4S) throughput 0.501660 | |
CMGE (register, 2D) 1->2 1.989375 | |
CMGE (register, 2D) 1->3 2.001091 | |
CMGE (register, 2D) throughput 0.505736 | |
CMGE (register, D) 1->2 2.005376 | |
CMGE (register, D) 1->3 2.001990 | |
CMGE (register, D) throughput 0.499728 | |
CMGE (zero, 8B) 1->2 2.001660 | |
CMGE (zero, 8B) throughput 0.499207 | |
CMGE (zero, 16B) 1->2 2.002143 | |
CMGE (zero, 16B) throughput 0.500010 | |
CMGE (zero, 4H) 1->2 2.003604 | |
CMGE (zero, 4H) throughput 0.503835 | |
CMGE (zero, 8H) 1->2 2.006360 | |
CMGE (zero, 8H) throughput 0.500096 | |
CMGE (zero, 2S) 1->2 1.988250 | |
CMGE (zero, 2S) throughput 0.500926 | |
CMGE (zero, 4S) 1->2 2.004698 | |
CMGE (zero, 4S) throughput 0.504475 | |
CMGE (zero, 2D) 1->2 1.999798 | |
CMGE (zero, 2D) throughput 0.480170 | |
CMGE (zero, D) 1->2 2.001814 | |
CMGE (zero, D) throughput 0.500951 | |
CMGT (register, 8B) 1->2 1.996122 | |
CMGT (register, 8B) 1->3 1.994071 | |
CMGT (register, 8B) throughput 0.501203 | |
CMGT (register, 16B) 1->2 2.001702 | |
CMGT (register, 16B) 1->3 2.001537 | |
CMGT (register, 16B) throughput 0.591036 | |
CMGT (register, 4H) 1->2 2.013325 | |
CMGT (register, 4H) 1->3 1.987754 | |
CMGT (register, 4H) throughput 0.512092 | |
CMGT (register, 8H) 1->2 2.005629 | |
CMGT (register, 8H) 1->3 2.007126 | |
CMGT (register, 8H) throughput 0.501545 | |
CMGT (register, 2S) 1->2 1.998345 | |
CMGT (register, 2S) 1->3 1.994602 | |
CMGT (register, 2S) throughput 0.551390 | |
CMGT (register, 4S) 1->2 2.001172 | |
CMGT (register, 4S) 1->3 2.000444 | |
CMGT (register, 4S) throughput 0.579435 | |
CMGT (register, 2D) 1->2 1.990171 | |
CMGT (register, 2D) 1->3 2.004938 | |
CMGT (register, 2D) throughput 0.512744 | |
CMGT (register, D) 1->2 1.998949 | |
CMGT (register, D) 1->3 2.001499 | |
CMGT (register, D) throughput 0.500116 | |
CMGT (zero, 8B) 1->2 2.001499 | |
CMGT (zero, 8B) throughput 0.500333 | |
CMGT (zero, 16B) 1->2 2.003922 | |
CMGT (zero, 16B) throughput 0.502181 | |
CMGT (zero, 4H) 1->2 2.003968 | |
CMGT (zero, 4H) throughput 0.499843 | |
CMGT (zero, 8H) 1->2 1.996044 | |
CMGT (zero, 8H) throughput 0.501601 | |
CMGT (zero, 2S) 1->2 2.005231 | |
CMGT (zero, 2S) throughput 0.502294 | |
CMGT (zero, 4S) 1->2 2.003645 | |
CMGT (zero, 4S) throughput 0.505179 | |
CMGT (zero, 2D) 1->2 2.002471 | |
CMGT (zero, 2D) throughput 0.499990 | |
CMGT (zero, D) 1->2 2.003808 | |
CMGT (zero, D) throughput 0.447960 | |
CMHI (register, 8B) 1->2 2.000343 | |
CMHI (register, 8B) 1->3 1.999473 | |
CMHI (register, 8B) throughput 0.506225 | |
CMHI (register, 16B) 1->2 1.999718 | |
CMHI (register, 16B) 1->3 1.998629 | |
CMHI (register, 16B) throughput 0.500648 | |
CMHI (register, 4H) 1->2 1.989102 | |
CMHI (register, 4H) 1->3 2.035119 | |
CMHI (register, 4H) throughput 0.506180 | |
CMHI (register, 8H) 1->2 2.006965 | |
CMHI (register, 8H) 1->3 1.999231 | |
CMHI (register, 8H) throughput 0.500946 | |
CMHI (register, 2S) 1->2 2.001378 | |
CMHI (register, 2S) 1->3 2.035977 | |
CMHI (register, 2S) throughput 0.500461 | |
CMHI (register, 4S) 1->2 2.008586 | |
CMHI (register, 4S) 1->3 1.996220 | |
CMHI (register, 4S) throughput 0.504428 | |
CMHI (register, 2D) 1->2 2.003078 | |
CMHI (register, 2D) 1->3 1.992025 | |
CMHI (register, 2D) throughput 0.502781 | |
CMHI (register, D) 1->2 2.004010 | |
CMHI (register, D) 1->3 2.000485 | |
CMHI (register, D) throughput 0.499646 | |
CMHS (register, 8B) 1->2 1.991622 | |
CMHS (register, 8B) 1->3 2.002019 | |
CMHS (register, 8B) throughput 0.496626 | |
CMHS (register, 16B) 1->2 1.999516 | |
CMHS (register, 16B) 1->3 2.002226 | |
CMHS (register, 16B) throughput 0.501696 | |
CMHS (register, 4H) 1->2 2.005998 | |
CMHS (register, 4H) 1->3 2.001783 | |
CMHS (register, 4H) throughput 0.690763 | |
CMHS (register, 8H) 1->2 2.065523 | |
CMHS (register, 8H) 1->3 2.454696 | |
CMHS (register, 8H) throughput 0.499153 | |
CMHS (register, 2S) 1->2 1.995297 | |
CMHS (register, 2S) 1->3 2.228693 | |
CMHS (register, 2S) throughput 0.511217 | |
CMHS (register, 4S) 1->2 1.995689 | |
CMHS (register, 4S) 1->3 2.005425 | |
CMHS (register, 4S) throughput 0.501531 | |
CMHS (register, 2D) 1->2 2.041059 | |
CMHS (register, 2D) 1->3 2.001739 | |
CMHS (register, 2D) throughput 0.499582 | |
CMHS (register, D) 1->2 2.011261 | |
CMHS (register, D) 1->3 2.012988 | |
CMHS (register, D) throughput 0.498172 | |
CMLE (zero, 8B) 1->2 1.970814 | |
CMLE (zero, 8B) throughput 0.500550 | |
CMLE (zero, 16B) 1->2 2.073166 | |
CMLE (zero, 16B) throughput 0.521689 | |
CMLE (zero, 4H) 1->2 2.291393 | |
CMLE (zero, 4H) throughput 0.445284 | |
CMLE (zero, 8H) 1->2 1.997589 | |
CMLE (zero, 8H) throughput 0.484635 | |
CMLE (zero, 2S) 1->2 2.007736 | |
CMLE (zero, 2S) throughput 0.503221 | |
CMLE (zero, 4S) 1->2 1.873106 | |
CMLE (zero, 4S) throughput 0.507329 | |
CMLE (zero, 2D) 1->2 2.284267 | |
CMLE (zero, 2D) throughput 0.504268 | |
CMLE (zero, D) 1->2 2.001492 | |
CMLE (zero, D) throughput 0.501570 | |
CMLT (zero, 8B) 1->2 2.312165 | |
CMLT (zero, 8B) throughput 0.500992 | |
CMLT (zero, 16B) 1->2 1.999069 | |
CMLT (zero, 16B) throughput 0.500911 | |
CMLT (zero, 4H) 1->2 1.986973 | |
CMLT (zero, 4H) throughput 0.503219 | |
CMLT (zero, 8H) 1->2 2.078910 | |
CMLT (zero, 8H) throughput 0.503502 | |
CMLT (zero, 2S) 1->2 2.004264 | |
CMLT (zero, 2S) throughput 0.482685 | |
CMLT (zero, 4S) 1->2 1.997697 | |
CMLT (zero, 4S) throughput 0.506626 | |
CMLT (zero, 2D) 1->2 1.924074 | |
CMLT (zero, 2D) throughput 0.500628 | |
CMLT (zero, D) 1->2 2.002547 | |
CMLT (zero, D) throughput 0.500795 | |
CMTST (register, 8B) 1->2 1.956177 | |
CMTST (register, 8B) 1->3 2.266587 | |
CMTST (register, 8B) throughput 0.501472 | |
CMTST (register, 16B) 1->2 2.003147 | |
CMTST (register, 16B) 1->3 1.994916 | |
CMTST (register, 16B) throughput 0.507358 | |
CMTST (register, 4H) 1->2 1.997420 | |
CMTST (register, 4H) 1->3 1.998788 | |
CMTST (register, 4H) throughput 0.548552 | |
CMTST (register, 8H) 1->2 2.056292 | |
CMTST (register, 8H) 1->3 2.007087 | |
CMTST (register, 8H) throughput 0.499939 | |
CMTST (register, 2S) 1->2 2.001334 | |
CMTST (register, 2S) 1->3 2.007207 | |
CMTST (register, 2S) throughput 0.500491 | |
CMTST (register, 4S) 1->2 1.995359 | |
CMTST (register, 4S) 1->3 1.995922 | |
CMTST (register, 4S) throughput 0.497137 | |
CMTST (register, 2D) 1->2 1.992997 | |
CMTST (register, 2D) 1->3 2.034011 | |
CMTST (register, 2D) throughput 0.499068 | |
CMTST (register, D) 1->2 2.057699 | |
CMTST (register, D) 1->3 2.003239 | |
CMTST (register, D) throughput 0.503841 | |
CNT (8B) 1->2 2.011778 | |
CNT (8B) throughput 0.482074 | |
CNT (16B) 1->2 1.834248 | |
CNT (16B) throughput 0.481441 | |
DUP (element, vector, 8B) 1->2 2.148641 | |
DUP (element, vector, 8B) throughput 0.531609 | |
DUP (element, vector, 16B) 1->2 2.004050 | |
DUP (element, vector, 16B) throughput 0.507774 | |
DUP (element, vector, 4H) 1->2 2.040938 | |
DUP (element, vector, 4H) throughput 0.572068 | |
DUP (element, vector, 8H) 1->2 1.976484 | |
DUP (element, vector, 8H) throughput 0.502851 | |
DUP (element, vector, 2S) 1->2 1.996122 | |
DUP (element, vector, 2S) throughput 0.500768 | |
DUP (element, vector, 4S) 1->2 2.005224 | |
DUP (element, vector, 4S) throughput 0.499849 | |
DUP (element, vector, 2D) 1->2 1.983744 | |
DUP (element, vector, 2D) throughput 0.526271 | |
DUP (element, scalar, B) 1->2 2.001419 | |
DUP (element, scalar, B) throughput 0.501853 | |
DUP (element, scalar, H) 1->2 2.003967 | |
DUP (element, scalar, H) throughput 0.501200 | |
DUP (element, scalar, S) 1->2 2.005183 | |
DUP (element, scalar, S) throughput 0.608651 | |
DUP (element, scalar, D) 1->2 1.954670 | |
DUP (element, scalar, D) throughput 0.507820 | |
DUP (general, 8B) 1->2 roundtrip 9.066604 | |
DUP (general, 8B) throughput 0.501842 | |
DUP (general, 16B) 1->2 roundtrip 9.039889 | |
DUP (general, 16B) throughput 0.501340 | |
DUP (general, 4H) 1->2 roundtrip 9.482526 | |
DUP (general, 4H) throughput 0.433388 | |
DUP (general, 8H) 1->2 roundtrip 9.025672 | |
DUP (general, 8H) throughput 0.502759 | |
DUP (general, 2S) 1->2 roundtrip 9.032102 | |
DUP (general, 2S) throughput 0.500430 | |
DUP (general, 4S) 1->2 roundtrip 9.012267 | |
DUP (general, 4S) throughput 0.502364 | |
DUP (general, 2D) 1->2 roundtrip 9.078728 | |
DUP (general, 2D) throughput 0.501462 | |
EOR (vector, 8B) 1->2 2.002408 | |
EOR (vector, 8B) 1->3 2.084994 | |
EOR (vector, 8B) throughput 0.517456 | |
EOR (vector, 16B) 1->2 1.995085 | |
EOR (vector, 16B) 1->3 2.004534 | |
EOR (vector, 16B) throughput 0.502114 | |
EXT (vector, 8B) 1->2 1.999919 | |
EXT (vector, 8B) 1->3 2.043421 | |
EXT (vector, 8B) throughput 0.505026 | |
EXT (vector, 16B) 1->2 1.997172 | |
EXT (vector, 16B) 1->3 2.014941 | |
EXT (vector, 16B) throughput 0.500496 | |
FABD (vector, 2S) 1->2 2.997977 | |
FABD (vector, 2S) 1->3 3.017332 | |
FABD (vector, 2S) throughput 0.545956 | |
FABD (vector, 4S) 1->2 2.999115 | |
FABD (vector, 4S) 1->3 2.987863 | |
FABD (vector, 4S) throughput 0.498720 | |
FABD (vector, 2D) 1->2 3.041788 | |
FABD (vector, 2D) 1->3 3.003807 | |
FABD (vector, 2D) throughput 0.499232 | |
FABD (scalar, S) 1->2 3.002386 | |
FABD (scalar, S) 1->3 3.136969 | |
FABD (scalar, S) throughput 0.504517 | |
FABD (scalar, D) 1->2 3.021868 | |
FABD (scalar, D) 1->3 3.013891 | |
FABD (scalar, D) throughput 0.500425 | |
FABS (scalar, S) 1->2 2.002389 | |
FABS (scalar, S) throughput 0.501584 | |
FABS (scalar, D) 1->2 1.987221 | |
FABS (scalar, D) throughput 0.501301 | |
FABS (vector, 2S) 1->2 2.002505 | |
FABS (vector, 2S) throughput 0.502177 | |
FABS (vector, 4S) 1->2 2.002507 | |
FABS (vector, 4S) throughput 0.508736 | |
FABS (vector, 2D) 1->2 2.002551 | |
FABS (vector, 2D) throughput 0.500835 | |
FACGE (vector, 2S) 1->2 2.002752 | |
FACGE (vector, 2S) 1->3 2.006364 | |
FACGE (vector, 2S) throughput 0.500379 | |
FACGE (vector, 4S) 1->2 1.996688 | |
FACGE (vector, 4S) 1->3 1.877471 | |
FACGE (vector, 4S) throughput 0.499821 | |
FACGE (vector, 2D) 1->2 1.991583 | |
FACGE (vector, 2D) 1->3 1.996003 | |
FACGE (vector, 2D) throughput 0.499525 | |
FACGE (scalar, S) 1->2 2.004414 | |
FACGE (scalar, S) 1->3 1.999913 | |
FACGE (scalar, S) throughput 0.501225 | |
FACGE (scalar, D) 1->2 2.006074 | |
FACGE (scalar, D) 1->3 2.000203 | |
FACGE (scalar, D) throughput 0.504247 | |
FACGT (vector, 2S) 1->2 2.008307 | |
FACGT (vector, 2S) 1->3 1.999915 | |
FACGT (vector, 2S) throughput 0.501108 | |
FACGT (vector, 4S) 1->2 2.058673 | |
FACGT (vector, 4S) 1->3 2.001821 | |
FACGT (vector, 4S) throughput 0.501898 | |
FACGT (vector, 2D) 1->2 2.001417 | |
FACGT (vector, 2D) 1->3 2.016845 | |
FACGT (vector, 2D) throughput 0.498137 | |
FACGT (scalar, S) 1->2 1.904207 | |
FACGT (scalar, S) 1->3 2.002876 | |
FACGT (scalar, S) throughput 0.505936 | |
FACGT (scalar, D) 1->2 2.004412 | |
FACGT (scalar, D) 1->3 1.997336 | |
FACGT (scalar, D) throughput 0.500476 | |
FADD (vector, 2S) 1->2 3.039415 | |
FADD (vector, 2S) 1->3 3.042276 | |
FADD (vector, 2S) throughput 0.500370 | |
FADD (vector, 4S) 1->2 3.005264 | |
FADD (vector, 4S) 1->3 2.216904 | |
FADD (vector, 4S) throughput 0.512810 | |
FADD (vector, 2D) 1->2 3.064412 | |
FADD (vector, 2D) 1->3 3.024382 | |
FADD (vector, 2D) throughput 0.512824 | |
FADD (scalar, S) 1->2 3.017706 | |
FADD (scalar, S) 1->3 3.137789 | |
FADD (scalar, S) throughput 0.499500 | |
FADD (scalar, D) 1->2 2.997294 | |
FADD (scalar, D) 1->3 3.000085 | |
FADD (scalar, D) throughput 0.500457 | |
FADDP (scalar, S) 1->2 3.017252 | |
FADDP (scalar, S) throughput 0.500131 | |
FADDP (scalar, D) 1->2 3.048722 | |
FADDP (scalar, D) throughput 0.514400 | |
FADDP (vector, 2S) 1->2 2.980663 | |
FADDP (vector, 2S) 1->3 3.023402 | |
FADDP (vector, 2S) throughput 0.500871 | |
FADDP (vector, 4S) 1->2 2.942020 | |
FADDP (vector, 4S) 1->3 3.047022 | |
FADDP (vector, 4S) throughput 0.517003 | |
FADDP (vector, 2D) 1->2 2.997292 | |
FADDP (vector, 2D) 1->3 3.043937 | |
FADDP (vector, 2D) throughput 0.499511 | |
FCCMP (scalar, S) 3->1 1.997130 | |
FCCMP (scalar, S) 3->2 2.021058 | |
FCCMP (scalar, S) 3->3 2.005629 | |
FCCMP (scalar, S) throughput 1.058537 | |
FCCMP (scalar, D) 3->1 2.010040 | |
FCCMP (scalar, D) 3->2 2.042685 | |
FCCMP (scalar, D) 3->3 2.004211 | |
FCCMP (scalar, D) throughput 1.076800 | |
FCCMPE (scalar, S) 3->1 2.005750 | |
FCCMPE (scalar, S) 3->2 2.001013 | |
FCCMPE (scalar, S) 3->3 1.997357 | |
FCCMPE (scalar, S) throughput 0.917496 | |
FCCMPE (scalar, D) 3->1 2.001292 | |
FCCMPE (scalar, D) 3->2 2.005608 | |
FCCMPE (scalar, D) 3->3 2.002110 | |
FCCMPE (scalar, D) throughput 1.078223 | |
FCMEQ (vector, 2S) 1->2 2.006604 | |
FCMEQ (vector, 2S) 1->3 1.867895 | |
FCMEQ (vector, 2S) throughput 0.506825 | |
FCMEQ (vector, 4S) 1->2 2.008853 | |
FCMEQ (vector, 4S) 1->3 1.609889 | |
FCMEQ (vector, 4S) throughput 0.479951 | |
FCMEQ (vector, 2D) 1->2 2.179490 | |
FCMEQ (vector, 2D) 1->3 1.998181 | |
FCMEQ (vector, 2D) throughput 0.500106 | |
FCMEQ (scalar, S) 1->2 1.846760 | |
FCMEQ (scalar, S) 1->3 2.008772 | |
FCMEQ (scalar, S) throughput 0.500204 | |
FCMEQ (scalar, D) 1->2 2.008257 | |
FCMEQ (scalar, D) 1->3 1.976208 | |
FCMEQ (scalar, D) throughput 0.485502 | |
FCMEQ (vector, zero, 2S) 1->2 2.025717 | |
FCMEQ (vector, zero, 2S) throughput 0.502888 | |
FCMEQ (vector, zero, 4S) 1->2 1.997710 | |
FCMEQ (vector, zero, 4S) throughput 0.509201 | |
FCMEQ (vector, zero, 2D) 1->2 2.475723 | |
FCMEQ (vector, zero, 2D) throughput 0.500729 | |
FCMEQ (scalar, zero, S) 1->2 2.064302 | |
FCMEQ (scalar, zero, S) throughput 0.513147 | |
FCMEQ (scalar, zero, D) 1->2 2.003712 | |
FCMEQ (scalar, zero, D) throughput 0.503948 | |
FCMGE (vector, 2S) 1->2 2.004453 | |
FCMGE (vector, 2S) 1->3 1.999152 | |
FCMGE (vector, 2S) throughput 0.500764 | |
FCMGE (vector, 4S) 1->2 2.001011 | |
FCMGE (vector, 4S) 1->3 1.988415 | |
FCMGE (vector, 4S) throughput 0.500795 | |
FCMGE (vector, 2D) 1->2 2.002828 | |
FCMGE (vector, 2D) 1->3 2.002794 | |
FCMGE (vector, 2D) throughput 0.501266 | |
FCMGE (scalar, S) 1->2 2.004365 | |
FCMGE (scalar, S) 1->3 2.001819 | |
FCMGE (scalar, S) throughput 0.499687 | |
FCMGE (scalar, D) 1->2 2.012643 | |
FCMGE (scalar, D) 1->3 2.007910 | |
FCMGE (scalar, D) throughput 0.501121 | |
FCMGE (vector, zero, 2S) 1->2 2.050359 | |
FCMGE (vector, zero, 2S) throughput 0.504173 | |
FCMGE (vector, zero, 4S) 1->2 1.957363 | |
FCMGE (vector, zero, 4S) throughput 0.502483 | |
FCMGE (vector, zero, 2D) 1->2 1.996365 | |
FCMGE (vector, zero, 2D) throughput 0.505976 | |
FCMGE (scalar, zero, S) 1->2 2.006184 | |
FCMGE (scalar, zero, S) throughput 0.500101 | |
FCMGE (scalar, zero, D) 1->2 1.986841 | |
FCMGE (scalar, zero, D) throughput 0.507546 | |
FCMGT (vector, 2S) 1->2 2.007614 | |
FCMGT (vector, 2S) 1->3 1.592154 | |
FCMGT (vector, 2S) throughput 0.491833 | |
FCMGT (vector, 4S) 1->2 2.002790 | |
FCMGT (vector, 4S) 1->3 2.001863 | |
FCMGT (vector, 4S) throughput 0.501749 | |
FCMGT (vector, 2D) 1->2 2.001904 | |
FCMGT (vector, 2D) 1->3 2.005222 | |
FCMGT (vector, 2D) throughput 0.500749 | |
FCMGT (scalar, S) 1->2 2.000807 | |
FCMGT (scalar, S) 1->3 2.002748 | |
FCMGT (scalar, S) throughput 0.514061 | |
FCMGT (scalar, D) 1->2 2.018995 | |
FCMGT (scalar, D) 1->3 2.007733 | |
FCMGT (scalar, D) throughput 0.605849 | |
FCMGT (vector, zero, 2S) 1->2 1.953895 | |
FCMGT (vector, zero, 2S) throughput 0.504525 | |
FCMGT (vector, zero, 4S) 1->2 2.021187 | |
FCMGT (vector, zero, 4S) throughput 0.487339 | |
FCMGT (vector, zero, 2D) 1->2 2.023363 | |
FCMGT (vector, zero, 2D) throughput 0.767000 | |
FCMGT (scalar, zero, S) 1->2 3.610909 | |
FCMGT (scalar, zero, S) throughput 0.474408 | |
FCMGT (scalar, zero, D) 1->2 1.980071 | |
FCMGT (scalar, zero, D) throughput 0.659380 | |
FCMLE (vector, zero, 2S) 1->2 1.973843 | |
FCMLE (vector, zero, 2S) throughput 0.502182 | |
FCMLE (vector, zero, 4S) 1->2 2.006478 | |
FCMLE (vector, zero, 4S) throughput 0.501270 | |
FCMLE (vector, zero, 2D) 1->2 2.001941 | |
FCMLE (vector, zero, 2D) throughput 0.613984 | |
FCMLE (scalar, zero, S) 1->2 2.001172 | |
FCMLE (scalar, zero, S) throughput 0.525673 | |
FCMLE (scalar, zero, D) 1->2 2.018028 | |
FCMLE (scalar, zero, D) throughput 0.492320 | |
FCMLT (vector, zero, 2S) 1->2 1.995014 | |
FCMLT (vector, zero, 2S) throughput 0.488937 | |
FCMLT (vector, zero, 4S) 1->2 2.220259 | |
FCMLT (vector, zero, 4S) throughput 0.503450 | |
FCMLT (vector, zero, 2D) 1->2 2.011091 | |
FCMLT (vector, zero, 2D) throughput 0.459751 | |
FCMLT (scalar, zero, S) 1->2 2.014313 | |
FCMLT (scalar, zero, S) throughput 0.499631 | |
FCMLT (scalar, zero, D) 1->2 2.002511 | |
FCMLT (scalar, zero, D) throughput 0.441847 | |
FCMP (scalar, S) 3->1 2.003352 | |
FCMP (scalar, S) 3->2 2.003953 | |
FCMP (scalar, S) throughput 0.984456 | |
FCMP (scalar, D) 3->1 1.974929 | |
FCMP (scalar, D) 3->2 2.038346 | |
FCMP (scalar, D) throughput 1.002465 | |
FCMPE (scalar, S) 3->1 1.998951 | |
FCMPE (scalar, S) 3->2 2.007125 | |
FCMPE (scalar, S) throughput 1.002804 | |
FCMPE (scalar, D) 3->1 2.007896 | |
FCMPE (scalar, D) 3->2 2.038025 | |
FCMPE (scalar, D) throughput 1.007608 | |
FCSEL (scalar, S) 1->2 1.997943 | |
FCSEL (scalar, S) 1->3 2.006446 | |
FCSEL (scalar, S) 1->4 2.074971 | |
FCSEL (scalar, S) throughput 0.478855 | |
FCSEL (scalar, D) 1->2 1.996204 | |
FCSEL (scalar, D) 1->3 1.996890 | |
FCSEL (scalar, D) 1->4 2.009475 | |
FCSEL (scalar, D) throughput 0.514374 | |
FCVT (S to D) 1->2 2.993055 | |
FCVT (S to D) throughput 0.521553 | |
FCVT (D to S) 1->2 2.997575 | |
FCVT (D to S) throughput 0.396494 | |
FCVTAS (scalar, S to 32-bit GPR) 1->2 roundtrip 11.603616 | |
FCVTAS (scalar, S to 32-bit GPR) throughput 1.036901 | |
FCVTAS (scalar, S to 64-bit GPR) 1->2 roundtrip 13.430579 | |
FCVTAS (scalar, S to 64-bit GPR) throughput 1.067614 | |
FCVTAS (scalar, D to 32-bit GPR) 1->2 roundtrip 10.658934 | |
FCVTAS (scalar, D to 32-bit GPR) throughput 0.849991 | |
FCVTAS (scalar, D to 64-bit GPR) 1->2 roundtrip 9.003498 | |
FCVTAS (scalar, D to 64-bit GPR) throughput 1.221277 | |
FCVTAS (scalar, S to S) 1->2 3.031498 | |
FCVTAS (scalar, S to S) throughput 0.505448 | |
FCVTAS (scalar, S to S) 1->2 2.869918 | |
FCVTAS (scalar, S to S) throughput 0.560695 | |
FCVTAS (scalar, D to D) 1->2 3.169740 | |
FCVTAS (scalar, D to D) throughput 0.468550 | |
FCVTAS (scalar, D to D) 1->2 2.782702 | |
FCVTAS (scalar, D to D) throughput 0.568616 | |
FCVTAS (vector, 2S) 1->2 3.999831 | |
FCVTAS (vector, 2S) throughput 0.509222 | |
FCVTAS (vector, 4S) 1->2 2.866493 | |
FCVTAS (vector, 4S) throughput 0.501625 | |
FCVTAS (vector, 2D) 1->2 2.986932 | |
FCVTAS (vector, 2D) throughput 0.549963 | |
FCVTAU (scalar, S to 32-bit GPR) 1->2 roundtrip 8.818556 | |
FCVTAU (scalar, S to 32-bit GPR) throughput 1.029471 | |
FCVTAU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.525896 | |
FCVTAU (scalar, S to 64-bit GPR) throughput 1.223013 | |
FCVTAU (scalar, D to 32-bit GPR) 1->2 roundtrip 10.470033 | |
FCVTAU (scalar, D to 32-bit GPR) throughput 1.035602 | |
FCVTAU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.495007 | |
FCVTAU (scalar, D to 64-bit GPR) throughput 1.046352 | |
FCVTAU (scalar, S to S) 1->2 2.986819 | |
FCVTAU (scalar, S to S) throughput 0.500172 | |
FCVTAU (scalar, S to S) 1->2 2.889492 | |
FCVTAU (scalar, S to S) throughput 0.506585 | |
FCVTAU (scalar, D to D) 1->2 3.264348 | |
FCVTAU (scalar, D to D) throughput 0.543383 | |
FCVTAU (scalar, D to D) 1->2 2.892832 | |
FCVTAU (scalar, D to D) throughput 0.589352 | |
FCVTAU (vector, 2S) 1->2 2.990128 | |
FCVTAU (vector, 2S) throughput 0.548115 | |
FCVTAU (vector, 4S) 1->2 2.990863 | |
FCVTAU (vector, 4S) throughput 0.658071 | |
FCVTAU (vector, 2D) 1->2 2.956689 | |
FCVTAU (vector, 2D) throughput 0.503791 | |
FCVTL (vector, 4S) 1->2 3.037293 | |
FCVTL (vector, 4S) throughput 0.500050 | |
FCVTL (vector, 2D) 1->2 3.042538 | |
FCVTL (vector, 2D) throughput 0.500810 | |
FCVTL2 (vector, 4S) 1->2 3.116172 | |
FCVTL2 (vector, 4S) throughput 0.507461 | |
FCVTL2 (vector, 2D) 1->2 3.030071 | |
FCVTL2 (vector, 2D) throughput 0.499581 | |
FCVTMS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.101886 | |
FCVTMS (scalar, S to 32-bit GPR) throughput 1.010978 | |
FCVTMS (scalar, S to 64-bit GPR) 1->2 roundtrip 10.024300 | |
FCVTMS (scalar, S to 64-bit GPR) throughput 0.999414 | |
FCVTMS (scalar, D to 32-bit GPR) 1->2 roundtrip 9.993498 | |
FCVTMS (scalar, D to 32-bit GPR) throughput 1.180365 | |
FCVTMS (scalar, D to 64-bit GPR) 1->2 roundtrip 9.539635 | |
FCVTMS (scalar, D to 64-bit GPR) throughput 1.572712 | |
FCVTMS (scalar, S to S) 1->2 2.661297 | |
FCVTMS (scalar, S to S) throughput 0.497103 | |
FCVTMS (scalar, S to S) 1->2 2.834145 | |
FCVTMS (scalar, S to S) throughput 0.641390 | |
FCVTMS (scalar, D to D) 1->2 3.007653 | |
FCVTMS (scalar, D to D) throughput 0.571661 | |
FCVTMS (scalar, D to D) 1->2 2.941764 | |
FCVTMS (scalar, D to D) throughput 0.571747 | |
FCVTMS (vector, 2S) 1->2 3.691594 | |
FCVTMS (vector, 2S) throughput 0.510426 | |
FCVTMS (vector, 4S) 1->2 3.250913 | |
FCVTMS (vector, 4S) throughput 0.464128 | |
FCVTMS (vector, 2D) 1->2 3.285809 | |
FCVTMS (vector, 2D) throughput 0.566691 | |
FCVTMU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.195782 | |
FCVTMU (scalar, S to 32-bit GPR) throughput 0.864335 | |
FCVTMU (scalar, S to 64-bit GPR) 1->2 roundtrip 12.414602 | |
FCVTMU (scalar, S to 64-bit GPR) throughput 1.018955 | |
FCVTMU (scalar, D to 32-bit GPR) 1->2 roundtrip 10.727224 | |
FCVTMU (scalar, D to 32-bit GPR) throughput 1.038673 | |
FCVTMU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.574723 | |
FCVTMU (scalar, D to 64-bit GPR) throughput 1.073544 | |
FCVTMU (scalar, S to S) 1->2 2.859100 | |
FCVTMU (scalar, S to S) throughput 0.535914 | |
FCVTMU (scalar, S to S) 1->2 3.069142 | |
FCVTMU (scalar, S to S) throughput 0.551570 | |
FCVTMU (scalar, D to D) 1->2 2.664293 | |
FCVTMU (scalar, D to D) throughput 0.533239 | |
FCVTMU (scalar, D to D) 1->2 2.839785 | |
FCVTMU (scalar, D to D) throughput 0.577255 | |
FCVTMU (vector, 2S) 1->2 3.009597 | |
FCVTMU (vector, 2S) throughput 0.558682 | |
FCVTMU (vector, 4S) 1->2 5.205633 | |
FCVTMU (vector, 4S) throughput 0.507248 | |
FCVTMU (vector, 2D) 1->2 2.992145 | |
FCVTMU (vector, 2D) throughput 0.534786 | |
FCVTN (vector, 4S) 1->2 3.375417 | |
FCVTN (vector, 4S) throughput 0.546083 | |
FCVTN (vector, 2D) 1->2 3.008991 | |
FCVTN (vector, 2D) throughput 0.597376 | |
FCVTN2 (vector, 4S) 1->1 2.825799 | |
FCVTN2 (vector, 4S) 1->2 3.068806 | |
FCVTN2 (vector, 4S) throughput 0.520936 | |
FCVTN2 (vector, 4S) throughput 0.519790 | |
FCVTN2 (vector, 2D) 1->1 3.562772 | |
FCVTN2 (vector, 2D) 1->2 3.884408 | |
FCVTN2 (vector, 2D) throughput 0.459407 | |
FCVTN2 (vector, 2D) throughput 0.523584 | |
FCVTNS (scalar, S to 32-bit GPR) 1->2 roundtrip 11.140945 | |
FCVTNS (scalar, S to 32-bit GPR) throughput 0.915345 | |
FCVTNS (scalar, S to 64-bit GPR) 1->2 roundtrip 11.418873 | |
FCVTNS (scalar, S to 64-bit GPR) throughput 1.033333 | |
FCVTNS (scalar, D to 32-bit GPR) 1->2 roundtrip 10.348987 | |
FCVTNS (scalar, D to 32-bit GPR) throughput 1.108204 | |
FCVTNS (scalar, D to 64-bit GPR) 1->2 roundtrip 11.268111 | |
FCVTNS (scalar, D to 64-bit GPR) throughput 1.066431 | |
FCVTNS (scalar, S to S) 1->2 1.997886 | |
FCVTNS (scalar, S to S) throughput 0.534696 | |
FCVTNS (scalar, S to S) 1->2 3.027061 | |
FCVTNS (scalar, S to S) throughput 0.499325 | |
FCVTNS (scalar, D to D) 1->2 3.003336 | |
FCVTNS (scalar, D to D) throughput 0.519442 | |
FCVTNS (scalar, D to D) 1->2 3.006380 | |
FCVTNS (scalar, D to D) throughput 0.500126 | |
FCVTNS (vector, 2S) 1->2 3.035755 | |
FCVTNS (vector, 2S) throughput 0.534818 | |
FCVTNS (vector, 4S) 1->2 2.993950 | |
FCVTNS (vector, 4S) throughput 0.549736 | |
FCVTNS (vector, 2D) 1->2 3.008178 | |
FCVTNS (vector, 2D) throughput 0.558094 | |
FCVTNU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.574919 | |
FCVTNU (scalar, S to 32-bit GPR) throughput 1.055316 | |
FCVTNU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.450965 | |
FCVTNU (scalar, S to 64-bit GPR) throughput 1.087463 | |
FCVTNU (scalar, D to 32-bit GPR) 1->2 roundtrip 11.447600 | |
FCVTNU (scalar, D to 32-bit GPR) throughput 1.001384 | |
FCVTNU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.571521 | |
FCVTNU (scalar, D to 64-bit GPR) throughput 1.029495 | |
FCVTNU (scalar, S to S) 1->2 2.965977 | |
FCVTNU (scalar, S to S) throughput 0.492871 | |
FCVTNU (scalar, S to S) 1->2 2.941985 | |
FCVTNU (scalar, S to S) throughput 0.500926 | |
FCVTNU (scalar, D to D) 1->2 3.646507 | |
FCVTNU (scalar, D to D) throughput 0.752619 | |
FCVTNU (scalar, D to D) 1->2 3.282938 | |
FCVTNU (scalar, D to D) throughput 0.718735 | |
FCVTNU (vector, 2S) 1->2 4.070864 | |
FCVTNU (vector, 2S) throughput 0.571288 | |
FCVTNU (vector, 4S) 1->2 3.004129 | |
FCVTNU (vector, 4S) throughput 0.576835 | |
FCVTNU (vector, 2D) 1->2 2.538863 | |
FCVTNU (vector, 2D) throughput 0.628689 | |
FCVTPS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.717571 | |
FCVTPS (scalar, S to 32-bit GPR) throughput 1.073475 | |
FCVTPS (scalar, S to 64-bit GPR) 1->2 roundtrip 13.414665 | |
FCVTPS (scalar, S to 64-bit GPR) throughput 1.101739 | |
FCVTPS (scalar, D to 32-bit GPR) 1->2 roundtrip 12.572712 | |
FCVTPS (scalar, D to 32-bit GPR) throughput 1.423803 | |
FCVTPS (scalar, D to 64-bit GPR) 1->2 roundtrip 13.185186 | |
FCVTPS (scalar, D to 64-bit GPR) throughput 1.017215 | |
FCVTPS (scalar, S to S) 1->2 3.145810 | |
FCVTPS (scalar, S to S) throughput 0.575828 | |
FCVTPS (scalar, S to S) 1->2 3.146623 | |
FCVTPS (scalar, S to S) throughput 0.502160 | |
FCVTPS (scalar, D to D) 1->2 3.056686 | |
FCVTPS (scalar, D to D) throughput 0.505396 | |
FCVTPS (scalar, D to D) 1->2 3.215751 | |
FCVTPS (scalar, D to D) throughput 0.591519 | |
FCVTPS (vector, 2S) 1->2 3.006195 | |
FCVTPS (vector, 2S) throughput 0.568417 | |
FCVTPS (vector, 4S) 1->2 2.995118 | |
FCVTPS (vector, 4S) throughput 0.495055 | |
FCVTPS (vector, 2D) 1->2 2.999836 | |
FCVTPS (vector, 2D) throughput 0.819517 | |
FCVTPU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.446611 | |
FCVTPU (scalar, S to 32-bit GPR) throughput 0.913548 | |
FCVTPU (scalar, S to 64-bit GPR) 1->2 roundtrip 9.523916 | |
FCVTPU (scalar, S to 64-bit GPR) throughput 1.070175 | |
FCVTPU (scalar, D to 32-bit GPR) 1->2 roundtrip 11.426467 | |
FCVTPU (scalar, D to 32-bit GPR) throughput 1.037729 | |
FCVTPU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.584542 | |
FCVTPU (scalar, D to 64-bit GPR) throughput 1.070458 | |
FCVTPU (scalar, S to S) 1->2 3.041627 | |
FCVTPU (scalar, S to S) throughput 0.527059 | |
FCVTPU (scalar, S to S) 1->2 2.829353 | |
FCVTPU (scalar, S to S) throughput 0.506241 | |
FCVTPU (scalar, D to D) 1->2 3.523051 | |
FCVTPU (scalar, D to D) throughput 0.480371 | |
FCVTPU (scalar, D to D) 1->2 3.007856 | |
FCVTPU (scalar, D to D) throughput 0.526266 | |
FCVTPU (vector, 2S) 1->2 2.711960 | |
FCVTPU (vector, 2S) throughput 0.466104 | |
FCVTPU (vector, 4S) 1->2 2.925887 | |
FCVTPU (vector, 4S) throughput 0.568106 | |
FCVTPU (vector, 2D) 1->2 3.004981 | |
FCVTPU (vector, 2D) throughput 0.491573 | |
FCVTXN (scalar, D) 1->2 3.041785 | |
FCVTXN (scalar, D) throughput 0.500409 | |
FCVTXN (vector, 2D) 1->2 3.009921 | |
FCVTXN (vector, 2D) throughput 0.513317 | |
FCVTXN2 (vector, 2D) 1->1 3.064671 | |
FCVTXN2 (vector, 2D) 1->2 3.527573 | |
FCVTXN2 (vector, 2D) throughput 0.438883 | |
FCVTXN2 (vector, 2D) throughput 0.521689 | |
FCVTZS (scalar, integer, S to 32-bit GPR) 1->2 roundtrip 10.041067 | |
FCVTZS (scalar, integer, S to 32-bit GPR) throughput 1.039943 | |
FCVTZS (scalar, integer, S to 64-bit GPR) 1->2 roundtrip 13.757289 | |
FCVTZS (scalar, integer, S to 64-bit GPR) throughput 1.749955 | |
FCVTZS (scalar, integer, D to 32-bit GPR) 1->2 roundtrip 10.774294 | |
FCVTZS (scalar, integer, D to 32-bit GPR) throughput 1.056831 | |
FCVTZS (scalar, integer, D to 64-bit GPR) 1->2 roundtrip 10.506338 | |
FCVTZS (scalar, integer, D to 64-bit GPR) throughput 1.103998 | |
FCVTZS (scalar, integer, S to S) 1->2 2.999879 | |
FCVTZS (scalar, integer, S to S) throughput 0.583946 | |
FCVTZS (scalar, integer, S to S) 1->2 3.072139 | |
FCVTZS (scalar, integer, S to S) throughput 0.626512 | |
FCVTZS (scalar, integer, D to D) 1->2 3.119773 | |
FCVTZS (scalar, integer, D to D) throughput 0.453751 | |
FCVTZS (scalar, integer, D to D) 1->2 2.892484 | |
FCVTZS (scalar, integer, D to D) throughput 0.601942 | |
FCVTZS (vector, integer, 2S) 1->2 3.007290 | |
FCVTZS (vector, integer, 2S) throughput 0.373931 | |
FCVTZS (vector, integer, 4S) 1->2 3.387503 | |
FCVTZS (vector, integer, 4S) throughput 0.498858 | |
FCVTZS (vector, integer, 2D) 1->2 3.004697 | |
FCVTZS (vector, integer, 2D) throughput 0.499621 | |
FCVTZS (scalar, fixed-point, S to 32-bit GPR) 1->2 roundtrip 10.398485 | |
FCVTZS (scalar, fixed-point, S to 32-bit GPR) throughput 1.005020 | |
FCVTZS (scalar, fixed-point, S to 64-bit GPR) 1->2 roundtrip 10.056076 | |
FCVTZS (scalar, fixed-point, S to 64-bit GPR) throughput 1.006716 | |
FCVTZS (scalar, fixed-point, D to 32-bit GPR) 1->2 roundtrip 10.816793 | |
FCVTZS (scalar, fixed-point, D to 32-bit GPR) throughput 1.008710 | |
FCVTZS (scalar, fixed-point, D to 64-bit GPR) 1->2 roundtrip 10.099570 | |
FCVTZS (scalar, fixed-point, D to 64-bit GPR) throughput 1.002450 | |
FCVTZS (scalar, fixed-point, S to S) 1->2 3.006682 | |
FCVTZS (scalar, fixed-point, S to S) throughput 0.503361 | |
FCVTZS (scalar, fixed-point, S to S) 1->2 3.004332 | |
FCVTZS (scalar, fixed-point, S to S) throughput 0.501524 | |
FCVTZS (scalar, fixed-point, D to D) 1->2 3.004171 | |
FCVTZS (scalar, fixed-point, D to D) throughput 0.512814 | |
FCVTZS (scalar, fixed-point, D to D) 1->2 2.985498 | |
FCVTZS (scalar, fixed-point, D to D) throughput 0.414732 | |
FCVTZS (vector, fixed-point, 2S) 1->2 3.020926 | |
FCVTZS (vector, fixed-point, 2S) throughput 0.559360 | |
FCVTZS (vector, fixed-point, 4S) 1->2 5.346975 | |
FCVTZS (vector, fixed-point, 4S) throughput 0.498832 | |
FCVTZS (vector, fixed-point, 2D) 1->2 3.007552 | |
FCVTZS (vector, fixed-point, 2D) throughput 0.495271 | |
FCVTZU (scalar, integer, S to 32-bit GPR) 1->2 roundtrip 10.145493 | |
FCVTZU (scalar, integer, S to 32-bit GPR) throughput 1.034596 | |
FCVTZU (scalar, integer, S to 64-bit GPR) 1->2 roundtrip 11.097452 | |
FCVTZU (scalar, integer, S to 64-bit GPR) throughput 1.060496 | |
FCVTZU (scalar, integer, D to 32-bit GPR) 1->2 roundtrip 12.247709 | |
FCVTZU (scalar, integer, D to 32-bit GPR) throughput 1.035967 | |
FCVTZU (scalar, integer, D to 64-bit GPR) 1->2 roundtrip 11.609988 | |
FCVTZU (scalar, integer, D to 64-bit GPR) throughput 0.976461 | |
FCVTZU (scalar, integer, S to S) 1->2 3.016074 | |
FCVTZU (scalar, integer, S to S) throughput 0.466359 | |
FCVTZU (scalar, integer, S to S) 1->2 3.004079 | |
FCVTZU (scalar, integer, S to S) throughput 0.498811 | |
FCVTZU (scalar, integer, D to D) 1->2 3.009625 | |
FCVTZU (scalar, integer, D to D) throughput 0.459779 | |
FCVTZU (scalar, integer, D to D) 1->2 3.222534 | |
FCVTZU (scalar, integer, D to D) throughput 0.546707 | |
FCVTZU (vector, integer, 2S) 1->2 3.676983 | |
FCVTZU (vector, integer, 2S) throughput 0.572760 | |
FCVTZU (vector, integer, 4S) 1->2 3.831612 | |
FCVTZU (vector, integer, 4S) throughput 0.547168 | |
FCVTZU (vector, integer, 2D) 1->2 3.278282 | |
FCVTZU (vector, integer, 2D) throughput 0.502514 | |
FCVTZU (scalar, fixed-point, S to 32-bit GPR) 1->2 roundtrip 8.421535 | |
FCVTZU (scalar, fixed-point, S to 32-bit GPR) throughput 1.001660 | |
FCVTZU (scalar, fixed-point, S to 64-bit GPR) 1->2 roundtrip 10.507977 | |
FCVTZU (scalar, fixed-point, S to 64-bit GPR) throughput 1.019138 | |
FCVTZU (scalar, fixed-point, D to 32-bit GPR) 1->2 roundtrip 10.230273 | |
FCVTZU (scalar, fixed-point, D to 32-bit GPR) throughput 0.912596 | |
FCVTZU (scalar, fixed-point, D to 64-bit GPR) 1->2 roundtrip 12.176381 | |
FCVTZU (scalar, fixed-point, D to 64-bit GPR) throughput 1.092583 | |
FCVTZU (scalar, fixed-point, S to S) 1->2 3.238013 | |
FCVTZU (scalar, fixed-point, S to S) throughput 0.550197 | |
FCVTZU (scalar, fixed-point, S to S) 1->2 2.951210 | |
FCVTZU (scalar, fixed-point, S to S) throughput 0.500965 | |
FCVTZU (scalar, fixed-point, D to D) 1->2 3.234381 | |
FCVTZU (scalar, fixed-point, D to D) throughput 0.506615 | |
FCVTZU (scalar, fixed-point, D to D) 1->2 3.012834 | |
FCVTZU (scalar, fixed-point, D to D) throughput 0.377403 | |
FCVTZU (vector, fixed-point, 2S) 1->2 3.011905 | |
FCVTZU (vector, fixed-point, 2S) throughput 0.568191 | |
FCVTZU (vector, fixed-point, 4S) 1->2 2.986890 | |
FCVTZU (vector, fixed-point, 4S) throughput 0.426544 | |
FCVTZU (vector, fixed-point, 2D) 1->2 2.998424 | |
FCVTZU (vector, fixed-point, 2D) throughput 0.604811 | |
FDIV (vector, 2S) 1->2 8.524710 | |
FDIV (vector, 2S) 1->3 8.484255 | |
FDIV (vector, 2S) throughput 1.042281 | |
FDIV (vector, 4S) 1->2 8.820111 | |
FDIV (vector, 4S) 1->3 11.110762 | |
FDIV (vector, 4S) throughput 1.584961 | |
FDIV (vector, 2D) 1->2 11.640530 | |
FDIV (vector, 2D) 1->3 11.415876 | |
FDIV (vector, 2D) throughput 2.203663 | |
FDIV (scalar, S) 1->2 8.402121 | |
FDIV (scalar, S) 1->3 8.582267 | |
FDIV (scalar, S) throughput 1.072185 | |
FDIV (scalar, D) 1->2 10.786627 | |
FDIV (scalar, D) 1->3 8.159040 | |
FDIV (scalar, D) throughput 1.001098 | |
FMADD (scalar, S) 1->2 4.003807 | |
FMADD (scalar, S) 1->3 4.005183 | |
FMADD (scalar, S) 1->4 4.005709 | |
FMADD (scalar, S) throughput 0.504211 | |
FMADD (scalar, D) 1->2 4.009919 | |
FMADD (scalar, D) 1->3 4.051048 | |
FMADD (scalar, D) 1->4 3.995396 | |
FMADD (scalar, D) throughput 0.499899 | |
FMAX (vector, 2S) 1->2 1.989215 | |
FMAX (vector, 2S) 1->3 1.996366 | |
FMAX (vector, 2S) throughput 0.503665 | |
FMAX (vector, 4S) 1->2 2.001537 | |
FMAX (vector, 4S) 1->3 2.010570 | |
FMAX (vector, 4S) throughput 0.500496 | |
FMAX (vector, 2D) 1->2 2.001564 | |
FMAX (vector, 2D) 1->3 2.025912 | |
FMAX (vector, 2D) throughput 0.649603 | |
FMAX (scalar, S) 1->2 2.005848 | |
FMAX (scalar, S) 1->3 2.005427 | |
FMAX (scalar, S) throughput 0.537431 | |
FMAX (scalar, D) 1->2 1.774451 | |
FMAX (scalar, D) 1->3 2.105003 | |
FMAX (scalar, D) throughput 0.618553 | |
FMAXNM (vector, 2S) 1->2 1.992366 | |
FMAXNM (vector, 2S) 1->3 6.331647 | |
FMAXNM (vector, 2S) throughput 0.489815 | |
FMAXNM (vector, 4S) 1->2 1.804837 | |
FMAXNM (vector, 4S) 1->3 2.040985 | |
FMAXNM (vector, 4S) throughput 0.564068 | |
FMAXNM (vector, 2D) 1->2 2.419328 | |
FMAXNM (vector, 2D) 1->3 2.105843 | |
FMAXNM (vector, 2D) throughput 0.885836 | |
FMAXNM (scalar, S) 1->2 5.322586 | |
FMAXNM (scalar, S) 1->3 2.043239 | |
FMAXNM (scalar, S) throughput 0.679429 | |
FMAXNM (scalar, D) 1->2 2.160803 | |
FMAXNM (scalar, D) 1->3 2.849304 | |
FMAXNM (scalar, D) throughput 0.922383 | |
FMAXNMP (scalar) 1->2 0.576800 | |
FMAXNMP (scalar) throughput 0.499077 | |
FMAXNMP (vector, 2S) 1->2 1.999655 | |
FMAXNMP (vector, 2S) 1->3 1.966016 | |
FMAXNMP (vector, 2S) throughput 0.509083 | |
FMAXNMP (vector, 4S) 1->2 2.003527 | |
FMAXNMP (vector, 4S) 1->3 2.023556 | |
FMAXNMP (vector, 4S) throughput 0.721581 | |
FMAXNMP (vector, 2D) 1->2 2.360014 | |
FMAXNMP (vector, 2D) 1->3 1.996395 | |
FMAXNMP (vector, 2D) throughput 0.500226 | |
FMAXNMV (4S) 1->2 3.010154 | |
FMAXNMV (4S) throughput 0.550183 | |
FMAXP (scalar) 1->2 2.223625 | |
FMAXP (scalar) throughput 0.516044 | |
FMAXP (vector, 2S) 1->2 2.085603 | |
FMAXP (vector, 2S) 1->3 1.996345 | |
FMAXP (vector, 2S) throughput 0.568192 | |
FMAXP (vector, 4S) 1->2 1.998261 | |
FMAXP (vector, 4S) 1->3 2.001661 | |
FMAXP (vector, 4S) throughput 0.507957 | |
FMAXP (vector, 2D) 1->2 2.019602 | |
FMAXP (vector, 2D) 1->3 3.541763 | |
FMAXP (vector, 2D) throughput 0.542707 | |
FMAXV (4S) 1->2 3.079730 | |
FMAXV (4S) throughput 0.537902 | |
FMIN (vector, 2S) 1->2 2.004649 | |
FMIN (vector, 2S) 1->3 2.001660 | |
FMIN (vector, 2S) throughput 0.528470 | |
FMIN (vector, 4S) 1->2 1.859301 | |
FMIN (vector, 4S) 1->3 1.996568 | |
FMIN (vector, 4S) throughput 0.559163 | |
FMIN (vector, 2D) 1->2 2.042478 | |
FMIN (vector, 2D) 1->3 1.985137 | |
FMIN (vector, 2D) throughput 0.527988 | |
FMIN (scalar, S) 1->2 2.001660 | |
FMIN (scalar, S) 1->3 1.893246 | |
FMIN (scalar, S) throughput 0.499292 | |
FMIN (scalar, D) 1->2 1.938327 | |
FMIN (scalar, D) 1->3 2.014141 | |
FMIN (scalar, D) throughput 0.552703 | |
FMINNM (vector, 2S) 1->2 1.998302 | |
FMINNM (vector, 2S) 1->3 2.002712 | |
FMINNM (vector, 2S) throughput 0.500273 | |
FMINNM (vector, 4S) 1->2 2.235015 | |
FMINNM (vector, 4S) 1->3 1.991421 | |
FMINNM (vector, 4S) throughput 0.443074 | |
FMINNM (vector, 2D) 1->2 2.108962 | |
FMINNM (vector, 2D) 1->3 2.025632 | |
FMINNM (vector, 2D) throughput 0.508916 | |
FMINNM (scalar, S) 1->2 2.596537 | |
FMINNM (scalar, S) 1->3 1.922710 | |
FMINNM (scalar, S) throughput 0.547823 | |
FMINNM (scalar, D) 1->2 2.124868 | |
FMINNM (scalar, D) 1->3 1.952563 | |
FMINNM (scalar, D) throughput 0.569998 | |
FMINNMP (scalar) 1->2 2.042396 | |
FMINNMP (scalar) throughput 0.582984 | |
FMINNMP (vector, 2S) 1->2 2.001051 | |
FMINNMP (vector, 2S) 1->3 2.002632 | |
FMINNMP (vector, 2S) throughput 0.552115 | |
FMINNMP (vector, 4S) 1->2 1.948231 | |
FMINNMP (vector, 4S) 1->3 2.001200 | |
FMINNMP (vector, 4S) throughput 0.552306 | |
FMINNMP (vector, 2D) 1->2 2.001619 | |
FMINNMP (vector, 2D) 1->3 2.284364 | |
FMINNMP (vector, 2D) throughput 0.520191 | |
FMINNMV (4S) 1->2 3.261729 | |
FMINNMV (4S) throughput 0.530006 | |
FMINP (scalar) 1->2 2.112255 | |
FMINP (scalar) throughput 0.534266 | |
FMINP (vector, 2S) 1->2 2.009556 | |
FMINP (vector, 2S) 1->3 1.966754 | |
FMINP (vector, 2S) throughput 0.536985 | |
FMINP (vector, 4S) 1->2 1.990420 | |
FMINP (vector, 4S) 1->3 2.237802 | |
FMINP (vector, 4S) throughput 0.663710 | |
FMINP (vector, 2D) 1->2 1.983723 | |
FMINP (vector, 2D) 1->3 2.004440 | |
FMINP (vector, 2D) throughput 0.536236 | |
FMINV (4S) 1->2 3.236783 | |
FMINV (4S) throughput 0.471801 | |
FMLA (by element, S) 1->1 4.510554 | |
FMLA (by element, S) 1->2 4.481910 | |
FMLA (by element, S) 1->3 4.062277 | |
FMLA (by element, S) throughput 0.582743 | |
FMLA (by element, S) throughput 0.529383 | |
FMLA (by element, D) 1->1 4.530256 | |
FMLA (by element, D) 1->2 4.094104 | |
FMLA (by element, D) 1->3 3.994275 | |
FMLA (by element, D) throughput 0.540946 | |
FMLA (by element, D) throughput 0.498731 | |
FMLA (by element, 2S) 1->1 4.859091 | |
FMLA (by element, 2S) 1->2 4.123458 | |
FMLA (by element, 2S) 1->3 3.212912 | |
FMLA (by element, 2S) throughput 0.500852 | |
FMLA (by element, 2S) throughput 0.524038 | |
FMLA (by element, 4S) 1->1 4.158025 | |
FMLA (by element, 4S) 1->2 4.016117 | |
FMLA (by element, 4S) 1->3 4.487265 | |
FMLA (by element, 4S) throughput 0.502025 | |
FMLA (by element, 4S) throughput 0.501211 | |
FMLA (by element, 2D) 1->1 4.317128 | |
FMLA (by element, 2D) 1->2 4.020326 | |
FMLA (by element, 2D) 1->3 4.189396 | |
FMLA (by element, 2D) throughput 0.541501 | |
FMLA (by element, 2D) throughput 0.466347 | |
FMLA (vector, 2S) 1->1 4.276643 | |
FMLA (vector, 2S) 1->2 4.325360 | |
FMLA (vector, 2S) 1->3 4.652830 | |
FMLA (vector, 2S) throughput 0.500935 | |
FMLA (vector, 2S) throughput 0.528830 | |
FMLA (vector, 4S) 1->1 4.569606 | |
FMLA (vector, 4S) 1->2 4.003032 | |
FMLA (vector, 4S) 1->3 3.997783 | |
FMLA (vector, 4S) throughput 0.563335 | |
FMLA (vector, 4S) throughput 0.532048 | |
FMLA (vector, 2D) 1->1 3.391062 | |
FMLA (vector, 2D) 1->2 4.005708 | |
FMLA (vector, 2D) 1->3 4.572441 | |
FMLA (vector, 2D) throughput 0.563265 | |
FMLA (vector, 2D) throughput 0.560861 | |
FMLS (by element, S) 1->1 4.605425 | |
FMLS (by element, S) 1->2 3.092525 | |
FMLS (by element, S) 1->3 4.012552 | |
FMLS (by element, S) throughput 0.543858 | |
FMLS (by element, S) throughput 0.558041 | |
FMLS (by element, D) 1->1 5.271855 | |
FMLS (by element, D) 1->2 4.060299 | |
FMLS (by element, D) 1->3 4.334880 | |
FMLS (by element, D) throughput 0.501913 | |
FMLS (by element, D) throughput 0.522991 | |
FMLS (by element, 2S) 1->1 4.648784 | |
FMLS (by element, 2S) 1->2 4.029255 | |
FMLS (by element, 2S) 1->3 3.655245 | |
FMLS (by element, 2S) throughput 0.927654 | |
FMLS (by element, 2S) throughput 0.687133 | |
FMLS (by element, 4S) 1->1 3.863717 | |
FMLS (by element, 4S) 1->2 4.357051 | |
FMLS (by element, 4S) 1->3 3.688806 | |
FMLS (by element, 4S) throughput 0.523209 | |
FMLS (by element, 4S) throughput 0.531783 | |
FMLS (by element, 2D) 1->1 4.364202 | |
FMLS (by element, 2D) 1->2 4.489878 | |
FMLS (by element, 2D) 1->3 3.998704 | |
FMLS (by element, 2D) throughput 0.504997 | |
FMLS (by element, 2D) throughput 0.506586 | |
FMLS (vector, 2S) 1->1 4.252711 | |
FMLS (vector, 2S) 1->2 3.949617 | |
FMLS (vector, 2S) 1->3 4.765188 | |
FMLS (vector, 2S) throughput 0.500541 | |
FMLS (vector, 2S) throughput 0.541270 | |
FMLS (vector, 4S) 1->1 4.011409 | |
FMLS (vector, 4S) 1->2 4.270038 | |
FMLS (vector, 4S) 1->3 4.024905 | |
FMLS (vector, 4S) throughput 0.534261 | |
FMLS (vector, 4S) throughput 0.536869 | |
FMLS (vector, 2D) 1->1 4.005548 | |
FMLS (vector, 2D) 1->2 3.482984 | |
FMLS (vector, 2D) 1->3 4.002141 | |
FMLS (vector, 2D) throughput 0.502889 | |
FMLS (vector, 2D) throughput 0.509518 | |
FMOV (S to 32-bit GPR) 1->2 roundtrip 7.262409 | |
FMOV (S to 32-bit GPR) throughput 1.138684 | |
FMOV (D to 64-bit GPR) 1->2 roundtrip 7.532818 | |
FMOV (D to 64-bit GPR) throughput 1.114386 | |
FMOV (D[1] to 64-bit GPR) 1->2 roundtrip 7.387935 | |
FMOV (D[1] to 64-bit GPR) throughput 1.098989 | |
FMOV (S from 32-bit GPR) 1->2 roundtrip 7.336815 | |
FMOV (S from 32-bit GPR) throughput 0.550816 | |
FMOV (D from 64-bit GPR) 1->2 roundtrip 7.315407 | |
FMOV (D from 64-bit GPR) throughput 0.535606 | |
FMOV (D[1] from 64-bit GPR) 1->2 roundtrip 9.993927 | |
FMOV (D[1] from 64-bit GPR) throughput 0.532862 | |
FMSUB (scalar, S) 1->2 4.882739 | |
FMSUB (scalar, S) 1->3 4.128418 | |
FMSUB (scalar, S) 1->4 4.246758 | |
FMSUB (scalar, S) throughput 0.531251 | |
FMSUB (scalar, D) 1->2 4.327763 | |
FMSUB (scalar, D) 1->3 4.469558 | |
FMSUB (scalar, D) 1->4 4.775833 | |
FMSUB (scalar, D) throughput 0.520588 | |
FMUL (by element, 2S) 1->2 4.186752 | |
FMUL (by element, 2S) 1->3 4.001174 | |
FMUL (by element, 2S) throughput 0.437302 | |
FMUL (by element, 4S) 1->2 4.127683 | |
FMUL (by element, 4S) 1->3 4.001418 | |
FMUL (by element, 4S) throughput 0.549621 | |
FMUL (by element, 2D) 1->2 4.036038 | |
FMUL (by element, 2D) 1->3 4.007074 | |
FMUL (by element, 2D) throughput 0.649999 | |
FMUL (vector, 2S) 1->2 4.045654 | |
FMUL (vector, 2S) 1->3 4.262703 | |
FMUL (vector, 2S) throughput 0.498826 | |
FMUL (vector, 4S) 1->2 4.905140 | |
FMUL (vector, 4S) 1->3 4.243440 | |
FMUL (vector, 4S) throughput 0.500774 | |
FMUL (vector, 2D) 1->2 4.002231 | |
FMUL (vector, 2D) 1->3 4.000685 | |
FMUL (vector, 2D) throughput 0.531302 | |
FMUL (scalar, S) 1->2 4.107715 | |
FMUL (scalar, S) 1->3 4.478802 | |
FMUL (scalar, S) throughput 0.535267 | |
FMUL (scalar, D) 1->2 4.002019 | |
FMUL (scalar, D) 1->3 4.081938 | |
FMUL (scalar, D) throughput 0.511456 | |
FMULX (vector, 2S) 1->2 4.075666 | |
FMULX (vector, 2S) 1->3 4.104338 | |
FMULX (vector, 2S) throughput 0.617016 | |
FMULX (vector, 4S) 1->2 4.052475 | |
FMULX (vector, 4S) 1->3 4.568221 | |
FMULX (vector, 4S) throughput 0.586538 | |
FMULX (vector, 2D) 1->2 4.291056 | |
FMULX (vector, 2D) 1->3 4.038773 | |
FMULX (vector, 2D) throughput 0.501289 | |
FMULX (scalar, S) 1->2 4.525187 | |
FMULX (scalar, S) 1->3 5.017153 | |
FMULX (scalar, S) throughput 0.528340 | |
FMULX (scalar, D) 1->2 4.084123 | |
FMULX (scalar, D) 1->3 4.098681 | |
FMULX (scalar, D) throughput 0.507264 | |
FMULX (by element, 2S) 1->2 4.200224 | |
FMULX (by element, 2S) 1->3 4.011031 | |
FMULX (by element, 2S) throughput 0.569895 | |
FMULX (by element, 4S) 1->2 4.537691 | |
FMULX (by element, 4S) 1->3 4.338192 | |
FMULX (by element, 4S) throughput 0.566259 | |
FMULX (by element, 2D) 1->2 4.499638 | |
FMULX (by element, 2D) 1->3 4.005844 | |
FMULX (by element, 2D) throughput 0.536081 | |
FNEG (scalar, S) 1->2 2.013848 | |
FNEG (scalar, S) throughput 0.533214 | |
FNEG (scalar, D) 1->2 2.005183 | |
FNEG (scalar, D) throughput 0.557990 | |
FNEG (vector, 2S) 1->2 2.008787 | |
FNEG (vector, 2S) throughput 0.782475 | |
FNEG (vector, 4S) 1->2 2.013588 | |
FNEG (vector, 4S) throughput 0.490816 | |
FNEG (vector, 2D) 1->2 2.002299 | |
FNEG (vector, 2D) throughput 0.603482 | |
FNMADD (scalar, S) 1->2 3.502479 | |
FNMADD (scalar, S) 1->3 4.642398 | |
FNMADD (scalar, S) 1->4 4.241711 | |
FNMADD (scalar, S) throughput 0.580311 | |
FNMADD (scalar, D) 1->2 4.304259 | |
FNMADD (scalar, D) 1->3 4.088608 | |
FNMADD (scalar, D) 1->4 3.949732 | |
FNMADD (scalar, D) throughput 0.571812 | |
FNMSUB (scalar, S) 1->2 6.207944 | |
FNMSUB (scalar, S) 1->3 4.475177 | |
FNMSUB (scalar, S) 1->4 4.184392 | |
FNMSUB (scalar, S) throughput 0.509574 | |
FNMSUB (scalar, D) 1->2 4.173437 | |
FNMSUB (scalar, D) 1->3 3.948129 | |
FNMSUB (scalar, D) 1->4 4.049152 | |
FNMSUB (scalar, D) throughput 0.511223 | |
FNMUL (scalar, S) 1->2 4.283304 | |
FNMUL (scalar, S) 1->3 3.865035 | |
FNMUL (scalar, S) throughput 0.497975 | |
FNMUL (scalar, D) 1->2 4.072647 | |
FNMUL (scalar, D) 1->3 4.518981 | |
FNMUL (scalar, D) throughput 0.500854 | |
FRECPE (scalar, S) 1->2 3.032585 | |
FRECPE (scalar, S) throughput 1.057892 | |
FRECPE (scalar, D) 1->2 2.646545 | |
FRECPE (scalar, D) throughput 1.041507 | |
FRECPE (vector, 2S) 1->2 3.541906 | |
FRECPE (vector, 2S) throughput 1.152230 | |
FRECPE (vector, 4S) 1->2 4.388301 | |
FRECPE (vector, 4S) throughput 2.123917 | |
FRECPE (vector, 2D) 1->2 4.826315 | |
FRECPE (vector, 2D) throughput 1.762349 | |
FRECPS (vector, 2S) 1->2 3.871055 | |
FRECPS (vector, 2S) 1->3 4.210768 | |
FRECPS (vector, 2S) throughput 0.499793 | |
FRECPS (vector, 4S) 1->2 3.732778 | |
FRECPS (vector, 4S) 1->3 4.434309 | |
FRECPS (vector, 4S) throughput 0.532724 | |
FRECPS (vector, 2D) 1->2 4.123998 | |
FRECPS (vector, 2D) 1->3 5.038989 | |
FRECPS (vector, 2D) throughput 0.590944 | |
FRECPS (scalar, S) 1->2 3.956695 | |
FRECPS (scalar, S) 1->3 3.521623 | |
FRECPS (scalar, S) throughput 0.570431 | |
FRECPS (scalar, D) 1->2 2.521919 | |
FRECPS (scalar, D) 1->3 4.282170 | |
FRECPS (scalar, D) throughput 0.609473 | |
FRECPX (scalar, S) 1->2 2.815102 | |
FRECPX (scalar, S) throughput 1.049834 | |
FRECPX (scalar, D) 1->2 2.996078 | |
FRECPX (scalar, D) throughput 1.052182 | |
FRINTA (scalar, S) 1->2 3.169889 | |
FRINTA (scalar, S) throughput 0.580934 | |
FRINTA (scalar, D) 1->2 3.003875 | |
FRINTA (scalar, D) throughput 0.510362 | |
FRINTA (vector, 2S) 1->2 3.268979 | |
FRINTA (vector, 2S) throughput 0.508253 | |
FRINTA (vector, 4S) 1->2 3.019030 | |
FRINTA (vector, 4S) throughput 0.562411 | |
FRINTA (vector, 2D) 1->2 3.047137 | |
FRINTA (vector, 2D) throughput 0.658554 | |
FRINTI (scalar, S) 1->2 3.028459 | |
FRINTI (scalar, S) throughput 0.548685 | |
FRINTI (scalar, D) 1->2 2.996289 | |
FRINTI (scalar, D) throughput 0.500429 | |
FRINTI (vector, 2S) 1->2 3.554761 | |
FRINTI (vector, 2S) throughput 0.505823 | |
FRINTI (vector, 4S) 1->2 3.005918 | |
FRINTI (vector, 4S) throughput 0.471539 | |
FRINTI (vector, 2D) 1->2 3.265465 | |
FRINTI (vector, 2D) throughput 0.539730 | |
FRINTM (scalar, S) 1->2 3.561399 | |
FRINTM (scalar, S) throughput 0.411683 | |
FRINTM (scalar, D) 1->2 3.158083 | |
FRINTM (scalar, D) throughput 0.576857 | |
FRINTM (vector, 2S) 1->2 3.008568 | |
FRINTM (vector, 2S) throughput 0.592108 | |
FRINTM (vector, 4S) 1->2 2.997332 | |
FRINTM (vector, 4S) throughput 0.573206 | |
FRINTM (vector, 2D) 1->2 2.955198 | |
FRINTM (vector, 2D) throughput 0.535116 | |
FRINTN (scalar, S) 1->2 3.231363 | |
FRINTN (scalar, S) throughput 0.506383 | |
FRINTN (scalar, D) 1->2 3.094444 | |
FRINTN (scalar, D) throughput 0.503457 | |
FRINTN (vector, 2S) 1->2 3.611484 | |
FRINTN (vector, 2S) throughput 0.476992 | |
FRINTN (vector, 4S) 1->2 3.009720 | |
FRINTN (vector, 4S) throughput 0.522611 | |
FRINTN (vector, 2D) 1->2 2.374987 | |
FRINTN (vector, 2D) throughput 0.503199 | |
FRINTP (scalar, S) 1->2 3.008043 | |
FRINTP (scalar, S) throughput 0.505219 | |
FRINTP (scalar, D) 1->2 3.073819 | |
FRINTP (scalar, D) throughput 0.545873 | |
FRINTP (vector, 2S) 1->2 2.974266 | |
FRINTP (vector, 2S) throughput 0.589942 | |
FRINTP (vector, 4S) 1->2 2.980045 | |
FRINTP (vector, 4S) throughput 0.583509 | |
FRINTP (vector, 2D) 1->2 4.459906 | |
FRINTP (vector, 2D) throughput 0.516048 | |
FRINTX (scalar, S) 1->2 3.010046 | |
FRINTX (scalar, S) throughput 0.526302 | |
FRINTX (scalar, D) 1->2 3.064139 | |
FRINTX (scalar, D) throughput 0.560471 | |
FRINTX (vector, 2S) 1->2 3.004931 | |
FRINTX (vector, 2S) throughput 0.580727 | |
FRINTX (vector, 4S) 1->2 2.825430 | |
FRINTX (vector, 4S) throughput 0.576709 | |
FRINTX (vector, 2D) 1->2 3.191691 | |
FRINTX (vector, 2D) throughput 0.582464 | |
FRINTZ (scalar, S) 1->2 2.999111 | |
FRINTZ (scalar, S) throughput 0.574588 | |
FRINTZ (scalar, D) 1->2 3.001292 | |
FRINTZ (scalar, D) throughput 0.501257 | |
FRINTZ (vector, 2S) 1->2 3.220337 | |
FRINTZ (vector, 2S) throughput 0.491193 | |
FRINTZ (vector, 4S) 1->2 3.759242 | |
FRINTZ (vector, 4S) throughput 0.503341 | |
FRINTZ (vector, 2D) 1->2 3.917434 | |
FRINTZ (vector, 2D) throughput 0.562542 | |
FRSQRTE (scalar, S) 1->2 3.022026 | |
FRSQRTE (scalar, S) throughput 1.093789 | |
FRSQRTE (scalar, D) 1->2 2.882875 | |
FRSQRTE (scalar, D) throughput 1.217684 | |
FRSQRTE (vector, 2S) 1->2 3.706096 | |
FRSQRTE (vector, 2S) throughput 1.005127 | |
FRSQRTE (vector, 4S) 1->2 3.121202 | |
FRSQRTE (vector, 4S) throughput 2.319847 | |
FRSQRTE (vector, 2D) 1->2 4.861275 | |
FRSQRTE (vector, 2D) throughput 2.019493 | |
FRSQRTS (vector, 2S) 1->2 3.996850 | |
FRSQRTS (vector, 2S) 1->3 4.006966 | |
FRSQRTS (vector, 2S) throughput 0.505021 | |
FRSQRTS (vector, 4S) 1->2 3.999151 | |
FRSQRTS (vector, 4S) 1->3 4.044041 | |
FRSQRTS (vector, 4S) throughput 0.500739 | |
FRSQRTS (vector, 2D) 1->2 3.999272 | |
FRSQRTS (vector, 2D) 1->3 3.998989 | |
FRSQRTS (vector, 2D) throughput 0.500142 | |
FRSQRTS (scalar, S) 1->2 3.987286 | |
FRSQRTS (scalar, S) 1->3 4.005225 | |
FRSQRTS (scalar, S) throughput 0.500627 | |
FRSQRTS (scalar, D) 1->2 4.038138 | |
FRSQRTS (scalar, D) 1->3 4.005337 | |
FRSQRTS (scalar, D) throughput 0.500820 | |
FSQRT (scalar, S) 1->2 10.082928 | |
FSQRT (scalar, S) throughput 2.015929 | |
FSQRT (scalar, D) 1->2 13.037594 | |
FSQRT (scalar, D) throughput 2.232481 | |
FSQRT (vector, 2S) 1->2 9.151747 | |
FSQRT (vector, 2S) throughput 2.009602 | |
FSQRT (vector, 4S) 1->2 12.848573 | |
FSQRT (vector, 4S) throughput 3.237254 | |
FSQRT (vector, 2D) 1->2 15.225853 | |
FSQRT (vector, 2D) throughput 4.182419 | |
FSUB (vector, 2S) 1->2 2.997502 | |
FSUB (vector, 2S) 1->3 5.328179 | |
FSUB (vector, 2S) throughput 0.537653 | |
FSUB (vector, 4S) 1->2 3.532436 | |
FSUB (vector, 4S) 1->3 3.004130 | |
FSUB (vector, 4S) throughput 0.499550 | |
FSUB (vector, 2D) 1->2 3.002221 | |
FSUB (vector, 2D) 1->3 2.993416 | |
FSUB (vector, 2D) throughput 0.501114 | |
FSUB (scalar, S) 1->2 3.010752 | |
FSUB (scalar, S) 1->3 3.188527 | |
FSUB (scalar, S) throughput 0.506147 | |
FSUB (scalar, D) 1->2 3.004645 | |
FSUB (scalar, D) 1->3 3.022191 | |
FSUB (scalar, D) throughput 0.460967 | |
INS (element, B) 1->1 1.988376 | |
INS (element, B) 1->2 2.000774 | |
INS (element, B) throughput 0.508113 | |
INS (element, B) throughput 0.489807 | |
INS (element, H) 1->1 2.008252 | |
INS (element, H) 1->2 1.909536 | |
INS (element, H) throughput 0.503999 | |
INS (element, H) throughput 0.504581 | |
INS (element, S) 1->1 1.981414 | |
INS (element, S) 1->2 2.001459 | |
INS (element, S) throughput 0.500328 | |
INS (element, S) throughput 0.501240 | |
INS (element, D) 1->1 2.002874 | |
INS (element, D) 1->2 2.002227 | |
INS (element, D) throughput 0.519410 | |
INS (element, D) throughput 0.521239 | |
INS (general, B) 1->1 1.524256 | |
INS (general, B) 1->2 roundtrip 9.034778 | |
INS (general, B) throughput 0.757842 | |
INS (general, H) 1->1 1.958713 | |
INS (general, H) 1->2 roundtrip 9.091431 | |
INS (general, H) throughput 0.753812 | |
INS (general, S) 1->1 2.002261 | |
INS (general, S) 1->2 roundtrip 9.011616 | |
INS (general, S) throughput 0.761611 | |
INS (general, D) 1->1 1.987297 | |
INS (general, D) 1->2 roundtrip 9.160768 | |
INS (general, D) throughput 0.750141 | |
LD1 (multiple, one register, 8B) 1->2 roundtrip 7.496864 | |
LD1 (multiple, one register, 8B) throughput 0.492584 | |
LD1 (multiple, one register, 16B) 1->2 roundtrip 6.905282 | |
LD1 (multiple, one register, 16B) throughput 0.500638 | |
LD1 (multiple, one register, 4H) 1->2 roundtrip 7.037339 | |
LD1 (multiple, one register, 4H) throughput 0.501437 | |
LD1 (multiple, one register, 8H) 1->2 roundtrip 7.033447 | |
LD1 (multiple, one register, 8H) throughput 0.499682 | |
LD1 (multiple, one register, 2S) 1->2 roundtrip 6.995109 | |
LD1 (multiple, one register, 2S) throughput 0.456356 | |
LD1 (multiple, one register, 4S) 1->2 roundtrip 7.020445 | |
LD1 (multiple, one register, 4S) throughput 0.501930 | |
LD1 (multiple, one register, 1D) 1->2 roundtrip 7.064167 | |
LD1 (multiple, one register, 1D) throughput 0.501518 | |
LD1 (multiple, one register, 2D) 1->2 roundtrip 7.034202 | |
LD1 (multiple, one register, 2D) throughput 0.500762 | |
LD1 (multiple, two registers, 8B) 1->3 roundtrip 7.011493 | |
LD1 (multiple, two registers, 8B) 2->3 roundtrip 7.186841 | |
LD1 (multiple, two registers, 8B) throughput 0.999586 | |
LD1 (multiple, two registers, 16B) 1->3 roundtrip 7.120672 | |
LD1 (multiple, two registers, 16B) 2->3 roundtrip 7.392044 | |
LD1 (multiple, two registers, 16B) throughput 1.090149 | |
LD1 (multiple, two registers, 4H) 1->3 roundtrip 7.154723 | |
LD1 (multiple, two registers, 4H) 2->3 roundtrip 7.095731 | |
LD1 (multiple, two registers, 4H) throughput 1.001407 | |
LD1 (multiple, two registers, 8H) 1->3 roundtrip 7.077802 | |
LD1 (multiple, two registers, 8H) 2->3 roundtrip 7.102367 | |
LD1 (multiple, two registers, 8H) throughput 0.999379 | |
LD1 (multiple, two registers, 2S) 1->3 roundtrip 8.869814 | |
LD1 (multiple, two registers, 2S) 2->3 roundtrip 6.627348 | |
LD1 (multiple, two registers, 2S) throughput 1.001687 | |
LD1 (multiple, two registers, 4S) 1->3 roundtrip 7.089131 | |
LD1 (multiple, two registers, 4S) 2->3 roundtrip 7.070537 | |
LD1 (multiple, two registers, 4S) throughput 1.015609 | |
LD1 (multiple, two registers, 1D) 1->3 roundtrip 6.990187 | |
LD1 (multiple, two registers, 1D) 2->3 roundtrip 7.239993 | |
LD1 (multiple, two registers, 1D) throughput 1.002241 | |
LD1 (multiple, two registers, 2D) 1->3 roundtrip 7.013282 | |
LD1 (multiple, two registers, 2D) 2->3 roundtrip 8.550091 | |
LD1 (multiple, two registers, 2D) throughput 1.034062 | |
LD1 (multiple, three registers, 8B) throughput 1.517993 | |
LD1 (multiple, three registers, 16B) throughput 1.510336 | |
LD1 (multiple, three registers, 4H) throughput 1.511464 | |
LD1 (multiple, three registers, 8H) throughput 1.502121 | |
LD1 (multiple, three registers, 2S) throughput 1.518675 | |
LD1 (multiple, three registers, 4S) throughput 1.508180 | |
LD1 (multiple, three registers, 1D) throughput 1.503664 | |
LD1 (multiple, three registers, 2D) throughput 1.515746 | |
LD1 (multiple, four registers, 8B) throughput 1.902273 | |
LD1 (multiple, four registers, 16B) throughput 2.024473 | |
LD1 (multiple, four registers, 4H) throughput 2.030051 | |
LD1 (multiple, four registers, 8H) throughput 2.073090 | |
LD1 (multiple, four registers, 2S) throughput 2.034621 | |
LD1 (multiple, four registers, 4S) throughput 2.031685 | |
LD1 (multiple, four registers, 1D) throughput 2.018009 | |
LD1 (multiple, four registers, 2D) throughput 2.003928 | |
LD1 (single structure, B) 1->2 roundtrip 9.237689 | |
LD1 (single structure, B) throughput 2.013528 | |
LD1 (single structure, H) 1->2 roundtrip 9.105559 | |
LD1 (single structure, H) throughput 2.047518 | |
LD1 (single structure, S) 1->2 roundtrip 9.015028 | |
LD1 (single structure, S) throughput 2.010168 | |
LD1 (single structure, D) 1->2 roundtrip 8.175292 | |
LD1 (single structure, D) throughput 2.252838 | |
LD1R (8B) 1->2 roundtrip 9.032480 | |
LD1R (8B) throughput 0.507917 | |
LD1R (16B) 1->2 roundtrip 8.973735 | |
LD1R (16B) throughput 0.502272 | |
LD1R (4H) 1->2 roundtrip 9.197005 | |
LD1R (4H) throughput 0.501230 | |
LD1R (8H) 1->2 roundtrip 9.032142 | |
LD1R (8H) throughput 0.501438 | |
LD1R (2S) 1->2 roundtrip 9.083982 | |
LD1R (2S) throughput 0.820501 | |
LD1R (4S) 1->2 roundtrip 11.281370 | |
LD1R (4S) throughput 0.500339 | |
LD1R (1D) 1->2 roundtrip 9.281731 | |
LD1R (1D) throughput 0.501154 | |
LD1R (2D) 1->2 roundtrip 9.010712 | |
LD1R (2D) throughput 0.501346 | |
LD2 (multiple structures, 8B) 1->3 roundtrip 9.105677 | |
LD2 (multiple structures, 8B) 2->3 roundtrip 9.083969 | |
LD2 (multiple structures, 8B) throughput 1.009460 | |
LD2 (multiple structures, 16B) 1->3 roundtrip 9.218350 | |
LD2 (multiple structures, 16B) 2->3 roundtrip 9.042767 | |
LD2 (multiple structures, 16B) throughput 0.979966 | |
LD2 (multiple structures, 4H) 1->3 roundtrip 7.646544 | |
LD2 (multiple structures, 4H) 2->3 roundtrip 9.016837 | |
LD2 (multiple structures, 4H) throughput 1.001600 | |
LD2 (multiple structures, 8H) 1->3 roundtrip 9.016312 | |
LD2 (multiple structures, 8H) 2->3 roundtrip 9.527361 | |
LD2 (multiple structures, 8H) throughput 1.047649 | |
LD2 (multiple structures, 2S) 1->3 roundtrip 9.056487 | |
LD2 (multiple structures, 2S) 2->3 roundtrip 9.027327 | |
LD2 (multiple structures, 2S) throughput 1.002177 | |
LD2 (multiple structures, 4S) 1->3 roundtrip 9.228103 | |
LD2 (multiple structures, 4S) 2->3 roundtrip 11.305949 | |
LD2 (multiple structures, 4S) throughput 1.115639 | |
LD2 (multiple structures, 2D) 1->3 roundtrip 9.130457 | |
LD2 (multiple structures, 2D) 2->3 roundtrip 9.018417 | |
LD2 (multiple structures, 2D) throughput 1.003042 | |
LD2 (single structure, B) 1->3 roundtrip 9.129090 | |
LD2 (single structure, B) 2->3 roundtrip 9.060652 | |
LD2 (single structure, B) throughput 1.250162 | |
LD2 (single structure, H) 1->3 roundtrip 9.018789 | |
LD2 (single structure, H) 2->3 roundtrip 9.088440 | |
LD2 (single structure, H) throughput 0.633054 | |
LD2 (single structure, S) 1->3 roundtrip 9.102636 | |
LD2 (single structure, S) 2->3 roundtrip 9.164477 | |
LD2 (single structure, S) throughput 1.332354 | |
LD2 (single structure, D) 1->3 roundtrip 8.996682 | |
LD2 (single structure, D) 2->3 roundtrip 9.171231 | |
LD2 (single structure, D) throughput 1.363277 | |
LD2R (8B) 1->3 roundtrip 8.151490 | |
LD2R (8B) 2->3 roundtrip 9.055318 | |
LD2R (8B) throughput 1.002141 | |
LD2R (16B) 1->3 roundtrip 9.095853 | |
LD2R (16B) 2->3 roundtrip 9.056817 | |
LD2R (16B) throughput 0.998748 | |
LD2R (4H) 1->3 roundtrip 8.842675 | |
LD2R (4H) 2->3 roundtrip 9.147393 | |
LD2R (4H) throughput 1.006504 | |
LD2R (8H) 1->3 roundtrip 9.031579 | |
LD2R (8H) 2->3 roundtrip 9.784526 | |
LD2R (8H) throughput 1.010883 | |
LD2R (2S) 1->3 roundtrip 9.166738 | |
LD2R (2S) 2->3 roundtrip 9.096007 | |
LD2R (2S) throughput 1.006395 | |
LD2R (4S) 1->3 roundtrip 8.813419 | |
LD2R (4S) 2->3 roundtrip 12.025902 | |
LD2R (4S) throughput 1.042096 | |
LD2R (1D) 1->3 roundtrip 9.013919 | |
LD2R (1D) 2->3 roundtrip 9.474108 | |
LD2R (1D) throughput 1.061019 | |
LD2R (2D) 1->3 roundtrip 9.009001 | |
LD2R (2D) 2->3 roundtrip 9.014220 | |
LD2R (2D) throughput 1.001026 | |
LD3 (multiple structures, 8B) throughput 1.985801 | |
LD3 (multiple structures, 16B) throughput 2.020741 | |
LD3 (multiple structures, 4H) throughput 2.425026 | |
LD3 (multiple structures, 8H) throughput 2.012357 | |
LD3 (multiple structures, 2S) throughput 2.027656 | |
LD3 (multiple structures, 4S) throughput 2.003097 | |
LD3 (multiple structures, 2D) throughput 2.009708 | |
LD3 (single structure, B) throughput 2.034950 | |
LD3 (single structure, H) throughput 2.042560 | |
LD3 (single structure, S) throughput 2.031384 | |
LD3 (single structure, D) throughput 2.024957 | |
LD3R (8B) throughput 1.459800 | |
LD3R (16B) throughput 1.507324 | |
LD3R (4H) throughput 1.506469 | |
LD3R (8H) throughput 1.479041 | |
LD3R (2S) throughput 1.514882 | |
LD3R (4S) throughput 1.519725 | |
LD3R (1D) throughput 2.033667 | |
LD3R (2D) throughput 2.028952 | |
LD4 (multiple structures, 8B) throughput 2.066458 | |
LD4 (multiple structures, 16B) throughput 4.011827 | |
LD4 (multiple structures, 4H) throughput 2.009874 | |
LD4 (multiple structures, 8H) throughput 4.051732 | |
LD4 (multiple structures, 2S) throughput 2.000805 | |
LD4 (multiple structures, 4S) throughput 4.020400 | |
LD4 (multiple structures, 2D) throughput 4.098653 | |
LD4 (multiple, 8B, post-index) throughput 2.286229 | |
LD4 (multiple, 16B, post-index) throughput 4.078362 | |
LD4 (multiple, 4H, post-index) throughput 2.016536 | |
LD4 (multiple, 8H, post-index) throughput 3.831365 | |
LD4 (multiple, 2S, post-index) throughput 2.006732 | |
LD4 (multiple, 4S, post-index) throughput 4.019459 | |
LD4 (multiple, 2D, post-index) throughput 4.011161 | |
LD4 (single structure, B) throughput 2.032379 | |
LD4 (single structure, H) throughput 2.001791 | |
LD4 (single structure, S) throughput 2.087423 | |
LD4 (single structure, D) throughput 2.002626 | |
LD4 (single, B, post-index) throughput 2.008120 | |
LD4 (single, H, post-index) throughput 2.033647 | |
LD4 (single, S, post-index) throughput 1.780095 | |
LD4 (single, D, post-index) throughput 2.013273 | |
LD4R (8B) throughput 2.022372 | |
LD4R (16B) throughput 2.034385 | |
LD4R (4H) throughput 2.021174 | |
LD4R (8H) throughput 2.007375 | |
LD4R (2S) throughput 2.020434 | |
LD4R (4S) throughput 2.010825 | |
LD4R (1D) throughput 2.000273 | |
LD4R (2D) throughput 2.022669 | |
LDNP (S) 1->3 roundtrip 7.129860 | |
LDNP (S) 2->3 roundtrip 7.035929 | |
LDNP (S) throughput 1.001716 | |
LDNP (D) 1->3 roundtrip 7.058020 | |
LDNP (D) 2->3 roundtrip 7.215500 | |
LDNP (D) throughput 1.006747 | |
LDNP (Q) 1->3 roundtrip 7.033402 | |
LDNP (Q) 2->3 roundtrip 4.138727 | |
LDNP (Q) throughput 0.992942 | |
LDNP (signed offset, S) 1->3 roundtrip 7.098021 | |
LDNP (signed offset, S) 2->3 roundtrip 7.080957 | |
LDNP (signed offset, S) throughput 1.001782 | |
LDNP (signed offset, D) 1->3 roundtrip 7.070861 | |
LDNP (signed offset, D) 2->3 roundtrip 7.151612 | |
LDNP (signed offset, D) throughput 1.001563 | |
LDNP (signed offset, Q) 1->3 roundtrip 6.995954 | |
LDNP (signed offset, Q) 2->3 roundtrip 7.163104 | |
LDNP (signed offset, Q) throughput 1.007582 | |
LDP (S) 1->3 roundtrip 7.237894 | |
LDP (S) 2->3 roundtrip 7.097876 | |
LDP (S) throughput 1.004284 | |
LDP (D) 1->3 roundtrip 6.999353 | |
LDP (D) 2->3 roundtrip 7.555351 | |
LDP (D) throughput 1.662848 | |
LDP (Q) 1->3 roundtrip 8.648435 | |
LDP (Q) 2->3 roundtrip 7.959010 | |
LDP (Q) throughput 1.004211 | |
LDP (post-index, S) 1->3 roundtrip 7.761390 | |
LDP (post-index, S) 2->3 roundtrip 7.084072 | |
LDP (post-index, S) throughput 1.102760 | |
LDP (post-index, D) 1->3 roundtrip 6.691063 | |
LDP (post-index, D) 2->3 roundtrip 9.554171 | |
LDP (post-index, D) throughput 1.249167 | |
LDP (post-index, Q) 1->3 roundtrip 6.596199 | |
LDP (post-index, Q) 2->3 roundtrip 7.253716 | |
LDP (post-index, Q) throughput 2.055624 | |
LDP (pre-index, S) 1->3 roundtrip 6.779389 | |
LDP (pre-index, S) 2->3 roundtrip 7.762803 | |
LDP (pre-index, S) throughput 1.245505 | |
LDP (pre-index, D) 1->3 roundtrip 8.912175 | |
LDP (pre-index, D) 2->3 roundtrip 8.151609 | |
LDP (pre-index, D) throughput 1.333533 | |
LDP (pre-index, Q) 1->3 roundtrip 8.130755 | |
LDP (pre-index, Q) 2->3 roundtrip 8.356180 | |
LDP (pre-index, Q) throughput 1.191473 | |
LDP (signed offset, S) 1->3 roundtrip 7.574074 | |
LDP (signed offset, S) 2->3 roundtrip 7.816022 | |
LDP (signed offset, S) throughput 1.072364 | |
LDP (signed offset, D) 1->3 roundtrip 8.323065 | |
LDP (signed offset, D) 2->3 roundtrip 8.036336 | |
LDP (signed offset, D) throughput 1.023175 | |
LDP (signed offset, Q) 1->3 roundtrip 9.448364 | |
LDP (signed offset, Q) 2->3 roundtrip 7.566425 | |
LDP (signed offset, Q) throughput 1.051238 | |
LDR (S) 1->2 roundtrip 7.746717 | |
LDR (S) throughput 0.499114 | |
LDR (D) 1->2 roundtrip 6.939503 | |
LDR (D) throughput 0.535427 | |
LDR (Q) 1->2 roundtrip 10.153085 | |
LDR (Q) throughput 0.506620 | |
LDR (post-index, S) 1->2 roundtrip 7.068683 | |
LDR (post-index, S) throughput 0.660144 | |
LDR (post-index, D) 1->2 roundtrip 4.896653 | |
LDR (post-index, D) throughput 0.506181 | |
LDR (post-index, Q) 1->2 roundtrip 17.190853 | |
LDR (post-index, Q) throughput 0.673746 | |
LDR (pre-index, S) 1->2 roundtrip 8.663215 | |
LDR (pre-index, S) throughput 0.475521 | |
LDR (pre-index, D) 1->2 roundtrip 9.099869 | |
LDR (pre-index, D) throughput 0.463561 | |
LDR (pre-index, Q) 1->2 roundtrip 18.213276 | |
LDR (pre-index, Q) throughput 0.497461 | |
LDR (unsigned offset, S) 1->2 roundtrip 8.594967 | |
LDR (unsigned offset, S) throughput 0.500802 | |
LDR (unsigned offset, D) 1->2 roundtrip 6.723945 | |
LDR (unsigned offset, D) throughput 0.553039 | |
LDR (unsigned offset, Q) 1->2 roundtrip 5.054318 | |
LDR (unsigned offset, Q) throughput 0.517326 | |
LDR (literal, S) throughput 0.501477 | |
LDR (literal, D) throughput 0.540213 | |
LDR (literal, Q) throughput 0.495296 | |
LDR (register, S) 1->2 roundtrip 7.468402 | |
LDR (register, S) 1->3 roundtrip 8.114378 | |
LDR (register, S) throughput 0.504002 | |
LDR (register, D) 1->2 roundtrip 8.178510 | |
LDR (register, D) 1->3 roundtrip 7.503237 | |
LDR (register, D) throughput 0.566229 | |
LDR (register, Q) 1->2 roundtrip 8.265896 | |
LDR (register, Q) 1->3 roundtrip 7.296197 | |
LDR (register, Q) throughput 0.271045 | |
LDR (register, uxtw, S) 1->2 roundtrip 9.204787 | |
LDR (register, uxtw, S) 1->3 roundtrip 7.163584 | |
LDR (register, uxtw, S) throughput 0.570560 | |
LDR (register, uxtw, D) 1->2 roundtrip 7.521730 | |
LDR (register, uxtw, D) 1->3 roundtrip 7.347308 | |
LDR (register, uxtw, D) throughput 0.482927 | |
LDR (register, uxtw, Q) 1->2 roundtrip 7.301325 | |
LDR (register, uxtw, Q) 1->3 roundtrip 7.270031 | |
LDR (register, uxtw, Q) throughput 0.575110 | |
LDR (register, sxtw, S) 1->2 roundtrip 7.537086 | |
LDR (register, sxtw, S) 1->3 roundtrip 6.562206 | |
LDR (register, sxtw, S) throughput 0.541562 | |
LDR (register, sxtw, D) 1->2 roundtrip 7.555881 | |
LDR (register, sxtw, D) 1->3 roundtrip 8.085580 | |
LDR (register, sxtw, D) throughput 0.434113 | |
LDR (register, sxtw, Q) 1->2 roundtrip 8.257919 | |
LDR (register, sxtw, Q) 1->3 roundtrip 7.470659 | |
LDR (register, sxtw, Q) throughput 0.534491 | |
LDR (register, lsl, S) 1->2 roundtrip 5.657690 | |
LDR (register, lsl, S) 1->3 roundtrip 8.016595 | |
LDR (register, lsl, S) throughput 0.500096 | |
LDR (register, lsl, D) 1->2 roundtrip 7.409360 | |
LDR (register, lsl, D) 1->3 roundtrip 7.796671 | |
LDR (register, lsl, D) throughput 0.549861 | |
LDR (register, lsl, Q) 1->2 roundtrip 7.604372 | |
LDR (register, lsl, Q) 1->3 roundtrip 9.329175 | |
LDR (register, lsl, Q) throughput 0.502061 | |
LDUR (S) 1->2 roundtrip 7.952640 | |
LDUR (S) throughput 0.552248 | |
LDUR (D) 1->2 roundtrip 7.898053 | |
LDUR (D) throughput 0.542755 | |
LDUR (Q) 1->2 roundtrip 6.684032 | |
LDUR (Q) throughput 0.502185 | |
MLA (by element, 4H) 1->1 2.651300 | |
MLA (by element, 4H) 1->2 2.919289 | |
MLA (by element, 4H) 1->3 3.005718 | |
MLA (by element, 4H) throughput 0.416492 | |
MLA (by element, 4H) throughput 0.551420 | |
MLA (by element, 8H) 1->1 3.004814 | |
MLA (by element, 8H) 1->2 2.980716 | |
MLA (by element, 8H) 1->3 4.867201 | |
MLA (by element, 8H) throughput 0.421445 | |
MLA (by element, 8H) throughput 0.562966 | |
MLA (by element, 2S) 1->1 2.991071 | |
MLA (by element, 2S) 1->2 3.026394 | |
MLA (by element, 2S) 1->3 3.025500 | |
MLA (by element, 2S) throughput 0.542866 | |
MLA (by element, 2S) throughput 0.563320 | |
MLA (by element, 4S) 1->1 2.905926 | |
MLA (by element, 4S) 1->2 3.038773 | |
MLA (by element, 4S) 1->3 3.046851 | |
MLA (by element, 4S) throughput 0.507081 | |
MLA (by element, 4S) throughput 0.529790 | |
MLA (vector, 8B) 1->1 3.014456 | |
MLA (vector, 8B) 1->2 3.010854 | |
MLA (vector, 8B) 1->3 3.623993 | |
MLA (vector, 8B) throughput 0.499141 | |
MLA (vector, 8B) throughput 0.533033 | |
MLA (vector, 16B) 1->1 3.424985 | |
MLA (vector, 16B) 1->2 3.524273 | |
MLA (vector, 16B) 1->3 2.997898 | |
MLA (vector, 16B) throughput 0.574058 | |
MLA (vector, 16B) throughput 0.537056 | |
MLA (vector, 4H) 1->1 3.011372 | |
MLA (vector, 4H) 1->2 3.005741 | |
MLA (vector, 4H) 1->3 3.364581 | |
MLA (vector, 4H) throughput 0.558823 | |
MLA (vector, 4H) throughput 0.447700 | |
MLA (vector, 8H) 1->1 3.457334 | |
MLA (vector, 8H) 1->2 3.010852 | |
MLA (vector, 8H) 1->3 2.120557 | |
MLA (vector, 8H) throughput 0.980558 | |
MLA (vector, 8H) throughput 0.510988 | |
MLA (vector, 2S) 1->1 3.000564 | |
MLA (vector, 2S) 1->2 3.671882 | |
MLA (vector, 2S) 1->3 3.077388 | |
MLA (vector, 2S) throughput 0.530708 | |
MLA (vector, 2S) throughput 0.512276 | |
MLA (vector, 4S) 1->1 4.062491 | |
MLA (vector, 4S) 1->2 2.988690 | |
MLA (vector, 4S) 1->3 2.979174 | |
MLA (vector, 4S) throughput 0.504614 | |
MLA (vector, 4S) throughput 0.507961 | |
MLS (by element, 4H) 1->1 2.957716 | |
MLS (by element, 4H) 1->2 3.106535 | |
MLS (by element, 4H) 1->3 3.535993 | |
MLS (by element, 4H) throughput 0.498676 | |
MLS (by element, 4H) throughput 0.556905 | |
MLS (by element, 8H) 1->1 2.986776 | |
MLS (by element, 8H) 1->2 2.962045 | |
MLS (by element, 8H) 1->3 3.000323 | |
MLS (by element, 8H) throughput 0.579716 | |
MLS (by element, 8H) throughput 0.546364 | |
MLS (by element, 2S) 1->1 3.607027 | |
MLS (by element, 2S) 1->2 3.431976 | |
MLS (by element, 2S) 1->3 2.872706 | |
MLS (by element, 2S) throughput 0.500515 | |
MLS (by element, 2S) throughput 0.526552 | |
MLS (by element, 4S) 1->1 3.451403 | |
MLS (by element, 4S) 1->2 3.019598 | |
MLS (by element, 4S) 1->3 2.999959 | |
MLS (by element, 4S) throughput 0.590091 | |
MLS (by element, 4S) throughput 0.517215 | |
MLS (vector, 8B) 1->1 3.181490 | |
MLS (vector, 8B) 1->2 3.001738 | |
MLS (vector, 8B) 1->3 3.307790 | |
MLS (vector, 8B) throughput 0.529128 | |
MLS (vector, 8B) throughput 0.443394 | |
MLS (vector, 16B) 1->1 3.013403 | |
MLS (vector, 16B) 1->2 3.216815 | |
MLS (vector, 16B) 1->3 4.056943 | |
MLS (vector, 16B) throughput 0.503182 | |
MLS (vector, 16B) throughput 0.515519 | |
MLS (vector, 4H) 1->1 3.012123 | |
MLS (vector, 4H) 1->2 3.081776 | |
MLS (vector, 4H) 1->3 3.751146 | |
MLS (vector, 4H) throughput 0.501721 | |
MLS (vector, 4H) throughput 0.533658 | |
MLS (vector, 8H) 1->1 2.988269 | |
MLS (vector, 8H) 1->2 3.000808 | |
MLS (vector, 8H) 1->3 3.458138 | |
MLS (vector, 8H) throughput 0.563467 | |
MLS (vector, 8H) throughput 0.531125 | |
MLS (vector, 2S) 1->1 2.993176 | |
MLS (vector, 2S) 1->2 3.438311 | |
MLS (vector, 2S) 1->3 2.991025 | |
MLS (vector, 2S) throughput 0.501554 | |
MLS (vector, 2S) throughput 0.392994 | |
MLS (vector, 4S) 1->1 3.051061 | |
MLS (vector, 4S) 1->2 2.742058 | |
MLS (vector, 4S) 1->3 3.001465 | |
MLS (vector, 4S) throughput 0.528878 | |
MLS (vector, 4S) throughput 0.516196 | |
MOV (vector, 8B) 1->2 1.999597 | |
MOV (vector, 8B) throughput 0.590581 | |
MOV (vector, 16B) 1->2 1.247376 | |
MOV (vector, 16B) throughput 0.248444 | |
MOVI (vector, 4H, zero) throughput 0.251940 | |
MOVI (vector, 8H, zero) throughput 0.252171 | |
MOVI (vector, 2S, zero) throughput 0.258209 | |
MOVI (vector, 4S, zero) throughput 0.219978 | |
MOVI (vector, 4H) throughput 0.537250 | |
MOVI (vector, 8H) throughput 0.501582 | |
MOVI (vector, 2S) throughput 0.535327 | |
MOVI (vector, 4S) throughput 0.430070 | |
MOVI (vector, 4H, lsl) throughput 0.529857 | |
MOVI (vector, 8H, lsl) throughput 0.500820 | |
MOVI (vector, 2S, lsl) throughput 0.594961 | |
MOVI (vector, 4S, lsl) throughput 0.472988 | |
MOVI (vector, 2S, msl) throughput 0.503963 | |
MOVI (vector, 4S, msl) throughput 0.609193 | |
MOVI (vector, 64-bit, 2D) throughput 0.502693 | |
MOVI (vector, 64-bit, D) throughput 0.584680 | |
MUL (by element, 4H) 1->2 3.001615 | |
MUL (by element, 4H) 1->3 2.877025 | |
MUL (by element, 4H) throughput 0.568466 | |
MUL (by element, 8H) 1->2 3.021402 | |
MUL (by element, 8H) 1->3 3.673616 | |
MUL (by element, 8H) throughput 0.500567 | |
MUL (by element, 2S) 1->2 3.597019 | |
MUL (by element, 2S) 1->3 3.446838 | |
MUL (by element, 2S) throughput 0.503001 | |
MUL (by element, 4S) 1->2 3.228488 | |
MUL (by element, 4S) 1->3 3.008003 | |
MUL (by element, 4S) throughput 0.542315 | |
MUL (vector, 8B) 1->2 3.010203 | |
MUL (vector, 8B) 1->3 3.293325 | |
MUL (vector, 8B) throughput 0.537211 | |
MUL (vector, 16B) 1->2 2.885216 | |
MUL (vector, 16B) 1->3 2.667125 | |
MUL (vector, 16B) throughput 0.464154 | |
MUL (vector, 4H) 1->2 3.332741 | |
MUL (vector, 4H) 1->3 2.987184 | |
MUL (vector, 4H) throughput 0.499980 | |
MUL (vector, 8H) 1->2 3.616624 | |
MUL (vector, 8H) 1->3 3.296550 | |
MUL (vector, 8H) throughput 0.467136 | |
MUL (vector, 2S) 1->2 3.282825 | |
MUL (vector, 2S) 1->3 3.022032 | |
MUL (vector, 2S) throughput 0.538245 | |
MUL (vector, 4S) 1->2 3.175769 | |
MUL (vector, 4S) 1->3 2.998626 | |
MUL (vector, 4S) throughput 0.556138 | |
MVNI (vector, 4H) throughput 0.573552 | |
MVNI (vector, 8H) throughput 0.497753 | |
MVNI (vector, 2S) throughput 0.595928 | |
MVNI (vector, 4S) throughput 0.528199 | |
MVNI (vector, 4H, lsl) throughput 0.437369 | |
MVNI (vector, 8H, lsl) throughput 0.565829 | |
MVNI (vector, 2S, lsl) throughput 0.500683 | |
MVNI (vector, 4S, lsl) throughput 0.501579 | |
MVNI (vector, 2S, msl) throughput 0.579338 | |
MVNI (vector, 4S, msl) throughput 0.488313 | |
NEG (vector, 8B) 1->2 2.013323 | |
NEG (vector, 8B) throughput 0.508059 | |
NEG (vector, 16B) 1->2 1.917469 | |
NEG (vector, 16B) throughput 0.501281 | |
NEG (vector, 4H) 1->2 2.008097 | |
NEG (vector, 4H) throughput 0.527345 | |
NEG (vector, 8H) 1->2 1.993436 | |
NEG (vector, 8H) throughput 0.497871 | |
NEG (vector, 2S) 1->2 2.057741 | |
NEG (vector, 2S) throughput 0.548211 | |
NEG (vector, 4S) 1->2 1.995479 | |
NEG (vector, 4S) throughput 0.507916 | |
NEG (vector, 2D) 1->2 2.010483 | |
NEG (vector, 2D) throughput 0.830210 | |
NOT (vector, 8B) 1->2 2.005074 | |
NOT (vector, 8B) throughput 0.512108 | |
NOT (vector, 16B) 1->2 2.185401 | |
NOT (vector, 16B) throughput 0.497840 | |
ORN (vector, 8B) 1->2 1.994586 | |
ORN (vector, 8B) 1->3 2.015508 | |
ORN (vector, 8B) throughput 0.508539 | |
ORN (vector, 16B) 1->2 2.162469 | |
ORN (vector, 16B) 1->3 2.228269 | |
ORN (vector, 16B) throughput 0.467943 | |
ORR (vector, immediate, 4H) 1->1 2.029656 | |
ORR (vector, immediate, 4H) throughput 0.481015 | |
ORR (vector, immediate, 4H) throughput 0.568080 | |
ORR (vector, immediate, 8H) 1->1 2.064586 | |
ORR (vector, immediate, 8H) throughput 0.776736 | |
ORR (vector, immediate, 8H) throughput 0.473577 | |
ORR (vector, immediate, 2S) 1->1 2.223360 | |
ORR (vector, immediate, 2S) throughput 0.515830 | |
ORR (vector, immediate, 2S) throughput 0.412866 | |
ORR (vector, immediate, 4S) 1->1 2.002350 | |
ORR (vector, immediate, 4S) throughput 0.506261 | |
ORR (vector, immediate, 4S) throughput 0.502726 | |
ORR (vector, 8B) 1->2 1.999571 | |
ORR (vector, 8B) 1->3 2.002751 | |
ORR (vector, 8B) throughput 0.392356 | |
ORR (vector, 16B) 1->2 2.667962 | |
ORR (vector, 16B) 1->3 2.002825 | |
ORR (vector, 16B) throughput 0.349877 | |
PMUL (8B) 1->2 3.778660 | |
PMUL (8B) 1->3 1.895518 | |
PMUL (8B) throughput 0.712787 | |
PMUL (16B) 1->2 4.838376 | |
PMUL (16B) 1->3 2.448032 | |
PMUL (16B) throughput 0.512456 | |
PMULL (1Q) 1->2 3.028828 | |
PMULL (1Q) 1->3 2.997009 | |
PMULL (1Q) throughput 0.500430 | |
PMULL (8H) 1->2 3.007371 | |
PMULL (8H) 1->3 3.092862 | |
PMULL (8H) throughput 0.513959 | |
PMULL2 (1Q) 1->2 3.095582 | |
PMULL2 (1Q) 1->3 2.933247 | |
PMULL2 (1Q) throughput 0.617001 | |
PMULL2 (8H) 1->2 2.849522 | |
PMULL2 (8H) 1->3 5.254128 | |
PMULL2 (8H) throughput 0.770522 | |
PMULL + EOR (1Q) 1->2 4.188216 | |
PMULL + EOR (1Q) 1->3 4.196204 | |
PMULL + EOR (1Q) 1->4 4.247578 | |
PMULL + EOR (1Q) throughput 0.504712 | |
PMULL + EOR (8H) 1->2 4.180045 | |
PMULL + EOR (8H) 1->3 4.939697 | |
PMULL + EOR (8H) 1->4 5.004923 | |
PMULL + EOR (8H) throughput 0.578347 | |
RADDHN (8H) 1->2 3.005673 | |
RADDHN (8H) 1->3 3.030740 | |
RADDHN (8H) throughput 0.497274 | |
RADDHN (4S) 1->2 3.023591 | |
RADDHN (4S) 1->3 3.025277 | |
RADDHN (4S) throughput 0.502520 | |
RADDHN (2D) 1->2 2.996732 | |
RADDHN (2D) 1->3 3.056888 | |
RADDHN (2D) throughput 0.503019 | |
RADDHN2 (8H) 1->1 3.156757 | |
RADDHN2 (8H) 1->2 3.623012 | |
RADDHN2 (8H) 1->3 3.010248 | |
RADDHN2 (8H) throughput 0.505579 | |
RADDHN2 (8H) throughput 0.371679 | |
RADDHN2 (4S) 1->1 3.150907 | |
RADDHN2 (4S) 1->2 3.745533 | |
RADDHN2 (4S) 1->3 2.694099 | |
RADDHN2 (4S) throughput 0.501379 | |
RADDHN2 (4S) throughput 0.500970 | |
RADDHN2 (2D) 1->1 2.999084 | |
RADDHN2 (2D) 1->2 3.005224 | |
RADDHN2 (2D) 1->3 3.002875 | |
RADDHN2 (2D) throughput 0.501953 | |
RADDHN2 (2D) throughput 0.555179 | |
RBIT (vector, 8B) 1->2 2.004617 | |
RBIT (vector, 8B) throughput 0.574678 | |
RBIT (vector, 16B) 1->2 1.898328 | |
RBIT (vector, 16B) throughput 0.659125 | |
REV16 (vector, 8B) 1->2 2.012634 | |
REV16 (vector, 8B) throughput 0.624666 | |
REV16 (vector, 16B) 1->2 2.005384 | |
REV16 (vector, 16B) throughput 0.498250 | |
REV32 (vector, 8B) 1->2 2.015148 | |
REV32 (vector, 8B) throughput 0.500532 | |
REV32 (vector, 16B) 1->2 1.848963 | |
REV32 (vector, 16B) throughput 0.566067 | |
REV32 (vector, 4H) 1->2 2.537875 | |
REV32 (vector, 4H) throughput 0.498430 | |
REV32 (vector, 8H) 1->2 1.983819 | |
REV32 (vector, 8H) throughput 0.624582 | |
REV64 (vector, 8B) 1->2 2.504411 | |
REV64 (vector, 8B) throughput 0.496911 | |
REV64 (vector, 16B) 1->2 2.015870 | |
REV64 (vector, 16B) throughput 0.581281 | |
REV64 (vector, 4H) 1->2 1.972754 | |
REV64 (vector, 4H) throughput 0.499645 | |
REV64 (vector, 8H) 1->2 1.983081 | |
REV64 (vector, 8H) throughput 0.452944 | |
REV64 (vector, 2S) 1->2 1.992644 | |
REV64 (vector, 2S) throughput 0.590899 | |
REV64 (vector, 4S) 1->2 1.914631 | |
REV64 (vector, 4S) throughput 0.602440 | |
RSHRN (8H) 1->2 3.014105 | |
RSHRN (8H) throughput 0.597651 | |
RSHRN (4S) 1->2 3.103403 | |
RSHRN (4S) throughput 0.601344 | |
RSHRN (2D) 1->2 3.009539 | |
RSHRN (2D) throughput 0.567662 | |
RSHRN2 (8H) 1->1 3.009717 | |
RSHRN2 (8H) 1->2 3.617428 | |
RSHRN2 (8H) throughput 0.501913 | |
RSHRN2 (8H) throughput 0.543457 | |
RSHRN2 (4S) 1->1 3.004216 | |
RSHRN2 (4S) 1->2 3.026157 | |
RSHRN2 (4S) throughput 0.417719 | |
RSHRN2 (4S) throughput 0.566053 | |
RSHRN2 (2D) 1->1 3.668177 | |
RSHRN2 (2D) 1->2 2.581415 | |
RSHRN2 (2D) throughput 0.451969 | |
RSHRN2 (2D) throughput 0.539838 | |
RSUBHN (8H) 1->2 2.982652 | |
RSUBHN (8H) 1->3 2.994159 | |
RSUBHN (8H) throughput 0.421083 | |
RSUBHN (4S) 1->2 3.008464 | |
RSUBHN (4S) 1->3 2.989121 | |
RSUBHN (4S) throughput 0.564006 | |
RSUBHN (2D) 1->2 3.557729 | |
RSUBHN (2D) 1->3 3.031143 | |
RSUBHN (2D) throughput 0.627254 | |
RSUBHN2 (8H) 1->1 3.019074 | |
RSUBHN2 (8H) 1->2 3.041829 | |
RSUBHN2 (8H) 1->3 3.072098 | |
RSUBHN2 (8H) throughput 0.540833 | |
RSUBHN2 (8H) throughput 0.542734 | |
RSUBHN2 (4S) 1->1 4.368918 | |
RSUBHN2 (4S) 1->2 3.020609 | |
RSUBHN2 (4S) 1->3 3.240532 | |
RSUBHN2 (4S) throughput 0.483360 | |
RSUBHN2 (4S) throughput 0.621139 | |
RSUBHN2 (2D) 1->1 2.689037 | |
RSUBHN2 (2D) 1->2 3.174594 | |
RSUBHN2 (2D) 1->3 3.023611 | |
RSUBHN2 (2D) throughput 0.604233 | |
RSUBHN2 (2D) throughput 0.544724 | |
SABA (vector, 8B) 1->1 2.998870 | |
SABA (vector, 8B) 1->2 2.868923 | |
SABA (vector, 8B) 1->3 3.009475 | |
SABA (vector, 8B) throughput 0.599454 | |
SABA (vector, 8B) throughput 0.536865 | |
SABA (vector, 16B) 1->1 2.775812 | |
SABA (vector, 16B) 1->2 3.006600 | |
SABA (vector, 16B) 1->3 2.826876 | |
SABA (vector, 16B) throughput 0.518390 | |
SABA (vector, 16B) throughput 0.523267 | |
SABA (vector, 4H) 1->1 3.005871 | |
SABA (vector, 4H) 1->2 2.860097 | |
SABA (vector, 4H) 1->3 3.009153 | |
SABA (vector, 4H) throughput 0.503031 | |
SABA (vector, 4H) throughput 0.519432 | |
SABA (vector, 8H) 1->1 3.012635 | |
SABA (vector, 8H) 1->2 3.639463 | |
SABA (vector, 8H) 1->3 3.465409 | |
SABA (vector, 8H) throughput 0.684085 | |
SABA (vector, 8H) throughput 0.495022 | |
SABA (vector, 2S) 1->1 3.011420 | |
SABA (vector, 2S) 1->2 3.020408 | |
SABA (vector, 2S) 1->3 2.880928 | |
SABA (vector, 2S) throughput 0.568116 | |
SABA (vector, 2S) throughput 0.426606 | |
SABA (vector, 4S) 1->1 3.610996 | |
SABA (vector, 4S) 1->2 3.028709 | |
SABA (vector, 4S) 1->3 2.521606 | |
SABA (vector, 4S) throughput 0.451987 | |
SABA (vector, 4S) throughput 0.542796 | |
SABAL (vector, 8H) 1->1 3.052599 | |
SABAL (vector, 8H) 1->2 2.862481 | |
SABAL (vector, 8H) 1->3 2.901147 | |
SABAL (vector, 8H) throughput 0.532919 | |
SABAL (vector, 8H) throughput 0.519014 | |
SABAL (vector, 4S) 1->1 2.955786 | |
SABAL (vector, 4S) 1->2 3.014858 | |
SABAL (vector, 4S) 1->3 3.396685 | |
SABAL (vector, 4S) throughput 0.482266 | |
SABAL (vector, 4S) throughput 0.442707 | |
SABAL (vector, 2D) 1->1 3.055825 | |
SABAL (vector, 2D) 1->2 3.021014 | |
SABAL (vector, 2D) 1->3 3.685871 | |
SABAL (vector, 2D) throughput 0.403480 | |
SABAL (vector, 2D) throughput 0.432392 | |
SABAL2 (vector, 8H) 1->1 3.546463 | |
SABAL2 (vector, 8H) 1->2 3.742812 | |
SABAL2 (vector, 8H) 1->3 2.668307 | |
SABAL2 (vector, 8H) throughput 0.503630 | |
SABAL2 (vector, 8H) throughput 0.521434 | |
SABAL2 (vector, 4S) 1->1 3.619215 | |
SABAL2 (vector, 4S) 1->2 3.009677 | |
SABAL2 (vector, 4S) 1->3 3.381162 | |
SABAL2 (vector, 4S) throughput 0.434552 | |
SABAL2 (vector, 4S) throughput 0.556649 | |
SABAL2 (vector, 2D) 1->1 3.637452 | |
SABAL2 (vector, 2D) 1->2 3.000567 | |
SABAL2 (vector, 2D) 1->3 3.003960 | |
SABAL2 (vector, 2D) throughput 0.570905 | |
SABAL2 (vector, 2D) throughput 0.539789 | |
SABD (vector, 8B) 1->2 3.312987 | |
SABD (vector, 8B) 1->3 2.769235 | |
SABD (vector, 8B) throughput 0.502359 | |
SABD (vector, 16B) 1->2 3.567490 | |
SABD (vector, 16B) 1->3 2.989731 | |
SABD (vector, 16B) throughput 0.614586 | |
SABD (vector, 4H) 1->2 3.355146 | |
SABD (vector, 4H) 1->3 3.081927 | |
SABD (vector, 4H) throughput 0.520798 | |
SABD (vector, 8H) 1->2 3.045998 | |
SABD (vector, 8H) 1->3 3.092206 | |
SABD (vector, 8H) throughput 0.500790 | |
SABD (vector, 2S) 1->2 2.996344 | |
SABD (vector, 2S) 1->3 3.032403 | |
SABD (vector, 2S) throughput 0.639312 | |
SABD (vector, 4S) 1->2 2.964865 | |
SABD (vector, 4S) 1->3 2.935895 | |
SABD (vector, 4S) throughput 0.536731 | |
SABDL (vector, 8H) 1->2 3.013685 | |
SABDL (vector, 8H) 1->3 3.092662 | |
SABDL (vector, 8H) throughput 0.508309 | |
SABDL (vector, 4S) 1->2 2.797812 | |
SABDL (vector, 4S) 1->3 3.014093 | |
SABDL (vector, 4S) throughput 0.522330 | |
SABDL (vector, 2D) 1->2 3.010288 | |
SABDL (vector, 2D) 1->3 3.593778 | |
SABDL (vector, 2D) throughput 0.586483 | |
SABDL2 (vector, 8H) 1->2 2.997375 | |
SABDL2 (vector, 8H) 1->3 3.006155 | |
SABDL2 (vector, 8H) throughput 0.589966 | |
SABDL2 (vector, 4S) 1->2 3.121799 | |
SABDL2 (vector, 4S) 1->3 3.394735 | |
SABDL2 (vector, 4S) throughput 0.500081 | |
SABDL2 (vector, 2D) 1->2 3.685663 | |
SABDL2 (vector, 2D) 1->3 3.000686 | |
SABDL2 (vector, 2D) throughput 0.561512 | |
SADALP (vector, 4H) 1->1 3.004996 | |
SADALP (vector, 4H) 1->2 3.007693 | |
SADALP (vector, 4H) throughput 0.394840 | |
SADALP (vector, 4H) throughput 0.541280 | |
SADALP (vector, 8H) 1->1 3.383163 | |
SADALP (vector, 8H) 1->2 3.309917 | |
SADALP (vector, 8H) throughput 0.518616 | |
SADALP (vector, 8H) throughput 0.940106 | |
SADALP (vector, 2S) 1->1 2.822840 | |
SADALP (vector, 2S) 1->2 2.997860 | |
SADALP (vector, 2S) throughput 0.629988 | |
SADALP (vector, 2S) throughput 0.504692 | |
SADALP (vector, 4S) 1->1 2.996531 | |
SADALP (vector, 4S) 1->2 3.189474 | |
SADALP (vector, 4S) throughput 0.544209 | |
SADALP (vector, 4S) throughput 0.621157 | |
SADALP (vector, 1D) 1->1 3.822951 | |
SADALP (vector, 1D) 1->2 2.815154 | |
SADALP (vector, 1D) throughput 0.538329 | |
SADALP (vector, 1D) throughput 0.547505 | |
SADALP (vector, 2D) 1->1 3.184567 | |
SADALP (vector, 2D) 1->2 2.998870 | |
SADALP (vector, 2D) throughput 0.501885 | |
SADALP (vector, 2D) throughput 0.518254 | |
SADDL (vector, 8H) 1->2 2.074264 | |
SADDL (vector, 8H) 1->3 1.947960 | |
SADDL (vector, 8H) throughput 0.539708 | |
SADDL (vector, 4S) 1->2 2.461415 | |
SADDL (vector, 4S) 1->3 2.002957 | |
SADDL (vector, 4S) throughput 0.500723 | |
SADDL (vector, 2D) 1->2 1.996255 | |
SADDL (vector, 2D) 1->3 2.002956 | |
SADDL (vector, 2D) throughput 0.409344 | |
SADDL2 (vector, 8H) 1->2 2.003802 | |
SADDL2 (vector, 8H) 1->3 2.135944 | |
SADDL2 (vector, 8H) throughput 0.520689 | |
SADDL2 (vector, 4S) 1->2 1.760975 | |
SADDL2 (vector, 4S) 1->3 1.991428 | |
SADDL2 (vector, 4S) throughput 0.526342 | |
SADDL2 (vector, 2D) 1->2 2.067015 | |
SADDL2 (vector, 2D) 1->3 1.995926 | |
SADDL2 (vector, 2D) throughput 0.572240 | |
SADDLP (vector, 4H) 1->2 2.094150 | |
SADDLP (vector, 4H) throughput 0.501276 | |
SADDLP (vector, 8H) 1->2 2.005872 | |
SADDLP (vector, 8H) throughput 0.403376 | |
SADDLP (vector, 2S) 1->2 1.925611 | |
SADDLP (vector, 2S) throughput 0.701203 | |
SADDLP (vector, 4S) 1->2 1.999915 | |
SADDLP (vector, 4S) throughput 0.626526 | |
SADDLP (vector, 1D) 1->2 1.896445 | |
SADDLP (vector, 1D) throughput 0.515437 | |
SADDLP (vector, 2D) 1->2 2.322773 | |
SADDLP (vector, 2D) throughput 0.550845 | |
SADDLV (vector, 8B) 1->2 2.999071 | |
SADDLV (vector, 8B) throughput 0.636241 | |
SADDLV (vector, 16B) 1->2 3.019153 | |
SADDLV (vector, 16B) throughput 0.575372 | |
SADDLV (vector, 4H) 1->2 3.030367 | |
SADDLV (vector, 4H) throughput 0.578967 | |
SADDLV (vector, 8H) 1->2 2.999272 | |
SADDLV (vector, 8H) throughput 0.635103 | |
SADDLV (vector, 4S) 1->2 3.055673 | |
SADDLV (vector, 4S) throughput 0.278926 | |
SADDW (vector, 8H) 1->2 2.022393 | |
SADDW (vector, 8H) 1->3 1.939439 | |
SADDW (vector, 8H) throughput 0.809379 | |
SADDW (vector, 4S) 1->2 2.142636 | |
SADDW (vector, 4S) 1->3 2.007137 | |
SADDW (vector, 4S) throughput 0.429380 | |
SADDW (vector, 2D) 1->2 2.003597 | |
SADDW (vector, 2D) 1->3 1.467885 | |
SADDW (vector, 2D) throughput 0.580798 | |
SADDW2 (vector, 8H) 1->2 2.026643 | |
SADDW2 (vector, 8H) 1->3 2.024498 | |
SADDW2 (vector, 8H) throughput 0.503265 | |
SADDW2 (vector, 4S) 1->2 2.004170 | |
SADDW2 (vector, 4S) 1->3 2.010447 | |
SADDW2 (vector, 4S) throughput 0.499028 | |
SADDW2 (vector, 2D) 1->2 2.020449 | |
SADDW2 (vector, 2D) 1->3 1.996132 | |
SADDW2 (vector, 2D) throughput 0.504555 | |
SCVTF (scalar, fixed-point, S from 32-bit GPR) 1->2 roundtrip 10.747176 | |
SCVTF (scalar, fixed-point, S from 32-bit GPR) throughput 0.514834 | |
SCVTF (scalar, fixed-point, D from 32-bit GPR) 1->2 roundtrip 10.475107 | |
SCVTF (scalar, fixed-point, D from 32-bit GPR) throughput 0.508775 | |
SCVTF (scalar, fixed-point, S from 64-bit GPR) 1->2 roundtrip 10.016344 | |
SCVTF (scalar, fixed-point, S from 64-bit GPR) throughput 0.502420 | |
SCVTF (scalar, fixed-point, D from 64-bit GPR) 1->2 roundtrip 11.965442 | |
SCVTF (scalar, fixed-point, D from 64-bit GPR) throughput 1.263495 | |
SCVTF (scalar, integer, S from 32-bit GPR) 1->2 roundtrip 11.604058 | |
SCVTF (scalar, integer, S from 32-bit GPR) throughput 0.447383 | |
SCVTF (scalar, integer, D from 32-bit GPR) 1->2 roundtrip 10.031948 | |
SCVTF (scalar, integer, D from 32-bit GPR) throughput 0.503654 | |
SCVTF (scalar, integer, S from 64-bit GPR) 1->2 roundtrip 10.027090 | |
SCVTF (scalar, integer, S from 64-bit GPR) throughput 0.505900 | |
SCVTF (scalar, integer, D from 64-bit GPR) 1->2 roundtrip 11.149041 | |
SCVTF (scalar, integer, D from 64-bit GPR) throughput 0.499189 | |
SCVTF (scalar, fixed-point, S from S) 1->2 3.291355 | |
SCVTF (scalar, fixed-point, S from S) throughput 0.560211 | |
SCVTF (scalar, fixed-point, D from D) 1->2 3.128059 | |
SCVTF (scalar, fixed-point, D from D) throughput 0.571024 | |
SCVTF (vector, fixed-point, 2S from 2S) 1->2 3.236119 | |
SCVTF (vector, fixed-point, 2S from 2S) throughput 0.357278 | |
SCVTF (vector, fixed-point, 4S from 4S) 1->2 3.007992 | |
SCVTF (vector, fixed-point, 4S from 4S) throughput 0.447131 | |
SCVTF (vector, fixed-point, 2D from 2D) 1->2 3.082175 | |
SCVTF (vector, fixed-point, 2D from 2D) throughput 0.604206 | |
SCVTF (scalar, integer, S from S) 1->2 3.059918 | |
SCVTF (scalar, integer, S from S) throughput 0.328453 | |
SCVTF (scalar, integer, D from D) 1->2 3.025104 | |
SCVTF (scalar, integer, D from D) throughput 0.486123 | |
SCVTF (vector, integer, 2S from 2S) 1->2 3.003636 | |
SCVTF (vector, integer, 2S from 2S) throughput 0.478421 | |
SCVTF (vector, integer, 4S from 4S) 1->2 3.049093 | |
SCVTF (vector, integer, 4S from 4S) throughput 0.638962 | |
SCVTF (vector, integer, 2D from 2D) 1->2 3.003180 | |
SCVTF (vector, integer, 2D from 2D) throughput 0.496317 | |
SHA1C 1->1 4.051272 | |
SHA1C 1->2 5.016176 | |
SHA1C 1->3 3.424852 | |
SHA1C throughput 4.260635 | |
SHA1C throughput 9.748699 | |
SHA1H 1->2 2.212774 | |
SHA1H throughput 1.138778 | |
SHA1M 1->1 6.134673 | |
SHA1M 1->2 4.875985 | |
SHA1M 1->3 3.976708 | |
SHA1M throughput 4.697822 | |
SHA1M throughput 5.052569 | |
SHA1P 1->1 4.413201 | |
SHA1P 1->2 5.006492 | |
SHA1P 1->3 4.649625 | |
SHA1P throughput 4.088976 | |
SHA1P throughput 5.241915 | |
SHA1SU0 1->1 1.317131 | |
SHA1SU0 1->2 1.779497 | |
SHA1SU0 1->3 2.030424 | |
SHA1SU0 throughput 1.039081 | |
SHA1SU0 throughput 1.166148 | |
SHA1SU1 1->1 1.998886 | |
SHA1SU1 1->2 1.985158 | |
SHA1SU1 throughput 1.008883 | |
SHA1SU1 throughput 1.008367 | |
SHA256H 1->1 4.006509 | |
SHA256H 1->2 5.040704 | |
SHA256H 1->3 5.554881 | |
SHA256H throughput 2.902630 | |
SHA256H throughput 2.328667 | |
SHA256H2 1->1 4.008517 | |
SHA256H2 1->2 5.028506 | |
SHA256H2 1->3 5.271121 | |
SHA256H2 throughput 1.938203 | |
SHA256H2 throughput 2.837457 | |
SHA256SU0 1->1 2.110497 | |
SHA256SU0 1->2 2.008347 | |
SHA256SU0 throughput 1.018245 | |
SHA256SU0 throughput 1.167320 | |
SHA256SU1 1->1 3.045821 | |
SHA256SU1 1->2 2.995285 | |
SHA256SU1 1->3 3.006964 | |
SHA256SU1 throughput 1.207115 | |
SHA256SU1 throughput 1.178254 | |
SHADD (vector, 8B) 1->2 1.626995 | |
SHADD (vector, 8B) 1->3 1.903988 | |
SHADD (vector, 8B) throughput 0.514430 | |
SHADD (vector, 16B) 1->2 1.915966 | |
SHADD (vector, 16B) 1->3 1.993592 | |
SHADD (vector, 16B) throughput 0.520691 | |
SHADD (vector, 4H) 1->2 1.854030 | |
SHADD (vector, 4H) 1->3 2.035794 | |
SHADD (vector, 4H) throughput 0.399484 | |
SHADD (vector, 8H) 1->2 2.370750 | |
SHADD (vector, 8H) 1->3 2.111178 | |
SHADD (vector, 8H) throughput 0.421902 | |
SHADD (vector, 2S) 1->2 3.026012 | |
SHADD (vector, 2S) 1->3 1.934143 | |
SHADD (vector, 2S) throughput 0.559937 | |
SHADD (vector, 4S) 1->2 1.672959 | |
SHADD (vector, 4S) 1->3 5.758187 | |
SHADD (vector, 4S) throughput 0.566609 | |
SHL (vector, 8B) 1->2 2.105348 | |
SHL (vector, 8B) throughput 0.574626 | |
SHL (vector, 16B) 1->2 2.003999 | |
SHL (vector, 16B) throughput 0.445028 | |
SHL (vector, 4H) 1->2 2.044053 | |
SHL (vector, 4H) throughput 0.700117 | |
SHL (vector, 8H) 1->2 1.990233 | |
SHL (vector, 8H) throughput 0.500839 | |
SHL (vector, 2S) 1->2 2.002233 | |
SHL (vector, 2S) throughput 0.620495 | |
SHL (vector, 4S) 1->2 1.999740 | |
SHL (vector, 4S) throughput 0.455981 | |
SHL (vector, 2D) 1->2 2.038344 | |
SHL (vector, 2D) throughput 0.528903 | |
SHL (scalar, D) 1->2 1.947001 | |
SHL (scalar, D) throughput 0.592678 | |
SHLL (vector, 8H) 1->2 1.859510 | |
SHLL (vector, 8H) throughput 0.475248 | |
SHLL (vector, 4S) 1->2 2.162726 | |
SHLL (vector, 4S) throughput 0.516511 | |
SHLL (vector, 2D) 1->2 2.620067 | |
SHLL (vector, 2D) throughput 0.491338 | |
SHLL2 (vector, 8H) 1->2 1.862501 | |
SHLL2 (vector, 8H) throughput 0.545502 | |
SHLL2 (vector, 4S) 1->2 1.959241 | |
SHLL2 (vector, 4S) throughput 0.582233 | |
SHLL2 (vector, 2D) 1->2 1.994776 | |
SHLL2 (vector, 2D) throughput 0.525899 | |
SHRN (8H) 1->2 3.078216 | |
SHRN (8H) throughput 0.661883 | |
SHRN (4S) 1->2 4.815494 | |
SHRN (4S) throughput 0.371687 | |
SHRN (2D) 1->2 3.316214 | |
SHRN (2D) throughput 0.507678 | |
SHRN2 (8H) 1->1 3.007827 | |
SHRN2 (8H) 1->2 3.006640 | |
SHRN2 (8H) throughput 0.503419 | |
SHRN2 (8H) throughput 0.499430 | |
SHRN2 (4S) 1->1 2.997977 | |
SHRN2 (4S) 1->2 3.003805 | |
SHRN2 (4S) throughput 0.500379 | |
SHRN2 (4S) throughput 0.501873 | |
SHRN2 (2D) 1->1 2.997010 | |
SHRN2 (2D) 1->2 3.002507 | |
SHRN2 (2D) throughput 0.501437 | |
SHRN2 (2D) throughput 0.507549 | |
SHSUB (vector, 8B) 1->2 2.004899 | |
SHSUB (vector, 8B) 1->3 2.004039 | |
SHSUB (vector, 8B) throughput 0.501795 | |
SHSUB (vector, 16B) 1->2 2.000342 | |
SHSUB (vector, 16B) 1->3 1.957339 | |
SHSUB (vector, 16B) throughput 0.446011 | |
SHSUB (vector, 4H) 1->2 2.011948 | |
SHSUB (vector, 4H) 1->3 1.726524 | |
SHSUB (vector, 4H) throughput 0.487286 | |
SHSUB (vector, 8H) 1->2 1.990778 | |
SHSUB (vector, 8H) 1->3 2.006438 | |
SHSUB (vector, 8H) throughput 0.501997 | |
SHSUB (vector, 2S) 1->2 2.001495 | |
SHSUB (vector, 2S) 1->3 2.009630 | |
SHSUB (vector, 2S) throughput 0.504384 | |
SHSUB (vector, 4S) 1->2 2.007168 | |
SHSUB (vector, 4S) 1->3 1.935026 | |
SHSUB (vector, 4S) throughput 0.501028 | |
SLI (vector, 8B) 1->1 2.016049 | |
SLI (vector, 8B) 1->2 1.831947 | |
SLI (vector, 8B) throughput 0.506567 | |
SLI (vector, 8B) throughput 0.503098 | |
SLI (vector, 16B) 1->1 1.981928 | |
SLI (vector, 16B) 1->2 1.999109 | |
SLI (vector, 16B) throughput 0.542134 | |
SLI (vector, 16B) throughput 0.503133 | |
SLI (vector, 4H) 1->1 2.020328 | |
SLI (vector, 4H) 1->2 2.035310 | |
SLI (vector, 4H) throughput 0.501670 | |
SLI (vector, 4H) throughput 0.501104 | |
SLI (vector, 8H) 1->1 2.003808 | |
SLI (vector, 8H) 1->2 2.018126 | |
SLI (vector, 8H) throughput 0.501324 | |
SLI (vector, 8H) throughput 0.500663 | |
SLI (vector, 2S) 1->1 1.999071 | |
SLI (vector, 2S) 1->2 2.001741 | |
SLI (vector, 2S) throughput 0.501085 | |
SLI (vector, 2S) throughput 0.503613 | |
SLI (vector, 4S) 1->1 2.001417 | |
SLI (vector, 4S) 1->2 2.000485 | |
SLI (vector, 4S) throughput 0.501979 | |
SLI (vector, 4S) throughput 0.505544 | |
SLI (vector, 2D) 1->1 1.998383 | |
SLI (vector, 2D) 1->2 2.002066 | |
SLI (vector, 2D) throughput 0.501438 | |
SLI (vector, 2D) throughput 0.626713 | |
SLI (scalar, D) 1->1 2.325765 | |
SLI (scalar, D) 1->2 1.914288 | |
SLI (scalar, D) throughput 0.386480 | |
SLI (scalar, D) throughput 0.343077 | |
SMAX (vector, 8B) 1->2 2.734828 | |
SMAX (vector, 8B) 1->3 1.611673 | |
SMAX (vector, 8B) throughput 0.480095 | |
SMAX (vector, 16B) 1->2 2.007288 | |
SMAX (vector, 16B) 1->3 2.022247 | |
SMAX (vector, 16B) throughput 0.499990 | |
SMAX (vector, 4H) 1->2 1.994675 | |
SMAX (vector, 4H) 1->3 1.792808 | |
SMAX (vector, 4H) throughput 0.564807 | |
SMAX (vector, 8H) 1->2 2.002549 | |
SMAX (vector, 8H) 1->3 2.001720 | |
SMAX (vector, 8H) throughput 0.515603 | |
SMAX (vector, 2S) 1->2 1.846244 | |
SMAX (vector, 2S) 1->3 2.043013 | |
SMAX (vector, 2S) throughput 0.656958 | |
SMAX (vector, 4S) 1->2 2.296030 | |
SMAX (vector, 4S) 1->3 2.003608 | |
SMAX (vector, 4S) throughput 0.632772 | |
SMAXP (vector, 8B) 1->2 1.968545 | |
SMAXP (vector, 8B) 1->3 2.082002 | |
SMAXP (vector, 8B) throughput 0.516500 | |
SMAXP (vector, 16B) 1->2 2.298335 | |
SMAXP (vector, 16B) 1->3 2.024417 | |
SMAXP (vector, 16B) throughput 0.501979 | |
SMAXP (vector, 4H) 1->2 1.989820 | |
SMAXP (vector, 4H) 1->3 2.053554 | |
SMAXP (vector, 4H) throughput 0.499341 | |
SMAXP (vector, 8H) 1->2 2.030617 | |
SMAXP (vector, 8H) 1->3 1.991000 | |
SMAXP (vector, 8H) throughput 0.418627 | |
SMAXP (vector, 2S) 1->2 1.999742 | |
SMAXP (vector, 2S) 1->3 2.034116 | |
SMAXP (vector, 2S) throughput 0.479085 | |
SMAXP (vector, 4S) 1->2 2.086236 | |
SMAXP (vector, 4S) 1->3 2.012648 | |
SMAXP (vector, 4S) throughput 0.506180 | |
SMAXV (vector, 8B) 1->2 3.001274 | |
SMAXV (vector, 8B) throughput 0.512884 | |
SMAXV (vector, 16B) 1->2 3.007935 | |
SMAXV (vector, 16B) throughput 0.501903 | |
SMAXV (vector, 4H) 1->2 3.192244 | |
SMAXV (vector, 4H) throughput 0.499464 | |
SMAXV (vector, 8H) 1->2 3.014160 | |
SMAXV (vector, 8H) throughput 0.500308 | |
SMAXV (vector, 4S) 1->2 2.999595 | |
SMAXV (vector, 4S) throughput 0.527162 | |
SMIN (vector, 8B) 1->2 1.977120 | |
SMIN (vector, 8B) 1->3 2.020017 | |
SMIN (vector, 8B) throughput 0.498721 | |
SMIN (vector, 16B) 1->2 1.993190 | |
SMIN (vector, 16B) 1->3 2.000483 | |
SMIN (vector, 16B) throughput 0.511552 | |
SMIN (vector, 4H) 1->2 2.002025 | |
SMIN (vector, 4H) 1->3 2.003766 | |
SMIN (vector, 4H) throughput 0.520112 | |
SMIN (vector, 8H) 1->2 1.988933 | |
SMIN (vector, 8H) 1->3 2.069363 | |
SMIN (vector, 8H) throughput 0.500815 | |
SMIN (vector, 2S) 1->2 2.008381 | |
SMIN (vector, 2S) 1->3 1.999271 | |
SMIN (vector, 2S) throughput 0.504682 | |
SMIN (vector, 4S) 1->2 1.999557 | |
SMIN (vector, 4S) 1->3 1.995196 | |
SMIN (vector, 4S) throughput 0.500542 | |
SMINP (vector, 8B) 1->2 1.998342 | |
SMINP (vector, 8B) 1->3 2.002187 | |
SMINP (vector, 8B) throughput 0.395699 | |
SMINP (vector, 16B) 1->2 1.847285 | |
SMINP (vector, 16B) 1->3 1.999275 | |
SMINP (vector, 16B) throughput 0.506697 | |
SMINP (vector, 4H) 1->2 2.002141 | |
SMINP (vector, 4H) 1->3 2.346024 | |
SMINP (vector, 4H) throughput 0.523312 | |
SMINP (vector, 8H) 1->2 1.993068 | |
SMINP (vector, 8H) 1->3 2.034254 | |
SMINP (vector, 8H) throughput 0.500921 | |
SMINP (vector, 2S) 1->2 2.001130 | |
SMINP (vector, 2S) 1->3 1.996486 | |
SMINP (vector, 2S) throughput 0.504290 | |
SMINP (vector, 4S) 1->2 2.049841 | |
SMINP (vector, 4S) 1->3 2.000525 | |
SMINP (vector, 4S) throughput 0.500491 | |
SMINV (vector, 8B) 1->2 3.041788 | |
SMINV (vector, 8B) throughput 0.500172 | |
SMINV (vector, 16B) 1->2 3.005102 | |
SMINV (vector, 16B) throughput 0.500977 | |
SMINV (vector, 4H) 1->2 3.321040 | |
SMINV (vector, 4H) throughput 0.524513 | |
SMINV (vector, 8H) 1->2 3.443354 | |
SMINV (vector, 8H) throughput 0.477800 | |
SMINV (vector, 4S) 1->2 3.004535 | |
SMINV (vector, 4S) throughput 0.470443 | |
SMLAL (by element, 4S) 1->1 2.826182 | |
SMLAL (by element, 4S) 1->2 3.007856 | |
SMLAL (by element, 4S) 1->3 3.039942 | |
SMLAL (by element, 4S) throughput 0.553001 | |
SMLAL (by element, 4S) throughput 0.543355 | |
SMLAL (by element, 2D) 1->1 3.496292 | |
SMLAL (by element, 2D) 1->2 2.997418 | |
SMLAL (by element, 2D) 1->3 3.418896 | |
SMLAL (by element, 2D) throughput 0.598293 | |
SMLAL (by element, 2D) throughput 0.579837 | |
SMLAL2 (by element, 4S) 1->1 3.060125 | |
SMLAL2 (by element, 4S) 1->2 3.918842 | |
SMLAL2 (by element, 4S) 1->3 3.023200 | |
SMLAL2 (by element, 4S) throughput 0.644695 | |
SMLAL2 (by element, 4S) throughput 0.516338 | |
SMLAL2 (by element, 2D) 1->1 2.881804 | |
SMLAL2 (by element, 2D) 1->2 3.038486 | |
SMLAL2 (by element, 2D) 1->3 4.950226 | |
SMLAL2 (by element, 2D) throughput 0.510654 | |
SMLAL2 (by element, 2D) throughput 0.566586 | |
SMLAL (vector, 8H) 1->1 3.028980 | |
SMLAL (vector, 8H) 1->2 2.949429 | |
SMLAL (vector, 8H) 1->3 3.008869 | |
SMLAL (vector, 8H) throughput 0.514041 | |
SMLAL (vector, 8H) throughput 0.528330 | |
SMLAL (vector, 4S) 1->1 3.295244 | |
SMLAL (vector, 4S) 1->2 3.198064 | |
SMLAL (vector, 4S) 1->3 3.296404 | |
SMLAL (vector, 4S) throughput 0.490309 | |
SMLAL (vector, 4S) throughput 0.450080 | |
SMLAL (vector, 2D) 1->1 2.951768 | |
SMLAL (vector, 2D) 1->2 3.062477 | |
SMLAL (vector, 2D) 1->3 3.526685 | |
SMLAL (vector, 2D) throughput 0.589969 | |
SMLAL (vector, 2D) throughput 0.480925 | |
SMLAL2 (vector, 8H) 1->1 2.849970 | |
SMLAL2 (vector, 8H) 1->2 3.028642 | |
SMLAL2 (vector, 8H) 1->3 3.156189 | |
SMLAL2 (vector, 8H) throughput 0.500999 | |
SMLAL2 (vector, 8H) throughput 0.478366 | |
SMLAL2 (vector, 4S) 1->1 2.545938 | |
SMLAL2 (vector, 4S) 1->2 3.017366 | |
SMLAL2 (vector, 4S) 1->3 3.011268 | |
SMLAL2 (vector, 4S) throughput 0.491994 | |
SMLAL2 (vector, 4S) throughput 0.570666 | |
SMLAL2 (vector, 2D) 1->1 4.350049 | |
SMLAL2 (vector, 2D) 1->2 2.883715 | |
SMLAL2 (vector, 2D) 1->3 3.649403 | |
SMLAL2 (vector, 2D) throughput 0.508877 | |
SMLAL2 (vector, 2D) throughput 0.499391 | |
SMLSL (by element, 4S) 1->1 3.005588 | |
SMLSL (by element, 4S) 1->2 3.004089 | |
SMLSL (by element, 4S) 1->3 3.028720 | |
SMLSL (by element, 4S) throughput 0.502365 | |
SMLSL (by element, 4S) throughput 0.553020 | |
SMLSL (by element, 2D) 1->1 2.947778 | |
SMLSL (by element, 2D) 1->2 3.028599 | |
SMLSL (by element, 2D) 1->3 3.217926 | |
SMLSL (by element, 2D) throughput 0.494125 | |
SMLSL (by element, 2D) throughput 0.504300 | |
SMLSL2 (by element, 4S) 1->1 3.014650 | |
SMLSL2 (by element, 4S) 1->2 4.463565 | |
SMLSL2 (by element, 4S) 1->3 2.993067 | |
SMLSL2 (by element, 4S) throughput 0.506629 | |
SMLSL2 (by element, 4S) throughput 0.505831 | |
SMLSL2 (by element, 2D) 1->1 3.001555 | |
SMLSL2 (by element, 2D) 1->2 3.023283 | |
SMLSL2 (by element, 2D) 1->3 3.963480 | |
SMLSL2 (by element, 2D) throughput 0.462687 | |
SMLSL2 (by element, 2D) throughput 0.570078 | |
SMLSL (vector, 8H) 1->1 3.540537 | |
SMLSL (vector, 8H) 1->2 2.898409 | |
SMLSL (vector, 8H) 1->3 4.891859 | |
SMLSL (vector, 8H) throughput 0.503461 | |
SMLSL (vector, 8H) throughput 0.505369 | |
SMLSL (vector, 4S) 1->1 2.998453 | |
SMLSL (vector, 4S) 1->2 5.692473 | |
SMLSL (vector, 4S) 1->3 4.123894 | |
SMLSL (vector, 4S) throughput 0.542829 | |
SMLSL (vector, 4S) throughput 0.509587 | |
SMLSL (vector, 2D) 1->1 3.108366 | |
SMLSL (vector, 2D) 1->2 2.997250 | |
SMLSL (vector, 2D) 1->3 3.046002 | |
SMLSL (vector, 2D) throughput 0.520369 | |
SMLSL (vector, 2D) throughput 0.505021 | |
SMLSL2 (vector, 8H) 1->1 3.000003 | |
SMLSL2 (vector, 8H) 1->2 3.017474 | |
SMLSL2 (vector, 8H) 1->3 3.006357 | |
SMLSL2 (vector, 8H) throughput 0.476535 | |
SMLSL2 (vector, 8H) throughput 0.503824 | |
SMLSL2 (vector, 4S) 1->1 2.938481 | |
SMLSL2 (vector, 4S) 1->2 3.004765 | |
SMLSL2 (vector, 4S) 1->3 2.884797 | |
SMLSL2 (vector, 4S) throughput 0.596734 | |
SMLSL2 (vector, 4S) throughput 0.730539 | |
SMLSL2 (vector, 2D) 1->1 2.598412 | |
SMLSL2 (vector, 2D) 1->2 3.208983 | |
SMLSL2 (vector, 2D) 1->3 3.115093 | |
SMLSL2 (vector, 2D) throughput 0.500903 | |
SMLSL2 (vector, 2D) throughput 0.419695 | |
SMULL (by element, 4S) 1->2 3.005669 | |
SMULL (by element, 4S) 1->3 3.255625 | |
SMULL (by element, 4S) throughput 0.512605 | |
SMULL (by element, 2D) 1->2 2.996527 | |
SMULL (by element, 2D) 1->3 3.049404 | |
SMULL (by element, 2D) throughput 0.486058 | |
SMULL2 (by element, 4S) 1->2 3.016030 | |
SMULL2 (by element, 4S) 1->3 2.945528 | |
SMULL2 (by element, 4S) throughput 0.473397 | |
SMULL2 (by element, 2D) 1->2 3.840938 | |
SMULL2 (by element, 2D) 1->3 3.574636 | |
SMULL2 (by element, 2D) throughput 0.641237 | |
SMULL (vector, 8H) 1->2 2.752544 | |
SMULL (vector, 8H) 1->3 2.584957 | |
SMULL (vector, 8H) throughput 0.532668 | |
SMULL (vector, 4S) 1->2 3.103032 | |
SMULL (vector, 4S) 1->3 2.928861 | |
SMULL (vector, 4S) throughput 0.439363 | |
SMULL (vector, 2D) 1->2 2.989323 | |
SMULL (vector, 2D) 1->3 3.100492 | |
SMULL (vector, 2D) throughput 0.451012 | |
SMULL2 (vector, 8H) 1->2 2.896022 | |
SMULL2 (vector, 8H) 1->3 3.165623 | |
SMULL2 (vector, 8H) throughput 0.451292 | |
SMULL2 (vector, 4S) 1->2 2.779296 | |
SMULL2 (vector, 4S) 1->3 3.345433 | |
SMULL2 (vector, 4S) throughput 0.617391 | |
SMULL2 (vector, 2D) 1->2 3.195457 | |
SMULL2 (vector, 2D) 1->3 2.999587 | |
SMULL2 (vector, 2D) throughput 0.604326 | |
SQABS (scalar, B) 1->2 3.472400 | |
SQABS (scalar, B) throughput 0.627009 | |
SQABS (scalar, H) 1->2 2.303872 | |
SQABS (scalar, H) throughput 0.548397 | |
SQABS (scalar, S) 1->2 3.005932 | |
SQABS (scalar, S) throughput 0.512640 | |
SQABS (scalar, D) 1->2 3.088566 | |
SQABS (scalar, D) throughput 0.507566 | |
SQABS (vector, 8B) 1->2 3.003698 | |
SQABS (vector, 8B) throughput 0.515342 | |
SQABS (vector, 16B) 1->2 3.011217 | |
SQABS (vector, 16B) throughput 0.508145 | |
SQABS (vector, 4H) 1->2 3.005674 | |
SQABS (vector, 4H) throughput 0.502244 | |
SQABS (vector, 8H) 1->2 3.023524 | |
SQABS (vector, 8H) throughput 0.500373 | |
SQABS (vector, 2S) 1->2 2.864715 | |
SQABS (vector, 2S) throughput 0.498016 | |
SQABS (vector, 4S) 1->2 3.005181 | |
SQABS (vector, 4S) throughput 0.499646 | |
SQABS (vector, 2D) 1->2 3.009597 | |
SQABS (vector, 2D) throughput 0.508673 | |
SQADD (scalar, B) 1->2 3.048914 | |
SQADD (scalar, B) 1->3 3.003927 | |
SQADD (scalar, B) throughput 0.518105 | |
SQADD (scalar, H) 1->2 3.001883 | |
SQADD (scalar, H) 1->3 2.993458 | |
SQADD (scalar, H) throughput 0.500476 | |
SQADD (scalar, S) 1->2 3.012473 | |
SQADD (scalar, S) 1->3 3.157192 | |
SQADD (scalar, S) throughput 0.534810 | |
SQADD (scalar, D) 1->2 3.026652 | |
SQADD (scalar, D) 1->3 4.288236 | |
SQADD (scalar, D) throughput 0.152139 | |
SQADD (vector, 8B) 1->2 3.021192 | |
SQADD (vector, 8B) 1->3 3.055438 | |
SQADD (vector, 8B) throughput 0.505976 | |
SQADD (vector, 16B) 1->2 3.008360 | |
SQADD (vector, 16B) 1->3 2.996531 | |
SQADD (vector, 16B) throughput 0.623501 | |
SQADD (vector, 4H) 1->2 2.543183 | |
SQADD (vector, 4H) 1->3 3.006992 | |
SQADD (vector, 4H) throughput 0.500116 | |
SQADD (vector, 8H) 1->2 3.129619 | |
SQADD (vector, 8H) 1->3 3.234857 | |
SQADD (vector, 8H) throughput 0.497800 | |
SQADD (vector, 2S) 1->2 3.006642 | |
SQADD (vector, 2S) 1->3 3.027332 | |
SQADD (vector, 2S) throughput 0.500525 | |
SQADD (vector, 4S) 1->2 2.990222 | |
SQADD (vector, 4S) 1->3 3.020003 | |
SQADD (vector, 4S) throughput 0.520818 | |
SQADD (vector, 2D) 1->2 3.010245 | |
SQADD (vector, 2D) 1->3 2.998262 | |
SQADD (vector, 2D) throughput 0.498087 | |
SQDMLAL (by element, S) 1->1 3.003928 | |
SQDMLAL (by element, S) 1->2 3.049528 | |
SQDMLAL (by element, S) 1->3 3.000242 | |
SQDMLAL (by element, S) throughput 0.522752 | |
SQDMLAL (by element, S) throughput 0.504277 | |
SQDMLAL (by element, D) 1->1 3.017898 | |
SQDMLAL (by element, D) 1->2 2.999677 | |
SQDMLAL (by element, D) 1->3 2.985283 | |
SQDMLAL (by element, D) throughput 0.526290 | |
SQDMLAL (by element, D) throughput 0.541117 | |
SQDMLAL (by element, 4S) 1->1 2.949351 | |
SQDMLAL (by element, 4S) 1->2 3.192838 | |
SQDMLAL (by element, 4S) 1->3 3.086383 | |
SQDMLAL (by element, 4S) throughput 0.499584 | |
SQDMLAL (by element, 4S) throughput 0.505966 | |
SQDMLAL (by element, 2D) 1->1 3.009272 | |
SQDMLAL (by element, 2D) 1->2 3.030329 | |
SQDMLAL (by element, 2D) 1->3 3.044863 | |
SQDMLAL (by element, 2D) throughput 0.507021 | |
SQDMLAL (by element, 2D) throughput 0.514776 | |
SQDMLAL2 (by element, 4S) 1->1 3.343962 | |
SQDMLAL2 (by element, 4S) 1->2 2.995472 | |
SQDMLAL2 (by element, 4S) 1->3 3.096468 | |
SQDMLAL2 (by element, 4S) throughput 0.510379 | |
SQDMLAL2 (by element, 4S) throughput 0.509917 | |
SQDMLAL2 (by element, 2D) 1->1 2.020210 | |
SQDMLAL2 (by element, 2D) 1->2 3.084143 | |
SQDMLAL2 (by element, 2D) 1->3 2.998184 | |
SQDMLAL2 (by element, 2D) throughput 0.504110 | |
SQDMLAL2 (by element, 2D) throughput 0.502440 | |
SQDMLAL (scalar, S) 1->1 2.974457 | |
SQDMLAL (scalar, S) 1->2 3.009234 | |
SQDMLAL (scalar, S) 1->3 3.187135 | |
SQDMLAL (scalar, S) throughput 0.501851 | |
SQDMLAL (scalar, S) throughput 0.500837 | |
SQDMLAL (scalar, D) 1->1 3.012650 | |
SQDMLAL (scalar, D) 1->2 3.009392 | |
SQDMLAL (scalar, D) 1->3 3.008543 | |
SQDMLAL (scalar, D) throughput 0.501549 | |
SQDMLAL (scalar, D) throughput 0.509604 | |
SQDMLAL (vector, 4S) 1->1 3.065719 | |
SQDMLAL (vector, 4S) 1->2 2.812892 | |
SQDMLAL (vector, 4S) 1->3 3.027910 | |
SQDMLAL (vector, 4S) throughput 0.636140 | |
SQDMLAL (vector, 4S) throughput 0.502144 | |
SQDMLAL (vector, 2D) 1->1 3.006803 | |
SQDMLAL (vector, 2D) 1->2 3.006519 | |
SQDMLAL (vector, 2D) 1->3 2.999516 | |
SQDMLAL (vector, 2D) throughput 0.507610 | |
SQDMLAL (vector, 2D) throughput 0.536091 | |
SQDMLAL2 (vector, 4S) 1->1 2.838606 | |
SQDMLAL2 (vector, 4S) 1->2 3.033688 | |
SQDMLAL2 (vector, 4S) 1->3 3.027577 | |
SQDMLAL2 (vector, 4S) throughput 0.506570 | |
SQDMLAL2 (vector, 4S) throughput 0.501430 | |
SQDMLAL2 (vector, 2D) 1->1 2.821224 | |
SQDMLAL2 (vector, 2D) 1->2 3.032901 | |
SQDMLAL2 (vector, 2D) 1->3 3.007089 | |
SQDMLAL2 (vector, 2D) throughput 0.504479 | |
SQDMLAL2 (vector, 2D) throughput 0.505002 | |
SQDMLSL (by element, S) 1->1 3.013400 | |
SQDMLSL (by element, S) 1->2 2.996241 | |
SQDMLSL (by element, S) 1->3 2.961169 | |
SQDMLSL (by element, S) throughput 0.552747 | |
SQDMLSL (by element, S) throughput 0.507680 | |
SQDMLSL (by element, D) 1->1 3.130264 | |
SQDMLSL (by element, D) 1->2 2.993860 | |
SQDMLSL (by element, D) 1->3 2.996121 | |
SQDMLSL (by element, D) throughput 0.490468 | |
SQDMLSL (by element, D) throughput 0.501998 | |
SQDMLSL (by element, 4S) 1->1 2.988384 | |
SQDMLSL (by element, 4S) 1->2 3.001554 | |
SQDMLSL (by element, 4S) 1->3 3.041623 | |
SQDMLSL (by element, 4S) throughput 0.501407 | |
SQDMLSL (by element, 4S) throughput 0.523436 | |
SQDMLSL (by element, 2D) 1->1 2.999554 | |
SQDMLSL (by element, 2D) 1->2 3.024981 | |
SQDMLSL (by element, 2D) 1->3 3.039034 | |
SQDMLSL (by element, 2D) throughput 0.495917 | |
SQDMLSL (by element, 2D) throughput 0.500798 | |
SQDMLSL2 (by element, 4S) 1->1 3.006561 | |
SQDMLSL2 (by element, 4S) 1->2 3.004820 | |
SQDMLSL2 (by element, 4S) 1->3 3.006520 | |
SQDMLSL2 (by element, 4S) throughput 0.503037 | |
SQDMLSL2 (by element, 4S) throughput 0.505295 | |
SQDMLSL2 (by element, 2D) 1->1 2.900908 | |
SQDMLSL2 (by element, 2D) 1->2 2.993828 | |
SQDMLSL2 (by element, 2D) 1->3 3.001615 | |
SQDMLSL2 (by element, 2D) throughput 0.501341 | |
SQDMLSL2 (by element, 2D) throughput 0.501989 | |
SQDMLSL (scalar, S) 1->1 2.982338 | |
SQDMLSL (scalar, S) 1->2 3.004048 | |
SQDMLSL (scalar, S) 1->3 3.012401 | |
SQDMLSL (scalar, S) throughput 0.500045 | |
SQDMLSL (scalar, S) throughput 0.500863 | |
SQDMLSL (scalar, D) 1->1 3.007563 | |
SQDMLSL (scalar, D) 1->2 3.017593 | |
SQDMLSL (scalar, D) 1->3 2.997225 | |
SQDMLSL (scalar, D) throughput 0.522114 | |
SQDMLSL (scalar, D) throughput 0.502184 | |
SQDMLSL (vector, 4S) 1->1 2.993659 | |
SQDMLSL (vector, 4S) 1->2 3.019274 | |
SQDMLSL (vector, 4S) 1->3 2.999233 | |
SQDMLSL (vector, 4S) throughput 0.509191 | |
SQDMLSL (vector, 4S) throughput 0.506240 | |
SQDMLSL (vector, 2D) 1->1 3.106844 | |
SQDMLSL (vector, 2D) 1->2 3.141441 | |
SQDMLSL (vector, 2D) 1->3 2.979331 | |
SQDMLSL (vector, 2D) throughput 0.504981 | |
SQDMLSL (vector, 2D) throughput 0.517781 | |
SQDMLSL2 (vector, 4S) 1->1 2.998583 | |
SQDMLSL2 (vector, 4S) 1->2 3.021074 | |
SQDMLSL2 (vector, 4S) 1->3 3.253337 | |
SQDMLSL2 (vector, 4S) throughput 0.499372 | |
SQDMLSL2 (vector, 4S) throughput 0.414922 | |
SQDMLSL2 (vector, 2D) 1->1 2.998100 | |
SQDMLSL2 (vector, 2D) 1->2 2.997576 | |
SQDMLSL2 (vector, 2D) 1->3 3.009111 | |
SQDMLSL2 (vector, 2D) throughput 0.500768 | |
SQDMLSL2 (vector, 2D) throughput 0.500617 | |
SQDMULH (by element, H) 1->2 3.061268 | |
SQDMULH (by element, H) 1->3 3.034944 | |
SQDMULH (by element, H) throughput 0.500101 | |
SQDMULH (by element, S) 1->2 3.000768 | |
SQDMULH (by element, S) 1->3 3.005669 | |
SQDMULH (by element, S) throughput 0.612429 | |
SQDMULH (by element, 4H) 1->2 3.153676 | |
SQDMULH (by element, 4H) 1->3 3.092952 | |
SQDMULH (by element, 4H) throughput 0.583670 | |
SQDMULH (by element, 8H) 1->2 3.633149 | |
SQDMULH (by element, 8H) 1->3 2.857361 | |
SQDMULH (by element, 8H) throughput 0.591613 | |
SQDMULH (by element, 2S) 1->2 3.010811 | |
SQDMULH (by element, 2S) 1->3 3.013068 | |
SQDMULH (by element, 2S) throughput 0.476230 | |
SQDMULH (by element, 4S) 1->2 2.405200 | |
SQDMULH (by element, 4S) 1->3 3.000484 | |
SQDMULH (by element, 4S) throughput 0.508685 | |
SQDMULH (scalar, H) 1->2 2.057005 | |
SQDMULH (scalar, H) 1->3 3.003720 | |
SQDMULH (scalar, H) throughput 0.723287 | |
SQDMULH (scalar, S) 1->2 3.009720 | |
SQDMULH (scalar, S) 1->3 3.008325 | |
SQDMULH (scalar, S) throughput 0.502558 | |
SQDMULH (vector, 4H) 1->2 2.909365 | |
SQDMULH (vector, 4H) 1->3 4.915345 | |
SQDMULH (vector, 4H) throughput 0.587159 | |
SQDMULH (vector, 8H) 1->2 3.871524 | |
SQDMULH (vector, 8H) 1->3 3.007936 | |
SQDMULH (vector, 8H) throughput 0.556455 | |
SQDMULH (vector, 2S) 1->2 3.009840 | |
SQDMULH (vector, 2S) 1->3 3.242582 | |
SQDMULH (vector, 2S) throughput 0.578232 | |
SQDMULH (vector, 4S) 1->2 3.008221 | |
SQDMULH (vector, 4S) 1->3 3.063854 | |
SQDMULH (vector, 4S) throughput 0.546736 | |
SQDMULL (by element, S) 1->2 3.069566 | |
SQDMULL (by element, S) 1->3 3.005752 | |
SQDMULL (by element, S) throughput 0.503520 | |
SQDMULL (by element, D) 1->2 3.509877 | |
SQDMULL (by element, D) 1->3 3.441182 | |
SQDMULL (by element, D) throughput 0.593149 | |
SQDMULL (by element, 4S) 1->2 2.979054 | |
SQDMULL (by element, 4S) 1->3 3.257607 | |
SQDMULL (by element, 4S) throughput 0.501907 | |
SQDMULL (by element, 2D) 1->2 3.022577 | |
SQDMULL (by element, 2D) 1->3 3.006814 | |
SQDMULL (by element, 2D) throughput 0.499682 | |
SQDMULL2 (by element, 4S) 1->2 3.059888 | |
SQDMULL2 (by element, 4S) 1->3 3.067782 | |
SQDMULL2 (by element, 4S) throughput 0.534277 | |
SQDMULL2 (by element, 2D) 1->2 3.002980 | |
SQDMULL2 (by element, 2D) 1->3 3.060039 | |
SQDMULL2 (by element, 2D) throughput 0.535131 | |
SQDMULL (scalar, S) 1->2 3.007775 | |
SQDMULL (scalar, S) 1->3 5.528016 | |
SQDMULL (scalar, S) throughput 0.593691 | |
SQDMULL (scalar, D) 1->2 3.167561 | |
SQDMULL (scalar, D) 1->3 2.713940 | |
SQDMULL (scalar, D) throughput 0.491888 | |
SQDMULL (vector, 4S) 1->2 3.006153 | |
SQDMULL (vector, 4S) 1->3 3.554972 | |
SQDMULL (vector, 4S) throughput 0.506529 | |
SQDMULL (vector, 2D) 1->2 3.003226 | |
SQDMULL (vector, 2D) 1->3 3.034560 | |
SQDMULL (vector, 2D) throughput 0.568673 | |
SQDMULL2 (vector, 4S) 1->2 2.993657 | |
SQDMULL2 (vector, 4S) 1->3 3.801023 | |
SQDMULL2 (vector, 4S) throughput 0.578436 | |
SQDMULL2 (vector, 2D) 1->2 2.985018 | |
SQDMULL2 (vector, 2D) 1->3 2.990198 | |
SQDMULL2 (vector, 2D) throughput 0.517305 | |
SQNEG (scalar, B) 1->2 2.706624 | |
SQNEG (scalar, B) throughput 0.561048 | |
SQNEG (scalar, H) 1->2 3.091304 | |
SQNEG (scalar, H) throughput 0.581881 | |
SQNEG (scalar, S) 1->2 2.822549 | |
SQNEG (scalar, S) throughput 0.416655 | |
SQNEG (scalar, D) 1->2 3.270133 | |
SQNEG (scalar, D) throughput 0.454799 | |
SQNEG (vector, 8B) 1->2 2.842037 | |
SQNEG (vector, 8B) throughput 0.505356 | |
SQNEG (vector, 16B) 1->2 3.290256 | |
SQNEG (vector, 16B) throughput 0.536155 | |
SQNEG (vector, 4H) 1->2 3.228559 | |
SQNEG (vector, 4H) throughput 0.510857 | |
SQNEG (vector, 8H) 1->2 3.268666 | |
SQNEG (vector, 8H) throughput 0.573829 | |
SQNEG (vector, 2S) 1->2 3.066290 | |
SQNEG (vector, 2S) throughput 0.485769 | |
SQNEG (vector, 4S) 1->2 2.986320 | |
SQNEG (vector, 4S) throughput 0.519163 | |
SQNEG (vector, 2D) 1->2 4.952564 | |
SQNEG (vector, 2D) throughput 0.610702 | |
SQRDMULH (by element, H) 1->2 3.956337 | |
SQRDMULH (by element, H) 1->3 3.023799 | |
SQRDMULH (by element, H) throughput 0.500678 | |
SQRDMULH (by element, S) 1->2 3.039815 | |
SQRDMULH (by element, S) 1->3 3.004089 | |
SQRDMULH (by element, S) throughput 0.500623 | |
SQRDMULH (by element, 4H) 1->2 2.997493 | |
SQRDMULH (by element, 4H) 1->3 2.997089 | |
SQRDMULH (by element, 4H) throughput 0.502153 | |
SQRDMULH (by element, 8H) 1->2 2.999072 | |
SQRDMULH (by element, 8H) 1->3 3.006400 | |
SQRDMULH (by element, 8H) throughput 0.500661 | |
SQRDMULH (by element, 2S) 1->2 3.004778 | |
SQRDMULH (by element, 2S) 1->3 2.993297 | |
SQRDMULH (by element, 2S) throughput 0.507419 | |
SQRDMULH (by element, 4S) 1->2 3.010327 | |
SQRDMULH (by element, 4S) 1->3 3.005468 | |
SQRDMULH (by element, 4S) throughput 0.566450 | |
SQRDMULH (scalar, H) 1->2 3.004778 | |
SQRDMULH (scalar, H) 1->3 2.998587 | |
SQRDMULH (scalar, H) throughput 0.500910 | |
SQRDMULH (scalar, S) 1->2 3.003889 | |
SQRDMULH (scalar, S) 1->3 3.000767 | |
SQRDMULH (scalar, S) throughput 0.504085 | |
SQRDMULH (vector, 4H) 1->2 2.992858 | |
SQRDMULH (vector, 4H) 1->3 3.015767 | |
SQRDMULH (vector, 4H) throughput 0.549746 | |
SQRDMULH (vector, 8H) 1->2 3.006185 | |
SQRDMULH (vector, 8H) 1->3 2.991169 | |
SQRDMULH (vector, 8H) throughput 0.458436 | |
SQRDMULH (vector, 2S) 1->2 3.008182 | |
SQRDMULH (vector, 2S) 1->3 3.027859 | |
SQRDMULH (vector, 2S) throughput 0.500982 | |
SQRDMULH (vector, 4S) 1->2 3.006791 | |
SQRDMULH (vector, 4S) 1->3 2.996568 | |
SQRDMULH (vector, 4S) throughput 0.513310 | |
SQRSHL (scalar, B) 1->2 2.997212 | |
SQRSHL (scalar, B) 1->3 3.003806 | |
SQRSHL (scalar, B) throughput 0.502941 | |
SQRSHL (scalar, H) 1->2 2.999030 | |
SQRSHL (scalar, H) 1->3 3.005306 | |
SQRSHL (scalar, H) throughput 0.499329 | |
SQRSHL (scalar, S) 1->2 2.997331 | |
SQRSHL (scalar, S) 1->3 3.018343 | |
SQRSHL (scalar, S) throughput 0.502596 | |
SQRSHL (scalar, D) 1->2 2.998705 | |
SQRSHL (scalar, D) 1->3 3.003076 | |
SQRSHL (scalar, D) throughput 0.500071 | |
SQRSHL (vector, 8B) 1->2 3.003153 | |
SQRSHL (vector, 8B) 1->3 2.998908 | |
SQRSHL (vector, 8B) throughput 0.501647 | |
SQRSHL (vector, 16B) 1->2 2.994152 | |
SQRSHL (vector, 16B) 1->3 2.959500 | |
SQRSHL (vector, 16B) throughput 0.502137 | |
SQRSHL (vector, 4H) 1->2 3.010105 | |
SQRSHL (vector, 4H) 1->3 2.999029 | |
SQRSHL (vector, 4H) throughput 0.527158 | |
SQRSHL (vector, 8H) 1->2 2.998867 | |
SQRSHL (vector, 8H) 1->3 3.003509 | |
SQRSHL (vector, 8H) throughput 0.493901 | |
SQRSHL (vector, 2S) 1->2 2.997212 | |
SQRSHL (vector, 2S) 1->3 2.992458 | |
SQRSHL (vector, 2S) throughput 0.499939 | |
SQRSHL (vector, 4S) 1->2 4.260130 | |
SQRSHL (vector, 4S) 1->3 2.994592 | |
SQRSHL (vector, 4S) throughput 0.529662 | |
SQRSHL (vector, 2D) 1->2 2.934866 | |
SQRSHL (vector, 2D) 1->3 3.007734 | |
SQRSHL (vector, 2D) throughput 0.500106 | |
SQRSHRN (H) 1->2 2.998666 | |
SQRSHRN (H) throughput 0.499641 | |
SQRSHRN (S) 1->2 2.998949 | |
SQRSHRN (S) throughput 0.500439 | |
SQRSHRN (D) 1->2 3.010974 | |
SQRSHRN (D) throughput 0.546988 | |
SQRSHRN (8H) 1->2 3.888693 | |
SQRSHRN (8H) throughput 0.500157 | |
SQRSHRN (4S) 1->2 3.004079 | |
SQRSHRN (4S) throughput 0.500658 | |
SQRSHRN (2D) 1->2 2.998384 | |
SQRSHRN (2D) throughput 0.504177 | |
SQRSHRN2 (8H) 1->1 3.065678 | |
SQRSHRN2 (8H) 1->2 3.012512 | |
SQRSHRN2 (8H) throughput 0.501463 | |
SQRSHRN2 (8H) throughput 0.505671 | |
SQRSHRN2 (4S) 1->1 3.003928 | |
SQRSHRN2 (4S) 1->2 3.002771 | |
SQRSHRN2 (4S) throughput 0.503267 | |
SQRSHRN2 (4S) throughput 0.501387 | |
SQRSHRN2 (2D) 1->1 2.995481 | |
SQRSHRN2 (2D) 1->2 3.040777 | |
SQRSHRN2 (2D) throughput 0.506258 | |
SQRSHRN2 (2D) throughput 0.504319 | |
SQRSHRUN (H) 1->2 3.033688 | |
SQRSHRUN (H) throughput 0.500688 | |
SQRSHRUN (S) 1->2 3.033933 | |
SQRSHRUN (S) throughput 0.500952 | |
SQRSHRUN (D) 1->2 3.022434 | |
SQRSHRUN (D) throughput 0.544000 | |
SQRSHRUN (8H) 1->2 3.185138 | |
SQRSHRUN (8H) throughput 0.519237 | |
SQRSHRUN (4S) 1->2 3.006963 | |
SQRSHRUN (4S) throughput 0.505507 | |
SQRSHRUN (2D) 1->2 2.997414 | |
SQRSHRUN (2D) throughput 0.513439 | |
SQRSHRUN2 (8H) 1->1 2.995600 | |
SQRSHRUN2 (8H) 1->2 2.999959 | |
SQRSHRUN2 (8H) throughput 0.513489 | |
SQRSHRUN2 (8H) throughput 0.508692 | |
SQRSHRUN2 (4S) 1->1 2.997089 | |
SQRSHRUN2 (4S) 1->2 2.996284 | |
SQRSHRUN2 (4S) throughput 0.503411 | |
SQRSHRUN2 (4S) throughput 0.499205 | |
SQRSHRUN2 (2D) 1->1 3.006195 | |
SQRSHRUN2 (2D) 1->2 3.338471 | |
SQRSHRUN2 (2D) throughput 0.507309 | |
SQRSHRUN2 (2D) throughput 0.501063 | |
SQSHL (immediate, scalar, B) 1->2 2.001579 | |
SQSHL (immediate, scalar, B) throughput 0.581757 | |
SQSHL (immediate, scalar, H) 1->2 2.000406 | |
SQSHL (immediate, scalar, H) throughput 0.504105 | |
SQSHL (immediate, scalar, S) 1->2 2.004332 | |
SQSHL (immediate, scalar, S) throughput 0.508069 | |
SQSHL (immediate, scalar, D) 1->2 2.022099 | |
SQSHL (immediate, scalar, D) throughput 0.501779 | |
SQSHL (immediate, vector, 8B) 1->2 2.045715 | |
SQSHL (immediate, vector, 8B) throughput 0.507496 | |
SQSHL (immediate, vector, 16B) 1->2 1.996849 | |
SQSHL (immediate, vector, 16B) throughput 0.504752 | |
SQSHL (immediate, vector, 4H) 1->2 1.958491 | |
SQSHL (immediate, vector, 4H) throughput 0.475284 | |
SQSHL (immediate, vector, 8H) 1->2 1.924322 | |
SQSHL (immediate, vector, 8H) throughput 0.499945 | |
SQSHL (immediate, vector, 2S) 1->2 2.006104 | |
SQSHL (immediate, vector, 2S) throughput 0.499651 | |
SQSHL (immediate, vector, 4S) 1->2 1.999071 | |
SQSHL (immediate, vector, 4S) throughput 0.513914 | |
SQSHL (immediate, vector, 2D) 1->2 2.003401 | |
SQSHL (immediate, vector, 2D) throughput 0.501179 | |
SQSHL (scalar, B) 1->2 1.966152 | |
SQSHL (scalar, B) 1->3 1.995559 | |
SQSHL (scalar, B) throughput 0.501056 | |
SQSHL (scalar, H) 1->2 2.070208 | |
SQSHL (scalar, H) 1->3 2.067836 | |
SQSHL (scalar, H) throughput 0.473409 | |
SQSHL (scalar, S) 1->2 2.001609 | |
SQSHL (scalar, S) 1->3 2.003694 | |
SQSHL (scalar, S) throughput 0.546499 | |
SQSHL (scalar, D) 1->2 2.006682 | |
SQSHL (scalar, D) 1->3 2.005103 | |
SQSHL (scalar, D) throughput 0.499636 | |
SQSHL (vector, 8B) 1->2 1.998302 | |
SQSHL (vector, 8B) 1->3 2.002590 | |
SQSHL (vector, 8B) throughput 0.505026 | |
SQSHL (vector, 16B) 1->2 2.001416 | |
SQSHL (vector, 16B) 1->3 2.001658 | |
SQSHL (vector, 16B) throughput 0.500805 | |
SQSHL (vector, 4H) 1->2 1.990189 | |
SQSHL (vector, 4H) 1->3 1.999433 | |
SQSHL (vector, 4H) throughput 0.529579 | |
SQSHL (vector, 8H) 1->2 1.998546 | |
SQSHL (vector, 8H) 1->3 1.999193 | |
SQSHL (vector, 8H) throughput 0.597052 | |
SQSHL (vector, 2S) 1->2 2.001546 | |
SQSHL (vector, 2S) 1->3 1.975511 | |
SQSHL (vector, 2S) throughput 0.517262 | |
SQSHL (vector, 4S) 1->2 2.002390 | |
SQSHL (vector, 4S) 1->3 2.003684 | |
SQSHL (vector, 4S) throughput 0.500851 | |
SQSHL (vector, 2D) 1->2 2.003402 | |
SQSHL (vector, 2D) 1->3 2.000243 | |
SQSHL (vector, 2D) throughput 0.506491 | |
SQSHLU (scalar, B) 1->2 2.005021 | |
SQSHLU (scalar, B) throughput 0.558506 | |
SQSHLU (scalar, H) 1->2 1.946845 | |
SQSHLU (scalar, H) throughput 0.501963 | |
SQSHLU (scalar, S) 1->2 2.013523 | |
SQSHLU (scalar, S) throughput 0.499652 | |
SQSHLU (scalar, D) 1->2 1.996244 | |
SQSHLU (scalar, D) throughput 0.507304 | |
SQSHLU (vector, 8B) 1->2 2.007560 | |
SQSHLU (vector, 8B) throughput 0.500947 | |
SQSHLU (vector, 16B) 1->2 2.004050 | |
SQSHLU (vector, 16B) throughput 0.501488 | |
SQSHLU (vector, 4H) 1->2 2.004009 | |
SQSHLU (vector, 4H) throughput 0.500957 | |
SQSHLU (vector, 8H) 1->2 2.003806 | |
SQSHLU (vector, 8H) throughput 0.501832 | |
SQSHLU (vector, 2S) 1->2 2.002956 | |
SQSHLU (vector, 2S) throughput 0.502141 | |
SQSHLU (vector, 4S) 1->2 2.002025 | |
SQSHLU (vector, 4S) throughput 0.501807 | |
SQSHLU (vector, 2D) 1->2 2.002670 | |
SQSHLU (vector, 2D) throughput 0.501908 | |
SQSHRN (H) 1->2 3.002505 | |
SQSHRN (H) throughput 0.500192 | |
SQSHRN (S) 1->2 2.999718 | |
SQSHRN (S) throughput 0.504466 | |
SQSHRN (D) 1->2 3.014901 | |
SQSHRN (D) throughput 0.500673 | |
SQSHRN (8H) 1->2 3.004859 | |
SQSHRN (8H) throughput 0.500081 | |
SQSHRN (4S) 1->2 3.003968 | |
SQSHRN (4S) throughput 0.499925 | |
SQSHRN (2D) 1->2 3.004373 | |
SQSHRN (2D) throughput 0.520810 | |
SQSHRN2 (8H) 1->1 3.053101 | |
SQSHRN2 (8H) 1->2 2.998788 | |
SQSHRN2 (8H) throughput 0.501544 | |
SQSHRN2 (8H) throughput 0.506965 | |
SQSHRN2 (4S) 1->1 3.562566 | |
SQSHRN2 (4S) 1->2 3.062928 | |
SQSHRN2 (4S) throughput 0.507279 | |
SQSHRN2 (4S) throughput 0.449896 | |
SQSHRN2 (2D) 1->1 3.006561 | |
SQSHRN2 (2D) 1->2 2.998746 | |
SQSHRN2 (2D) throughput 0.501675 | |
SQSHRN2 (2D) throughput 0.500635 | |
SQSHRUN (H) 1->2 2.997495 | |
SQSHRUN (H) throughput 0.500825 | |
SQSHRUN (S) 1->2 3.007733 | |
SQSHRUN (S) throughput 0.499692 | |
SQSHRUN (D) 1->2 3.001255 | |
SQSHRUN (D) throughput 0.462327 | |
SQSHRUN (8H) 1->2 3.001550 | |
SQSHRUN (8H) throughput 0.514202 | |
SQSHRUN (4S) 1->2 3.017884 | |
SQSHRUN (4S) throughput 0.501276 | |
SQSHRUN (2D) 1->2 2.819247 | |
SQSHRUN (2D) throughput 0.499607 | |
SQSHRUN2 (8H) 1->1 3.002262 | |
SQSHRUN2 (8H) 1->2 2.993182 | |
SQSHRUN2 (8H) throughput 0.546163 | |
SQSHRUN2 (8H) throughput 0.500744 | |
SQSHRUN2 (4S) 1->1 3.059402 | |
SQSHRUN2 (4S) 1->2 3.002915 | |
SQSHRUN2 (4S) throughput 0.460710 | |
SQSHRUN2 (4S) throughput 0.503715 | |
SQSHRUN2 (2D) 1->1 3.003928 | |
SQSHRUN2 (2D) 1->2 2.997452 | |
SQSHRUN2 (2D) throughput 0.501465 | |
SQSHRUN2 (2D) throughput 0.503077 | |
SQSUB (scalar, B) 1->2 3.479109 | |
SQSUB (scalar, B) 1->3 3.039398 | |
SQSUB (scalar, B) throughput 0.496778 | |
SQSUB (scalar, H) 1->2 2.917777 | |
SQSUB (scalar, H) 1->3 3.018289 | |
SQSUB (scalar, H) throughput 0.506563 | |
SQSUB (scalar, S) 1->2 3.006438 | |
SQSUB (scalar, S) 1->3 3.004051 | |
SQSUB (scalar, S) throughput 0.500106 | |
SQSUB (scalar, D) 1->2 3.035833 | |
SQSUB (scalar, D) 1->3 2.993780 | |
SQSUB (scalar, D) throughput 0.500987 | |
SQSUB (vector, 8B) 1->2 2.994675 | |
SQSUB (vector, 8B) 1->3 2.998746 | |
SQSUB (vector, 8B) throughput 0.500668 | |
SQSUB (vector, 16B) 1->2 3.004051 | |
SQSUB (vector, 16B) 1->3 3.004128 | |
SQSUB (vector, 16B) throughput 0.501326 | |
SQSUB (vector, 4H) 1->2 3.021299 | |
SQSUB (vector, 4H) 1->3 3.003805 | |
SQSUB (vector, 4H) throughput 0.500927 | |
SQSUB (vector, 8H) 1->2 3.005508 | |
SQSUB (vector, 8H) 1->3 3.001179 | |
SQSUB (vector, 8H) throughput 0.520061 | |
SQSUB (vector, 2S) 1->2 3.037698 | |
SQSUB (vector, 2S) 1->3 3.015782 | |
SQSUB (vector, 2S) throughput 0.504872 | |
SQSUB (vector, 4S) 1->2 3.006562 | |
SQSUB (vector, 4S) 1->3 3.689516 | |
SQSUB (vector, 4S) throughput 0.497996 | |
SQSUB (vector, 2D) 1->2 3.019276 | |
SQSUB (vector, 2D) 1->3 3.012534 | |
SQSUB (vector, 2D) throughput 0.500789 | |
SQXTN (H) 1->2 3.056994 | |
SQXTN (H) throughput 0.486639 | |
SQXTN (S) 1->2 3.035064 | |
SQXTN (S) throughput 0.500693 | |
SQXTN (D) 1->2 3.052006 | |
SQXTN (D) throughput 0.508721 | |
SQXTN (8H) 1->2 3.006112 | |
SQXTN (8H) throughput 0.490159 | |
SQXTN (4S) 1->2 3.058840 | |
SQXTN (4S) throughput 0.483904 | |
SQXTN (2D) 1->2 3.137458 | |
SQXTN (2D) throughput 0.501248 | |
SQXTN2 (8H) 1->1 2.994751 | |
SQXTN2 (8H) 1->2 3.003726 | |
SQXTN2 (8H) throughput 0.501473 | |
SQXTN2 (8H) throughput 0.502220 | |
SQXTN2 (4S) 1->1 2.999313 | |
SQXTN2 (4S) 1->2 3.002951 | |
SQXTN2 (4S) throughput 0.501544 | |
SQXTN2 (4S) throughput 0.504336 | |
SQXTN2 (2D) 1->1 3.002870 | |
SQXTN2 (2D) 1->2 3.004010 | |
SQXTN2 (2D) throughput 0.500470 | |
SQXTN2 (2D) throughput 0.501084 | |
SQXTUN (H) 1->2 3.006763 | |
SQXTUN (H) throughput 0.501007 | |
SQXTUN (S) 1->2 3.008366 | |
SQXTUN (S) throughput 0.465829 | |
SQXTUN (D) 1->2 2.999960 | |
SQXTUN (D) throughput 0.500070 | |
SQXTUN (8H) 1->2 3.055923 | |
SQXTUN (8H) throughput 0.500217 | |
SQXTUN (4S) 1->2 2.537115 | |
SQXTUN (4S) throughput 0.487003 | |
SQXTUN (2D) 1->2 3.010213 | |
SQXTUN (2D) throughput 0.507268 | |
SQXTUN2 (8H) 1->1 3.004082 | |
SQXTUN2 (8H) 1->2 3.064016 | |
SQXTUN2 (8H) throughput 0.503944 | |
SQXTUN2 (8H) throughput 0.500930 | |
SQXTUN2 (4S) 1->1 2.999432 | |
SQXTUN2 (4S) 1->2 2.991989 | |
SQXTUN2 (4S) throughput 0.501832 | |
SQXTUN2 (4S) throughput 0.501009 | |
SQXTUN2 (2D) 1->1 2.998667 | |
SQXTUN2 (2D) 1->2 3.003564 | |
SQXTUN2 (2D) throughput 0.500733 | |
SQXTUN2 (2D) throughput 0.500245 | |
SRHADD (vector, 8B) 1->2 2.002430 | |
SRHADD (vector, 8B) 1->3 1.996245 | |
SRHADD (vector, 8B) throughput 0.527845 | |
SRHADD (vector, 16B) 1->2 2.001455 | |
SRHADD (vector, 16B) 1->3 2.002509 | |
SRHADD (vector, 16B) throughput 0.500481 | |
SRHADD (vector, 4H) 1->2 1.998302 | |
SRHADD (vector, 4H) 1->3 2.003888 | |
SRHADD (vector, 4H) throughput 0.500132 | |
SRHADD (vector, 8H) 1->2 2.034964 | |
SRHADD (vector, 8H) 1->3 1.999655 | |
SRHADD (vector, 8H) throughput 0.496352 | |
SRHADD (vector, 2S) 1->2 2.011419 | |
SRHADD (vector, 2S) 1->3 2.008199 | |
SRHADD (vector, 2S) throughput 0.486575 | |
SRHADD (vector, 4S) 1->2 2.001778 | |
SRHADD (vector, 4S) 1->3 1.998302 | |
SRHADD (vector, 4S) throughput 0.501200 | |
SRI (vector, 8B) 1->1 2.007837 | |
SRI (vector, 8B) 1->2 1.994514 | |
SRI (vector, 8B) throughput 0.501534 | |
SRI (vector, 8B) throughput 0.499879 | |
SRI (vector, 16B) 1->1 2.001659 | |
SRI (vector, 16B) 1->2 2.003643 | |
SRI (vector, 16B) throughput 0.711092 | |
SRI (vector, 16B) throughput 0.754190 | |
SRI (vector, 4H) 1->1 2.004212 | |
SRI (vector, 4H) 1->2 2.184784 | |
SRI (vector, 4H) throughput 0.500882 | |
SRI (vector, 4H) throughput 0.531105 | |
SRI (vector, 8H) 1->1 2.186553 | |
SRI (vector, 8H) 1->2 2.123729 | |
SRI (vector, 8H) throughput 0.518402 | |
SRI (vector, 8H) throughput 0.505493 | |
SRI (vector, 2S) 1->1 2.006189 | |
SRI (vector, 2S) 1->2 1.907257 | |
SRI (vector, 2S) throughput 0.465679 | |
SRI (vector, 2S) throughput 0.544131 | |
SRI (vector, 4S) 1->1 10.931492 | |
SRI (vector, 4S) 1->2 14.516744 | |
SRI (vector, 4S) throughput 0.508750 | |
SRI (vector, 4S) throughput 0.505805 | |
SRI (vector, 2D) 1->1 2.025453 | |
SRI (vector, 2D) 1->2 2.001374 | |
SRI (vector, 2D) throughput 0.500781 | |
SRI (vector, 2D) throughput 0.500091 | |
SRI (vector, D) 1->1 2.003644 | |
SRI (vector, D) 1->2 2.001539 | |
SRI (vector, D) throughput 0.501739 | |
SRI (vector, D) throughput 0.502198 | |
SRSHL (vector, 8B) 1->2 3.018916 | |
SRSHL (vector, 8B) 1->3 3.868938 | |
SRSHL (vector, 8B) throughput 0.497979 | |
SRSHL (vector, 16B) 1->2 3.034883 | |
SRSHL (vector, 16B) 1->3 3.002868 | |
SRSHL (vector, 16B) throughput 0.508326 | |
SRSHL (vector, 4H) 1->2 3.007155 | |
SRSHL (vector, 4H) 1->3 3.031486 | |
SRSHL (vector, 4H) throughput 0.499748 | |
SRSHL (vector, 8H) 1->2 3.004088 | |
SRSHL (vector, 8H) 1->3 3.000079 | |
SRSHL (vector, 8H) throughput 0.501984 | |
SRSHL (vector, 2S) 1->2 3.055885 | |
SRSHL (vector, 2S) 1->3 3.010813 | |
SRSHL (vector, 2S) throughput 0.501821 | |
SRSHL (vector, 4S) 1->2 3.035877 | |
SRSHL (vector, 4S) 1->3 2.992651 | |
SRSHL (vector, 4S) throughput 0.499924 | |
SRSHL (vector, 2D) 1->2 3.007206 | |
SRSHL (vector, 2D) 1->3 2.999312 | |
SRSHL (vector, 2D) throughput 0.500091 | |
SRSHL (vector, D) 1->2 3.005426 | |
SRSHL (vector, D) 1->3 3.009760 | |
SRSHL (vector, D) throughput 0.500227 | |
SRSHR (vector, 8B) 1->2 3.011845 | |
SRSHR (vector, 8B) throughput 0.501372 | |
SRSHR (vector, 16B) 1->2 3.002796 | |
SRSHR (vector, 16B) throughput 0.500866 | |
SRSHR (vector, 4H) 1->2 3.003765 | |
SRSHR (vector, 4H) throughput 0.500151 | |
SRSHR (vector, 8H) 1->2 3.000605 | |
SRSHR (vector, 8H) throughput 0.501129 | |
SRSHR (vector, 2S) 1->2 3.003969 | |
SRSHR (vector, 2S) throughput 0.500749 | |
SRSHR (vector, 4S) 1->2 2.616290 | |
SRSHR (vector, 4S) throughput 0.506145 | |
SRSHR (vector, 2D) 1->2 3.005104 | |
SRSHR (vector, 2D) throughput 0.499243 | |
SRSHR (vector, D) 1->2 2.998666 | |
SRSHR (vector, D) throughput 0.501806 | |
SRSRA (vector, 8B) 1->1 3.033130 | |
SRSRA (vector, 8B) 1->2 3.109935 | |
SRSRA (vector, 8B) throughput 0.511310 | |
SRSRA (vector, 8B) throughput 0.511036 | |
SRSRA (vector, 16B) 1->1 3.061165 | |
SRSRA (vector, 16B) 1->2 2.990229 | |
SRSRA (vector, 16B) throughput 0.500599 | |
SRSRA (vector, 16B) throughput 0.502277 | |
SRSRA (vector, 4H) 1->1 2.999274 | |
SRSRA (vector, 4H) 1->2 3.005337 | |
SRSRA (vector, 4H) throughput 0.499869 | |
SRSRA (vector, 4H) throughput 0.635998 | |
SRSRA (vector, 8H) 1->1 2.993943 | |
SRSRA (vector, 8H) 1->2 3.004010 | |
SRSRA (vector, 8H) throughput 0.502081 | |
SRSRA (vector, 8H) throughput 0.504531 | |
SRSRA (vector, 2S) 1->1 3.009961 | |
SRSRA (vector, 2S) 1->2 3.005792 | |
SRSRA (vector, 2S) throughput 0.503159 | |
SRSRA (vector, 2S) throughput 0.499235 | |
SRSRA (vector, 4S) 1->1 2.997251 | |
SRSRA (vector, 4S) 1->2 3.039173 | |
SRSRA (vector, 4S) throughput 0.501164 | |
SRSRA (vector, 4S) throughput 0.500768 | |
SRSRA (vector, 2D) 1->1 3.027452 | |
SRSRA (vector, 2D) 1->2 3.001657 | |
SRSRA (vector, 2D) throughput 0.503194 | |
SRSRA (vector, 2D) throughput 0.502951 | |
SRSRA (vector, D) 1->1 3.008625 | |
SRSRA (vector, D) 1->2 3.006600 | |
SRSRA (vector, D) throughput 0.503149 | |
SRSRA (vector, D) throughput 0.507012 | |
SSHL (vector, 8B) 1->2 2.004649 | |
SSHL (vector, 8B) 1->3 2.022005 | |
SSHL (vector, 8B) throughput 0.500693 | |
SSHL (vector, 16B) 1->2 1.996004 | |
SSHL (vector, 16B) 1->3 2.000406 | |
SSHL (vector, 16B) throughput 0.649285 | |
SSHL (vector, 4H) 1->2 1.966226 | |
SSHL (vector, 4H) 1->3 1.790707 | |
SSHL (vector, 4H) throughput 0.500085 | |
SSHL (vector, 8H) 1->2 2.201452 | |
SSHL (vector, 8H) 1->3 2.001886 | |
SSHL (vector, 8H) throughput 0.500476 | |
SSHL (vector, 2S) 1->2 2.001414 | |
SSHL (vector, 2S) 1->3 2.002389 | |
SSHL (vector, 2S) throughput 0.507486 | |
SSHL (vector, 4S) 1->2 2.009136 | |
SSHL (vector, 4S) 1->3 2.000566 | |
SSHL (vector, 4S) throughput 0.500957 | |
SSHL (vector, 2D) 1->2 1.998181 | |
SSHL (vector, 2D) 1->3 2.001378 | |
SSHL (vector, 2D) throughput 0.501248 | |
SSHL (vector, D) 1->2 2.005710 | |
SSHL (vector, D) 1->3 2.001011 | |
SSHL (vector, D) throughput 0.474733 | |
SSHLL (8H) 1->2 1.995962 | |
SSHLL (8H) throughput 0.504129 | |
SSHLL (4S) 1->2 2.002511 | |
SSHLL (4S) throughput 0.503263 | |
SSHLL (2D) 1->2 1.998989 | |
SSHLL (2D) throughput 0.500491 | |
SSHLL2 (8H) 1->2 1.999192 | |
SSHLL2 (8H) throughput 0.503169 | |
SSHLL2 (4S) 1->2 1.999556 | |
SSHLL2 (4S) throughput 0.500658 | |
SSHLL2 (2D) 1->2 2.008643 | |
SSHLL2 (2D) throughput 0.502465 | |
SSHR (vector, 8B) 1->2 2.067298 | |
SSHR (vector, 8B) throughput 0.500162 | |
SSHR (vector, 16B) 1->2 1.921283 | |
SSHR (vector, 16B) throughput 0.500937 | |
SSHR (vector, 4H) 1->2 1.997497 | |
SSHR (vector, 4H) throughput 0.502946 | |
SSHR (vector, 8H) 1->2 2.082301 | |
SSHR (vector, 8H) throughput 0.549584 | |
SSHR (vector, 2S) 1->2 2.004048 | |
SSHR (vector, 2S) throughput 0.501718 | |
SSHR (vector, 4S) 1->2 1.999798 | |
SSHR (vector, 4S) throughput 0.500881 | |
SSHR (vector, 2D) 1->2 2.001498 | |
SSHR (vector, 2D) throughput 0.500663 | |
SSHR (vector, D) 1->2 2.004696 | |
SSHR (vector, D) throughput 0.501124 | |
SSRA (vector, 8B) 1->1 3.005590 | |
SSRA (vector, 8B) 1->2 3.012716 | |
SSRA (vector, 8B) throughput 0.499995 | |
SSRA (vector, 8B) throughput 0.502622 | |
SSRA (vector, 16B) 1->1 3.007085 | |
SSRA (vector, 16B) 1->2 3.000365 | |
SSRA (vector, 16B) throughput 0.505188 | |
SSRA (vector, 16B) throughput 0.500040 | |
SSRA (vector, 4H) 1->1 3.005343 | |
SSRA (vector, 4H) 1->2 3.004172 | |
SSRA (vector, 4H) throughput 0.501270 | |
SSRA (vector, 4H) throughput 0.599157 | |
SSRA (vector, 8H) 1->1 3.028021 | |
SSRA (vector, 8H) 1->2 3.007656 | |
SSRA (vector, 8H) throughput 0.501559 | |
SSRA (vector, 8H) throughput 0.552259 | |
SSRA (vector, 2S) 1->1 3.002941 | |
SSRA (vector, 2S) 1->2 3.002506 | |
SSRA (vector, 2S) throughput 0.505633 | |
SSRA (vector, 2S) throughput 0.507162 | |
SSRA (vector, 4S) 1->1 2.999678 | |
SSRA (vector, 4S) 1->2 3.003969 | |
SSRA (vector, 4S) throughput 0.504049 | |
SSRA (vector, 4S) throughput 0.504373 | |
SSRA (vector, 2D) 1->1 3.078916 | |
SSRA (vector, 2D) 1->2 3.017494 | |
SSRA (vector, 2D) throughput 0.507461 | |
SSRA (vector, 2D) throughput 0.500593 | |
SSRA (vector, D) 1->1 3.034883 | |
SSRA (vector, D) 1->2 2.997292 | |
SSRA (vector, D) throughput 0.501989 | |
SSRA (vector, D) throughput 0.500010 | |
SSUBL (vector, 8H) 1->2 1.996770 | |
SSUBL (vector, 8H) 1->3 1.993630 | |
SSUBL (vector, 8H) throughput 0.500780 | |
SSUBL (vector, 4S) 1->2 1.996286 | |
SSUBL (vector, 4S) 1->3 1.998546 | |
SSUBL (vector, 4S) throughput 0.500096 | |
SSUBL (vector, 2D) 1->2 1.992507 | |
SSUBL (vector, 2D) 1->3 1.998060 | |
SSUBL (vector, 2D) throughput 0.499889 | |
SSUBL2 (vector, 8H) 1->2 2.005992 | |
SSUBL2 (vector, 8H) 1->3 2.003108 | |
SSUBL2 (vector, 8H) throughput 0.500536 | |
SSUBL2 (vector, 4S) 1->2 2.010679 | |
SSUBL2 (vector, 4S) 1->3 2.038265 | |
SSUBL2 (vector, 4S) throughput 0.369727 | |
SSUBL2 (vector, 2D) 1->2 2.003269 | |
SSUBL2 (vector, 2D) 1->3 2.001702 | |
SSUBL2 (vector, 2D) throughput 0.500886 | |
SSUBW (vector, 8H) 1->2 1.998060 | |
SSUBW (vector, 8H) 1->3 2.001618 | |
SSUBW (vector, 8H) throughput 0.505043 | |
SSUBW (vector, 4S) 1->2 1.985226 | |
SSUBW (vector, 4S) 1->3 2.017752 | |
SSUBW (vector, 4S) throughput 0.501812 | |
SSUBW (vector, 2D) 1->2 2.010447 | |
SSUBW (vector, 2D) 1->3 2.022046 | |
SSUBW (vector, 2D) throughput 0.502539 | |
SSUBW2 (vector, 8H) 1->2 2.001740 | |
SSUBW2 (vector, 8H) 1->3 2.001375 | |
SSUBW2 (vector, 8H) throughput 0.739194 | |
SSUBW2 (vector, 4S) 1->2 2.015306 | |
SSUBW2 (vector, 4S) 1->3 1.987937 | |
SSUBW2 (vector, 4S) throughput 0.502058 | |
SSUBW2 (vector, 2D) 1->2 2.006924 | |
SSUBW2 (vector, 2D) 1->3 2.001539 | |
SSUBW2 (vector, 2D) throughput 0.498596 | |
STNP (S) throughput 1.025963 | |
STNP (D) throughput 1.043861 | |
STNP (Q) throughput 2.078856 | |
STNP (signed offset, S) throughput 1.038707 | |
STNP (signed offset, D) throughput 1.041593 | |
STNP (signed offset, Q) throughput 2.090197 | |
STP (S) throughput 0.999960 | |
STP (D) throughput 1.139441 | |
STP (Q) throughput 2.008707 | |
STP (post-index, S) 3->3 1.272286 | |
STP (post-index, S) throughput 1.024868 | |
STP (post-index, D) 3->3 1.322916 | |
STP (post-index, D) throughput 1.014627 | |
STP (post-index, Q) 3->3 2.014026 | |
STP (post-index, Q) throughput 1.990785 | |
STP (pre-index, S) 3->3 1.486040 | |
STP (pre-index, S) throughput 1.007733 | |
STP (pre-index, D) 3->3 1.194301 | |
STP (pre-index, D) throughput 1.104939 | |
STP (pre-index, Q) 3->3 2.037017 | |
STP (pre-index, Q) throughput 2.112572 | |
STP (signed offset, S) throughput 0.950166 | |
STP (signed offset, D) throughput 1.072080 | |
STP (signed offset, Q) throughput 2.012709 | |
STR (S) throughput 1.056329 | |
STR (D) throughput 1.051909 | |
STR (Q) throughput 1.006660 | |
STR (post-index, S) 3->3 1.283754 | |
STR (post-index, S) throughput 0.948258 | |
STR (post-index, D) 3->3 1.221233 | |
STR (post-index, D) throughput 1.210981 | |
STR (post-index, Q) 3->3 1.154575 | |
STR (post-index, Q) throughput 0.598137 | |
STR (pre-index, S) 3->3 1.535841 | |
STR (pre-index, S) throughput 1.019057 | |
STR (pre-index, D) 3->3 1.046766 | |
STR (pre-index, D) throughput 1.056794 | |
STR (pre-index, Q) 3->3 1.472915 | |
STR (pre-index, Q) throughput 0.731031 | |
STR (signed offset, S) throughput 1.053465 | |
STR (signed offset, D) throughput 1.032076 | |
STR (signed offset, Q) throughput 1.022392 | |
STR (register, S) throughput 1.042215 | |
STR (register, D) throughput 1.008838 | |
STR (register, Q) throughput 1.001341 | |
STR (register, uxtw, S) throughput 1.002501 | |
STR (register, uxtw, D) throughput 1.095900 | |
STR (register, uxtw, Q) throughput 1.069675 | |
STR (register, sxtw, S) throughput 0.997972 | |
STR (register, sxtw, D) throughput 0.992167 | |
STR (register, sxtw, Q) throughput 1.278306 | |
STR (register, lsl, S) throughput 1.000283 | |
STR (register, lsl, D) throughput 0.999631 | |
STR (register, lsl, Q) throughput 1.269661 | |
STUR (S) throughput 1.304385 | |
STUR (D) throughput 1.038822 | |
STUR (Q) throughput 1.023398 | |
SUB (vector, 8B) 1->2 1.884181 | |
SUB (vector, 8B) 1->3 2.003527 | |
SUB (vector, 8B) throughput 0.431326 | |
SUB (vector, 16B) 1->2 2.000161 | |
SUB (vector, 16B) 1->3 2.013864 | |
SUB (vector, 16B) throughput 0.503737 | |
SUB (vector, 4H) 1->2 1.790820 | |
SUB (vector, 4H) 1->3 2.054359 | |
SUB (vector, 4H) throughput 0.500081 | |
SUB (vector, 8H) 1->2 1.881353 | |
SUB (vector, 8H) 1->3 2.370348 | |
SUB (vector, 8H) throughput 0.443212 | |
SUB (vector, 2S) 1->2 2.011606 | |
SUB (vector, 2S) 1->3 1.597435 | |
SUB (vector, 2S) throughput 0.502015 | |
SUB (vector, 4S) 1->2 1.997079 | |
SUB (vector, 4S) 1->3 2.006958 | |
SUB (vector, 4S) throughput 0.509589 | |
SUB (vector, 2D) 1->2 2.001537 | |
SUB (vector, 2D) 1->3 2.001619 | |
SUB (vector, 2D) throughput 0.499879 | |
SUB (vector, D) 1->2 2.003887 | |
SUB (vector, D) 1->3 2.002632 | |
SUB (vector, D) throughput 0.633295 | |
SUBHN (vector, 8H) 1->2 4.381085 | |
SUBHN (vector, 8H) 1->3 2.936640 | |
SUBHN (vector, 8H) throughput 0.534330 | |
SUBHN (vector, 4S) 1->2 6.561866 | |
SUBHN (vector, 4S) 1->3 3.026140 | |
SUBHN (vector, 4S) throughput 0.528292 | |
SUBHN (vector, 2D) 1->2 2.633643 | |
SUBHN (vector, 2D) 1->3 3.037886 | |
SUBHN (vector, 2D) throughput 0.534753 | |
SUBHN2 (vector, 8H) 1->1 2.937879 | |
SUBHN2 (vector, 8H) 1->2 3.372501 | |
SUBHN2 (vector, 8H) 1->3 3.031820 | |
SUBHN2 (vector, 8H) throughput 0.519114 | |
SUBHN2 (vector, 8H) throughput 0.501891 | |
SUBHN2 (vector, 4S) 1->1 3.043085 | |
SUBHN2 (vector, 4S) 1->2 3.004639 | |
SUBHN2 (vector, 4S) 1->3 3.009285 | |
SUBHN2 (vector, 4S) throughput 0.477181 | |
SUBHN2 (vector, 4S) throughput 0.503986 | |
SUBHN2 (vector, 2D) 1->1 3.000526 | |
SUBHN2 (vector, 2D) 1->2 3.004292 | |
SUBHN2 (vector, 2D) 1->3 3.005792 | |
SUBHN2 (vector, 2D) throughput 0.501655 | |
SUBHN2 (vector, 2D) throughput 0.502291 | |
SUQADD (scalar, B) 1->1 3.014213 | |
SUQADD (scalar, B) 1->2 3.020005 | |
SUQADD (scalar, B) throughput 0.499380 | |
SUQADD (scalar, B) throughput 0.500921 | |
SUQADD (scalar, H) 1->1 3.006482 | |
SUQADD (scalar, H) 1->2 3.003038 | |
SUQADD (scalar, H) throughput 0.500773 | |
SUQADD (scalar, H) throughput 0.503409 | |
SUQADD (scalar, S) 1->1 3.010488 | |
SUQADD (scalar, S) 1->2 2.998261 | |
SUQADD (scalar, S) throughput 0.500445 | |
SUQADD (scalar, S) throughput 0.607076 | |
SUQADD (scalar, D) 1->1 2.995514 | |
SUQADD (scalar, D) 1->2 2.998545 | |
SUQADD (scalar, D) throughput 0.501564 | |
SUQADD (scalar, D) throughput 0.500741 | |
SUQADD (vector, 8B) 1->1 3.004292 | |
SUQADD (vector, 8B) 1->2 3.009880 | |
SUQADD (vector, 8B) throughput 0.506692 | |
SUQADD (vector, 8B) throughput 0.501252 | |
SUQADD (vector, 16B) 1->1 3.003079 | |
SUQADD (vector, 16B) 1->2 1.968559 | |
SUQADD (vector, 16B) throughput 0.493915 | |
SUQADD (vector, 16B) throughput 0.499827 | |
SUQADD (vector, 4H) 1->1 2.996972 | |
SUQADD (vector, 4H) 1->2 3.004819 | |
SUQADD (vector, 4H) throughput 0.536828 | |
SUQADD (vector, 4H) throughput 0.499889 | |
SUQADD (vector, 8H) 1->1 2.996281 | |
SUQADD (vector, 8H) 1->2 3.004743 | |
SUQADD (vector, 8H) throughput 0.509034 | |
SUQADD (vector, 8H) throughput 0.503601 | |
SUQADD (vector, 2S) 1->1 2.995599 | |
SUQADD (vector, 2S) 1->2 3.003887 | |
SUQADD (vector, 2S) throughput 0.502420 | |
SUQADD (vector, 2S) throughput 0.500083 | |
SUQADD (vector, 4S) 1->1 2.995682 | |
SUQADD (vector, 4S) 1->2 2.993902 | |
SUQADD (vector, 4S) throughput 0.458729 | |
SUQADD (vector, 4S) throughput 0.761102 | |
SUQADD (vector, 2D) 1->1 2.858124 | |
SUQADD (vector, 2D) 1->2 3.452689 | |
SUQADD (vector, 2D) throughput 0.488925 | |
SUQADD (vector, 2D) throughput 0.503596 | |
SXTL (8H) 1->2 2.008300 | |
SXTL (8H) throughput 0.500658 | |
SXTL (4S) 1->2 1.977197 | |
SXTL (4S) throughput 0.502157 | |
SXTL (2D) 1->2 1.989008 | |
SXTL (2D) throughput 0.540235 | |
SXTL2 (8H) 1->2 1.999657 | |
SXTL2 (8H) throughput 0.514695 | |
SXTL2 (4S) 1->2 2.064282 | |
SXTL2 (4S) throughput 0.545762 | |
SXTL2 (2D) 1->2 1.996387 | |
SXTL2 (2D) throughput 0.583660 | |
TBL (single register table, 8B) 1->2 2.025149 | |
TBL (single register table, 8B) 1->3 2.035841 | |
TBL (single register table, 8B) throughput 0.645442 | |
TBL (single register table, 16B) 1->2 2.003846 | |
TBL (single register table, 16B) 1->3 1.953104 | |
TBL (single register table, 16B) throughput 0.501682 | |
TBL (two register table, 8B) 1->2 1.944759 | |
TBL (two register table, 8B) 1->3 1.942964 | |
TBL (two register table, 8B) 1->4 1.990899 | |
TBL (two register table, 8B) throughput 0.503022 | |
TBL (two register table, 16B) 1->2 2.449338 | |
TBL (two register table, 16B) 1->3 2.026723 | |
TBL (two register table, 16B) 1->4 2.027946 | |
TBL (two register table, 16B) throughput 0.546886 | |
TBL (three register table, 8B) 1->2 4.387148 | |
TBL (three register table, 8B) 1->3 4.742329 | |
TBL (three register table, 8B) 1->4 1.743119 | |
TBL (three register table, 8B) 1->5 4.164090 | |
TBL (three register table, 8B) throughput 1.116361 | |
TBL (three register table, 16B) 1->2 3.498044 | |
TBL (three register table, 16B) 1->3 5.012106 | |
TBL (three register table, 16B) 1->4 2.184351 | |
TBL (three register table, 16B) 1->5 4.004693 | |
TBL (three register table, 16B) throughput 1.082302 | |
TBL (four register table, 8B) 1->2 5.270441 | |
TBL (four register table, 8B) 1->3 4.002271 | |
TBL (four register table, 8B) 1->4 4.041704 | |
TBL (four register table, 8B) 1->5 6.698875 | |
TBL (four register table, 8B) 1->6 3.868545 | |
TBL (four register table, 8B) throughput 1.168290 | |
TBL (four register table, 16B) 1->2 4.224936 | |
TBL (four register table, 16B) 1->3 3.988502 | |
TBL (four register table, 16B) 1->4 4.245066 | |
TBL (four register table, 16B) 1->5 5.019683 | |
TBL (four register table, 16B) 1->6 3.940122 | |
TBL (four register table, 16B) throughput 1.726255 | |
TBX (single register table, 8B) 1->1 2.105161 | |
TBX (single register table, 8B) 1->2 2.035628 | |
TBX (single register table, 8B) 1->3 2.354108 | |
TBX (single register table, 8B) throughput 0.372355 | |
TBX (single register table, 8B) throughput 0.543319 | |
TBX (single register table, 16B) 1->1 2.003685 | |
TBX (single register table, 16B) 1->2 2.008868 | |
TBX (single register table, 16B) 1->3 1.995202 | |
TBX (single register table, 16B) throughput 0.581042 | |
TBX (single register table, 16B) throughput 0.570306 | |
TBX (two register table, 8B) 1->1 4.189078 | |
TBX (two register table, 8B) 1->2 4.246723 | |
TBX (two register table, 8B) 1->3 1.991254 | |
TBX (two register table, 8B) 1->4 4.386379 | |
TBX (two register table, 8B) throughput 1.129195 | |
TBX (two register table, 16B) 1->1 4.467096 | |
TBX (two register table, 16B) 1->2 4.558736 | |
TBX (two register table, 16B) 1->3 2.116799 | |
TBX (two register table, 16B) 1->4 3.800919 | |
TBX (two register table, 16B) throughput 1.063570 | |
TBX (three register table, 8B) 1->1 6.387795 | |
TBX (three register table, 8B) 1->2 4.213440 | |
TBX (three register table, 8B) 1->3 2.958374 | |
TBX (three register table, 8B) 1->4 2.989796 | |
TBX (three register table, 8B) 1->5 6.086680 | |
TBX (three register table, 8B) throughput 1.662913 | |
TBX (three register table, 16B) 1->1 6.525186 | |
TBX (three register table, 16B) 1->2 6.559284 | |
TBX (three register table, 16B) 1->3 4.520745 | |
TBX (three register table, 16B) 1->4 2.707402 | |
TBX (three register table, 16B) 1->5 6.332463 | |
TBX (three register table, 16B) throughput 1.735098 | |
TBX (four register table, 8B) 1->1 11.984942 | |
TBX (four register table, 8B) 1->2 8.525770 | |
TBX (four register table, 8B) 1->3 7.290039 | |
TBX (four register table, 8B) 1->4 4.365438 | |
TBX (four register table, 8B) 1->5 2.023570 | |
TBX (four register table, 8B) 1->6 8.652513 | |
TBX (four register table, 8B) throughput 1.933548 | |
TBX (four register table, 16B) 1->1 9.206829 | |
TBX (four register table, 16B) 1->2 8.447791 | |
TBX (four register table, 16B) 1->3 7.034643 | |
TBX (four register table, 16B) 1->4 4.389924 | |
TBX (four register table, 16B) 1->5 2.012877 | |
TBX (four register table, 16B) 1->6 8.522473 | |
TBX (four register table, 16B) throughput 2.084390 | |
TRN1 (vector, 8B) 1->2 2.040816 | |
TRN1 (vector, 8B) 1->3 2.038938 | |
TRN1 (vector, 8B) throughput 0.506167 | |
TRN1 (vector, 16B) 1->2 2.001782 | |
TRN1 (vector, 16B) 1->3 2.001414 | |
TRN1 (vector, 16B) throughput 0.557463 | |
TRN1 (vector, 4H) 1->2 1.927152 | |
TRN1 (vector, 4H) 1->3 1.997783 | |
TRN1 (vector, 4H) throughput 0.394201 | |
TRN1 (vector, 8H) 1->2 1.864094 | |
TRN1 (vector, 8H) 1->3 1.998667 | |
TRN1 (vector, 8H) throughput 0.472379 | |
TRN1 (vector, 2S) 1->2 1.798929 | |
TRN1 (vector, 2S) 1->3 2.114264 | |
TRN1 (vector, 2S) throughput 0.549210 | |
TRN1 (vector, 4S) 1->2 1.758556 | |
TRN1 (vector, 4S) 1->3 1.963672 | |
TRN1 (vector, 4S) throughput 0.529229 | |
TRN1 (vector, 2D) 1->2 2.035978 | |
TRN1 (vector, 2D) 1->3 1.976727 | |
TRN1 (vector, 2D) throughput 0.834853 | |
TRN2 (vector, 8B) 1->2 2.009228 | |
TRN2 (vector, 8B) 1->3 2.205048 | |
TRN2 (vector, 8B) throughput 0.484013 | |
TRN2 (vector, 16B) 1->2 2.030527 | |
TRN2 (vector, 16B) 1->3 1.995024 | |
TRN2 (vector, 16B) throughput 0.539536 | |
TRN2 (vector, 4H) 1->2 2.000162 | |
TRN2 (vector, 4H) 1->3 2.010245 | |
TRN2 (vector, 4H) throughput 0.345918 | |
TRN2 (vector, 8H) 1->2 2.004454 | |
TRN2 (vector, 8H) 1->3 2.009575 | |
TRN2 (vector, 8H) throughput 0.547113 | |
TRN2 (vector, 2S) 1->2 1.997097 | |
TRN2 (vector, 2S) 1->3 2.006387 | |
TRN2 (vector, 2S) throughput 0.499409 | |
TRN2 (vector, 4S) 1->2 2.014498 | |
TRN2 (vector, 4S) 1->3 2.036159 | |
TRN2 (vector, 4S) throughput 0.545007 | |
TRN2 (vector, 2D) 1->2 2.721746 | |
TRN2 (vector, 2D) 1->3 2.001173 | |
TRN2 (vector, 2D) throughput 0.557248 | |
UABA (vector, 8B) 1->1 3.012376 | |
UABA (vector, 8B) 1->2 3.015588 | |
UABA (vector, 8B) 1->3 3.043489 | |
UABA (vector, 8B) throughput 0.584227 | |
UABA (vector, 8B) throughput 0.659806 | |
UABA (vector, 16B) 1->1 3.606709 | |
UABA (vector, 16B) 1->2 2.973085 | |
UABA (vector, 16B) 1->3 3.424069 | |
UABA (vector, 16B) throughput 0.521626 | |
UABA (vector, 16B) throughput 0.505386 | |
UABA (vector, 4H) 1->1 3.004285 | |
UABA (vector, 4H) 1->2 3.004890 | |
UABA (vector, 4H) 1->3 3.004782 | |
UABA (vector, 4H) throughput 0.516618 | |
UABA (vector, 4H) throughput 0.509410 | |
UABA (vector, 8H) 1->1 2.998547 | |
UABA (vector, 8H) 1->2 2.999555 | |
UABA (vector, 8H) 1->3 2.998910 | |
UABA (vector, 8H) throughput 0.499657 | |
UABA (vector, 8H) throughput 0.501855 | |
UABA (vector, 2S) 1->1 2.998988 | |
UABA (vector, 2S) 1->2 3.004938 | |
UABA (vector, 2S) 1->3 3.004981 | |
UABA (vector, 2S) throughput 0.500450 | |
UABA (vector, 2S) throughput 0.545581 | |
UABA (vector, 4S) 1->1 3.000541 | |
UABA (vector, 4S) 1->2 3.020530 | |
UABA (vector, 4S) 1->3 3.041100 | |
UABA (vector, 4S) throughput 0.480946 | |
UABA (vector, 4S) throughput 0.499843 | |
UABAL (vector, 8H) 1->1 2.999192 | |
UABAL (vector, 8H) 1->2 2.999313 | |
UABAL (vector, 8H) 1->3 3.937148 | |
UABAL (vector, 8H) throughput 0.510577 | |
UABAL (vector, 8H) throughput 0.513663 | |
UABAL (vector, 4S) 1->1 3.884978 | |
UABAL (vector, 4S) 1->2 3.001205 | |
UABAL (vector, 4S) 1->3 2.751259 | |
UABAL (vector, 4S) throughput 0.501767 | |
UABAL (vector, 4S) throughput 0.506250 | |
UABAL (vector, 2D) 1->1 3.102301 | |
UABAL (vector, 2D) 1->2 3.013262 | |
UABAL (vector, 2D) 1->3 3.071869 | |
UABAL (vector, 2D) throughput 0.503591 | |
UABAL (vector, 2D) throughput 0.517312 | |
UABAL2 (vector, 8H) 1->1 3.069438 | |
UABAL2 (vector, 8H) 1->2 3.000484 | |
UABAL2 (vector, 8H) 1->3 3.013500 | |
UABAL2 (vector, 8H) throughput 0.501271 | |
UABAL2 (vector, 8H) throughput 0.499413 | |
UABAL2 (vector, 4S) 1->1 3.016508 | |
UABAL2 (vector, 4S) 1->2 2.149381 | |
UABAL2 (vector, 4S) 1->3 2.968712 | |
UABAL2 (vector, 4S) throughput 0.832119 | |
UABAL2 (vector, 4S) throughput 0.521277 | |
UABAL2 (vector, 2D) 1->1 2.955655 | |
UABAL2 (vector, 2D) 1->2 2.866641 | |
UABAL2 (vector, 2D) 1->3 2.743871 | |
UABAL2 (vector, 2D) throughput 0.489441 | |
UABAL2 (vector, 2D) throughput 0.815735 | |
UABD (vector, 8B) 1->2 3.542901 | |
UABD (vector, 8B) 1->3 3.458645 | |
UABD (vector, 8B) throughput 0.736395 | |
UABD (vector, 16B) 1->2 3.777826 | |
UABD (vector, 16B) 1->3 3.014343 | |
UABD (vector, 16B) throughput 0.607980 | |
UABD (vector, 4H) 1->2 2.968354 | |
UABD (vector, 4H) 1->3 3.257936 | |
UABD (vector, 4H) throughput 0.582999 | |
UABD (vector, 8H) 1->2 3.530229 | |
UABD (vector, 8H) 1->3 6.125389 | |
UABD (vector, 8H) throughput 0.546821 | |
UABD (vector, 2S) 1->2 3.091853 | |
UABD (vector, 2S) 1->3 2.688652 | |
UABD (vector, 2S) throughput 0.504279 | |
UABD (vector, 4S) 1->2 3.323197 | |
UABD (vector, 4S) 1->3 3.126994 | |
UABD (vector, 4S) throughput 0.515446 | |
UABDL (vector, 8H) 1->2 2.940565 | |
UABDL (vector, 8H) 1->3 2.894123 | |
UABDL (vector, 8H) throughput 0.525405 | |
UABDL (vector, 4S) 1->2 3.007571 | |
UABDL (vector, 4S) 1->3 3.490585 | |
UABDL (vector, 4S) throughput 0.554998 | |
UABDL (vector, 2D) 1->2 4.685013 | |
UABDL (vector, 2D) 1->3 2.950823 | |
UABDL (vector, 2D) throughput 0.589253 | |
UABDL2 (vector, 8H) 1->2 3.011346 | |
UABDL2 (vector, 8H) 1->3 2.478033 | |
UABDL2 (vector, 8H) throughput 0.456040 | |
UABDL2 (vector, 4S) 1->2 2.384823 | |
UABDL2 (vector, 4S) 1->3 2.737678 | |
UABDL2 (vector, 4S) throughput 0.499366 | |
UABDL2 (vector, 2D) 1->2 3.006923 | |
UABDL2 (vector, 2D) 1->3 4.055362 | |
UABDL2 (vector, 2D) throughput 0.436354 | |
UADALP (vector, 4H) 1->1 3.005983 | |
UADALP (vector, 4H) 1->2 2.897276 | |
UADALP (vector, 4H) throughput 0.507710 | |
UADALP (vector, 4H) throughput 0.511346 | |
UADALP (vector, 8H) 1->1 3.007908 | |
UADALP (vector, 8H) 1->2 3.022958 | |
UADALP (vector, 8H) throughput 0.511524 | |
UADALP (vector, 8H) throughput 0.610777 | |
UADALP (vector, 2S) 1->1 2.961537 | |
UADALP (vector, 2S) 1->2 2.992182 | |
UADALP (vector, 2S) throughput 0.504693 | |
UADALP (vector, 2S) throughput 0.495274 | |
UADALP (vector, 4S) 1->1 2.994225 | |
UADALP (vector, 4S) 1->2 3.008910 | |
UADALP (vector, 4S) throughput 0.503443 | |
UADALP (vector, 4S) throughput 0.498574 | |
UADALP (vector, 1D) 1->1 3.021862 | |
UADALP (vector, 1D) 1->2 2.347373 | |
UADALP (vector, 1D) throughput 0.559677 | |
UADALP (vector, 1D) throughput 0.697949 | |
UADALP (vector, 2D) 1->1 3.150477 | |
UADALP (vector, 2D) 1->2 3.185844 | |
UADALP (vector, 2D) throughput 0.500667 | |
UADALP (vector, 2D) throughput 0.508501 | |
UADDL (vector, 8H) 1->2 2.008423 | |
UADDL (vector, 8H) 1->3 1.998180 | |
UADDL (vector, 8H) throughput 0.499430 | |
UADDL (vector, 4S) 1->2 1.986666 | |
UADDL (vector, 4S) 1->3 1.927467 | |
UADDL (vector, 4S) throughput 0.207887 | |
UADDL (vector, 2D) 1->2 4.468753 | |
UADDL (vector, 2D) 1->3 2.010923 | |
UADDL (vector, 2D) throughput 0.596283 | |
UADDL2 (vector, 8H) 1->2 2.069321 | |
UADDL2 (vector, 8H) 1->3 1.957000 | |
UADDL2 (vector, 8H) throughput 0.628416 | |
UADDL2 (vector, 4S) 1->2 2.301002 | |
UADDL2 (vector, 4S) 1->3 2.038065 | |
UADDL2 (vector, 4S) throughput 0.515035 | |
UADDL2 (vector, 2D) 1->2 2.371403 | |
UADDL2 (vector, 2D) 1->3 1.990831 | |
UADDL2 (vector, 2D) throughput 0.562891 | |
UADDLP (vector, 4H) 1->2 1.602521 | |
UADDLP (vector, 4H) throughput 0.790884 | |
UADDLP (vector, 8H) 1->2 2.007051 | |
UADDLP (vector, 8H) throughput 0.873410 | |
UADDLP (vector, 2S) 1->2 1.917857 | |
UADDLP (vector, 2S) throughput 0.443352 | |
UADDLP (vector, 4S) 1->2 2.044721 | |
UADDLP (vector, 4S) throughput 0.518982 | |
UADDLP (vector, 1D) 1->2 1.997618 | |
UADDLP (vector, 1D) throughput 0.500838 | |
UADDLP (vector, 2D) 1->2 3.511391 | |
UADDLP (vector, 2D) throughput 0.687573 | |
UADDLV (vector, 8B) 1->2 2.841930 | |
UADDLV (vector, 8B) throughput 0.609047 | |
UADDLV (vector, 16B) 1->2 2.831086 | |
UADDLV (vector, 16B) throughput 0.463608 | |
UADDLV (vector, 4H) 1->2 2.854879 | |
UADDLV (vector, 4H) throughput 0.560475 | |
UADDLV (vector, 8H) 1->2 3.003847 | |
UADDLV (vector, 8H) throughput 0.499995 | |
UADDLV (vector, 4S) 1->2 3.012268 | |
UADDLV (vector, 4S) throughput 0.507081 | |
UADDW (vector, 8H) 1->2 2.000041 | |
UADDW (vector, 8H) 1->3 2.004939 | |
UADDW (vector, 8H) throughput 0.501410 | |
UADDW (vector, 4S) 1->2 2.001418 | |
UADDW (vector, 4S) 1->3 1.992226 | |
UADDW (vector, 4S) throughput 0.499940 | |
UADDW (vector, 2D) 1->2 2.026859 | |
UADDW (vector, 2D) 1->3 2.001660 | |
UADDW (vector, 2D) throughput 0.500916 | |
UADDW2 (vector, 8H) 1->2 2.002429 | |
UADDW2 (vector, 8H) 1->3 2.007305 | |
UADDW2 (vector, 8H) throughput 0.506165 | |
UADDW2 (vector, 4S) 1->2 2.019518 | |
UADDW2 (vector, 4S) 1->3 1.737755 | |
UADDW2 (vector, 4S) throughput 0.531340 | |
UADDW2 (vector, 2D) 1->2 2.004495 | |
UADDW2 (vector, 2D) 1->3 2.434771 | |
UADDW2 (vector, 2D) throughput 0.504535 | |
UCVTF (scalar, fixed-point, S from 32-bit GPR) 1->2 roundtrip 10.457200 | |
UCVTF (scalar, fixed-point, S from 32-bit GPR) throughput 0.698316 | |
UCVTF (scalar, fixed-point, D from 32-bit GPR) 1->2 roundtrip 7.191101 | |
UCVTF (scalar, fixed-point, D from 32-bit GPR) throughput 0.512621 | |
UCVTF (scalar, fixed-point, S from 64-bit GPR) 1->2 roundtrip 11.849162 | |
UCVTF (scalar, fixed-point, S from 64-bit GPR) throughput 0.619549 | |
UCVTF (scalar, fixed-point, D from 64-bit GPR) 1->2 roundtrip 10.648273 | |
UCVTF (scalar, fixed-point, D from 64-bit GPR) throughput 0.566978 | |
UCVTF (scalar, integer, S from 32-bit GPR) 1->2 roundtrip 10.610343 | |
UCVTF (scalar, integer, S from 32-bit GPR) throughput 0.528529 | |
UCVTF (scalar, integer, D from 32-bit GPR) 1->2 roundtrip 10.524389 | |
UCVTF (scalar, integer, D from 32-bit GPR) throughput 0.556105 | |
UCVTF (scalar, integer, S from 64-bit GPR) 1->2 roundtrip 9.621484 | |
UCVTF (scalar, integer, S from 64-bit GPR) throughput 0.432386 | |
UCVTF (scalar, integer, D from 64-bit GPR) 1->2 roundtrip 10.426746 | |
UCVTF (scalar, integer, D from 64-bit GPR) throughput 0.475069 | |
UCVTF (scalar, fixed-point, S from S) 1->2 2.998061 | |
UCVTF (scalar, fixed-point, S from S) throughput 0.466083 | |
UCVTF (scalar, fixed-point, D from D) 1->2 3.176450 | |
UCVTF (scalar, fixed-point, D from D) throughput 0.436402 | |
UCVTF (vector, fixed-point, 2S from 2S) 1->2 3.280614 | |
UCVTF (vector, fixed-point, 2S from 2S) throughput 0.503234 | |
UCVTF (vector, fixed-point, 4S from 4S) 1->2 3.073526 | |
UCVTF (vector, fixed-point, 4S from 4S) throughput 0.506796 | |
UCVTF (vector, fixed-point, 2D from 2D) 1->2 2.747564 | |
UCVTF (vector, fixed-point, 2D from 2D) throughput 0.500288 | |
UCVTF (scalar, integer, S from S) 1->2 3.064312 | |
UCVTF (scalar, integer, S from S) throughput 0.555895 | |
UCVTF (scalar, integer, D from D) 1->2 2.887607 | |
UCVTF (scalar, integer, D from D) throughput 0.679989 | |
UCVTF (vector, integer, 2S from 2S) 1->2 3.152348 | |
UCVTF (vector, integer, 2S from 2S) throughput 0.540997 | |
UCVTF (vector, integer, 4S from 4S) 1->2 3.006375 | |
UCVTF (vector, integer, 4S from 4S) throughput 0.506009 | |
UCVTF (vector, integer, 2D from 2D) 1->2 3.224033 | |
UCVTF (vector, integer, 2D from 2D) throughput 0.461208 | |
UHADD (vector, 8B) 1->2 1.822367 | |
UHADD (vector, 8B) 1->3 2.006196 | |
UHADD (vector, 8B) throughput 0.510822 | |
UHADD (vector, 16B) 1->2 2.101755 | |
UHADD (vector, 16B) 1->3 2.019476 | |
UHADD (vector, 16B) throughput 0.707074 | |
UHADD (vector, 4H) 1->2 2.001859 | |
UHADD (vector, 4H) 1->3 2.010407 | |
UHADD (vector, 4H) throughput 0.557679 | |
UHADD (vector, 8H) 1->2 2.113739 | |
UHADD (vector, 8H) 1->3 1.977806 | |
UHADD (vector, 8H) throughput 0.548886 | |
UHADD (vector, 2S) 1->2 2.124150 | |
UHADD (vector, 2S) 1->3 2.000000 | |
UHADD (vector, 2S) throughput 0.530922 | |
UHADD (vector, 4S) 1->2 2.019514 | |
UHADD (vector, 4S) 1->3 2.091906 | |
UHADD (vector, 4S) throughput 0.838553 | |
UHSUB (vector, 8B) 1->2 1.995457 | |
UHSUB (vector, 8B) 1->3 1.677583 | |
UHSUB (vector, 8B) throughput 0.519878 | |
UHSUB (vector, 16B) 1->2 2.002910 | |
UHSUB (vector, 16B) 1->3 1.965772 | |
UHSUB (vector, 16B) throughput 0.443214 | |
UHSUB (vector, 4H) 1->2 2.012813 | |
UHSUB (vector, 4H) 1->3 2.195572 | |
UHSUB (vector, 4H) throughput 0.811867 | |
UHSUB (vector, 8H) 1->2 2.010722 | |
UHSUB (vector, 8H) 1->3 2.004332 | |
UHSUB (vector, 8H) throughput 0.503533 | |
UHSUB (vector, 2S) 1->2 1.985147 | |
UHSUB (vector, 2S) 1->3 2.002343 | |
UHSUB (vector, 2S) throughput 0.499525 | |
UHSUB (vector, 4S) 1->2 1.996174 | |
UHSUB (vector, 4S) 1->3 2.006441 | |
UHSUB (vector, 4S) throughput 0.500719 | |
UMAX (vector, 8B) 1->2 2.246729 | |
UMAX (vector, 8B) 1->3 2.016166 | |
UMAX (vector, 8B) throughput 0.500709 | |
UMAX (vector, 16B) 1->2 2.034838 | |
UMAX (vector, 16B) 1->3 2.002227 | |
UMAX (vector, 16B) throughput 0.806014 | |
UMAX (vector, 4H) 1->2 2.485459 | |
UMAX (vector, 4H) 1->3 2.042385 | |
UMAX (vector, 4H) throughput 0.500302 | |
UMAX (vector, 8H) 1->2 1.998223 | |
UMAX (vector, 8H) 1->3 2.003523 | |
UMAX (vector, 8H) throughput 0.550813 | |
UMAX (vector, 2S) 1->2 2.001579 | |
UMAX (vector, 2S) 1->3 1.995321 | |
UMAX (vector, 2S) throughput 0.500653 | |
UMAX (vector, 4S) 1->2 2.003945 | |
UMAX (vector, 4S) 1->3 2.484453 | |
UMAX (vector, 4S) throughput 0.542465 | |
UMAXP (vector, 8B) 1->2 1.995252 | |
UMAXP (vector, 8B) 1->3 2.011448 | |
UMAXP (vector, 8B) throughput 0.498453 | |
UMAXP (vector, 16B) 1->2 2.000404 | |
UMAXP (vector, 16B) 1->3 2.129050 | |
UMAXP (vector, 16B) throughput 0.522371 | |
UMAXP (vector, 4H) 1->2 2.040910 | |
UMAXP (vector, 4H) 1->3 2.008082 | |
UMAXP (vector, 4H) throughput 0.433786 | |
UMAXP (vector, 8H) 1->2 1.998452 | |
UMAXP (vector, 8H) 1->3 2.001662 | |
UMAXP (vector, 8H) throughput 0.483485 | |
UMAXP (vector, 2S) 1->2 1.775554 | |
UMAXP (vector, 2S) 1->3 2.007881 | |
UMAXP (vector, 2S) throughput 0.501685 | |
UMAXP (vector, 4S) 1->2 2.003274 | |
UMAXP (vector, 4S) 1->3 2.004558 | |
UMAXP (vector, 4S) throughput 0.491161 | |
UMAXV (vector, 8B) 1->2 3.412627 | |
UMAXV (vector, 8B) throughput 0.675986 | |
UMAXV (vector, 16B) 1->2 3.030833 | |
UMAXV (vector, 16B) throughput 0.502907 | |
UMAXV (vector, 4H) 1->2 3.019840 | |
UMAXV (vector, 4H) throughput 0.499808 | |
UMAXV (vector, 8H) 1->2 3.004698 | |
UMAXV (vector, 8H) throughput 0.500673 | |
UMAXV (vector, 4S) 1->2 3.003847 | |
UMAXV (vector, 4S) throughput 0.509343 | |
UMIN (vector, 8B) 1->2 2.001459 | |
UMIN (vector, 8B) 1->3 2.002390 | |
UMIN (vector, 8B) throughput 0.497056 | |
UMIN (vector, 16B) 1->2 1.999401 | |
UMIN (vector, 16B) 1->3 2.000554 | |
UMIN (vector, 16B) throughput 0.511671 | |
UMIN (vector, 4H) 1->2 2.497732 | |
UMIN (vector, 4H) 1->3 2.454370 | |
UMIN (vector, 4H) throughput 0.502363 | |
UMIN (vector, 8H) 1->2 2.001539 | |
UMIN (vector, 8H) 1->3 2.007289 | |
UMIN (vector, 8H) throughput 0.504434 | |
UMIN (vector, 2S) 1->2 1.996405 | |
UMIN (vector, 2S) 1->3 2.005142 | |
UMIN (vector, 2S) throughput 0.499692 | |
UMIN (vector, 4S) 1->2 1.997416 | |
UMIN (vector, 4S) 1->3 1.994159 | |
UMIN (vector, 4S) throughput 0.500925 | |
UMINP (vector, 8B) 1->2 2.002469 | |
UMINP (vector, 8B) 1->3 1.999192 | |
UMINP (vector, 8B) throughput 0.506676 | |
UMINP (vector, 16B) 1->2 2.024820 | |
UMINP (vector, 16B) 1->3 2.003846 | |
UMINP (vector, 16B) throughput 0.428618 | |
UMINP (vector, 4H) 1->2 2.009754 | |
UMINP (vector, 4H) 1->3 1.971866 | |
UMINP (vector, 4H) throughput 0.497687 | |
UMINP (vector, 8H) 1->2 1.997458 | |
UMINP (vector, 8H) 1->3 2.004089 | |
UMINP (vector, 8H) throughput 0.501812 | |
UMINP (vector, 2S) 1->2 2.103062 | |
UMINP (vector, 2S) 1->3 2.001458 | |
UMINP (vector, 2S) throughput 0.551793 | |
UMINP (vector, 4S) 1->2 1.991057 | |
UMINP (vector, 4S) 1->3 1.993393 | |
UMINP (vector, 4S) throughput 0.500491 | |
UMINV (vector, 8B) 1->2 3.000484 | |
UMINV (vector, 8B) throughput 0.509009 | |
UMINV (vector, 16B) 1->2 3.006317 | |
UMINV (vector, 16B) throughput 0.500971 | |
UMINV (vector, 4H) 1->2 3.004088 | |
UMINV (vector, 4H) throughput 0.499263 | |
UMINV (vector, 8H) 1->2 3.003928 | |
UMINV (vector, 8H) throughput 0.509704 | |
UMINV (vector, 4S) 1->2 3.005669 | |
UMINV (vector, 4S) throughput 0.501567 | |
UMLAL (by element, 4S) 1->1 2.999839 | |
UMLAL (by element, 4S) 1->2 3.654811 | |
UMLAL (by element, 4S) 1->3 2.994952 | |
UMLAL (by element, 4S) throughput 0.501321 | |
UMLAL (by element, 4S) throughput 0.501198 | |
UMLAL (by element, 2D) 1->1 3.004009 | |
UMLAL (by element, 2D) 1->2 3.004371 | |
UMLAL (by element, 2D) 1->3 2.998261 | |
UMLAL (by element, 2D) throughput 0.501119 | |
UMLAL (by element, 2D) throughput 0.500744 | |
UMLAL2 (by element, 4S) 1->1 3.008909 | |
UMLAL2 (by element, 4S) 1->2 3.005014 | |
UMLAL2 (by element, 4S) 1->3 3.003887 | |
UMLAL2 (by element, 4S) throughput 0.495833 | |
UMLAL2 (by element, 4S) throughput 0.525928 | |
UMLAL2 (by element, 2D) 1->1 2.993212 | |
UMLAL2 (by element, 2D) 1->2 2.993753 | |
UMLAL2 (by element, 2D) 1->3 2.958075 | |
UMLAL2 (by element, 2D) throughput 0.502463 | |
UMLAL2 (by element, 2D) throughput 0.499606 | |
UMLAL (vector, 8H) 1->1 3.006318 | |
UMLAL (vector, 8H) 1->2 3.004172 | |
UMLAL (vector, 8H) 1->3 2.995597 | |
UMLAL (vector, 8H) throughput 0.502310 | |
UMLAL (vector, 8H) throughput 0.501108 | |
UMLAL (vector, 4S) 1->1 3.005658 | |
UMLAL (vector, 4S) 1->2 3.006318 | |
UMLAL (vector, 4S) 1->3 2.998707 | |
UMLAL (vector, 4S) throughput 0.501954 | |
UMLAL (vector, 4S) throughput 0.500863 | |
UMLAL (vector, 2D) 1->1 2.998503 | |
UMLAL (vector, 2D) 1->2 3.003765 | |
UMLAL (vector, 2D) 1->3 3.003059 | |
UMLAL (vector, 2D) throughput 0.506492 | |
UMLAL (vector, 2D) throughput 0.547277 | |
UMLAL2 (vector, 8H) 1->1 3.006034 | |
UMLAL2 (vector, 8H) 1->2 3.003969 | |
UMLAL2 (vector, 8H) 1->3 3.004293 | |
UMLAL2 (vector, 8H) throughput 0.501139 | |
UMLAL2 (vector, 8H) throughput 0.500333 | |
UMLAL2 (vector, 4S) 1->1 2.944956 | |
UMLAL2 (vector, 4S) 1->2 2.999759 | |
UMLAL2 (vector, 4S) 1->3 3.021586 | |
UMLAL2 (vector, 4S) throughput 0.513671 | |
UMLAL2 (vector, 4S) throughput 0.500454 | |
UMLAL2 (vector, 2D) 1->1 3.117640 | |
UMLAL2 (vector, 2D) 1->2 2.989494 | |
UMLAL2 (vector, 2D) 1->3 2.995324 | |
UMLAL2 (vector, 2D) throughput 0.502070 | |
UMLAL2 (vector, 2D) throughput 0.535454 | |
UMLSL (by element, 4S) 1->1 2.998826 | |
UMLSL (by element, 4S) 1->2 2.979390 | |
UMLSL (by element, 4S) 1->3 3.005345 | |
UMLSL (by element, 4S) throughput 0.505578 | |
UMLSL (by element, 4S) throughput 0.504171 | |
UMLSL (by element, 2D) 1->1 3.005872 | |
UMLSL (by element, 2D) 1->2 3.045706 | |
UMLSL (by element, 2D) 1->3 2.999595 | |
UMLSL (by element, 2D) throughput 0.502419 | |
UMLSL (by element, 2D) throughput 0.504071 | |
UMLSL2 (by element, 4S) 1->1 2.999473 | |
UMLSL2 (by element, 4S) 1->2 3.032361 | |
UMLSL2 (by element, 4S) 1->3 3.007817 | |
UMLSL2 (by element, 4S) throughput 0.511140 | |
UMLSL2 (by element, 4S) throughput 0.506101 | |
UMLSL2 (by element, 2D) 1->1 3.008587 | |
UMLSL2 (by element, 2D) 1->2 3.003928 | |
UMLSL2 (by element, 2D) 1->3 3.001823 | |
UMLSL2 (by element, 2D) throughput 0.500258 | |
UMLSL2 (by element, 2D) throughput 0.509506 | |
UMLSL (vector, 8H) 1->1 2.988960 | |
UMLSL (vector, 8H) 1->2 3.516208 | |
UMLSL (vector, 8H) 1->3 2.996323 | |
UMLSL (vector, 8H) throughput 0.500490 | |
UMLSL (vector, 8H) throughput 0.588719 | |
UMLSL (vector, 4S) 1->1 3.000284 | |
UMLSL (vector, 4S) 1->2 3.043092 | |
UMLSL (vector, 4S) 1->3 3.009342 | |
UMLSL (vector, 4S) throughput 0.502667 | |
UMLSL (vector, 4S) throughput 0.500668 | |
UMLSL (vector, 2D) 1->1 3.006399 | |
UMLSL (vector, 2D) 1->2 3.000729 | |
UMLSL (vector, 2D) 1->3 3.000041 | |
UMLSL (vector, 2D) throughput 0.508637 | |
UMLSL (vector, 2D) throughput 0.505424 | |
UMLSL2 (vector, 8H) 1->1 2.982321 | |
UMLSL2 (vector, 8H) 1->2 3.026275 | |
UMLSL2 (vector, 8H) 1->3 3.003522 | |
UMLSL2 (vector, 8H) throughput 0.507018 | |
UMLSL2 (vector, 8H) throughput 0.507179 | |
UMLSL2 (vector, 4S) 1->1 3.033138 | |
UMLSL2 (vector, 4S) 1->2 3.008344 | |
UMLSL2 (vector, 4S) 1->3 3.004212 | |
UMLSL2 (vector, 4S) throughput 0.511546 | |
UMLSL2 (vector, 4S) throughput 0.500382 | |
UMLSL2 (vector, 2D) 1->1 3.003846 | |
UMLSL2 (vector, 2D) 1->2 3.028994 | |
UMLSL2 (vector, 2D) 1->3 2.991364 | |
UMLSL2 (vector, 2D) throughput 0.567365 | |
UMLSL2 (vector, 2D) throughput 0.499688 | |
UMULL (by element, 4S) 1->2 2.999758 | |
UMULL (by element, 4S) 1->3 3.010757 | |
UMULL (by element, 4S) throughput 0.510123 | |
UMULL (by element, 2D) 1->2 3.044865 | |
UMULL (by element, 2D) 1->3 3.000604 | |
UMULL (by element, 2D) throughput 0.500536 | |
UMULL2 (by element, 4S) 1->2 3.011146 | |
UMULL2 (by element, 4S) 1->3 2.997209 | |
UMULL2 (by element, 4S) throughput 0.511687 | |
UMULL2 (by element, 2D) 1->2 2.732818 | |
UMULL2 (by element, 2D) 1->3 2.643800 | |
UMULL2 (by element, 2D) throughput 0.505435 | |
UMULL (vector, 8H) 1->2 3.250536 | |
UMULL (vector, 8H) 1->3 3.234991 | |
UMULL (vector, 8H) throughput 0.500527 | |
UMULL (vector, 4S) 1->2 3.001032 | |
UMULL (vector, 4S) 1->3 3.079452 | |
UMULL (vector, 4S) throughput 0.527473 | |
UMULL (vector, 2D) 1->2 3.001379 | |
UMULL (vector, 2D) 1->3 3.001637 | |
UMULL (vector, 2D) throughput 0.528311 | |
UMULL2 (vector, 8H) 1->2 3.052963 | |
UMULL2 (vector, 8H) 1->3 3.001457 | |
UMULL2 (vector, 8H) throughput 0.501687 | |
UMULL2 (vector, 4S) 1->2 3.168095 | |
UMULL2 (vector, 4S) 1->3 3.113388 | |
UMULL2 (vector, 4S) throughput 0.513803 | |
UMULL2 (vector, 2D) 1->2 3.231967 | |
UMULL2 (vector, 2D) 1->3 3.006792 | |
UMULL2 (vector, 2D) throughput 0.521272 | |
UQADD (scalar, B) 1->2 3.017023 | |
UQADD (scalar, B) 1->3 3.009117 | |
UQADD (scalar, B) throughput 0.500387 | |
UQADD (scalar, H) 1->2 3.001292 | |
UQADD (scalar, H) 1->3 3.355431 | |
UQADD (scalar, H) throughput 0.499081 | |
UQADD (scalar, S) 1->2 3.001723 | |
UQADD (scalar, S) 1->3 2.996384 | |
UQADD (scalar, S) throughput 0.501174 | |
UQADD (scalar, D) 1->2 3.004698 | |
UQADD (scalar, D) 1->3 3.002997 | |
UQADD (scalar, D) throughput 0.500501 | |
UQADD (vector, 8B) 1->2 3.042355 | |
UQADD (vector, 8B) 1->3 3.007143 | |
UQADD (vector, 8B) throughput 0.442692 | |
UQADD (vector, 16B) 1->2 3.203748 | |
UQADD (vector, 16B) 1->3 3.003240 | |
UQADD (vector, 16B) throughput 0.501757 | |
UQADD (vector, 4H) 1->2 3.003078 | |
UQADD (vector, 4H) 1->3 3.004050 | |
UQADD (vector, 4H) throughput 0.501683 | |
UQADD (vector, 8H) 1->2 3.009369 | |
UQADD (vector, 8H) 1->3 3.006356 | |
UQADD (vector, 8H) throughput 0.500719 | |
UQADD (vector, 2S) 1->2 3.010152 | |
UQADD (vector, 2S) 1->3 3.045059 | |
UQADD (vector, 2S) throughput 0.503016 | |
UQADD (vector, 4S) 1->2 3.497136 | |
UQADD (vector, 4S) 1->3 3.036524 | |
UQADD (vector, 4S) throughput 0.501969 | |
UQADD (vector, 2D) 1->2 3.020203 | |
UQADD (vector, 2D) 1->3 3.006276 | |
UQADD (vector, 2D) throughput 0.501473 | |
UQRSHL (scalar, B) 1->2 2.994993 | |
UQRSHL (scalar, B) 1->3 3.028942 | |
UQRSHL (scalar, B) throughput 0.499328 | |
UQRSHL (scalar, H) 1->2 3.005913 | |
UQRSHL (scalar, H) 1->3 2.999392 | |
UQRSHL (scalar, H) throughput 0.505886 | |
UQRSHL (scalar, S) 1->2 3.004048 | |
UQRSHL (scalar, S) 1->3 3.013089 | |
UQRSHL (scalar, S) throughput 0.499495 | |
UQRSHL (scalar, D) 1->2 3.005224 | |
UQRSHL (scalar, D) 1->3 3.005655 | |
UQRSHL (scalar, D) throughput 0.500658 | |
UQRSHL (vector, 8B) 1->2 3.007126 | |
UQRSHL (vector, 8B) 1->3 3.024457 | |
UQRSHL (vector, 8B) throughput 0.503057 | |
UQRSHL (vector, 16B) 1->2 3.005468 | |
UQRSHL (vector, 16B) 1->3 3.008626 | |
UQRSHL (vector, 16B) throughput 0.503547 | |
UQRSHL (vector, 4H) 1->2 2.780224 | |
UQRSHL (vector, 4H) 1->3 3.007546 | |
UQRSHL (vector, 4H) throughput 0.987806 | |
UQRSHL (vector, 8H) 1->2 2.832677 | |
UQRSHL (vector, 8H) 1->3 3.602381 | |
UQRSHL (vector, 8H) throughput 0.574786 | |
UQRSHL (vector, 2S) 1->2 3.007937 | |
UQRSHL (vector, 2S) 1->3 3.538890 | |
UQRSHL (vector, 2S) throughput 0.514295 | |
UQRSHL (vector, 4S) 1->2 3.014361 | |
UQRSHL (vector, 4S) 1->3 2.637247 | |
UQRSHL (vector, 4S) throughput 0.664171 | |
UQRSHL (vector, 2D) 1->2 2.719851 | |
UQRSHL (vector, 2D) 1->3 2.971758 | |
UQRSHL (vector, 2D) throughput 0.521039 | |
UQRSHRN (H) 1->2 3.285268 | |
UQRSHRN (H) throughput 0.514991 | |
UQRSHRN (S) 1->2 3.573747 | |
UQRSHRN (S) throughput 0.525182 | |
UQRSHRN (D) 1->2 3.079974 | |
UQRSHRN (D) throughput 0.508211 | |
UQRSHRN (8H) 1->2 6.921608 | |
UQRSHRN (8H) throughput 0.500784 | |
UQRSHRN (4S) 1->2 8.954140 | |
UQRSHRN (4S) throughput 0.505460 | |
UQRSHRN (2D) 1->2 3.302136 | |
UQRSHRN (2D) throughput 0.607179 | |
UQRSHRN2 (8H) 1->1 2.679829 | |
UQRSHRN2 (8H) 1->2 4.643218 | |
UQRSHRN2 (8H) throughput 0.626300 | |
UQRSHRN2 (8H) throughput 0.612307 | |
UQRSHRN2 (4S) 1->1 2.806391 | |
UQRSHRN2 (4S) 1->2 4.021526 | |
UQRSHRN2 (4S) throughput 0.523523 | |
UQRSHRN2 (4S) throughput 0.507590 | |
UQRSHRN2 (2D) 1->1 3.029720 | |
UQRSHRN2 (2D) 1->2 3.010251 | |
UQRSHRN2 (2D) throughput 0.512515 | |
UQRSHRN2 (2D) throughput 0.506653 | |
UQSHL (immediate, scalar, B) 1->2 2.011004 | |
UQSHL (immediate, scalar, B) throughput 0.501908 | |
UQSHL (immediate, scalar, H) 1->2 2.138286 | |
UQSHL (immediate, scalar, H) throughput 0.504419 | |
UQSHL (immediate, scalar, S) 1->2 2.006277 | |
UQSHL (immediate, scalar, S) throughput 0.537711 | |
UQSHL (immediate, scalar, D) 1->2 2.012642 | |
UQSHL (immediate, scalar, D) throughput 0.491334 | |
UQSHL (immediate, vector, 8B) 1->2 1.910435 | |
UQSHL (immediate, vector, 8B) throughput 0.502025 | |
UQSHL (immediate, vector, 16B) 1->2 2.005506 | |
UQSHL (immediate, vector, 16B) throughput 0.501063 | |
UQSHL (immediate, vector, 4H) 1->2 1.873589 | |
UQSHL (immediate, vector, 4H) throughput 0.500495 | |
UQSHL (immediate, vector, 8H) 1->2 1.999960 | |
UQSHL (immediate, vector, 8H) throughput 0.483492 | |
UQSHL (immediate, vector, 2S) 1->2 1.995321 | |
UQSHL (immediate, vector, 2S) throughput 0.510465 | |
UQSHL (immediate, vector, 4S) 1->2 2.003523 | |
UQSHL (immediate, vector, 4S) throughput 0.500481 | |
UQSHL (immediate, vector, 2D) 1->2 2.189327 | |
UQSHL (immediate, vector, 2D) throughput 0.511200 | |
UQSHL (scalar, B) 1->2 2.005699 | |
UQSHL (scalar, B) 1->3 2.002309 | |
UQSHL (scalar, B) throughput 0.500141 | |
UQSHL (scalar, H) 1->2 1.999029 | |
UQSHL (scalar, H) 1->3 2.001499 | |
UQSHL (scalar, H) throughput 0.502495 | |
UQSHL (scalar, S) 1->2 2.002713 | |
UQSHL (scalar, S) 1->3 2.027615 | |
UQSHL (scalar, S) throughput 0.500989 | |
UQSHL (scalar, D) 1->2 2.011378 | |
UQSHL (scalar, D) 1->3 2.005669 | |
UQSHL (scalar, D) throughput 0.516109 | |
UQSHL (vector, 8B) 1->2 1.998342 | |
UQSHL (vector, 8B) 1->3 2.004252 | |
UQSHL (vector, 8B) throughput 0.505346 | |
UQSHL (vector, 16B) 1->2 2.004697 | |
UQSHL (vector, 16B) 1->3 2.003767 | |
UQSHL (vector, 16B) throughput 0.499464 | |
UQSHL (vector, 4H) 1->2 2.003962 | |
UQSHL (vector, 4H) 1->3 2.001657 | |
UQSHL (vector, 4H) throughput 0.500810 | |
UQSHL (vector, 8H) 1->2 1.995239 | |
UQSHL (vector, 8H) 1->3 2.508322 | |
UQSHL (vector, 8H) throughput 0.497732 | |
UQSHL (vector, 2S) 1->2 1.988098 | |
UQSHL (vector, 2S) 1->3 2.015091 | |
UQSHL (vector, 2S) throughput 0.502505 | |
UQSHL (vector, 4S) 1->2 1.996445 | |
UQSHL (vector, 4S) 1->3 2.004730 | |
UQSHL (vector, 4S) throughput 0.499939 | |
UQSHL (vector, 2D) 1->2 2.042112 | |
UQSHL (vector, 2D) 1->3 2.001620 | |
UQSHL (vector, 2D) throughput 0.477072 | |
UQSHRN (H) 1->2 3.392759 | |
UQSHRN (H) throughput 0.500182 | |
UQSHRN (S) 1->2 3.012148 | |
UQSHRN (S) throughput 0.500121 | |
UQSHRN (D) 1->2 3.009032 | |
UQSHRN (D) throughput 0.500921 | |
UQSHRN (8H) 1->2 2.997577 | |
UQSHRN (8H) throughput 0.501807 | |
UQSHRN (4S) 1->2 3.007369 | |
UQSHRN (4S) throughput 0.499591 | |
UQSHRN (2D) 1->2 2.995474 | |
UQSHRN (2D) throughput 0.500622 | |
UQSHRN2 (8H) 1->1 3.004051 | |
UQSHRN2 (8H) 1->2 3.006347 | |
UQSHRN2 (8H) throughput 0.501301 | |
UQSHRN2 (8H) throughput 0.591390 | |
UQSHRN2 (4S) 1->1 2.945206 | |
UQSHRN2 (4S) 1->2 3.006018 | |
UQSHRN2 (4S) throughput 0.471756 | |
UQSHRN2 (4S) throughput 0.513580 | |
UQSHRN2 (2D) 1->1 3.052552 | |
UQSHRN2 (2D) 1->2 2.997815 | |
UQSHRN2 (2D) throughput 0.501554 | |
UQSHRN2 (2D) throughput 0.502942 | |
UQSUB (scalar, B) 1->2 3.002060 | |
UQSUB (scalar, B) 1->3 3.268618 | |
UQSUB (scalar, B) throughput 0.502396 | |
UQSUB (scalar, H) 1->2 2.997059 | |
UQSUB (scalar, H) 1->3 3.033528 | |
UQSUB (scalar, H) throughput 0.501882 | |
UQSUB (scalar, S) 1->2 3.004132 | |
UQSUB (scalar, S) 1->3 3.001090 | |
UQSUB (scalar, S) throughput 0.500780 | |
UQSUB (scalar, D) 1->2 3.004332 | |
UQSUB (scalar, D) 1->3 3.007651 | |
UQSUB (scalar, D) throughput 0.505710 | |
UQSUB (vector, 8B) 1->2 2.999029 | |
UQSUB (vector, 8B) 1->3 2.999517 | |
UQSUB (vector, 8B) throughput 0.500086 | |
UQSUB (vector, 16B) 1->2 3.004657 | |
UQSUB (vector, 16B) 1->3 2.997735 | |
UQSUB (vector, 16B) throughput 0.499303 | |
UQSUB (vector, 4H) 1->2 3.010649 | |
UQSUB (vector, 4H) 1->3 2.998220 | |
UQSUB (vector, 4H) throughput 0.504550 | |
UQSUB (vector, 8H) 1->2 3.086739 | |
UQSUB (vector, 8H) 1->3 3.016121 | |
UQSUB (vector, 8H) throughput 0.489916 | |
UQSUB (vector, 2S) 1->2 3.059730 | |
UQSUB (vector, 2S) 1->3 1.996872 | |
UQSUB (vector, 2S) throughput 0.573582 | |
UQSUB (vector, 4S) 1->2 4.739487 | |
UQSUB (vector, 4S) 1->3 2.591698 | |
UQSUB (vector, 4S) throughput 1.121370 | |
UQSUB (vector, 2D) 1->2 3.148319 | |
UQSUB (vector, 2D) 1->3 4.411490 | |
UQSUB (vector, 2D) throughput 0.599011 | |
UQXTN (H) 1->2 2.977129 | |
UQXTN (H) throughput 0.586779 | |
UQXTN (S) 1->2 3.033272 | |
UQXTN (S) throughput 0.756405 | |
UQXTN (D) 1->2 3.023385 | |
UQXTN (D) throughput 0.495257 | |
UQXTN (8H) 1->2 4.256384 | |
UQXTN (8H) throughput 0.832011 | |
UQXTN (4S) 1->2 4.341512 | |
UQXTN (4S) throughput 0.505235 | |
UQXTN (2D) 1->2 3.025028 | |
UQXTN (2D) throughput 0.504063 | |
UQXTN2 (8H) 1->1 2.587378 | |
UQXTN2 (8H) 1->2 4.379300 | |
UQXTN2 (8H) throughput 0.520148 | |
UQXTN2 (8H) throughput 0.727016 | |
UQXTN2 (4S) 1->1 2.773772 | |
UQXTN2 (4S) 1->2 2.628527 | |
UQXTN2 (4S) throughput 0.624190 | |
UQXTN2 (4S) throughput 0.510067 | |
UQXTN2 (2D) 1->1 3.435462 | |
UQXTN2 (2D) 1->2 2.918382 | |
UQXTN2 (2D) throughput 0.300850 | |
UQXTN2 (2D) throughput 0.603676 | |
URECPE (vector, 2S) 1->2 3.013039 | |
URECPE (vector, 2S) throughput 1.010032 | |
URECPE (vector, 4S) 1->2 4.005748 | |
URECPE (vector, 4S) throughput 2.037750 | |
URHADD (vector, 8B) 1->2 2.004009 | |
URHADD (vector, 8B) 1->3 2.461513 | |
URHADD (vector, 8B) throughput 0.384811 | |
URHADD (vector, 16B) 1->2 2.120179 | |
URHADD (vector, 16B) 1->3 1.919364 | |
URHADD (vector, 16B) throughput 0.770809 | |
URHADD (vector, 4H) 1->2 1.303291 | |
URHADD (vector, 4H) 1->3 2.494275 | |
URHADD (vector, 4H) throughput 0.569315 | |
URHADD (vector, 8H) 1->2 2.002632 | |
URHADD (vector, 8H) 1->3 2.003766 | |
URHADD (vector, 8H) throughput 0.501377 | |
URHADD (vector, 2S) 1->2 2.004901 | |
URHADD (vector, 2S) 1->3 1.988352 | |
URHADD (vector, 2S) throughput 0.508580 | |
URHADD (vector, 4S) 1->2 2.021343 | |
URHADD (vector, 4S) 1->3 1.998022 | |
URHADD (vector, 4S) throughput 0.509050 | |
URSHL (vector, 8B) 1->2 3.007653 | |
URSHL (vector, 8B) 1->3 2.835124 | |
URSHL (vector, 8B) throughput 0.501543 | |
URSHL (vector, 16B) 1->2 3.745608 | |
URSHL (vector, 16B) 1->3 3.000669 | |
URSHL (vector, 16B) throughput 0.388147 | |
URSHL (vector, 4H) 1->2 5.821141 | |
URSHL (vector, 4H) 1->3 2.970527 | |
URSHL (vector, 4H) throughput 0.610191 | |
URSHL (vector, 8H) 1->2 3.286739 | |
URSHL (vector, 8H) 1->3 3.360378 | |
URSHL (vector, 8H) throughput 0.319726 | |
URSHL (vector, 2S) 1->2 2.488568 | |
URSHL (vector, 2S) 1->3 3.125006 | |
URSHL (vector, 2S) throughput 0.423143 | |
URSHL (vector, 4S) 1->2 4.679257 | |
URSHL (vector, 4S) 1->3 3.022717 | |
URSHL (vector, 4S) throughput 0.509860 | |
URSHL (vector, 2D) 1->2 3.009247 | |
URSHL (vector, 2D) 1->3 3.068553 | |
URSHL (vector, 2D) throughput 0.500045 | |
URSHL (vector, D) 1->2 3.006317 | |
URSHL (vector, D) 1->3 3.001816 | |
URSHL (vector, D) throughput 0.500673 | |
URSHR (vector, 8B) 1->2 2.998058 | |
URSHR (vector, 8B) throughput 0.500698 | |
URSHR (vector, 16B) 1->2 3.009920 | |
URSHR (vector, 16B) throughput 0.500678 | |
URSHR (vector, 4H) 1->2 3.004941 | |
URSHR (vector, 4H) throughput 0.509312 | |
URSHR (vector, 8H) 1->2 2.954608 | |
URSHR (vector, 8H) throughput 0.501139 | |
URSHR (vector, 2S) 1->2 2.946709 | |
URSHR (vector, 2S) throughput 0.531821 | |
URSHR (vector, 4S) 1->2 2.997495 | |
URSHR (vector, 4S) throughput 0.501422 | |
URSHR (vector, 2D) 1->2 2.993660 | |
URSHR (vector, 2D) throughput 0.508366 | |
URSHR (vector, D) 1->2 3.045716 | |
URSHR (vector, D) throughput 0.499031 | |
URSQRTE (vector, 2S) 1->2 3.004246 | |
URSQRTE (vector, 2S) throughput 1.003665 | |
URSQRTE (vector, 4S) 1->2 4.040411 | |
URSQRTE (vector, 4S) throughput 2.008019 | |
URSRA (vector, 8B) 1->1 3.003355 | |
URSRA (vector, 8B) 1->2 3.003968 | |
URSRA (vector, 8B) throughput 0.503118 | |
URSRA (vector, 8B) throughput 0.512767 | |
URSRA (vector, 16B) 1->1 3.244222 | |
URSRA (vector, 16B) 1->2 3.004008 | |
URSRA (vector, 16B) throughput 0.514830 | |
URSRA (vector, 16B) throughput 0.502523 | |
URSRA (vector, 4H) 1->1 3.028914 | |
URSRA (vector, 4H) 1->2 2.998747 | |
URSRA (vector, 4H) throughput 0.537276 | |
URSRA (vector, 4H) throughput 0.499389 | |
URSRA (vector, 8H) 1->1 3.047338 | |
URSRA (vector, 8H) 1->2 2.995999 | |
URSRA (vector, 8H) throughput 0.577622 | |
URSRA (vector, 8H) throughput 0.523333 | |
URSRA (vector, 2S) 1->1 3.011381 | |
URSRA (vector, 2S) 1->2 3.006235 | |
URSRA (vector, 2S) throughput 0.504517 | |
URSRA (vector, 2S) throughput 0.503449 | |
URSRA (vector, 4S) 1->1 3.079847 | |
URSRA (vector, 4S) 1->2 3.000443 | |
URSRA (vector, 4S) throughput 0.500712 | |
URSRA (vector, 4S) throughput 0.500036 | |
URSRA (vector, 2D) 1->1 3.000202 | |
URSRA (vector, 2D) 1->2 2.998706 | |
URSRA (vector, 2D) throughput 0.502314 | |
URSRA (vector, 2D) throughput 0.502417 | |
URSRA (vector, D) 1->1 3.013890 | |
URSRA (vector, D) 1->2 3.039845 | |
URSRA (vector, D) throughput 0.501417 | |
URSRA (vector, D) throughput 0.501303 | |
USHL (vector, 8B) 1->2 1.989495 | |
USHL (vector, 8B) 1->3 1.999152 | |
USHL (vector, 8B) throughput 0.502374 | |
USHL (vector, 16B) 1->2 1.994234 | |
USHL (vector, 16B) 1->3 1.991499 | |
USHL (vector, 16B) throughput 0.526541 | |
USHL (vector, 4H) 1->2 1.996527 | |
USHL (vector, 4H) 1->3 1.996398 | |
USHL (vector, 4H) throughput 0.501022 | |
USHL (vector, 8H) 1->2 1.996244 | |
USHL (vector, 8H) 1->3 1.997013 | |
USHL (vector, 8H) throughput 0.500187 | |
USHL (vector, 2S) 1->2 1.996202 | |
USHL (vector, 2S) 1->3 1.997496 | |
USHL (vector, 2S) throughput 0.499995 | |
USHL (vector, 4S) 1->2 1.927933 | |
USHL (vector, 4S) 1->3 2.036458 | |
USHL (vector, 4S) throughput 0.457522 | |
USHL (vector, 2D) 1->2 2.034435 | |
USHL (vector, 2D) 1->3 2.138450 | |
USHL (vector, 2D) throughput 0.589878 | |
USHL (vector, D) 1->2 1.999655 | |
USHL (vector, D) 1->3 1.996566 | |
USHL (vector, D) throughput 0.477089 | |
USHLL (8H) 1->2 2.001011 | |
USHLL (8H) throughput 0.501119 | |
USHLL (4S) 1->2 1.998870 | |
USHLL (4S) throughput 0.505809 | |
USHLL (2D) 1->2 1.875590 | |
USHLL (2D) throughput 0.504577 | |
USHLL2 (8H) 1->2 2.052146 | |
USHLL2 (8H) throughput 0.446601 | |
USHLL2 (4S) 1->2 1.999394 | |
USHLL2 (4S) throughput 0.502997 | |
USHLL2 (2D) 1->2 2.018545 | |
USHLL2 (2D) throughput 0.502672 | |
USHR (vector, 8B) 1->2 2.005020 | |
USHR (vector, 8B) throughput 0.477208 | |
USHR (vector, 16B) 1->2 2.004049 | |
USHR (vector, 16B) throughput 0.504016 | |
USHR (vector, 4H) 1->2 2.001456 | |
USHR (vector, 4H) throughput 0.502161 | |
USHR (vector, 8H) 1->2 2.001416 | |
USHR (vector, 8H) throughput 0.501605 | |
USHR (vector, 2S) 1->2 1.749885 | |
USHR (vector, 2S) throughput 0.550078 | |
USHR (vector, 4S) 1->2 2.005507 | |
USHR (vector, 4S) throughput 0.502693 | |
USHR (vector, 2D) 1->2 1.995962 | |
USHR (vector, 2D) throughput 0.500121 | |
USHR (vector, D) 1->2 2.041423 | |
USHR (vector, D) throughput 0.494804 | |
USQADD (scalar, B) 1->1 2.997816 | |
USQADD (scalar, B) 1->2 3.003724 | |
USQADD (scalar, B) throughput 0.503462 | |
USQADD (scalar, B) throughput 0.501184 | |
USQADD (scalar, H) 1->1 6.566940 | |
USQADD (scalar, H) 1->2 2.973582 | |
USQADD (scalar, H) throughput 0.503567 | |
USQADD (scalar, H) throughput 0.500800 | |
USQADD (scalar, S) 1->1 3.007369 | |
USQADD (scalar, S) 1->2 2.998706 | |
USQADD (scalar, S) throughput 0.502809 | |
USQADD (scalar, S) throughput 0.500716 | |
USQADD (scalar, D) 1->1 3.000890 | |
USQADD (scalar, D) 1->2 3.033487 | |
USQADD (scalar, D) throughput 0.513847 | |
USQADD (scalar, D) throughput 0.499771 | |
USQADD (vector, 8B) 1->1 2.999030 | |
USQADD (vector, 8B) 1->2 2.997737 | |
USQADD (vector, 8B) throughput 0.502815 | |
USQADD (vector, 8B) throughput 0.506357 | |
USQADD (vector, 16B) 1->1 2.998708 | |
USQADD (vector, 16B) 1->2 3.000404 | |
USQADD (vector, 16B) throughput 0.501625 | |
USQADD (vector, 16B) throughput 0.510639 | |
USQADD (vector, 4H) 1->1 3.000766 | |
USQADD (vector, 4H) 1->2 2.994345 | |
USQADD (vector, 4H) throughput 0.500242 | |
USQADD (vector, 4H) throughput 0.504551 | |
USQADD (vector, 8H) 1->1 2.997130 | |
USQADD (vector, 8H) 1->2 2.985403 | |
USQADD (vector, 8H) throughput 0.475672 | |
USQADD (vector, 8H) throughput 0.505702 | |
USQADD (vector, 2S) 1->1 3.004091 | |
USQADD (vector, 2S) 1->2 3.003768 | |
USQADD (vector, 2S) throughput 0.590591 | |
USQADD (vector, 2S) throughput 0.493307 | |
USQADD (vector, 4S) 1->1 3.036975 | |
USQADD (vector, 4S) 1->2 3.004050 | |
USQADD (vector, 4S) throughput 0.504719 | |
USQADD (vector, 4S) throughput 0.502641 | |
USQADD (vector, 2D) 1->1 2.998869 | |
USQADD (vector, 2D) 1->2 2.865945 | |
USQADD (vector, 2D) throughput 0.501979 | |
USQADD (vector, 2D) throughput 0.514321 | |
USRA (vector, 8B) 1->1 3.001335 | |
USRA (vector, 8B) 1->2 2.947156 | |
USRA (vector, 8B) throughput 0.501108 | |
USRA (vector, 8B) throughput 0.503346 | |
USRA (vector, 16B) 1->1 3.010770 | |
USRA (vector, 16B) 1->2 3.003035 | |
USRA (vector, 16B) throughput 0.504724 | |
USRA (vector, 16B) throughput 0.589196 | |
USRA (vector, 4H) 1->1 2.993619 | |
USRA (vector, 4H) 1->2 2.998624 | |
USRA (vector, 4H) throughput 0.501589 | |
USRA (vector, 4H) throughput 0.500894 | |
USRA (vector, 8H) 1->1 3.803372 | |
USRA (vector, 8H) 1->2 2.293926 | |
USRA (vector, 8H) throughput 0.603799 | |
USRA (vector, 8H) throughput 0.759979 | |
USRA (vector, 2S) 1->1 2.623391 | |
USRA (vector, 2S) 1->2 3.010146 | |
USRA (vector, 2S) throughput 0.314360 | |
USRA (vector, 2S) throughput 0.873608 | |
USRA (vector, 4S) 1->1 2.998970 | |
USRA (vector, 4S) 1->2 3.022581 | |
USRA (vector, 4S) throughput 0.509754 | |
USRA (vector, 4S) throughput 0.505011 | |
USRA (vector, 2D) 1->1 2.812467 | |
USRA (vector, 2D) 1->2 2.506845 | |
USRA (vector, 2D) throughput 0.380819 | |
USRA (vector, 2D) throughput 0.506094 | |
USRA (vector, D) 1->1 2.992441 | |
USRA (vector, D) 1->2 2.139951 | |
USRA (vector, D) throughput 0.556692 | |
USRA (vector, D) throughput 0.817266 | |
USUBL (vector, 8H) 1->2 2.753544 | |
USUBL (vector, 8H) 1->3 2.007528 | |
USUBL (vector, 8H) throughput 0.517599 | |
USUBL (vector, 4S) 1->2 1.994681 | |
USUBL (vector, 4S) 1->3 2.013432 | |
USUBL (vector, 4S) throughput 0.520950 | |
USUBL (vector, 2D) 1->2 2.013319 | |
USUBL (vector, 2D) 1->3 2.005394 | |
USUBL (vector, 2D) throughput 0.500893 | |
USUBL2 (vector, 8H) 1->2 2.014213 | |
USUBL2 (vector, 8H) 1->3 2.003954 | |
USUBL2 (vector, 8H) throughput 0.472374 | |
USUBL2 (vector, 4S) 1->2 1.577588 | |
USUBL2 (vector, 4S) 1->3 1.999742 | |
USUBL2 (vector, 4S) throughput 0.372265 | |
USUBL2 (vector, 2D) 1->2 2.098169 | |
USUBL2 (vector, 2D) 1->3 2.038050 | |
USUBL2 (vector, 2D) throughput 1.150694 | |
USUBW (vector, 8H) 1->2 1.682018 | |
USUBW (vector, 8H) 1->3 2.008733 | |
USUBW (vector, 8H) throughput 0.965066 | |
USUBW (vector, 4S) 1->2 1.996234 | |
USUBW (vector, 4S) 1->3 2.033434 | |
USUBW (vector, 4S) throughput 0.499165 | |
USUBW (vector, 2D) 1->2 1.365220 | |
USUBW (vector, 2D) 1->3 2.073189 | |
USUBW (vector, 2D) throughput 0.499442 | |
USUBW2 (vector, 8H) 1->2 1.997746 | |
USUBW2 (vector, 8H) 1->3 2.001944 | |
USUBW2 (vector, 8H) throughput 0.522561 | |
USUBW2 (vector, 4S) 1->2 2.337822 | |
USUBW2 (vector, 4S) 1->3 0.591963 | |
USUBW2 (vector, 4S) throughput 1.147010 | |
USUBW2 (vector, 2D) 1->2 1.948438 | |
USUBW2 (vector, 2D) 1->3 1.832933 | |
USUBW2 (vector, 2D) throughput 0.336697 | |
UXTL (8H) 1->2 1.664685 | |
UXTL (8H) throughput 0.530700 | |
UXTL (4S) 1->2 2.001617 | |
UXTL (4S) throughput 0.501073 | |
UXTL (2D) 1->2 1.996365 | |
UXTL (2D) throughput 0.384898 | |
UXTL2 (8H) 1->2 1.933410 | |
UXTL2 (8H) throughput 0.503815 | |
UXTL2 (4S) 1->2 1.927957 | |
UXTL2 (4S) throughput 0.782738 | |
UXTL2 (2D) 1->2 1.862616 | |
UXTL2 (2D) throughput 0.862132 | |
UZP1 (vector, 8B) 1->2 2.007480 | |
UZP1 (vector, 8B) 1->3 2.048063 | |
UZP1 (vector, 8B) throughput 0.529028 | |
UZP1 (vector, 16B) 1->2 2.028290 | |
UZP1 (vector, 16B) 1->3 1.998299 | |
UZP1 (vector, 16B) throughput 0.510807 | |
UZP1 (vector, 4H) 1->2 2.080379 | |
UZP1 (vector, 4H) 1->3 2.001418 | |
UZP1 (vector, 4H) throughput 0.448635 | |
UZP1 (vector, 8H) 1->2 2.013215 | |
UZP1 (vector, 8H) 1->3 2.060852 | |
UZP1 (vector, 8H) throughput 0.432702 | |
UZP1 (vector, 2S) 1->2 2.005752 | |
UZP1 (vector, 2S) 1->3 2.007434 | |
UZP1 (vector, 2S) throughput 0.536685 | |
UZP1 (vector, 4S) 1->2 1.400319 | |
UZP1 (vector, 4S) 1->3 2.638996 | |
UZP1 (vector, 4S) throughput 0.539946 | |
UZP1 (vector, 2D) 1->2 2.003700 | |
UZP1 (vector, 2D) 1->3 2.259963 | |
UZP1 (vector, 2D) throughput 0.528597 | |
UZP2 (vector, 8B) 1->2 1.940772 | |
UZP2 (vector, 8B) 1->3 2.443712 | |
UZP2 (vector, 8B) throughput 0.501203 | |
UZP2 (vector, 16B) 1->2 1.937812 | |
UZP2 (vector, 16B) 1->3 1.913362 | |
UZP2 (vector, 16B) throughput 0.623700 | |
UZP2 (vector, 4H) 1->2 1.822546 | |
UZP2 (vector, 4H) 1->3 1.977655 | |
UZP2 (vector, 4H) throughput 0.513033 | |
UZP2 (vector, 8H) 1->2 2.005845 | |
UZP2 (vector, 8H) 1->3 1.998345 | |
UZP2 (vector, 8H) throughput 0.503386 | |
UZP2 (vector, 2S) 1->2 2.011903 | |
UZP2 (vector, 2S) 1->3 1.991578 | |
UZP2 (vector, 2S) throughput 0.310717 | |
UZP2 (vector, 4S) 1->2 4.347735 | |
UZP2 (vector, 4S) 1->3 1.997954 | |
UZP2 (vector, 4S) throughput 0.548875 | |
UZP2 (vector, 2D) 1->2 2.001721 | |
UZP2 (vector, 2D) 1->3 2.774334 | |
UZP2 (vector, 2D) throughput 0.375724 | |
XTN (8H) 1->2 2.000087 | |
XTN (8H) throughput 0.433367 | |
XTN (4S) 1->2 2.017537 | |
XTN (4S) throughput 0.514384 | |
XTN (2D) 1->2 1.927030 | |
XTN (2D) throughput 0.503530 | |
XTN2 (8H) 1->1 1.999742 | |
XTN2 (8H) 1->2 2.023476 | |
XTN2 (8H) throughput 0.540852 | |
XTN2 (8H) throughput 0.512553 | |
XTN2 (4S) 1->1 2.052769 | |
XTN2 (4S) 1->2 2.039766 | |
XTN2 (4S) throughput 0.579195 | |
XTN2 (4S) throughput 0.537616 | |
XTN2 (2D) 1->1 1.990717 | |
XTN2 (2D) 1->2 2.163577 | |
XTN2 (2D) throughput 0.671383 | |
XTN2 (2D) throughput 0.612017 | |
ZIP1 (vector, 8B) 1->2 2.863807 | |
ZIP1 (vector, 8B) 1->3 1.739579 | |
ZIP1 (vector, 8B) throughput 0.557652 | |
ZIP1 (vector, 16B) 1->2 1.804700 | |
ZIP1 (vector, 16B) 1->3 2.086436 | |
ZIP1 (vector, 16B) throughput 0.584670 | |
ZIP1 (vector, 4H) 1->2 2.678185 | |
ZIP1 (vector, 4H) 1->3 1.765431 | |
ZIP1 (vector, 4H) throughput 0.744936 | |
ZIP1 (vector, 8H) 1->2 2.069380 | |
ZIP1 (vector, 8H) 1->3 0.972466 | |
ZIP1 (vector, 8H) throughput 0.469937 | |
ZIP1 (vector, 2S) 1->2 1.958476 | |
ZIP1 (vector, 2S) 1->3 1.795895 | |
ZIP1 (vector, 2S) throughput 0.517368 | |
ZIP1 (vector, 4S) 1->2 1.592768 | |
ZIP1 (vector, 4S) 1->3 1.996014 | |
ZIP1 (vector, 4S) throughput 0.645911 | |
ZIP1 (vector, 2D) 1->2 2.002308 | |
ZIP1 (vector, 2D) 1->3 1.772047 | |
ZIP1 (vector, 2D) throughput 0.499879 | |
ZIP2 (vector, 8B) 1->2 2.007073 | |
ZIP2 (vector, 8B) 1->3 2.026400 | |
ZIP2 (vector, 8B) throughput 0.500886 | |
ZIP2 (vector, 16B) 1->2 1.906780 | |
ZIP2 (vector, 16B) 1->3 2.001659 | |
ZIP2 (vector, 16B) throughput 0.500764 | |
ZIP2 (vector, 4H) 1->2 2.001617 | |
ZIP2 (vector, 4H) 1->3 2.004940 | |
ZIP2 (vector, 4H) throughput 0.500015 | |
ZIP2 (vector, 8H) 1->2 2.001500 | |
ZIP2 (vector, 8H) 1->3 2.085533 | |
ZIP2 (vector, 8H) throughput 0.618345 | |
ZIP2 (vector, 2S) 1->2 1.944943 | |
ZIP2 (vector, 2S) 1->3 1.967563 | |
ZIP2 (vector, 2S) throughput 0.507152 | |
ZIP2 (vector, 4S) 1->2 1.962262 | |
ZIP2 (vector, 4S) 1->3 2.004400 | |
ZIP2 (vector, 4S) throughput 0.433273 | |
ZIP2 (vector, 2D) 1->2 1.553542 | |
ZIP2 (vector, 2D) 1->3 4.027645 | |
ZIP2 (vector, 2D) throughput 0.426050 |
This file has been truncated, but you can view the full file.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
ADC (32-bit) 1->2 1.001724 | |
ADC (32-bit) 1->3 0.997957 | |
ADC (32-bit) 1->4 1.031768 | |
ADC (32-bit) throughput 0.252231 | |
ADC (64-bit) 1->2 1.001924 | |
ADC (64-bit) 1->3 1.006011 | |
ADC (64-bit) 1->4 1.003799 | |
ADC (64-bit) throughput 0.253425 | |
ADCS (32-bit) 1->2 1.000050 | |
ADCS (32-bit) 1->3 1.013846 | |
ADCS (32-bit) 1->4 1.011756 | |
ADCS (32-bit) 4->2 0.997830 | |
ADCS (32-bit) 4->3 1.016450 | |
ADCS (32-bit) 4->4 0.998229 | |
ADCS (32-bit) throughput 0.753265 | |
ADCS (32-bit) throughput 0.574634 | |
ADCS (64-bit) 1->2 0.998628 | |
ADCS (64-bit) 1->3 1.011108 | |
ADCS (64-bit) 1->4 0.998153 | |
ADCS (64-bit) 4->2 1.005563 | |
ADCS (64-bit) 4->3 1.005975 | |
ADCS (64-bit) 4->4 0.985275 | |
ADCS (64-bit) throughput 0.751609 | |
ADCS (64-bit) throughput 0.567030 | |
ADD (sxtb, 32-bit) 1->2 2.005350 | |
ADD (sxtb, 32-bit) 1->3 2.009502 | |
ADD (sxtb, 32-bit) throughput 0.503392 | |
ADD (sxtb, 64-bit) 1->2 1.997356 | |
ADD (sxtb, 64-bit) 1->3 1.990158 | |
ADD (sxtb, 64-bit) throughput 0.500940 | |
ADD (uxtb, 32-bit) 1->2 0.999027 | |
ADD (uxtb, 32-bit) 1->3 0.998802 | |
ADD (uxtb, 32-bit) throughput 0.250812 | |
ADD (uxtb, 64-bit) 1->2 1.000599 | |
ADD (uxtb, 64-bit) 1->3 1.000549 | |
ADD (uxtb, 64-bit) throughput 0.250596 | |
ADD (sxth, 32-bit) 1->2 1.997554 | |
ADD (sxth, 32-bit) 1->3 1.997131 | |
ADD (sxth, 32-bit) throughput 0.500022 | |
ADD (sxth, 64-bit) 1->2 2.002900 | |
ADD (sxth, 64-bit) 1->3 2.003974 | |
ADD (sxth, 64-bit) throughput 0.500025 | |
ADD (uxth, 32-bit) 1->2 1.002500 | |
ADD (uxth, 32-bit) 1->3 1.003875 | |
ADD (uxth, 32-bit) throughput 0.253688 | |
ADD (uxth, 64-bit) 1->2 0.999526 | |
ADD (uxth, 64-bit) 1->3 1.002025 | |
ADD (uxth, 64-bit) throughput 0.250703 | |
ADD (sxtw, 32-bit) 1->2 0.999801 | |
ADD (sxtw, 32-bit) 1->3 1.001573 | |
ADD (sxtw, 32-bit) throughput 0.251906 | |
ADD (sxtw, 64-bit) 1->2 2.002474 | |
ADD (sxtw, 64-bit) 1->3 2.006486 | |
ADD (sxtw, 64-bit) throughput 0.502009 | |
ADD (uxtw, 32-bit) 1->2 0.999401 | |
ADD (uxtw, 32-bit) 1->3 0.998253 | |
ADD (uxtw, 32-bit) throughput 0.250781 | |
ADD (uxtw, 64-bit) 1->2 1.001400 | |
ADD (uxtw, 64-bit) 1->3 0.999051 | |
ADD (uxtw, 64-bit) throughput 0.250359 | |
ADD (uxtx, 64-bit) 1->2 1.000699 | |
ADD (uxtx, 64-bit) 1->3 1.000000 | |
ADD (uxtx, 64-bit) throughput 0.250515 | |
ADD (sxtx, 64-bit) 1->2 0.998129 | |
ADD (sxtx, 64-bit) 1->3 0.998153 | |
ADD (sxtx, 64-bit) throughput 0.250690 | |
ADD (immediate, 32-bit) 1->2 0.999801 | |
ADD (immediate, 32-bit) throughput 0.250362 | |
ADD (immediate, 64-bit) 1->2 1.001824 | |
ADD (immediate, 64-bit) throughput 0.250340 | |
ADD (shifted immediate, 32-bit) 1->2 1.000849 | |
ADD (shifted immediate, 32-bit) throughput 0.250800 | |
ADD (shifted immediate, 64-bit) 1->2 1.001800 | |
ADD (shifted immediate, 64-bit) throughput 0.250268 | |
ADD (register, 32-bit) 1->2 1.002400 | |
ADD (register, 32-bit) 1->3 1.001825 | |
ADD (register, 32-bit) throughput 0.250306 | |
ADD (register, 64-bit) 1->2 0.998154 | |
ADD (register, 64-bit) 1->3 0.900994 | |
ADD (register, 64-bit) throughput 0.251913 | |
ADD (register, lsl, 32-bit) 1->2 1.999230 | |
ADD (register, lsl, 32-bit) 1->3 2.017136 | |
ADD (register, lsl, 32-bit) throughput 0.501649 | |
ADD (register, lsl, 64-bit) 1->2 2.001312 | |
ADD (register, lsl, 64-bit) 1->3 1.995312 | |
ADD (register, lsl, 64-bit) throughput 0.504281 | |
ADD (register, lsr, 32-bit) 1->2 2.007464 | |
ADD (register, lsr, 32-bit) 1->3 2.017951 | |
ADD (register, lsr, 32-bit) throughput 0.501388 | |
ADD (register, lsr, 64-bit) 1->2 2.013573 | |
ADD (register, lsr, 64-bit) 1->3 2.011425 | |
ADD (register, lsr, 64-bit) throughput 0.502631 | |
ADD (register, asr, 32-bit) 1->2 1.999724 | |
ADD (register, asr, 32-bit) 1->3 2.003575 | |
ADD (register, asr, 32-bit) throughput 0.501634 | |
ADD (register, asr, 64-bit) 1->2 2.015700 | |
ADD (register, asr, 64-bit) 1->3 2.003551 | |
ADD (register, asr, 64-bit) throughput 0.497531 | |
ADDS (sxtb, 32-bit) 1->2 1.994942 | |
ADDS (sxtb, 32-bit) 1->3 1.999376 | |
ADDS (sxtb, 32-bit) 4->2 2.007260 | |
ADDS (sxtb, 32-bit) 4->3 2.002750 | |
ADDS (sxtb, 32-bit) throughput 0.501850 | |
ADDS (sxtb, 64-bit) 1->2 2.003599 | |
ADDS (sxtb, 64-bit) 1->3 2.002845 | |
ADDS (sxtb, 64-bit) 4->2 1.999176 | |
ADDS (sxtb, 64-bit) 4->3 2.002700 | |
ADDS (sxtb, 64-bit) throughput 0.502311 | |
ADDS (uxtb, 32-bit) 1->2 1.001822 | |
ADDS (uxtb, 32-bit) 1->3 1.001950 | |
ADDS (uxtb, 32-bit) 4->2 0.999301 | |
ADDS (uxtb, 32-bit) 4->3 1.006266 | |
ADDS (uxtb, 32-bit) throughput 0.500996 | |
ADDS (uxtb, 64-bit) 1->2 1.001899 | |
ADDS (uxtb, 64-bit) 1->3 1.009424 | |
ADDS (uxtb, 64-bit) 4->2 1.006266 | |
ADDS (uxtb, 64-bit) 4->3 1.007464 | |
ADDS (uxtb, 64-bit) throughput 0.501490 | |
ADDS (sxth, 32-bit) 1->2 2.002974 | |
ADDS (sxth, 32-bit) 1->3 2.009724 | |
ADDS (sxth, 32-bit) 4->2 1.627990 | |
ADDS (sxth, 32-bit) 4->3 1.939374 | |
ADDS (sxth, 32-bit) throughput 0.546155 | |
ADDS (sxth, 64-bit) 1->2 1.984059 | |
ADDS (sxth, 64-bit) 1->3 1.998253 | |
ADDS (sxth, 64-bit) 4->2 2.018298 | |
ADDS (sxth, 64-bit) 4->3 1.951331 | |
ADDS (sxth, 64-bit) throughput 0.502869 | |
ADDS (uxth, 32-bit) 1->2 1.002321 | |
ADDS (uxth, 32-bit) 1->3 0.998154 | |
ADDS (uxth, 32-bit) 4->2 0.997979 | |
ADDS (uxth, 32-bit) 4->3 0.999178 | |
ADDS (uxth, 32-bit) throughput 0.503615 | |
ADDS (uxth, 64-bit) 1->2 0.999875 | |
ADDS (uxth, 64-bit) 1->3 0.998128 | |
ADDS (uxth, 64-bit) 4->2 1.000125 | |
ADDS (uxth, 64-bit) 4->3 1.078642 | |
ADDS (uxth, 64-bit) throughput 0.500879 | |
ADDS (sxtw, 32-bit) 1->2 0.998378 | |
ADDS (sxtw, 32-bit) 1->3 0.998403 | |
ADDS (sxtw, 32-bit) 4->2 0.999276 | |
ADDS (sxtw, 32-bit) 4->3 1.025513 | |
ADDS (sxtw, 32-bit) throughput 0.501026 | |
ADDS (sxtw, 64-bit) 1->2 2.002749 | |
ADDS (sxtw, 64-bit) 1->3 2.012298 | |
ADDS (sxtw, 64-bit) 4->2 1.997255 | |
ADDS (sxtw, 64-bit) 4->3 1.999451 | |
ADDS (sxtw, 64-bit) throughput 0.499841 | |
ADDS (uxtw, 32-bit) 1->2 0.998851 | |
ADDS (uxtw, 32-bit) 1->3 0.998802 | |
ADDS (uxtw, 32-bit) 4->2 1.002724 | |
ADDS (uxtw, 32-bit) 4->3 0.986746 | |
ADDS (uxtw, 32-bit) throughput 0.501441 | |
ADDS (uxtw, 64-bit) 1->2 1.001975 | |
ADDS (uxtw, 64-bit) 1->3 1.001225 | |
ADDS (uxtw, 64-bit) 4->2 1.002801 | |
ADDS (uxtw, 64-bit) 4->3 0.999301 | |
ADDS (uxtw, 64-bit) throughput 0.499794 | |
ADDS (uxtx, 64-bit) 1->2 0.998252 | |
ADDS (uxtx, 64-bit) 1->3 0.998079 | |
ADDS (uxtx, 64-bit) 4->2 0.999251 | |
ADDS (uxtx, 64-bit) 4->3 1.010952 | |
ADDS (uxtx, 64-bit) throughput 0.501163 | |
ADDS (sxtx, 64-bit) 1->2 0.998951 | |
ADDS (sxtx, 64-bit) 1->3 1.001274 | |
ADDS (sxtx, 64-bit) 4->2 1.007417 | |
ADDS (sxtx, 64-bit) 4->3 1.003523 | |
ADDS (sxtx, 64-bit) throughput 0.500452 | |
ADDS (immediate, 32-bit) 1->2 0.998802 | |
ADDS (immediate, 32-bit) 3->2 1.002825 | |
ADDS (immediate, 32-bit) throughput 0.500680 | |
ADDS (immediate, 64-bit) 1->2 1.000623 | |
ADDS (immediate, 64-bit) 3->2 1.003692 | |
ADDS (immediate, 64-bit) throughput 0.500884 | |
ADDS (shifted immediate, 32-bit) 1->2 0.998851 | |
ADDS (shifted immediate, 32-bit) 3->2 0.999351 | |
ADDS (shifted immediate, 32-bit) throughput 0.501528 | |
ADDS (shifted immediate, 64-bit) 1->2 1.000574 | |
ADDS (shifted immediate, 64-bit) 3->2 0.999226 | |
ADDS (shifted immediate, 64-bit) throughput 0.499782 | |
ADDS (register, 32-bit) 1->2 0.998079 | |
ADDS (register, 32-bit) 1->3 1.001972 | |
ADDS (register, 32-bit) 4->2 1.002950 | |
ADDS (register, 32-bit) 4->3 1.006691 | |
ADDS (register, 32-bit) throughput 0.502149 | |
ADDS (register, 64-bit) 1->2 0.999476 | |
ADDS (register, 64-bit) 1->3 0.998128 | |
ADDS (register, 64-bit) 4->2 1.006088 | |
ADDS (register, 64-bit) 4->3 0.997930 | |
ADDS (register, 64-bit) throughput 0.500820 | |
ADDS (register, lsl, 32-bit) 1->2 2.003225 | |
ADDS (register, lsl, 32-bit) 1->3 2.009483 | |
ADDS (register, lsl, 32-bit) 4->2 2.002700 | |
ADDS (register, lsl, 32-bit) 4->3 2.009255 | |
ADDS (register, lsl, 32-bit) throughput 0.501276 | |
ADDS (register, lsl, 64-bit) 1->2 2.003100 | |
ADDS (register, lsl, 64-bit) 1->3 2.002949 | |
ADDS (register, lsl, 64-bit) 4->2 2.011675 | |
ADDS (register, lsl, 64-bit) 4->3 1.992481 | |
ADDS (register, lsl, 64-bit) throughput 0.502322 | |
ADDS (register, lsr, 32-bit) 1->2 2.004125 | |
ADDS (register, lsr, 32-bit) 1->3 2.010249 | |
ADDS (register, lsr, 32-bit) 4->2 2.003549 | |
ADDS (register, lsr, 32-bit) 4->3 2.002774 | |
ADDS (register, lsr, 32-bit) throughput 0.502409 | |
ADDS (register, lsr, 64-bit) 1->2 1.997729 | |
ADDS (register, lsr, 64-bit) 1->3 2.002951 | |
ADDS (register, lsr, 64-bit) 4->2 1.998054 | |
ADDS (register, lsr, 64-bit) 4->3 1.997230 | |
ADDS (register, lsr, 64-bit) throughput 0.495598 | |
ADDS (register, asr, 32-bit) 1->2 2.003100 | |
ADDS (register, asr, 32-bit) 1->3 1.999327 | |
ADDS (register, asr, 32-bit) 4->2 1.997355 | |
ADDS (register, asr, 32-bit) 4->3 2.010573 | |
ADDS (register, asr, 32-bit) throughput 0.501348 | |
ADDS (register, asr, 64-bit) 1->2 1.999026 | |
ADDS (register, asr, 64-bit) 1->3 1.999301 | |
ADDS (register, asr, 64-bit) 4->2 2.012235 | |
ADDS (register, asr, 64-bit) 4->3 1.998777 | |
ADDS (register, asr, 64-bit) throughput 0.501207 | |
ADR throughput 0.250932 | |
ADRP throughput 0.250684 | |
AND (immediate, 32-bit) 1->2 1.003515 | |
AND (immediate, 32-bit) throughput 0.250353 | |
AND (immediate, 64-bit) 1->2 1.005138 | |
AND (immediate, 64-bit) throughput 0.250112 | |
AND (register, 32-bit) 1->2 1.001225 | |
AND (register, 32-bit) 1->3 1.001225 | |
AND (register, 32-bit) throughput 0.250787 | |
AND (register, 64-bit) 1->2 1.001850 | |
AND (register, 64-bit) 1->3 1.000574 | |
AND (register, 64-bit) throughput 0.251455 | |
AND (register, lsl, 32-bit) 1->2 2.012233 | |
AND (register, lsl, 32-bit) 1->3 2.006241 | |
AND (register, lsl, 32-bit) throughput 0.500041 | |
AND (register, lsl, 64-bit) 1->2 2.008009 | |
AND (register, lsl, 64-bit) 1->3 1.999200 | |
AND (register, lsl, 64-bit) throughput 0.500253 | |
AND (register, lsr, 32-bit) 1->2 2.012505 | |
AND (register, lsr, 32-bit) 1->3 2.005789 | |
AND (register, lsr, 32-bit) throughput 0.501500 | |
AND (register, lsr, 64-bit) 1->2 1.997929 | |
AND (register, lsr, 64-bit) 1->3 1.994108 | |
AND (register, lsr, 64-bit) throughput 0.502383 | |
AND (register, asr, 32-bit) 1->2 1.998703 | |
AND (register, asr, 32-bit) 1->3 2.002849 | |
AND (register, asr, 32-bit) throughput 0.501784 | |
AND (register, asr, 64-bit) 1->2 1.999250 | |
AND (register, asr, 64-bit) 1->3 1.999251 | |
AND (register, asr, 64-bit) throughput 0.496600 | |
AND (register, ror, 32-bit) 1->2 1.999825 | |
AND (register, ror, 32-bit) 1->3 2.001872 | |
AND (register, ror, 32-bit) throughput 0.500060 | |
AND (register, ror, 64-bit) 1->2 1.998054 | |
AND (register, ror, 64-bit) 1->3 1.991853 | |
AND (register, ror, 64-bit) throughput 0.500069 | |
ADDS (immediate, 32-bit) 1->2 0.999900 | |
ADDS (immediate, 32-bit) 3->2 1.014133 | |
ADDS (immediate, 32-bit) throughput 0.501335 | |
ADDS (immediate, 64-bit) 1->2 0.999026 | |
ADDS (immediate, 64-bit) 3->2 1.003050 | |
ADDS (immediate, 64-bit) throughput 0.501131 | |
ANDS (register, 32-bit) 1->2 0.999026 | |
ANDS (register, 32-bit) 1->3 1.007533 | |
ANDS (register, 32-bit) 4->2 0.995536 | |
ANDS (register, 32-bit) 4->3 0.999551 | |
ANDS (register, 32-bit) throughput 0.500766 | |
ANDS (register, 64-bit) 1->2 1.001800 | |
ANDS (register, 64-bit) 1->3 0.999925 | |
ANDS (register, 64-bit) 4->2 1.006450 | |
ANDS (register, 64-bit) 4->3 0.998054 | |
ANDS (register, 64-bit) throughput 0.500234 | |
ANDS (register, lsl, 32-bit) 1->2 2.000299 | |
ANDS (register, lsl, 32-bit) 1->3 1.998129 | |
ANDS (register, lsl, 32-bit) 4->2 1.999825 | |
ANDS (register, lsl, 32-bit) 4->3 2.011886 | |
ANDS (register, lsl, 32-bit) throughput 0.501613 | |
ANDS (register, lsl, 64-bit) 1->2 2.002850 | |
ANDS (register, lsl, 64-bit) 1->3 2.005558 | |
ANDS (register, lsl, 64-bit) 4->2 2.009274 | |
ANDS (register, lsl, 64-bit) 4->3 2.002548 | |
ANDS (register, lsl, 64-bit) throughput 0.500944 | |
ANDS (register, lsr, 32-bit) 1->2 2.002951 | |
ANDS (register, lsr, 32-bit) 1->3 2.003475 | |
ANDS (register, lsr, 32-bit) 4->2 2.003817 | |
ANDS (register, lsr, 32-bit) 4->3 1.997581 | |
ANDS (register, lsr, 32-bit) throughput 0.501497 | |
ANDS (register, lsr, 64-bit) 1->2 2.002899 | |
ANDS (register, lsr, 64-bit) 1->3 1.999427 | |
ANDS (register, lsr, 64-bit) 4->2 1.999326 | |
ANDS (register, lsr, 64-bit) 4->3 2.004575 | |
ANDS (register, lsr, 64-bit) throughput 0.499851 | |
ANDS (register, asr, 32-bit) 1->2 2.004824 | |
ANDS (register, asr, 32-bit) 1->3 2.005592 | |
ANDS (register, asr, 32-bit) 4->2 2.004248 | |
ANDS (register, asr, 32-bit) 4->3 2.011249 | |
ANDS (register, asr, 32-bit) throughput 0.502803 | |
ANDS (register, asr, 64-bit) 1->2 2.003394 | |
ANDS (register, asr, 64-bit) 1->3 2.004893 | |
ANDS (register, asr, 64-bit) 4->2 2.002700 | |
ANDS (register, asr, 64-bit) 4->3 1.998980 | |
ANDS (register, asr, 64-bit) throughput 0.500670 | |
ANDS (register, ror, 32-bit) 1->2 2.010674 | |
ANDS (register, ror, 32-bit) 1->3 2.005569 | |
ANDS (register, ror, 32-bit) 4->2 2.002649 | |
ANDS (register, ror, 32-bit) 4->3 2.003325 | |
ANDS (register, ror, 32-bit) throughput 0.500187 | |
ANDS (register, ror, 64-bit) 1->2 1.999401 | |
ANDS (register, ror, 64-bit) 1->3 1.998228 | |
ANDS (register, ror, 64-bit) 4->2 1.998005 | |
ANDS (register, ror, 64-bit) 4->3 2.010586 | |
ANDS (register, ror, 64-bit) throughput 0.496239 | |
ASR (immediate, 32-bit) 1->2 1.000574 | |
ASR (immediate, 32-bit) throughput 0.250125 | |
ASR (immediate, 64-bit) 1->2 1.001250 | |
ASR (immediate, 64-bit) throughput 0.250962 | |
ASR (register, 32-bit) 1->2 1.000050 | |
ASR (register, 32-bit) 1->3 1.000025 | |
ASR (register, 32-bit) throughput 0.250125 | |
ASR (register, 64-bit) 1->2 1.004844 | |
ASR (register, 64-bit) 1->3 0.999975 | |
ASR (register, 64-bit) throughput 0.250368 | |
B throughput 1.003642 | |
B.cc (not taken) throughput 1.003006 | |
B.cc (taken) throughput 0.501071 | |
BFC (32-bit) 1->1 0.998129 | |
BFC (32-bit) throughput 1.001878 | |
BFC (64-bit) 1->1 1.000575 | |
BFC (64-bit) throughput 1.000896 | |
BFI (32-bit) 1->1 1.001925 | |
BFI (32-bit) 1->2 0.999276 | |
BFI (32-bit) throughput 1.001425 | |
BFI (64-bit) 1->1 0.999401 | |
BFI (64-bit) 1->2 1.003024 | |
BFI (64-bit) throughput 0.999807 | |
BFXIL (32-bit) 1->1 1.001925 | |
BFXIL (32-bit) 1->2 0.999352 | |
BFXIL (32-bit) throughput 1.000415 | |
BFXIL (64-bit) 1->1 0.998253 | |
BFXIL (64-bit) 1->2 1.006591 | |
BFXIL (64-bit) throughput 1.001947 | |
BIC (register, 32-bit) 1->2 1.000624 | |
BIC (register, 32-bit) 1->3 1.008371 | |
BIC (register, 32-bit) throughput 0.250371 | |
BIC (register, 64-bit) 1->2 1.000000 | |
BIC (register, 64-bit) 1->3 1.000650 | |
BIC (register, 64-bit) throughput 0.250888 | |
BIC (register, lsl, 32-bit) 1->2 2.003550 | |
BIC (register, lsl, 32-bit) 1->3 2.005691 | |
BIC (register, lsl, 32-bit) throughput 0.501860 | |
BIC (register, lsl, 64-bit) 1->2 2.004267 | |
BIC (register, lsl, 64-bit) 1->3 2.003100 | |
BIC (register, lsl, 64-bit) throughput 0.500078 | |
BIC (register, lsr, 32-bit) 1->2 2.011073 | |
BIC (register, lsr, 32-bit) 1->3 2.005068 | |
BIC (register, lsr, 32-bit) throughput 0.499838 | |
BIC (register, lsr, 64-bit) 1->2 2.011235 | |
BIC (register, lsr, 64-bit) 1->3 2.003524 | |
BIC (register, lsr, 64-bit) throughput 0.500804 | |
BIC (register, asr, 32-bit) 1->2 1.997654 | |
BIC (register, asr, 32-bit) 1->3 2.036382 | |
BIC (register, asr, 32-bit) throughput 0.500702 | |
BIC (register, asr, 64-bit) 1->2 2.002999 | |
BIC (register, asr, 64-bit) 1->3 2.002975 | |
BIC (register, asr, 64-bit) throughput 0.503371 | |
BIC (register, ror, 32-bit) 1->2 2.005589 | |
BIC (register, ror, 32-bit) 1->3 1.999225 | |
BIC (register, ror, 32-bit) throughput 0.501612 | |
BIC (register, ror, 64-bit) 1->2 1.997555 | |
BIC (register, ror, 64-bit) 1->3 2.003049 | |
BIC (register, ror, 64-bit) throughput 0.501768 | |
BICS (register, 32-bit) 1->2 1.000624 | |
BICS (register, 32-bit) 1->3 1.000150 | |
BICS (register, 32-bit) 4->2 1.005089 | |
BICS (register, 32-bit) 4->3 1.005243 | |
BICS (register, 32-bit) throughput 0.503072 | |
BICS (register, 64-bit) 1->2 1.001224 | |
BICS (register, 64-bit) 1->3 0.998286 | |
BICS (register, 64-bit) 4->2 0.997530 | |
BICS (register, 64-bit) 4->3 0.998876 | |
BICS (register, 64-bit) throughput 0.501400 | |
BICS (register, lsl, 32-bit) 1->2 2.011003 | |
BICS (register, lsl, 32-bit) 1->3 1.999252 | |
BICS (register, lsl, 32-bit) 4->2 2.006848 | |
BICS (register, lsl, 32-bit) 4->3 1.999525 | |
BICS (register, lsl, 32-bit) throughput 0.500864 | |
BICS (register, lsl, 64-bit) 1->2 1.999601 | |
BICS (register, lsl, 64-bit) 1->3 2.003250 | |
BICS (register, lsl, 64-bit) 4->2 1.998103 | |
BICS (register, lsl, 64-bit) 4->3 2.003668 | |
BICS (register, lsl, 64-bit) throughput 0.497650 | |
BICS (register, lsr, 32-bit) 1->2 2.003226 | |
BICS (register, lsr, 32-bit) 1->3 2.003075 | |
BICS (register, lsr, 32-bit) 4->2 2.002899 | |
BICS (register, lsr, 32-bit) 4->3 2.005489 | |
BICS (register, lsr, 32-bit) throughput 0.502021 | |
BICS (register, lsr, 64-bit) 1->2 2.002975 | |
BICS (register, lsr, 64-bit) 1->3 2.009680 | |
BICS (register, lsr, 64-bit) 4->2 2.002724 | |
BICS (register, lsr, 64-bit) 4->3 2.002850 | |
BICS (register, lsr, 64-bit) throughput 0.500945 | |
BICS (register, asr, 32-bit) 1->2 1.998228 | |
BICS (register, asr, 32-bit) 1->3 2.002850 | |
BICS (register, asr, 32-bit) 4->2 2.002775 | |
BICS (register, asr, 32-bit) 4->3 2.002900 | |
BICS (register, asr, 32-bit) throughput 0.499991 | |
BICS (register, asr, 64-bit) 1->2 1.999301 | |
BICS (register, asr, 64-bit) 1->3 2.002900 | |
BICS (register, asr, 64-bit) 4->2 1.997207 | |
BICS (register, asr, 64-bit) 4->3 2.010032 | |
BICS (register, asr, 64-bit) throughput 0.499969 | |
BICS (register, ror, 32-bit) 1->2 1.997904 | |
BICS (register, ror, 32-bit) 1->3 1.997830 | |
BICS (register, ror, 32-bit) 4->2 2.011977 | |
BICS (register, ror, 32-bit) 4->3 1.999527 | |
BICS (register, ror, 32-bit) throughput 0.500025 | |
BICS (register, ror, 64-bit) 1->2 2.011604 | |
BICS (register, ror, 64-bit) 1->3 1.999250 | |
BICS (register, ror, 64-bit) 4->2 2.002650 | |
BICS (register, ror, 64-bit) 4->3 2.020721 | |
BICS (register, ror, 64-bit) throughput 0.499844 | |
BL throughput 1.005465 | |
CBNZ (not taken) throughput 0.501323 | |
CBNZ (taken) throughput 1.006102 | |
CBZ (not taken) throughput 0.502231 | |
CBZ (taken) throughput 1.003983 | |
CCMN (immediate, 32-bit) 2->1 1.002900 | |
CCMN (immediate, 32-bit) 2->2 1.001525 | |
CCMN (immediate, 32-bit) throughput 0.630423 | |
CCMN (immediate, 32-bit) throughput 0.519367 | |
CCMN (immediate, 64-bit) 2->1 0.997829 | |
CCMN (immediate, 64-bit) 2->2 0.999476 | |
CCMN (immediate, 64-bit) throughput 0.631394 | |
CCMN (immediate, 64-bit) throughput 0.518466 | |
CCMN (register, 32-bit) 3->1 1.015423 | |
CCMN (register, 32-bit) 3->2 0.999825 | |
CCMN (register, 32-bit) 3->3 1.001575 | |
CCMN (register, 32-bit) throughput 0.631630 | |
CCMN (register, 32-bit) throughput 0.518700 | |
CCMN (register, 64-bit) 3->1 1.008433 | |
CCMN (register, 64-bit) 3->2 0.997854 | |
CCMN (register, 64-bit) 3->3 1.000350 | |
CCMN (register, 64-bit) throughput 0.631065 | |
CCMN (register, 64-bit) throughput 0.517972 | |
CCMP (immediate, 32-bit) 2->1 0.997579 | |
CCMP (immediate, 32-bit) 2->2 1.001450 | |
CCMP (immediate, 32-bit) throughput 0.631015 | |
CCMP (immediate, 32-bit) throughput 0.519040 | |
CCMP (immediate, 64-bit) 2->1 1.002824 | |
CCMP (immediate, 64-bit) 2->2 1.001800 | |
CCMP (immediate, 64-bit) throughput 0.629861 | |
CCMP (immediate, 64-bit) throughput 0.505524 | |
CCMP (register, 32-bit) 3->1 0.997978 | |
CCMP (register, 32-bit) 3->2 0.998977 | |
CCMP (register, 32-bit) 3->3 0.998752 | |
CCMP (register, 32-bit) throughput 0.631543 | |
CCMP (register, 32-bit) throughput 0.519354 | |
CCMP (register, 64-bit) 3->1 1.002675 | |
CCMP (register, 64-bit) 3->2 1.011399 | |
CCMP (register, 64-bit) 3->3 1.000524 | |
CCMP (register, 64-bit) throughput 0.631163 | |
CCMP (register, 64-bit) throughput 0.520718 | |
CINC (32-bit) 1->2 1.000100 | |
CINC (32-bit) 1->3 0.997480 | |
CINC (32-bit) throughput 0.250062 | |
CINC (64-bit) 1->2 1.001250 | |
CINC (64-bit) 1->3 0.999251 | |
CINC (64-bit) throughput 0.250778 | |
CINV (32-bit) 1->2 1.001900 | |
CINV (32-bit) 1->3 1.002824 | |
CINV (32-bit) throughput 0.250147 | |
CINV (64-bit) 1->2 0.999377 | |
CINV (64-bit) 1->3 0.998029 | |
CINV (64-bit) throughput 0.250806 | |
CLREX throughput 5.012049 | |
CLREX throughput 5.005650 | |
CLS (32-bit) 1->2 1.008857 | |
CLS (32-bit) throughput 0.251572 | |
CLS (64-bit) 1->2 1.000499 | |
CLS (64-bit) throughput 0.250909 | |
CLZ (32-bit) 1->2 1.001150 | |
CLZ (32-bit) throughput 0.250091 | |
CLZ (64-bit) 1->2 0.998876 | |
CLZ (64-bit) throughput 0.248278 | |
CMN (sxtb, 32-bit) 3->1 2.003024 | |
CMN (sxtb, 32-bit) 3->2 1.999026 | |
CMN (sxtb, 32-bit) throughput 0.502303 | |
CMN (sxtb, 64-bit) 3->1 2.003618 | |
CMN (sxtb, 64-bit) 3->2 2.009198 | |
CMN (sxtb, 64-bit) throughput 0.500708 | |
CMN (uxtb, 32-bit) 3->1 0.974359 | |
CMN (uxtb, 32-bit) 3->2 1.008283 | |
CMN (uxtb, 32-bit) throughput 0.293192 | |
CMN (uxtb, 64-bit) 3->1 1.007249 | |
CMN (uxtb, 64-bit) 3->2 1.006548 | |
CMN (uxtb, 64-bit) throughput 0.292180 | |
CMN (sxth, 32-bit) 3->1 2.003941 | |
CMN (sxth, 32-bit) 3->2 2.026015 | |
CMN (sxth, 32-bit) throughput 0.500693 | |
CMN (sxth, 64-bit) 3->1 1.998202 | |
CMN (sxth, 64-bit) 3->2 2.002723 | |
CMN (sxth, 64-bit) throughput 0.500683 | |
CMN (uxth, 32-bit) 3->1 1.004499 | |
CMN (uxth, 32-bit) 3->2 0.997530 | |
CMN (uxth, 32-bit) throughput 0.291944 | |
CMN (uxth, 64-bit) 3->1 1.005917 | |
CMN (uxth, 64-bit) 3->2 1.004474 | |
CMN (uxth, 64-bit) throughput 0.292589 | |
CMN (sxtw, 32-bit) 3->1 0.997830 | |
CMN (sxtw, 32-bit) 3->2 0.998029 | |
CMN (sxtw, 32-bit) throughput 0.292710 | |
CMN (sxtw, 64-bit) 3->1 2.009079 | |
CMN (sxtw, 64-bit) 3->2 2.002775 | |
CMN (sxtw, 64-bit) throughput 0.501884 | |
CMN (uxtw, 32-bit) 3->1 0.998029 | |
CMN (uxtw, 32-bit) 3->2 0.997978 | |
CMN (uxtw, 32-bit) throughput 0.293496 | |
CMN (uxtw, 64-bit) 3->1 1.011302 | |
CMN (uxtw, 64-bit) 3->2 1.012908 | |
CMN (uxtw, 64-bit) throughput 0.292717 | |
CMN (uxtx, 64-bit) 3->1 0.998876 | |
CMN (uxtx, 64-bit) 3->2 0.999551 | |
CMN (uxtx, 64-bit) throughput 0.291955 | |
CMN (sxtx, 64-bit) 3->1 1.008923 | |
CMN (sxtx, 64-bit) 3->2 0.999327 | |
CMN (sxtx, 64-bit) throughput 0.292627 | |
CMN (immediate, 32-bit) 2->1 0.997930 | |
CMN (immediate, 32-bit) throughput 0.293324 | |
CMN (immediate, 64-bit) 2->1 1.005101 | |
CMN (immediate, 64-bit) throughput 0.292095 | |
CMN (shifted immediate, 32-bit) 2->1 0.997380 | |
CMN (shifted immediate, 32-bit) throughput 0.294060 | |
CMN (shifted immediate, 64-bit) 2->1 0.999426 | |
CMN (shifted immediate, 64-bit) throughput 0.291615 | |
CMN (register, 32-bit) 3->1 0.998403 | |
CMN (register, 32-bit) 3->2 1.007790 | |
CMN (register, 32-bit) throughput 0.293121 | |
CMN (register, 64-bit) 3->1 1.002799 | |
CMN (register, 64-bit) 3->2 0.998478 | |
CMN (register, 64-bit) throughput 0.293687 | |
CMN (register, lsl, 32-bit) 3->1 2.003176 | |
CMN (register, lsl, 32-bit) 3->2 2.003349 | |
CMN (register, lsl, 32-bit) throughput 0.500671 | |
CMN (register, lsl, 64-bit) 3->1 2.003350 | |
CMN (register, lsl, 64-bit) 3->2 2.005750 | |
CMN (register, lsl, 64-bit) throughput 0.501875 | |
CMN (register, lsr, 32-bit) 3->1 1.999950 | |
CMN (register, lsr, 32-bit) 3->2 2.002850 | |
CMN (register, lsr, 32-bit) throughput 0.499863 | |
CMN (register, lsr, 64-bit) 3->1 2.003598 | |
CMN (register, lsr, 64-bit) 3->2 2.003450 | |
CMN (register, lsr, 64-bit) throughput 0.500555 | |
CMN (register, asr, 32-bit) 3->1 2.003349 | |
CMN (register, asr, 32-bit) 3->2 2.005217 | |
CMN (register, asr, 32-bit) throughput 0.500953 | |
CMN (register, asr, 64-bit) 3->1 2.027215 | |
CMN (register, asr, 64-bit) 3->2 1.998727 | |
CMN (register, asr, 64-bit) throughput 0.500900 | |
CMP (sxtb, 32-bit) 3->1 1.997878 | |
CMP (sxtb, 32-bit) 3->2 2.009930 | |
CMP (sxtb, 32-bit) throughput 0.502275 | |
CMP (sxtb, 64-bit) 3->1 2.002948 | |
CMP (sxtb, 64-bit) 3->2 1.997855 | |
CMP (sxtb, 64-bit) throughput 0.500865 | |
CMP (uxtb, 32-bit) 3->1 0.999625 | |
CMP (uxtb, 32-bit) 3->2 0.999376 | |
CMP (uxtb, 32-bit) throughput 0.295216 | |
CMP (uxtb, 64-bit) 3->1 0.998003 | |
CMP (uxtb, 64-bit) 3->2 1.013148 | |
CMP (uxtb, 64-bit) throughput 0.293722 | |
CMP (sxth, 32-bit) 3->1 2.006311 | |
CMP (sxth, 32-bit) 3->2 2.015798 | |
CMP (sxth, 32-bit) throughput 0.501815 | |
CMP (sxth, 64-bit) 3->1 1.997879 | |
CMP (sxth, 64-bit) 3->2 1.999226 | |
CMP (sxth, 64-bit) throughput 0.501350 | |
CMP (uxth, 32-bit) 3->1 1.005941 | |
CMP (uxth, 32-bit) 3->2 0.999252 | |
CMP (uxth, 32-bit) throughput 0.292656 | |
CMP (uxth, 64-bit) 3->1 1.002950 | |
CMP (uxth, 64-bit) 3->2 1.002950 | |
CMP (uxth, 64-bit) throughput 0.292058 | |
CMP (sxtw, 32-bit) 3->1 1.007809 | |
CMP (sxtw, 32-bit) 3->2 1.001423 | |
CMP (sxtw, 32-bit) throughput 0.292150 | |
CMP (sxtw, 64-bit) 3->1 2.004115 | |
CMP (sxtw, 64-bit) 3->2 2.005474 | |
CMP (sxtw, 64-bit) throughput 0.501884 | |
CMP (uxtw, 32-bit) 3->1 0.998129 | |
CMP (uxtw, 32-bit) 3->2 1.002775 | |
CMP (uxtw, 32-bit) throughput 0.292259 | |
CMP (uxtw, 64-bit) 3->1 1.003075 | |
CMP (uxtw, 64-bit) 3->2 0.998078 | |
CMP (uxtw, 64-bit) throughput 0.292053 | |
CMP (uxtx, 64-bit) 3->1 0.997980 | |
CMP (uxtx, 64-bit) 3->2 1.005838 | |
CMP (uxtx, 64-bit) throughput 0.292599 | |
CMP (sxtx, 64-bit) 3->1 1.002950 | |
CMP (sxtx, 64-bit) 3->2 1.002874 | |
CMP (sxtx, 64-bit) throughput 0.293237 | |
CMP (immediate, 32-bit) 2->1 0.999151 | |
CMP (immediate, 32-bit) throughput 0.292355 | |
CMP (immediate, 64-bit) 2->1 1.005543 | |
CMP (immediate, 64-bit) throughput 0.292698 | |
CMP (shifted immediate, 32-bit) 2->1 0.999352 | |
CMP (shifted immediate, 32-bit) throughput 0.292642 | |
CMP (shifted immediate, 64-bit) 2->1 1.006589 | |
CMP (shifted immediate, 64-bit) throughput 0.294072 | |
CMP (register, 32-bit) 3->1 0.999650 | |
CMP (register, 32-bit) 3->2 0.997888 | |
CMP (register, 32-bit) throughput 0.292622 | |
CMP (register, 64-bit) 3->1 1.000896 | |
CMP (register, 64-bit) 3->2 0.997331 | |
CMP (register, 64-bit) throughput 0.292715 | |
CMP (register, lsl, 32-bit) 3->1 1.997504 | |
CMP (register, lsl, 32-bit) 3->2 2.010810 | |
CMP (register, lsl, 32-bit) throughput 0.501080 | |
CMP (register, lsl, 64-bit) 3->1 2.002650 | |
CMP (register, lsl, 64-bit) 3->2 1.999376 | |
CMP (register, lsl, 64-bit) throughput 0.501568 | |
CMP (register, lsr, 32-bit) 3->1 2.014695 | |
CMP (register, lsr, 32-bit) 3->2 1.998801 | |
CMP (register, lsr, 32-bit) throughput 0.500714 | |
CMP (register, lsr, 64-bit) 3->1 1.998403 | |
CMP (register, lsr, 64-bit) 3->2 1.999000 | |
CMP (register, lsr, 64-bit) throughput 0.500840 | |
CMP (register, asr, 32-bit) 3->1 2.006361 | |
CMP (register, asr, 32-bit) 3->2 1.999202 | |
CMP (register, asr, 32-bit) throughput 0.500901 | |
CMP (register, asr, 64-bit) 3->1 1.999425 | |
CMP (register, asr, 64-bit) 3->2 2.003500 | |
CMP (register, asr, 64-bit) throughput 0.503881 | |
CNEG (32-bit) 1->2 1.001824 | |
CNEG (32-bit) 1->3 1.003768 | |
CNEG (32-bit) throughput 0.250343 | |
CNEG (64-bit) 1->2 1.001350 | |
CNEG (64-bit) 1->3 1.003849 | |
CNEG (64-bit) throughput 0.251497 | |
CSDB throughput 0.203890 | |
CSEL (32-bit) 1->2 1.000924 | |
CSEL (32-bit) 1->3 1.001423 | |
CSEL (32-bit) 1->4 1.010389 | |
CSEL (32-bit) throughput 0.262741 | |
CSEL (64-bit) 1->2 1.403981 | |
CSEL (64-bit) 1->3 0.998168 | |
CSEL (64-bit) 1->4 0.993724 | |
CSEL (64-bit) throughput 0.250844 | |
CSET (32-bit) 1->2 0.999600 | |
CSET (32-bit) throughput 0.250362 | |
CSET (64-bit) 1->2 0.998877 | |
CSET (64-bit) throughput 0.250964 | |
CSETM (32-bit) 1->2 0.998777 | |
CSETM (32-bit) throughput 0.250750 | |
CSETM (64-bit) 1->2 1.007863 | |
CSETM (64-bit) throughput 0.250744 | |
CSINC (32-bit) 1->2 1.000999 | |
CSINC (32-bit) 1->3 1.000775 | |
CSINC (32-bit) 1->4 0.999177 | |
CSINC (32-bit) throughput 0.252337 | |
CSINC (64-bit) 1->2 1.001825 | |
CSINC (64-bit) 1->3 1.001850 | |
CSINC (64-bit) 1->4 1.002800 | |
CSINC (64-bit) throughput 0.251876 | |
CSINV (32-bit) 1->2 0.998851 | |
CSINV (32-bit) 1->3 0.998827 | |
CSINV (32-bit) 1->4 0.999925 | |
CSINV (32-bit) throughput 0.249922 | |
CSINV (64-bit) 1->2 0.998203 | |
CSINV (64-bit) 1->3 1.001775 | |
CSINV (64-bit) 1->4 1.003343 | |
CSINV (64-bit) throughput 0.250797 | |
CSNEG (32-bit) 1->2 0.999075 | |
CSNEG (32-bit) 1->3 1.000100 | |
CSNEG (32-bit) 1->4 1.013328 | |
CSNEG (32-bit) throughput 0.251188 | |
CSNEG (64-bit) 1->2 1.001850 | |
CSNEG (64-bit) 1->3 1.001349 | |
CSNEG (64-bit) 1->4 1.007264 | |
CSNEG (64-bit) throughput 0.250784 | |
DMB (SY) throughput 4.019746 | |
DMB (ST) throughput 3.997205 | |
DMB (LD) throughput 3.975467 | |
DMB (ISH) throughput 3.974113 | |
DMB (ISHST) throughput 3.997902 | |
DMB (ISHLD) throughput 3.986851 | |
DMB (NSH) throughput 4.011021 | |
DMB (NSHST) throughput 4.014446 | |
DMB (NSHLD) throughput 4.003493 | |
DMB (OSH) throughput 4.002045 | |
DMB (OSHST) throughput 4.001422 | |
DMB (OSHLD) throughput 4.023491 | |
DSB (SY) throughput 16.045199 | |
DSB (ST) throughput 16.024004 | |
DSB (LD) throughput 16.011502 | |
DSB (ISH) throughput 15.950617 | |
DSB (ISHST) throughput 16.022783 | |
DSB (ISHLD) throughput 16.022867 | |
DSB (NSH) throughput 16.029418 | |
DSB (NSHST) throughput 16.002994 | |
DSB (NSHLD) throughput 16.006345 | |
DSB (OSH) throughput 16.018524 | |
DSB (OSHST) throughput 16.027838 | |
DSB (OSHLD) throughput 16.049716 | |
EON (register, 32-bit) 1->2 0.998253 | |
EON (register, 32-bit) 1->3 1.002051 | |
EON (register, 32-bit) throughput 0.250356 | |
EON (register, 64-bit) 1->2 1.000050 | |
EON (register, 64-bit) 1->3 1.000250 | |
EON (register, 64-bit) throughput 0.250909 | |
EON (register, lsl, 32-bit) 1->2 2.003550 | |
EON (register, lsl, 32-bit) 1->3 2.003750 | |
EON (register, lsl, 32-bit) throughput 0.501207 | |
EON (register, lsl, 64-bit) 1->2 1.999377 | |
EON (register, lsl, 64-bit) 1->3 2.006587 | |
EON (register, lsl, 64-bit) throughput 0.502593 | |
EON (register, lsr, 32-bit) 1->2 1.999325 | |
EON (register, lsr, 32-bit) 1->3 2.002824 | |
EON (register, lsr, 32-bit) throughput 0.500844 | |
EON (register, lsr, 64-bit) 1->2 2.004418 | |
EON (register, lsr, 64-bit) 1->3 2.010007 | |
EON (register, lsr, 64-bit) throughput 0.501241 | |
EON (register, asr, 32-bit) 1->2 2.005390 | |
EON (register, asr, 32-bit) 1->3 2.003175 | |
EON (register, asr, 32-bit) throughput 0.500184 | |
EON (register, asr, 64-bit) 1->2 2.004966 | |
EON (register, asr, 64-bit) 1->3 1.999151 | |
EON (register, asr, 64-bit) throughput 0.501491 | |
EON (register, ror, 32-bit) 1->2 1.997355 | |
EON (register, ror, 32-bit) 1->3 1.997854 | |
EON (register, ror, 32-bit) throughput 0.502700 | |
EON (register, ror, 64-bit) 1->2 1.999201 | |
EON (register, ror, 64-bit) 1->3 2.007165 | |
EON (register, ror, 64-bit) throughput 0.500843 | |
EOR (immediate, 32-bit) 1->2 1.001825 | |
EOR (immediate, 32-bit) throughput 0.251678 | |
EOR (immediate, 64-bit) 1->2 0.999376 | |
EOR (immediate, 64-bit) throughput 0.250371 | |
EOR (register, 32-bit) 1->2 0.998129 | |
EOR (register, 32-bit) 1->3 0.999650 | |
EOR (register, 32-bit) throughput 0.251932 | |
EOR (register, 64-bit) 1->2 1.001200 | |
EOR (register, 64-bit) 1->3 0.998826 | |
EOR (register, 64-bit) throughput 0.250869 | |
EOR (register, lsl, 32-bit) 1->2 2.002649 | |
EOR (register, lsl, 32-bit) 1->3 2.003125 | |
EOR (register, lsl, 32-bit) throughput 0.501360 | |
EOR (register, lsl, 64-bit) 1->2 1.998103 | |
EOR (register, lsl, 64-bit) 1->3 1.999350 | |
EOR (register, lsl, 64-bit) throughput 0.501497 | |
EOR (register, lsr, 32-bit) 1->2 2.003149 | |
EOR (register, lsr, 32-bit) 1->3 1.997554 | |
EOR (register, lsr, 32-bit) throughput 0.503515 | |
EOR (register, lsr, 64-bit) 1->2 2.005992 | |
EOR (register, lsr, 64-bit) 1->3 2.005664 | |
EOR (register, lsr, 64-bit) throughput 0.500887 | |
EOR (register, asr, 32-bit) 1->2 2.002625 | |
EOR (register, asr, 32-bit) 1->3 1.998976 | |
EOR (register, asr, 32-bit) throughput 0.501519 | |
EOR (register, asr, 64-bit) 1->2 2.003550 | |
EOR (register, asr, 64-bit) 1->3 2.003499 | |
EOR (register, asr, 64-bit) throughput 0.500406 | |
EOR (register, ror, 32-bit) 1->2 2.003600 | |
EOR (register, ror, 32-bit) 1->3 2.017399 | |
EOR (register, ror, 32-bit) throughput 0.499997 | |
EOR (register, ror, 64-bit) 1->2 1.999250 | |
EOR (register, ror, 64-bit) 1->3 2.002376 | |
EOR (register, ror, 64-bit) throughput 0.501550 | |
EXTR (register, 32-bit) 1->2 1.000674 | |
EXTR (register, 32-bit) 1->3 2.008863 | |
EXTR (register, 32-bit) throughput 1.002214 | |
EXTR (register, 64-bit) 1->2 1.001250 | |
EXTR (register, 64-bit) 1->3 1.999375 | |
EXTR (register, 64-bit) throughput 1.001978 | |
ISB (SY) throughput 25.067329 | |
LDNP (32-bit) 1->3 (with chain penalty) 4.030096 | |
LDNP (32-bit) 2->3 (with chain penalty) 4.063344 | |
LDNP (32-bit) throughput 0.501579 | |
LDNP (64-bit) 1->3 (with chain penalty) 4.015050 | |
LDNP (64-bit) 2->3 (with chain penalty) 4.016775 | |
LDNP (64-bit) throughput 0.502062 | |
LDP (32-bit) 1->3 (with chain penalty) 4.001899 | |
LDP (32-bit) 2->3 (with chain penalty) 4.007611 | |
LDP (32-bit) throughput 0.501006 | |
LDP (64-bit) 1->3 (with chain penalty) 4.017296 | |
LDP (64-bit) 2->3 (with chain penalty) 4.013423 | |
LDP (64-bit) throughput 0.498423 | |
LDP (post-index, 32-bit) 1->3 (with chain penalty) 4.141027 | |
LDP (post-index, 32-bit) 2->3 (with chain penalty) 4.395065 | |
LDP (post-index, 32-bit) throughput 0.820692 | |
LDP (post-index, 64-bit) 1->3 (with chain penalty) 7.154856 | |
LDP (post-index, 64-bit) 2->3 (with chain penalty) 4.799932 | |
LDP (post-index, 64-bit) throughput 0.766868 | |
LDP (pre-index, 32-bit) 1->3 (with chain penalty) 4.043588 | |
LDP (pre-index, 32-bit) 2->3 (with chain penalty) 4.015238 | |
LDP (pre-index, 32-bit) throughput 0.757041 | |
LDP (pre-index, 64-bit) 1->3 (with chain penalty) 4.441302 | |
LDP (pre-index, 64-bit) 2->3 (with chain penalty) 4.051144 | |
LDP (pre-index, 64-bit) throughput 0.768783 | |
LDP (signed offset, 32-bit) 1->3 (with chain penalty) 4.005763 | |
LDP (signed offset, 32-bit) 2->3 (with chain penalty) 4.005661 | |
LDP (signed offset, 32-bit) throughput 0.502540 | |
LDP (signed offset, 64-bit) 1->3 (with chain penalty) 4.009807 | |
LDP (signed offset, 64-bit) 2->3 (with chain penalty) 4.005790 | |
LDP (signed offset, 64-bit) throughput 0.500922 | |
LDPSW (post-index) 1->3 (with chain penalty) 4.072938 | |
LDPSW (post-index) 2->3 (with chain penalty) 4.035088 | |
LDPSW (post-index) throughput 0.755173 | |
LDPSW (pre-index) 1->3 (with chain penalty) 4.070140 | |
LDPSW (pre-index) 2->3 (with chain penalty) 3.968215 | |
LDPSW (pre-index) throughput 0.756324 | |
LDPSW (signed offset) 1->3 (with chain penalty) 4.010297 | |
LDPSW (signed offset) 2->3 (with chain penalty) 4.007974 | |
LDPSW (signed offset) throughput 0.500834 | |
LDR (32-bit) 1->2 (with chain penalty) 4.017048 | |
LDR (32-bit) throughput 0.500134 | |
LDR (64-bit) 1->2 (with chain penalty) 4.021792 | |
LDR (64-bit) throughput 0.501223 | |
LDR (post-index, 32-bit) 1->2 (with chain penalty) 4.334915 | |
LDR (post-index, 32-bit) throughput 0.580015 | |
LDR (post-index, 64-bit) 1->2 (with chain penalty) 4.245224 | |
LDR (post-index, 64-bit) throughput 0.561074 | |
LDR (pre-index, 32-bit) 1->2 (with chain penalty) 4.082148 | |
LDR (pre-index, 32-bit) throughput 0.553236 | |
LDR (pre-index, 64-bit) 1->2 (with chain penalty) 4.185985 | |
LDR (pre-index, 64-bit) throughput 0.559594 | |
LDR (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.026397 | |
LDR (unsigned offset, 32-bit) throughput 0.502865 | |
LDR (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.008151 | |
LDR (unsigned offset, 64-bit) throughput 0.501223 | |
LDR (literal, 32-bit) throughput 0.499729 | |
LDR (literal, 64-bit) throughput 0.503729 | |
LDR (register, 32-bit) 1->2 (with chain penalty) 4.006511 | |
LDR (register, 32-bit) 1->3 (with chain penalty) 4.007987 | |
LDR (register, 32-bit) throughput 0.501176 | |
LDR (register, 64-bit) 1->2 (with chain penalty) 4.031471 | |
LDR (register, 64-bit) 1->3 (with chain penalty) 4.017546 | |
LDR (register, 64-bit) throughput 0.499794 | |
LDR (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.026899 | |
LDR (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.027169 | |
LDR (register, uxtw, 32-bit) throughput 0.499738 | |
LDR (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.020587 | |
LDR (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.020409 | |
LDR (register, uxtw, 64-bit) throughput 0.501560 | |
LDR (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.004694 | |
LDR (register, sxtw, 32-bit) 1->3 (with chain penalty) 3.928103 | |
LDR (register, sxtw, 32-bit) throughput 0.499094 | |
LDR (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.011432 | |
LDR (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.020095 | |
LDR (register, sxtw, 64-bit) throughput 0.501357 | |
LDR (register, lsl, 32-bit) 1->2 (with chain penalty) 4.071491 | |
LDR (register, lsl, 32-bit) 1->3 (with chain penalty) 4.024270 | |
LDR (register, lsl, 32-bit) throughput 0.503504 | |
LDR (register, lsl, 64-bit) 1->2 (with chain penalty) 4.006266 | |
LDR (register, lsl, 64-bit) 1->3 (with chain penalty) 4.007374 | |
LDR (register, lsl, 64-bit) throughput 0.500668 | |
LDRB 1->2 (with chain penalty) 4.014158 | |
LDRB throughput 0.499869 | |
LDRB (post-index) 1->2 (with chain penalty) 4.316805 | |
LDRB (post-index) throughput 0.591563 | |
LDRB (pre-index) 1->2 (with chain penalty) 4.162812 | |
LDRB (pre-index) throughput 0.572155 | |
LDRB (unsigned offset) 1->2 (with chain penalty) 4.009531 | |
LDRB (unsigned offset) throughput 0.502548 | |
LDRB (register) 1->2 (with chain penalty) 4.003618 | |
LDRB (register) 1->3 (with chain penalty) 4.017565 | |
LDRB (register) throughput 0.501220 | |
LDRB (register, uxtw) 1->2 (with chain penalty) 4.021651 | |
LDRB (register, uxtw) 1->3 (with chain penalty) 4.005988 | |
LDRB (register, uxtw) throughput 0.499719 | |
LDRB (register, sxtw) 1->2 (with chain penalty) 4.007448 | |
LDRB (register, sxtw) 1->3 (with chain penalty) 3.944590 | |
LDRB (register, sxtw) throughput 0.500109 | |
LDRH 1->2 (with chain penalty) 4.012178 | |
LDRH throughput 0.500621 | |
LDRH (post-index) 1->2 (with chain penalty) 4.090203 | |
LDRH (post-index) throughput 0.560347 | |
LDRH (pre-index) 1->2 (with chain penalty) 4.558019 | |
LDRH (pre-index) throughput 0.580368 | |
LDRH (unsigned offset) 1->2 (with chain penalty) 3.999175 | |
LDRH (unsigned offset) throughput 0.501345 | |
LDRH (register) 1->2 (with chain penalty) 4.012034 | |
LDRH (register) 1->3 (with chain penalty) 4.017408 | |
LDRH (register) throughput 0.499869 | |
LDRH (register, uxtw) 1->2 (with chain penalty) 4.005716 | |
LDRH (register, uxtw) 1->3 (with chain penalty) 4.000771 | |
LDRH (register, uxtw) throughput 0.501997 | |
LDRH (register, sxtw) 1->2 (with chain penalty) 4.031594 | |
LDRH (register, sxtw) 1->3 (with chain penalty) 3.998403 | |
LDRH (register, sxtw) throughput 0.500452 | |
LDRH (register, lsl) 1->2 (with chain penalty) 3.994460 | |
LDRH (register, lsl) 1->3 (with chain penalty) 4.008461 | |
LDRH (register, lsl) throughput 0.499991 | |
LDRSB (32-bit) 1->2 (with chain penalty) 4.004769 | |
LDRSB (32-bit) throughput 0.500976 | |
LDRSB (64-bit) 1->2 (with chain penalty) 4.010871 | |
LDRSB (64-bit) throughput 0.501706 | |
LDRSB (post-index, 32-bit) 1->2 (with chain penalty) 4.171054 | |
LDRSB (post-index, 32-bit) throughput 0.599933 | |
LDRSB (post-index, 64-bit) 1->2 (with chain penalty) 4.557743 | |
LDRSB (post-index, 64-bit) throughput 0.604768 | |
LDRSB (pre-index, 32-bit) 1->2 (with chain penalty) 4.556681 | |
LDRSB (pre-index, 32-bit) throughput 0.605927 | |
LDRSB (pre-index, 64-bit) 1->2 (with chain penalty) 4.447756 | |
LDRSB (pre-index, 64-bit) throughput 0.597689 | |
LDRSB (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.008184 | |
LDRSB (unsigned offset, 32-bit) throughput 0.500124 | |
LDRSB (unsigned offset, 64-bit) 1->2 (with chain penalty) 3.993088 | |
LDRSB (unsigned offset, 64-bit) throughput 0.501294 | |
LDRSB (register, 32-bit) 1->2 (with chain penalty) 4.018748 | |
LDRSB (register, 32-bit) 1->3 (with chain penalty) 4.003118 | |
LDRSB (register, 32-bit) throughput 0.499716 | |
LDRSB (register, 64-bit) 1->2 (with chain penalty) 4.008370 | |
LDRSB (register, 64-bit) 1->3 (with chain penalty) 4.025615 | |
LDRSB (register, 64-bit) throughput 0.497757 | |
LDRSB (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.014577 | |
LDRSB (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.018925 | |
LDRSB (register, uxtw, 32-bit) throughput 0.502091 | |
LDRSB (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.005093 | |
LDRSB (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.007334 | |
LDRSB (register, uxtw, 64-bit) throughput 0.501025 | |
LDRSB (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.009398 | |
LDRSB (register, sxtw, 32-bit) 1->3 (with chain penalty) 3.914832 | |
LDRSB (register, sxtw, 32-bit) throughput 0.500056 | |
LDRSB (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.029021 | |
LDRSB (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.021049 | |
LDRSB (register, sxtw, 64-bit) throughput 0.500622 | |
LDRSH (32-bit) 1->2 (with chain penalty) 4.009380 | |
LDRSH (32-bit) throughput 0.501507 | |
LDRSH (64-bit) 1->2 (with chain penalty) 4.006037 | |
LDRSH (64-bit) throughput 0.500979 | |
LDRSH (post-index, 32-bit) 1->2 (with chain penalty) 4.247946 | |
LDRSH (post-index, 32-bit) throughput 0.587233 | |
LDRSH (post-index, 64-bit) 1->2 (with chain penalty) 4.618051 | |
LDRSH (post-index, 64-bit) throughput 0.617256 | |
LDRSH (pre-index, 32-bit) 1->2 (with chain penalty) 4.492430 | |
LDRSH (pre-index, 32-bit) throughput 0.555038 | |
LDRSH (pre-index, 64-bit) 1->2 (with chain penalty) 4.494413 | |
LDRSH (pre-index, 64-bit) throughput 0.598118 | |
LDRSH (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.031631 | |
LDRSH (unsigned offset, 32-bit) throughput 0.501308 | |
LDRSH (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.008537 | |
LDRSH (unsigned offset, 64-bit) throughput 0.500528 | |
LDRSH (register, 32-bit) 1->2 (with chain penalty) 4.006938 | |
LDRSH (register, 32-bit) 1->3 (with chain penalty) 4.003145 | |
LDRSH (register, 32-bit) throughput 0.499856 | |
LDRSH (register, 64-bit) 1->2 (with chain penalty) 4.016650 | |
LDRSH (register, 64-bit) 1->3 (with chain penalty) 4.012051 | |
LDRSH (register, 64-bit) throughput 0.500724 | |
LDRSH (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.000874 | |
LDRSH (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.008335 | |
LDRSH (register, uxtw, 32-bit) throughput 0.503037 | |
LDRSH (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.014573 | |
LDRSH (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.029748 | |
LDRSH (register, uxtw, 64-bit) throughput 0.499732 | |
LDRSH (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.004316 | |
LDRSH (register, sxtw, 32-bit) 1->3 (with chain penalty) 3.923781 | |
LDRSH (register, sxtw, 32-bit) throughput 0.500647 | |
LDRSH (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.064896 | |
LDRSH (register, sxtw, 64-bit) 1->3 (with chain penalty) 3.998374 | |
LDRSH (register, sxtw, 64-bit) throughput 0.500537 | |
LDRSH (register, lsl, 32-bit) 1->2 (with chain penalty) 4.018027 | |
LDRSH (register, lsl, 32-bit) 1->3 (with chain penalty) 4.005689 | |
LDRSH (register, lsl, 32-bit) throughput 0.501001 | |
LDRSH (register, lsl, 64-bit) 1->2 (with chain penalty) 3.999303 | |
LDRSH (register, lsl, 64-bit) 1->3 (with chain penalty) 4.009365 | |
LDRSH (register, lsl, 64-bit) throughput 0.500166 | |
LDRSW 1->2 (with chain penalty) 3.999899 | |
LDRSW throughput 0.499778 | |
LDRSW (post-index) 1->2 (with chain penalty) 4.413461 | |
LDRSW (post-index) throughput 0.594854 | |
LDRSW (pre-index) 1->2 (with chain penalty) 4.594317 | |
LDRSW (pre-index) throughput 0.609555 | |
LDRSW (unsigned offset) 1->2 (with chain penalty) 4.009487 | |
LDRSW (unsigned offset) throughput 0.501868 | |
LDRSW (literal) throughput 0.500390 | |
LDRSW (register) 1->2 (with chain penalty) 4.013461 | |
LDRSW (register) 1->3 (with chain penalty) 4.258063 | |
LDRSW (register) throughput 0.549595 | |
LDRSW (register, uxtw) 1->2 (with chain penalty) 3.983973 | |
LDRSW (register, uxtw) 1->3 (with chain penalty) 4.080187 | |
LDRSW (register, uxtw) throughput 0.500647 | |
LDRSW (register, sxtw) 1->2 (with chain penalty) 4.036974 | |
LDRSW (register, sxtw) 1->3 (with chain penalty) 4.036920 | |
LDRSW (register, sxtw) throughput 0.501317 | |
LDRSW (register, lsl) 1->2 (with chain penalty) 4.009140 | |
LDRSW (register, lsl) 1->3 (with chain penalty) 3.998153 | |
LDRSW (register, lsl) throughput 0.499916 | |
LDUR (32-bit) 1->2 (with chain penalty) 4.017552 | |
LDUR (32-bit) throughput 0.499384 | |
LDUR (64-bit) 1->2 (with chain penalty) 4.010527 | |
LDUR (64-bit) throughput 0.500430 | |
LDURB 1->2 (with chain penalty) 4.018989 | |
LDURB throughput 0.500309 | |
LDURH 1->2 (with chain penalty) 3.999677 | |
LDURH throughput 0.500578 | |
LDURSB (32-bit) 1->2 (with chain penalty) 3.994110 | |
LDURSB (32-bit) throughput 0.500412 | |
LDURSB (64-bit) 1->2 (with chain penalty) 4.026342 | |
LDURSB (64-bit) throughput 0.501109 | |
LDURSH (32-bit) 1->2 (with chain penalty) 4.005794 | |
LDURSH (32-bit) throughput 0.497604 | |
LDURSH (64-bit) 1->2 (with chain penalty) 4.018915 | |
LDURSH (64-bit) throughput 0.500368 | |
LDURSW 1->2 (with chain penalty) 4.032879 | |
LDURSW throughput 0.501715 | |
LSL (immediate, 32-bit) 1->2 1.001225 | |
LSL (immediate, 32-bit) throughput 0.250781 | |
LSL (immediate, 64-bit) 1->2 0.998980 | |
LSL (immediate, 64-bit) throughput 0.250859 | |
LSL (register, 32-bit) 1->2 1.001824 | |
LSL (register, 32-bit) 1->3 1.002199 | |
LSL (register, 32-bit) throughput 0.250325 | |
LSL (register, 64-bit) 1->2 1.002097 | |
LSL (register, 64-bit) 1->3 0.998826 | |
LSL (register, 64-bit) throughput 0.250143 | |
LSR (immediate, 32-bit) 1->2 1.001475 | |
LSR (immediate, 32-bit) throughput 0.250318 | |
LSR (immediate, 64-bit) 1->2 0.999226 | |
LSR (immediate, 64-bit) throughput 0.250947 | |
LSR (register, 32-bit) 1->2 1.000449 | |
LSR (register, 32-bit) 1->3 1.001800 | |
LSR (register, 32-bit) throughput 0.250524 | |
LSR (register, 64-bit) 1->2 1.003221 | |
LSR (register, 64-bit) 1->3 1.002246 | |
LSR (register, 64-bit) throughput 0.250318 | |
MADD (32-bit) 1->2 3.002074 | |
MADD (32-bit) 1->3 2.999299 | |
MADD (32-bit) 1->4 1.000125 | |
MADD (32-bit) throughput 1.002728 | |
MADD (64-bit) 1->2 3.016174 | |
MADD (64-bit) 1->3 3.006840 | |
MADD (64-bit) 1->4 1.001450 | |
MADD (64-bit) throughput 1.004038 | |
MNEG (32-bit) 1->2 2.997455 | |
MNEG (32-bit) 1->3 3.008650 | |
MNEG (32-bit) throughput 0.996317 | |
MNEG (64-bit) 1->2 3.014297 | |
MNEG (64-bit) 1->3 3.011079 | |
MNEG (64-bit) throughput 1.005815 | |
MOV (bitmask immediate, 32-bit) throughput 0.251603 | |
MOV (bitmask immediate, 64-bit) throughput 0.251812 | |
MOV (from sp, 32-bit) throughput 0.251258 | |
MOV (from sp, 64-bit) throughput 0.250719 | |
MOVK (32-bit) 1->1 1.000924 | |
MOVK (32-bit) throughput 0.252103 | |
MOVK (64-bit) 1->1 1.007164 | |
MOVK (64-bit) throughput 0.250325 | |
MOVN (32-bit) throughput 0.250225 | |
MOVN (64-bit) throughput 0.251223 | |
MOVZ (32-bit) throughput 0.251697 | |
MOVZ (64-bit) throughput 0.250246 | |
MRS (CNTFRQ_EL0) throughput 16.060659 | |
MRS (CNTPCT_EL0) throughput 1.250827 | |
MRS (CNTVCT_EL0) throughput 1.254877 | |
MRS (DCZID_EL0) throughput 1.253462 | |
MRS (FPCR) throughput 1.250896 | |
MRS (FPSR) throughput 7.027045 | |
MRS (NZCV) throughput 0.251500 | |
MRS (TPIDRRO_EL0) throughput 0.751328 | |
MRS (TPIDR_EL0) throughput 0.751586 | |
MSR (FPCR) throughput 8.011875 | |
MSR (FPSR) throughput 10.013823 | |
MSR (TPIDR_EL0) throughput 10.007260 | |
MSR (NZCV) throughput 0.292710 | |
MSUB (32-bit) 1->2 2.993969 | |
MSUB (32-bit) 1->3 2.999301 | |
MSUB (32-bit) 1->4 1.000724 | |
MSUB (32-bit) throughput 1.001202 | |
MSUB (64-bit) 1->2 3.003200 | |
MSUB (64-bit) 1->3 3.011005 | |
MSUB (64-bit) 1->4 0.999252 | |
MSUB (64-bit) throughput 1.001960 | |
MUL (32-bit) 1->2 3.012875 | |
MUL (32-bit) 1->3 3.003150 | |
MUL (32-bit) throughput 1.001591 | |
MUL (64-bit) 1->2 3.003292 | |
MUL (64-bit) 1->3 3.006587 | |
MUL (64-bit) throughput 1.000786 | |
MVN (register, 32-bit) 1->2 0.998777 | |
MVN (register, 32-bit) throughput 0.250334 | |
MVN (register, 64-bit) 1->2 0.999975 | |
MVN (register, 64-bit) throughput 0.250837 | |
MVN (register, lsl, 32-bit) 1->2 1.999301 | |
MVN (register, lsl, 32-bit) throughput 0.501083 | |
MVN (register, lsl, 64-bit) 1->2 1.997556 | |
MVN (register, lsl, 64-bit) throughput 0.500580 | |
MVN (register, lsr, 32-bit) 1->2 1.980332 | |
MVN (register, lsr, 32-bit) throughput 0.500834 | |
MVN (register, lsr, 64-bit) 1->2 2.003549 | |
MVN (register, lsr, 64-bit) throughput 0.500552 | |
MVN (register, asr, 32-bit) 1->2 1.998078 | |
MVN (register, asr, 32-bit) throughput 0.500881 | |
MVN (register, asr, 64-bit) 1->2 2.002850 | |
MVN (register, asr, 64-bit) throughput 0.502186 | |
MVN (register, ror, 32-bit) 1->2 1.999401 | |
MVN (register, ror, 32-bit) throughput 0.499875 | |
MVN (register, ror, 64-bit) 1->2 2.009107 | |
MVN (register, ror, 64-bit) throughput 0.501687 | |
NEG (register, 32-bit) 1->2 0.998801 | |
NEG (register, 32-bit) throughput 0.250787 | |
NEG (register, 64-bit) 1->2 0.998802 | |
NEG (register, 64-bit) throughput 0.250781 | |
NEG (register, lsl, 32-bit) 1->2 1.998004 | |
NEG (register, lsl, 32-bit) throughput 0.501202 | |
NEG (register, lsl, 64-bit) 1->2 1.999051 | |
NEG (register, lsl, 64-bit) throughput 0.500059 | |
NEG (register, lsr, 32-bit) 1->2 2.003675 | |
NEG (register, lsr, 32-bit) throughput 0.502004 | |
NEG (register, lsr, 64-bit) 1->2 2.011361 | |
NEG (register, lsr, 64-bit) throughput 0.500075 | |
NEG (register, asr, 32-bit) 1->2 2.000623 | |
NEG (register, asr, 32-bit) throughput 0.500861 | |
NEG (register, asr, 64-bit) 1->2 1.998901 | |
NEG (register, asr, 64-bit) throughput 0.501204 | |
NEGS (register, 32-bit) 1->2 1.001850 | |
NEGS (register, 32-bit) 3->2 0.999300 | |
NEGS (register, 32-bit) throughput 0.500156 | |
NEGS (register, 64-bit) 1->2 1.000824 | |
NEGS (register, 64-bit) 3->2 1.017573 | |
NEGS (register, 64-bit) throughput 0.500823 | |
NEGS (register, lsl, 32-bit) 1->2 2.000274 | |
NEGS (register, lsl, 32-bit) 3->2 2.006937 | |
NEGS (register, lsl, 32-bit) throughput 0.501332 | |
NEGS (register, lsl, 64-bit) 1->2 2.002874 | |
NEGS (register, lsl, 64-bit) 3->2 1.998501 | |
NEGS (register, lsl, 64-bit) throughput 0.500181 | |
NEGS (register, lsr, 32-bit) 1->2 1.998005 | |
NEGS (register, lsr, 32-bit) 3->2 2.005948 | |
NEGS (register, lsr, 32-bit) throughput 0.501975 | |
NEGS (register, lsr, 64-bit) 1->2 1.999226 | |
NEGS (register, lsr, 64-bit) 3->2 1.997654 | |
NEGS (register, lsr, 64-bit) throughput 0.501350 | |
NEGS (register, asr, 32-bit) 1->2 2.002850 | |
NEGS (register, asr, 32-bit) 3->2 1.999252 | |
NEGS (register, asr, 32-bit) throughput 0.501751 | |
NEGS (register, asr, 64-bit) 1->2 2.005768 | |
NEGS (register, asr, 64-bit) 3->2 1.999127 | |
NEGS (register, asr, 64-bit) throughput 0.500708 | |
NGC (register, 32-bit) 1->2 1.000424 | |
NGC (register, 32-bit) 1->3 1.003099 | |
NGC (register, 32-bit) throughput 0.250913 | |
NGC (register, 64-bit) 1->2 1.000600 | |
NGC (register, 64-bit) 1->3 0.997101 | |
NGC (register, 64-bit) throughput 0.250193 | |
NGCS (register, 32-bit) 1->2 0.998927 | |
NGCS (register, 32-bit) 1->3 1.008249 | |
NGCS (register, 32-bit) 3->2 0.999376 | |
NGCS (register, 32-bit) 3->3 1.000575 | |
NGCS (register, 32-bit) throughput 0.751531 | |
NGCS (register, 32-bit) throughput 0.568298 | |
NGCS (register, 64-bit) 1->2 0.999451 | |
NGCS (register, 64-bit) 1->3 1.004142 | |
NGCS (register, 64-bit) 3->2 1.002650 | |
NGCS (register, 64-bit) 3->3 0.999925 | |
NGCS (register, 64-bit) throughput 0.751479 | |
NGCS (register, 64-bit) throughput 0.566243 | |
NOP throughput 0.200606 | |
ORN (register, 32-bit) 1->2 1.000624 | |
ORN (register, 32-bit) 1->3 0.998254 | |
ORN (register, 32-bit) throughput 0.250203 | |
ORN (register, 64-bit) 1->2 1.001900 | |
ORN (register, 64-bit) 1->3 1.002050 | |
ORN (register, 64-bit) throughput 0.250790 | |
ORN (register, lsl, 32-bit) 1->2 2.003126 | |
ORN (register, lsl, 32-bit) 1->3 1.999724 | |
ORN (register, lsl, 32-bit) throughput 0.501787 | |
ORN (register, lsl, 64-bit) 1->2 1.999700 | |
ORN (register, lsl, 64-bit) 1->3 1.999126 | |
ORN (register, lsl, 64-bit) throughput 0.494400 | |
ORN (register, lsr, 32-bit) 1->2 1.997555 | |
ORN (register, lsr, 32-bit) 1->3 1.997954 | |
ORN (register, lsr, 32-bit) throughput 0.50076 |
View raw
(Sorry about that, but we can’t show files that are this big right now.)
View raw
(Sorry about that, but we can’t show files that are this big right now.)
View raw
(Sorry about that, but we can’t show files that are this big right now.)
View raw
(Sorry about that, but we can’t show files that are this big right now.)
View raw
(Sorry about that, but we can’t show files that are this big right now.)
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment