simdna 1.0.2

High-performance SIMD-accelerated DNA sequence encoding supporting all IUPAC nucleotide codes
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010
1011
1012
1013
1014
1015
1016
1017
1018
1019
1020
1021
1022
1023
1024
1025
1026
1027
1028
1029
1030
1031
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042
1043
1044
1045
1046
1047
1048
1049
1050
1051
1052
1053
1054
1055
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066
1067
1068
1069
1070
1071
1072
1073
1074
1075
1076
1077
1078
1079
1080
1081
1082
1083
1084
1085
1086
1087
1088
1089
1090
1091
1092
1093
1094
1095
1096
1097
1098
1099
1100
1101
1102
1103
1104
1105
1106
1107
1108
1109
1110
1111
1112
1113
1114
1115
1116
1117
1118
1119
1120
1121
1122
1123
1124
1125
1126
1127
1128
1129
1130
1131
1132
1133
1134
1135
1136
1137
1138
1139
1140
1141
1142
1143
1144
1145
1146
1147
1148
1149
1150
1151
1152
1153
1154
1155
1156
1157
1158
1159
1160
1161
1162
1163
1164
1165
1166
1167
1168
1169
1170
1171
1172
1173
1174
1175
1176
1177
1178
1179
1180
1181
1182
1183
1184
1185
1186
1187
1188
1189
1190
1191
1192
1193
1194
1195
1196
1197
1198
1199
1200
1201
1202
1203
1204
1205
1206
1207
1208
1209
1210
1211
1212
1213
1214
1215
1216
1217
1218
1219
1220
1221
1222
1223
1224
1225
1226
1227
1228
1229
1230
1231
1232
1233
1234
1235
1236
1237
1238
1239
1240
1241
1242
1243
1244
1245
1246
1247
1248
1249
1250
1251
1252
1253
1254
1255
1256
1257
1258
1259
1260
1261
1262
1263
1264
1265
1266
1267
1268
1269
1270
1271
1272
1273
1274
1275
1276
1277
1278
1279
1280
1281
1282
1283
1284
1285
1286
1287
1288
1289
1290
1291
1292
1293
1294
1295
1296
1297
1298
1299
1300
1301
1302
1303
1304
1305
1306
1307
1308
1309
1310
1311
1312
1313
1314
1315
1316
1317
1318
1319
1320
1321
1322
1323
1324
1325
1326
1327
1328
1329
1330
1331
1332
1333
1334
1335
1336
1337
1338
1339
1340
1341
1342
1343
1344
1345
1346
1347
1348
1349
1350
1351
1352
1353
1354
1355
1356
1357
1358
1359
1360
1361
1362
1363
1364
1365
1366
1367
1368
1369
1370
1371
1372
1373
1374
1375
1376
1377
1378
1379
1380
1381
1382
1383
1384
1385
1386
1387
1388
1389
1390
1391
1392
1393
1394
1395
1396
1397
1398
1399
1400
1401
1402
1403
1404
1405
1406
1407
1408
1409
1410
1411
1412
1413
1414
1415
1416
1417
1418
1419
1420
1421
1422
1423
1424
1425
1426
1427
1428
1429
1430
1431
1432
1433
1434
1435
1436
1437
1438
1439
1440
1441
1442
1443
1444
1445
1446
1447
1448
1449
1450
1451
1452
1453
1454
1455
1456
1457
1458
1459
1460
1461
1462
1463
1464
1465
1466
1467
1468
1469
1470
1471
1472
1473
1474
1475
1476
1477
1478
1479
1480
1481
1482
1483
1484
1485
1486
1487
1488
1489
1490
1491
1492
1493
1494
1495
1496
1497
1498
1499
1500
1501
1502
1503
1504
1505
1506
1507
1508
1509
1510
1511
1512
1513
1514
1515
1516
1517
1518
1519
1520
1521
1522
1523
1524
1525
1526
1527
1528
1529
1530
1531
1532
1533
1534
1535
1536
1537
1538
1539
1540
1541
1542
1543
1544
1545
1546
1547
1548
1549
1550
1551
1552
1553
1554
1555
1556
1557
1558
1559
1560
1561
1562
1563
1564
1565
1566
1567
1568
1569
1570
1571
1572
1573
1574
1575
1576
1577
1578
1579
1580
1581
1582
1583
1584
1585
1586
1587
1588
1589
1590
1591
1592
1593
1594
1595
1596
1597
1598
1599
1600
1601
1602
1603
1604
1605
1606
1607
1608
1609
1610
1611
1612
1613
1614
1615
1616
1617
1618
1619
1620
1621
1622
1623
1624
1625
1626
1627
1628
1629
1630
1631
1632
1633
1634
1635
1636
1637
1638
1639
1640
1641
1642
1643
1644
1645
1646
1647
1648
1649
1650
1651
1652
1653
1654
1655
1656
1657
1658
1659
1660
1661
1662
1663
1664
1665
1666
1667
1668
1669
1670
1671
1672
1673
1674
1675
1676
1677
1678
1679
1680
1681
1682
1683
1684
1685
1686
1687
1688
1689
1690
1691
1692
1693
1694
1695
1696
1697
1698
1699
1700
1701
1702
1703
1704
1705
1706
1707
1708
1709
1710
1711
1712
1713
1714
1715
1716
1717
1718
1719
1720
1721
1722
1723
1724
1725
1726
1727
1728
1729
1730
1731
1732
1733
1734
1735
1736
1737
1738
1739
1740
1741
1742
1743
1744
1745
1746
1747
1748
1749
1750
1751
1752
1753
1754
1755
1756
1757
1758
1759
1760
1761
1762
1763
1764
1765
1766
1767
1768
1769
1770
1771
1772
1773
1774
1775
1776
1777
1778
1779
1780
1781
1782
1783
1784
1785
1786
1787
1788
1789
1790
1791
1792
1793
1794
1795
1796
1797
1798
1799
1800
1801
1802
1803
1804
1805
1806
1807
1808
1809
1810
1811
1812
1813
1814
1815
1816
1817
1818
1819
1820
1821
1822
1823
1824
1825
1826
1827
1828
1829
1830
1831
1832
1833
1834
1835
1836
1837
1838
1839
1840
1841
1842
1843
1844
1845
1846
1847
1848
1849
1850
1851
1852
1853
1854
1855
1856
1857
1858
1859
1860
1861
1862
1863
1864
1865
1866
1867
1868
1869
1870
1871
1872
1873
1874
1875
1876
1877
1878
1879
1880
1881
1882
1883
1884
1885
1886
1887
1888
1889
1890
1891
1892
1893
1894
1895
1896
1897
1898
1899
1900
1901
1902
1903
1904
1905
1906
1907
1908
1909
1910
1911
1912
1913
1914
1915
1916
1917
1918
1919
1920
1921
1922
1923
1924
1925
1926
1927
1928
1929
1930
1931
1932
1933
1934
1935
1936
1937
1938
1939
1940
1941
1942
1943
1944
1945
1946
1947
1948
1949
1950
1951
1952
1953
1954
1955
1956
1957
1958
1959
1960
1961
1962
1963
1964
1965
1966
1967
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
2027
2028
2029
2030
2031
2032
2033
2034
2035
2036
2037
2038
2039
2040
2041
2042
2043
2044
2045
2046
2047
2048
2049
2050
2051
2052
2053
2054
2055
2056
2057
2058
2059
2060
2061
2062
2063
2064
2065
2066
2067
2068
2069
2070
2071
2072
2073
2074
2075
2076
2077
2078
2079
2080
2081
2082
2083
2084
2085
2086
2087
2088
2089
2090
2091
2092
2093
2094
2095
2096
2097
2098
2099
2100
2101
2102
2103
2104
2105
2106
2107
2108
2109
2110
2111
2112
2113
2114
2115
2116
2117
2118
2119
2120
2121
2122
2123
2124
2125
2126
2127
2128
2129
2130
2131
2132
2133
2134
2135
2136
2137
2138
2139
2140
2141
2142
2143
2144
2145
2146
2147
2148
2149
2150
2151
2152
2153
2154
2155
2156
2157
2158
2159
2160
2161
2162
2163
2164
2165
2166
2167
2168
2169
2170
2171
2172
2173
2174
2175
2176
2177
2178
2179
2180
2181
2182
2183
2184
2185
2186
2187
2188
2189
2190
2191
2192
2193
2194
2195
2196
2197
2198
2199
2200
2201
2202
2203
2204
2205
2206
2207
2208
2209
2210
2211
2212
2213
2214
2215
2216
2217
2218
2219
2220
2221
2222
2223
2224
2225
2226
2227
2228
2229
2230
2231
2232
2233
2234
2235
2236
2237
2238
2239
2240
2241
2242
2243
2244
2245
2246
2247
2248
2249
2250
2251
2252
2253
2254
2255
2256
2257
2258
2259
2260
2261
2262
2263
2264
2265
2266
2267
2268
2269
2270
2271
2272
2273
2274
2275
2276
2277
2278
2279
2280
2281
2282
2283
2284
2285
2286
2287
2288
2289
2290
2291
2292
2293
2294
2295
2296
2297
2298
2299
2300
2301
2302
2303
2304
2305
2306
2307
2308
2309
2310
2311
2312
2313
2314
2315
2316
2317
2318
2319
2320
2321
2322
2323
2324
2325
2326
2327
2328
2329
2330
2331
2332
2333
2334
2335
2336
2337
2338
2339
2340
2341
2342
2343
2344
2345
2346
2347
2348
2349
2350
2351
2352
2353
2354
2355
2356
2357
2358
2359
2360
2361
2362
2363
2364
2365
2366
2367
2368
2369
2370
2371
2372
2373
2374
2375
2376
2377
2378
2379
2380
2381
2382
2383
2384
2385
2386
2387
2388
2389
2390
2391
2392
2393
2394
2395
2396
2397
2398
2399
2400
2401
2402
2403
2404
2405
2406
2407
2408
2409
2410
2411
2412
2413
2414
2415
2416
2417
2418
2419
2420
2421
2422
2423
2424
2425
2426
2427
2428
2429
2430
2431
2432
2433
2434
2435
2436
2437
2438
2439
2440
2441
2442
2443
2444
2445
2446
2447
2448
2449
2450
2451
2452
2453
2454
2455
2456
2457
2458
2459
2460
2461
2462
2463
2464
2465
2466
2467
2468
2469
2470
2471
2472
2473
2474
2475
2476
2477
2478
2479
2480
2481
2482
2483
2484
2485
2486
2487
2488
2489
2490
2491
2492
2493
2494
2495
2496
2497
2498
2499
2500
2501
2502
2503
2504
2505
2506
2507
2508
2509
2510
2511
2512
2513
2514
2515
2516
2517
2518
2519
2520
2521
2522
2523
2524
2525
2526
2527
2528
2529
2530
2531
2532
2533
2534
2535
2536
2537
2538
2539
2540
2541
2542
2543
2544
2545
2546
2547
2548
2549
2550
2551
2552
2553
2554
2555
2556
2557
2558
2559
2560
2561
2562
2563
2564
2565
2566
2567
2568
2569
2570
2571
2572
2573
2574
2575
2576
2577
2578
2579
2580
2581
2582
2583
2584
2585
2586
2587
2588
2589
2590
2591
2592
2593
2594
2595
2596
2597
2598
2599
2600
2601
2602
2603
2604
2605
2606
2607
2608
2609
2610
2611
2612
2613
2614
2615
2616
2617
2618
2619
2620
2621
2622
2623
2624
2625
2626
2627
2628
2629
2630
2631
2632
2633
2634
2635
2636
2637
2638
2639
2640
2641
2642
2643
2644
2645
2646
2647
2648
2649
2650
2651
2652
2653
2654
2655
2656
2657
2658
2659
2660
2661
2662
2663
2664
2665
2666
2667
2668
2669
2670
2671
2672
2673
2674
2675
2676
2677
2678
2679
2680
2681
2682
2683
2684
2685
2686
2687
2688
2689
2690
2691
2692
2693
2694
2695
2696
2697
2698
2699
2700
2701
2702
2703
2704
2705
2706
2707
2708
2709
2710
2711
2712
2713
2714
2715
2716
2717
2718
2719
2720
2721
2722
2723
2724
2725
2726
2727
2728
2729
2730
2731
2732
2733
2734
2735
2736
2737
2738
2739
2740
2741
2742
2743
2744
2745
2746
2747
2748
2749
2750
2751
2752
2753
2754
2755
2756
2757
2758
2759
2760
2761
2762
2763
2764
2765
2766
2767
2768
2769
2770
2771
2772
2773
2774
2775
2776
2777
2778
2779
2780
2781
2782
2783
2784
2785
2786
2787
2788
2789
2790
2791
2792
2793
2794
2795
2796
2797
2798
2799
2800
2801
2802
2803
2804
2805
2806
2807
2808
2809
2810
2811
2812
2813
2814
2815
2816
2817
2818
2819
2820
2821
2822
2823
2824
2825
2826
2827
2828
2829
2830
2831
2832
2833
2834
2835
2836
2837
2838
2839
2840
2841
2842
2843
2844
2845
2846
2847
2848
2849
2850
2851
2852
2853
2854
2855
2856
2857
2858
2859
2860
2861
2862
2863
2864
2865
2866
2867
2868
2869
2870
2871
2872
2873
2874
2875
2876
2877
2878
2879
2880
2881
2882
2883
2884
2885
2886
2887
2888
2889
2890
2891
2892
2893
2894
2895
2896
2897
2898
2899
2900
2901
2902
2903
2904
2905
2906
2907
2908
2909
2910
2911
2912
2913
2914
2915
2916
2917
2918
2919
2920
2921
2922
2923
2924
2925
2926
2927
2928
2929
2930
2931
2932
2933
2934
2935
2936
2937
2938
2939
2940
2941
2942
2943
2944
2945
2946
2947
2948
2949
2950
2951
2952
2953
2954
2955
2956
2957
2958
2959
2960
2961
2962
2963
2964
2965
2966
2967
2968
2969
2970
2971
2972
2973
2974
2975
2976
2977
2978
2979
2980
2981
2982
2983
2984
2985
2986
2987
2988
2989
2990
2991
2992
2993
2994
2995
2996
2997
2998
2999
3000
3001
3002
3003
3004
3005
3006
3007
3008
3009
3010
3011
3012
3013
3014
3015
3016
3017
3018
3019
3020
3021
3022
3023
3024
3025
3026
3027
3028
3029
3030
3031
3032
3033
3034
3035
3036
3037
3038
3039
3040
3041
3042
3043
3044
3045
3046
3047
3048
3049
3050
3051
3052
3053
3054
3055
3056
3057
3058
3059
3060
3061
3062
3063
3064
3065
3066
3067
3068
3069
3070
3071
3072
3073
3074
3075
3076
3077
3078
3079
3080
3081
3082
3083
3084
3085
3086
3087
3088
3089
3090
3091
3092
3093
3094
3095
3096
3097
3098
3099
3100
3101
3102
3103
3104
3105
3106
3107
3108
3109
3110
3111
3112
3113
3114
3115
3116
3117
3118
3119
3120
3121
3122
3123
3124
3125
3126
3127
3128
3129
3130
3131
3132
3133
3134
3135
3136
3137
3138
3139
3140
3141
3142
3143
3144
3145
3146
3147
3148
3149
3150
3151
3152
3153
3154
3155
3156
3157
3158
3159
3160
3161
3162
3163
3164
3165
3166
3167
3168
3169
3170
3171
3172
3173
3174
3175
3176
3177
3178
3179
3180
3181
3182
3183
3184
3185
3186
3187
3188
3189
3190
3191
3192
3193
3194
3195
3196
3197
3198
3199
3200
3201
3202
3203
3204
3205
3206
3207
3208
3209
3210
3211
3212
3213
3214
3215
3216
3217
3218
3219
3220
3221
3222
3223
3224
3225
3226
3227
3228
3229
3230
3231
3232
3233
3234
3235
3236
3237
3238
3239
3240
3241
3242
3243
3244
3245
3246
3247
3248
3249
3250
3251
3252
3253
3254
3255
3256
3257
3258
3259
3260
3261
3262
3263
3264
3265
3266
3267
3268
3269
3270
3271
3272
3273
3274
3275
3276
3277
3278
3279
3280
3281
3282
3283
3284
3285
3286
3287
3288
3289
3290
3291
3292
3293
3294
3295
3296
3297
3298
3299
    Finished `bench` profile [optimized] target(s) in 0.03s
     Running benchmark/benchmark.rs (target/release/deps/benchmark-1dbf0b03f7f99eab)

╔════════════════════════════════════════════════════════════╗
║ simdna benchmark v1.0.1                                     ║
║ Run date: 2025-12-18 23:01:19 UTC                              ║
║ Platform: aarch64                                           ║
╚════════════════════════════════════════════════════════════╝

Benchmarking encode/simd_4bit/15
Benchmarking encode/simd_4bit/15: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/15: Collecting 100 samples in estimated 5.0000 s (174M iterations)
Benchmarking encode/simd_4bit/15: Analyzing
encode/simd_4bit/15     time:   [28.682 ns 28.969 ns 29.353 ns]
                        thrpt:  [487.35 MiB/s 493.81 MiB/s 498.74 MiB/s]
                 change:
                        time:   [−9.9254% −6.5528% −3.7401%] (p = 0.00 < 0.05)
                        thrpt:  [+3.8854% +7.0123% +11.019%]
                        Performance has improved.
Found 17 outliers among 100 measurements (17.00%)
  8 (8.00%) high mild
  9 (9.00%) high severe
Benchmarking encode/scalar_2bit/15
Benchmarking encode/scalar_2bit/15: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/15: Collecting 100 samples in estimated 5.0004 s (53M iterations)
Benchmarking encode/scalar_2bit/15: Analyzing
encode/scalar_2bit/15   time:   [94.188 ns 94.264 ns 94.386 ns]
                        thrpt:  [151.56 MiB/s 151.76 MiB/s 151.88 MiB/s]
                 change:
                        time:   [−7.9751% −5.4861% −3.4814%] (p = 0.00 < 0.05)
                        thrpt:  [+3.6070% +5.8045% +8.6662%]
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  2 (2.00%) high mild
  5 (5.00%) high severe
Benchmarking encode/scalar_4bit/15
Benchmarking encode/scalar_4bit/15: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/15: Collecting 100 samples in estimated 5.0001 s (160M iterations)
Benchmarking encode/scalar_4bit/15: Analyzing
encode/scalar_4bit/15   time:   [31.281 ns 31.309 ns 31.350 ns]
                        thrpt:  [456.30 MiB/s 456.89 MiB/s 457.31 MiB/s]
                 change:
                        time:   [+4.6355% +7.1587% +8.7070%] (p = 0.00 < 0.05)
                        thrpt:  [−8.0096% −6.6804% −4.4302%]
                        Performance has regressed.
Found 10 outliers among 100 measurements (10.00%)
  3 (3.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/simd_4bit/16
Benchmarking encode/simd_4bit/16: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/16: Collecting 100 samples in estimated 5.0000 s (186M iterations)
Benchmarking encode/simd_4bit/16: Analyzing
encode/simd_4bit/16     time:   [26.802 ns 26.833 ns 26.871 ns]
                        thrpt:  [567.86 MiB/s 568.66 MiB/s 569.31 MiB/s]
                 change:
                        time:   [−7.6430% −4.8339% −2.7452%] (p = 0.00 < 0.05)
                        thrpt:  [+2.8227% +5.0795% +8.2755%]
                        Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
  5 (5.00%) high mild
  3 (3.00%) high severe
Benchmarking encode/scalar_2bit/16
Benchmarking encode/scalar_2bit/16: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/16: Collecting 100 samples in estimated 5.0002 s (99M iterations)
Benchmarking encode/scalar_2bit/16: Analyzing
encode/scalar_2bit/16   time:   [50.590 ns 50.622 ns 50.661 ns]
                        thrpt:  [301.19 MiB/s 301.43 MiB/s 301.62 MiB/s]
                 change:
                        time:   [−7.2551% −5.0751% −3.2373%] (p = 0.00 < 0.05)
                        thrpt:  [+3.3456% +5.3464% +7.8226%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  5 (5.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/scalar_4bit/16
Benchmarking encode/scalar_4bit/16: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/16: Collecting 100 samples in estimated 5.0001 s (174M iterations)
Benchmarking encode/scalar_4bit/16: Analyzing
encode/scalar_4bit/16   time:   [28.465 ns 28.482 ns 28.502 ns]
                        thrpt:  [535.36 MiB/s 535.73 MiB/s 536.06 MiB/s]
                 change:
                        time:   [−7.6730% −5.1625% −3.1190%] (p = 0.00 < 0.05)
                        thrpt:  [+3.2194% +5.4435% +8.3107%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  7 (7.00%) high mild
  5 (5.00%) high severe
Benchmarking encode/simd_4bit/17
Benchmarking encode/simd_4bit/17: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/17: Collecting 100 samples in estimated 5.0001 s (184M iterations)
Benchmarking encode/simd_4bit/17: Analyzing
encode/simd_4bit/17     time:   [26.787 ns 26.804 ns 26.825 ns]
                        thrpt:  [604.37 MiB/s 604.85 MiB/s 605.25 MiB/s]
                 change:
                        time:   [−6.9235% −4.5472% −2.6912%] (p = 0.00 < 0.05)
                        thrpt:  [+2.7656% +4.7638% +7.4385%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  4 (4.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/scalar_2bit/17
Benchmarking encode/scalar_2bit/17: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/17: Collecting 100 samples in estimated 5.0004 s (47M iterations)
Benchmarking encode/scalar_2bit/17: Analyzing
encode/scalar_2bit/17   time:   [105.59 ns 105.68 ns 105.79 ns]
                        thrpt:  [153.24 MiB/s 153.42 MiB/s 153.54 MiB/s]
                 change:
                        time:   [−6.8068% −4.5177% −2.8268%] (p = 0.00 < 0.05)
                        thrpt:  [+2.9090% +4.7315% +7.3039%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  1 (1.00%) low mild
  5 (5.00%) high mild
  6 (6.00%) high severe
Benchmarking encode/scalar_4bit/17
Benchmarking encode/scalar_4bit/17: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/17: Collecting 100 samples in estimated 5.0001 s (172M iterations)
Benchmarking encode/scalar_4bit/17: Analyzing
encode/scalar_4bit/17   time:   [28.987 ns 29.009 ns 29.039 ns]
                        thrpt:  [558.30 MiB/s 558.87 MiB/s 559.30 MiB/s]
                 change:
                        time:   [−6.5438% −4.5124% −2.9735%] (p = 0.00 < 0.05)
                        thrpt:  [+3.0646% +4.7256% +7.0020%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  4 (4.00%) high mild
  6 (6.00%) high severe
Benchmarking encode/simd_4bit/32
Benchmarking encode/simd_4bit/32: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/32: Collecting 100 samples in estimated 5.0001 s (180M iterations)
Benchmarking encode/simd_4bit/32: Analyzing
encode/simd_4bit/32     time:   [27.789 ns 27.974 ns 28.316 ns]
                        thrpt:  [1.0525 GiB/s 1.0654 GiB/s 1.0725 GiB/s]
                 change:
                        time:   [−7.0776% −4.4293% −2.3353%] (p = 0.00 < 0.05)
                        thrpt:  [+2.3912% +4.6345% +7.6167%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  3 (3.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/scalar_2bit/32
Benchmarking encode/scalar_2bit/32: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/32: Collecting 100 samples in estimated 5.0003 s (73M iterations)
Benchmarking encode/scalar_2bit/32: Analyzing
encode/scalar_2bit/32   time:   [68.103 ns 68.205 ns 68.326 ns]
                        thrpt:  [446.65 MiB/s 447.44 MiB/s 448.11 MiB/s]
                 change:
                        time:   [−3.8059% −0.8056% +1.5033%] (p = 0.64 > 0.05)
                        thrpt:  [−1.4810% +0.8122% +3.9565%]
                        No change in performance detected.
Found 6 outliers among 100 measurements (6.00%)
  1 (1.00%) low mild
  2 (2.00%) high mild
  3 (3.00%) high severe
Benchmarking encode/scalar_4bit/32
Benchmarking encode/scalar_4bit/32: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/32: Collecting 100 samples in estimated 5.0001 s (135M iterations)
Benchmarking encode/scalar_4bit/32: Analyzing
encode/scalar_4bit/32   time:   [36.964 ns 37.145 ns 37.466 ns]
                        thrpt:  [814.53 MiB/s 821.58 MiB/s 825.61 MiB/s]
                 change:
                        time:   [−10.132% −7.0515% −4.3110%] (p = 0.00 < 0.05)
                        thrpt:  [+4.5053% +7.5864% +11.275%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  3 (3.00%) high mild
  8 (8.00%) high severe
Benchmarking encode/simd_4bit/33
Benchmarking encode/simd_4bit/33: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/33: Collecting 100 samples in estimated 5.0000 s (160M iterations)
Benchmarking encode/simd_4bit/33: Analyzing
encode/simd_4bit/33     time:   [31.195 ns 31.291 ns 31.447 ns]
                        thrpt:  [1000.8 MiB/s 1005.7 MiB/s 1008.9 MiB/s]
                 change:
                        time:   [−8.8303% −5.0691% −1.6947%] (p = 0.00 < 0.05)
                        thrpt:  [+1.7239% +5.3398% +9.6856%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  3 (3.00%) high mild
  8 (8.00%) high severe
Benchmarking encode/scalar_2bit/33
Benchmarking encode/scalar_2bit/33: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/33: Collecting 100 samples in estimated 5.0001 s (44M iterations)
Benchmarking encode/scalar_2bit/33: Analyzing
encode/scalar_2bit/33   time:   [113.66 ns 113.71 ns 113.77 ns]
                        thrpt:  [276.63 MiB/s 276.77 MiB/s 276.90 MiB/s]
                 change:
                        time:   [−24.062% −14.980% −6.9311%] (p = 0.00 < 0.05)
                        thrpt:  [+7.4473% +17.620% +31.687%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  2 (2.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/scalar_4bit/33
Benchmarking encode/scalar_4bit/33: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/33: Collecting 100 samples in estimated 5.0000 s (123M iterations)
Benchmarking encode/scalar_4bit/33: Analyzing
encode/scalar_4bit/33   time:   [40.703 ns 40.756 ns 40.827 ns]
                        thrpt:  [770.83 MiB/s 772.19 MiB/s 773.20 MiB/s]
                 change:
                        time:   [−1.9727% −1.7306% −1.4383%] (p = 0.00 < 0.05)
                        thrpt:  [+1.4593% +1.7611% +2.0124%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  4 (4.00%) high mild
  9 (9.00%) high severe
Benchmarking encode/simd_4bit/63
Benchmarking encode/simd_4bit/63: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/63: Collecting 100 samples in estimated 5.0000 s (123M iterations)
Benchmarking encode/simd_4bit/63: Analyzing
encode/simd_4bit/63     time:   [40.570 ns 40.599 ns 40.633 ns]
                        thrpt:  [1.4440 GiB/s 1.4452 GiB/s 1.4462 GiB/s]
                 change:
                        time:   [−2.7859% −2.6522% −2.5147%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5796% +2.7244% +2.8658%]
                        Performance has improved.
Found 6 outliers among 100 measurements (6.00%)
  2 (2.00%) high mild
  4 (4.00%) high severe
Benchmarking encode/scalar_2bit/63
Benchmarking encode/scalar_2bit/63: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/63: Collecting 100 samples in estimated 5.0002 s (39M iterations)
Benchmarking encode/scalar_2bit/63: Analyzing
encode/scalar_2bit/63   time:   [129.11 ns 129.30 ns 129.52 ns]
                        thrpt:  [463.88 MiB/s 464.68 MiB/s 465.35 MiB/s]
                 change:
                        time:   [−3.8814% −3.1020% −2.6180%] (p = 0.00 < 0.05)
                        thrpt:  [+2.6884% +3.2013% +4.0381%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  1 (1.00%) low mild
  3 (3.00%) high mild
  5 (5.00%) high severe
Benchmarking encode/scalar_4bit/63
Benchmarking encode/scalar_4bit/63: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/63: Collecting 100 samples in estimated 5.0003 s (84M iterations)
Benchmarking encode/scalar_4bit/63: Analyzing
encode/scalar_4bit/63   time:   [59.341 ns 59.769 ns 60.404 ns]
                        thrpt:  [994.66 MiB/s 1005.2 MiB/s 1012.5 MiB/s]
                 change:
                        time:   [−2.2651% −1.4190% −0.4231%] (p = 0.00 < 0.05)
                        thrpt:  [+0.4249% +1.4394% +2.3176%]
                        Change within noise threshold.
Found 13 outliers among 100 measurements (13.00%)
  6 (6.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/simd_4bit/64
Benchmarking encode/simd_4bit/64: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/64: Collecting 100 samples in estimated 5.0001 s (133M iterations)
Benchmarking encode/simd_4bit/64: Analyzing
encode/simd_4bit/64     time:   [37.465 ns 37.520 ns 37.593 ns]
                        thrpt:  [1.5855 GiB/s 1.5886 GiB/s 1.5910 GiB/s]
                 change:
                        time:   [−7.5128% −4.7399% −2.5136%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5784% +4.9758% +8.1231%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  5 (5.00%) high mild
  9 (9.00%) high severe
Benchmarking encode/scalar_2bit/64
Benchmarking encode/scalar_2bit/64: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/64: Collecting 100 samples in estimated 5.0003 s (60M iterations)
Benchmarking encode/scalar_2bit/64: Analyzing
encode/scalar_2bit/64   time:   [82.801 ns 82.855 ns 82.919 ns]
                        thrpt:  [736.08 MiB/s 736.65 MiB/s 737.13 MiB/s]
                 change:
                        time:   [−9.1266% −6.4501% −4.1788%] (p = 0.00 < 0.05)
                        thrpt:  [+4.3610% +6.8949% +10.043%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  6 (6.00%) high mild
  5 (5.00%) high severe
Benchmarking encode/scalar_4bit/64
Benchmarking encode/scalar_4bit/64: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/64: Collecting 100 samples in estimated 5.0002 s (84M iterations)
Benchmarking encode/scalar_4bit/64: Analyzing
encode/scalar_4bit/64   time:   [59.701 ns 59.750 ns 59.809 ns]
                        thrpt:  [1020.5 MiB/s 1021.5 MiB/s 1022.4 MiB/s]
                 change:
                        time:   [−12.686% −8.3319% −4.4741%] (p = 0.00 < 0.05)
                        thrpt:  [+4.6837% +9.0892% +14.529%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  5 (5.00%) high mild
  9 (9.00%) high severe
Benchmarking encode/simd_4bit/127
Benchmarking encode/simd_4bit/127: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/127: Collecting 100 samples in estimated 5.0002 s (84M iterations)
Benchmarking encode/simd_4bit/127: Analyzing
encode/simd_4bit/127    time:   [59.158 ns 59.194 ns 59.244 ns]
                        thrpt:  [1.9965 GiB/s 1.9981 GiB/s 1.9994 GiB/s]
                 change:
                        time:   [−7.2269% −4.9502% −3.0501%] (p = 0.00 < 0.05)
                        thrpt:  [+3.1461% +5.2080% +7.7898%]
                        Performance has improved.
Found 6 outliers among 100 measurements (6.00%)
  4 (4.00%) high mild
  2 (2.00%) high severe
Benchmarking encode/scalar_2bit/127
Benchmarking encode/scalar_2bit/127: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/127: Collecting 100 samples in estimated 5.0003 s (28M iterations)
Benchmarking encode/scalar_2bit/127: Analyzing
encode/scalar_2bit/127  time:   [175.69 ns 175.89 ns 176.14 ns]
                        thrpt:  [687.62 MiB/s 688.58 MiB/s 689.37 MiB/s]
                 change:
                        time:   [−9.9969% −6.6344% −3.8573%] (p = 0.00 < 0.05)
                        thrpt:  [+4.0120% +7.1059% +11.107%]
                        Performance has improved.
Found 17 outliers among 100 measurements (17.00%)
  3 (3.00%) low mild
  7 (7.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/scalar_4bit/127
Benchmarking encode/scalar_4bit/127: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/127: Collecting 100 samples in estimated 5.0001 s (45M iterations)
Benchmarking encode/scalar_4bit/127: Analyzing
encode/scalar_4bit/127  time:   [112.17 ns 112.48 ns 112.95 ns]
                        thrpt:  [1.0472 GiB/s 1.0515 GiB/s 1.0545 GiB/s]
                 change:
                        time:   [−9.8301% −5.8489% −2.0777%] (p = 0.00 < 0.05)
                        thrpt:  [+2.1218% +6.2122% +10.902%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  4 (4.00%) high mild
  6 (6.00%) high severe
Benchmarking encode/simd_4bit/128
Benchmarking encode/simd_4bit/128: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/128: Collecting 100 samples in estimated 5.0001 s (88M iterations)
Benchmarking encode/simd_4bit/128: Analyzing
encode/simd_4bit/128    time:   [56.197 ns 56.239 ns 56.297 ns]
                        thrpt:  [2.1175 GiB/s 2.1197 GiB/s 2.1213 GiB/s]
                 change:
                        time:   [−8.0066% −4.9757% −2.6265%] (p = 0.00 < 0.05)
                        thrpt:  [+2.6974% +5.2362% +8.7034%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  4 (4.00%) high mild
  5 (5.00%) high severe
Benchmarking encode/scalar_2bit/128
Benchmarking encode/scalar_2bit/128: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/128: Collecting 100 samples in estimated 5.0005 s (38M iterations)
Benchmarking encode/scalar_2bit/128: Analyzing
encode/scalar_2bit/128  time:   [132.54 ns 132.64 ns 132.81 ns]
                        thrpt:  [919.12 MiB/s 920.28 MiB/s 921.04 MiB/s]
                 change:
                        time:   [−24.253% −17.672% −11.056%] (p = 0.00 < 0.05)
                        thrpt:  [+12.431% +21.465% +32.018%]
                        Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
  1 (1.00%) low mild
  3 (3.00%) high mild
  4 (4.00%) high severe
Benchmarking encode/scalar_4bit/128
Benchmarking encode/scalar_4bit/128: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/128: Collecting 100 samples in estimated 5.0003 s (44M iterations)
Benchmarking encode/scalar_4bit/128: Analyzing
encode/scalar_4bit/128  time:   [112.41 ns 112.51 ns 112.63 ns]
                        thrpt:  [1.0584 GiB/s 1.0596 GiB/s 1.0605 GiB/s]
                 change:
                        time:   [−8.1299% −5.2212% −2.8951%] (p = 0.00 < 0.05)
                        thrpt:  [+2.9815% +5.5088% +8.8494%]
                        Performance has improved.
Found 3 outliers among 100 measurements (3.00%)
  1 (1.00%) high mild
  2 (2.00%) high severe
Benchmarking encode/simd_4bit/255
Benchmarking encode/simd_4bit/255: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/255: Collecting 100 samples in estimated 5.0001 s (55M iterations)
Benchmarking encode/simd_4bit/255: Analyzing
encode/simd_4bit/255    time:   [90.928 ns 90.977 ns 91.040 ns]
                        thrpt:  [2.6086 GiB/s 2.6104 GiB/s 2.6118 GiB/s]
                 change:
                        time:   [−7.0510% −5.0758% −3.5295%] (p = 0.00 < 0.05)
                        thrpt:  [+3.6586% +5.3472% +7.5859%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  7 (7.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/scalar_2bit/255
Benchmarking encode/scalar_2bit/255: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/255: Collecting 100 samples in estimated 5.0012 s (20M iterations)
Benchmarking encode/scalar_2bit/255: Analyzing
encode/scalar_2bit/255  time:   [255.44 ns 255.58 ns 255.76 ns]
                        thrpt:  [950.84 MiB/s 951.50 MiB/s 952.03 MiB/s]
                 change:
                        time:   [−5.6748% −3.3821% −1.5942%] (p = 0.00 < 0.05)
                        thrpt:  [+1.6200% +3.5004% +6.0162%]
                        Performance has improved.
Found 17 outliers among 100 measurements (17.00%)
  1 (1.00%) low severe
  1 (1.00%) low mild
  2 (2.00%) high mild
  13 (13.00%) high severe
Benchmarking encode/scalar_4bit/255
Benchmarking encode/scalar_4bit/255: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/255: Collecting 100 samples in estimated 5.0007 s (27M iterations)
Benchmarking encode/scalar_4bit/255: Analyzing
encode/scalar_4bit/255  time:   [189.41 ns 191.12 ns 193.98 ns]
                        thrpt:  [1.2243 GiB/s 1.2426 GiB/s 1.2539 GiB/s]
                 change:
                        time:   [−8.2224% −5.9111% −3.9733%] (p = 0.00 < 0.05)
                        thrpt:  [+4.1377% +6.2824% +8.9590%]
                        Performance has improved.
Found 4 outliers among 100 measurements (4.00%)
  2 (2.00%) high mild
  2 (2.00%) high severe
Benchmarking encode/simd_4bit/256
Benchmarking encode/simd_4bit/256: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/256: Collecting 100 samples in estimated 5.0004 s (57M iterations)
Benchmarking encode/simd_4bit/256: Analyzing
encode/simd_4bit/256    time:   [88.061 ns 88.670 ns 89.644 ns]
                        thrpt:  [2.6596 GiB/s 2.6888 GiB/s 2.7074 GiB/s]
                 change:
                        time:   [−3.6293% −3.0406% −2.2066%] (p = 0.00 < 0.05)
                        thrpt:  [+2.2564% +3.1359% +3.7660%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  5 (5.00%) high mild
  8 (8.00%) high severe
Benchmarking encode/scalar_2bit/256
Benchmarking encode/scalar_2bit/256: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/256: Collecting 100 samples in estimated 5.0001 s (24M iterations)
Benchmarking encode/scalar_2bit/256: Analyzing
encode/scalar_2bit/256  time:   [204.91 ns 205.06 ns 205.24 ns]
                        thrpt:  [1.1617 GiB/s 1.1627 GiB/s 1.1635 GiB/s]
                 change:
                        time:   [−1.3453% −1.1611% −0.9810%] (p = 0.00 < 0.05)
                        thrpt:  [+0.9907% +1.1748% +1.3636%]
                        Change within noise threshold.
Found 13 outliers among 100 measurements (13.00%)
  4 (4.00%) high mild
  9 (9.00%) high severe
Benchmarking encode/scalar_4bit/256
Benchmarking encode/scalar_4bit/256: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/256: Collecting 100 samples in estimated 5.0000 s (26M iterations)
Benchmarking encode/scalar_4bit/256: Analyzing
encode/scalar_4bit/256  time:   [188.90 ns 189.03 ns 189.21 ns]
                        thrpt:  [1.2601 GiB/s 1.2613 GiB/s 1.2622 GiB/s]
                 change:
                        time:   [−4.5887% −4.4056% −4.2322%] (p = 0.00 < 0.05)
                        thrpt:  [+4.4192% +4.6086% +4.8094%]
                        Performance has improved.
Found 5 outliers among 100 measurements (5.00%)
  3 (3.00%) high mild
  2 (2.00%) high severe
Benchmarking encode/simd_4bit/512
Benchmarking encode/simd_4bit/512: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/512: Collecting 100 samples in estimated 5.0001 s (32M iterations)
Benchmarking encode/simd_4bit/512: Analyzing
encode/simd_4bit/512    time:   [153.53 ns 153.63 ns 153.76 ns]
                        thrpt:  [3.1011 GiB/s 3.1038 GiB/s 3.1058 GiB/s]
                 change:
                        time:   [−3.2902% −3.1298% −2.9685%] (p = 0.00 < 0.05)
                        thrpt:  [+3.0594% +3.2309% +3.4021%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  4 (4.00%) high mild
  9 (9.00%) high severe
Benchmarking encode/scalar_2bit/512
Benchmarking encode/scalar_2bit/512: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/512: Collecting 100 samples in estimated 5.0008 s (14M iterations)
Benchmarking encode/scalar_2bit/512: Analyzing
encode/scalar_2bit/512  time:   [359.48 ns 360.54 ns 361.79 ns]
                        thrpt:  [1.3180 GiB/s 1.3226 GiB/s 1.3265 GiB/s]
                 change:
                        time:   [−8.7252% −5.6871% −3.0136%] (p = 0.00 < 0.05)
                        thrpt:  [+3.1073% +6.0300% +9.5592%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  5 (5.00%) high mild
  8 (8.00%) high severe
Benchmarking encode/scalar_4bit/512
Benchmarking encode/scalar_4bit/512: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/512: Collecting 100 samples in estimated 5.0016 s (14M iterations)
Benchmarking encode/scalar_4bit/512: Analyzing
encode/scalar_4bit/512  time:   [351.72 ns 352.04 ns 352.41 ns]
                        thrpt:  [1.3531 GiB/s 1.3545 GiB/s 1.3557 GiB/s]
                 change:
                        time:   [−7.0509% −4.7817% −2.9240%] (p = 0.00 < 0.05)
                        thrpt:  [+3.0120% +5.0218% +7.5857%]
                        Performance has improved.
Found 3 outliers among 100 measurements (3.00%)
  3 (3.00%) high mild
Benchmarking encode/simd_4bit/1023
Benchmarking encode/simd_4bit/1023: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/1023: Collecting 100 samples in estimated 5.0005 s (18M iterations)
Benchmarking encode/simd_4bit/1023: Analyzing
encode/simd_4bit/1023   time:   [284.26 ns 287.19 ns 291.57 ns]
                        thrpt:  [3.2676 GiB/s 3.3175 GiB/s 3.3517 GiB/s]
                 change:
                        time:   [−7.8173% −5.2437% −3.2637%] (p = 0.00 < 0.05)
                        thrpt:  [+3.3738% +5.5339% +8.4802%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  3 (3.00%) high mild
  6 (6.00%) high severe
Benchmarking encode/scalar_2bit/1023
Benchmarking encode/scalar_2bit/1023: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/1023: Collecting 100 samples in estimated 5.0020 s (7.0M iterations)
Benchmarking encode/scalar_2bit/1023: Analyzing
encode/scalar_2bit/1023 time:   [713.86 ns 714.91 ns 716.20 ns]
                        thrpt:  [1.3303 GiB/s 1.3327 GiB/s 1.3346 GiB/s]
                 change:
                        time:   [−2.8623% −2.7263% −2.5761%] (p = 0.00 < 0.05)
                        thrpt:  [+2.6443% +2.8027% +2.9466%]
                        Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
  3 (3.00%) high mild
  5 (5.00%) high severe
Benchmarking encode/scalar_4bit/1023
Benchmarking encode/scalar_4bit/1023: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/1023: Collecting 100 samples in estimated 5.0021 s (7.4M iterations)
Benchmarking encode/scalar_4bit/1023: Analyzing
encode/scalar_4bit/1023 time:   [669.73 ns 670.06 ns 670.45 ns]
                        thrpt:  [1.4210 GiB/s 1.4219 GiB/s 1.4226 GiB/s]
                 change:
                        time:   [−6.2511% −4.2598% −2.6699%] (p = 0.00 < 0.05)
                        thrpt:  [+2.7432% +4.4494% +6.6679%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  5 (5.00%) high mild
  6 (6.00%) high severe
Benchmarking encode/simd_4bit/1024
Benchmarking encode/simd_4bit/1024: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/1024: Collecting 100 samples in estimated 5.0001 s (18M iterations)
Benchmarking encode/simd_4bit/1024: Analyzing
encode/simd_4bit/1024   time:   [281.41 ns 283.33 ns 287.21 ns]
                        thrpt:  [3.3204 GiB/s 3.3660 GiB/s 3.3889 GiB/s]
                 change:
                        time:   [−8.0081% −5.6175% −3.6129%] (p = 0.00 < 0.05)
                        thrpt:  [+3.7483% +5.9518% +8.7052%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  2 (2.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/scalar_2bit/1024
Benchmarking encode/scalar_2bit/1024: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/1024: Collecting 100 samples in estimated 5.0006 s (7.6M iterations)
Benchmarking encode/scalar_2bit/1024: Analyzing
encode/scalar_2bit/1024 time:   [654.37 ns 655.45 ns 656.76 ns]
                        thrpt:  [1.4521 GiB/s 1.4550 GiB/s 1.4574 GiB/s]
                 change:
                        time:   [−8.0942% −4.9551% −2.5184%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5834% +5.2134% +8.8071%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  3 (3.00%) high mild
  8 (8.00%) high severe
Benchmarking encode/scalar_4bit/1024
Benchmarking encode/scalar_4bit/1024: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/1024: Collecting 100 samples in estimated 5.0015 s (7.4M iterations)
Benchmarking encode/scalar_4bit/1024: Analyzing
encode/scalar_4bit/1024 time:   [672.82 ns 673.83 ns 675.27 ns]
                        thrpt:  [1.4123 GiB/s 1.4153 GiB/s 1.4174 GiB/s]
                 change:
                        time:   [−10.950% −6.9555% −3.2416%] (p = 0.00 < 0.05)
                        thrpt:  [+3.3502% +7.4754% +12.297%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  1 (1.00%) high mild
  12 (12.00%) high severe
Benchmarking encode/simd_4bit/2048
Benchmarking encode/simd_4bit/2048: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/2048: Collecting 100 samples in estimated 5.0023 s (9.1M iterations)
Benchmarking encode/simd_4bit/2048: Analyzing
encode/simd_4bit/2048   time:   [545.29 ns 545.68 ns 546.22 ns]
                        thrpt:  [3.4919 GiB/s 3.4953 GiB/s 3.4979 GiB/s]
                 change:
                        time:   [−10.159% −6.8216% −4.1322%] (p = 0.00 < 0.05)
                        thrpt:  [+4.3103% +7.3210% +11.308%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  3 (3.00%) high mild
  6 (6.00%) high severe
Benchmarking encode/scalar_2bit/2048
Benchmarking encode/scalar_2bit/2048: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/2048: Collecting 100 samples in estimated 5.0048 s (4.0M iterations)
Benchmarking encode/scalar_2bit/2048: Analyzing
encode/scalar_2bit/2048 time:   [1.2534 µs 1.2572 µs 1.2636 µs]
                        thrpt:  [1.5094 GiB/s 1.5172 GiB/s 1.5218 GiB/s]
                 change:
                        time:   [−9.1204% −5.6029% −2.7605%] (p = 0.00 < 0.05)
                        thrpt:  [+2.8388% +5.9355% +10.036%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  6 (6.00%) high mild
  5 (5.00%) high severe
Benchmarking encode/scalar_4bit/2048
Benchmarking encode/scalar_4bit/2048: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/2048: Collecting 100 samples in estimated 5.0028 s (3.7M iterations)
Benchmarking encode/scalar_4bit/2048: Analyzing
encode/scalar_4bit/2048 time:   [1.3348 µs 1.3362 µs 1.3378 µs]
                        thrpt:  [1.4257 GiB/s 1.4275 GiB/s 1.4290 GiB/s]
                 change:
                        time:   [−2.8649% −2.6898% −2.5144%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5793% +2.7641% +2.9494%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  7 (7.00%) high mild
  6 (6.00%) high severe
Benchmarking encode/simd_4bit/4095
Benchmarking encode/simd_4bit/4095: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/4095: Collecting 100 samples in estimated 5.0038 s (4.6M iterations)
Benchmarking encode/simd_4bit/4095: Analyzing
encode/simd_4bit/4095   time:   [1.0782 µs 1.0793 µs 1.0808 µs]
                        thrpt:  [3.5287 GiB/s 3.5337 GiB/s 3.5372 GiB/s]
                 change:
                        time:   [−3.5986% −3.4261% −3.2258%] (p = 0.00 < 0.05)
                        thrpt:  [+3.3333% +3.5477% +3.7329%]
                        Performance has improved.
Found 16 outliers among 100 measurements (16.00%)
  2 (2.00%) low mild
  6 (6.00%) high mild
  8 (8.00%) high severe
Benchmarking encode/scalar_2bit/4095
Benchmarking encode/scalar_2bit/4095: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/4095: Collecting 100 samples in estimated 5.0062 s (2.0M iterations)
Benchmarking encode/scalar_2bit/4095: Analyzing
encode/scalar_2bit/4095 time:   [2.5308 µs 2.5319 µs 2.5333 µs]
                        thrpt:  [1.5055 GiB/s 1.5063 GiB/s 1.5069 GiB/s]
                 change:
                        time:   [−3.9385% −2.7050% −2.0027%] (p = 0.00 < 0.05)
                        thrpt:  [+2.0436% +2.7802% +4.1000%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  2 (2.00%) high mild
  8 (8.00%) high severe
Benchmarking encode/scalar_4bit/4095
Benchmarking encode/scalar_4bit/4095: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/4095: Collecting 100 samples in estimated 5.0032 s (1.9M iterations)
Benchmarking encode/scalar_4bit/4095: Analyzing
encode/scalar_4bit/4095 time:   [2.6100 µs 2.6244 µs 2.6517 µs]
                        thrpt:  [1.4382 GiB/s 1.4532 GiB/s 1.4612 GiB/s]
                 change:
                        time:   [−6.4035% −4.3325% −2.6874%] (p = 0.00 < 0.05)
                        thrpt:  [+2.7616% +4.5287% +6.8416%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  1 (1.00%) low mild
  3 (3.00%) high mild
  9 (9.00%) high severe
Benchmarking encode/simd_4bit/4096
Benchmarking encode/simd_4bit/4096: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/4096: Collecting 100 samples in estimated 5.0037 s (4.6M iterations)
Benchmarking encode/simd_4bit/4096: Analyzing
encode/simd_4bit/4096   time:   [1.0843 µs 1.0891 µs 1.0986 µs]
                        thrpt:  [3.4723 GiB/s 3.5025 GiB/s 3.5180 GiB/s]
                 change:
                        time:   [−6.0015% −4.0177% −2.4503%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5119% +4.1859% +6.3846%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  4 (4.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/scalar_2bit/4096
Benchmarking encode/scalar_2bit/4096: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/4096: Collecting 100 samples in estimated 5.0020 s (2.1M iterations)
Benchmarking encode/scalar_2bit/4096: Analyzing
encode/scalar_2bit/4096 time:   [2.4253 µs 2.4270 µs 2.4290 µs]
                        thrpt:  [1.5705 GiB/s 1.5718 GiB/s 1.5729 GiB/s]
                 change:
                        time:   [−7.8110% −5.0725% −2.8280%] (p = 0.00 < 0.05)
                        thrpt:  [+2.9103% +5.3436% +8.4728%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
  5 (5.00%) high severe
Benchmarking encode/scalar_4bit/4096
Benchmarking encode/scalar_4bit/4096: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/4096: Collecting 100 samples in estimated 5.0011 s (1.9M iterations)
Benchmarking encode/scalar_4bit/4096: Analyzing
encode/scalar_4bit/4096 time:   [2.6103 µs 2.6121 µs 2.6143 µs]
                        thrpt:  [1.4592 GiB/s 1.4604 GiB/s 1.4614 GiB/s]
                 change:
                        time:   [−2.6323% −2.4620% −2.3070%] (p = 0.00 < 0.05)
                        thrpt:  [+2.3615% +2.5242% +2.7034%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  6 (6.00%) high mild
  6 (6.00%) high severe
Benchmarking encode/simd_4bit/8192
Benchmarking encode/simd_4bit/8192: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/8192: Collecting 100 samples in estimated 5.0008 s (2.3M iterations)
Benchmarking encode/simd_4bit/8192: Analyzing
encode/simd_4bit/8192   time:   [2.1416 µs 2.1434 µs 2.1459 µs]
                        thrpt:  [3.5553 GiB/s 3.5595 GiB/s 3.5626 GiB/s]
                 change:
                        time:   [−2.5584% −2.3565% −2.1368%] (p = 0.00 < 0.05)
                        thrpt:  [+2.1835% +2.4134% +2.6256%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  2 (2.00%) high mild
  8 (8.00%) high severe
Benchmarking encode/scalar_2bit/8192
Benchmarking encode/scalar_2bit/8192: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/8192: Collecting 100 samples in estimated 5.0125 s (1.1M iterations)
Benchmarking encode/scalar_2bit/8192: Analyzing
encode/scalar_2bit/8192 time:   [4.7642 µs 4.8092 µs 4.8753 µs]
                        thrpt:  [1.5649 GiB/s 1.5864 GiB/s 1.6014 GiB/s]
                 change:
                        time:   [−2.6914% −2.1702% −1.4550%] (p = 0.00 < 0.05)
                        thrpt:  [+1.4765% +2.2184% +2.7658%]
                        Performance has improved.
Found 15 outliers among 100 measurements (15.00%)
  6 (6.00%) high mild
  9 (9.00%) high severe
Benchmarking encode/scalar_4bit/8192
Benchmarking encode/scalar_4bit/8192: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/8192: Collecting 100 samples in estimated 5.0219 s (970k iterations)
Benchmarking encode/scalar_4bit/8192: Analyzing
encode/scalar_4bit/8192 time:   [5.1796 µs 5.2087 µs 5.2548 µs]
                        thrpt:  [1.4519 GiB/s 1.4647 GiB/s 1.4730 GiB/s]
                 change:
                        time:   [−5.6229% −3.5506% −1.9904%] (p = 0.00 < 0.05)
                        thrpt:  [+2.0308% +3.6813% +5.9579%]
                        Performance has improved.
Found 19 outliers among 100 measurements (19.00%)
  1 (1.00%) low mild
  8 (8.00%) high mild
  10 (10.00%) high severe
Benchmarking encode/simd_4bit/9999
Benchmarking encode/simd_4bit/9999: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/9999: Collecting 100 samples in estimated 5.0059 s (1.9M iterations)
Benchmarking encode/simd_4bit/9999: Analyzing
encode/simd_4bit/9999   time:   [2.6205 µs 2.6227 µs 2.6255 µs]
                        thrpt:  [3.5469 GiB/s 3.5507 GiB/s 3.5537 GiB/s]
                 change:
                        time:   [−6.5716% −4.3890% −2.7087%] (p = 0.00 < 0.05)
                        thrpt:  [+2.7841% +4.5905% +7.0339%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  7 (7.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/scalar_2bit/9999
Benchmarking encode/scalar_2bit/9999: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/9999: Collecting 100 samples in estimated 5.0103 s (838k iterations)
Benchmarking encode/scalar_2bit/9999: Analyzing
encode/scalar_2bit/9999 time:   [5.9676 µs 5.9715 µs 5.9766 µs]
                        thrpt:  [1.5581 GiB/s 1.5595 GiB/s 1.5605 GiB/s]
                 change:
                        time:   [−7.2158% −4.7419% −2.8093%] (p = 0.00 < 0.05)
                        thrpt:  [+2.8905% +4.9779% +7.7769%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  3 (3.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/scalar_4bit/9999
Benchmarking encode/scalar_4bit/9999: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/9999: Collecting 100 samples in estimated 5.0239 s (793k iterations)
Benchmarking encode/scalar_4bit/9999: Analyzing
encode/scalar_4bit/9999 time:   [6.3284 µs 6.3322 µs 6.3370 µs]
                        thrpt:  [1.4695 GiB/s 1.4706 GiB/s 1.4715 GiB/s]
                 change:
                        time:   [−6.8919% −4.3845% −2.5593%] (p = 0.00 < 0.05)
                        thrpt:  [+2.6265% +4.5856% +7.4021%]
                        Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
  3 (3.00%) high mild
  5 (5.00%) high severe
Benchmarking encode/simd_4bit/10000
Benchmarking encode/simd_4bit/10000: Warming up for 3.0000 s
Benchmarking encode/simd_4bit/10000: Collecting 100 samples in estimated 5.0117 s (1.9M iterations)
Benchmarking encode/simd_4bit/10000: Analyzing
encode/simd_4bit/10000  time:   [2.6158 µs 2.6176 µs 2.6201 µs]
                        thrpt:  [3.5545 GiB/s 3.5579 GiB/s 3.5603 GiB/s]
                 change:
                        time:   [−7.1777% −5.0450% −3.2852%] (p = 0.00 < 0.05)
                        thrpt:  [+3.3968% +5.3130% +7.7327%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  2 (2.00%) high mild
  7 (7.00%) high severe
Benchmarking encode/scalar_2bit/10000
Benchmarking encode/scalar_2bit/10000: Warming up for 3.0000 s
Benchmarking encode/scalar_2bit/10000: Collecting 100 samples in estimated 5.0144 s (864k iterations)
Benchmarking encode/scalar_2bit/10000: Analyzing
encode/scalar_2bit/10000
                        time:   [5.7885 µs 5.7923 µs 5.7978 µs]
                        thrpt:  [1.6063 GiB/s 1.6079 GiB/s 1.6089 GiB/s]
                 change:
                        time:   [−7.9622% −5.2748% −3.1425%] (p = 0.00 < 0.05)
                        thrpt:  [+3.2445% +5.5685% +8.6510%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  1 (1.00%) low mild
  7 (7.00%) high mild
  5 (5.00%) high severe
Benchmarking encode/scalar_4bit/10000
Benchmarking encode/scalar_4bit/10000: Warming up for 3.0000 s
Benchmarking encode/scalar_4bit/10000: Collecting 100 samples in estimated 5.0094 s (783k iterations)
Benchmarking encode/scalar_4bit/10000: Analyzing
encode/scalar_4bit/10000
                        time:   [6.3252 µs 6.3456 µs 6.3818 µs]
                        thrpt:  [1.4593 GiB/s 1.4677 GiB/s 1.4724 GiB/s]
                 change:
                        time:   [−5.7173% −3.9249% −2.4827%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5459% +4.0852% +6.0640%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  2 (2.00%) low mild
  2 (2.00%) high mild
  8 (8.00%) high severe

Benchmarking decode/simd_4bit/15
Benchmarking decode/simd_4bit/15: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/15: Collecting 100 samples in estimated 5.0000 s (171M iterations)
Benchmarking decode/simd_4bit/15: Analyzing
decode/simd_4bit/15     time:   [29.147 ns 29.162 ns 29.180 ns]
                        thrpt:  [490.24 MiB/s 490.53 MiB/s 490.79 MiB/s]
                 change:
                        time:   [−7.5753% −4.9710% −2.9325%] (p = 0.00 < 0.05)
                        thrpt:  [+3.0211% +5.2310% +8.1961%]
                        Performance has improved.
Found 16 outliers among 100 measurements (16.00%)
  1 (1.00%) low mild
  5 (5.00%) high mild
  10 (10.00%) high severe
Benchmarking decode/scalar_2bit/15
Benchmarking decode/scalar_2bit/15: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/15: Collecting 100 samples in estimated 5.0001 s (171M iterations)
Benchmarking decode/scalar_2bit/15: Analyzing
decode/scalar_2bit/15   time:   [29.194 ns 29.208 ns 29.223 ns]
                        thrpt:  [489.52 MiB/s 489.77 MiB/s 489.99 MiB/s]
                 change:
                        time:   [+4.5789% +7.2819% +9.4721%] (p = 0.00 < 0.05)
                        thrpt:  [−8.6525% −6.7876% −4.3784%]
                        Performance has regressed.
Found 13 outliers among 100 measurements (13.00%)
  5 (5.00%) high mild
  8 (8.00%) high severe
Benchmarking decode/scalar_4bit/15
Benchmarking decode/scalar_4bit/15: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/15: Collecting 100 samples in estimated 5.0001 s (201M iterations)
Benchmarking decode/scalar_4bit/15: Analyzing
decode/scalar_4bit/15   time:   [25.080 ns 25.221 ns 25.376 ns]
                        thrpt:  [563.73 MiB/s 567.19 MiB/s 570.37 MiB/s]
                 change:
                        time:   [−3.2316% −2.5250% −1.8087%] (p = 0.00 < 0.05)
                        thrpt:  [+1.8420% +2.5904% +3.3395%]
                        Performance has improved.
Benchmarking decode/simd_4bit/16
Benchmarking decode/simd_4bit/16: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/16: Collecting 100 samples in estimated 5.0001 s (187M iterations)
Benchmarking decode/simd_4bit/16: Analyzing
decode/simd_4bit/16     time:   [26.733 ns 26.753 ns 26.779 ns]
                        thrpt:  [569.81 MiB/s 570.37 MiB/s 570.78 MiB/s]
                 change:
                        time:   [−3.5704% −3.3565% −3.1494%] (p = 0.00 < 0.05)
                        thrpt:  [+3.2518% +3.4731% +3.7026%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  1 (1.00%) low mild
  6 (6.00%) high mild
  3 (3.00%) high severe
Benchmarking decode/scalar_2bit/16
Benchmarking decode/scalar_2bit/16: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/16: Collecting 100 samples in estimated 5.0001 s (170M iterations)
Benchmarking decode/scalar_2bit/16: Analyzing
decode/scalar_2bit/16   time:   [29.450 ns 29.487 ns 29.532 ns]
                        thrpt:  [516.69 MiB/s 517.47 MiB/s 518.12 MiB/s]
                 change:
                        time:   [+9.7767% +9.9978% +10.252%] (p = 0.00 < 0.05)
                        thrpt:  [−9.2983% −9.0891% −8.9060%]
                        Performance has regressed.
Found 12 outliers among 100 measurements (12.00%)
  1 (1.00%) high mild
  11 (11.00%) high severe
Benchmarking decode/scalar_4bit/16
Benchmarking decode/scalar_4bit/16: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/16: Collecting 100 samples in estimated 5.0000 s (199M iterations)
Benchmarking decode/scalar_4bit/16: Analyzing
decode/scalar_4bit/16   time:   [25.333 ns 25.474 ns 25.649 ns]
                        thrpt:  [594.90 MiB/s 598.99 MiB/s 602.33 MiB/s]
                 change:
                        time:   [−2.5723% −1.8775% −1.0821%] (p = 0.00 < 0.05)
                        thrpt:  [+1.0940% +1.9134% +2.6402%]
                        Performance has improved.
Found 2 outliers among 100 measurements (2.00%)
  1 (1.00%) high mild
  1 (1.00%) high severe
Benchmarking decode/simd_4bit/17
Benchmarking decode/simd_4bit/17: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/17: Collecting 100 samples in estimated 5.0000 s (187M iterations)
Benchmarking decode/simd_4bit/17: Analyzing
decode/simd_4bit/17     time:   [26.784 ns 26.841 ns 26.895 ns]
                        thrpt:  [602.81 MiB/s 604.01 MiB/s 605.30 MiB/s]
                 change:
                        time:   [−13.806% −8.9475% −4.3737%] (p = 0.00 < 0.05)
                        thrpt:  [+4.5738% +9.8268% +16.017%]
                        Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
  1 (1.00%) low mild
  5 (5.00%) high mild
  2 (2.00%) high severe
Benchmarking decode/scalar_2bit/17
Benchmarking decode/scalar_2bit/17: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/17: Collecting 100 samples in estimated 5.0001 s (170M iterations)
Benchmarking decode/scalar_2bit/17: Analyzing
decode/scalar_2bit/17   time:   [28.945 ns 28.969 ns 28.997 ns]
                        thrpt:  [559.11 MiB/s 559.65 MiB/s 560.12 MiB/s]
                 change:
                        time:   [−9.3230% −6.2537% −3.6847%] (p = 0.00 < 0.05)
                        thrpt:  [+3.8257% +6.6709% +10.281%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  4 (4.00%) high mild
  9 (9.00%) high severe
Benchmarking decode/scalar_4bit/17
Benchmarking decode/scalar_4bit/17: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/17: Collecting 100 samples in estimated 5.0001 s (182M iterations)
Benchmarking decode/scalar_4bit/17: Analyzing
decode/scalar_4bit/17   time:   [27.451 ns 27.478 ns 27.511 ns]
                        thrpt:  [589.31 MiB/s 590.01 MiB/s 590.59 MiB/s]
                 change:
                        time:   [−10.002% −7.0692% −4.5919%] (p = 0.00 < 0.05)
                        thrpt:  [+4.8130% +7.6069% +11.114%]
                        Performance has improved.
Found 15 outliers among 100 measurements (15.00%)
  8 (8.00%) high mild
  7 (7.00%) high severe
Benchmarking decode/simd_4bit/32
Benchmarking decode/simd_4bit/32: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/32: Collecting 100 samples in estimated 5.0001 s (176M iterations)
Benchmarking decode/simd_4bit/32: Analyzing
decode/simd_4bit/32     time:   [28.437 ns 28.474 ns 28.520 ns]
                        thrpt:  [1.0450 GiB/s 1.0466 GiB/s 1.0480 GiB/s]
                 change:
                        time:   [−10.773% −6.6079% −3.0140%] (p = 0.00 < 0.05)
                        thrpt:  [+3.1076% +7.0754% +12.074%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  5 (5.00%) high mild
  7 (7.00%) high severe
Benchmarking decode/scalar_2bit/32
Benchmarking decode/scalar_2bit/32: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/32: Collecting 100 samples in estimated 5.0001 s (147M iterations)
Benchmarking decode/scalar_2bit/32: Analyzing
decode/scalar_2bit/32   time:   [33.667 ns 33.709 ns 33.762 ns]
                        thrpt:  [903.89 MiB/s 905.32 MiB/s 906.45 MiB/s]
                 change:
                        time:   [−15.847% −11.312% −7.2120%] (p = 0.00 < 0.05)
                        thrpt:  [+7.7726% +12.755% +18.832%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  5 (5.00%) high mild
  9 (9.00%) high severe
Benchmarking decode/scalar_4bit/32
Benchmarking decode/scalar_4bit/32: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/32: Collecting 100 samples in estimated 5.0000 s (148M iterations)
Benchmarking decode/scalar_4bit/32: Analyzing
decode/scalar_4bit/32   time:   [33.594 ns 33.821 ns 34.232 ns]
                        thrpt:  [891.50 MiB/s 902.33 MiB/s 908.42 MiB/s]
                 change:
                        time:   [−18.089% −11.396% −5.8256%] (p = 0.00 < 0.05)
                        thrpt:  [+6.1859% +12.861% +22.083%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  1 (1.00%) high mild
  10 (10.00%) high severe
Benchmarking decode/simd_4bit/33
Benchmarking decode/simd_4bit/33: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/33: Collecting 100 samples in estimated 5.0001 s (172M iterations)
Benchmarking decode/simd_4bit/33: Analyzing
decode/simd_4bit/33     time:   [29.167 ns 29.396 ns 29.750 ns]
                        thrpt:  [1.0331 GiB/s 1.0455 GiB/s 1.0537 GiB/s]
                 change:
                        time:   [−3.3002% −2.7595% −2.1612%] (p = 0.00 < 0.05)
                        thrpt:  [+2.2089% +2.8379% +3.4128%]
                        Performance has improved.
Found 6 outliers among 100 measurements (6.00%)
  2 (2.00%) high mild
  4 (4.00%) high severe
Benchmarking decode/scalar_2bit/33
Benchmarking decode/scalar_2bit/33: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/33: Collecting 100 samples in estimated 5.0002 s (146M iterations)
Benchmarking decode/scalar_2bit/33: Analyzing
decode/scalar_2bit/33   time:   [34.319 ns 34.341 ns 34.375 ns]
                        thrpt:  [915.54 MiB/s 916.43 MiB/s 917.03 MiB/s]
                 change:
                        time:   [−9.7619% −9.6234% −9.4851%] (p = 0.00 < 0.05)
                        thrpt:  [+10.479% +10.648% +10.818%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  1 (1.00%) low mild
  2 (2.00%) high mild
  7 (7.00%) high severe
Benchmarking decode/scalar_4bit/33
Benchmarking decode/scalar_4bit/33: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/33: Collecting 100 samples in estimated 5.0001 s (144M iterations)
Benchmarking decode/scalar_4bit/33: Analyzing
decode/scalar_4bit/33   time:   [34.624 ns 34.645 ns 34.674 ns]
                        thrpt:  [907.64 MiB/s 908.38 MiB/s 908.94 MiB/s]
                 change:
                        time:   [−1.9972% −1.8279% −1.6617%] (p = 0.00 < 0.05)
                        thrpt:  [+1.6897% +1.8620% +2.0379%]
                        Performance has improved.
Found 16 outliers among 100 measurements (16.00%)
  1 (1.00%) low mild
  7 (7.00%) high mild
  8 (8.00%) high severe
Benchmarking decode/simd_4bit/63
Benchmarking decode/simd_4bit/63: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/63: Collecting 100 samples in estimated 5.0001 s (146M iterations)
Benchmarking decode/simd_4bit/63: Analyzing
decode/simd_4bit/63     time:   [34.253 ns 34.282 ns 34.318 ns]
                        thrpt:  [1.7097 GiB/s 1.7115 GiB/s 1.7130 GiB/s]
                 change:
                        time:   [−10.486% −10.341% −10.191%] (p = 0.00 < 0.05)
                        thrpt:  [+11.347% +11.534% +11.714%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe
Benchmarking decode/scalar_2bit/63
Benchmarking decode/scalar_2bit/63: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/63: Collecting 100 samples in estimated 5.0002 s (119M iterations)
Benchmarking decode/scalar_2bit/63: Analyzing
decode/scalar_2bit/63   time:   [41.998 ns 42.025 ns 42.057 ns]
                        thrpt:  [1.3951 GiB/s 1.3962 GiB/s 1.3970 GiB/s]
                 change:
                        time:   [−3.0702% −2.3140% −1.8482%] (p = 0.00 < 0.05)
                        thrpt:  [+1.8830% +2.3688% +3.1675%]
                        Performance has improved.
Found 16 outliers among 100 measurements (16.00%)
  6 (6.00%) high mild
  10 (10.00%) high severe
Benchmarking decode/scalar_4bit/63
Benchmarking decode/scalar_4bit/63: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/63: Collecting 100 samples in estimated 5.0000 s (113M iterations)
Benchmarking decode/scalar_4bit/63: Analyzing
decode/scalar_4bit/63   time:   [44.217 ns 44.517 ns 44.981 ns]
                        thrpt:  [1.3044 GiB/s 1.3180 GiB/s 1.3269 GiB/s]
                 change:
                        time:   [−8.6881% −5.6312% −2.9314%] (p = 0.00 < 0.05)
                        thrpt:  [+3.0199% +5.9673% +9.5148%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  4 (4.00%) high mild
  8 (8.00%) high severe
Benchmarking decode/simd_4bit/64
Benchmarking decode/simd_4bit/64: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/64: Collecting 100 samples in estimated 5.0000 s (156M iterations)
Benchmarking decode/simd_4bit/64: Analyzing
decode/simd_4bit/64     time:   [32.129 ns 32.282 ns 32.518 ns]
                        thrpt:  [1.8330 GiB/s 1.8464 GiB/s 1.8552 GiB/s]
                 change:
                        time:   [−20.338% −15.585% −11.547%] (p = 0.00 < 0.05)
                        thrpt:  [+13.055% +18.462% +25.530%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  7 (7.00%) high mild
  7 (7.00%) high severe
Benchmarking decode/scalar_2bit/64
Benchmarking decode/scalar_2bit/64: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/64: Collecting 100 samples in estimated 5.0001 s (118M iterations)
Benchmarking decode/scalar_2bit/64: Analyzing
decode/scalar_2bit/64   time:   [42.375 ns 42.391 ns 42.409 ns]
                        thrpt:  [1.4055 GiB/s 1.4061 GiB/s 1.4066 GiB/s]
                 change:
                        time:   [−2.3426% −2.1553% −1.9450%] (p = 0.00 < 0.05)
                        thrpt:  [+1.9836% +2.2028% +2.3988%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  5 (5.00%) high mild
  8 (8.00%) high severe
Benchmarking decode/scalar_4bit/64
Benchmarking decode/scalar_4bit/64: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/64: Collecting 100 samples in estimated 5.0002 s (113M iterations)
Benchmarking decode/scalar_4bit/64: Analyzing
decode/scalar_4bit/64   time:   [44.338 ns 44.374 ns 44.429 ns]
                        thrpt:  [1.3416 GiB/s 1.3432 GiB/s 1.3443 GiB/s]
                 change:
                        time:   [+1.7972% +2.0603% +2.3255%] (p = 0.00 < 0.05)
                        thrpt:  [−2.2726% −2.0187% −1.7655%]
                        Performance has regressed.
Found 9 outliers among 100 measurements (9.00%)
  1 (1.00%) low mild
  1 (1.00%) high mild
  7 (7.00%) high severe
Benchmarking decode/simd_4bit/127
Benchmarking decode/simd_4bit/127: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/127: Collecting 100 samples in estimated 5.0002 s (103M iterations)
Benchmarking decode/simd_4bit/127: Analyzing
decode/simd_4bit/127    time:   [48.255 ns 48.305 ns 48.375 ns]
                        thrpt:  [2.4450 GiB/s 2.4486 GiB/s 2.4511 GiB/s]
                 change:
                        time:   [−22.120% −14.817% −7.8581%] (p = 0.00 < 0.05)
                        thrpt:  [+8.5283% +17.394% +28.402%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe
Benchmarking decode/scalar_2bit/127
Benchmarking decode/scalar_2bit/127: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/127: Collecting 100 samples in estimated 5.0003 s (75M iterations)
Benchmarking decode/scalar_2bit/127: Analyzing
decode/scalar_2bit/127  time:   [66.622 ns 66.706 ns 66.816 ns]
                        thrpt:  [1.7702 GiB/s 1.7731 GiB/s 1.7754 GiB/s]
                 change:
                        time:   [+2.1427% +2.7097% +3.1385%] (p = 0.00 < 0.05)
                        thrpt:  [−3.0430% −2.6383% −2.0978%]
                        Performance has regressed.
Found 13 outliers among 100 measurements (13.00%)
  1 (1.00%) high mild
  12 (12.00%) high severe
Benchmarking decode/scalar_4bit/127
Benchmarking decode/scalar_4bit/127: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/127: Collecting 100 samples in estimated 5.0001 s (81M iterations)
Benchmarking decode/scalar_4bit/127: Analyzing
decode/scalar_4bit/127  time:   [61.489 ns 61.711 ns 62.112 ns]
                        thrpt:  [1.9043 GiB/s 1.9167 GiB/s 1.9236 GiB/s]
                 change:
                        time:   [−4.3466% −3.4892% −2.5309%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5966% +3.6154% +4.5441%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  3 (3.00%) high mild
  11 (11.00%) high severe
Benchmarking decode/simd_4bit/128
Benchmarking decode/simd_4bit/128: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/128: Collecting 100 samples in estimated 5.0002 s (104M iterations)
Benchmarking decode/simd_4bit/128: Analyzing
decode/simd_4bit/128    time:   [47.814 ns 48.002 ns 48.304 ns]
                        thrpt:  [2.4679 GiB/s 2.4834 GiB/s 2.4932 GiB/s]
                 change:
                        time:   [−2.4237% −1.8759% −1.1004%] (p = 0.00 < 0.05)
                        thrpt:  [+1.1127% +1.9118% +2.4839%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  4 (4.00%) high mild
  10 (10.00%) high severe
Benchmarking decode/scalar_2bit/128
Benchmarking decode/scalar_2bit/128: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/128: Collecting 100 samples in estimated 5.0002 s (75M iterations)
Benchmarking decode/scalar_2bit/128: Analyzing
decode/scalar_2bit/128  time:   [66.916 ns 66.956 ns 67.007 ns]
                        thrpt:  [1.7790 GiB/s 1.7804 GiB/s 1.7815 GiB/s]
                 change:
                        time:   [+2.4146% +2.5932% +2.7764%] (p = 0.00 < 0.05)
                        thrpt:  [−2.7014% −2.5277% −2.3577%]
                        Performance has regressed.
Found 16 outliers among 100 measurements (16.00%)
  2 (2.00%) low mild
  6 (6.00%) high mild
  8 (8.00%) high severe
Benchmarking decode/scalar_4bit/128
Benchmarking decode/scalar_4bit/128: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/128: Collecting 100 samples in estimated 5.0001 s (81M iterations)
Benchmarking decode/scalar_4bit/128: Analyzing
decode/scalar_4bit/128  time:   [61.800 ns 61.835 ns 61.880 ns]
                        thrpt:  [1.9265 GiB/s 1.9279 GiB/s 1.9290 GiB/s]
                 change:
                        time:   [−3.3936% −3.2132% −3.0453%] (p = 0.00 < 0.05)
                        thrpt:  [+3.1409% +3.3198% +3.5128%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  4 (4.00%) high mild
  7 (7.00%) high severe
Benchmarking decode/simd_4bit/255
Benchmarking decode/simd_4bit/255: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/255: Collecting 100 samples in estimated 5.0003 s (71M iterations)
Benchmarking decode/simd_4bit/255: Analyzing
decode/simd_4bit/255    time:   [70.420 ns 70.462 ns 70.515 ns]
                        thrpt:  [3.3679 GiB/s 3.3704 GiB/s 3.3724 GiB/s]
                 change:
                        time:   [−6.6812% −4.4354% −2.7512%] (p = 0.00 < 0.05)
                        thrpt:  [+2.8291% +4.6412% +7.1596%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe
Benchmarking decode/scalar_2bit/255
Benchmarking decode/scalar_2bit/255: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/255: Collecting 100 samples in estimated 5.0005 s (45M iterations)
Benchmarking decode/scalar_2bit/255: Analyzing
decode/scalar_2bit/255  time:   [110.42 ns 110.50 ns 110.61 ns]
                        thrpt:  [2.1471 GiB/s 2.1492 GiB/s 2.1507 GiB/s]
                 change:
                        time:   [−7.3716% −4.5111% −2.4213%] (p = 0.00 < 0.05)
                        thrpt:  [+2.4814% +4.7242% +7.9583%]
                        Performance has improved.
Found 17 outliers among 100 measurements (17.00%)
  1 (1.00%) low mild
  6 (6.00%) high mild
  10 (10.00%) high severe
Benchmarking decode/scalar_4bit/255
Benchmarking decode/scalar_4bit/255: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/255: Collecting 100 samples in estimated 5.0006 s (43M iterations)
Benchmarking decode/scalar_4bit/255: Analyzing
decode/scalar_4bit/255  time:   [115.34 ns 116.49 ns 118.22 ns]
                        thrpt:  [2.0089 GiB/s 2.0387 GiB/s 2.0590 GiB/s]
                 change:
                        time:   [−5.7105% −2.6932% −0.2897%] (p = 0.04 < 0.05)
                        thrpt:  [+0.2906% +2.7677% +6.0564%]
                        Change within noise threshold.
Found 8 outliers among 100 measurements (8.00%)
  5 (5.00%) high mild
  3 (3.00%) high severe
Benchmarking decode/simd_4bit/256
Benchmarking decode/simd_4bit/256: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/256: Collecting 100 samples in estimated 5.0001 s (73M iterations)
Benchmarking decode/simd_4bit/256: Analyzing
decode/simd_4bit/256    time:   [68.529 ns 69.084 ns 69.851 ns]
                        thrpt:  [3.4133 GiB/s 3.4512 GiB/s 3.4791 GiB/s]
                 change:
                        time:   [−12.305% −7.3128% −3.3605%] (p = 0.00 < 0.05)
                        thrpt:  [+3.4774% +7.8898% +14.032%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  2 (2.00%) high mild
  7 (7.00%) high severe
Benchmarking decode/scalar_2bit/256
Benchmarking decode/scalar_2bit/256: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/256: Collecting 100 samples in estimated 5.0003 s (45M iterations)
Benchmarking decode/scalar_2bit/256: Analyzing
decode/scalar_2bit/256  time:   [110.92 ns 111.01 ns 111.11 ns]
                        thrpt:  [2.1458 GiB/s 2.1478 GiB/s 2.1494 GiB/s]
                 change:
                        time:   [−7.4010% −4.9252% −2.9358%] (p = 0.00 < 0.05)
                        thrpt:  [+3.0246% +5.1804% +7.9925%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  1 (1.00%) low mild
  3 (3.00%) high mild
  5 (5.00%) high severe
Benchmarking decode/scalar_4bit/256
Benchmarking decode/scalar_4bit/256: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/256: Collecting 100 samples in estimated 5.0002 s (44M iterations)
Benchmarking decode/scalar_4bit/256: Analyzing
decode/scalar_4bit/256  time:   [112.30 ns 112.39 ns 112.53 ns]
                        thrpt:  [2.1187 GiB/s 2.1213 GiB/s 2.1231 GiB/s]
                 change:
                        time:   [−4.4918% −2.4908% −1.0533%] (p = 0.00 < 0.05)
                        thrpt:  [+1.0645% +2.5544% +4.7030%]
                        Performance has improved.
Found 15 outliers among 100 measurements (15.00%)
  2 (2.00%) low mild
  8 (8.00%) high mild
  5 (5.00%) high severe
Benchmarking decode/simd_4bit/512
Benchmarking decode/simd_4bit/512: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/512: Collecting 100 samples in estimated 5.0005 s (46M iterations)
Benchmarking decode/simd_4bit/512: Analyzing
decode/simd_4bit/512    time:   [108.37 ns 108.42 ns 108.47 ns]
                        thrpt:  [4.3961 GiB/s 4.3981 GiB/s 4.3999 GiB/s]
                 change:
                        time:   [−12.097% −8.0578% −4.6783%] (p = 0.00 < 0.05)
                        thrpt:  [+4.9079% +8.7640% +13.762%]
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  3 (3.00%) high mild
  4 (4.00%) high severe
Benchmarking decode/scalar_2bit/512
Benchmarking decode/scalar_2bit/512: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/512: Collecting 100 samples in estimated 5.0003 s (25M iterations)
Benchmarking decode/scalar_2bit/512: Analyzing
decode/scalar_2bit/512  time:   [202.50 ns 202.77 ns 203.14 ns]
                        thrpt:  [2.3474 GiB/s 2.3516 GiB/s 2.3548 GiB/s]
                 change:
                        time:   [−4.5225% −2.9690% −1.7322%] (p = 0.00 < 0.05)
                        thrpt:  [+1.7627% +3.0599% +4.7367%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  9 (9.00%) high mild
  2 (2.00%) high severe
Benchmarking decode/scalar_4bit/512
Benchmarking decode/scalar_4bit/512: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/512: Collecting 100 samples in estimated 5.0004 s (26M iterations)
Benchmarking decode/scalar_4bit/512: Analyzing
decode/scalar_4bit/512  time:   [190.73 ns 190.93 ns 191.19 ns]
                        thrpt:  [2.4940 GiB/s 2.4974 GiB/s 2.5001 GiB/s]
                 change:
                        time:   [−6.9158% −4.2518% −2.1372%] (p = 0.00 < 0.05)
                        thrpt:  [+2.1839% +4.4407% +7.4297%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  5 (5.00%) high mild
  7 (7.00%) high severe
Benchmarking decode/simd_4bit/1023
Benchmarking decode/simd_4bit/1023: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/1023: Collecting 100 samples in estimated 5.0005 s (25M iterations)
Benchmarking decode/simd_4bit/1023: Analyzing
decode/simd_4bit/1023   time:   [196.52 ns 197.09 ns 197.97 ns]
                        thrpt:  [4.8126 GiB/s 4.8340 GiB/s 4.8482 GiB/s]
                 change:
                        time:   [−7.3024% −3.7124% −0.2308%] (p = 0.04 < 0.05)
                        thrpt:  [+0.2313% +3.8555% +7.8776%]
                        Change within noise threshold.
Found 13 outliers among 100 measurements (13.00%)
  6 (6.00%) high mild
  7 (7.00%) high severe
Benchmarking decode/scalar_2bit/1023
Benchmarking decode/scalar_2bit/1023: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/1023: Collecting 100 samples in estimated 5.0017 s (13M iterations)
Benchmarking decode/scalar_2bit/1023: Analyzing
decode/scalar_2bit/1023 time:   [387.48 ns 388.19 ns 389.55 ns]
                        thrpt:  [2.4458 GiB/s 2.4543 GiB/s 2.4588 GiB/s]
                 change:
                        time:   [−6.4623% −3.7651% −1.1562%] (p = 0.00 < 0.05)
                        thrpt:  [+1.1697% +3.9124% +6.9087%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  1 (1.00%) low mild
  5 (5.00%) high mild
  5 (5.00%) high severe
Benchmarking decode/scalar_4bit/1023
Benchmarking decode/scalar_4bit/1023: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/1023: Collecting 100 samples in estimated 5.0011 s (14M iterations)
Benchmarking decode/scalar_4bit/1023: Analyzing
decode/scalar_4bit/1023 time:   [355.18 ns 355.50 ns 355.88 ns]
                        thrpt:  [2.6771 GiB/s 2.6800 GiB/s 2.6824 GiB/s]
                 change:
                        time:   [−8.9032% −6.0167% −3.6475%] (p = 0.00 < 0.05)
                        thrpt:  [+3.7856% +6.4018% +9.7734%]
                        Performance has improved.
Found 5 outliers among 100 measurements (5.00%)
  3 (3.00%) high mild
  2 (2.00%) high severe
Benchmarking decode/simd_4bit/1024
Benchmarking decode/simd_4bit/1024: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/1024: Collecting 100 samples in estimated 5.0001 s (26M iterations)
Benchmarking decode/simd_4bit/1024: Analyzing
decode/simd_4bit/1024   time:   [195.01 ns 195.08 ns 195.17 ns]
                        thrpt:  [4.8864 GiB/s 4.8886 GiB/s 4.8905 GiB/s]
                 change:
                        time:   [−8.0570% −4.8149% −2.2900%] (p = 0.00 < 0.05)
                        thrpt:  [+2.3437% +5.0585% +8.7631%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  2 (2.00%) high mild
  7 (7.00%) high severe
Benchmarking decode/scalar_2bit/1024
Benchmarking decode/scalar_2bit/1024: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/1024: Collecting 100 samples in estimated 5.0001 s (13M iterations)
Benchmarking decode/scalar_2bit/1024: Analyzing
decode/scalar_2bit/1024 time:   [381.41 ns 381.62 ns 381.89 ns]
                        thrpt:  [2.4972 GiB/s 2.4990 GiB/s 2.5004 GiB/s]
                 change:
                        time:   [−6.1743% −3.7635% −1.9771%] (p = 0.00 < 0.05)
                        thrpt:  [+2.0170% +3.9106% +6.5806%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  2 (2.00%) low mild
  7 (7.00%) high mild
  4 (4.00%) high severe
Benchmarking decode/scalar_4bit/1024
Benchmarking decode/scalar_4bit/1024: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/1024: Collecting 100 samples in estimated 5.0013 s (14M iterations)
Benchmarking decode/scalar_4bit/1024: Analyzing
decode/scalar_4bit/1024 time:   [350.72 ns 351.18 ns 351.76 ns]
                        thrpt:  [2.7112 GiB/s 2.7156 GiB/s 2.7192 GiB/s]
                 change:
                        time:   [−6.7505% −4.4176% −2.4897%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5532% +4.6218% +7.2392%]
                        Performance has improved.
Found 17 outliers among 100 measurements (17.00%)
  5 (5.00%) high mild
  12 (12.00%) high severe
Benchmarking decode/simd_4bit/2048
Benchmarking decode/simd_4bit/2048: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/2048: Collecting 100 samples in estimated 5.0007 s (13M iterations)
Benchmarking decode/simd_4bit/2048: Analyzing
decode/simd_4bit/2048   time:   [377.87 ns 379.97 ns 384.12 ns]
                        thrpt:  [4.9655 GiB/s 5.0198 GiB/s 5.0477 GiB/s]
                 change:
                        time:   [−5.5625% −3.6461% −2.0339%] (p = 0.00 < 0.05)
                        thrpt:  [+2.0761% +3.7841% +5.8901%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  1 (1.00%) low mild
  2 (2.00%) high mild
  8 (8.00%) high severe
Benchmarking decode/scalar_2bit/2048
Benchmarking decode/scalar_2bit/2048: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/2048: Collecting 100 samples in estimated 5.0033 s (6.7M iterations)
Benchmarking decode/scalar_2bit/2048: Analyzing
decode/scalar_2bit/2048 time:   [741.90 ns 747.26 ns 755.39 ns]
                        thrpt:  [2.5250 GiB/s 2.5524 GiB/s 2.5709 GiB/s]
                 change:
                        time:   [−6.6816% −4.0199% −1.8993%] (p = 0.00 < 0.05)
                        thrpt:  [+1.9361% +4.1883% +7.1600%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  1 (1.00%) low mild
  6 (6.00%) high mild
  7 (7.00%) high severe
Benchmarking decode/scalar_4bit/2048
Benchmarking decode/scalar_4bit/2048: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/2048: Collecting 100 samples in estimated 5.0031 s (7.3M iterations)
Benchmarking decode/scalar_4bit/2048: Analyzing
decode/scalar_4bit/2048 time:   [688.46 ns 688.99 ns 689.70 ns]
                        thrpt:  [2.7655 GiB/s 2.7683 GiB/s 2.7705 GiB/s]
                 change:
                        time:   [−8.0907% −5.3017% −3.1239%] (p = 0.00 < 0.05)
                        thrpt:  [+3.2246% +5.5985% +8.8029%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  2 (2.00%) low mild
  4 (4.00%) high mild
  5 (5.00%) high severe
Benchmarking decode/simd_4bit/4095
Benchmarking decode/simd_4bit/4095: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/4095: Collecting 100 samples in estimated 5.0021 s (6.8M iterations)
Benchmarking decode/simd_4bit/4095: Analyzing
decode/simd_4bit/4095   time:   [726.37 ns 727.13 ns 728.22 ns]
                        thrpt:  [5.2371 GiB/s 5.2449 GiB/s 5.2505 GiB/s]
                 change:
                        time:   [−6.7046% −3.9436% −2.0955%] (p = 0.00 < 0.05)
                        thrpt:  [+2.1403% +4.1056% +7.1865%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  7 (7.00%) high mild
  4 (4.00%) high severe
Benchmarking decode/scalar_2bit/4095
Benchmarking decode/scalar_2bit/4095: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/4095: Collecting 100 samples in estimated 5.0013 s (3.4M iterations)
Benchmarking decode/scalar_2bit/4095: Analyzing
decode/scalar_2bit/4095 time:   [1.4725 µs 1.4732 µs 1.4742 µs]
                        thrpt:  [2.5871 GiB/s 2.5887 GiB/s 2.5900 GiB/s]
                 change:
                        time:   [−8.0642% −5.3659% −3.1148%] (p = 0.00 < 0.05)
                        thrpt:  [+3.2149% +5.6702% +8.7716%]
                        Performance has improved.
Found 15 outliers among 100 measurements (15.00%)
  1 (1.00%) low mild
  2 (2.00%) high mild
  12 (12.00%) high severe
Benchmarking decode/scalar_4bit/4095
Benchmarking decode/scalar_4bit/4095: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/4095: Collecting 100 samples in estimated 5.0003 s (3.6M iterations)
Benchmarking decode/scalar_4bit/4095: Analyzing
decode/scalar_4bit/4095 time:   [1.3718 µs 1.3732 µs 1.3749 µs]
                        thrpt:  [2.7739 GiB/s 2.7773 GiB/s 2.7800 GiB/s]
                 change:
                        time:   [−8.1882% −5.6707% −3.4842%] (p = 0.00 < 0.05)
                        thrpt:  [+3.6100% +6.0116% +8.9185%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  4 (4.00%) high mild
  8 (8.00%) high severe
Benchmarking decode/simd_4bit/4096
Benchmarking decode/simd_4bit/4096: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/4096: Collecting 100 samples in estimated 5.0034 s (6.9M iterations)
Benchmarking decode/simd_4bit/4096: Analyzing
decode/simd_4bit/4096   time:   [724.05 ns 724.45 ns 724.93 ns]
                        thrpt:  [5.2622 GiB/s 5.2656 GiB/s 5.2686 GiB/s]
                 change:
                        time:   [−5.3827% −3.6574% −2.2513%] (p = 0.00 < 0.05)
                        thrpt:  [+2.3032% +3.7962% +5.6889%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  3 (3.00%) high mild
  8 (8.00%) high severe
Benchmarking decode/scalar_2bit/4096
Benchmarking decode/scalar_2bit/4096: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/4096: Collecting 100 samples in estimated 5.0008 s (3.4M iterations)
Benchmarking decode/scalar_2bit/4096: Analyzing
decode/scalar_2bit/4096 time:   [1.4723 µs 1.4732 µs 1.4743 µs]
                        thrpt:  [2.5875 GiB/s 2.5895 GiB/s 2.5910 GiB/s]
                 change:
                        time:   [−7.4514% −4.7830% −2.6831%] (p = 0.00 < 0.05)
                        thrpt:  [+2.7571% +5.0233% +8.0513%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  5 (5.00%) high mild
  8 (8.00%) high severe
Benchmarking decode/scalar_4bit/4096
Benchmarking decode/scalar_4bit/4096: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/4096: Collecting 100 samples in estimated 5.0066 s (3.7M iterations)
Benchmarking decode/scalar_4bit/4096: Analyzing
decode/scalar_4bit/4096 time:   [1.3686 µs 1.3766 µs 1.3887 µs]
                        thrpt:  [2.7469 GiB/s 2.7711 GiB/s 2.7873 GiB/s]
                 change:
                        time:   [−2.3305% −1.9031% −1.3443%] (p = 0.00 < 0.05)
                        thrpt:  [+1.3626% +1.9400% +2.3861%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  4 (4.00%) high mild
  6 (6.00%) high severe
Benchmarking decode/simd_4bit/8192
Benchmarking decode/simd_4bit/8192: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/8192: Collecting 100 samples in estimated 5.0002 s (3.5M iterations)
Benchmarking decode/simd_4bit/8192: Analyzing
decode/simd_4bit/8192   time:   [1.4249 µs 1.4261 µs 1.4275 µs]
                        thrpt:  [5.3447 GiB/s 5.3498 GiB/s 5.3542 GiB/s]
                 change:
                        time:   [−7.5865% −5.2729% −3.3327%] (p = 0.00 < 0.05)
                        thrpt:  [+3.4476% +5.5665% +8.2093%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  4 (4.00%) high mild
  5 (5.00%) high severe
Benchmarking decode/scalar_2bit/8192
Benchmarking decode/scalar_2bit/8192: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/8192: Collecting 100 samples in estimated 5.0079 s (1.7M iterations)
Benchmarking decode/scalar_2bit/8192: Analyzing
decode/scalar_2bit/8192 time:   [2.9218 µs 2.9233 µs 2.9252 µs]
                        thrpt:  [2.6082 GiB/s 2.6098 GiB/s 2.6112 GiB/s]
                 change:
                        time:   [−6.2533% −4.1826% −2.6307%] (p = 0.00 < 0.05)
                        thrpt:  [+2.7018% +4.3652% +6.6704%]
                        Performance has improved.
Found 6 outliers among 100 measurements (6.00%)
  2 (2.00%) high mild
  4 (4.00%) high severe
Benchmarking decode/scalar_4bit/8192
Benchmarking decode/scalar_4bit/8192: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/8192: Collecting 100 samples in estimated 5.0106 s (1.9M iterations)
Benchmarking decode/scalar_4bit/8192: Analyzing
decode/scalar_4bit/8192 time:   [2.6939 µs 2.6970 µs 2.7008 µs]
                        thrpt:  [2.8249 GiB/s 2.8289 GiB/s 2.8321 GiB/s]
                 change:
                        time:   [−14.033% −7.9536% −3.4872%] (p = 0.00 < 0.05)
                        thrpt:  [+3.6132% +8.6409% +16.324%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  2 (2.00%) low mild
  5 (5.00%) high mild
  5 (5.00%) high severe
Benchmarking decode/simd_4bit/9999
Benchmarking decode/simd_4bit/9999: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/9999: Collecting 100 samples in estimated 5.0010 s (2.9M iterations)
Benchmarking decode/simd_4bit/9999: Analyzing
decode/simd_4bit/9999   time:   [1.7362 µs 1.7380 µs 1.7404 µs]
                        thrpt:  [5.3508 GiB/s 5.3580 GiB/s 5.3637 GiB/s]
                 change:
                        time:   [−9.4257% −6.2354% −3.6412%] (p = 0.00 < 0.05)
                        thrpt:  [+3.7788% +6.6501% +10.407%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  5 (5.00%) high mild
  5 (5.00%) high severe
Benchmarking decode/scalar_2bit/9999
Benchmarking decode/scalar_2bit/9999: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/9999: Collecting 100 samples in estimated 5.0136 s (1.4M iterations)
Benchmarking decode/scalar_2bit/9999: Analyzing
decode/scalar_2bit/9999 time:   [3.5626 µs 3.5838 µs 3.6251 µs]
                        thrpt:  [2.5688 GiB/s 2.5984 GiB/s 2.6139 GiB/s]
                 change:
                        time:   [−8.8694% −5.9123% −3.3702%] (p = 0.00 < 0.05)
                        thrpt:  [+3.4877% +6.2838% +9.7326%]
                        Performance has improved.
Found 6 outliers among 100 measurements (6.00%)
  2 (2.00%) high mild
  4 (4.00%) high severe
Benchmarking decode/scalar_4bit/9999
Benchmarking decode/scalar_4bit/9999: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/9999: Collecting 100 samples in estimated 5.0128 s (1.5M iterations)
Benchmarking decode/scalar_4bit/9999: Analyzing
decode/scalar_4bit/9999 time:   [3.2829 µs 3.2990 µs 3.3305 µs]
                        thrpt:  [2.7961 GiB/s 2.8228 GiB/s 2.8366 GiB/s]
                 change:
                        time:   [−7.9427% −5.1150% −2.7547%] (p = 0.00 < 0.05)
                        thrpt:  [+2.8327% +5.3908% +8.6280%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  5 (5.00%) high mild
  8 (8.00%) high severe
Benchmarking decode/simd_4bit/10000
Benchmarking decode/simd_4bit/10000: Warming up for 3.0000 s
Benchmarking decode/simd_4bit/10000: Collecting 100 samples in estimated 5.0015 s (2.9M iterations)
Benchmarking decode/simd_4bit/10000: Analyzing
decode/simd_4bit/10000  time:   [1.7358 µs 1.7376 µs 1.7398 µs]
                        thrpt:  [5.3531 GiB/s 5.3597 GiB/s 5.3655 GiB/s]
                 change:
                        time:   [−11.016% −6.8875% −3.5091%] (p = 0.00 < 0.05)
                        thrpt:  [+3.6367% +7.3970% +12.380%]
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  4 (4.00%) high mild
  3 (3.00%) high severe
Benchmarking decode/scalar_2bit/10000
Benchmarking decode/scalar_2bit/10000: Warming up for 3.0000 s
Benchmarking decode/scalar_2bit/10000: Collecting 100 samples in estimated 5.0060 s (1.4M iterations)
Benchmarking decode/scalar_2bit/10000: Analyzing
decode/scalar_2bit/10000
                        time:   [3.5610 µs 3.5642 µs 3.5679 µs]
                        thrpt:  [2.6103 GiB/s 2.6130 GiB/s 2.6153 GiB/s]
                 change:
                        time:   [−7.9356% −5.2629% −3.1533%] (p = 0.00 < 0.05)
                        thrpt:  [+3.2559% +5.5552% +8.6196%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  7 (7.00%) high mild
  3 (3.00%) high severe
Benchmarking decode/scalar_4bit/10000
Benchmarking decode/scalar_4bit/10000: Warming up for 3.0000 s
Benchmarking decode/scalar_4bit/10000: Collecting 100 samples in estimated 5.0039 s (1.5M iterations)
Benchmarking decode/scalar_4bit/10000: Analyzing
decode/scalar_4bit/10000
                        time:   [3.2765 µs 3.2781 µs 3.2800 µs]
                        thrpt:  [2.8394 GiB/s 2.8410 GiB/s 2.8424 GiB/s]
                 change:
                        time:   [−6.1278% −4.2717% −2.7686%] (p = 0.00 < 0.05)
                        thrpt:  [+2.8474% +4.4623% +6.5278%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
  8 (8.00%) high severe

Benchmarking roundtrip/simd_4bit/15
Benchmarking roundtrip/simd_4bit/15: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/15: Collecting 100 samples in estimated 5.0002 s (93M iterations)
Benchmarking roundtrip/simd_4bit/15: Analyzing
roundtrip/simd_4bit/15  time:   [53.910 ns 53.980 ns 54.069 ns]
                        thrpt:  [264.57 MiB/s 265.01 MiB/s 265.35 MiB/s]
                 change:
                        time:   [−7.7050% −5.3254% −3.4065%] (p = 0.00 < 0.05)
                        thrpt:  [+3.5267% +5.6249% +8.3482%]
                        Performance has improved.
Found 15 outliers among 100 measurements (15.00%)
  2 (2.00%) low mild
  2 (2.00%) high mild
  11 (11.00%) high severe
Benchmarking roundtrip/scalar_2bit/15
Benchmarking roundtrip/scalar_2bit/15: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/15: Collecting 100 samples in estimated 5.0002 s (42M iterations)
Benchmarking roundtrip/scalar_2bit/15: Analyzing
roundtrip/scalar_2bit/15
                        time:   [118.95 ns 119.65 ns 120.72 ns]
                        thrpt:  [118.50 MiB/s 119.56 MiB/s 120.26 MiB/s]
                 change:
                        time:   [−7.0059% −4.4029% −2.3297%] (p = 0.00 < 0.05)
                        thrpt:  [+2.3853% +4.6057% +7.5337%]
                        Performance has improved.
Found 16 outliers among 100 measurements (16.00%)
  8 (8.00%) high mild
  8 (8.00%) high severe
Benchmarking roundtrip/scalar_4bit/15
Benchmarking roundtrip/scalar_4bit/15: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/15: Collecting 100 samples in estimated 5.0002 s (90M iterations)
Benchmarking roundtrip/scalar_4bit/15: Analyzing
roundtrip/scalar_4bit/15
                        time:   [55.381 ns 55.415 ns 55.458 ns]
                        thrpt:  [257.95 MiB/s 258.14 MiB/s 258.30 MiB/s]
                 change:
                        time:   [+2.2415% +2.4236% +2.6211%] (p = 0.00 < 0.05)
                        thrpt:  [−2.5541% −2.3662% −2.1923%]
                        Performance has regressed.
Found 7 outliers among 100 measurements (7.00%)
  2 (2.00%) high mild
  5 (5.00%) high severe
Benchmarking roundtrip/simd_4bit/16
Benchmarking roundtrip/simd_4bit/16: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/16: Collecting 100 samples in estimated 5.0002 s (96M iterations)
Benchmarking roundtrip/simd_4bit/16: Analyzing
roundtrip/simd_4bit/16  time:   [51.837 ns 51.894 ns 51.971 ns]
                        thrpt:  [293.60 MiB/s 294.04 MiB/s 294.36 MiB/s]
                 change:
                        time:   [+2.7225% +5.4013% +10.523%] (p = 0.01 < 0.05)
                        thrpt:  [−9.5207% −5.1245% −2.6503%]
                        Performance has regressed.
Found 13 outliers among 100 measurements (13.00%)
  2 (2.00%) low mild
  6 (6.00%) high mild
  5 (5.00%) high severe
Benchmarking roundtrip/scalar_2bit/16
Benchmarking roundtrip/scalar_2bit/16: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/16: Collecting 100 samples in estimated 5.0000 s (66M iterations)
Benchmarking roundtrip/scalar_2bit/16: Analyzing
roundtrip/scalar_2bit/16
                        time:   [75.943 ns 75.993 ns 76.062 ns]
                        thrpt:  [200.61 MiB/s 200.79 MiB/s 200.92 MiB/s]
                 change:
                        time:   [−2.6522% −2.4586% −2.2770%] (p = 0.00 < 0.05)
                        thrpt:  [+2.3300% +2.5206% +2.7245%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  7 (7.00%) high mild
  5 (5.00%) high severe
Benchmarking roundtrip/scalar_4bit/16
Benchmarking roundtrip/scalar_4bit/16: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/16: Collecting 100 samples in estimated 5.0001 s (89M iterations)
Benchmarking roundtrip/scalar_4bit/16: Analyzing
roundtrip/scalar_4bit/16
                        time:   [56.091 ns 56.135 ns 56.192 ns]
                        thrpt:  [271.55 MiB/s 271.82 MiB/s 272.03 MiB/s]
                 change:
                        time:   [+2.0514% +2.2141% +2.3805%] (p = 0.00 < 0.05)
                        thrpt:  [−2.3251% −2.1661% −2.0102%]
                        Performance has regressed.
Found 9 outliers among 100 measurements (9.00%)
  3 (3.00%) high mild
  6 (6.00%) high severe
Benchmarking roundtrip/simd_4bit/17
Benchmarking roundtrip/simd_4bit/17: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/17: Collecting 100 samples in estimated 5.0001 s (88M iterations)
Benchmarking roundtrip/simd_4bit/17: Analyzing
roundtrip/simd_4bit/17  time:   [56.848 ns 56.922 ns 57.021 ns]
                        thrpt:  [284.32 MiB/s 284.82 MiB/s 285.19 MiB/s]
                 change:
                        time:   [+1.3150% +1.9960% +2.5075%] (p = 0.00 < 0.05)
                        thrpt:  [−2.4462% −1.9569% −1.2979%]
                        Performance has regressed.
Found 12 outliers among 100 measurements (12.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
  7 (7.00%) high severe
Benchmarking roundtrip/scalar_2bit/17
Benchmarking roundtrip/scalar_2bit/17: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/17: Collecting 100 samples in estimated 5.0004 s (38M iterations)
Benchmarking roundtrip/scalar_2bit/17: Analyzing
roundtrip/scalar_2bit/17
                        time:   [132.25 ns 133.15 ns 134.57 ns]
                        thrpt:  [120.48 MiB/s 121.76 MiB/s 122.59 MiB/s]
                 change:
                        time:   [−4.7158% −4.3449% −3.7960%] (p = 0.00 < 0.05)
                        thrpt:  [+3.9458% +4.5423% +4.9492%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  3 (3.00%) high mild
  7 (7.00%) high severe
Benchmarking roundtrip/scalar_4bit/17
Benchmarking roundtrip/scalar_4bit/17: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/17: Collecting 100 samples in estimated 5.0000 s (88M iterations)
Benchmarking roundtrip/scalar_4bit/17: Analyzing
roundtrip/scalar_4bit/17
                        time:   [56.716 ns 57.255 ns 58.049 ns]
                        thrpt:  [279.29 MiB/s 283.16 MiB/s 285.85 MiB/s]
                 change:
                        time:   [−3.3289% −2.7532% −1.9262%] (p = 0.00 < 0.05)
                        thrpt:  [+1.9640% +2.8311% +3.4435%]
                        Performance has improved.
Found 15 outliers among 100 measurements (15.00%)
  2 (2.00%) high mild
  13 (13.00%) high severe
Benchmarking roundtrip/simd_4bit/32
Benchmarking roundtrip/simd_4bit/32: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/32: Collecting 100 samples in estimated 5.0002 s (80M iterations)
Benchmarking roundtrip/simd_4bit/32: Analyzing
roundtrip/simd_4bit/32  time:   [62.398 ns 62.444 ns 62.494 ns]
                        thrpt:  [488.33 MiB/s 488.72 MiB/s 489.08 MiB/s]
                 change:
                        time:   [+1.6547% +1.9368% +2.2210%] (p = 0.00 < 0.05)
                        thrpt:  [−2.1728% −1.9000% −1.6278%]
                        Performance has regressed.
Found 10 outliers among 100 measurements (10.00%)
  4 (4.00%) high mild
  6 (6.00%) high severe
Benchmarking roundtrip/scalar_2bit/32
Benchmarking roundtrip/scalar_2bit/32: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/32: Collecting 100 samples in estimated 5.0000 s (51M iterations)
Benchmarking roundtrip/scalar_2bit/32: Analyzing
roundtrip/scalar_2bit/32
                        time:   [97.372 ns 97.449 ns 97.539 ns]
                        thrpt:  [312.88 MiB/s 313.16 MiB/s 313.41 MiB/s]
                 change:
                        time:   [−2.3735% −2.1471% −1.9005%] (p = 0.00 < 0.05)
                        thrpt:  [+1.9373% +2.1942% +2.4312%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  3 (3.00%) high mild
  8 (8.00%) high severe
Benchmarking roundtrip/scalar_4bit/32
Benchmarking roundtrip/scalar_4bit/32: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/32: Collecting 100 samples in estimated 5.0000 s (71M iterations)
Benchmarking roundtrip/scalar_4bit/32: Analyzing
roundtrip/scalar_4bit/32
                        time:   [70.706 ns 70.751 ns 70.804 ns]
                        thrpt:  [431.02 MiB/s 431.34 MiB/s 431.61 MiB/s]
                 change:
                        time:   [−10.142% −7.0660% −4.4600%] (p = 0.00 < 0.05)
                        thrpt:  [+4.6682% +7.6032% +11.287%]
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  2 (2.00%) high mild
  5 (5.00%) high severe
Benchmarking roundtrip/simd_4bit/33
Benchmarking roundtrip/simd_4bit/33: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/33: Collecting 100 samples in estimated 5.0001 s (78M iterations)
Benchmarking roundtrip/simd_4bit/33: Analyzing
roundtrip/simd_4bit/33  time:   [63.449 ns 63.536 ns 63.643 ns]
                        thrpt:  [494.50 MiB/s 495.33 MiB/s 496.00 MiB/s]
                 change:
                        time:   [−18.868% −11.261% −4.7286%] (p = 0.00 < 0.05)
                        thrpt:  [+4.9633% +12.689% +23.256%]
                        Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
  4 (4.00%) high mild
  4 (4.00%) high severe
Benchmarking roundtrip/scalar_2bit/33
Benchmarking roundtrip/scalar_2bit/33: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/33: Collecting 100 samples in estimated 5.0002 s (34M iterations)
Benchmarking roundtrip/scalar_2bit/33: Analyzing
roundtrip/scalar_2bit/33
                        time:   [147.53 ns 147.68 ns 147.92 ns]
                        thrpt:  [212.76 MiB/s 213.10 MiB/s 213.31 MiB/s]
                 change:
                        time:   [−4.7084% −4.5467% −4.3743%] (p = 0.00 < 0.05)
                        thrpt:  [+4.5744% +4.7632% +4.9410%]
                        Performance has improved.
Found 15 outliers among 100 measurements (15.00%)
  2 (2.00%) low mild
  6 (6.00%) high mild
  7 (7.00%) high severe
Benchmarking roundtrip/scalar_4bit/33
Benchmarking roundtrip/scalar_4bit/33: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/33: Collecting 100 samples in estimated 5.0000 s (63M iterations)
Benchmarking roundtrip/scalar_4bit/33: Analyzing
roundtrip/scalar_4bit/33
                        time:   [78.916 ns 79.512 ns 80.423 ns]
                        thrpt:  [391.32 MiB/s 395.81 MiB/s 398.80 MiB/s]
                 change:
                        time:   [+1.1626% +2.0470% +2.8257%] (p = 0.00 < 0.05)
                        thrpt:  [−2.7481% −2.0059% −1.1493%]
                        Performance has regressed.
Found 16 outliers among 100 measurements (16.00%)
  8 (8.00%) high mild
  8 (8.00%) high severe
Benchmarking roundtrip/simd_4bit/63
Benchmarking roundtrip/simd_4bit/63: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/63: Collecting 100 samples in estimated 5.0000 s (67M iterations)
Benchmarking roundtrip/simd_4bit/63: Analyzing
roundtrip/simd_4bit/63  time:   [74.949 ns 75.016 ns 75.116 ns]
                        thrpt:  [799.85 MiB/s 800.92 MiB/s 801.63 MiB/s]
                 change:
                        time:   [−7.0672% −6.7600% −6.5358%] (p = 0.00 < 0.05)
                        thrpt:  [+6.9929% +7.2501% +7.6047%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  1 (1.00%) low mild
  3 (3.00%) high mild
  6 (6.00%) high severe
Benchmarking roundtrip/scalar_2bit/63
Benchmarking roundtrip/scalar_2bit/63: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/63: Collecting 100 samples in estimated 5.0001 s (29M iterations)
Benchmarking roundtrip/scalar_2bit/63: Analyzing
roundtrip/scalar_2bit/63
                        time:   [170.46 ns 170.94 ns 171.49 ns]
                        thrpt:  [350.36 MiB/s 351.47 MiB/s 352.47 MiB/s]
                 change:
                        time:   [−4.7045% −4.5072% −4.2961%] (p = 0.00 < 0.05)
                        thrpt:  [+4.4890% +4.7200% +4.9367%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  1 (1.00%) low mild
  3 (3.00%) high mild
  9 (9.00%) high severe
Benchmarking roundtrip/scalar_4bit/63
Benchmarking roundtrip/scalar_4bit/63: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/63: Collecting 100 samples in estimated 5.0001 s (47M iterations)
Benchmarking roundtrip/scalar_4bit/63: Analyzing
roundtrip/scalar_4bit/63
                        time:   [105.96 ns 106.07 ns 106.25 ns]
                        thrpt:  [565.49 MiB/s 566.44 MiB/s 567.01 MiB/s]
                 change:
                        time:   [−8.1951% −4.8405% −2.1673%] (p = 0.00 < 0.05)
                        thrpt:  [+2.2153% +5.0867% +8.9267%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  7 (7.00%) high mild
  4 (4.00%) high severe
Benchmarking roundtrip/simd_4bit/64
Benchmarking roundtrip/simd_4bit/64: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/64: Collecting 100 samples in estimated 5.0000 s (71M iterations)
Benchmarking roundtrip/simd_4bit/64: Analyzing
roundtrip/simd_4bit/64  time:   [72.096 ns 72.328 ns 72.557 ns]
                        thrpt:  [841.20 MiB/s 843.87 MiB/s 846.58 MiB/s]
                 change:
                        time:   [−11.904% −9.0189% −6.6104%] (p = 0.00 < 0.05)
                        thrpt:  [+7.0783% +9.9130% +13.512%]
                        Performance has improved.
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high severe
Benchmarking roundtrip/scalar_2bit/64
Benchmarking roundtrip/scalar_2bit/64: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/64: Collecting 100 samples in estimated 5.0001 s (39M iterations)
Benchmarking roundtrip/scalar_2bit/64: Analyzing
roundtrip/scalar_2bit/64
                        time:   [124.61 ns 126.16 ns 128.35 ns]
                        thrpt:  [475.54 MiB/s 483.81 MiB/s 489.79 MiB/s]
                 change:
                        time:   [−8.6250% −6.4275% −4.5484%] (p = 0.00 < 0.05)
                        thrpt:  [+4.7651% +6.8690% +9.4391%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  2 (2.00%) high mild
  11 (11.00%) high severe
Benchmarking roundtrip/scalar_4bit/64
Benchmarking roundtrip/scalar_4bit/64: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/64: Collecting 100 samples in estimated 5.0004 s (47M iterations)
Benchmarking roundtrip/scalar_4bit/64: Analyzing
roundtrip/scalar_4bit/64
                        time:   [106.53 ns 107.62 ns 109.09 ns]
                        thrpt:  [559.48 MiB/s 567.12 MiB/s 572.93 MiB/s]
                 change:
                        time:   [−6.3169% −3.5101% −1.4877%] (p = 0.00 < 0.05)
                        thrpt:  [+1.5102% +3.6378% +6.7428%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  1 (1.00%) low mild
  3 (3.00%) high mild
  10 (10.00%) high severe
Benchmarking roundtrip/simd_4bit/127
Benchmarking roundtrip/simd_4bit/127: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/127: Collecting 100 samples in estimated 5.0005 s (48M iterations)
Benchmarking roundtrip/simd_4bit/127: Analyzing
roundtrip/simd_4bit/127 time:   [103.42 ns 103.50 ns 103.60 ns]
                        thrpt:  [1.1417 GiB/s 1.1428 GiB/s 1.1437 GiB/s]
                 change:
                        time:   [−8.8195% −6.8690% −5.2980%] (p = 0.00 < 0.05)
                        thrpt:  [+5.5944% +7.3756% +9.6726%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
  8 (8.00%) high severe
Benchmarking roundtrip/scalar_2bit/127
Benchmarking roundtrip/scalar_2bit/127: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/127: Collecting 100 samples in estimated 5.0011 s (21M iterations)
Benchmarking roundtrip/scalar_2bit/127: Analyzing
roundtrip/scalar_2bit/127
                        time:   [235.16 ns 235.32 ns 235.53 ns]
                        thrpt:  [514.22 MiB/s 514.69 MiB/s 515.04 MiB/s]
                 change:
                        time:   [−7.1063% −5.4602% −4.5169%] (p = 0.00 < 0.05)
                        thrpt:  [+4.7306% +5.7755% +7.6500%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  1 (1.00%) low mild
  7 (7.00%) high mild
  4 (4.00%) high severe
Benchmarking roundtrip/scalar_4bit/127
Benchmarking roundtrip/scalar_4bit/127: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/127: Collecting 100 samples in estimated 5.0004 s (29M iterations)
Benchmarking roundtrip/scalar_4bit/127: Analyzing
roundtrip/scalar_4bit/127
                        time:   [174.85 ns 174.93 ns 175.02 ns]
                        thrpt:  [692.00 MiB/s 692.38 MiB/s 692.70 MiB/s]
                 change:
                        time:   [−2.2928% −2.1442% −1.9915%] (p = 0.00 < 0.05)
                        thrpt:  [+2.0320% +2.1912% +2.3466%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe
Benchmarking roundtrip/simd_4bit/128
Benchmarking roundtrip/simd_4bit/128: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/128: Collecting 100 samples in estimated 5.0001 s (50M iterations)
Benchmarking roundtrip/simd_4bit/128: Analyzing
roundtrip/simd_4bit/128 time:   [99.560 ns 99.679 ns 99.824 ns]
                        thrpt:  [1.1942 GiB/s 1.1959 GiB/s 1.1974 GiB/s]
                 change:
                        time:   [−4.9233% −4.7340% −4.5410%] (p = 0.00 < 0.05)
                        thrpt:  [+4.7570% +4.9693% +5.1783%]
                        Performance has improved.
Found 16 outliers among 100 measurements (16.00%)
  2 (2.00%) low mild
  8 (8.00%) high mild
  6 (6.00%) high severe
Benchmarking roundtrip/scalar_2bit/128
Benchmarking roundtrip/scalar_2bit/128: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/128: Collecting 100 samples in estimated 5.0003 s (26M iterations)
Benchmarking roundtrip/scalar_2bit/128: Analyzing
roundtrip/scalar_2bit/128
                        time:   [191.58 ns 191.74 ns 191.95 ns]
                        thrpt:  [635.94 MiB/s 636.64 MiB/s 637.19 MiB/s]
                 change:
                        time:   [−5.6373% −4.6106% −3.8728%] (p = 0.00 < 0.05)
                        thrpt:  [+4.0288% +4.8334% +5.9740%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  4 (4.00%) high mild
  8 (8.00%) high severe
Benchmarking roundtrip/scalar_4bit/128
Benchmarking roundtrip/scalar_4bit/128: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/128: Collecting 100 samples in estimated 5.0002 s (28M iterations)
Benchmarking roundtrip/scalar_4bit/128: Analyzing
roundtrip/scalar_4bit/128
                        time:   [175.81 ns 176.60 ns 178.06 ns]
                        thrpt:  [685.57 MiB/s 691.24 MiB/s 694.33 MiB/s]
                 change:
                        time:   [−10.703% −7.1316% −4.0227%] (p = 0.00 < 0.05)
                        thrpt:  [+4.1914% +7.6792% +11.985%]
                        Performance has improved.
Found 17 outliers among 100 measurements (17.00%)
  1 (1.00%) low mild
  8 (8.00%) high mild
  8 (8.00%) high severe
Benchmarking roundtrip/simd_4bit/255
Benchmarking roundtrip/simd_4bit/255: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/255: Collecting 100 samples in estimated 5.0004 s (32M iterations)
Benchmarking roundtrip/simd_4bit/255: Analyzing
roundtrip/simd_4bit/255 time:   [158.25 ns 158.84 ns 160.01 ns]
                        thrpt:  [1.4842 GiB/s 1.4952 GiB/s 1.5007 GiB/s]
                 change:
                        time:   [−4.1869% −3.8877% −3.4383%] (p = 0.00 < 0.05)
                        thrpt:  [+3.5607% +4.0449% +4.3698%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe
Benchmarking roundtrip/scalar_2bit/255
Benchmarking roundtrip/scalar_2bit/255: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/255: Collecting 100 samples in estimated 5.0010 s (14M iterations)
Benchmarking roundtrip/scalar_2bit/255: Analyzing
roundtrip/scalar_2bit/255
                        time:   [358.83 ns 359.09 ns 359.43 ns]
                        thrpt:  [676.60 MiB/s 677.24 MiB/s 677.73 MiB/s]
                 change:
                        time:   [−13.764% −9.1705% −5.3748%] (p = 0.00 < 0.05)
                        thrpt:  [+5.6801% +10.096% +15.961%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  1 (1.00%) low mild
  5 (5.00%) high mild
  6 (6.00%) high severe
Benchmarking roundtrip/scalar_4bit/255
Benchmarking roundtrip/scalar_4bit/255: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/255: Collecting 100 samples in estimated 5.0006 s (16M iterations)
Benchmarking roundtrip/scalar_4bit/255: Analyzing
roundtrip/scalar_4bit/255
                        time:   [305.20 ns 305.43 ns 305.70 ns]
                        thrpt:  [795.50 MiB/s 796.20 MiB/s 796.80 MiB/s]
                 change:
                        time:   [−3.3063% −3.1541% −2.9943%] (p = 0.00 < 0.05)
                        thrpt:  [+3.0867% +3.2568% +3.4193%]
                        Performance has improved.
Found 5 outliers among 100 measurements (5.00%)
  3 (3.00%) high mild
  2 (2.00%) high severe
Benchmarking roundtrip/simd_4bit/256
Benchmarking roundtrip/simd_4bit/256: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/256: Collecting 100 samples in estimated 5.0000 s (33M iterations)
Benchmarking roundtrip/simd_4bit/256: Analyzing
roundtrip/simd_4bit/256 time:   [153.03 ns 153.20 ns 153.42 ns]
                        thrpt:  [1.5540 GiB/s 1.5562 GiB/s 1.5580 GiB/s]
                 change:
                        time:   [−4.5924% −4.3903% −4.1841%] (p = 0.00 < 0.05)
                        thrpt:  [+4.3668% +4.5919% +4.8134%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  4 (4.00%) high mild
  10 (10.00%) high severe
Benchmarking roundtrip/scalar_2bit/256
Benchmarking roundtrip/scalar_2bit/256: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/256: Collecting 100 samples in estimated 5.0003 s (16M iterations)
Benchmarking roundtrip/scalar_2bit/256: Analyzing
roundtrip/scalar_2bit/256
                        time:   [307.69 ns 307.84 ns 308.02 ns]
                        thrpt:  [792.61 MiB/s 793.07 MiB/s 793.46 MiB/s]
                 change:
                        time:   [−4.8947% −3.9199% −3.3322%] (p = 0.00 < 0.05)
                        thrpt:  [+3.4471% +4.0798% +5.1466%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  2 (2.00%) high mild
  7 (7.00%) high severe
Benchmarking roundtrip/scalar_4bit/256
Benchmarking roundtrip/scalar_4bit/256: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/256: Collecting 100 samples in estimated 5.0008 s (16M iterations)
Benchmarking roundtrip/scalar_4bit/256: Analyzing
roundtrip/scalar_4bit/256
                        time:   [303.12 ns 305.62 ns 309.38 ns]
                        thrpt:  [789.12 MiB/s 798.84 MiB/s 805.41 MiB/s]
                 change:
                        time:   [−3.0633% −2.6241% −1.9973%] (p = 0.00 < 0.05)
                        thrpt:  [+2.0380% +2.6948% +3.1601%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  4 (4.00%) high mild
  5 (5.00%) high severe
Benchmarking roundtrip/simd_4bit/512
Benchmarking roundtrip/simd_4bit/512: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/512: Collecting 100 samples in estimated 5.0009 s (19M iterations)
Benchmarking roundtrip/simd_4bit/512: Analyzing
roundtrip/simd_4bit/512 time:   [261.13 ns 261.26 ns 261.41 ns]
                        thrpt:  [1.8241 GiB/s 1.8252 GiB/s 1.8260 GiB/s]
                 change:
                        time:   [−3.3202% −3.1760% −3.0180%] (p = 0.00 < 0.05)
                        thrpt:  [+3.1119% +3.2802% +3.4342%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  2 (2.00%) low mild
  1 (1.00%) high mild
  7 (7.00%) high severe
Benchmarking roundtrip/scalar_2bit/512
Benchmarking roundtrip/scalar_2bit/512: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/512: Collecting 100 samples in estimated 5.0019 s (8.9M iterations)
Benchmarking roundtrip/scalar_2bit/512: Analyzing
roundtrip/scalar_2bit/512
                        time:   [560.80 ns 561.06 ns 561.36 ns]
                        thrpt:  [869.82 MiB/s 870.29 MiB/s 870.68 MiB/s]
                 change:
                        time:   [−2.5183% −2.3621% −2.2020%] (p = 0.00 < 0.05)
                        thrpt:  [+2.2516% +2.4192% +2.5833%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  2 (2.00%) low mild
  4 (4.00%) high mild
  6 (6.00%) high severe
Benchmarking roundtrip/scalar_4bit/512
Benchmarking roundtrip/scalar_4bit/512: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/512: Collecting 100 samples in estimated 5.0011 s (9.2M iterations)
Benchmarking roundtrip/scalar_4bit/512: Analyzing
roundtrip/scalar_4bit/512
                        time:   [541.77 ns 542.06 ns 542.40 ns]
                        thrpt:  [900.22 MiB/s 900.79 MiB/s 901.27 MiB/s]
                 change:
                        time:   [−2.5389% −2.3674% −2.2023%] (p = 0.00 < 0.05)
                        thrpt:  [+2.2519% +2.4248% +2.6050%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  5 (5.00%) high mild
  7 (7.00%) high severe
Benchmarking roundtrip/simd_4bit/1023
Benchmarking roundtrip/simd_4bit/1023: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/1023: Collecting 100 samples in estimated 5.0009 s (10M iterations)
Benchmarking roundtrip/simd_4bit/1023: Analyzing
roundtrip/simd_4bit/1023
                        time:   [478.39 ns 478.95 ns 479.72 ns]
                        thrpt:  [1.9860 GiB/s 1.9892 GiB/s 1.9915 GiB/s]
                 change:
                        time:   [−3.2619% −3.0424% −2.8188%] (p = 0.00 < 0.05)
                        thrpt:  [+2.9006% +3.1379% +3.3719%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  4 (4.00%) high mild
  8 (8.00%) high severe
Benchmarking roundtrip/scalar_2bit/1023
Benchmarking roundtrip/scalar_2bit/1023: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/1023: Collecting 100 samples in estimated 5.0027 s (4.5M iterations)
Benchmarking roundtrip/scalar_2bit/1023: Analyzing
roundtrip/scalar_2bit/1023
                        time:   [1.1175 µs 1.1181 µs 1.1188 µs]
                        thrpt:  [872.03 MiB/s 872.58 MiB/s 873.06 MiB/s]
                 change:
                        time:   [−3.5372% −3.1313% −2.8423%] (p = 0.00 < 0.05)
                        thrpt:  [+2.9254% +3.2325% +3.6669%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  3 (3.00%) high mild
  7 (7.00%) high severe
Benchmarking roundtrip/scalar_4bit/1023
Benchmarking roundtrip/scalar_4bit/1023: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/1023: Collecting 100 samples in estimated 5.0028 s (4.8M iterations)
Benchmarking roundtrip/scalar_4bit/1023: Analyzing
roundtrip/scalar_4bit/1023
                        time:   [1.0496 µs 1.0592 µs 1.0735 µs]
                        thrpt:  [908.82 MiB/s 921.07 MiB/s 929.50 MiB/s]
                 change:
                        time:   [−6.2725% −3.6062% −1.6738%] (p = 0.00 < 0.05)
                        thrpt:  [+1.7023% +3.7411% +6.6923%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  5 (5.00%) high mild
  6 (6.00%) high severe
Benchmarking roundtrip/simd_4bit/1024
Benchmarking roundtrip/simd_4bit/1024: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/1024: Collecting 100 samples in estimated 5.0022 s (11M iterations)
Benchmarking roundtrip/simd_4bit/1024: Analyzing
roundtrip/simd_4bit/1024
                        time:   [474.43 ns 474.98 ns 475.68 ns]
                        thrpt:  [2.0049 GiB/s 2.0078 GiB/s 2.0102 GiB/s]
                 change:
                        time:   [−6.5018% −4.5857% −3.0778%] (p = 0.00 < 0.05)
                        thrpt:  [+3.1756% +4.8061% +6.9539%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  2 (2.00%) high mild
  9 (9.00%) high severe
Benchmarking roundtrip/scalar_2bit/1024
Benchmarking roundtrip/scalar_2bit/1024: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/1024: Collecting 100 samples in estimated 5.0046 s (4.8M iterations)
Benchmarking roundtrip/scalar_2bit/1024: Analyzing
roundtrip/scalar_2bit/1024
                        time:   [1.0308 µs 1.0314 µs 1.0322 µs]
                        thrpt:  [946.06 MiB/s 946.82 MiB/s 947.40 MiB/s]
                 change:
                        time:   [−6.7099% −4.5757% −2.8540%] (p = 0.00 < 0.05)
                        thrpt:  [+2.9379% +4.7951% +7.1925%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  8 (8.00%) high mild
  4 (4.00%) high severe
Benchmarking roundtrip/scalar_4bit/1024
Benchmarking roundtrip/scalar_4bit/1024: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/1024: Collecting 100 samples in estimated 5.0028 s (4.8M iterations)
Benchmarking roundtrip/scalar_4bit/1024: Analyzing
roundtrip/scalar_4bit/1024
                        time:   [1.0461 µs 1.0466 µs 1.0473 µs]
                        thrpt:  [932.45 MiB/s 933.08 MiB/s 933.57 MiB/s]
                 change:
                        time:   [−6.6791% −4.1979% −2.4498%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5113% +4.3819% +7.1571%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  4 (4.00%) high mild
  6 (6.00%) high severe
Benchmarking roundtrip/simd_4bit/2048
Benchmarking roundtrip/simd_4bit/2048: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/2048: Collecting 100 samples in estimated 5.0018 s (5.4M iterations)
Benchmarking roundtrip/simd_4bit/2048: Analyzing
roundtrip/simd_4bit/2048
                        time:   [923.47 ns 926.56 ns 932.53 ns]
                        thrpt:  [2.0454 GiB/s 2.0585 GiB/s 2.0654 GiB/s]
                 change:
                        time:   [−6.5838% −3.7132% −1.2014%] (p = 0.00 < 0.05)
                        thrpt:  [+1.2160% +3.8564% +7.0478%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  6 (6.00%) high mild
  6 (6.00%) high severe
Benchmarking roundtrip/scalar_2bit/2048
Benchmarking roundtrip/scalar_2bit/2048: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/2048: Collecting 100 samples in estimated 5.0054 s (2.5M iterations)
Benchmarking roundtrip/scalar_2bit/2048: Analyzing
roundtrip/scalar_2bit/2048
                        time:   [1.9897 µs 1.9915 µs 1.9938 µs]
                        thrpt:  [979.59 MiB/s 980.75 MiB/s 981.62 MiB/s]
                 change:
                        time:   [−10.647% −6.6346% −3.3597%] (p = 0.00 < 0.05)
                        thrpt:  [+3.4764% +7.1061% +11.916%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  2 (2.00%) low mild
  4 (4.00%) high mild
  8 (8.00%) high severe
Benchmarking roundtrip/scalar_4bit/2048
Benchmarking roundtrip/scalar_4bit/2048: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/2048: Collecting 100 samples in estimated 5.0012 s (2.5M iterations)
Benchmarking roundtrip/scalar_4bit/2048: Analyzing
roundtrip/scalar_4bit/2048
                        time:   [2.0203 µs 2.0218 µs 2.0240 µs]
                        thrpt:  [964.96 MiB/s 966.03 MiB/s 966.77 MiB/s]
                 change:
                        time:   [−2.7692% −2.5898% −2.4168%] (p = 0.00 < 0.05)
                        thrpt:  [+2.4766% +2.6587% +2.8481%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  7 (7.00%) high mild
  5 (5.00%) high severe
Benchmarking roundtrip/simd_4bit/4095
Benchmarking roundtrip/simd_4bit/4095: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/4095: Collecting 100 samples in estimated 5.0009 s (2.7M iterations)
Benchmarking roundtrip/simd_4bit/4095: Analyzing
roundtrip/simd_4bit/4095
                        time:   [1.8100 µs 1.8119 µs 1.8144 µs]
                        thrpt:  [2.1020 GiB/s 2.1049 GiB/s 2.1070 GiB/s]
                 change:
                        time:   [−2.5755% −2.4443% −2.2969%] (p = 0.00 < 0.05)
                        thrpt:  [+2.3509% +2.5055% +2.6436%]
                        Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
  6 (6.00%) high mild
  2 (2.00%) high severe
Benchmarking roundtrip/scalar_2bit/4095
Benchmarking roundtrip/scalar_2bit/4095: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/4095: Collecting 100 samples in estimated 5.0130 s (1.2M iterations)
Benchmarking roundtrip/scalar_2bit/4095: Analyzing
roundtrip/scalar_2bit/4095
                        time:   [4.0162 µs 4.0291 µs 4.0530 µs]
                        thrpt:  [963.55 MiB/s 969.26 MiB/s 972.39 MiB/s]
                 change:
                        time:   [−2.2906% −2.0359% −1.6631%] (p = 0.00 < 0.05)
                        thrpt:  [+1.6913% +2.0782% +2.3443%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  2 (2.00%) low mild
  4 (4.00%) high mild
  8 (8.00%) high severe
Benchmarking roundtrip/scalar_4bit/4095
Benchmarking roundtrip/scalar_4bit/4095: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/4095: Collecting 100 samples in estimated 5.0017 s (1.3M iterations)
Benchmarking roundtrip/scalar_4bit/4095: Analyzing
roundtrip/scalar_4bit/4095
                        time:   [3.9885 µs 3.9918 µs 3.9962 µs]
                        thrpt:  [977.24 MiB/s 978.33 MiB/s 979.14 MiB/s]
                 change:
                        time:   [−2.5704% −2.3896% −2.2164%] (p = 0.00 < 0.05)
                        thrpt:  [+2.2666% +2.4481% +2.6382%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe
Benchmarking roundtrip/simd_4bit/4096
Benchmarking roundtrip/simd_4bit/4096: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/4096: Collecting 100 samples in estimated 5.0034 s (2.8M iterations)
Benchmarking roundtrip/simd_4bit/4096: Analyzing
roundtrip/simd_4bit/4096
                        time:   [1.8070 µs 1.8083 µs 1.8100 µs]
                        thrpt:  [2.1076 GiB/s 2.1095 GiB/s 2.1111 GiB/s]
                 change:
                        time:   [−2.9676% −2.7009% −2.4610%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5231% +2.7759% +3.0584%]
                        Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
  5 (5.00%) high mild
  3 (3.00%) high severe
Benchmarking roundtrip/scalar_2bit/4096
Benchmarking roundtrip/scalar_2bit/4096: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/4096: Collecting 100 samples in estimated 5.0129 s (1.3M iterations)
Benchmarking roundtrip/scalar_2bit/4096: Analyzing
roundtrip/scalar_2bit/4096
                        time:   [3.9031 µs 3.9070 µs 3.9118 µs]
                        thrpt:  [998.59 MiB/s 999.82 MiB/s 1000.8 MiB/s]
                 change:
                        time:   [−3.8001% −2.8221% −2.2519%] (p = 0.00 < 0.05)
                        thrpt:  [+2.3038% +2.9041% +3.9502%]
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
  7 (7.00%) high severe
Benchmarking roundtrip/scalar_4bit/4096
Benchmarking roundtrip/scalar_4bit/4096: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/4096: Collecting 100 samples in estimated 5.0181 s (1.3M iterations)
Benchmarking roundtrip/scalar_4bit/4096: Analyzing
roundtrip/scalar_4bit/4096
                        time:   [3.9869 µs 3.9988 µs 4.0175 µs]
                        thrpt:  [972.31 MiB/s 976.84 MiB/s 979.78 MiB/s]
                 change:
                        time:   [−21.373% −13.781% −6.7724%] (p = 0.00 < 0.05)
                        thrpt:  [+7.2644% +15.983% +27.183%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  3 (3.00%) high mild
  8 (8.00%) high severe
Benchmarking roundtrip/simd_4bit/8192
Benchmarking roundtrip/simd_4bit/8192: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/8192: Collecting 100 samples in estimated 5.0168 s (1.4M iterations)
Benchmarking roundtrip/simd_4bit/8192: Analyzing
roundtrip/simd_4bit/8192
                        time:   [3.5668 µs 3.5721 µs 3.5789 µs]
                        thrpt:  [2.1318 GiB/s 2.1359 GiB/s 2.1390 GiB/s]
                 change:
                        time:   [−2.6543% −2.4122% −2.1469%] (p = 0.00 < 0.05)
                        thrpt:  [+2.1940% +2.4718% +2.7267%]
                        Performance has improved.
Found 17 outliers among 100 measurements (17.00%)
  8 (8.00%) high mild
  9 (9.00%) high severe
Benchmarking roundtrip/scalar_2bit/8192
Benchmarking roundtrip/scalar_2bit/8192: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/8192: Collecting 100 samples in estimated 5.0025 s (651k iterations)
Benchmarking roundtrip/scalar_2bit/8192: Analyzing
roundtrip/scalar_2bit/8192
                        time:   [7.6700 µs 7.6753 µs 7.6822 µs]
                        thrpt:  [1017.0 MiB/s 1017.9 MiB/s 1018.6 MiB/s]
                 change:
                        time:   [−8.8286% −5.7800% −3.2631%] (p = 0.00 < 0.05)
                        thrpt:  [+3.3731% +6.1346% +9.6835%]
                        Performance has improved.
Found 15 outliers among 100 measurements (15.00%)
  3 (3.00%) low mild
  5 (5.00%) high mild
  7 (7.00%) high severe
Benchmarking roundtrip/scalar_4bit/8192
Benchmarking roundtrip/scalar_4bit/8192: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/8192: Collecting 100 samples in estimated 5.0207 s (636k iterations)
Benchmarking roundtrip/scalar_4bit/8192: Analyzing
roundtrip/scalar_4bit/8192
                        time:   [7.8804 µs 7.8858 µs 7.8937 µs]
                        thrpt:  [989.71 MiB/s 990.71 MiB/s 991.38 MiB/s]
                 change:
                        time:   [−2.6371% −2.4659% −2.2830%] (p = 0.00 < 0.05)
                        thrpt:  [+2.3363% +2.5282% +2.7085%]
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  5 (5.00%) high mild
  2 (2.00%) high severe
Benchmarking roundtrip/simd_4bit/9999
Benchmarking roundtrip/simd_4bit/9999: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/9999: Collecting 100 samples in estimated 5.0214 s (1.2M iterations)
Benchmarking roundtrip/simd_4bit/9999: Analyzing
roundtrip/simd_4bit/9999
                        time:   [4.3553 µs 4.3590 µs 4.3638 µs]
                        thrpt:  [2.1340 GiB/s 2.1363 GiB/s 2.1381 GiB/s]
                 change:
                        time:   [−2.9188% −2.7449% −2.5597%] (p = 0.00 < 0.05)
                        thrpt:  [+2.6269% +2.8224% +3.0065%]
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  5 (5.00%) high mild
  2 (2.00%) high severe
Benchmarking roundtrip/scalar_2bit/9999
Benchmarking roundtrip/scalar_2bit/9999: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/9999: Collecting 100 samples in estimated 5.0072 s (525k iterations)
Benchmarking roundtrip/scalar_2bit/9999: Analyzing
roundtrip/scalar_2bit/9999
                        time:   [9.5244 µs 9.5360 µs 9.5508 µs]
                        thrpt:  [998.43 MiB/s 999.98 MiB/s 1001.2 MiB/s]
                 change:
                        time:   [−3.6346% −2.9437% −2.5018%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5660% +3.0330% +3.7717%]
                        Performance has improved.
Found 15 outliers among 100 measurements (15.00%)
  2 (2.00%) low mild
  4 (4.00%) high mild
  9 (9.00%) high severe
Benchmarking roundtrip/scalar_4bit/9999
Benchmarking roundtrip/scalar_4bit/9999: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/9999: Collecting 100 samples in estimated 5.0075 s (520k iterations)
Benchmarking roundtrip/scalar_4bit/9999: Analyzing
roundtrip/scalar_4bit/9999
                        time:   [9.6152 µs 9.6416 µs 9.6885 µs]
                        thrpt:  [984.24 MiB/s 989.03 MiB/s 991.74 MiB/s]
                 change:
                        time:   [−2.5921% −2.0435% −1.1772%] (p = 0.00 < 0.05)
                        thrpt:  [+1.1913% +2.0861% +2.6611%]
                        Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
  8 (8.00%) high severe
Benchmarking roundtrip/simd_4bit/10000
Benchmarking roundtrip/simd_4bit/10000: Warming up for 3.0000 s
Benchmarking roundtrip/simd_4bit/10000: Collecting 100 samples in estimated 5.0188 s (1.1M iterations)
Benchmarking roundtrip/simd_4bit/10000: Analyzing
roundtrip/simd_4bit/10000
                        time:   [4.3508 µs 4.3540 µs 4.3579 µs]
                        thrpt:  [2.1371 GiB/s 2.1390 GiB/s 2.1406 GiB/s]
                 change:
                        time:   [−3.0323% −2.8465% −2.6468%] (p = 0.00 < 0.05)
                        thrpt:  [+2.7188% +2.9299% +3.1271%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  5 (5.00%) high mild
  5 (5.00%) high severe
Benchmarking roundtrip/scalar_2bit/10000
Benchmarking roundtrip/scalar_2bit/10000: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_2bit/10000: Collecting 100 samples in estimated 5.0024 s (535k iterations)
Benchmarking roundtrip/scalar_2bit/10000: Analyzing
roundtrip/scalar_2bit/10000
                        time:   [9.3328 µs 9.3389 µs 9.3464 µs]
                        thrpt:  [1020.4 MiB/s 1021.2 MiB/s 1021.9 MiB/s]
                 change:
                        time:   [−2.8071% −2.6646% −2.5203%] (p = 0.00 < 0.05)
                        thrpt:  [+2.5855% +2.7375% +2.8882%]
                        Performance has improved.
Found 13 outliers among 100 measurements (13.00%)
  6 (6.00%) high mild
  7 (7.00%) high severe
Benchmarking roundtrip/scalar_4bit/10000
Benchmarking roundtrip/scalar_4bit/10000: Warming up for 3.0000 s
Benchmarking roundtrip/scalar_4bit/10000: Collecting 100 samples in estimated 5.0069 s (520k iterations)
Benchmarking roundtrip/scalar_4bit/10000: Analyzing
roundtrip/scalar_4bit/10000
                        time:   [9.6174 µs 9.6231 µs 9.6294 µs]
                        thrpt:  [990.38 MiB/s 991.02 MiB/s 991.61 MiB/s]
                 change:
                        time:   [−2.7167% −2.5029% −2.3205%] (p = 0.00 < 0.05)
                        thrpt:  [+2.3756% +2.5672% +2.7926%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe

Benchmarking reverse_complement/simd_high_level/15
Benchmarking reverse_complement/simd_high_level/15: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/15: Collecting 100 samples in estimated 5.0001 s (56M iterations)
Benchmarking reverse_complement/simd_high_level/15: Analyzing
reverse_complement/simd_high_level/15
                        time:   [89.812 ns 89.884 ns 89.960 ns]
                        thrpt:  [159.02 MiB/s 159.15 MiB/s 159.28 MiB/s]
                 change:
                        time:   [−2.6407% −2.4855% −2.3224%] (p = 0.00 < 0.05)
                        thrpt:  [+2.3776% +2.5489% +2.7123%]
                        Performance has improved.
Found 5 outliers among 100 measurements (5.00%)
  1 (1.00%) high mild
  4 (4.00%) high severe
Benchmarking reverse_complement/simd_encoded/15
Benchmarking reverse_complement/simd_encoded/15: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/15: Collecting 100 samples in estimated 5.0000 s (127M iterations)
Benchmarking reverse_complement/simd_encoded/15: Analyzing
reverse_complement/simd_encoded/15
                        time:   [39.278 ns 39.352 ns 39.429 ns]
                        thrpt:  [362.81 MiB/s 363.52 MiB/s 364.20 MiB/s]
                 change:
                        time:   [−0.8605% −0.2349% +0.2516%] (p = 0.47 > 0.05)
                        thrpt:  [−0.2509% +0.2355% +0.8680%]
                        No change in performance detected.
Found 5 outliers among 100 measurements (5.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
Benchmarking reverse_complement/scalar/15
Benchmarking reverse_complement/scalar/15: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/15: Collecting 100 samples in estimated 5.0001 s (202M iterations)
Benchmarking reverse_complement/scalar/15: Analyzing
reverse_complement/scalar/15
                        time:   [24.739 ns 24.826 ns 24.975 ns]
                        thrpt:  [572.77 MiB/s 576.22 MiB/s 578.25 MiB/s]
                 change:
                        time:   [−13.339% −12.610% −11.868%] (p = 0.00 < 0.05)
                        thrpt:  [+13.467% +14.430% +15.392%]
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  4 (4.00%) high mild
  7 (7.00%) high severe
Benchmarking reverse_complement/simd_high_level/16
Benchmarking reverse_complement/simd_high_level/16: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/16: Collecting 100 samples in estimated 5.0002 s (71M iterations)
Benchmarking reverse_complement/simd_high_level/16: Analyzing
reverse_complement/simd_high_level/16
                        time:   [70.135 ns 70.186 ns 70.246 ns]
                        thrpt:  [217.22 MiB/s 217.41 MiB/s 217.56 MiB/s]
                 change:
                        time:   [+0.0202% +5.9819% +13.878%] (p = 0.09 > 0.05)
                        thrpt:  [−12.187% −5.6442% −0.0202%]
                        No change in performance detected.
Found 12 outliers among 100 measurements (12.00%)
  2 (2.00%) high mild
  10 (10.00%) high severe
Benchmarking reverse_complement/simd_encoded/16
Benchmarking reverse_complement/simd_encoded/16: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/16: Collecting 100 samples in estimated 5.0001 s (224M iterations)
Benchmarking reverse_complement/simd_encoded/16: Analyzing
reverse_complement/simd_encoded/16
                        time:   [22.311 ns 22.329 ns 22.355 ns]
                        thrpt:  [682.58 MiB/s 683.37 MiB/s 683.91 MiB/s]
                 change:
                        time:   [−0.9252% −0.7712% −0.6172%] (p = 0.00 < 0.05)
                        thrpt:  [+0.6210% +0.7772% +0.9338%]
                        Change within noise threshold.
Found 9 outliers among 100 measurements (9.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
  4 (4.00%) high severe
Benchmarking reverse_complement/scalar/16
Benchmarking reverse_complement/scalar/16: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/16: Collecting 100 samples in estimated 5.0001 s (207M iterations)
Benchmarking reverse_complement/scalar/16: Analyzing
reverse_complement/scalar/16
                        time:   [24.181 ns 24.189 ns 24.197 ns]
                        thrpt:  [630.59 MiB/s 630.82 MiB/s 631.02 MiB/s]
                 change:
                        time:   [+0.0675% +0.2949% +0.5355%] (p = 0.01 < 0.05)
                        thrpt:  [−0.5327% −0.2940% −0.0675%]
                        Change within noise threshold.
Found 14 outliers among 100 measurements (14.00%)
  3 (3.00%) high mild
  11 (11.00%) high severe
Benchmarking reverse_complement/simd_high_level/17
Benchmarking reverse_complement/simd_high_level/17: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/17: Collecting 100 samples in estimated 5.0002 s (53M iterations)
Benchmarking reverse_complement/simd_high_level/17: Analyzing
reverse_complement/simd_high_level/17
                        time:   [94.386 ns 94.457 ns 94.551 ns]
                        thrpt:  [171.47 MiB/s 171.64 MiB/s 171.77 MiB/s]
                 change:
                        time:   [−0.8847% −0.6584% −0.4341%] (p = 0.00 < 0.05)
                        thrpt:  [+0.4360% +0.6627% +0.8926%]
                        Change within noise threshold.
Found 7 outliers among 100 measurements (7.00%)
  3 (3.00%) high mild
  4 (4.00%) high severe
Benchmarking reverse_complement/simd_encoded/17
Benchmarking reverse_complement/simd_encoded/17: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/17: Collecting 100 samples in estimated 5.0002 s (119M iterations)
Benchmarking reverse_complement/simd_encoded/17: Analyzing
reverse_complement/simd_encoded/17
                        time:   [42.037 ns 42.070 ns 42.124 ns]
                        thrpt:  [384.88 MiB/s 385.37 MiB/s 385.67 MiB/s]
                 change:
                        time:   [−0.3475% −0.1359% +0.0671%] (p = 0.21 > 0.05)
                        thrpt:  [−0.0670% +0.1361% +0.3487%]
                        No change in performance detected.
Found 13 outliers among 100 measurements (13.00%)
  5 (5.00%) high mild
  8 (8.00%) high severe
Benchmarking reverse_complement/scalar/17
Benchmarking reverse_complement/scalar/17: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/17: Collecting 100 samples in estimated 5.0001 s (184M iterations)
Benchmarking reverse_complement/scalar/17: Analyzing
reverse_complement/scalar/17
                        time:   [27.051 ns 27.079 ns 27.114 ns]
                        thrpt:  [597.95 MiB/s 598.71 MiB/s 599.33 MiB/s]
                 change:
                        time:   [−0.0443% +0.1362% +0.3068%] (p = 0.13 > 0.05)
                        thrpt:  [−0.3059% −0.1360% +0.0443%]
                        No change in performance detected.
Found 12 outliers among 100 measurements (12.00%)
  4 (4.00%) high mild
  8 (8.00%) high severe
Benchmarking reverse_complement/simd_high_level/32
Benchmarking reverse_complement/simd_high_level/32: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/32: Collecting 100 samples in estimated 5.0002 s (64M iterations)
Benchmarking reverse_complement/simd_high_level/32: Analyzing
reverse_complement/simd_high_level/32
                        time:   [78.237 ns 78.885 ns 79.854 ns]
                        thrpt:  [382.17 MiB/s 386.86 MiB/s 390.07 MiB/s]
                 change:
                        time:   [+0.0317% +0.6020% +1.3603%] (p = 0.07 > 0.05)
                        thrpt:  [−1.3421% −0.5984% −0.0317%]
                        No change in performance detected.
Found 12 outliers among 100 measurements (12.00%)
  6 (6.00%) high mild
  6 (6.00%) high severe
Benchmarking reverse_complement/simd_encoded/32
Benchmarking reverse_complement/simd_encoded/32: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/32: Collecting 100 samples in estimated 5.0001 s (221M iterations)
Benchmarking reverse_complement/simd_encoded/32: Analyzing
reverse_complement/simd_encoded/32
                        time:   [22.591 ns 22.606 ns 22.625 ns]
                        thrpt:  [1.3172 GiB/s 1.3183 GiB/s 1.3192 GiB/s]
                 change:
                        time:   [−0.7645% −0.6327% −0.5081%] (p = 0.00 < 0.05)
                        thrpt:  [+0.5107% +0.6367% +0.7704%]
                        Change within noise threshold.
Found 6 outliers among 100 measurements (6.00%)
  2 (2.00%) high mild
  4 (4.00%) high severe
Benchmarking reverse_complement/scalar/32
Benchmarking reverse_complement/scalar/32: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/32: Collecting 100 samples in estimated 5.0000 s (149M iterations)
Benchmarking reverse_complement/scalar/32: Analyzing
reverse_complement/scalar/32
                        time:   [33.531 ns 33.552 ns 33.578 ns]
                        thrpt:  [908.87 MiB/s 909.57 MiB/s 910.12 MiB/s]
                 change:
                        time:   [−0.2281% −0.0525% +0.1055%] (p = 0.55 > 0.05)
                        thrpt:  [−0.1053% +0.0525% +0.2286%]
                        No change in performance detected.
Found 9 outliers among 100 measurements (9.00%)
  4 (4.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/simd_high_level/33
Benchmarking reverse_complement/simd_high_level/33: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/33: Collecting 100 samples in estimated 5.0006 s (40M iterations)
Benchmarking reverse_complement/simd_high_level/33: Analyzing
reverse_complement/simd_high_level/33
                        time:   [125.96 ns 126.06 ns 126.18 ns]
                        thrpt:  [249.42 MiB/s 249.65 MiB/s 249.85 MiB/s]
                 change:
                        time:   [−0.2320% +0.0157% +0.2575%] (p = 0.90 > 0.05)
                        thrpt:  [−0.2569% −0.0157% +0.2325%]
                        No change in performance detected.
Found 9 outliers among 100 measurements (9.00%)
  1 (1.00%) high mild
  8 (8.00%) high severe
Benchmarking reverse_complement/simd_encoded/33
Benchmarking reverse_complement/simd_encoded/33: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/33: Collecting 100 samples in estimated 5.0001 s (76M iterations)
Benchmarking reverse_complement/simd_encoded/33: Analyzing
reverse_complement/simd_encoded/33
                        time:   [65.486 ns 65.680 ns 65.960 ns]
                        thrpt:  [477.13 MiB/s 479.16 MiB/s 480.58 MiB/s]
                 change:
                        time:   [+0.0793% +0.6627% +1.7372%] (p = 0.09 > 0.05)
                        thrpt:  [−1.7075% −0.6583% −0.0792%]
                        No change in performance detected.
Found 10 outliers among 100 measurements (10.00%)
  2 (2.00%) high mild
  8 (8.00%) high severe
Benchmarking reverse_complement/scalar/33
Benchmarking reverse_complement/scalar/33: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/33: Collecting 100 samples in estimated 5.0000 s (145M iterations)
Benchmarking reverse_complement/scalar/33: Analyzing
reverse_complement/scalar/33
                        time:   [34.391 ns 34.415 ns 34.449 ns]
                        thrpt:  [913.56 MiB/s 914.45 MiB/s 915.11 MiB/s]
                 change:
                        time:   [−0.3077% −0.1656% −0.0216%] (p = 0.02 < 0.05)
                        thrpt:  [+0.0216% +0.1659% +0.3086%]
                        Change within noise threshold.
Found 7 outliers among 100 measurements (7.00%)
  2 (2.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/simd_high_level/63
Benchmarking reverse_complement/simd_high_level/63: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/63: Collecting 100 samples in estimated 5.0004 s (43M iterations)
Benchmarking reverse_complement/simd_high_level/63: Analyzing
reverse_complement/simd_high_level/63
                        time:   [115.94 ns 116.50 ns 117.55 ns]
                        thrpt:  [511.12 MiB/s 515.71 MiB/s 518.19 MiB/s]
                 change:
                        time:   [−0.2204% +0.0510% +0.4619%] (p = 0.78 > 0.05)
                        thrpt:  [−0.4598% −0.0510% +0.2209%]
                        No change in performance detected.
Found 9 outliers among 100 measurements (9.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
  4 (4.00%) high severe
Benchmarking reverse_complement/simd_encoded/63
Benchmarking reverse_complement/simd_encoded/63: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/63: Collecting 100 samples in estimated 5.0002 s (122M iterations)
Benchmarking reverse_complement/simd_encoded/63: Analyzing
reverse_complement/simd_encoded/63
                        time:   [40.992 ns 41.092 ns 41.229 ns]
                        thrpt:  [1.4231 GiB/s 1.4279 GiB/s 1.4313 GiB/s]
                 change:
                        time:   [−0.0815% +0.2028% +0.4840%] (p = 0.16 > 0.05)
                        thrpt:  [−0.4817% −0.2024% +0.0816%]
                        No change in performance detected.
Found 5 outliers among 100 measurements (5.00%)
  3 (3.00%) high mild
  2 (2.00%) high severe
Benchmarking reverse_complement/scalar/63
Benchmarking reverse_complement/scalar/63: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/63: Collecting 100 samples in estimated 5.0002 s (111M iterations)
Benchmarking reverse_complement/scalar/63: Analyzing
reverse_complement/scalar/63
                        time:   [45.153 ns 45.194 ns 45.246 ns]
                        thrpt:  [1.2968 GiB/s 1.2983 GiB/s 1.2994 GiB/s]
                 change:
                        time:   [−0.1244% +0.0456% +0.2158%] (p = 0.60 > 0.05)
                        thrpt:  [−0.2153% −0.0456% +0.1245%]
                        No change in performance detected.
Found 13 outliers among 100 measurements (13.00%)
  8 (8.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/simd_high_level/64
Benchmarking reverse_complement/simd_high_level/64: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/64: Collecting 100 samples in estimated 5.0002 s (52M iterations)
Benchmarking reverse_complement/simd_high_level/64: Analyzing
reverse_complement/simd_high_level/64
                        time:   [96.174 ns 96.269 ns 96.378 ns]
                        thrpt:  [633.29 MiB/s 634.01 MiB/s 634.64 MiB/s]
                 change:
                        time:   [−0.1711% +0.0182% +0.2174%] (p = 0.85 > 0.05)
                        thrpt:  [−0.2169% −0.0182% +0.1714%]
                        No change in performance detected.
Found 5 outliers among 100 measurements (5.00%)
  3 (3.00%) high mild
  2 (2.00%) high severe
Benchmarking reverse_complement/simd_encoded/64
Benchmarking reverse_complement/simd_encoded/64: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/64: Collecting 100 samples in estimated 5.0001 s (198M iterations)
Benchmarking reverse_complement/simd_encoded/64: Analyzing
reverse_complement/simd_encoded/64
                        time:   [25.161 ns 25.209 ns 25.267 ns]
                        thrpt:  [2.3590 GiB/s 2.3644 GiB/s 2.3689 GiB/s]
                 change:
                        time:   [−0.0646% +0.4229% +0.9251%] (p = 0.09 > 0.05)
                        thrpt:  [−0.9166% −0.4211% +0.0647%]
                        No change in performance detected.
Found 15 outliers among 100 measurements (15.00%)
  10 (10.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/scalar/64
Benchmarking reverse_complement/scalar/64: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/64: Collecting 100 samples in estimated 5.0002 s (110M iterations)
Benchmarking reverse_complement/scalar/64: Analyzing
reverse_complement/scalar/64
                        time:   [45.308 ns 45.360 ns 45.424 ns]
                        thrpt:  [1.3122 GiB/s 1.3140 GiB/s 1.3156 GiB/s]
                 change:
                        time:   [+0.3972% +1.3174% +1.9997%] (p = 0.00 < 0.05)
                        thrpt:  [−1.9605% −1.3002% −0.3956%]
                        Change within noise threshold.
Found 14 outliers among 100 measurements (14.00%)
  5 (5.00%) high mild
  9 (9.00%) high severe
Benchmarking reverse_complement/simd_high_level/127
Benchmarking reverse_complement/simd_high_level/127: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/127: Collecting 100 samples in estimated 5.0007 s (33M iterations)
Benchmarking reverse_complement/simd_high_level/127: Analyzing
reverse_complement/simd_high_level/127
                        time:   [152.91 ns 153.49 ns 154.33 ns]
                        thrpt:  [784.81 MiB/s 789.08 MiB/s 792.09 MiB/s]
                 change:
                        time:   [−2.3606% −0.5091% +1.1146%] (p = 0.60 > 0.05)
                        thrpt:  [−1.1023% +0.5117% +2.4177%]
                        No change in performance detected.
Found 7 outliers among 100 measurements (7.00%)
  1 (1.00%) high mild
  6 (6.00%) high severe
Benchmarking reverse_complement/simd_encoded/127
Benchmarking reverse_complement/simd_encoded/127: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/127: Collecting 100 samples in estimated 5.0000 s (93M iterations)
Benchmarking reverse_complement/simd_encoded/127: Analyzing
reverse_complement/simd_encoded/127
                        time:   [53.556 ns 53.693 ns 53.883 ns]
                        thrpt:  [2.1951 GiB/s 2.2029 GiB/s 2.2085 GiB/s]
                 change:
                        time:   [−0.3564% +0.4306% +1.7590%] (p = 0.51 > 0.05)
                        thrpt:  [−1.7286% −0.4288% +0.3576%]
                        No change in performance detected.
Found 13 outliers among 100 measurements (13.00%)
  6 (6.00%) high mild
  7 (7.00%) high severe
Benchmarking reverse_complement/scalar/127
Benchmarking reverse_complement/scalar/127: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/127: Collecting 100 samples in estimated 5.0001 s (78M iterations)
Benchmarking reverse_complement/scalar/127: Analyzing
reverse_complement/scalar/127
                        time:   [64.142 ns 64.269 ns 64.434 ns]
                        thrpt:  [1.8356 GiB/s 1.8404 GiB/s 1.8440 GiB/s]
                 change:
                        time:   [−0.0062% +0.1740% +0.3603%] (p = 0.07 > 0.05)
                        thrpt:  [−0.3590% −0.1737% +0.0062%]
                        No change in performance detected.
Found 8 outliers among 100 measurements (8.00%)
  4 (4.00%) high mild
  4 (4.00%) high severe
Benchmarking reverse_complement/simd_high_level/128
Benchmarking reverse_complement/simd_high_level/128: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/128: Collecting 100 samples in estimated 5.0002 s (41M iterations)
Benchmarking reverse_complement/simd_high_level/128: Analyzing
reverse_complement/simd_high_level/128
                        time:   [122.37 ns 122.49 ns 122.65 ns]
                        thrpt:  [995.30 MiB/s 996.55 MiB/s 997.54 MiB/s]
                 change:
                        time:   [−0.4436% −0.2384% −0.0443%] (p = 0.02 < 0.05)
                        thrpt:  [+0.0444% +0.2390% +0.4456%]
                        Change within noise threshold.
Found 8 outliers among 100 measurements (8.00%)
  5 (5.00%) high mild
  3 (3.00%) high severe
Benchmarking reverse_complement/simd_encoded/128
Benchmarking reverse_complement/simd_encoded/128: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/128: Collecting 100 samples in estimated 5.0000 s (199M iterations)
Benchmarking reverse_complement/simd_encoded/128: Analyzing
reverse_complement/simd_encoded/128
                        time:   [25.035 ns 25.058 ns 25.086 ns]
                        thrpt:  [4.7520 GiB/s 4.7573 GiB/s 4.7618 GiB/s]
                 change:
                        time:   [−0.4240% −0.2751% −0.1266%] (p = 0.00 < 0.05)
                        thrpt:  [+0.1268% +0.2759% +0.4258%]
                        Change within noise threshold.
Found 16 outliers among 100 measurements (16.00%)
  1 (1.00%) low mild
  7 (7.00%) high mild
  8 (8.00%) high severe
Benchmarking reverse_complement/scalar/128
Benchmarking reverse_complement/scalar/128: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/128: Collecting 100 samples in estimated 5.0003 s (79M iterations)
Benchmarking reverse_complement/scalar/128: Analyzing
reverse_complement/scalar/128
                        time:   [63.485 ns 63.560 ns 63.659 ns]
                        thrpt:  [1.8726 GiB/s 1.8755 GiB/s 1.8778 GiB/s]
                 change:
                        time:   [−1.0830% −0.4103% +0.0825%] (p = 0.20 > 0.05)
                        thrpt:  [−0.0825% +0.4120% +1.0948%]
                        No change in performance detected.
Found 9 outliers among 100 measurements (9.00%)
  4 (4.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/simd_high_level/255
Benchmarking reverse_complement/simd_high_level/255: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/255: Collecting 100 samples in estimated 5.0002 s (23M iterations)
Benchmarking reverse_complement/simd_high_level/255: Analyzing
reverse_complement/simd_high_level/255
                        time:   [215.20 ns 217.56 ns 221.03 ns]
                        thrpt:  [1.0744 GiB/s 1.0916 GiB/s 1.1035 GiB/s]
                 change:
                        time:   [−0.8639% −0.0855% +0.7283%] (p = 0.84 > 0.05)
                        thrpt:  [−0.7230% +0.0856% +0.8714%]
                        No change in performance detected.
Found 8 outliers among 100 measurements (8.00%)
  3 (3.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/simd_encoded/255
Benchmarking reverse_complement/simd_encoded/255: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/255: Collecting 100 samples in estimated 5.0002 s (84M iterations)
Benchmarking reverse_complement/simd_encoded/255: Analyzing
reverse_complement/simd_encoded/255
                        time:   [59.548 ns 60.093 ns 60.908 ns]
                        thrpt:  [3.8991 GiB/s 3.9520 GiB/s 3.9881 GiB/s]
                 change:
                        time:   [+0.3600% +0.8109% +1.3939%] (p = 0.00 < 0.05)
                        thrpt:  [−1.3747% −0.8043% −0.3588%]
                        Change within noise threshold.
Found 13 outliers among 100 measurements (13.00%)
  5 (5.00%) high mild
  8 (8.00%) high severe
Benchmarking reverse_complement/scalar/255
Benchmarking reverse_complement/scalar/255: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/255: Collecting 100 samples in estimated 5.0001 s (49M iterations)
Benchmarking reverse_complement/scalar/255: Analyzing
reverse_complement/scalar/255
                        time:   [102.71 ns 102.77 ns 102.87 ns]
                        thrpt:  [2.3086 GiB/s 2.3108 GiB/s 2.3123 GiB/s]
                 change:
                        time:   [−0.2748% −0.1055% +0.0530%] (p = 0.22 > 0.05)
                        thrpt:  [−0.0529% +0.1056% +0.2756%]
                        No change in performance detected.
Found 7 outliers among 100 measurements (7.00%)
  5 (5.00%) high mild
  2 (2.00%) high severe
Benchmarking reverse_complement/simd_high_level/256
Benchmarking reverse_complement/simd_high_level/256: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/256: Collecting 100 samples in estimated 5.0008 s (27M iterations)
Benchmarking reverse_complement/simd_high_level/256: Analyzing
reverse_complement/simd_high_level/256
                        time:   [181.90 ns 182.06 ns 182.25 ns]
                        thrpt:  [1.3082 GiB/s 1.3096 GiB/s 1.3107 GiB/s]
                 change:
                        time:   [−0.1891% −0.0419% +0.0997%] (p = 0.57 > 0.05)
                        thrpt:  [−0.0996% +0.0420% +0.1895%]
                        No change in performance detected.
Found 6 outliers among 100 measurements (6.00%)
  4 (4.00%) high mild
  2 (2.00%) high severe
Benchmarking reverse_complement/simd_encoded/256
Benchmarking reverse_complement/simd_encoded/256: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/256: Collecting 100 samples in estimated 5.0001 s (157M iterations)
Benchmarking reverse_complement/simd_encoded/256: Analyzing
reverse_complement/simd_encoded/256
                        time:   [31.728 ns 31.753 ns 31.784 ns]
                        thrpt:  [7.5013 GiB/s 7.5086 GiB/s 7.5144 GiB/s]
                 change:
                        time:   [−0.1134% +0.0231% +0.1559%] (p = 0.74 > 0.05)
                        thrpt:  [−0.1557% −0.0231% +0.1136%]
                        No change in performance detected.
Found 9 outliers among 100 measurements (9.00%)
  3 (3.00%) high mild
  6 (6.00%) high severe
Benchmarking reverse_complement/scalar/256
Benchmarking reverse_complement/scalar/256: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/256: Collecting 100 samples in estimated 5.0002 s (49M iterations)
Benchmarking reverse_complement/scalar/256: Analyzing
reverse_complement/scalar/256
                        time:   [101.91 ns 101.96 ns 102.02 ns]
                        thrpt:  [2.3369 GiB/s 2.3383 GiB/s 2.3395 GiB/s]
                 change:
                        time:   [−0.2255% −0.0234% +0.1651%] (p = 0.82 > 0.05)
                        thrpt:  [−0.1648% +0.0234% +0.2260%]
                        No change in performance detected.
Found 14 outliers among 100 measurements (14.00%)
  1 (1.00%) low mild
  6 (6.00%) high mild
  7 (7.00%) high severe
Benchmarking reverse_complement/simd_high_level/512
Benchmarking reverse_complement/simd_high_level/512: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/512: Collecting 100 samples in estimated 5.0006 s (17M iterations)
Benchmarking reverse_complement/simd_high_level/512: Analyzing
reverse_complement/simd_high_level/512
                        time:   [296.05 ns 296.32 ns 296.66 ns]
                        thrpt:  [1.6074 GiB/s 1.6092 GiB/s 1.6107 GiB/s]
                 change:
                        time:   [+0.1597% +0.3553% +0.5553%] (p = 0.00 < 0.05)
                        thrpt:  [−0.5522% −0.3541% −0.1595%]
                        Change within noise threshold.
Found 14 outliers among 100 measurements (14.00%)
  4 (4.00%) high mild
  10 (10.00%) high severe
Benchmarking reverse_complement/simd_encoded/512
Benchmarking reverse_complement/simd_encoded/512: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/512: Collecting 100 samples in estimated 5.0000 s (125M iterations)
Benchmarking reverse_complement/simd_encoded/512: Analyzing
reverse_complement/simd_encoded/512
                        time:   [39.678 ns 39.708 ns 39.749 ns]
                        thrpt:  [11.996 GiB/s 12.009 GiB/s 12.018 GiB/s]
                 change:
                        time:   [−0.1229% +0.0208% +0.1787%] (p = 0.79 > 0.05)
                        thrpt:  [−0.1784% −0.0207% +0.1231%]
                        No change in performance detected.
Found 11 outliers among 100 measurements (11.00%)
  1 (1.00%) low mild
  3 (3.00%) high mild
  7 (7.00%) high severe
Benchmarking reverse_complement/scalar/512
Benchmarking reverse_complement/scalar/512: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/512: Collecting 100 samples in estimated 5.0000 s (27M iterations)
Benchmarking reverse_complement/scalar/512: Analyzing
reverse_complement/scalar/512
                        time:   [181.62 ns 181.72 ns 181.82 ns]
                        thrpt:  [2.6225 GiB/s 2.6241 GiB/s 2.6254 GiB/s]
                 change:
                        time:   [−0.1250% +0.0421% +0.2121%] (p = 0.64 > 0.05)
                        thrpt:  [−0.2117% −0.0421% +0.1252%]
                        No change in performance detected.
Found 10 outliers among 100 measurements (10.00%)
  5 (5.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/simd_high_level/1023
Benchmarking reverse_complement/simd_high_level/1023: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/1023: Collecting 100 samples in estimated 5.0015 s (8.9M iterations)
Benchmarking reverse_complement/simd_high_level/1023: Analyzing
reverse_complement/simd_high_level/1023
                        time:   [562.94 ns 567.46 ns 574.38 ns]
                        thrpt:  [1.6587 GiB/s 1.6790 GiB/s 1.6924 GiB/s]
                 change:
                        time:   [+0.0333% +0.4636% +1.0425%] (p = 0.07 > 0.05)
                        thrpt:  [−1.0317% −0.4615% −0.0333%]
                        No change in performance detected.
Found 14 outliers among 100 measurements (14.00%)
  6 (6.00%) high mild
  8 (8.00%) high severe
Benchmarking reverse_complement/simd_encoded/1023
Benchmarking reverse_complement/simd_encoded/1023: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/1023: Collecting 100 samples in estimated 5.0002 s (61M iterations)
Benchmarking reverse_complement/simd_encoded/1023: Analyzing
reverse_complement/simd_encoded/1023
                        time:   [81.852 ns 81.922 ns 82.016 ns]
                        thrpt:  [11.617 GiB/s 11.630 GiB/s 11.640 GiB/s]
                 change:
                        time:   [−0.6764% −0.2856% +0.0516%] (p = 0.13 > 0.05)
                        thrpt:  [−0.0516% +0.2864% +0.6810%]
                        No change in performance detected.
Found 9 outliers among 100 measurements (9.00%)
  4 (4.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/scalar/1023
Benchmarking reverse_complement/scalar/1023: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/1023: Collecting 100 samples in estimated 5.0012 s (15M iterations)
Benchmarking reverse_complement/scalar/1023: Analyzing
reverse_complement/scalar/1023
                        time:   [339.36 ns 339.59 ns 339.88 ns]
                        thrpt:  [2.8031 GiB/s 2.8056 GiB/s 2.8075 GiB/s]
                 change:
                        time:   [−0.7051% −0.3842% −0.0930%] (p = 0.02 < 0.05)
                        thrpt:  [+0.0931% +0.3857% +0.7101%]
                        Change within noise threshold.
Found 7 outliers among 100 measurements (7.00%)
  2 (2.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/simd_high_level/1024
Benchmarking reverse_complement/simd_high_level/1024: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/1024: Collecting 100 samples in estimated 5.0024 s (9.5M iterations)
Benchmarking reverse_complement/simd_high_level/1024: Analyzing
reverse_complement/simd_high_level/1024
                        time:   [526.45 ns 526.83 ns 527.32 ns]
                        thrpt:  [1.8085 GiB/s 1.8102 GiB/s 1.8115 GiB/s]
                 change:
                        time:   [−0.1941% −0.0219% +0.1283%] (p = 0.80 > 0.05)
                        thrpt:  [−0.1282% +0.0219% +0.1944%]
                        No change in performance detected.
Found 13 outliers among 100 measurements (13.00%)
  7 (7.00%) high mild
  6 (6.00%) high severe
Benchmarking reverse_complement/simd_encoded/1024
Benchmarking reverse_complement/simd_encoded/1024: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/1024: Collecting 100 samples in estimated 5.0002 s (94M iterations)
Benchmarking reverse_complement/simd_encoded/1024: Analyzing
reverse_complement/simd_encoded/1024
                        time:   [52.655 ns 52.696 ns 52.750 ns]
                        thrpt:  [18.079 GiB/s 18.098 GiB/s 18.112 GiB/s]
                 change:
                        time:   [−0.8077% −0.3002% +0.0201%] (p = 0.18 > 0.05)
                        thrpt:  [−0.0201% +0.3011% +0.8143%]
                        No change in performance detected.
Found 5 outliers among 100 measurements (5.00%)
  2 (2.00%) high mild
  3 (3.00%) high severe
Benchmarking reverse_complement/scalar/1024
Benchmarking reverse_complement/scalar/1024: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/1024: Collecting 100 samples in estimated 5.0014 s (15M iterations)
Benchmarking reverse_complement/scalar/1024: Analyzing
reverse_complement/scalar/1024
                        time:   [338.88 ns 339.11 ns 339.39 ns]
                        thrpt:  [2.8100 GiB/s 2.8123 GiB/s 2.8142 GiB/s]
                 change:
                        time:   [−0.4005% −0.2123% −0.0246%] (p = 0.03 < 0.05)
                        thrpt:  [+0.0246% +0.2128% +0.4021%]
                        Change within noise threshold.
Found 13 outliers among 100 measurements (13.00%)
  1 (1.00%) low mild
  6 (6.00%) high mild
  6 (6.00%) high severe
Benchmarking reverse_complement/simd_high_level/2048
Benchmarking reverse_complement/simd_high_level/2048: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/2048: Collecting 100 samples in estimated 5.0035 s (5.0M iterations)
Benchmarking reverse_complement/simd_high_level/2048: Analyzing
reverse_complement/simd_high_level/2048
                        time:   [1.0073 µs 1.0081 µs 1.0091 µs]
                        thrpt:  [1.8901 GiB/s 1.8920 GiB/s 1.8934 GiB/s]
                 change:
                        time:   [−0.0207% +0.1545% +0.3370%] (p = 0.10 > 0.05)
                        thrpt:  [−0.3359% −0.1543% +0.0207%]
                        No change in performance detected.
Found 8 outliers among 100 measurements (8.00%)
  3 (3.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/simd_encoded/2048
Benchmarking reverse_complement/simd_encoded/2048: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/2048: Collecting 100 samples in estimated 5.0000 s (56M iterations)
Benchmarking reverse_complement/simd_encoded/2048: Analyzing
reverse_complement/simd_encoded/2048
                        time:   [89.972 ns 90.021 ns 90.085 ns]
                        thrpt:  [21.173 GiB/s 21.188 GiB/s 21.199 GiB/s]
                 change:
                        time:   [−0.1382% +0.0810% +0.3037%] (p = 0.49 > 0.05)
                        thrpt:  [−0.3028% −0.0810% +0.1383%]
                        No change in performance detected.
Found 13 outliers among 100 measurements (13.00%)
  3 (3.00%) high mild
  10 (10.00%) high severe
Benchmarking reverse_complement/scalar/2048
Benchmarking reverse_complement/scalar/2048: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/2048: Collecting 100 samples in estimated 5.0030 s (7.8M iterations)
Benchmarking reverse_complement/scalar/2048: Analyzing
reverse_complement/scalar/2048
                        time:   [637.68 ns 641.69 ns 649.52 ns]
                        thrpt:  [2.9365 GiB/s 2.9724 GiB/s 2.9911 GiB/s]
                 change:
                        time:   [−0.0904% +0.3854% +1.1201%] (p = 0.32 > 0.05)
                        thrpt:  [−1.1077% −0.3839% +0.0904%]
                        No change in performance detected.
Found 15 outliers among 100 measurements (15.00%)
  7 (7.00%) high mild
  8 (8.00%) high severe
Benchmarking reverse_complement/simd_high_level/4095
Benchmarking reverse_complement/simd_high_level/4095: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/4095: Collecting 100 samples in estimated 5.0030 s (2.5M iterations)
Benchmarking reverse_complement/simd_high_level/4095: Analyzing
reverse_complement/simd_high_level/4095
                        time:   [2.0317 µs 2.0348 µs 2.0390 µs]
                        thrpt:  [1.8704 GiB/s 1.8742 GiB/s 1.8772 GiB/s]
                 change:
                        time:   [+0.0456% +0.3652% +0.6731%] (p = 0.03 < 0.05)
                        thrpt:  [−0.6686% −0.3639% −0.0456%]
                        Change within noise threshold.
Found 19 outliers among 100 measurements (19.00%)
  2 (2.00%) high mild
  17 (17.00%) high severe
Benchmarking reverse_complement/simd_encoded/4095
Benchmarking reverse_complement/simd_encoded/4095: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/4095: Collecting 100 samples in estimated 5.0003 s (22M iterations)
Benchmarking reverse_complement/simd_encoded/4095: Analyzing
reverse_complement/simd_encoded/4095
                        time:   [223.67 ns 223.79 ns 223.92 ns]
                        thrpt:  [17.032 GiB/s 17.042 GiB/s 17.051 GiB/s]
                 change:
                        time:   [+0.0139% +0.1792% +0.3743%] (p = 0.04 < 0.05)
                        thrpt:  [−0.3729% −0.1789% −0.0139%]
                        Change within noise threshold.
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe
Benchmarking reverse_complement/scalar/4095
Benchmarking reverse_complement/scalar/4095: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/4095: Collecting 100 samples in estimated 5.0015 s (4.0M iterations)
Benchmarking reverse_complement/scalar/4095: Analyzing
reverse_complement/scalar/4095
                        time:   [1.2469 µs 1.2486 µs 1.2508 µs]
                        thrpt:  [3.0491 GiB/s 3.0544 GiB/s 3.0586 GiB/s]
                 change:
                        time:   [−0.3234% −0.0638% +0.1845%] (p = 0.63 > 0.05)
                        thrpt:  [−0.1842% +0.0638% +0.3245%]
                        No change in performance detected.
Found 8 outliers among 100 measurements (8.00%)
  3 (3.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/simd_high_level/4096
Benchmarking reverse_complement/simd_high_level/4096: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/4096: Collecting 100 samples in estimated 5.0085 s (2.5M iterations)
Benchmarking reverse_complement/simd_high_level/4096: Analyzing
reverse_complement/simd_high_level/4096
                        time:   [1.9734 µs 1.9799 µs 1.9909 µs]
                        thrpt:  [1.9160 GiB/s 1.9267 GiB/s 1.9330 GiB/s]
                 change:
                        time:   [−0.3065% −0.0016% +0.3379%] (p = 0.99 > 0.05)
                        thrpt:  [−0.3368% +0.0016% +0.3074%]
                        No change in performance detected.
Found 7 outliers among 100 measurements (7.00%)
  3 (3.00%) high mild
  4 (4.00%) high severe
Benchmarking reverse_complement/simd_encoded/4096
Benchmarking reverse_complement/simd_encoded/4096: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/4096: Collecting 100 samples in estimated 5.0009 s (29M iterations)
Benchmarking reverse_complement/simd_encoded/4096: Analyzing
reverse_complement/simd_encoded/4096
                        time:   [171.43 ns 171.55 ns 171.73 ns]
                        thrpt:  [22.214 GiB/s 22.237 GiB/s 22.252 GiB/s]
                 change:
                        time:   [−0.1378% +0.0007% +0.1411%] (p = 0.98 > 0.05)
                        thrpt:  [−0.1409% −0.0007% +0.1380%]
                        No change in performance detected.
Found 12 outliers among 100 measurements (12.00%)
  1 (1.00%) low mild
  6 (6.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/scalar/4096
Benchmarking reverse_complement/scalar/4096: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/4096: Collecting 100 samples in estimated 5.0051 s (4.0M iterations)
Benchmarking reverse_complement/scalar/4096: Analyzing
reverse_complement/scalar/4096
                        time:   [1.2460 µs 1.2469 µs 1.2480 µs]
                        thrpt:  [3.0567 GiB/s 3.0594 GiB/s 3.0616 GiB/s]
                 change:
                        time:   [−0.1440% +0.0096% +0.1646%] (p = 0.90 > 0.05)
                        thrpt:  [−0.1643% −0.0096% +0.1442%]
                        No change in performance detected.
Found 9 outliers among 100 measurements (9.00%)
  4 (4.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/simd_high_level/8192
Benchmarking reverse_complement/simd_high_level/8192: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/8192: Collecting 100 samples in estimated 5.0093 s (1.3M iterations)
Benchmarking reverse_complement/simd_high_level/8192: Analyzing
reverse_complement/simd_high_level/8192
                        time:   [3.8772 µs 3.8810 µs 3.8859 µs]
                        thrpt:  [1.9633 GiB/s 1.9658 GiB/s 1.9678 GiB/s]
                 change:
                        time:   [+0.2562% +0.4177% +0.5715%] (p = 0.00 < 0.05)
                        thrpt:  [−0.5683% −0.4160% −0.2555%]
                        Change within noise threshold.
Found 7 outliers among 100 measurements (7.00%)
  4 (4.00%) high mild
  3 (3.00%) high severe
Benchmarking reverse_complement/simd_encoded/8192
Benchmarking reverse_complement/simd_encoded/8192: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/8192: Collecting 100 samples in estimated 5.0001 s (16M iterations)
Benchmarking reverse_complement/simd_encoded/8192: Analyzing
reverse_complement/simd_encoded/8192
                        time:   [315.07 ns 318.12 ns 322.59 ns]
                        thrpt:  [23.651 GiB/s 23.982 GiB/s 24.215 GiB/s]
                 change:
                        time:   [−1.5424% −0.8871% −0.2596%] (p = 0.01 < 0.05)
                        thrpt:  [+0.2603% +0.8951% +1.5666%]
                        Change within noise threshold.
Found 10 outliers among 100 measurements (10.00%)
  4 (4.00%) high mild
  6 (6.00%) high severe
Benchmarking reverse_complement/scalar/8192
Benchmarking reverse_complement/scalar/8192: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/8192: Collecting 100 samples in estimated 5.0073 s (2.0M iterations)
Benchmarking reverse_complement/scalar/8192: Analyzing
reverse_complement/scalar/8192
                        time:   [2.4696 µs 2.4718 µs 2.4745 µs]
                        thrpt:  [3.0833 GiB/s 3.0865 GiB/s 3.0893 GiB/s]
                 change:
                        time:   [−0.1580% −0.0107% +0.1396%] (p = 0.88 > 0.05)
                        thrpt:  [−0.1394% +0.0107% +0.1582%]
                        No change in performance detected.
Found 10 outliers among 100 measurements (10.00%)
  5 (5.00%) high mild
  5 (5.00%) high severe
Benchmarking reverse_complement/simd_high_level/9999
Benchmarking reverse_complement/simd_high_level/9999: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/9999: Collecting 100 samples in estimated 5.0142 s (1.0M iterations)
Benchmarking reverse_complement/simd_high_level/9999: Analyzing
reverse_complement/simd_high_level/9999
                        time:   [4.8118 µs 4.8152 µs 4.8192 µs]
                        thrpt:  [1.9323 GiB/s 1.9339 GiB/s 1.9353 GiB/s]
                 change:
                        time:   [−0.1254% +0.0364% +0.2069%] (p = 0.67 > 0.05)
                        thrpt:  [−0.2065% −0.0364% +0.1256%]
                        No change in performance detected.
Found 7 outliers among 100 measurements (7.00%)
  3 (3.00%) high mild
  4 (4.00%) high severe
Benchmarking reverse_complement/simd_encoded/9999
Benchmarking reverse_complement/simd_encoded/9999: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/9999: Collecting 100 samples in estimated 5.0014 s (11M iterations)
Benchmarking reverse_complement/simd_encoded/9999: Analyzing
reverse_complement/simd_encoded/9999
                        time:   [465.52 ns 465.85 ns 466.27 ns]
                        thrpt:  [19.972 GiB/s 19.990 GiB/s 20.004 GiB/s]
                 change:
                        time:   [−0.0679% +0.0911% +0.2467%] (p = 0.26 > 0.05)
                        thrpt:  [−0.2461% −0.0910% +0.0680%]
                        No change in performance detected.
Found 11 outliers among 100 measurements (11.00%)
  5 (5.00%) high mild
  6 (6.00%) high severe
Benchmarking reverse_complement/scalar/9999
Benchmarking reverse_complement/scalar/9999: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/9999: Collecting 100 samples in estimated 5.0010 s (1.7M iterations)
Benchmarking reverse_complement/scalar/9999: Analyzing
reverse_complement/scalar/9999
                        time:   [3.0058 µs 3.0081 µs 3.0112 µs]
                        thrpt:  [3.0926 GiB/s 3.0958 GiB/s 3.0981 GiB/s]
                 change:
                        time:   [−0.5679% −0.3133% −0.0756%] (p = 0.01 < 0.05)
                        thrpt:  [+0.0757% +0.3143% +0.5712%]
                        Change within noise threshold.
Found 14 outliers among 100 measurements (14.00%)
  2 (2.00%) low mild
  6 (6.00%) high mild
  6 (6.00%) high severe
Benchmarking reverse_complement/simd_high_level/10000
Benchmarking reverse_complement/simd_high_level/10000: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_high_level/10000: Collecting 100 samples in estimated 5.0123 s (1.1M iterations)
Benchmarking reverse_complement/simd_high_level/10000: Analyzing
reverse_complement/simd_high_level/10000
                        time:   [4.7373 µs 4.7423 µs 4.7481 µs]
                        thrpt:  [1.9614 GiB/s 1.9639 GiB/s 1.9659 GiB/s]
                 change:
                        time:   [−0.2354% −0.0511% +0.1321%] (p = 0.60 > 0.05)
                        thrpt:  [−0.1320% +0.0511% +0.2360%]
                        No change in performance detected.
Found 7 outliers among 100 measurements (7.00%)
  5 (5.00%) high mild
  2 (2.00%) high severe
Benchmarking reverse_complement/simd_encoded/10000
Benchmarking reverse_complement/simd_encoded/10000: Warming up for 3.0000 s
Benchmarking reverse_complement/simd_encoded/10000: Collecting 100 samples in estimated 5.0011 s (13M iterations)
Benchmarking reverse_complement/simd_encoded/10000: Analyzing
reverse_complement/simd_encoded/10000
                        time:   [388.48 ns 388.99 ns 389.68 ns]
                        thrpt:  [23.900 GiB/s 23.942 GiB/s 23.974 GiB/s]
                 change:
                        time:   [−0.1302% +0.0102% +0.1619%] (p = 0.89 > 0.05)
                        thrpt:  [−0.1616% −0.0102% +0.1303%]
                        No change in performance detected.
Found 12 outliers among 100 measurements (12.00%)
  2 (2.00%) low mild
  4 (4.00%) high mild
  6 (6.00%) high severe
Benchmarking reverse_complement/scalar/10000
Benchmarking reverse_complement/scalar/10000: Warming up for 3.0000 s
Benchmarking reverse_complement/scalar/10000: Collecting 100 samples in estimated 5.0021 s (1.6M iterations)
Benchmarking reverse_complement/scalar/10000: Analyzing
reverse_complement/scalar/10000
                        time:   [3.0051 µs 3.0070 µs 3.0093 µs]
                        thrpt:  [3.0949 GiB/s 3.0972 GiB/s 3.0991 GiB/s]
                 change:
                        time:   [−0.6127% −0.1608% +0.1304%] (p = 0.53 > 0.05)
                        thrpt:  [−0.1303% +0.1611% +0.6165%]
                        No change in performance detected.
Found 14 outliers among 100 measurements (14.00%)
  2 (2.00%) low mild
  7 (7.00%) high mild
  5 (5.00%) high severe