asm-lsp 0.10.1

Language Server for x86/x86_64, ARM, RISCV, and z80 Assembly Code
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
�{setnbe4Set byte if not below or equal (CF == 0 and ZF == 0)setnbeSETHI	setnbeSETHI#vaesdeclast,Perform Last Round of an AES Decryption Flow
vaesdeclast vaesdeclastKvaesdeclast /vaesdeclastK/vaesdeclastvaesdeclastKvaesdeclast2vaesdeclastK2vaesdeclastHvaesdeclastH5jbJump if below (CF == 1)jbJCSNjbJCSOvfnmsub213phOFused Negative Multiply-Subtract of Packed Half-Precision Floating-Point Valuesvfnmsub213phK<vfnmsub213phKvfnmsub213phK>vfnmsub213phKvfnmsub213phR@vfnmsub213phRvfnmsub213phK<vfnmsub213phKvfnmsub213phK>vfnmsub213phKvfnmsub213phR@vfnmsub213phRvfnmsub213phRQvfnmsub213phRQvmulpd6Multiply Packed Double-Precision Floating-Point ValuesvmulpdH=vmulpdHvmulpdH?vmulpdHvmulpdHAvmulpdHvmulpdH=vmulpd vmulpdHvmulpd /vmulpdH?vmulpd vmulpdHvmulpd 2vmulpdHAvmulpdHvmulpdHQvmulpdHQ	vaddsubpdPacked Double-FP Add/Subtract	vaddsubpd 	vaddsubpd /	vaddsubpd 	vaddsubpd 2vpscatterdq=Scatter Packed Quadword Values with Signed Doubleword IndicesvpscatterdqHCvpscatterdqHCvpscatterdqHGvscatterdpdTScatter Packed Double-Precision Floating-Point Values with Signed Doubleword IndicesvscatterdpdHCvscatterdpdHCvscatterdpdHGstcSet Carry FlagstcSTCcmova#Move if above (CF == 0 and ZF == 0)cmovawcmovaw$cmovalcmoval'vpandq/Bitwise Logical AND of Packed Quadword IntegersvpandqH=vpandqHvpandqH?vpandqHvpandqHAvpandqHvpandqH=vpandqHvpandqH?vpandqHvpandqHAvpandqH
vcvttss2siIConvert with Truncation Scalar Single-Precision FP Value to Dword Integer
vcvttss2si 
vcvttss2siH
vcvttss2si '
vcvttss2siH'
vcvttss2siHRcmovnoMove if not overflow (OF == 0)cmovnowcmovnow$cmovnolcmovnol'ktestw#Bit Test 16-bit Masks and Set FlagsktestwJminsd;Return Minimum Scalar Double-Precision Floating-Point ValueminsdMINSDminsdMINSD+
aesdeclast,Perform Last Round of an AES Decryption Flow
aesdeclast'
aesdeclast'/cmppd5Compare Packed Double-Precision Floating-Point ValuescmppdCMPPDcmppdCMPPD/pmovzxwqBMove Packed Word Integers to Quadword Integers with Zero Extensionpmovzxwqpmovzxwq'	vmovshdup(Move Packed Single-FP High and Duplicate	vmovshdupH	vmovshdupH	vmovshdupH	vmovshdupH/	vmovshdupH2	vmovshdupH5	vmovshdup 	vmovshdupH	vmovshdup /	vmovshdupH/	vmovshdup 	vmovshdupH	vmovshdup 2	vmovshdupH2	vmovshdupH	vmovshdupH5	vpmovusdwMDown Convert Packed Doubleword Values to Word Values with Unsigned Saturation	vpmovusdwH	vpmovusdwH,	vpmovusdwH	vpmovusdwH0	vpmovusdwH	vpmovusdwH3	vpmovusdwH	vpmovusdwH	vpmovusdwH	vpmovusdwH+	vpmovusdwH/	vpmovusdwH2cwdConvert Word to Doublewordcwtd	vexpandpdKLoad Sparse Packed Double-Precision Floating-Point Values from Dense Memory	vexpandpdK	vexpandpdH	vexpandpdH	vexpandpdK/	vexpandpdH2	vexpandpdH5	vexpandpdK	vexpandpdK/	vexpandpdH	vexpandpdH2	vexpandpdH	vexpandpdH5vpshawPacked Shift Arithmetic Wordsvpshaw"vpshaw"/vpshaw"/vpminuw(Minimum of Packed Unsigned Word IntegersvpminuwIvpminuwI/vpminuwIvpminuwI2vpminuwIvpminuwI5vpminuw vpminuwIvpminuw /vpminuwI/vpminuw!vpminuwIvpminuw!2vpminuwI2vpminuwIvpminuwI5vrcpssOCompute Approximate Reciprocal of Scalar Single-Precision Floating-Point Valuesvrcpss vrcpss 'setneSet byte if not equal (ZF == 0)setneSETNE	setneSETNE#	vcvtph2pdLConvert Packed Half-Precision FP Values to Packed Double-Precision FP Values	vcvtph2pdK*	vcvtph2pdK.	vcvtph2pdR<	vcvtph2pdK	vcvtph2pdK	vcvtph2pdR	vcvtph2pdK*	vcvtph2pdK	vcvtph2pdK.	vcvtph2pdK	vcvtph2pdR<	vcvtph2pdR	vcvtph2pdRR	vcvtph2pdRR
prefetcht0'Prefetch Data Into Caches using T0 Hint
prefetcht0
PREFETCHT0
#blsic%Isolate Lowest Set Bit and Complementblsic6blsic6'vptestmq:Logical AND of Packed Quadword Integer Values and Set MaskvptestmqH=vptestmqH=vptestmqHvptestmqHvptestmqH?vptestmqH?vptestmqHvptestmqHvptestmqHAvptestmqHAvptestmqHvptestmqHvroundss3Round Scalar Single Precision Floating-Point Valuesvroundss vroundss 'roundss3Round Scalar Single Precision Floating-Point Valuesroundssroundss'cmovnbe0Move if not below or equal (CF == 0 and ZF == 0)cmovnbewcmovnbew$cmovnbelcmovnbel'vmovq
Move Quadwordvmovq vmovqHvmovq +vmovqH+vmovq +vmovqH+	vpblendmb*Blend Byte Vectors Using an OpMask Control	vpblendmbI	vpblendmbI/	vpblendmbI	vpblendmbI2	vpblendmbI	vpblendmbI5	vpblendmbI	vpblendmbI/	vpblendmbI	vpblendmbI2	vpblendmbI	vpblendmbI5jge#Jump if greater or equal (SF == OF)jgeJGENjgeJGEOcvtps2dqBConvert Packed Single-Precision FP Values to Packed Dword Integerscvtps2dqcvtps2dq/movntpsKStore Packed Single-Precision Floating-Point Values Using Non-Temporal HintmovntpsMOVNTPS/vfmsubsdHFused Multiply-Subtract of Scalar Double-Precision Floating-Point Valuesvfmsubsd$vfmsubsd$+vfmsubsd$+	movdir64bMOVe to DIRect store 64 Bytes	movdir64b15	prefetchw4Prefetch Data into Caches in Anticipation of a Write	prefetchwB#setnp Set byte if not parity (PF == 0)setnpSETPC	setnpSETPC#
vcvtph2psx>Convert Half-Precision FP Values to Single-Precision FP Values
vcvtph2psxK.
vcvtph2psxK<
vcvtph2psxR>
vcvtph2psxK
vcvtph2psxK
vcvtph2psxR
vcvtph2psxK.
vcvtph2psxK
vcvtph2psxK<
vcvtph2psxK
vcvtph2psxR>
vcvtph2psxR
vcvtph2psxRR
vcvtph2psxRRvpmulld?Multiply Packed Signed Doubleword Integers and Store Low ResultvpmulldH9vpmulldHvpmulldH:vpmulldHvpmulldH;vpmulldHvpmulldH9vpmulld vpmulldHvpmulld /vpmulldH:vpmulld!vpmulldHvpmulld!2vpmulldH;vpmulldH
vrsqrt14ssaCompute Approximate Reciprocal of a Square Root of a Scalar Single-Precision Floating-Point Value
vrsqrt14ssH
vrsqrt14ssH'
vrsqrt14ssH
vrsqrt14ssH'vsm3msg2=Perform Final Calculation for the Next Four SM3 Message Wordsvsm3msg2vsm3msg2/pslld)Shift Packed Doubleword Data Left Logicalpslldpslldpslld+pslldpslldpslld/vrcpshMCompute Approximate Reciprocal of Scalar Half-Precision Floating-Point ValuesvrcpshRvrcpshR$vrcpshRvrcpshR$
vcvtsd2usiSConvert Scalar Double-Precision Floating-Point Value to Unsigned Doubleword Integer
vcvtsd2usiH
vcvtsd2usiH+
vcvtsd2usiHQvcvtneobf162ps9Convert Odd Elements of Packed BF16 Values to FP32 Valuesvcvtneobf162psZ/vcvtneobf162psZ2vpaddsw6Add Packed Signed Word Integers with Signed SaturationvpaddswIvpaddswI/vpaddswIvpaddswI2vpaddswIvpaddswI5vpaddsw vpaddswIvpaddsw /vpaddswI/vpaddsw!vpaddswIvpaddsw!2vpaddswI2vpaddswIvpaddswI5vblendps4 Blend Packed Single Precision Floating-Point Valuesvblendps vblendps /vblendps vblendps 2rolRotate LeftrolbROLB	rolbROLB	rolbROLB	rolwROLWrolwROLWrolwROLWrollROLLrollROLLrollROLLrolbROLB#rolbROLB#rolbROLB#rolwROLW$rolwROLW$rolwROLW$rollROLL'rollROLL'rollROLL'pmovzxwdDMove Packed Word Integers to Doubleword Integers with Zero Extensionpmovzxwdpmovzxwd+vpdpbsudHPacked Dot Product of Signed-by-Unsinged Byte subvectors into DoublewordvpdpbsudXvpdpbsudX/vpdpbsudXvpdpbsudX2vgf2p8affineinvqb0Galois Field (2^8) Affine Inverse Transformationvgf2p8affineinvqbK=vgf2p8affineinvqbKvgf2p8affineinvqbK?vgf2p8affineinvqbKvgf2p8affineinvqbHAvgf2p8affineinvqbHvgf2p8affineinvqbK=vgf2p8affineinvqb vgf2p8affineinvqbKvgf2p8affineinvqb /vgf2p8affineinvqbK?vgf2p8affineinvqb vgf2p8affineinvqbKvgf2p8affineinvqb 2vgf2p8affineinvqbHAvgf2p8affineinvqbHcvtpd2psNConvert Packed Double-Precision FP Values to Packed Single-Precision FP Valuescvtpd2psCVTPD2PScvtpd2psCVTPD2PS/vfmaddsub132pdXFused Multiply-Alternating Add/Subtract of Packed Double-Precision Floating-Point Valuesvfmaddsub132pdH=vfmaddsub132pdHvfmaddsub132pdH?vfmaddsub132pdHvfmaddsub132pdHAvfmaddsub132pdHvfmaddsub132pdH=vfmaddsub132pd#vfmaddsub132pdHvfmaddsub132pd#/vfmaddsub132pdH?vfmaddsub132pd#vfmaddsub132pdHvfmaddsub132pd#2vfmaddsub132pdHAvfmaddsub132pdHvfmaddsub132pdHQvfmaddsub132pdHQ
vpcmpestrm3Packed Compare Explicit Length Strings, Return Maskvpcmpestrml vpcmpestrml /	vreducesdRPerform Reduction Transformation on a Scalar Double-Precision Floating-Point Value	vreducesdJ	vreducesdJ+	vreducesdJ	vreducesdJ+vshufpd5Shuffle Packed Double-Precision Floating-Point ValuesvshufpdH=vshufpdHvshufpdH?vshufpdHvshufpdHAvshufpdHvshufpdH=vshufpd vshufpdHvshufpd /vshufpdH?vshufpd vshufpdHvshufpd 2vshufpdHAvshufpdH
vpermil2pd:Permute Two-Source Double-Precision Floating-Point Vectors
vpermil2pd"
vpermil2pd"/
vpermil2pd"/
vpermil2pd"
vpermil2pd"2
vpermil2pd"2vmovsd1Move Scalar Double-Precision Floating-Point Value	vmovsdH,vmovsdH+vmovsd +vmovsdH+vmovsd +vmovsdH+vmovsdHvmovsd vmovsdHvfmadd231sdCFused Multiply-Add of Scalar Double-Precision Floating-Point Valuesvfmadd231sdHvfmadd231sdH+vfmadd231sd#vfmadd231sdHvfmadd231sd#+vfmadd231sdH+vfmadd231sdHQvfmadd231sdHQpmovzxbdDMove Packed Byte Integers to Doubleword Integers with Zero Extensionpmovzxbdpmovzxbd'vfrczss7Extract Fraction Scalar Single-Precision Floating Pointvfrczss"vfrczss"'
vrsqrt28sd�Approximation to the Reciprocal Square Root of a Scalar Double-Precision Floating-Point Value with Less Than 2^-28 Relative Error
vrsqrt28sdM
vrsqrt28sdM+
vrsqrt28sdM
vrsqrt28sdM+
vrsqrt28sdMR
vrsqrt28sdMR	vaddsubpsPacked Single-FP Add/Subtract	vaddsubps 	vaddsubps /	vaddsubps 	vaddsubps 2vandpsDBitwise Logical AND of Packed Single-Precision Floating-Point ValuesvandpsJ9vandpsJvandpsJ:vandpsJvandpsJ;vandpsJvandpsJ9vandps vandpsJvandps /vandpsJ:vandps vandpsJvandps 2vandpsJ;vandpsJvpshldw3Concatenate and Shift Packed Word Data Left LogicalvpshldwKvpshldwK/vpshldwKvpshldwK2vpshldwUvpshldwU5vpshldwKvpshldwK/vpshldwKvpshldwK2vpshldwUvpshldwU5pabsb&Packed Absolute Value of Byte Integerspabsbpabsb+pabsbpabsb/vblendpd3Blend Packed Double Precision Floating-Point Valuesvblendpd vblendpd /vblendpd vblendpd 2vpmuldqDMultiply Packed Signed Doubleword Integers and Store Quadword ResultvpmuldqH=vpmuldqHvpmuldqH?vpmuldqHvpmuldqHAvpmuldqHvpmuldqH=vpmuldq vpmuldqHvpmuldq /vpmuldqH?vpmuldq!vpmuldqHvpmuldq!2vpmuldqHAvpmuldqH	vsm4rnds4&Performs Four Rounds of SM4 Encryption	vsm4rnds4	vsm4rnds4/	vsm4rnds4	vsm4rnds42	vpmacsdqlCPacked Multiply Accumulate Signed Low Doubleword to Signed Quadword	vpmacsdql"	vpmacsdql"/bsfBit Scan ForwardbsfwBSFWbsfwBSFW$bsflBSFLbsflBSFL'vprotwPacked Rotate Wordsvprotw"vprotw"vprotw"/vprotw"/vprotw"/vfnmadd231psLFused Negative Multiply-Add of Packed Single-Precision Floating-Point Valuesvfnmadd231psH9vfnmadd231psHvfnmadd231psH:vfnmadd231psHvfnmadd231psH;vfnmadd231psHvfnmadd231psH9vfnmadd231ps#vfnmadd231psHvfnmadd231ps#/vfnmadd231psH:vfnmadd231ps#vfnmadd231psHvfnmadd231ps#2vfnmadd231psH;vfnmadd231psHvfnmadd231psHQvfnmadd231psHQxaddExchange and AddxaddbXADDB		xaddwXADDWxaddlXADDLxaddbXADDB#	xaddwXADDW$xaddlXADDL'xorps>Bitwise Logical XOR for Single-Precision Floating-Point ValuesxorpsXORPSxorpsXORPS/setnlSet byte if not less (SF == OF)setnlSETGE	setnlSETGE#kandnq$Bitwise Logical AND NOT 64-bit MaskskandnqIcmovbMove if below (CF == 1)cmovbwcmovbw$cmovblcmovbl'vinsertf64x4@Insert 256 Bits of Packed Double-Precision Floating-Point Valuesvinsertf64x4Hvinsertf64x4H2vinsertf64x4Hvinsertf64x4H2ud2Undefined Instructionud2jpoJump if parity odd (PF == 0)jpoJPCNjpoJPCOclcClear Carry FlagclcCLCpshufhwShuffle Packed High WordspshufhwPSHUFHWpshufhwPSHUFHW/kortestwOR 16-bit Masks and Set FlagskortestwHshrLogical Shift RightshrbSHRB	shrbSHRB	shrbSHRB	shrwSHRWshrwSHRWshrwSHRWshrlSHRLshrlSHRLshrlSHRLshrbSHRB#shrbSHRB#shrbSHRB#shrwSHRW$shrwSHRW$shrwSHRW$shrlSHRL'shrlSHRL'shrlSHRL'	vpermi2ps\Full Permute of Single-Precision Floating-Point Values From Two Tables Overwriting the Index	vpermi2psH9	vpermi2psH	vpermi2psH:	vpermi2psH	vpermi2psH;	vpermi2psH	vpermi2psH9	vpermi2psH	vpermi2psH:	vpermi2psH	vpermi2psH;	vpermi2psHvporPacked Bitwise Logical ORvpor vpor /vpor!vpor!2	vpermilpd.Permute Double-Precision Floating-Point Values 	vpermilpdH=	vpermilpdH?	vpermilpdHA	vpermilpdH=	vpermilpdH	vpermilpdH	vpermilpdH?	vpermilpdH	vpermilpdH	vpermilpdHA	vpermilpdH	vpermilpdH	vpermilpdH=	vpermilpdH=	vpermilpd 	vpermilpdH	vpermilpd 	vpermilpdH	vpermilpd /	vpermilpd /	vpermilpdH?	vpermilpdH?	vpermilpd 	vpermilpdH	vpermilpd 	vpermilpdH	vpermilpd 2	vpermilpd 2	vpermilpdHA	vpermilpdHA	vpermilpdH	vpermilpdHvphadddq:Packed Horizontal Add Signed Doubleword to Signed Quadwordvphadddq"vphadddq"/vfnmsub231pdQFused Negative Multiply-Subtract of Packed Double-Precision Floating-Point Valuesvfnmsub231pdH=vfnmsub231pdHvfnmsub231pdH?vfnmsub231pdHvfnmsub231pdHAvfnmsub231pdHvfnmsub231pdH=vfnmsub231pd#vfnmsub231pdHvfnmsub231pd#/vfnmsub231pdH?vfnmsub231pd#vfnmsub231pdHvfnmsub231pd#2vfnmsub231pdHAvfnmsub231pdHvfnmsub231pdHQvfnmsub231pdHQvmovlhps>Move Packed Single-Precision Floating-Point Values Low to Highvmovlhps vmovlhpsH
sha256msg2HPerform a Final Calculation for the Next Four SHA256 Message Doublewords
sha256msg2(
sha256msg2(/setnsSet byte if not sign (SF == 0)setnsSETPL	setnsSETPL#vucomisdNUnordered Compare Scalar Double-Precision Floating-Point Values and Set EFLAGSvucomisd vucomisdHvucomisd +vucomisdH+vucomisdHRxorpd>Bitwise Logical XOR for Double-Precision Floating-Point ValuesxorpdXORPDxorpdXORPD/vpcmpeqw%Compare Packed Word Data for EqualityvpcmpeqwIvpcmpeqwIvpcmpeqwI/vpcmpeqwI/vpcmpeqwIvpcmpeqwIvpcmpeqwI2vpcmpeqwI2vpcmpeqwIvpcmpeqwIvpcmpeqwI5vpcmpeqwI5vpcmpeqw vpcmpeqw /vpcmpeqw!vpcmpeqw!2vfmadd213sdCFused Multiply-Add of Scalar Double-Precision Floating-Point Valuesvfmadd213sdHvfmadd213sdH+vfmadd213sd#vfmadd213sdHvfmadd213sd#+vfmadd213sdH+vfmadd213sdHQvfmadd213sdHQjg&Jump if greater (ZF == 0 and SF == OF)jgJGTNjgJGTOinsertps3Insert Packed Single Precision Floating-Point Valueinsertpsinsertps'vfmsub231psHFused Multiply-Subtract of Packed Single-Precision Floating-Point Valuesvfmsub231psH9vfmsub231psHvfmsub231psH:vfmsub231psHvfmsub231psH;vfmsub231psHvfmsub231psH9vfmsub231ps#vfmsub231psHvfmsub231ps#/vfmsub231psH:vfmsub231ps#vfmsub231psHvfmsub231ps#2vfmsub231psH;vfmsub231psHvfmsub231psHQvfmsub231psHQvfmadd132sdCFused Multiply-Add of Scalar Double-Precision Floating-Point Valuesvfmadd132sdHvfmadd132sdH+vfmadd132sd#vfmadd132sdHvfmadd132sd#+vfmadd132sdH+vfmadd132sdHQvfmadd132sdHQsetbe/Set byte if below or equal (CF == 1 or ZF == 1)setbeSETLS	setbeSETLS#
vpclmulqdq"Carry-Less Quadword Multiplication

vpclmulqdq 
vpclmulqdqK
vpclmulqdq /
vpclmulqdqK/
vpclmulqdq
vpclmulqdqK
vpclmulqdq2
vpclmulqdqK2
vpclmulqdqH
vpclmulqdqH5vpcmpeqd+Compare Packed Doubleword Data for EqualityvpcmpeqdH9vpcmpeqdH9vpcmpeqdHvpcmpeqdHvpcmpeqdH:vpcmpeqdH:vpcmpeqdHvpcmpeqdHvpcmpeqdH;vpcmpeqdH;vpcmpeqdHvpcmpeqdHvpcmpeqd vpcmpeqd /vpcmpeqd!vpcmpeqd!2cmovnsMove if not sign (SF == 0)cmovnswcmovnsw$cmovnslcmovnsl'vmovdqu8Move Unaligned Byte Valuesvmovdqu8I0vmovdqu8Ivmovdqu8I3vmovdqu8Ivmovdqu8I6vmovdqu8Ivmovdqu8I/vmovdqu8I2vmovdqu8I5vmovdqu8Ivmovdqu8I/vmovdqu8Ivmovdqu8I2vmovdqu8Ivmovdqu8I5vmovdqu8I/vmovdqu8I2vmovdqu8I5vpshabPacked Shift Arithmetic Bytesvpshab"vpshab"/vpshab"/vsqrtssCCompute Square Root of Scalar Single-Precision Floating-Point ValuevsqrtssHvsqrtssH'vsqrtss vsqrtssHvsqrtss 'vsqrtssH'vsqrtssHQvsqrtssHQvpermbPermute Byte IntegersvpermbTvpermbT/vpermbTvpermbT2vpermbTvpermbT5vpermbTvpermbT/vpermbTvpermbT2vpermbTvpermbT5vcmpss5Compare Scalar Single-Precision Floating-Point ValuesvcmpssHvcmpssHvcmpssH'vcmpssH'vcmpss vcmpss 'vcmpssHRvcmpssHRvprotdPacked Rotate Doublewordsvprotd"vprotd"vprotd"/vprotd"/vprotd"/
vcvtusi2shFConvert Unsigned Integer to Scalar Half-Precision Floating-Point Valuevcvtusi2shlRvcvtusi2shlR'vcvtusi2shlRQpaddsb6Add Packed Signed Byte Integers with Signed Saturationpaddsbpaddsb+paddsbpaddsb/vpmuludq,Multiply Packed Unsigned Doubleword IntegersvpmuludqH=vpmuludqHvpmuludqH?vpmuludqHvpmuludqHAvpmuludqHvpmuludqH=vpmuludq vpmuludqHvpmuludq /vpmuludqH?vpmuludq!vpmuludqHvpmuludq!2vpmuludqHAvpmuludqHvinserti64x41Insert 256 Bits of Packed Quadword Integer Valuesvinserti64x4Hvinserti64x4H2vinserti64x4Hvinserti64x4H2vrndscalesh[Round Scalar Half-Precision Floating-Point Value To Include A Given Number Of Fraction BitsvrndscaleshRvrndscaleshR$vrndscaleshRvrndscaleshR$vrndscaleshRRvrndscaleshRR
vpmacssdqlSPacked Multiply Accumulate with Saturation Signed Low Doubleword to Signed Quadword
vpmacssdql"
vpmacssdql"/vpmadd52luqdPacked Multiply of Unsigned 52-bit Integers and Add the Low 52-bit Products to Quadword Accumulatorsvpmadd52luqK=vpmadd52luqKvpmadd52luqK?vpmadd52luqKvpmadd52luqOAvpmadd52luqOvpmadd52luqK=vpmadd52luqKvpmadd52luq[vpmadd52luq[/vpmadd52luqK?vpmadd52luqKvpmadd52luq[vpmadd52luq[2vpmadd52luqOAvpmadd52luqOmovss2Move Scalar Single-Precision Floating-Point ValuesmovssMOVSSmovssMOVSS'movssMOVSS'	vfmaddcshIFused Multiply-Add of Complex Scalar Half-Precision Floating-Point Values	vfmaddcshR	vfmaddcshR'	vfmaddcshR	vfmaddcshR'	vfmaddcshRQ	vfmaddcshRQvpshldvq@Concatenate and Variable Shift Packed Quadword Data Left LogicalvpshldvqK=vpshldvqKvpshldvqK?vpshldvqKvpshldvqUAvpshldvqUvpshldvqK=vpshldvqKvpshldvqK?vpshldvqKvpshldvqUAvpshldvqUvpermpd0Permute Double-Precision Floating-Point ElementsvpermpdH?vpermpdHAvpermpdH?vpermpdHvpermpdHvpermpdHAvpermpdHvpermpdHvpermpdH?vpermpdH?vpermpd!vpermpdHvpermpdHvpermpd!2vpermpdHAvpermpdHAvpermpdHvpermpdHvbroadcastss1Broadcast Single-Precision Floating-Point ElementvbroadcastssHvbroadcastssHvbroadcastssH'vbroadcastssH'vbroadcastss!vbroadcastss 'vbroadcastss!vbroadcastssHvbroadcastss 'vbroadcastssH'vbroadcastssHvbroadcastssH'setno"Set byte if not overflow (OF == 0)setnoSETOC	setnoSETOC#blcmskMask From Lowest Clear Bitblcmsk6blcmsk6'pfacc Packed Floating-Point Accumulatepfaccpfacc+pmaxub(Maximum of Packed Unsigned Byte IntegerspmaxubPMAXUB
pmaxubPMAXUB
+pmaxubPMAXUBpmaxubPMAXUB/vdppd<Dot Product of Packed Double Precision Floating-Point Valuesvdppd vdppd /das#Decimal Adjust AL after SubtractiondasDAScvtdq2pdBConvert Packed Dword Integers to Packed Double-Precision FP Valuescvtdq2pdcvtdq2pd+shlx*Logical Shift Left Without Affecting Flagsshlxl5shlxl5'vexp2pdyApproximation to the Exponential 2^x of Packed Double-Precision Floating-Point Values with Less Than 2^-23 Relative Errorvexp2pdMAvexp2pdMvexp2pdMAvexp2pdMvexp2pdMRvexp2pdMR	punpckhwd7Unpack and Interleave High-Order Words into Doublewords	punpckhwd	punpckhwd+	punpckhwd	punpckhwd/vfixupimmss;Fix Up Special Scalar Single-Precision Floating-Point ValuevfixupimmssHvfixupimmssH'vfixupimmssHvfixupimmssH'vfixupimmssHRvfixupimmssHRvfmsub132pdHFused Multiply-Subtract of Packed Double-Precision Floating-Point Valuesvfmsub132pdH=vfmsub132pdHvfmsub132pdH?vfmsub132pdHvfmsub132pdHAvfmsub132pdHvfmsub132pdH=vfmsub132pd#vfmsub132pdHvfmsub132pd#/vfmsub132pdH?vfmsub132pd#vfmsub132pdHvfmsub132pd#2vfmsub132pdHAvfmsub132pdHvfmsub132pdHQvfmsub132pdHQvfnmadd132sdLFused Negative Multiply-Add of Scalar Double-Precision Floating-Point Valuesvfnmadd132sdHvfnmadd132sdH+vfnmadd132sd#vfnmadd132sdHvfnmadd132sd#+vfnmadd132sdH+vfnmadd132sdHQvfnmadd132sdHQvfrczpd7Extract Fraction Packed Double-Precision Floating-Pointvfrczpd"vfrczpd"/vfrczpd"vfrczpd"2vpaddqAdd Packed Quadword IntegersvpaddqH=vpaddqHvpaddqH?vpaddqHvpaddqHAvpaddqHvpaddqH=vpaddq vpaddqHvpaddq /vpaddqH?vpaddq!vpaddqHvpaddq!2vpaddqHAvpaddqHpsignwPacked Sign of Word Integerspsignwpsignw+psignwpsignw/vpbroadcastwBroadcast Word IntegervpbroadcastwIvpbroadcastwIvpbroadcastwIvpbroadcastwIvpbroadcastwIvpbroadcastwIvpbroadcastwI$vpbroadcastwI$vpbroadcastwI$vpbroadcastwIvpbroadcastw!vpbroadcastwIvpbroadcastw!$vpbroadcastwI$vpbroadcastwIvpbroadcastw!vpbroadcastwIvpbroadcastw!$vpbroadcastwI$vpbroadcastwIvpbroadcastwIvpbroadcastwI$vfnmadd231ssLFused Negative Multiply-Add of Scalar Single-Precision Floating-Point Valuesvfnmadd231ssHvfnmadd231ssH'vfnmadd231ss#vfnmadd231ssHvfnmadd231ss#'vfnmadd231ssH'vfnmadd231ssHQvfnmadd231ssHQpaddwAdd Packed Word Integerspaddwpaddw+paddwpaddw/vpbroadcastmw2d?Broadcast Low Word of Mask Register to Packed Doubleword Valuesvpbroadcastmw2dNvpbroadcastmw2dNvpbroadcastmw2dNvpermt2b9Full Permute of Bytes From Two Tables Overwriting a Tablevpermt2bTvpermt2bT/vpermt2bTvpermt2bT2vpermt2bTvpermt2bT5vpermt2bTvpermt2bT/vpermt2bTvpermt2bT2vpermt2bTvpermt2bT5	vpmovusdbMDown Convert Packed Doubleword Values to Byte Values with Unsigned Saturation	vpmovusdbH	vpmovusdbH(	vpmovusdbH	vpmovusdbH,	vpmovusdbH	vpmovusdbH0	vpmovusdbH	vpmovusdbH	vpmovusdbH	vpmovusdbH'	vpmovusdbH+	vpmovusdbH/vmovntpsKStore Packed Single-Precision Floating-Point Values Using Non-Temporal Hintvmovntps /vmovntpsH/vmovntps 2vmovntpsH2vmovntpsH5
vpternlogd6Bitwise Ternary Logical Operation on Doubleword Values
vpternlogdH9
vpternlogdH
vpternlogdH:
vpternlogdH
vpternlogdH;
vpternlogdH
vpternlogdH9
vpternlogdH
vpternlogdH:
vpternlogdH
vpternlogdH;
vpternlogdHcvtps2piBConvert Packed Single-Precision FP Values to Packed Dword Integerscvtps2piCVTPS2PLcvtps2piCVTPS2PL+	vptestnmw7Logical NAND of Packed Word Integer Values and Set Mask	vptestnmwI	vptestnmwI	vptestnmwI/	vptestnmwI/	vptestnmwI	vptestnmwI	vptestnmwI2	vptestnmwI2	vptestnmwI	vptestnmwI	vptestnmwI5	vptestnmwI5vpscatterdd?Scatter Packed Doubleword Values with Signed Doubleword IndicesvpscatterddHCvpscatterddHGvpscatterddHKbtBit TestbtwBTWbtwBTWbtlBTLbtlBTLbtwBTW$btwBTW$btlBTL'btlBTL'rdpmc#Read Performance-Monitoring Counterrdpmc-kunpckwd"Unpack and Interleave 16-bit MaskskunpckwdIpopPop a Value from the StackpopwPOPWpoplPOPLpopwPOPW$poplPOPL'vphaddbq4Packed Horizontal Add Signed Byte to Signed Quadwordvphaddbq"vphaddbq"/vbroadcastsd1Broadcast Double-Precision Floating-Point Element
vbroadcastsdHvbroadcastsdHvbroadcastsdH+vbroadcastsdH+vbroadcastsd!vbroadcastsdHvbroadcastsd +vbroadcastsdH+vbroadcastsdHvbroadcastsdH+kortestqOR 64-bit Masks and Set FlagskortestqIvfnmsub231psQFused Negative Multiply-Subtract of Packed Single-Precision Floating-Point Valuesvfnmsub231psH9vfnmsub231psHvfnmsub231psH:vfnmsub231psHvfnmsub231psH;vfnmsub231psHvfnmsub231psH9vfnmsub231ps#vfnmsub231psHvfnmsub231ps#/vfnmsub231psH:vfnmsub231ps#vfnmsub231psHvfnmsub231ps#2vfnmsub231psH;vfnmsub231psHvfnmsub231psHQvfnmsub231psHQpinsrbInsert Bytepinsrbpinsrb#	vmovmskpd8Extract Packed Double-Precision Floating-Point Sign Mask	vmovmskpd 	vmovmskpd 
vrsqrt28ss�Approximation to the Reciprocal Square Root of a Scalar Single-Precision Floating-Point Value with Less Than 2^-28 Relative Error
vrsqrt28ssM
vrsqrt28ssM'
vrsqrt28ssM
vrsqrt28ssM'
vrsqrt28ssMR
vrsqrt28ssMRvstmxcsrStore MXCSR Register Statevstmxcsr 'adcAdd with CarryadcbADCBadcbADCB	adcbADCB		adcbADCB	#adcwADCWadcwADCWadcwADCWadcwADCWadcwADCW$adclADCLadclADCLadclADCLadclADCLadclADCL'adcbADCB#adcbADCB#	adcwADCW$adcwADCW$adcwADCW$adclADCL'adclADCL'adclADCL'
vcvtudq2ps\Convert Packed Unsigned Doubleword Integers to Packed Single-Precision Floating-Point Values
vcvtudq2psH9
vcvtudq2psH:
vcvtudq2psH;
vcvtudq2psH
vcvtudq2psH
vcvtudq2psH
vcvtudq2psH9
vcvtudq2psH
vcvtudq2psH:
vcvtudq2psH
vcvtudq2psH;
vcvtudq2psH
vcvtudq2psHQ
vcvtudq2psHQvdivpd4Divide Packed Double-Precision Floating-Point ValuesvdivpdH=vdivpdHvdivpdH?vdivpdHvdivpdHAvdivpdHvdivpdH=vdivpd vdivpdHvdivpd /vdivpdH?vdivpd vdivpdHvdivpd 2vdivpdHAvdivpdHvdivpdHQvdivpdHQvprorvq%Variable Rotate Packed Quadword RightvprorvqH=vprorvqHvprorvqH?vprorvqHvprorvqHAvprorvqHvprorvqH=vprorvqHvprorvqH?vprorvqHvprorvqHAvprorvqHcldClear Direction FlagcldCLDmulpd6Multiply Packed Double-Precision Floating-Point ValuesmulpdMULPDmulpdMULPD/pfrcp.Packed Floating-Point Reciprocal Approximationpfrcppfrcp+prefetchPrefetch Data into Cachesprefetch@#psubq!Subtract Packed Quadword IntegerspsubqPSUBQpsubqPSUBQ+psubqPSUBQpsubqPSUBQ/vdpps<Dot Product of Packed Single Precision Floating-Point Valuesvdpps vdpps /vdpps vdpps 2vfmsub132sdHFused Multiply-Subtract of Scalar Double-Precision Floating-Point Valuesvfmsub132sdHvfmsub132sdH+vfmsub132sd#vfmsub132sdHvfmsub132sd#+vfmsub132sdH+vfmsub132sdHQvfmsub132sdHQvmovsh0Move Scalar Half-Precision Floating-Point ValuesvmovshR%vmovshR$vmovshR$vmovshR$vmovshRvmovshR	vpdpwsudsXPacked Dot Product of Signed-by-Unsigned Word subvectors into Doubleword with Saturation	vpdpwsudsY	vpdpwsudsY/	vpdpwsudsY	vpdpwsudsY2	vpmacsdqhDPacked Multiply Accumulate Signed High Doubleword to Signed Quadword	vpmacsdqh"	vpmacsdqh"/vpshrdvqAConcatenate and Variable Shift Packed Quadword Data Right LogicalvpshrdvqK=vpshrdvqKvpshrdvqK?vpshrdvqKvpshrdvqUAvpshrdvqUvpshrdvqK=vpshrdvqKvpshrdvqK?vpshrdvqKvpshrdvqUAvpshrdvqU
vgetmantpdOExtract Normalized Mantissas from Packed Double-Precision Floating-Point Values
vgetmantpdH=
vgetmantpdH?
vgetmantpdHA
vgetmantpdH
vgetmantpdH
vgetmantpdH
vgetmantpdH=
vgetmantpdH
vgetmantpdH?
vgetmantpdH
vgetmantpdHA
vgetmantpdH
vgetmantpdHR
vgetmantpdHRvmulph4Multiply Packed Half-Precision Floating-Point ValuesvmulphK<vmulphKvmulphK>vmulphKvmulphR@vmulphRvmulphK<vmulphKvmulphK>vmulphKvmulphR@vmulphRvmulphRQvmulphRQvpbroadcastdBroadcast Doubleword IntegervpbroadcastdHvpbroadcastdHvpbroadcastdHvpbroadcastdHvpbroadcastdHvpbroadcastdHvpbroadcastdH'vpbroadcastdH'vpbroadcastdH'vpbroadcastdHvpbroadcastd!vpbroadcastdHvpbroadcastd!'vpbroadcastdH'vpbroadcastdHvpbroadcastd!vpbroadcastdHvpbroadcastd!'vpbroadcastdH'vpbroadcastdHvpbroadcastdHvpbroadcastdH'
vshufi32x40Shuffle 128-Bit Packed Doubleword Integer Values
vshufi32x4H:
vshufi32x4H
vshufi32x4H;
vshufi32x4H
vshufi32x4H:
vshufi32x4H
vshufi32x4H;
vshufi32x4HblsrReset Lowest Set Bitblsrl4blsrl4'blsfillFill From Lowest Set Bitblsfill6blsfill6'vphsubd.Packed Horizontal Subtract Doubleword Integersvphsubd vphsubd /vphsubd!vphsubd!2vmulsh:Fused Multiply Scalar Half-Precision Floating-Point ValuesvmulshRvmulshR$vmulshRvmulshR$vmulshRQvmulshRQfemmsFast Exit Multimedia Statefemmsvpsadbw#Compute Sum of Absolute Differences
vpsadbw vpsadbwIvpsadbw /vpsadbwI/vpsadbw!vpsadbwIvpsadbw!2vpsadbwI2vpsadbwIvpsadbwI5	vpblendmw*Blend Word Vectors Using an OpMask Control	vpblendmwI	vpblendmwI/	vpblendmwI	vpblendmwI2	vpblendmwI	vpblendmwI5	vpblendmwI	vpblendmwI/	vpblendmwI	vpblendmwI2	vpblendmwI	vpblendmwI5maxsd;Return Maximum Scalar Double-Precision Floating-Point ValuemaxsdMAXSDmaxsdMAXSD+vmovntdq-Store Double Quadword Using Non-Temporal Hintvmovntdq /vmovntdqH/vmovntdq 2vmovntdqH2vmovntdqH5psllw#Shift Packed Word Data Left Logicalpsllwpsllwpsllw+psllwpsllwpsllw/
vgatherqpdRGather Packed Double-Precision Floating-Point Values Using Signed Quadword Indices
vgatherqpdHD
vgatherqpdHH
vgatherqpdHL
vgatherqpd!D
vgatherqpd!HvpinsrbInsert Bytevpinsrb vpinsrbIvpinsrb #vpinsrbI#jnae$Jump if not above or equal (CF == 1)jnaeJCSNjnaeJCSOvminsd;Return Minimum Scalar Double-Precision Floating-Point ValuevminsdHvminsdH+vminsd vminsdHvminsd +vminsdH+vminsdHRvminsdHRvshufps5Shuffle Packed Single-Precision Floating-Point ValuesvshufpsH9vshufpsHvshufpsH:vshufpsHvshufpsH;vshufpsHvshufpsH9vshufps vshufpsHvshufps /vshufpsH:vshufps vshufpsHvshufps 2vshufpsH;vshufpsHmaxpd<Return Maximum Packed Double-Precision Floating-Point ValuesmaxpdMAXPDmaxpdMAXPD/pmovzxbw>Move Packed Byte Integers to Word Integers with Zero Extensionpmovzxbwpmovzxbw+vroundsd3Round Scalar Double Precision Floating-Point Valuesvroundsd vroundsd +pcmpgtq$Compare Packed Data for Greater Thanpcmpgtqpcmpgtq/kxnord!Bitwise Logical XNOR 32-bit MaskskxnordI
vgatherpf1dpsoSparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Doubleword Indices Using T1 Hint
vgatherpf1dpsLKvpcompresswBStore Sparse Packed Word Integer Values into Dense Memory/RegistervpcompresswK0vpcompresswKvpcompresswK3vpcompresswKvpcompresswU6vpcompresswUvpcompresswKvpcompresswKvpcompresswUvpcompresswK/vpcompresswK2vpcompresswU5	vpmovsxwqBMove Packed Word Integers to Quadword Integers with Sign Extension	vpmovsxwqH	vpmovsxwqH	vpmovsxwqH	vpmovsxwqH'	vpmovsxwqH+	vpmovsxwqH/	vpmovsxwq 	vpmovsxwqH	vpmovsxwq '	vpmovsxwqH'	vpmovsxwq!	vpmovsxwqH	vpmovsxwq!+	vpmovsxwqH+	vpmovsxwqH	vpmovsxwqH/vpmacsww5Packed Multiply Accumulate Signed Word to Signed Wordvpmacsww"vpmacsww"/	vphaddubq/Packed Horizontal Add Unsigned Byte to Quadword	vphaddubq"	vphaddubq"/vsubph4Subtract Packed Half-Precision Floating-Point ValuesvsubphK<vsubphKvsubphK>vsubphKvsubphR@vsubphRvsubphK<vsubphKvsubphK>vsubphKvsubphR@vsubphRvsubphRQvsubphRQxgetbv&Get Value of Extended Control Registerxgetbvpsrlq(Shift Packed Quadword Data Right Logicalpsrlqpsrlqpsrlq+psrlqpsrlqpsrlq/vfixupimmsd;Fix Up Special Scalar Double-Precision Floating-Point ValuevfixupimmsdHvfixupimmsdH+vfixupimmsdHvfixupimmsdH+vfixupimmsdHRvfixupimmsdHRpminsd,Minimum of Packed Signed Doubleword Integerspminsdpminsd/kandw Bitwise Logical AND 16-bit MaskskandwHkshiftldShift Left 32-bit MaskskshiftldItzcnt&Count the Number of Trailing Zero Bitstzcntw4tzcntw4$tzcntl4tzcntl4'vrangesdYRange Restriction Calculation For a pair of Scalar Double-Precision Floating-Point ValuesvrangesdJvrangesdJ+vrangesdJvrangesdJ+vrangesdJRvrangesdJRvpermdPermute Doubleword Integers
vpermdH:vpermdHvpermdH;vpermdHvpermdH:vpermd!vpermdHvpermd!2vpermdH;vpermdHvsubpd6Subtract Packed Double-Precision Floating-Point ValuesvsubpdH=vsubpdHvsubpdH?vsubpdHvsubpdHAvsubpdHvsubpdH=vsubpd vsubpdHvsubpd /vsubpdH?vsubpd vsubpdHvsubpd 2vsubpdHAvsubpdHvsubpdHQvsubpdHQvpshrdvw=Concatenate and Variable Shift Packed Word Data Right LogicalvpshrdvwKvpshrdvwK/vpshrdvwKvpshrdvwK2vpshrdvwUvpshrdvwU5vpshrdvwKvpshrdvwK/vpshrdvwKvpshrdvwK2vpshrdvwUvpshrdvwU5jnlJump if not less (SF == OF)jnlJGENjnlJGEOmovdiriMOVe to DIRect store Integermovdiri0'	vcvtph2uwZConvert Packed Half-Precision Floating-Point Values to Packed Unsigned Word Integer Values	vcvtph2uwK<	vcvtph2uwK>	vcvtph2uwR@	vcvtph2uwK	vcvtph2uwK	vcvtph2uwR	vcvtph2uwK<	vcvtph2uwK	vcvtph2uwK>	vcvtph2uwK	vcvtph2uwR@	vcvtph2uwR	vcvtph2uwRQ	vcvtph2uwRQcmovneMove if not equal (ZF == 0)cmovnewcmovnew$cmovnelcmovnel'extrq
Extract Fieldextrqextrqvfmsub231sdHFused Multiply-Subtract of Scalar Double-Precision Floating-Point Valuesvfmsub231sdHvfmsub231sdH+vfmsub231sd#vfmsub231sdHvfmsub231sd#+vfmsub231sdH+vfmsub231sdHQvfmsub231sdHQ	vpdpbuudsZPacked Dot Product of Unsigned-by-Unsinged Byte subvectors into Doubleword with Saturation	vpdpbuudsX	vpdpbuudsX/	vpdpbuudsX	vpdpbuudsX2	vpexpandbALoad Sparse Packed Byte Integer Values from Dense Memory/Register	vpexpandbK	vpexpandbK	vpexpandbU	vpexpandbK/	vpexpandbK2	vpexpandbU5	vpexpandbK	vpexpandbK/	vpexpandbK	vpexpandbK2	vpexpandbU	vpexpandbU5	pmaddubsw9Multiply and Add Packed Signed and Unsigned Byte Integers	pmaddubsw	pmaddubsw+	pmaddubsw	pmaddubsw/vfmadd231pdCFused Multiply-Add of Packed Double-Precision Floating-Point Valuesvfmadd231pdH=vfmadd231pdHvfmadd231pdH?vfmadd231pdHvfmadd231pdHAvfmadd231pdHvfmadd231pdH=vfmadd231pd#vfmadd231pdHvfmadd231pd#/vfmadd231pdH?vfmadd231pd#vfmadd231pdHvfmadd231pd#2vfmadd231pdHAvfmadd231pdHvfmadd231pdHQvfmadd231pdHQvphsubw(Packed Horizontal Subtract Word Integersvphsubw vphsubw /vphsubw!vphsubw!2vscatterpf0qps�Sparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Quadword Indices Using T0 Hint with Intent to Writevscatterpf0qpsLMvaesenclast,Perform Last Round of an AES Encryption Flow
vaesenclast vaesenclastKvaesenclast /vaesenclastK/vaesenclastvaesenclastKvaesenclast2vaesenclastK2vaesenclastHvaesenclastH5subSubtractsubbSUBBsubbSUBB	subbSUBB		subbSUBB	#subwSUBWsubwSUBWsubwSUBWsubwSUBWsubwSUBW$sublSUBLsublSUBLsublSUBLsublSUBLsublSUBL'subbSUBB#subbSUBB#	subwSUBW$subwSUBW$subwSUBW$sublSUBL'sublSUBL'sublSUBL'movhps7Move High Packed Single-Precision Floating-Point ValuesmovhpsMOVHPS+movhpsMOVHPS+vbcstnebf162ps;Load BF16 Element and Convert to FP32 Element With Broadcasvbcstnebf162psZ$vbcstnebf162psZ$rorx,Rotate Right Logical Without Affecting Flagsrorxl5rorxl5'	vcvtps2qq^Convert Packed Single Precision Floating-Point Values to Packed Singed Quadword Integer Values	vcvtps2qqJ8	vcvtps2qqJ9	vcvtps2qqJ:	vcvtps2qqJ	vcvtps2qqJ	vcvtps2qqJ	vcvtps2qqJ8	vcvtps2qqJ	vcvtps2qqJ9	vcvtps2qqJ	vcvtps2qqJ:	vcvtps2qqJ	vcvtps2qqJQ	vcvtps2qqJQvfmsub132shFFused Multiply-Subtract of Scalar Half-Precision Floating-Point Valuesvfmsub132shRvfmsub132shR$vfmsub132shRvfmsub132shR$vfmsub132shRQvfmsub132shRQvpabsw&Packed Absolute Value of Word IntegersvpabswIvpabswIvpabswIvpabswI/vpabswI2vpabswI5vpabsw vpabswIvpabsw /vpabswI/vpabsw!vpabswIvpabsw!2vpabswI2vpabswIvpabswI5vpmulhw:Multiply Packed Signed Word Integers and Store High ResultvpmulhwIvpmulhwI/vpmulhwIvpmulhwI2vpmulhwIvpmulhwI5vpmulhw vpmulhwIvpmulhw /vpmulhwI/vpmulhw!vpmulhwIvpmulhw!2vpmulhwI2vpmulhwIvpmulhwI5vpopcntb)Packed Population Count for Byte IntegersvpopcntbKvpopcntbKvpopcntbSvpopcntbK/vpopcntbK2vpopcntbS5vpopcntbKvpopcntbK/vpopcntbKvpopcntbK2vpopcntbSvpopcntbS5pmaddwd,Multiply and Add Packed Signed Word Integerspmaddwdpmaddwd+pmaddwdpmaddwd/vpscatterqq;Scatter Packed Quadword Values with Signed Quadword IndicesvpscatterqqHEvpscatterqqHIvpscatterqqHMvinserti64x21Insert 128 Bits of Packed Quadword Integer Valuesvinserti64x2Jvinserti64x2J/vinserti64x2Jvinserti64x2J/vinserti64x2Jvinserti64x2J/vinserti64x2Jvinserti64x2J/orps<Bitwise Logical OR of Single-Precision Floating-Point ValuesorpsORPSorpsORPS/vmaskmovdqu'Store Selected Bytes of Double Quadwordvmaskmovdqu vcvtneebf162ps:Convert Even Elements of Packed BF16 Values to FP32 Valuesvcvtneebf162psZ/vcvtneebf162psZ2movnti(Store Doubleword Using Non-Temporal Hintmovntil'psrad-Shift Packed Doubleword Data Right Arithmeticpsradpsradpsrad+psradpsradpsrad/	vblendmpdLBlend Packed Double-Precision Floating-Point Vectors Using an OpMask Control	vblendmpdH=	vblendmpdH	vblendmpdH?	vblendmpdH	vblendmpdHA	vblendmpdH	vblendmpdH=	vblendmpdH	vblendmpdH?	vblendmpdH	vblendmpdHA	vblendmpdHvbroadcastf32x26Broadcast Two Single-Precision Floating-Point Elementsvbroadcastf32x2Jvbroadcastf32x2Jvbroadcastf32x2J+vbroadcastf32x2J+vbroadcastf32x2Jvbroadcastf32x2J+vbroadcastf32x2Jvbroadcastf32x2J+
vpunpcklwd6Unpack and Interleave Low-Order Words into Doublewords
vpunpcklwdI
vpunpcklwdI/
vpunpcklwdI
vpunpcklwdI2
vpunpcklwdI
vpunpcklwdI5
vpunpcklwd 
vpunpcklwdI
vpunpcklwd /
vpunpcklwdI/
vpunpcklwd!
vpunpcklwdI
vpunpcklwd!2
vpunpcklwdI2
vpunpcklwdI
vpunpcklwdI5vpermwPermute Word IntegersvpermwIvpermwI/vpermwIvpermwI2vpermwIvpermwI5vpermwIvpermwI/vpermwIvpermwI2vpermwIvpermwI5incIncrement by 1incbINCB	incwINCWinclINCLincbINCB#incwINCW$inclINCL'	vmovdqu16Move Unaligned Word Values	vmovdqu16I0	vmovdqu16I	vmovdqu16I3	vmovdqu16I	vmovdqu16I6	vmovdqu16I	vmovdqu16I/	vmovdqu16I2	vmovdqu16I5	vmovdqu16I	vmovdqu16I/	vmovdqu16I	vmovdqu16I2	vmovdqu16I	vmovdqu16I5	vmovdqu16I/	vmovdqu16I2	vmovdqu16I5vbroadcastf64x47Broadcast Four Double-Precision Floating-Point Elementsvbroadcastf64x4H2vbroadcastf64x4H2	vpdpwuudsZPacked Dot Product of Unsigned-by-Unsigned Word subvectors into Doubleword with Saturation	vpdpwuudsY	vpdpwuudsY/	vpdpwuudsY	vpdpwuudsY2
vgf2p8mulbGalois Field Multiply Bytes
vgf2p8mulb
vgf2p8mulb/
vgf2p8mulb
vgf2p8mulb2
vgf2p8mulb
vgf2p8mulb5
vgf2p8mulb
vgf2p8mulb
vgf2p8mulb/
vgf2p8mulb/
vgf2p8mulb
vgf2p8mulb
vgf2p8mulb2
vgf2p8mulb2
vgf2p8mulb
vgf2p8mulb5	vgetexppslExtract Exponents of Packed Single-Precision Floating-Point Values as Single-Precision Floating-Point Values	vgetexppsH9	vgetexppsH:	vgetexppsH;	vgetexppsH	vgetexppsH	vgetexppsH	vgetexppsH9	vgetexppsH	vgetexppsH:	vgetexppsH	vgetexppsH;	vgetexppsH	vgetexppsHR	vgetexppsHR	vcvtqq2pdQConvert Packed Quadword Integers to Packed Double-Precision Floating-Point Values	vcvtqq2pdJ=	vcvtqq2pdJ?	vcvtqq2pdJA	vcvtqq2pdJ	vcvtqq2pdJ	vcvtqq2pdJ	vcvtqq2pdJ=	vcvtqq2pdJ	vcvtqq2pdJ?	vcvtqq2pdJ	vcvtqq2pdJA	vcvtqq2pdJ	vcvtqq2pdJQ	vcvtqq2pdJQvpermi2q?Full Permute of Quadwords From Two Tables Overwriting the Indexvpermi2qH=vpermi2qHvpermi2qH?vpermi2qHvpermi2qHAvpermi2qHvpermi2qH=vpermi2qHvpermi2qH?vpermi2qHvpermi2qHAvpermi2qHpaddusb:Add Packed Unsigned Byte Integers with Unsigned Saturationpaddusbpaddusb+paddusbpaddusb/	vcvtsi2sh7Convert Dword Integer to Scalar Half-Precision FP Value
vcvtsi2shlR
vcvtsi2shlR'
vcvtsi2shlRQ
vgetmantssMExtract Normalized Mantissa from Scalar Single-Precision Floating-Point Value
vgetmantssH
vgetmantssH'
vgetmantssH
vgetmantssH'
vgetmantssHR
vgetmantssHRvmulsd6Multiply Scalar Double-Precision Floating-Point ValuesvmulsdHvmulsdH+vmulsd vmulsdHvmulsd +vmulsdH+vmulsdHQvmulsdHQphaddd(Packed Horizontal Add Doubleword Integerphadddphaddd+phadddphaddd/
phminposuw3Packed Horizontal Minimum of Unsigned Word Integers
phminposuw
phminposuw/vpsubsw;Subtract Packed Signed Word Integers with Signed SaturationvpsubswIvpsubswI/vpsubswIvpsubswI2vpsubswIvpsubswI5vpsubsw vpsubswIvpsubsw /vpsubswI/vpsubsw!vpsubswIvpsubsw!2vpsubswI2vpsubswIvpsubswI5haddpsPacked Single-FP Horizontal Addhaddpshaddps/minss;Return Minimum Scalar Single-Precision Floating-Point ValueminssMINSSminssMINSS'	vpexpandwALoad Sparse Packed Word Integer Values from Dense Memory/Register	vpexpandwK	vpexpandwK	vpexpandwU	vpexpandwK/	vpexpandwK2	vpexpandwU5	vpexpandwK	vpexpandwK/	vpexpandwK	vpexpandwK2	vpexpandwU	vpexpandwU5vprotbPacked Rotate Bytesvprotb"vprotb"vprotb"/vprotb"/vprotb"/pmovzxbqBMove Packed Byte Integers to Quadword Integers with Zero Extensionpmovzxbqpmovzxbq$vprordRotate Packed Doubleword RightvprordH9vprordH:vprordH;vprordHvprordHvprordHvprordH9vprordHvprordH:vprordHvprordH;vprordHvfrczps7Extract Fraction Packed Single-Precision Floating-Pointvfrczps"vfrczps"/vfrczps"vfrczps"2cmpsd5Compare Scalar Double-Precision Floating-Point ValuescmpsdCMPSDcmpsdCMPSD+pfmaxPacked Floating-Point Maximumpfmaxpfmax+vplzcntq@Count the Number of Leading Zero Bits for Packed Quadword ValuesvplzcntqN=vplzcntqN?vplzcntqNAvplzcntqNvplzcntqNvplzcntqNvplzcntqN=vplzcntqNvplzcntqN?vplzcntqNvplzcntqNAvplzcntqN	vcvtsh2ssJConvert Scalar Half-Precision FP Value to Scalar Double-Precision FP Value	vcvtsh2ssR	vcvtsh2ssR$	vcvtsh2ssR	vcvtsh2ssR$	vcvtsh2ssRR	vcvtsh2ssRRvfnmadd132psLFused Negative Multiply-Add of Packed Single-Precision Floating-Point Valuesvfnmadd132psH9vfnmadd132psHvfnmadd132psH:vfnmadd132psHvfnmadd132psH;vfnmadd132psHvfnmadd132psH9vfnmadd132ps#vfnmadd132psHvfnmadd132ps#/vfnmadd132psH:vfnmadd132ps#vfnmadd132psHvfnmadd132ps#2vfnmadd132psH;vfnmadd132psHvfnmadd132psHQvfnmadd132psHQ	vreducessRPerform Reduction Transformation on a Scalar Single-Precision Floating-Point Value	vreducessJ	vreducessJ'	vreducessJ	vreducessJ'vpshrdw4Concatenate and Shift Packed Word Data Right LogicalvpshrdwKvpshrdwK/vpshrdwKvpshrdwK2vpshrdwUvpshrdwU5vpshrdwKvpshrdwK/vpshrdwKvpshrdwK2vpshrdwUvpshrdwU5vpmovq2m7Move Signs of Packed Quadword Integers to Mask Registervpmovq2mJvpmovq2mJvpmovq2mJvfmsub213pdHFused Multiply-Subtract of Packed Double-Precision Floating-Point Valuesvfmsub213pdH=vfmsub213pdHvfmsub213pdH?vfmsub213pdHvfmsub213pdHAvfmsub213pdHvfmsub213pdH=vfmsub213pd#vfmsub213pdHvfmsub213pd#/vfmsub213pdH?vfmsub213pd#vfmsub213pdHvfmsub213pd#2vfmsub213pdHAvfmsub213pdHvfmsub213pdHQvfmsub213pdHQsqrtssCCompute Square Root of Scalar Single-Precision Floating-Point ValuesqrtssSQRTSSsqrtssSQRTSS'vpermt2d?Full Permute of Doublewords From Two Tables Overwriting a Tablevpermt2dH9vpermt2dHvpermt2dH:vpermt2dHvpermt2dH;vpermt2dHvpermt2dH9vpermt2dHvpermt2dH:vpermt2dHvpermt2dH;vpermt2dHvpdpbuudJPacked Dot Product of Unsigned-by-Unsinged Byte subvectors into DoublewordvpdpbuudXvpdpbuudX/vpdpbuudXvpdpbuudX2	punpcklbw0Unpack and Interleave Low-Order Bytes into Words	punpcklbw	punpcklbw'	punpcklbw	punpcklbw/vphaddswAPacked Horizontal Add Signed Word Integers with Signed Saturationvphaddsw vphaddsw /vphaddsw!vphaddsw!2vpinsrdInsert Doublewordvpinsrd vpinsrdJvpinsrd 'vpinsrdJ'vxorpd>Bitwise Logical XOR for Double-Precision Floating-Point ValuesvxorpdJ=vxorpdJvxorpdJ?vxorpdJvxorpdJAvxorpdJvxorpdJ=vxorpd vxorpdJvxorpd /vxorpdJ?vxorpd vxorpdJvxorpd 2vxorpdJAvxorpdJvpslld)Shift Packed Doubleword Data Left LogicalvpslldH9vpslldH:vpslldH;vpslldHvpslldHvpslldH/vpslldHvpslldHvpslldH/vpslldHvpslldHvpslldH/vpslldH9vpslld vpslldHvpslld vpslldHvpslld /vpslldH/vpslldH:vpslld!vpslldHvpslld!vpslldHvpslld!/vpslldH/vpslldH;vpslldHvpslldHvpslldH/addpd1Add Packed Double-Precision Floating-Point ValuesaddpdADDPDaddpdADDPD/vprolvq$Variable Rotate Packed Quadword LeftvprolvqH=vprolvqHvprolvqH?vprolvqHvprolvqHAvprolvqHvprolvqH=vprolvqHvprolvqH?vprolvqHvprolvqHAvprolvqHkmovbMove 8-bit MaskkmovbJkmovbJkmovbJ#kmovbJkmovbJ#movddup Move One Double-FP and Duplicatemovddupmovddup+vpmovswbEDown Convert Packed Word Values to Byte Values with Signed SaturationvpmovswbIvpmovswbI,vpmovswbIvpmovswbI0vpmovswbIvpmovswbI3vpmovswbIvpmovswbIvpmovswbIvpmovswbI+vpmovswbI/vpmovswbI2vpsignbPacked Sign of Byte Integersvpsignb vpsignb /vpsignb!vpsignb!2movsldup'Move Packed Single-FP Low and Duplicatemovsldupmovsldup/
vcvttph2uwjConvert with Truncation Packed Half-Precision Floating-Point Values to Packed Unsigned Word Integer Values
vcvttph2uwK<
vcvttph2uwK>
vcvttph2uwR@
vcvttph2uwK
vcvttph2uwK
vcvttph2uwR
vcvttph2uwK<
vcvttph2uwK
vcvttph2uwK>
vcvttph2uwK
vcvttph2uwR@
vcvttph2uwR
vcvttph2uwRR
vcvttph2uwRRshld#Integer Double Precision Shift Leftshldwshldwshldlshldlshldw$shldw$shldl'shldl'vorps<Bitwise Logical OR of Single-Precision Floating-Point ValuesvorpsJ9vorpsJvorpsJ:vorpsJvorpsJ;vorpsJvorpsJ9vorps vorpsJvorps /vorpsJ:vorps vorpsJvorps 2vorpsJ;vorpsJvfnmadd132pdLFused Negative Multiply-Add of Packed Double-Precision Floating-Point Valuesvfnmadd132pdH=vfnmadd132pdHvfnmadd132pdH?vfnmadd132pdHvfnmadd132pdHAvfnmadd132pdHvfnmadd132pdH=vfnmadd132pd#vfnmadd132pdHvfnmadd132pd#/vfnmadd132pdH?vfnmadd132pd#vfnmadd132pdHvfnmadd132pd#2vfnmadd132pdHAvfnmadd132pdHvfnmadd132pdHQvfnmadd132pdHQrorRotate RightrorbRORB	rorbRORB	rorbRORB	rorwRORWrorwRORWrorwRORWrorlRORLrorlRORLrorlRORLrorbRORB#rorbRORB#rorbRORB#rorwRORW$rorwRORW$rorwRORW$rorlRORL'rorlRORL'rorlRORL'movdquMove Unaligned Double QuadwordmovdquMOVOUmovdquMOVOU/movdquMOVOU/pblendwBlend Packed Wordspblendwpblendw/kmovdMove 32-bit MaskkmovdIkmovdIkmovdI'kmovdIkmovdI'jnbe0Jump if not below or equal (CF == 0 and ZF == 0)jnbeJHINjnbeJHIOkshiftrbShift Right 8-bit MaskskshiftrbJvsm4key4(Perform Four Rounds of SM4 Key Expansionvsm4key4vsm4key4/vsm4key4vsm4key42vpcomub%Compare Packed Unsigned Byte Integersvpcomub"vpcomub"/vfmaddsub213psXFused Multiply-Alternating Add/Subtract of Packed Single-Precision Floating-Point Valuesvfmaddsub213psH9vfmaddsub213psHvfmaddsub213psH:vfmaddsub213psHvfmaddsub213psH;vfmaddsub213psHvfmaddsub213psH9vfmaddsub213ps#vfmaddsub213psHvfmaddsub213ps#/vfmaddsub213psH:vfmaddsub213ps#vfmaddsub213psHvfmaddsub213ps#2vfmaddsub213psH;vfmaddsub213psHvfmaddsub213psHQvfmaddsub213psHQvpminsw&Minimum of Packed Signed Word IntegersvpminswIvpminswI/vpminswIvpminswI2vpminswIvpminswI5vpminsw vpminswIvpminsw /vpminswI/vpminsw!vpminswIvpminsw!2vpminswI2vpminswIvpminswI5mwaitMonitor WaitmwaitDpsubusw?Subtract Packed Unsigned Word Integers with Unsigned SaturationpsubuswPSUBUSWpsubuswPSUBUSW+psubuswPSUBUSWpsubuswPSUBUSW/
vgatherpf0dpdoSparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Doubleword Indices Using T0 Hint
vgatherpf0dpdLGptestPacked Logical Compareptestptest/vpmaxsd,Maximum of Packed Signed Doubleword IntegersvpmaxsdH9vpmaxsdHvpmaxsdH:vpmaxsdHvpmaxsdH;vpmaxsdHvpmaxsdH9vpmaxsd vpmaxsdHvpmaxsd /vpmaxsdH:vpmaxsd!vpmaxsdHvpmaxsd!2vpmaxsdH;vpmaxsdHvpbroadcastbBroadcast Byte IntegervpbroadcastbIvpbroadcastbIvpbroadcastbIvpbroadcastbIvpbroadcastbIvpbroadcastbIvpbroadcastbI#vpbroadcastbI#vpbroadcastbI#vpbroadcastbIvpbroadcastb!vpbroadcastbIvpbroadcastb!#vpbroadcastbI#vpbroadcastbIvpbroadcastb!vpbroadcastbIvpbroadcastb!#vpbroadcastbI#vpbroadcastbIvpbroadcastbIvpbroadcastbI#kxord Bitwise Logical XOR 32-bit MaskskxordIpauseSpin Loop HintpausePAUSEsubpd6Subtract Packed Double-Precision Floating-Point ValuessubpdSUBPDsubpdSUBPD/	vfnmsubsdQFused Negative Multiply-Subtract of Scalar Double-Precision Floating-Point Values	vfnmsubsd$	vfnmsubsd$+	vfnmsubsd$+vmaxsd;Return Maximum Scalar Double-Precision Floating-Point ValuevmaxsdHvmaxsdH+vmaxsd vmaxsdHvmaxsd +vmaxsdH+vmaxsdHRvmaxsdHRvpmovsdwKDown Convert Packed Doubleword Values to Word Values with Signed SaturationvpmovsdwHvpmovsdwH,vpmovsdwHvpmovsdwH0vpmovsdwHvpmovsdwH3vpmovsdwHvpmovsdwHvpmovsdwHvpmovsdwH+vpmovsdwH/vpmovsdwH2vfnmadd213ssLFused Negative Multiply-Add of Scalar Single-Precision Floating-Point Valuesvfnmadd213ssHvfnmadd213ssH'vfnmadd213ss#vfnmadd213ssHvfnmadd213ss#'vfnmadd213ssH'vfnmadd213ssHQvfnmadd213ssHQvcmppd5Compare Packed Double-Precision Floating-Point ValuesvcmppdH=vcmppdH=vcmppdHvcmppdHvcmppdH?vcmppdH?vcmppdHvcmppdHvcmppdHAvcmppdHAvcmppdHvcmppdHvcmppd vcmppd /vcmppd vcmppd 2vcmppdHRvcmppdHR	vscalefph[Scale Packed Half-Precision Floating-Point Values With Half-Precision Floating-Point Values	vscalefphK<	vscalefphK	vscalefphK>	vscalefphK	vscalefphR@	vscalefphR	vscalefphK<	vscalefphK	vscalefphK>	vscalefphK	vscalefphR@	vscalefphR	vscalefphRQ	vscalefphRQ
vshufi64x2.Shuffle 128-Bit Packed Quadword Integer Values
vshufi64x2H?
vshufi64x2H
vshufi64x2HA
vshufi64x2H
vshufi64x2H?
vshufi64x2H
vshufi64x2HA
vshufi64x2Hroundsd3Round Scalar Double Precision Floating-Point Valuesroundsdroundsd+vpbroadcastqBroadcast Quadword IntegervpbroadcastqHvpbroadcastqHvpbroadcastqHvpbroadcastqH+vpbroadcastqH+vpbroadcastqH+vpbroadcastq!vpbroadcastqHvpbroadcastq!+vpbroadcastqH+vpbroadcastq!vpbroadcastqHvpbroadcastq!+vpbroadcastqH+vpbroadcastqHvpbroadcastqH+	vfnmsubpdQFused Negative Multiply-Subtract of Packed Double-Precision Floating-Point Values	vfnmsubpd$	vfnmsubpd$/	vfnmsubpd$/	vfnmsubpd$	vfnmsubpd$2	vfnmsubpd$2	vphadduwd1Packed Horizontal Add Unsigned Word to Doubleword	vphadduwd"	vphadduwd"/psubsb;Subtract Packed Signed Byte Integers with Signed SaturationpsubsbPSUBSBpsubsbPSUBSB+psubsbPSUBSBpsubsbPSUBSB/cvtpi2pdBConvert Packed Dword Integers to Packed Double-Precision FP Valuescvtpi2pdCVTPL2PDcvtpi2pdCVTPL2PD+pmovsxwqBMove Packed Word Integers to Quadword Integers with Sign Extensionpmovsxwqpmovsxwq'	punpckldq:Unpack and Interleave Low-Order Doublewords into Quadwords	punpckldq	punpckldq'	punpckldq	punpckldq/adcx9Unsigned Integer Addition of Two Operands with Carry Flagadcxl7adcxl7'pmulld?Multiply Packed Signed Doubleword Integers and Store Low Resultpmulldpmulld/vinsertf128#Insert Packed Floating-Point Valuesvinsertf128 vinsertf128 /	vpackssdw2Pack Doublewords into Words with Signed Saturation	vpackssdwI9	vpackssdwI	vpackssdwI:	vpackssdwI	vpackssdwI;	vpackssdwI	vpackssdwI9	vpackssdw 	vpackssdwI	vpackssdw /	vpackssdwI:	vpackssdw!	vpackssdwI	vpackssdw!2	vpackssdwI;	vpackssdwIvpermt2q=Full Permute of Quadwords From Two Tables Overwriting a Tablevpermt2qH=vpermt2qHvpermt2qH?vpermt2qHvpermt2qHAvpermt2qHvpermt2qH=vpermt2qHvpermt2qH?vpermt2qHvpermt2qHAvpermt2qHbtrBit Test and ResetbtrwBTRWbtrwBTRWbtrlBTRLbtrlBTRLbtrwBTRW$btrwBTRW$btrlBTRL'btrlBTRL'vrcp28pstApproximation to the Reciprocal of Packed Single-Precision Floating-Point Values with Less Than 2^-28 Relative Errorvrcp28psM;vrcp28psMvrcp28psM;vrcp28psMvrcp28psMRvrcp28psMR	pcmpistri4Packed Compare Implicit Length Strings, Return Index	pcmpistri	pcmpistri/
vcvtudq2phZConvert Packed Unsigned Doubleword Integers to Packed Half-Precision Floating-Point Valuesvcvtudq2phxK9vcvtudq2phyK:
vcvtudq2phR;vcvtudq2phxKvcvtudq2phyK
vcvtudq2phRvcvtudq2phxK9vcvtudq2phyK:vcvtudq2phxKvcvtudq2phyK
vcvtudq2phR;
vcvtudq2phR
vcvtudq2phRQ
vcvtudq2phRQvpmovm2w4Expand Bits of Mask Register to Packed Word Integersvpmovm2wIvpmovm2wIvpmovm2wIpackusdw4Pack Doublewords into Words with Unsigned Saturationpackusdwpackusdw/aamASCII Adjust AX After MultiplyaamAAMaamAAMvphaddd(Packed Horizontal Add Doubleword Integervphaddd vphaddd /vphaddd!vphaddd!2movntssKStore Scalar Single-Precision Floating-Point Values Using Non-Temporal Hintmovntss'	vpmacssddQPacked Multiply Accumulate with Saturation Signed Doubleword to Signed Doubleword	vpmacssdd"	vpmacssdd"/	pcmpestri4Packed Compare Explicit Length Strings, Return Index
pcmpestril
pcmpestril/	vunpckhpdHUnpack and Interleave High Packed Double-Precision Floating-Point Values	vunpckhpdH=	vunpckhpdH	vunpckhpdH?	vunpckhpdH	vunpckhpdHA	vunpckhpdH	vunpckhpdH=	vunpckhpd 	vunpckhpdH	vunpckhpd /	vunpckhpdH?	vunpckhpd 	vunpckhpdH	vunpckhpd 2	vunpckhpdHA	vunpckhpdHpmullw9Multiply Packed Signed Word Integers and Store Low Resultpmullwpmullw+pmullwpmullw/rdpru$Read Processor Register in User moderdpru.vfnmadd231phJFused Negative Multiply-Add of Packed Half-Precision Floating-Point Valuesvfnmadd231phK<vfnmadd231phKvfnmadd231phK>vfnmadd231phKvfnmadd231phR@vfnmadd231phRvfnmadd231phK<vfnmadd231phKvfnmadd231phK>vfnmadd231phKvfnmadd231phR@vfnmadd231phRvfnmadd231phRQvfnmadd231phRQ	vpblendmq.Blend Quadword Vectors Using an OpMask Control	vpblendmqH=	vpblendmqH	vpblendmqH?	vpblendmqH	vpblendmqHA	vpblendmqH	vpblendmqH=	vpblendmqH	vpblendmqH?	vpblendmqH	vpblendmqHA	vpblendmqH	vpdpwssdsVPacked Dot Product of Signed-by-Signed Word subvectors into Doubleword with Saturation	vpdpwssdsK9	vpdpwssdsK	vpdpwssdsK:	vpdpwssdsK	vpdpwssdsV;	vpdpwssdsV	vpdpwssdsK9	vpdpwssdsW	vpdpwssdsK	vpdpwssdsW/	vpdpwssdsK:	vpdpwssdsW	vpdpwssdsK	vpdpwssdsW2	vpdpwssdsV;	vpdpwssdsV
vpunpckldq:Unpack and Interleave Low-Order Doublewords into Quadwords
vpunpckldqH9
vpunpckldqH
vpunpckldqH:
vpunpckldqH
vpunpckldqH;
vpunpckldqH
vpunpckldqH9
vpunpckldq 
vpunpckldqH
vpunpckldq /
vpunpckldqH:
vpunpckldq!
vpunpckldqH
vpunpckldq!2
vpunpckldqH;
vpunpckldqH	vscalefsd_Scale Scalar Double-Precision Floating-Point Value With a Double-Precision Floating-Point Value	vscalefsdH	vscalefsdH+	vscalefsdH	vscalefsdH+	vscalefsdHQ	vscalefsdHQ
vmaskmovps>Conditional Move Packed Single-Precision Floating-Point Values
vmaskmovps /
vmaskmovps 2
vmaskmovps /
vmaskmovps 2vpmultishiftqb3Select Packed Unaligned Bytes from Quadword SourcesvpmultishiftqbK=vpmultishiftqbKvpmultishiftqbK?vpmultishiftqbKvpmultishiftqbTAvpmultishiftqbTvpmultishiftqbK=vpmultishiftqbKvpmultishiftqbK?vpmultishiftqbKvpmultishiftqbTAvpmultishiftqbTvfnmadd231pdLFused Negative Multiply-Add of Packed Double-Precision Floating-Point Valuesvfnmadd231pdH=vfnmadd231pdHvfnmadd231pdH?vfnmadd231pdHvfnmadd231pdHAvfnmadd231pdHvfnmadd231pdH=vfnmadd231pd#vfnmadd231pdHvfnmadd231pd#/vfnmadd231pdH?vfnmadd231pd#vfnmadd231pdHvfnmadd231pd#2vfnmadd231pdHAvfnmadd231pdHvfnmadd231pdHQvfnmadd231pdHQsetpo Set byte if parity odd (PF == 0)setpoSETPC	setpoSETPC#vpextrdExtract Doublewordvpextrd vpextrdJvpextrd 'vpextrdJ'vpermqPermute Quadword IntegersvpermqH?vpermqHAvpermqH?vpermqHvpermqHvpermqHAvpermqHvpermqHvpermqH?vpermqH?vpermq!vpermqHvpermqHvpermq!2vpermqHAvpermqHAvpermqHvpermqHvdivph2Divide Packed Half-Precision Floating-Point ValuesvdivphK<vdivphKvdivphK>vdivphKvdivphR@vdivphRvdivphK<vdivphKvdivphK>vdivphKvdivphR@vdivphRvdivphRQvdivphRQvfnmsub231ssQFused Negative Multiply-Subtract of Scalar Single-Precision Floating-Point Valuesvfnmsub231ssHvfnmsub231ssH'vfnmsub231ss#vfnmsub231ssHvfnmsub231ss#'vfnmsub231ssH'vfnmsub231ssHQvfnmsub231ssHQvpextrwExtract Wordvpextrw vpextrwIvpextrw $vpextrwI$vfmsub213ssHFused Multiply-Subtract of Scalar Single-Precision Floating-Point Valuesvfmsub213ssHvfmsub213ssH'vfmsub213ss#vfmsub213ssHvfmsub213ss#'vfmsub213ssH'vfmsub213ssHQvfmsub213ssHQ
vpgatherqd=Gather Packed Doubleword Values Using Signed Quadword Indices
vpgatherqdHD
vpgatherqdHH
vpgatherqdHL
vpgatherqd!D
vpgatherqd!HvfmulcphKFused Fused Multiply of Complex Packed Half-Precision Floating-Point ValuesvfmulcphK9vfmulcphKvfmulcphK:vfmulcphKvfmulcphR;vfmulcphRvfmulcphK9vfmulcphKvfmulcphK:vfmulcphKvfmulcphR;vfmulcphRvfmulcphRQvfmulcphRQpandnPacked Bitwise Logical AND NOTpandnpandn+pandnpandn/vldmxcsrLoad MXCSR Registervldmxcsr 'movntq)Store of Quadword Using Non-Temporal Hintmovntq
+kandq Bitwise Logical AND 64-bit MaskskandqIpmovzxdqHMove Packed Doubleword Integers to Quadword Integers with Zero Extensionpmovzxdqpmovzxdq+vpinsrwInsert Wordvpinsrw vpinsrwIvpinsrw $vpinsrwI$vpmacswd;Packed Multiply Accumulate Signed Word to Signed Doublewordvpmacswd"vpmacswd"/ktestb"Bit Test 8-bit Masks and Set FlagsktestbJbtsBit Test and SetbtswBTSWbtswBTSWbtslBTSLbtslBTSLbtswBTSW$btswBTSW$btslBTSL'btslBTSL'vpermi2b;Full Permute of Bytes From Two Tables Overwriting the Indexvpermi2bTvpermi2bT/vpermi2bTvpermi2bT2vpermi2bTvpermi2bT5vpermi2bTvpermi2bT/vpermi2bTvpermi2bT2vpermi2bTvpermi2bT5	vscalefss_Scale Scalar Single-Precision Floating-Point Value With a Single-Precision Floating-Point Value	vscalefssH	vscalefssH'	vscalefssH	vscalefssH'	vscalefssHQ	vscalefssHQvdivps4Divide Packed Single-Precision Floating-Point ValuesvdivpsH9vdivpsHvdivpsH:vdivpsHvdivpsH;vdivpsHvdivpsH9vdivps vdivpsHvdivps /vdivpsH:vdivps vdivpsHvdivps 2vdivpsH;vdivpsHvdivpsHQvdivpsHQpshuflwShuffle Packed Low WordspshuflwPSHUFLWpshuflwPSHUFLW/movdqaMove Aligned Double QuadwordmovdqaMOVOmovdqaMOVO/movdqaMOVO/
vpcmpestri4Packed Compare Explicit Length Strings, Return Indexvpcmpestril vpcmpestril /addAddaddbADDBaddbADDB	addbADDB		addbADDB	#addwADDWaddwADDWaddwADDWaddwADDWaddwADDW$addlADDLaddlADDLaddlADDLaddlADDLaddlADDL'addbADDB#addbADDB#	addwADDW$addwADDW$addwADDW$addlADDL'addlADDL'addlADDL'vfnmsub213ssQFused Negative Multiply-Subtract of Scalar Single-Precision Floating-Point Valuesvfnmsub213ssHvfnmsub213ssH'vfnmsub213ss#vfnmsub213ssHvfnmsub213ss#'vfnmsub213ssH'vfnmsub213ssHQvfnmsub213ssHQvfnmsub132psQFused Negative Multiply-Subtract of Packed Single-Precision Floating-Point Valuesvfnmsub132psH9vfnmsub132psHvfnmsub132psH:vfnmsub132psHvfnmsub132psH;vfnmsub132psHvfnmsub132psH9vfnmsub132ps#vfnmsub132psHvfnmsub132ps#/vfnmsub132psH:vfnmsub132ps#vfnmsub132psHvfnmsub132ps#2vfnmsub132psH;vfnmsub132psHvfnmsub132psHQvfnmsub132psHQknotwNOT 16-bit Mask RegisterknotwHvlddquLoad Unaligned Integer 128 Bitsvlddqu /vlddqu 2	vmovsldup'Move Packed Single-FP Low and Duplicate	vmovsldupH	vmovsldupH	vmovsldupH	vmovsldupH/	vmovsldupH2	vmovsldupH5	vmovsldup 	vmovsldupH	vmovsldup /	vmovsldupH/	vmovsldup 	vmovsldupH	vmovsldup 2	vmovsldupH2	vmovsldupH	vmovsldupH5movhpd6Move High Packed Double-Precision Floating-Point ValuemovhpdMOVHPD+movhpdMOVHPD+vaddph/Add Packed Half-Precision Floating-Point ValuesvaddphK<vaddphKvaddphK>vaddphKvaddphR@vaddphRvaddphK<vaddphKvaddphK>vaddphKvaddphR@vaddphRvaddphRQvaddphRQcmovnbMove if not below (CF == 0)cmovnbwcmovnbw$cmovnblcmovnbl'kordBitwise Logical OR 32-bit MaskskordI	sha1rnds4%Perform Four Rounds of SHA1 Operation	sha1rnds4(	sha1rnds4(/vfnmsub132sdQFused Negative Multiply-Subtract of Scalar Double-Precision Floating-Point Valuesvfnmsub132sdHvfnmsub132sdH+vfnmsub132sd#vfnmsub132sdHvfnmsub132sd#+vfnmsub132sdH+vfnmsub132sdHQvfnmsub132sdHQaaddAtomically ADDaadd'	vdbpsadbw>Double Block Packed Sum-Absolute-Differences on Unsigned Bytes	vdbpsadbwI	vdbpsadbwI/	vdbpsadbwI	vdbpsadbwI2	vdbpsadbwI	vdbpsadbwI5	vdbpsadbwI	vdbpsadbwI/	vdbpsadbwI	vdbpsadbwI2	vdbpsadbwI	vdbpsadbwI5sha1msg1NPerform an Intermediate Calculation for the Next Four SHA1 Message Doublewordssha1msg1(sha1msg1(/vsubps6Subtract Packed Single-Precision Floating-Point ValuesvsubpsH9vsubpsHvsubpsH:vsubpsHvsubpsH;vsubpsHvsubpsH9vsubps vsubpsHvsubps /vsubpsH:vsubps vsubpsHvsubps 2vsubpsH;vsubpsHvsubpsHQvsubpsHQandnpdHBitwise Logical AND NOT of Packed Double-Precision Floating-Point ValuesandnpdANDNPDandnpdANDNPD/pinsrdInsert DoublewordpinsrdPINSRDpinsrdPINSRD'vfmaddpsCFused Multiply-Add of Packed Single-Precision Floating-Point Valuesvfmaddps$vfmaddps$/vfmaddps$/vfmaddps$vfmaddps$2vfmaddps$2vcvtph2wQConvert Packed Half-Precision Floating-Point Values to Packed Word Integer Valuesvcvtph2wK<vcvtph2wK>vcvtph2wR@vcvtph2wKvcvtph2wKvcvtph2wRvcvtph2wK<vcvtph2wKvcvtph2wK>vcvtph2wKvcvtph2wR@vcvtph2wRvcvtph2wRQvcvtph2wRQ
vfpclasssd:Test Class of Scalar Double-Precision Floating-Point Value
vfpclasssdJ
vfpclasssdJ
vfpclasssdJ+
vfpclasssdJ+
vgatherpf1qpsmSparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Quadword Indices Using T1 Hint
vgatherpf1qpsLMsha1msg2FPerform a Final Calculation for the Next Four SHA1 Message Doublewordssha1msg2(sha1msg2(/andLogical ANDandbANDBandbANDB	andbANDB		andbANDB	#andwANDWandwANDWandwANDWandwANDWandwANDW$andlANDLandlANDLandlANDLandlANDLandlANDL'andbANDB#andbANDB#	andwANDW$andwANDW$andwANDW$andlANDL'andlANDL'andlANDL'jncJump if not carry (CF == 0)jncJCCNjncJCCOpxor#Packed Bitwise Logical Exclusive ORpxorPXORpxorPXOR+pxorPXORpxorPXOR/unpcklpdGUnpack and Interleave Low Packed Double-Precision Floating-Point ValuesunpcklpdUNPCKLPDunpcklpdUNPCKLPD/paddsw6Add Packed Signed Word Integers with Signed Saturationpaddswpaddsw+paddswpaddsw/mcommit
Memory COMMITmcommit>aandAtomically ANDaand'vfnmsub213sdQFused Negative Multiply-Subtract of Scalar Double-Precision Floating-Point Valuesvfnmsub213sdHvfnmsub213sdH+vfnmsub213sd#vfnmsub213sdHvfnmsub213sd#+vfnmsub213sdH+vfnmsub213sdHQvfnmsub213sdHQpfrsqrt:Packed Floating-Point Reciprocal Square Root Approximationpfrsqrtpfrsqrt+vmaxpd<Return Maximum Packed Double-Precision Floating-Point ValuesvmaxpdH=vmaxpdHvmaxpdH?vmaxpdHvmaxpdHAvmaxpdHvmaxpdH=vmaxpd vmaxpdHvmaxpd /vmaxpdH?vmaxpd vmaxpdHvmaxpd 2vmaxpdHAvmaxpdHvmaxpdHRvmaxpdHRcvtpd2piBConvert Packed Double-Precision FP Values to Packed Dword Integerscvtpd2piCVTPD2PLcvtpd2piCVTPD2PL/vfmadd231phAFused Multiply-Add of Packed Half-Precision Floating-Point Valuesvfmadd231phK<vfmadd231phKvfmadd231phK>vfmadd231phKvfmadd231phR@vfmadd231phRvfmadd231phK<vfmadd231phKvfmadd231phK>vfmadd231phKvfmadd231phR@vfmadd231phRvfmadd231phRQvfmadd231phRQ	vsm3rnds2#Perform Two Rounds of SM3 Operation	vsm3rnds2	vsm3rnds2/cmovnpMove if not parity (PF == 0)cmovnpwcmovnpw$cmovnplcmovnpl'vpsravq4Variable Shift Packed Quadword Data Right ArithmeticvpsravqH=vpsravqHvpsravqH?vpsravqHvpsravqHAvpsravqHvpsravqH=vpsravqHvpsravqH?vpsravqHvpsravqHAvpsravqH
vcvttps2dqRConvert with Truncation Packed Single-Precision FP Values to Packed Dword Integers
vcvttps2dqH9
vcvttps2dqH:
vcvttps2dqH;
vcvttps2dqH
vcvttps2dqH
vcvttps2dqH
vcvttps2dqH9
vcvttps2dq 
vcvttps2dqH
vcvttps2dq /
vcvttps2dqH:
vcvttps2dq 
vcvttps2dqH
vcvttps2dq 2
vcvttps2dqH;
vcvttps2dqH
vcvttps2dqHR
vcvttps2dqHR
vcvtuqq2phXConvert Packed Unsigned Quadword Integers to Packed Half-Precision Floating-Point Valuesvcvtuqq2phxK=vcvtuqq2phyK?vcvtuqq2phzRAvcvtuqq2phxKvcvtuqq2phyKvcvtuqq2phzRvcvtuqq2phxK=vcvtuqq2phyK?vcvtuqq2phzRAvcvtuqq2phxKvcvtuqq2phyKvcvtuqq2phzRvcvtuqq2phzRQvcvtuqq2phzRQvmovupd<Move Unaligned Packed Double-Precision Floating-Point ValuesvmovupdH0vmovupdHvmovupdH3vmovupdHvmovupdH6vmovupdHvmovupdH/vmovupdH2vmovupdH5vmovupd vmovupdHvmovupd /vmovupdH/vmovupd vmovupdHvmovupd 2vmovupdH2vmovupdHvmovupdH5vmovupd /vmovupdH/vmovupd 2vmovupdH2vmovupdH5	vcvtps2pdNConvert Packed Single-Precision FP Values to Packed Double-Precision FP Values	vcvtps2pdH8	vcvtps2pdK9	vcvtps2pdH:	vcvtps2pdH	vcvtps2pdK	vcvtps2pdH	vcvtps2pdH8	vcvtps2pd 	vcvtps2pdH	vcvtps2pd +	vcvtps2pdK9	vcvtps2pd 	vcvtps2pdK	vcvtps2pd /	vcvtps2pdH:	vcvtps2pdH	vcvtps2pdHR	vcvtps2pdHRsetng-Set byte if not greater (ZF == 1 or SF != OF)setngSETLE	setngSETLE#setncSet byte if not carry (CF == 0)setncSETCC	setncSETCC#vfmadd231psCFused Multiply-Add of Packed Single-Precision Floating-Point Valuesvfmadd231psH9vfmadd231psHvfmadd231psH:vfmadd231psHvfmadd231psH;vfmadd231psHvfmadd231psH9vfmadd231ps#vfmadd231psHvfmadd231ps#/vfmadd231psH:vfmadd231ps#vfmadd231psHvfmadd231ps#2vfmadd231psH;vfmadd231psHvfmadd231psHQvfmadd231psHQpextrbExtract Bytepextrbpextrb#vmovaps:Move Aligned Packed Single-Precision Floating-Point ValuesvmovapsH0vmovapsHvmovapsH3vmovapsHvmovapsH6vmovapsHvmovapsH/vmovapsH2vmovapsH5vmovaps vmovapsHvmovaps /vmovapsH/vmovaps vmovapsHvmovaps 2vmovapsH2vmovapsHvmovapsH5vmovaps /vmovapsH/vmovaps 2vmovapsH2vmovapsH5vrangepsXRange Restriction Calculation For Packed Pairs of Single-Precision Floating-Point ValuesvrangepsJ9vrangepsJvrangepsJ:vrangepsJvrangepsJ;vrangepsJvrangepsJ9vrangepsJvrangepsJ:vrangepsJvrangepsJ;vrangepsJvrangepsJRvrangepsJR	vblendmpsLBlend Packed Single-Precision Floating-Point Vectors Using an OpMask Control	vblendmpsH9	vblendmpsH	vblendmpsH:	vblendmpsH	vblendmpsH;	vblendmpsH	vblendmpsH9	vblendmpsH	vblendmpsH:	vblendmpsH	vblendmpsH;	vblendmpsHvpshrdq8Concatenate and Shift Packed Quadword Data Right LogicalvpshrdqK=vpshrdqKvpshrdqK?vpshrdqKvpshrdqUAvpshrdqUvpshrdqK=vpshrdqKvpshrdqK?vpshrdqKvpshrdqUAvpshrdqUvprolvd&Variable Rotate Packed Doubleword LeftvprolvdH9vprolvdHvprolvdH:vprolvdHvprolvdH;vprolvdHvprolvdH9vprolvdHvprolvdH:vprolvdHvprolvdH;vprolvdHvminss;Return Minimum Scalar Single-Precision Floating-Point ValuevminssHvminssH'vminss vminssHvminss 'vminssH'vminssHRvminssHRvcmpps5Compare Packed Single-Precision Floating-Point ValuesvcmppsH9vcmppsH9vcmppsHvcmppsHvcmppsH:vcmppsH:vcmppsHvcmppsHvcmppsH;vcmppsH;vcmppsHvcmppsHvcmpps vcmpps /vcmpps vcmpps 2vcmppsHRvcmppsHR
vpternlogq4Bitwise Ternary Logical Operation on Quadword Values
vpternlogqH=
vpternlogqH
vpternlogqH?
vpternlogqH
vpternlogqHA
vpternlogqH
vpternlogqH=
vpternlogqH
vpternlogqH?
vpternlogqH
vpternlogqHA
vpternlogqHpminsb&Minimum of Packed Signed Byte Integerspminsbpminsb/setcSet byte if carry (CF == 1)setcSETCS	setcSETCS#	vcvtpd2psNConvert Packed Double-Precision FP Values to Packed Single-Precision FP Values
vcvtpd2psxH=
vcvtpd2psyH?	vcvtpd2psHA
vcvtpd2psxH
vcvtpd2psyH	vcvtpd2psH
vcvtpd2psxH=
vcvtpd2psyH?
vcvtpd2psx 
vcvtpd2psxH
vcvtpd2psy 
vcvtpd2psyH
vcvtpd2psx /
vcvtpd2psy 2	vcvtpd2psHA	vcvtpd2psH	vcvtpd2psHQ	vcvtpd2psHQaxorAtomically XORaxor'insertqInsert FieldinsertqinsertqphsubswFPacked Horizontal Subtract Signed Word Integers with Signed Saturationphsubswphsubsw+phsubswphsubsw/
maskmovdqu'Store Selected Bytes of Double Quadword
maskmovdqusetbSet byte if below (CF == 1)setbSETCS	setbSETCS#vaddpd1Add Packed Double-Precision Floating-Point ValuesvaddpdH=vaddpdHvaddpdH?vaddpdHvaddpdHAvaddpdHvaddpdH=vaddpd vaddpdHvaddpd /vaddpdH?vaddpd vaddpdHvaddpd 2vaddpdHAvaddpdHvaddpdHQvaddpdHQ	vcvtpd2qqQConvert Packed Double-Precision Floating-Point Values to Packed Quadword Integers	vcvtpd2qqJ=	vcvtpd2qqJ?	vcvtpd2qqJA	vcvtpd2qqJ	vcvtpd2qqJ	vcvtpd2qqJ	vcvtpd2qqJ=	vcvtpd2qqJ	vcvtpd2qqJ?	vcvtpd2qqJ	vcvtpd2qqJA	vcvtpd2qqJ	vcvtpd2qqJQ	vcvtpd2qqJQ
vcvtph2udq`Convert Packed Half-Precision Floating-Point Values to Packed Unsigned Doubleword Integer Values
vcvtph2udqK.
vcvtph2udqK<
vcvtph2udqR>
vcvtph2udqK
vcvtph2udqK
vcvtph2udqR
vcvtph2udqK.
vcvtph2udqK
vcvtph2udqK<
vcvtph2udqK
vcvtph2udqR>
vcvtph2udqR
vcvtph2udqRQ
vcvtph2udqRQ
vfpclassps;Test Class of Packed Single-Precision Floating-Point ValuesvfpclasspsxJ9vfpclasspsxJ9vfpclasspsyJ:vfpclasspsyJ:vfpclasspszJ;vfpclasspszJ;vfpclasspsxJvfpclasspsxJvfpclasspsyJvfpclasspsyJvfpclasspszJvfpclasspszJ
vgatherpf1qpdmSparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Quadword Indices Using T1 Hint
vgatherpf1qpdLMpmulhuw<Multiply Packed Unsigned Word Integers and Store High Resultpmulhuw
pmulhuw
+pmulhuwpmulhuw/setsSet byte if sign (SF == 1)setsSETMI	setsSETMI#
vfpclassph9Test Class of Packed Half-Precision Floating-Point ValuesvfpclassphxK<vfpclassphxK<vfpclassphyK>vfpclassphyK>vfpclassphzR@vfpclassphzR@vfpclassphxKvfpclassphxKvfpclassphyKvfpclassphyKvfpclassphzRvfpclassphzR
vcvttps2qqnConvert with Truncation Packed Single Precision Floating-Point Values to Packed Singed Quadword Integer Values
vcvttps2qqJ8
vcvttps2qqJ9
vcvttps2qqJ:
vcvttps2qqJ
vcvttps2qqJ
vcvttps2qqJ
vcvttps2qqJ8
vcvttps2qqJ
vcvttps2qqJ9
vcvttps2qqJ
vcvttps2qqJ:
vcvttps2qqJ
vcvttps2qqJR
vcvttps2qqJRvpconflictqUDetect Conflicts Within a Vector of Packed Quadword Values into Dense Memory/RegistervpconflictqN=vpconflictqN?vpconflictqNAvpconflictqNvpconflictqNvpconflictqNvpconflictqN=vpconflictqNvpconflictqN?vpconflictqNvpconflictqNAvpconflictqNleaLoad Effective AddressleawLEAW"lealLEAL"monitorMonitor a Linear Address RangemonitorDpsraw'Shift Packed Word Data Right Arithmeticpsrawpsrawpsraw+psrawpsrawpsraw/	vcvtdq2pdBConvert Packed Dword Integers to Packed Double-Precision FP Values	vcvtdq2pdH8	vcvtdq2pdH9	vcvtdq2pdH:	vcvtdq2pdH	vcvtdq2pdH	vcvtdq2pdH	vcvtdq2pdH8	vcvtdq2pd 	vcvtdq2pdH	vcvtdq2pd +	vcvtdq2pdH9	vcvtdq2pd 	vcvtdq2pdH	vcvtdq2pd /	vcvtdq2pdH:	vcvtdq2pdH	vcvtph2dq@Convert Packed Half-Precision FP Values to Packed Dword Integers	vcvtph2dqK.	vcvtph2dqK<	vcvtph2dqR>	vcvtph2dqK	vcvtph2dqK	vcvtph2dqR	vcvtph2dqK.	vcvtph2dqK	vcvtph2dqK<	vcvtph2dqK	vcvtph2dqR>	vcvtph2dqR	vcvtph2dqRQ	vcvtph2dqRQvpermi2w;Full Permute of Words From Two Tables Overwriting the Indexvpermi2wIvpermi2wI/vpermi2wIvpermi2wI2vpermi2wIvpermi2wI5vpermi2wIvpermi2wI/vpermi2wIvpermi2wI2vpermi2wIvpermi2wI5cmpCompare Two OperandscmpbCMPBcmpbCMPB	cmpbCMPB		cmpbCMPB	#cmpwCMPWcmpwCMPWcmpwCMPWcmpwCMPWcmpwCMPW$cmplCMPLcmplCMPLcmplCMPLcmplCMPLcmplCMPL'cmpbCMPB#cmpbCMPB#	cmpwCMPW$cmpwCMPW$cmpwCMPW$cmplCMPL'cmplCMPL'cmplCMPL'vpminud.Minimum of Packed Unsigned Doubleword IntegersvpminudH9vpminudHvpminudH:vpminudHvpminudH;vpminudHvpminudH9vpminud vpminudHvpminud /vpminudH:vpminud!vpminudHvpminud!2vpminudH;vpminudHpsubusb?Subtract Packed Unsigned Byte Integers with Unsigned SaturationpsubusbPSUBUSBpsubusbPSUBUSB+psubusbPSUBUSBpsubusbPSUBUSB/cmovcMove if carry (CF == 1)cmovcwcmovcw$cmovclcmovcl'lzcnt%Count the Number of Leading Zero Bitslzcntw3lzcntw3$lzcntl3lzcntl3'vhsubps$Packed Single-FP Horizontal Subtractvhsubps vhsubps /vhsubps vhsubps 2vpdpwsudHPacked Dot Product of Signed-by-Unsigned Word subvectors into DoublewordvpdpwsudYvpdpwsudY/vpdpwsudYvpdpwsudY2movmskps8Extract Packed Single-Precision Floating-Point Sign MaskmovmskpsMOVMSKPSvphaddwd6Packed Horizontal Add Signed Word to Signed Doublewordvphaddwd"vphaddwd"/vpmovm2d:Expand Bits of Mask Register to Packed Doubleword Integersvpmovm2dJvpmovm2dJvpmovm2dJ
vcvtps2uqq`Convert Packed Single Precision Floating-Point Values to Packed Unsigned Quadword Integer Values
vcvtps2uqqJ8
vcvtps2uqqJ9
vcvtps2uqqJ:
vcvtps2uqqJ
vcvtps2uqqJ
vcvtps2uqqJ
vcvtps2uqqJ8
vcvtps2uqqJ
vcvtps2uqqJ9
vcvtps2uqqJ
vcvtps2uqqJ:
vcvtps2uqqJ
vcvtps2uqqJQ
vcvtps2uqqJQ
vshuff64x2=Shuffle 128-Bit Packed Double-Precision Floating-Point Values
vshuff64x2H?
vshuff64x2H
vshuff64x2HA
vshuff64x2H
vshuff64x2H?
vshuff64x2H
vshuff64x2HA
vshuff64x2HcldemoteCache Line Demotecldemote8#vbroadcasti128"Broadcast 128 Bits of Integer Datavbroadcasti128!/
vpcmpistri4Packed Compare Implicit Length Strings, Return Index
vpcmpistri 
vpcmpistri /vmovups<Move Unaligned Packed Single-Precision Floating-Point ValuesvmovupsH0vmovupsHvmovupsH3vmovupsHvmovupsH6vmovupsHvmovupsH/vmovupsH2vmovupsH5vmovups vmovupsHvmovups /vmovupsH/vmovups vmovupsHvmovups 2vmovupsH2vmovupsHvmovupsH5vmovups /vmovupsH/vmovups 2vmovupsH2vmovupsH5vfnmsub213psQFused Negative Multiply-Subtract of Packed Single-Precision Floating-Point Valuesvfnmsub213psH9vfnmsub213psHvfnmsub213psH:vfnmsub213psHvfnmsub213psH;vfnmsub213psHvfnmsub213psH9vfnmsub213ps#vfnmsub213psHvfnmsub213ps#/vfnmsub213psH:vfnmsub213ps#vfnmsub213psHvfnmsub213ps#2vfnmsub213psH;vfnmsub213psHvfnmsub213psHQvfnmsub213psHQvpcomq'Compare Packed Signed Quadword Integersvpcomq"vpcomq"/vpextrbExtract Bytevpextrb vpextrbIvpextrb #vpextrbI#pavgusbAverage Packed Byte Integerspavgusbpavgusb+	vcvtps2dqBConvert Packed Single-Precision FP Values to Packed Dword Integers	vcvtps2dqH9	vcvtps2dqH:	vcvtps2dqH;	vcvtps2dqH	vcvtps2dqH	vcvtps2dqH	vcvtps2dqH9	vcvtps2dq 	vcvtps2dqH	vcvtps2dq /	vcvtps2dqH:	vcvtps2dq 	vcvtps2dqH	vcvtps2dq 2	vcvtps2dqH;	vcvtps2dqH	vcvtps2dqHQ	vcvtps2dqHQvpmovdwDDown Convert Packed Doubleword Values to Word Values with TruncationvpmovdwHvpmovdwH,vpmovdwHvpmovdwH0vpmovdwHvpmovdwH3vpmovdwHvpmovdwHvpmovdwHvpmovdwH+vpmovdwH/vpmovdwH2vpaddsb6Add Packed Signed Byte Integers with Signed SaturationvpaddsbIvpaddsbI/vpaddsbIvpaddsbI2vpaddsbIvpaddsbI5vpaddsb vpaddsbIvpaddsb /vpaddsbI/vpaddsb!vpaddsbIvpaddsb!2vpaddsbI2vpaddsbIvpaddsbI5cmovbe+Move if below or equal (CF == 1 or ZF == 1)cmovbewcmovbew$cmovbelcmovbel'
vextractf64x4AExtract 256 Bits of Packed Double-Precision Floating-Point Values
vextractf64x4H
vextractf64x4H3
vextractf64x4H
vextractf64x4H2vxorps>Bitwise Logical XOR for Single-Precision Floating-Point ValuesvxorpsJ9vxorpsJvxorpsJ:vxorpsJvxorpsJ;vxorpsJvxorpsJ9vxorps vxorpsJvxorps /vxorpsJ:vxorps vxorpsJvxorps 2vxorpsJ;vxorpsJpmuludq,Multiply Packed Unsigned Doubleword Integerspmuludqpmuludq+pmuludqpmuludq/vcvttph2uqqnConvert with Truncation Packed Half Precision Floating-Point Values to Packed Unsigned Quadword Integer Valuesvcvttph2uqqK*vcvttph2uqqK.vcvttph2uqqR<vcvttph2uqqKvcvttph2uqqKvcvttph2uqqRvcvttph2uqqK*vcvttph2uqqKvcvttph2uqqK.vcvttph2uqqKvcvttph2uqqR<vcvttph2uqqRvcvttph2uqqRRvcvttph2uqqRRvpsrlw$Shift Packed Word Data Right LogicalvpsrlwIvpsrlwIvpsrlwI/vpsrlwIvpsrlwIvpsrlwI/vpsrlwIvpsrlwIvpsrlwI/vpsrlwI/vpsrlwI2vpsrlwI5vpsrlw vpsrlwIvpsrlw vpsrlwIvpsrlw /vpsrlwI/vpsrlwI/vpsrlw!vpsrlwIvpsrlw!vpsrlwIvpsrlw!/vpsrlwI/vpsrlwI2vpsrlwIvpsrlwIvpsrlwI/vpsrlwI5	vpdpbusdsXPacked Dot Product of Unsigned-by-Singed Byte subvectors into Doubleword with Saturation	vpdpbusdsK9	vpdpbusdsK	vpdpbusdsK:	vpdpbusdsK	vpdpbusdsV;	vpdpbusdsV	vpdpbusdsK9	vpdpbusdsW	vpdpbusdsK	vpdpbusdsW/	vpdpbusdsK:	vpdpbusdsW	vpdpbusdsK	vpdpbusdsW2	vpdpbusdsV;	vpdpbusdsVendbr32%END (terminate) BRanch in 32-bit modeendbr32addps1Add Packed Single-Precision Floating-Point ValuesaddpsADDPSaddpsADDPS/vtestpd/Packed Double-Precision Floating-Point Bit Testvtestpd vtestpd /vtestpd vtestpd 2kshiftrwShift Right 16-bit MaskskshiftrwHvpsrlq(Shift Packed Quadword Data Right LogicalvpsrlqH=vpsrlqH?vpsrlqHAvpsrlqHvpsrlqHvpsrlqH/vpsrlqHvpsrlqHvpsrlqH/vpsrlqHvpsrlqHvpsrlqH/vpsrlqH=vpsrlq vpsrlqHvpsrlq vpsrlqHvpsrlq /vpsrlqH/vpsrlqH?vpsrlq!vpsrlqHvpsrlq!vpsrlqHvpsrlq!/vpsrlqH/vpsrlqHAvpsrlqHvpsrlqHvpsrlqH/pcmpgtb4Compare Packed Signed Byte Integers for Greater Thanpcmpgtbpcmpgtb+pcmpgtbpcmpgtb/vfmadd132psCFused Multiply-Add of Packed Single-Precision Floating-Point Valuesvfmadd132psH9vfmadd132psHvfmadd132psH:vfmadd132psHvfmadd132psH;vfmadd132psHvfmadd132psH9vfmadd132ps#vfmadd132psHvfmadd132ps#/vfmadd132psH:vfmadd132ps#vfmadd132psHvfmadd132ps#2vfmadd132psH;vfmadd132psHvfmadd132psHQvfmadd132psHQvpandPacked Bitwise Logical ANDvpand vpand /vpand!vpand!2
vcvtuqq2psZConvert Packed Unsigned Quadword Integers to Packed Single-Precision Floating-Point Valuesvcvtuqq2psxJ=vcvtuqq2psyJ?
vcvtuqq2psJAvcvtuqq2psxJvcvtuqq2psyJ
vcvtuqq2psJvcvtuqq2psxJ=vcvtuqq2psyJ?vcvtuqq2psxJvcvtuqq2psyJ
vcvtuqq2psJA
vcvtuqq2psJ
vcvtuqq2psJQ
vcvtuqq2psJQvsha512msg1NPerform an Intermediate Calculation for the Next Four SHA512 Message Quadwordsvsha512msg1)unpcklpsGUnpack and Interleave Low Packed Single-Precision Floating-Point ValuesunpcklpsUNPCKLPSunpcklpsUNPCKLPS/vfnmsub231shOFused Negative Multiply-Subtract of Scalar Half-Precision Floating-Point Valuesvfnmsub231shRvfnmsub231shR$vfnmsub231shRvfnmsub231shR$vfnmsub231shRQvfnmsub231shRQ	vpmacsswwEPacked Multiply Accumulate with Saturation Signed Word to Signed Word	vpmacssww"	vpmacssww"/vextracti128Extract Packed Integer Valuesvextracti128!vextracti128!/rclRotate Left through Carry FlagrclbRCLB	rclbRCLB	rclbRCLB	rclwRCLWrclwRCLWrclwRCLWrcllRCLLrcllRCLLrcllRCLLrclbRCLB#rclbRCLB#rclbRCLB#rclwRCLW$rclwRCLW$rclwRCLW$rcllRCLL'rcllRCLL'rcllRCLL'blcfillFill From Lowest Clear Bitblcfill6blcfill6'vmovhps7Move High Packed Single-Precision Floating-Point Valuesvmovhps +vmovhpsH+vmovhps +vmovhpsH+kandnb#Bitwise Logical AND NOT 8-bit MaskskandnbJcmovnae$Move if not above or equal (CF == 1)cmovnaewcmovnaew$cmovnaelcmovnael'kmovqMove 64-bit MaskkmovqIkmovqI+kmovqI+	cmpxchg8bCompare and Exchange 8 Bytes	cmpxchg8b	CMPXCHG8B	+decDecrement by 1decbDECB	decwDECWdeclDECLdecbDECB#decwDECW$declDECL'int3Interrupt 3 (debug trap)int3	punpckhdq;Unpack and Interleave High-Order Doublewords into Quadwords	punpckhdq	punpckhdq+	punpckhdq	punpckhdq/pmovsxbdDMove Packed Byte Integers to Doubleword Integers with Sign Extensionpmovsxbdpmovsxbd'	vmovdqa64Move Aligned Quadword Values	vmovdqa64H0	vmovdqa64H	vmovdqa64H3	vmovdqa64H	vmovdqa64H6	vmovdqa64H	vmovdqa64H/	vmovdqa64H2	vmovdqa64H5	vmovdqa64H	vmovdqa64H/	vmovdqa64H	vmovdqa64H2	vmovdqa64H	vmovdqa64H5	vmovdqa64H/	vmovdqa64H2	vmovdqa64H5vpandd1Bitwise Logical AND of Packed Doubleword IntegersvpanddH9vpanddHvpanddH:vpanddHvpanddH;vpanddHvpanddH9vpanddHvpanddH:vpanddHvpanddH;vpanddH	vphadduwq/Packed Horizontal Add Unsigned Word to Quadword	vphadduwq"	vphadduwq"/	vpmovzxbqBMove Packed Byte Integers to Quadword Integers with Zero Extension	vpmovzxbqH	vpmovzxbqH	vpmovzxbqH	vpmovzxbqH$	vpmovzxbqH'	vpmovzxbqH+	vpmovzxbq 	vpmovzxbqH	vpmovzxbq $	vpmovzxbqH$	vpmovzxbq!	vpmovzxbqH	vpmovzxbq!'	vpmovzxbqH'	vpmovzxbqH	vpmovzxbqH+blendps4 Blend Packed Single Precision Floating-Point Valuesblendpsblendps/vpshufbPacked Shuffle BytesvpshufbIvpshufbI/vpshufbIvpshufbI2vpshufbIvpshufbI5vpshufb vpshufbIvpshufb /vpshufbI/vpshufb!vpshufbIvpshufb!2vpshufbI2vpshufbIvpshufbI5vcvttss2usiXConvert with Truncation Scalar Single-Precision Floating-Point Value to Unsigned Integervcvttss2usiHvcvttss2usiH'vcvttss2usiHR	vgetexpphhExtract Exponents of Packed Half-Precision Floating-Point Values as Half-Precision Floating-Point Values	vgetexpphK<	vgetexpphK>	vgetexpphR@	vgetexpphK	vgetexpphK	vgetexpphR	vgetexpphK<	vgetexpphK	vgetexpphK>	vgetexpphK	vgetexpphR@	vgetexpphR	vgetexpphRR	vgetexpphRRvinserti32x83Insert 256 Bits of Packed Doubleword Integer Valuesvinserti32x8Jvinserti32x8J2vinserti32x8Jvinserti32x8J2vphsubwd;Packed Horizontal Subtract Signed Word to Signed Doublewordvphsubwd"vphsubwd"/vpord0Bitwise Logical OR of Packed Doubleword IntegersvpordH9vpordHvpordH:vpordHvpordH;vpordHvpordH9vpordHvpordH:vpordHvpordH;vpordH	cvttss2siIConvert with Truncation Scalar Single-Precision FP Value to Dword Integer	cvttss2si	CVTTSS2SL	cvttss2si	CVTTSS2SL'	vpmovzxwdDMove Packed Word Integers to Doubleword Integers with Zero Extension	vpmovzxwdH	vpmovzxwdH	vpmovzxwdH	vpmovzxwdH+	vpmovzxwdH/	vpmovzxwdH2	vpmovzxwd 	vpmovzxwdH	vpmovzxwd +	vpmovzxwdH+	vpmovzxwd!	vpmovzxwdH	vpmovzxwd!/	vpmovzxwdH/	vpmovzxwdH	vpmovzxwdH2cmovae Move if above or equal (CF == 0)cmovaewcmovaew$cmovaelcmovael'adox<Unsigned Integer Addition of Two Operands with Overflow Flagadoxl7adoxl7'pcmpgtd:Compare Packed Signed Doubleword Integers for Greater Thanpcmpgtdpcmpgtd+pcmpgtdpcmpgtd/pfsubr&Packed Floating-Point Subtract Reversepfsubrpfsubr+vfmsubadd213pdXFused Multiply-Alternating Subtract/Add of Packed Double-Precision Floating-Point Valuesvfmsubadd213pdH=vfmsubadd213pdHvfmsubadd213pdH?vfmsubadd213pdHvfmsubadd213pdHAvfmsubadd213pdHvfmsubadd213pdH=vfmsubadd213pd#vfmsubadd213pdHvfmsubadd213pd#/vfmsubadd213pdH?vfmsubadd213pd#vfmsubadd213pdHvfmsubadd213pd#2vfmsubadd213pdHAvfmsubadd213pdHvfmsubadd213pdHQvfmsubadd213pdHQvpcomb#Compare Packed Signed Byte Integersvpcomb"vpcomb"/pmaxsw&Maximum of Packed Signed Word IntegerspmaxswPMAXSW
pmaxswPMAXSW
+pmaxswPMAXSWpmaxswPMAXSW/	vfmaddcphIFused Multiply-Add of Complex Packed Half-Precision Floating-Point Values	vfmaddcphK9	vfmaddcphK	vfmaddcphK:	vfmaddcphK	vfmaddcphR;	vfmaddcphR	vfmaddcphK9	vfmaddcphK	vfmaddcphK:	vfmaddcphK	vfmaddcphR;	vfmaddcphR	vfmaddcphRQ	vfmaddcphRQjna&Jump if not above (CF == 1 or ZF == 1)jnaJLSNjnaJLSOsetge'Set byte if greater or equal (SF == OF)setgeSETGE	setgeSETGE#	vfnmaddssLFused Negative Multiply-Add of Scalar Single-Precision Floating-Point Values	vfnmaddss$	vfnmaddss$'	vfnmaddss$'	vmovmskps8Extract Packed Single-Precision Floating-Point Sign Mask	vmovmskps 	vmovmskps vpaddbAdd Packed Byte IntegersvpaddbIvpaddbI/vpaddbIvpaddbI2vpaddbIvpaddbI5vpaddb vpaddbIvpaddb /vpaddbI/vpaddb!vpaddbIvpaddb!2vpaddbI2vpaddbIvpaddbI5vpmovsqbIDown Convert Packed Quadword Values to Byte Values with Signed SaturationvpmovsqbHvpmovsqbH%vpmovsqbHvpmovsqbH(vpmovsqbHvpmovsqbH,vpmovsqbHvpmovsqbHvpmovsqbHvpmovsqbH$vpmovsqbH'vpmovsqbH+vpsubusb?Subtract Packed Unsigned Byte Integers with Unsigned SaturationvpsubusbIvpsubusbI/vpsubusbIvpsubusbI2vpsubusbIvpsubusbI5vpsubusb vpsubusbIvpsubusb /vpsubusbI/vpsubusb!vpsubusbIvpsubusb!2vpsubusbI2vpsubusbIvpsubusbI5movdMove Doublewordmovdmovdmovdmovd'movdmovd'movd'movd'blciIsolate Lowest Clear Bitblci6blci6'	vpdpbssdsVPacked Dot Product of Signed-by-Singed Byte subvectors into Doubleword with Saturation	vpdpbssdsX	vpdpbssdsX/	vpdpbssdsX	vpdpbssdsX2movdq2q1Move Quadword from XMM to MMX Technology Registermovdq2q	vcvtss2si9Convert Scalar Single-Precision FP Value to Dword Integer	vcvtss2si 	vcvtss2siH	vcvtss2si '	vcvtss2siH'	vcvtss2siHQdpps<Dot Product of Packed Single Precision Floating-Point Valuesdppsdpps/vpmovb2m3Move Signs of Packed Byte Integers to Mask Registervpmovb2mIvpmovb2mIvpmovb2mIcomisdLCompare Scalar Ordered Double-Precision Floating-Point Values and Set EFLAGScomisdCOMISDcomisdCOMISD+vphaddwq4Packed Horizontal Add Signed Word to Signed Quadwordvphaddwq"vphaddwq"/	vunpcklpsGUnpack and Interleave Low Packed Single-Precision Floating-Point Values	vunpcklpsH9	vunpcklpsH	vunpcklpsH:	vunpcklpsH	vunpcklpsH;	vunpcklpsH	vunpcklpsH9	vunpcklps 	vunpcklpsH	vunpcklps /	vunpcklpsH:	vunpcklps 	vunpcklpsH	vunpcklps 2	vunpcklpsH;	vunpcklpsHvextractf128$Extract Packed Floating-Point Valuesvextractf128 vextractf128 /hsubpd$Packed Double-FP Horizontal Subtracthsubpdhsubpd/movmskpd8Extract Packed Double-Precision Floating-Point Sign MaskmovmskpdMOVMSKPDvcomisdLCompare Scalar Ordered Double-Precision Floating-Point Values and Set EFLAGSvcomisd vcomisdHvcomisd +vcomisdH+vcomisdHRvbroadcastf32x47Broadcast Four Single-Precision Floating-Point Elementsvbroadcastf32x4H/vbroadcastf32x4H/vbroadcastf32x4H/vbroadcastf32x4H/xchg&Exchange Register/Memory with Register
xchgbXCHGB		xchgbXCHGB	#xchgwXCHGWxchgwXCHGWxchgwXCHGWxchgwXCHGW$xchglXCHGLxchglXCHGLxchglXCHGLxchglXCHGL'xchgbXCHGB#	xchgwXCHGW$xchglXCHGL'kandnw$Bitwise Logical AND NOT 16-bit MaskskandnwHvbroadcastf128(Broadcast 128 Bit of Floating-Point Datavbroadcastf128 /cmovna&Move if not above (CF == 1 or ZF == 1)cmovnawcmovnaw$cmovnalcmovnal'
vpunpcklbw0Unpack and Interleave Low-Order Bytes into Words
vpunpcklbwI
vpunpcklbwI/
vpunpcklbwI
vpunpcklbwI2
vpunpcklbwI
vpunpcklbwI5
vpunpcklbw 
vpunpcklbwI
vpunpcklbw /
vpunpcklbwI/
vpunpcklbw!
vpunpcklbwI
vpunpcklbw!2
vpunpcklbwI2
vpunpcklbwI
vpunpcklbwI5vfnmadd132ssLFused Negative Multiply-Add of Scalar Single-Precision Floating-Point Valuesvfnmadd132ssHvfnmadd132ssH'vfnmadd132ss#vfnmadd132ssHvfnmadd132ss#'vfnmadd132ssH'vfnmadd132ssHQvfnmadd132ssHQ	vblendvps= Variable Blend Packed Single Precision Floating-Point Values	vblendvps 	vblendvps /	vblendvps 	vblendvps 2pfminPacked Floating-Point Minimumpfminpfmin+
gf2p8affineqb(Galois Field (2^8) Affine Transformation
gf2p8affineqb
gf2p8affineqb/rsqrtpsTCompute Reciprocals of Square Roots of Packed Single-Precision Floating-Point ValuesrsqrtpsRSQRTPSrsqrtpsRSQRTPS/vpmaxuw(Maximum of Packed Unsigned Word IntegersvpmaxuwIvpmaxuwI/vpmaxuwIvpmaxuwI2vpmaxuwIvpmaxuwI5vpmaxuw vpmaxuwIvpmaxuw /vpmaxuwI/vpmaxuw!vpmaxuwIvpmaxuw!2vpmaxuwI2vpmaxuwIvpmaxuwI5vpsllq'Shift Packed Quadword Data Left LogicalvpsllqH=vpsllqH?vpsllqHAvpsllqHvpsllqHvpsllqH/vpsllqHvpsllqHvpsllqH/vpsllqHvpsllqHvpsllqH/vpsllqH=vpsllq vpsllqHvpsllq vpsllqHvpsllq /vpsllqH/vpsllqH?vpsllq!vpsllqHvpsllq!vpsllqHvpsllq!/vpsllqH/vpsllqHAvpsllqHvpsllqHvpsllqH/vpsubusw?Subtract Packed Unsigned Word Integers with Unsigned SaturationvpsubuswIvpsubuswI/vpsubuswIvpsubuswI2vpsubuswIvpsubuswI5vpsubusw vpsubuswIvpsubusw /vpsubuswI/vpsubusw!vpsubuswIvpsubusw!2vpsubuswI2vpsubuswIvpsubuswI5vpshlqPacked Shift Logical Quadwordsvpshlq"vpshlq"/vpshlq"/pmovsxdqHMove Packed Doubleword Integers to Quadword Integers with Sign Extensionpmovsxdqpmovsxdq+setlSet byte if less (SF != OF)setlSETLT	setlSETLT#	vcvtph2qq\Convert Packed Half Precision Floating-Point Values to Packed Singed Quadword Integer Values	vcvtph2qqK*	vcvtph2qqK.	vcvtph2qqR<	vcvtph2qqK	vcvtph2qqK	vcvtph2qqR	vcvtph2qqK*	vcvtph2qqK	vcvtph2qqK.	vcvtph2qqK	vcvtph2qqR<	vcvtph2qqR	vcvtph2qqRQ	vcvtph2qqRQ	cvttpd2piRConvert with Truncation Packed Double-Precision FP Values to Packed Dword Integers	cvttpd2pi	CVTTPD2PL	cvttpd2pi	CVTTPD2PL/movntpdKStore Packed Double-Precision Floating-Point Values Using Non-Temporal HintmovntpdMOVNTPD/divpd4Divide Packed Double-Precision Floating-Point ValuesdivpdDIVPDdivpdDIVPD/vcomissLCompare Scalar Ordered Single-Precision Floating-Point Values and Set EFLAGSvcomiss vcomissHvcomiss 'vcomissH'vcomissHRpdepParallel Bits Depositpdepl5pdepl5'vscatterpf0dpd�Sparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Doubleword Indices Using T0 Hint with Intent to Writevscatterpf0dpdLGvfmsub132phFFused Multiply-Subtract of Packed Half-Precision Floating-Point Valuesvfmsub132phK<vfmsub132phKvfmsub132phK>vfmsub132phKvfmsub132phR@vfmsub132phRvfmsub132phK<vfmsub132phKvfmsub132phK>vfmsub132phKvfmsub132phR@vfmsub132phRvfmsub132phRQvfmsub132phRQt1mskcInverse Mask From Trailing Onest1mskc6t1mskc6'vsha512rnds2&Perform Two Rounds of SHA512 Operationvsha512rnds2)daa Decimal Adjust AL after AdditiondaaDAApi2fd6Packed Integer to Floating-Point Doubleword Conversionpi2fdpi2fd+ucomissNUnordered Compare Scalar Single-Precision Floating-Point Values and Set EFLAGSucomissUCOMISSucomissUCOMISS'vphsubbw5Packed Horizontal Subtract Signed Byte to Signed Wordvphsubbw"vphsubbw"/vpshuflwShuffle Packed Low WordsvpshuflwIvpshuflwIvpshuflwIvpshuflwI/vpshuflwI2vpshuflwI5vpshuflw vpshuflwIvpshuflw /vpshuflwI/vpshuflw!vpshuflwIvpshuflw!2vpshuflwI2vpshuflwIvpshuflwI5vscatterpf0qpd�Sparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Quadword Indices Using T0 Hint with Intent to Writevscatterpf0qpdLMvfmadd132phAFused Multiply-Add of Packed Half-Precision Floating-Point Valuesvfmadd132phK<vfmadd132phKvfmadd132phK>vfmadd132phKvfmadd132phR@vfmadd132phRvfmadd132phK<vfmadd132phKvfmadd132phK>vfmadd132phKvfmadd132phR@vfmadd132phRvfmadd132phRQvfmadd132phRQsetnae(Set byte if not above or equal (CF == 1)setnaeSETCS	setnaeSETCS#vpslldq)Shift Packed Double Quadword Left Logicalvpslldq vpslldqIvpslldqI/vpslldq!vpslldqIvpslldqI2vpslldqIvpslldqI5vpcomud+Compare Packed Unsigned Doubleword Integersvpcomud"vpcomud"/sha256rnds2&Perform Two Rounds of SHA256 Operationsha256rnds2(sha256rnds2(/	vptestnmq;Logical NAND of Packed Quadword Integer Values and Set Mask	vptestnmqH=	vptestnmqH=	vptestnmqH	vptestnmqH	vptestnmqH?	vptestnmqH?	vptestnmqH	vptestnmqH	vptestnmqHA	vptestnmqHA	vptestnmqH	vptestnmqHpextParallel Bits Extractpextl5pextl5'vfmaddsdCFused Multiply-Add of Scalar Double-Precision Floating-Point Valuesvfmaddsd$vfmaddsd$+vfmaddsd$+vpshufdShuffle Packed DoublewordsvpshufdH9vpshufdH:vpshufdH;vpshufdHvpshufdHvpshufdHvpshufdH9vpshufd vpshufdHvpshufd /vpshufdH:vpshufd!vpshufdHvpshufd!2vpshufdH;vpshufdHvcvtneeph2ps:Convert Even Elements of Packed FP16 Values to FP32 Valuesvcvtneeph2psZ/vcvtneeph2psZ2pminsw&Minimum of Packed Signed Word IntegerspminswPMINSW
pminswPMINSW
+pminswPMINSWpminswPMINSW/	vcvtps2ph<Convert Single-Precision FP value to Half-Precision FP value	vcvtps2phH	vcvtps2phH,	vcvtps2phH	vcvtps2phH0	vcvtps2phH	vcvtps2phH3	vcvtps2ph%	vcvtps2phH	vcvtps2ph%	vcvtps2phH	vcvtps2phH	vcvtps2ph%+	vcvtps2phH+	vcvtps2ph%/	vcvtps2phH/	vcvtps2phH2	vcvtps2phHR	vcvtps2phHR
vpgatherdd?Gather Packed Doubleword Values Using Signed Doubleword Indices
vpgatherddHB
vpgatherddHF
vpgatherddHJ
vpgatherdd!B
vpgatherdd!FvfmsubaddpdXFused Multiply-Alternating Subtract/Add of Packed Double-Precision Floating-Point Valuesvfmsubaddpd$vfmsubaddpd$/vfmsubaddpd$/vfmsubaddpd$vfmsubaddpd$2vfmsubaddpd$2kxorq Bitwise Logical XOR 64-bit MaskskxorqIvpcmpgtq$Compare Packed Data for Greater ThanvpcmpgtqH=vpcmpgtqH=vpcmpgtqHvpcmpgtqHvpcmpgtqH?vpcmpgtqH?vpcmpgtqHvpcmpgtqHvpcmpgtqHAvpcmpgtqHAvpcmpgtqHvpcmpgtqHvpcmpgtq vpcmpgtq /vpcmpgtq!vpcmpgtq!2pminub(Minimum of Packed Unsigned Byte IntegerspminubPMINUB
pminubPMINUB
+pminubPMINUBpminubPMINUB/vmulss6Multiply Scalar Single-Precision Floating-Point ValuesvmulssHvmulssH'vmulss vmulssHvmulss 'vmulssH'vmulssHQvmulssHQvinserti128Insert Packed Integer Valuesvinserti128!vinserti128!/vfmulcshEFused Multiply of Complex Scalar Half-Precision Floating-Point ValuesvfmulcshRvfmulcshR'vfmulcshRvfmulcshR'vfmulcshRQvfmulcshRQaesimc+Perform the AES InvMixColumn Transformationaesimc'aesimc'/mwaitxMonitor Wait with TimeoutmwaitxEvpopcntw)Packed Population Count for Word IntegersvpopcntwKvpopcntwKvpopcntwSvpopcntwK/vpopcntwK2vpopcntwS5vpopcntwKvpopcntwK/vpopcntwKvpopcntwK2vpopcntwSvpopcntwS5btcBit Test and ComplementbtcwBTCWbtcwBTCWbtclBTCLbtclBTCLbtcwBTCW$btcwBTCW$btclBTCL'btclBTCL'vproldRotate Packed Doubleword LeftvproldH9vproldH:vproldH;vproldHvproldHvproldHvproldH9vproldHvproldH:vproldHvproldH;vproldHjneJump if not equal (ZF == 0)jneJNENjneJNEO
clflushoptFlush Cache Line Optimized
clflushopt:#haddpdPacked Double-FP Horizontal Addhaddpdhaddpd/vpsubbSubtract Packed Byte IntegersvpsubbIvpsubbI/vpsubbIvpsubbI2vpsubbIvpsubbI5vpsubb vpsubbIvpsubb /vpsubbI/vpsubb!vpsubbIvpsubb!2vpsubbI2vpsubbIvpsubbI5	vmovdqu32 Move Unaligned Doubleword Values	vmovdqu32H0	vmovdqu32H	vmovdqu32H3	vmovdqu32H	vmovdqu32H6	vmovdqu32H	vmovdqu32H/	vmovdqu32H2	vmovdqu32H5	vmovdqu32H	vmovdqu32H/	vmovdqu32H	vmovdqu32H2	vmovdqu32H	vmovdqu32H5	vmovdqu32H/	vmovdqu32H2	vmovdqu32H5kunpckbw!Unpack and Interleave 8-bit MaskskunpckbwH	cvttpd2dqRConvert with Truncation Packed Double-Precision FP Values to Packed Dword Integers	cvttpd2dq	cvttpd2dq/kshiftlwShift Left 16-bit MaskskshiftlwHnopNo OperationnopNOPshrd$Integer Double Precision Shift Rightshrdwshrdwshrdlshrdlshrdw$shrdw$shrdl'shrdl'	vcvtsi2sd9Convert Dword Integer to Scalar Double-Precision FP Value
vcvtsi2sdl 
vcvtsi2sdlH
vcvtsi2sdl '
vcvtsi2sdlH'
vpgatherdq=Gather Packed Quadword Values Using Signed Doubleword Indices
vpgatherdqHB
vpgatherdqHB
vpgatherdqHF
vpgatherdq!B
vpgatherdq!BvppermPacked Permute Bytesvpperm"vpperm"/vpperm"/vpmaxud.Maximum of Packed Unsigned Doubleword IntegersvpmaxudH9vpmaxudHvpmaxudH:vpmaxudHvpmaxudH;vpmaxudHvpmaxudH9vpmaxud vpmaxudHvpmaxud /vpmaxudH:vpmaxud!vpmaxudHvpmaxud!2vpmaxudH;vpmaxudHvrndscaleph\Round Packed Half-Precision Floating-Point Values To Include A Given Number Of Fraction BitsvrndscalephK<vrndscalephK>vrndscalephR@vrndscalephKvrndscalephKvrndscalephRvrndscalephK<vrndscalephKvrndscalephK>vrndscalephKvrndscalephR@vrndscalephRvrndscalephRRvrndscalephRR	vgetexpssiExtract Exponent of Scalar Single-Precision Floating-Point Value as Single-Precision Floating-Point Value	vgetexpssH	vgetexpssH'	vgetexpssH	vgetexpssH'	vgetexpssHR	vgetexpssHRvpunpckhqdq@Unpack and Interleave High-Order Quadwords into Double QuadwordsvpunpckhqdqH=vpunpckhqdqHvpunpckhqdqH?vpunpckhqdqHvpunpckhqdqHAvpunpckhqdqHvpunpckhqdqH=vpunpckhqdq vpunpckhqdqHvpunpckhqdq /vpunpckhqdqH?vpunpckhqdq!vpunpckhqdqHvpunpckhqdq!2vpunpckhqdqHAvpunpckhqdqHjpJump if parity (PF == 1)jpJPSNjpJPSOblendpd3Blend Packed Double Precision Floating-Point Valuesblendpdblendpd/pfcmpgt.Packed Floating-Point Compare for Greater Thanpfcmpgtpfcmpgt+sarArithmetic Shift RightsarbSARB	sarbSARB	sarbSARB	sarwSARWsarwSARWsarwSARWsarlSARLsarlSARLsarlSARLsarbSARB#sarbSARB#sarbSARB#sarwSARW$sarwSARW$sarwSARW$sarlSARL'sarlSARL'sarlSARL'vscatterpf1dps�Sparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Doubleword Indices Using T1 Hint with Intent to Writevscatterpf1dpsLK	pclmulqdq"Carry-Less Quadword Multiplication	pclmulqdq&	pclmulqdq&/	vpmovsxwdDMove Packed Word Integers to Doubleword Integers with Sign Extension	vpmovsxwdH	vpmovsxwdH	vpmovsxwdH	vpmovsxwdH+	vpmovsxwdH/	vpmovsxwdH2	vpmovsxwd 	vpmovsxwdH	vpmovsxwd +	vpmovsxwdH+	vpmovsxwd!	vpmovsxwdH	vpmovsxwd!/	vpmovsxwdH/	vpmovsxwdH	vpmovsxwdH2vpshad#Packed Shift Arithmetic Doublewordsvpshad"vpshad"/vpshad"/pavgwAverage Packed Word Integerspavgw
pavgw
+pavgwpavgw/vporq.Bitwise Logical OR of Packed Quadword IntegersvporqH=vporqHvporqH?vporqHvporqHAvporqHvporqH=vporqHvporqH?vporqHvporqHAvporqHvsha512msg2FPerform a Final Calculation for the Next Four SHA512 Message Quadwordsvsha512msg2)mulsd6Multiply Scalar Double-Precision Floating-Point ValuesmulsdMULSDmulsdMULSD+vpsllvq0Variable Shift Packed Quadword Data Left LogicalvpsllvqH=vpsllvqHvpsllvqH?vpsllvqHvpsllvqHAvpsllvqHvpsllvqH=vpsllvq!vpsllvqHvpsllvq!/vpsllvqH?vpsllvq!vpsllvqHvpsllvq!2vpsllvqHAvpsllvqHvpshldd9Concatenate and Shift Packed Doubleword Data Left LogicalvpshlddK9vpshlddKvpshlddK:vpshlddKvpshlddU;vpshlddUvpshlddK9vpshlddKvpshlddK:vpshlddKvpshlddU;vpshlddUcmovpMove if parity (PF == 1)cmovpwcmovpw$cmovplcmovpl'joJump if overflow (OF == 1)joJOSNjoJOSOpfrsqit18Packed Floating-Point Reciprocal Square Root Iteration 1pfrsqit1pfrsqit1+vfixupimmps<Fix Up Special Packed Single-Precision Floating-Point ValuesvfixupimmpsK9vfixupimmpsKvfixupimmpsH:vfixupimmpsHvfixupimmpsH;vfixupimmpsHvfixupimmpsK9vfixupimmpsKvfixupimmpsH:vfixupimmpsHvfixupimmpsH;vfixupimmpsHvfixupimmpsHRvfixupimmpsHRpmovsxbqBMove Packed Byte Integers to Quadword Integers with Sign Extensionpmovsxbqpmovsxbq$crc32Accumulate CRC32 Valuecrc32b	crc32wcrc32lcrc32b#crc32w$crc32l'minps<Return Minimum Packed Single-Precision Floating-Point ValuesminpsMINPSminpsMINPS/
vperm2i128Permute 128-Bit Integer Values
vperm2i128!
vperm2i128!2vpmadd52huqjPacked Multiply of Unsigned 52-bit Unsigned Integers and Add High 52-bit Products to Quadword Accumulatorsvpmadd52huqK=vpmadd52huqKvpmadd52huqK?vpmadd52huqKvpmadd52huqOAvpmadd52huqOvpmadd52huqK=vpmadd52huqKvpmadd52huq[vpmadd52huq[/vpmadd52huqK?vpmadd52huqKvpmadd52huq[vpmadd52huq[2vpmadd52huqOAvpmadd52huqO	vfnmaddsdLFused Negative Multiply-Add of Scalar Double-Precision Floating-Point Values	vfnmaddsd$	vfnmaddsd$+	vfnmaddsd$+pfrcpit2,Packed Floating-Point Reciprocal Iteration 2pfrcpit2pfrcpit2+vpmovqbBDown Convert Packed Quadword Values to Byte Values with TruncationvpmovqbHvpmovqbH%vpmovqbHvpmovqbH(vpmovqbHvpmovqbH,vpmovqbHvpmovqbHvpmovqbHvpmovqbH$vpmovqbH'vpmovqbH+notOne's Complement NegationnotbNOTB	notwNOTWnotlNOTLnotbNOTB#notwNOTW$notlNOTL'vfmaddsubpsXFused Multiply-Alternating Add/Subtract of Packed Single-Precision Floating-Point Valuesvfmaddsubps$vfmaddsubps$/vfmaddsubps$/vfmaddsubps$vfmaddsubps$2vfmaddsubps$2	vpermt2pdZFull Permute of Double-Precision Floating-Point Values From Two Tables Overwriting a Table	vpermt2pdH=	vpermt2pdH	vpermt2pdH?	vpermt2pdH	vpermt2pdHA	vpermt2pdH	vpermt2pdH=	vpermt2pdH	vpermt2pdH?	vpermt2pdH	vpermt2pdHA	vpermt2pdHsalArithmetic Shift LeftsalbSALB	salbSALB	salbSALB	salwSALWsalwSALWsalwSALWsallSALLsallSALLsallSALLsalbSALB#salbSALB#salbSALB#salwSALW$salwSALW$salwSALW$sallSALL'sallSALL'sallSALL'vfnmsub231phOFused Negative Multiply-Subtract of Packed Half-Precision Floating-Point Valuesvfnmsub231phK<vfnmsub231phKvfnmsub231phK>vfnmsub231phKvfnmsub231phR@vfnmsub231phRvfnmsub231phK<vfnmsub231phKvfnmsub231phK>vfnmsub231phKvfnmsub231phR@vfnmsub231phRvfnmsub231phRQvfnmsub231phRQjnle0Jump if not less or equal (ZF == 0 and SF == OF)jnleJGTNjnleJGTOvpcmpgtw4Compare Packed Signed Word Integers for Greater ThanvpcmpgtwIvpcmpgtwIvpcmpgtwI/vpcmpgtwI/vpcmpgtwIvpcmpgtwIvpcmpgtwI2vpcmpgtwI2vpcmpgtwIvpcmpgtwIvpcmpgtwI5vpcmpgtwI5vpcmpgtw vpcmpgtw /vpcmpgtw!vpcmpgtw!2	vcvtpd2dqBConvert Packed Double-Precision FP Values to Packed Dword Integers
vcvtpd2dqxH=
vcvtpd2dqyH?	vcvtpd2dqHA
vcvtpd2dqxH
vcvtpd2dqyH	vcvtpd2dqH
vcvtpd2dqxH=
vcvtpd2dqyH?
vcvtpd2dqx 
vcvtpd2dqxH
vcvtpd2dqy 
vcvtpd2dqyH
vcvtpd2dqx /
vcvtpd2dqy 2	vcvtpd2dqHA	vcvtpd2dqH	vcvtpd2dqHQ	vcvtpd2dqHQvscatterdpsTScatter Packed Single-Precision Floating-Point Values with Signed Doubleword IndicesvscatterdpsHCvscatterdpsHGvscatterdpsHKvpshldq7Concatenate and Shift Packed Quadword Data Left LogicalvpshldqK=vpshldqKvpshldqK?vpshldqKvpshldqUAvpshldqUvpshldqK=vpshldqKvpshldqK?vpshldqKvpshldqUAvpshldqUcvtsi2sd9Convert Dword Integer to Scalar Double-Precision FP Value	cvtsi2sdlCVTSL2SD	cvtsi2sdlCVTSL2SD'movupd<Move Unaligned Packed Double-Precision Floating-Point ValuesmovupdMOVUPDmovupdMOVUPD/movupdMOVUPD/
vcvtss2usiSConvert Scalar Single-Precision Floating-Point Value to Unsigned Doubleword Integer
vcvtss2usiH
vcvtss2usiH'
vcvtss2usiHQvfmaddsub132psXFused Multiply-Alternating Add/Subtract of Packed Single-Precision Floating-Point Valuesvfmaddsub132psH9vfmaddsub132psHvfmaddsub132psH:vfmaddsub132psHvfmaddsub132psH;vfmaddsub132psHvfmaddsub132psH9vfmaddsub132ps#vfmaddsub132psHvfmaddsub132ps#/vfmaddsub132psH:vfmaddsub132ps#vfmaddsub132psHvfmaddsub132ps#2vfmaddsub132psH;vfmaddsub132psHvfmaddsub132psHQvfmaddsub132psHQretReturn from ProcedureretlRETretlcmovg&Move if greater (ZF == 0 and SF == OF)cmovgwcmovgw$cmovglcmovgl'
vshuff32x4=Shuffle 128-Bit Packed Single-Precision Floating-Point Values
vshuff32x4H:
vshuff32x4H
vshuff32x4H;
vshuff32x4H
vshuff32x4H:
vshuff32x4H
vshuff32x4H;
vshuff32x4H	vcvtss2shJConvert Scalar Single-Precision FP Value to Scalar Half-Precision FP Value	vcvtss2shR	vcvtss2shR'	vcvtss2shR	vcvtss2shR'	vcvtss2shRQ	vcvtss2shRQcmovle+Move if less or equal (ZF == 1 or SF != OF)cmovlewcmovlew$cmovlelcmovlel'kandnd$Bitwise Logical AND NOT 32-bit MaskskandndIpavgbAverage Packed Byte Integerspavgb
pavgb
+pavgbpavgb/jnge'Jump if not greater or equal (SF != OF)jngeJLTNjngeJLTO
vcvtusi2sdHConvert Unsigned Integer to Scalar Double-Precision Floating-Point Valuevcvtusi2sdlHvcvtusi2sdlH'vpdpbusdHPacked Dot Product of Unsigned-by-Singed Byte subvectors into DoublewordvpdpbusdK9vpdpbusdKvpdpbusdK:vpdpbusdKvpdpbusdV;vpdpbusdVvpdpbusdK9vpdpbusdWvpdpbusdKvpdpbusdW/vpdpbusdK:vpdpbusdWvpdpbusdKvpdpbusdW2vpdpbusdV;vpdpbusdVmovlpd5Move Low Packed Double-Precision Floating-Point ValuemovlpdMOVLPD+movlpdMOVLPD+	vpmadcswd?Packed Multiply Add Accumulate Signed Word to Signed Doubleword	vpmadcswd"	vpmadcswd"/vpshld Packed Shift Logical Doublewordsvpshld"vpshld"/vpshld"/
aesenclast,Perform Last Round of an AES Encryption Flow
aesenclast'
aesenclast'/	vpblendvbVariable Blend Packed Bytes	vpblendvb 	vpblendvb /	vpblendvb!	vpblendvb!2vpmaxsb&Maximum of Packed Signed Byte IntegersvpmaxsbIvpmaxsbI/vpmaxsbIvpmaxsbI2vpmaxsbIvpmaxsbI5vpmaxsb vpmaxsbIvpmaxsb /vpmaxsbI/vpmaxsb!vpmaxsbIvpmaxsb!2vpmaxsbI2vpmaxsbIvpmaxsbI5
vcvtpd2udq\Convert Packed Double-Precision Floating-Point Values to Packed Unsigned Doubleword Integersvcvtpd2udqxH=vcvtpd2udqyH?
vcvtpd2udqHAvcvtpd2udqxHvcvtpd2udqyH
vcvtpd2udqHvcvtpd2udqxH=vcvtpd2udqyH?vcvtpd2udqxHvcvtpd2udqyH
vcvtpd2udqHA
vcvtpd2udqH
vcvtpd2udqHQ
vcvtpd2udqHQblcic%Isolate Lowest Set Bit and Complementblcic6blcic6'vrcp28pdtApproximation to the Reciprocal of Packed Double-Precision Floating-Point Values with Less Than 2^-28 Relative Errorvrcp28pdMAvrcp28pdMvrcp28pdMAvrcp28pdMvrcp28pdMRvrcp28pdMRmaxss;Return Maximum Scalar Single-Precision Floating-Point ValuemaxssMAXSSmaxssMAXSS'kxnorq!Bitwise Logical XNOR 64-bit MaskskxnorqIcbwConvert Byte to Wordcbtwvpsraw'Shift Packed Word Data Right ArithmeticvpsrawIvpsrawIvpsrawI/vpsrawIvpsrawIvpsrawI/vpsrawIvpsrawIvpsrawI/vpsrawI/vpsrawI2vpsrawI5vpsraw vpsrawIvpsraw vpsrawIvpsraw /vpsrawI/vpsrawI/vpsraw!vpsrawIvpsraw!vpsrawIvpsraw!/vpsrawI/vpsrawI2vpsrawIvpsrawIvpsrawI/vpsrawI5vbroadcasti32x8#Broadcast Eight Doubleword Elementsvbroadcasti32x8J2vbroadcasti32x8J2vfmaddsub132phVFused Multiply-Alternating Add/Subtract of Packed Half-Precision Floating-Point Valuesvfmaddsub132phK<vfmaddsub132phKvfmaddsub132phK>vfmaddsub132phKvfmaddsub132phR@vfmaddsub132phRvfmaddsub132phK<vfmaddsub132phKvfmaddsub132phK>vfmaddsub132phKvfmaddsub132phR@vfmaddsub132phRvfmaddsub132phRQvfmaddsub132phRQkadddADD Two 32-bit MaskskadddIvpcompressbBStore Sparse Packed Byte Integer Values into Dense Memory/RegistervpcompressbK0vpcompressbKvpcompressbK3vpcompressbKvpcompressbU6vpcompressbUvpcompressbKvpcompressbKvpcompressbUvpcompressbK/vpcompressbK2vpcompressbU5vcmpph3Compare Packed Half-Precision Floating-Point ValuesvcmpphK<vcmpphK<vcmpphKvcmpphKvcmpphK>vcmpphK>vcmpphKvcmpphKvcmpphR@vcmpphR@vcmpphRvcmpphRvcmpphRRvcmpphRRvrcppsPCompute Approximate Reciprocals of Packed Single-Precision Floating-Point Valuesvrcpps vrcpps /vrcpps vrcpps 2vfmaddsub231psXFused Multiply-Alternating Add/Subtract of Packed Single-Precision Floating-Point Valuesvfmaddsub231psH9vfmaddsub231psHvfmaddsub231psH:vfmaddsub231psHvfmaddsub231psH;vfmaddsub231psHvfmaddsub231psH9vfmaddsub231ps#vfmaddsub231psHvfmaddsub231ps#/vfmaddsub231psH:vfmaddsub231ps#vfmaddsub231psHvfmaddsub231ps#2vfmaddsub231psH;vfmaddsub231psHvfmaddsub231psHQvfmaddsub231psHQvrcp28ssuApproximation to the Reciprocal of a Scalar Single-Precision Floating-Point Value with Less Than 2^-28 Relative Errorvrcp28ssMvrcp28ssM'vrcp28ssMvrcp28ssM'vrcp28ssMRvrcp28ssMRvfmsub231pdHFused Multiply-Subtract of Packed Double-Precision Floating-Point Valuesvfmsub231pdH=vfmsub231pdHvfmsub231pdH?vfmsub231pdHvfmsub231pdHAvfmsub231pdHvfmsub231pdH=vfmsub231pd#vfmsub231pdHvfmsub231pd#/vfmsub231pdH?vfmsub231pd#vfmsub231pdHvfmsub231pd#2vfmsub231pdHAvfmsub231pdHvfmsub231pdHQvfmsub231pdHQ
vcvtps2phx<Convert Single-Precision FP value to Half-Precision FP valuevcvtps2phxxK9vcvtps2phxyK:
vcvtps2phxR;vcvtps2phxxKvcvtps2phxyK
vcvtps2phxRvcvtps2phxxK9vcvtps2phxyK:vcvtps2phxxKvcvtps2phxyK
vcvtps2phxR;
vcvtps2phxR
vcvtps2phxRQ
vcvtps2phxRQ	vpmovmskbMove Byte Mask	vpmovmskb 	vpmovmskb!	pcmpestrm3Packed Compare Explicit Length Strings, Return Mask
pcmpestrml
pcmpestrml/	vexpandpsKLoad Sparse Packed Single-Precision Floating-Point Values from Dense Memory	vexpandpsH	vexpandpsH	vexpandpsH	vexpandpsH/	vexpandpsH2	vexpandpsH5	vexpandpsH	vexpandpsH/	vexpandpsH	vexpandpsH2	vexpandpsH	vexpandpsH5
vperm2f128Permute Floating-Point Values
vperm2f128 
vperm2f128 2vpermi2dAFull Permute of Doublewords From Two Tables Overwriting the Indexvpermi2dH9vpermi2dHvpermi2dH:vpermi2dHvpermi2dH;vpermi2dHvpermi2dH9vpermi2dHvpermi2dH:vpermi2dHvpermi2dH;vpermi2dH	vpermt2psZFull Permute of Single-Precision Floating-Point Values From Two Tables Overwriting a Table	vpermt2psH9	vpermt2psH	vpermt2psH:	vpermt2psH	vpermt2psH;	vpermt2psH	vpermt2psH9	vpermt2psH	vpermt2psH:	vpermt2psH	vpermt2psH;	vpermt2psHvfmsubpsHFused Multiply-Subtract of Packed Single-Precision Floating-Point Valuesvfmsubps$vfmsubps$/vfmsubps$/vfmsubps$vfmsubps$2vfmsubps$2vphaddw#Packed Horizontal Add Word Integersvphaddw vphaddw /vphaddw!vphaddw!2setnle4Set byte if not less or equal (ZF == 0 and SF == OF)setnleSETGT	setnleSETGT#pcmpeqb%Compare Packed Byte Data for EqualitypcmpeqbPCMPEQBpcmpeqbPCMPEQB+pcmpeqbPCMPEQBpcmpeqbPCMPEQB/cmovpoMove if parity odd (PF == 0)cmovpowcmovpow$cmovpolcmovpol'setle/Set byte if less or equal (ZF == 1 or SF != OF)setleSETLE	setleSETLE#kxorbBitwise Logical XOR 8-bit MaskskxorbJcmovnlMove if not less (SF == OF)cmovnlwcmovnlw$cmovnllcmovnll'psubbSubtract Packed Byte IntegerspsubbPSUBBpsubbPSUBB+psubbPSUBBpsubbPSUBB/vbroadcasti32x2!Broadcast Two Doubleword Elementsvbroadcasti32x2Jvbroadcasti32x2Jvbroadcasti32x2Jvbroadcasti32x2J+vbroadcasti32x2J+vbroadcasti32x2J+vbroadcasti32x2Jvbroadcasti32x2J+vbroadcasti32x2Jvbroadcasti32x2J+vbroadcasti32x2Jvbroadcasti32x2J+	vmovdqu64Move Unaligned Quadword Values	vmovdqu64H0	vmovdqu64H	vmovdqu64H3	vmovdqu64H	vmovdqu64H6	vmovdqu64H	vmovdqu64H/	vmovdqu64H2	vmovdqu64H5	vmovdqu64H	vmovdqu64H/	vmovdqu64H	vmovdqu64H2	vmovdqu64H	vmovdqu64H5	vmovdqu64H/	vmovdqu64H2	vmovdqu64H5cvtpi2psBConvert Packed Dword Integers to Packed Single-Precision FP Valuescvtpi2psCVTPL2PScvtpi2psCVTPL2PS+packssdw2Pack Doublewords into Words with Signed Saturationpackssdwpackssdw+packssdwpackssdw/vmovw	Move WordvmovwRvmovwRvmovwR$vmovwR$cmovlMove if less (SF != OF)cmovlwcmovlw$cmovllcmovll'vpabsb&Packed Absolute Value of Byte IntegersvpabsbIvpabsbIvpabsbIvpabsbI/vpabsbI2vpabsbI5vpabsb vpabsbIvpabsb /vpabsbI/vpabsb!vpabsbIvpabsb!2vpabsbI2vpabsbIvpabsbI5	vpblendmd0Blend Doubleword Vectors Using an OpMask Control	vpblendmdH9	vpblendmdH	vpblendmdH:	vpblendmdH	vpblendmdH;	vpblendmdH	vpblendmdH9	vpblendmdH	vpblendmdH:	vpblendmdH	vpblendmdH;	vpblendmdHvpabsd,Packed Absolute Value of Doubleword IntegersvpabsdH9vpabsdH:vpabsdH;vpabsdHvpabsdHvpabsdHvpabsdH9vpabsd vpabsdHvpabsd /vpabsdH:vpabsd!vpabsdHvpabsd!2vpabsdH;vpabsdHvpsubsb;Subtract Packed Signed Byte Integers with Signed SaturationvpsubsbIvpsubsbI/vpsubsbIvpsubsbI2vpsubsbIvpsubsbI5vpsubsb vpsubsbIvpsubsb /vpsubsbI/vpsubsb!vpsubsbIvpsubsb!2vpsubsbI2vpsubsbIvpsubsbI5vpmullw9Multiply Packed Signed Word Integers and Store Low ResultvpmullwIvpmullwI/vpmullwIvpmullwI2vpmullwIvpmullwI5vpmullw vpmullwIvpmullw /vpmullwI/vpmullw!vpmullwIvpmullw!2vpmullwI2vpmullwIvpmullwI5setnzSet byte if not zero (ZF == 0)setnzSETNE	setnzSETNE#	gf2p8mulbGalois Field Multiply Bytes	gf2p8mulb	gf2p8mulb/vandpdDBitwise Logical AND of Packed Double-Precision Floating-Point ValuesvandpdJ=vandpdJvandpdJ?vandpdJvandpdJAvandpdJvandpdJ=vandpd vandpdJvandpd /vandpdJ?vandpd vandpdJvandpd 2vandpdJAvandpdJvfmadd231ssCFused Multiply-Add of Scalar Single-Precision Floating-Point Valuesvfmadd231ssHvfmadd231ssH'vfmadd231ss#vfmadd231ssHvfmadd231ss#'vfmadd231ssH'vfmadd231ssHQvfmadd231ssHQmpsadbw3Compute Multiple Packed Sums of Absolute Differencempsadbwmpsadbw/vmovddup Move One Double-FP and DuplicatevmovddupHvmovddupHvmovddupHvmovddupH+vmovddupH2vmovddupH5vmovddup vmovddupHvmovddup +vmovddupH+vmovddup vmovddupHvmovddup 2vmovddupH2vmovddupHvmovddupH5vpbroadcastmb2q=Broadcast Low Byte of Mask Register to Packed Quadword Valuesvpbroadcastmb2qNvpbroadcastmb2qNvpbroadcastmb2qNvfmaddsub231phVFused Multiply-Alternating Add/Subtract of Packed Half-Precision Floating-Point Valuesvfmaddsub231phK<vfmaddsub231phKvfmaddsub231phK>vfmaddsub231phKvfmaddsub231phR@vfmaddsub231phRvfmaddsub231phK<vfmaddsub231phKvfmaddsub231phK>vfmaddsub231phKvfmaddsub231phR@vfmaddsub231phRvfmaddsub231phRQvfmaddsub231phRQvfmadd132pdCFused Multiply-Add of Packed Double-Precision Floating-Point Valuesvfmadd132pdH=vfmadd132pdHvfmadd132pdH?vfmadd132pdHvfmadd132pdHAvfmadd132pdHvfmadd132pdH=vfmadd132pd#vfmadd132pdHvfmadd132pd#/vfmadd132pdH?vfmadd132pd#vfmadd132pdHvfmadd132pd#2vfmadd132pdHAvfmadd132pdHvfmadd132pdHQvfmadd132pdHQ
vrsqrt28pd�Approximation to the Reciprocal Square Root of Packed Double-Precision Floating-Point Values with Less Than 2^-28 Relative Error
vrsqrt28pdMA
vrsqrt28pdM
vrsqrt28pdMA
vrsqrt28pdM
vrsqrt28pdMR
vrsqrt28pdMRclflushFlush Cache Lineclflush9#	cvttsd2siJConvert with Truncation Scalar Double-Precision FP Value to Signed Integer	cvttsd2si	CVTTSD2SL	cvttsd2si	CVTTSD2SL+vpcmpub#Compare Packed Unsigned Byte ValuesvpcmpubIvpcmpubIvpcmpubI/vpcmpubI/vpcmpubIvpcmpubIvpcmpubI2vpcmpubI2vpcmpubIvpcmpubIvpcmpubI5vpcmpubI5
vfpclasspd;Test Class of Packed Double-Precision Floating-Point ValuesvfpclasspdxJ=vfpclasspdxJ=vfpclasspdyJ?vfpclasspdyJ?vfpclasspdzJAvfpclasspdzJAvfpclasspdxJvfpclasspdxJvfpclasspdyJvfpclasspdyJvfpclasspdzJvfpclasspdzJsetpe!Set byte if parity even (PF == 1)setpeSETPS	setpeSETPS#palignrPacked Align Rightpalignrpalignr+palignrpalignr/vfmsub231shFFused Multiply-Subtract of Scalar Half-Precision Floating-Point Valuesvfmsub231shRvfmsub231shR$vfmsub231shRvfmsub231shR$vfmsub231shRQvfmsub231shRQvpshldvw<Concatenate and Variable Shift Packed Word Data Left LogicalvpshldvwKvpshldvwK/vpshldvwKvpshldvwK2vpshldvwUvpshldvwU5vpshldvwKvpshldvwK/vpshldvwKvpshldvwK2vpshldvwUvpshldvwU5	vpmovsxdqHMove Packed Doubleword Integers to Quadword Integers with Sign Extension	vpmovsxdqH	vpmovsxdqH	vpmovsxdqH	vpmovsxdqH+	vpmovsxdqH/	vpmovsxdqH2	vpmovsxdq 	vpmovsxdqH	vpmovsxdq +	vpmovsxdqH+	vpmovsxdq!	vpmovsxdqH	vpmovsxdq!/	vpmovsxdqH/	vpmovsxdqH	vpmovsxdqH2vptestmd<Logical AND of Packed Doubleword Integer Values and Set MaskvptestmdH9vptestmdH9vptestmdHvptestmdHvptestmdH:vptestmdH:vptestmdHvptestmdHvptestmdH;vptestmdH;vptestmdHvptestmdH
vpmaskmovd+Conditional Move Packed Doubleword Integers
vpmaskmovd!/
vpmaskmovd!2
vpmaskmovd!/
vpmaskmovd!2kxorw Bitwise Logical XOR 16-bit MaskskxorwH
vgatherpf1dpdoSparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Doubleword Indices Using T1 Hint
vgatherpf1dpdLGvrangessYRange Restriction Calculation For a pair of Scalar Single-Precision Floating-Point ValuesvrangessJvrangessJ'vrangessJvrangessJ'vrangessJRvrangessJRcvtdq2psBConvert Packed Dword Integers to Packed Single-Precision FP Valuescvtdq2pscvtdq2ps/jecxzJump if ECX register is 0jecxzJCXZLNmovbeMove Data After Swapping Bytesmovbew/$movbel/'movbew/$movbel/'xorLogical Exclusive ORxorbXORBxorbXORB	xorbXORB		xorbXORB	#xorwXORWxorwXORWxorwXORWxorwXORWxorwXORW$xorlXORLxorlXORLxorlXORLxorlXORLxorlXORL'xorbXORB#xorbXORB#	xorwXORW$xorwXORW$xorwXORW$xorlXORL'xorlXORL'xorlXORL'vrndscaleps^Round Packed Single-Precision Floating-Point Values To Include A Given Number Of Fraction BitsvrndscalepsH9vrndscalepsH:vrndscalepsH;vrndscalepsHvrndscalepsHvrndscalepsHvrndscalepsH9vrndscalepsHvrndscalepsH:vrndscalepsHvrndscalepsH;vrndscalepsHvrndscalepsHRvrndscalepsHRcmovnge'Move if not greater or equal (SF != OF)cmovngewcmovngew$cmovngelcmovngel'into#Interrupt 4 If Overflow Flag is SetintoINTOvaesdec+Perform One Round of an AES Decryption Flow
vaesdec vaesdecKvaesdec /vaesdecK/vaesdecvaesdecKvaesdec2vaesdecK2vaesdecHvaesdecH5jzJump if zero (ZF == 1)jzJEQNjzJEQOvfnmsub231sdQFused Negative Multiply-Subtract of Scalar Double-Precision Floating-Point Valuesvfnmsub231sdHvfnmsub231sdH+vfnmsub231sd#vfnmsub231sdHvfnmsub231sd#+vfnmsub231sdH+vfnmsub231sdHQvfnmsub231sdHQvfnmsub213shOFused Negative Multiply-Subtract of Scalar Half-Precision Floating-Point Valuesvfnmsub213shRvfnmsub213shR$vfnmsub213shRvfnmsub213shR$vfnmsub213shRQvfnmsub213shRQvmovdqaMove Aligned Double Quadwordvmovdqa vmovdqa /vmovdqa vmovdqa 2vmovdqa /vmovdqa 2vpcmpuw#Compare Packed Unsigned Word ValuesvpcmpuwIvpcmpuwIvpcmpuwI/vpcmpuwI/vpcmpuwIvpcmpuwIvpcmpuwI2vpcmpuwI2vpcmpuwIvpcmpuwIvpcmpuwI5vpcmpuwI5	vpmovzxwqBMove Packed Word Integers to Quadword Integers with Zero Extension	vpmovzxwqH	vpmovzxwqH	vpmovzxwqH	vpmovzxwqH'	vpmovzxwqH+	vpmovzxwqH/	vpmovzxwq 	vpmovzxwqH	vpmovzxwq '	vpmovzxwqH'	vpmovzxwq!	vpmovzxwqH	vpmovzxwq!+	vpmovzxwqH+	vpmovzxwqH	vpmovzxwqH/vpconflictdWDetect Conflicts Within a Vector of Packed Doubleword Values into Dense Memory/RegistervpconflictdN9vpconflictdN:vpconflictdN;vpconflictdNvpconflictdNvpconflictdNvpconflictdN9vpconflictdNvpconflictdN:vpconflictdNvpconflictdN;vpconflictdNjcJump if carry (CF == 1)jcJCSNjcJCSOvhaddpsPacked Single-FP Horizontal Addvhaddps vhaddps /vhaddps vhaddps 2setnge+Set byte if not greater or equal (SF != OF)setngeSETLT	setngeSETLT#cmpss5Compare Scalar Single-Precision Floating-Point ValuescmpssCMPSScmpssCMPSS'
vcvtph2uqq^Convert Packed Half Precision Floating-Point Values to Packed Unsigned Quadword Integer Values
vcvtph2uqqK*
vcvtph2uqqK.
vcvtph2uqqR<
vcvtph2uqqK
vcvtph2uqqK
vcvtph2uqqR
vcvtph2uqqK*
vcvtph2uqqK
vcvtph2uqqK.
vcvtph2uqqK
vcvtph2uqqR<
vcvtph2uqqR
vcvtph2uqqRQ
vcvtph2uqqRQvfmsubadd213phVFused Multiply-Alternating Subtract/Add of Packed Half-Precision Floating-Point Valuesvfmsubadd213phK<vfmsubadd213phKvfmsubadd213phK>vfmsubadd213phKvfmsubadd213phR@vfmsubadd213phRvfmsubadd213phK<vfmsubadd213phKvfmsubadd213phK>vfmsubadd213phKvfmsubadd213phR@vfmsubadd213phRvfmsubadd213phRQvfmsubadd213phRQvinserti32x43Insert 128 Bits of Packed Doubleword Integer Valuesvinserti32x4Hvinserti32x4H/vinserti32x4Hvinserti32x4H/vinserti32x4Hvinserti32x4H/vinserti32x4Hvinserti32x4H/	vphaddubd1Packed Horizontal Add Unsigned Byte to Doubleword	vphaddubd"	vphaddubd"/setpSet byte if parity (PF == 1)setpSETPS	setpSETPS#vpmovm2b4Expand Bits of Mask Register to Packed Byte Integersvpmovm2bIvpmovm2bIvpmovm2bI	vcvtuw2phTConvert Packed Unsigned Word Integers to Packed Half-Precision Floating-Point Values	vcvtuw2phK<	vcvtuw2phK>	vcvtuw2phR@	vcvtuw2phK	vcvtuw2phK	vcvtuw2phR	vcvtuw2phK<	vcvtuw2phK	vcvtuw2phK>	vcvtuw2phK	vcvtuw2phR@	vcvtuw2phR	vcvtuw2phRQ	vcvtuw2phRQkandbBitwise Logical AND 8-bit MaskskandbJ
vextracti32x84Extract 256 Bits of Packed Doubleword Integer Values
vextracti32x8J
vextracti32x8J3
vextracti32x8J
vextracti32x8J2vpminsd,Minimum of Packed Signed Doubleword IntegersvpminsdH9vpminsdHvpminsdH:vpminsdHvpminsdH;vpminsdHvpminsdH9vpminsd vpminsdHvpminsd /vpminsdH:vpminsd!vpminsdHvpminsd!2vpminsdH;vpminsdHsetoSet byte if overflow (OF == 1)setoSETOS	setoSETOS#tzmskMask From Trailing Zerostzmsk6tzmsk6'blsiIsolate Lowest Set Bitblsil4blsil4'knotdNOT 32-bit Mask RegisterknotdIrdtscRead Time-Stamp CounterrdtscRDTSCvfmsub132ssHFused Multiply-Subtract of Scalar Single-Precision Floating-Point Valuesvfmsub132ssHvfmsub132ssH'vfmsub132ss#vfmsub132ssHvfmsub132ss#'vfmsub132ssH'vfmsub132ssHQvfmsub132ssHQ	vpacksswb,Pack Words into Bytes with Signed Saturation	vpacksswbI	vpacksswbI/	vpacksswbI	vpacksswbI2	vpacksswbI	vpacksswbI5	vpacksswb 	vpacksswbI	vpacksswb /	vpacksswbI/	vpacksswb!	vpacksswbI	vpacksswb!2	vpacksswbI2	vpacksswbI	vpacksswbI5vfmaddpdCFused Multiply-Add of Packed Double-Precision Floating-Point Valuesvfmaddpd$vfmaddpd$/vfmaddpd$/vfmaddpd$vfmaddpd$2vfmaddpd$2	vgetexppdlExtract Exponents of Packed Double-Precision Floating-Point Values as Double-Precision Floating-Point Values	vgetexppdH=	vgetexppdH?	vgetexppdHA	vgetexppdH	vgetexppdH	vgetexppdH	vgetexppdH=	vgetexppdH	vgetexppdH?	vgetexppdH	vgetexppdHA	vgetexppdH	vgetexppdHR	vgetexppdHRvpxor#Packed Bitwise Logical Exclusive ORvpxor vpxor /vpxor!vpxor!2	serializeSerialize Instruction Execution	serializeF	vmovdqa32Move Aligned Doubleword Values	vmovdqa32H0	vmovdqa32H	vmovdqa32H3	vmovdqa32H	vmovdqa32H6	vmovdqa32H	vmovdqa32H/	vmovdqa32H2	vmovdqa32H5	vmovdqa32H	vmovdqa32H/	vmovdqa32H	vmovdqa32H2	vmovdqa32H	vmovdqa32H5	vmovdqa32H/	vmovdqa32H2	vmovdqa32H5
vgetmantphMExtract Normalized Mantissas from Packed Half-Precision Floating-Point Values
vgetmantphK<
vgetmantphK>
vgetmantphR@
vgetmantphK
vgetmantphK
vgetmantphR
vgetmantphK<
vgetmantphK
vgetmantphK>
vgetmantphK
vgetmantphR@
vgetmantphR
vgetmantphRR
vgetmantphRRkandd Bitwise Logical AND 32-bit MaskskanddI	vunpcklpdGUnpack and Interleave Low Packed Double-Precision Floating-Point Values	vunpcklpdH=	vunpcklpdH	vunpcklpdH?	vunpcklpdH	vunpcklpdHA	vunpcklpdH	vunpcklpdH=	vunpcklpd 	vunpcklpdH	vunpcklpd /	vunpcklpdH?	vunpcklpd 	vunpcklpdH	vunpcklpd 2	vunpcklpdHA	vunpcklpdHpmulhw:Multiply Packed Signed Word Integers and Store High Resultpmulhwpmulhw+pmulhwpmulhw/
vgetmantshKExtract Normalized Mantissa from Scalar Half-Precision Floating-Point Value
vgetmantshR
vgetmantshR$
vgetmantshR
vgetmantshR$
vgetmantshRR
vgetmantshRRpushPush Value Onto the StackpushqpushqpushwPUSHWpushlPUSHLpushwPUSHW$pushlPUSHL'vrndscalepd^Round Packed Double-Precision Floating-Point Values To Include A Given Number Of Fraction BitsvrndscalepdH=vrndscalepdH?vrndscalepdHAvrndscalepdHvrndscalepdHvrndscalepdHvrndscalepdH=vrndscalepdHvrndscalepdH?vrndscalepdHvrndscalepdHAvrndscalepdHvrndscalepdHRvrndscalepdHRmulss6Multiply Scalar Single-Precision Floating-Point ValuesmulssMULSSmulssMULSS'korqBitwise Logical OR 64-bit MaskskorqIroundps3Round Packed Single Precision Floating-Point Valuesroundpsroundps/vfmadd213phAFused Multiply-Add of Packed Half-Precision Floating-Point Valuesvfmadd213phK<vfmadd213phKvfmadd213phK>vfmadd213phKvfmadd213phR@vfmadd213phRvfmadd213phK<vfmadd213phKvfmadd213phK>vfmadd213phKvfmadd213phR@vfmadd213phRvfmadd213phRQvfmadd213phRQvptestmw6Logical AND of Packed Word Integer Values and Set MaskvptestmwIvptestmwIvptestmwI/vptestmwI/vptestmwIvptestmwIvptestmwI2vptestmwI2vptestmwIvptestmwIvptestmwI5vptestmwI5setg*Set byte if greater (ZF == 0 and SF == OF)setgSETGT	setgSETGT#vmovapd:Move Aligned Packed Double-Precision Floating-Point ValuesvmovapdH0vmovapdHvmovapdH3vmovapdHvmovapdH6vmovapdHvmovapdH/vmovapdH2vmovapdH5vmovapd vmovapdHvmovapd /vmovapdH/vmovapd vmovapdHvmovapd 2vmovapdH2vmovapdHvmovapdH5vmovapd /vmovapdH/vmovapd 2vmovapdH2vmovapdH5vpmaxsq*Maximum of Packed Signed Quadword IntegersvpmaxsqH=vpmaxsqHvpmaxsqH?vpmaxsqHvpmaxsqHAvpmaxsqHvpmaxsqH=vpmaxsqHvpmaxsqH?vpmaxsqHvpmaxsqHAvpmaxsqH	vinsertps3Insert Packed Single Precision Floating-Point Value	vinsertps 	vinsertpsH	vinsertps '	vinsertpsH'vpsignd"Packed Sign of Doubleword Integersvpsignd vpsignd /vpsignd!vpsignd!2vminpd<Return Minimum Packed Double-Precision Floating-Point ValuesvminpdH=vminpdHvminpdH?vminpdHvminpdHAvminpdHvminpdH=vminpd vminpdHvminpd /vminpdH?vminpd vminpdHvminpd 2vminpdHAvminpdHvminpdHRvminpdHR
vrsqrt28ps�Approximation to the Reciprocal Square Root of Packed Single-Precision Floating-Point Values with Less Than 2^-28 Relative Error
vrsqrt28psM;
vrsqrt28psM
vrsqrt28psM;
vrsqrt28psM
vrsqrt28psMR
vrsqrt28psMRvdivss4Divide Scalar Single-Precision Floating-Point ValuesvdivssHvdivssH'vdivss vdivssHvdivss 'vdivssH'vdivssHQvdivssHQandpdDBitwise Logical AND of Packed Double-Precision Floating-Point ValuesandpdANDPDandpdANDPD/ktestd#Bit Test 32-bit Masks and Set FlagsktestdImovsd1Move Scalar Double-Precision Floating-Point ValuemovsdMOVSDmovsdMOVSD+movsdMOVSD+vfmsubssHFused Multiply-Subtract of Scalar Single-Precision Floating-Point Valuesvfmsubss$vfmsubss$'vfmsubss$'vmpsadbw3Compute Multiple Packed Sums of Absolute Differencevmpsadbw vmpsadbw /vmpsadbw!vmpsadbw!2movlhps>Move Packed Single-Precision Floating-Point Values Low to HighmovlhpsMOVLHPSvpmaddwd,Multiply and Add Packed Signed Word IntegersvpmaddwdIvpmaddwdI/vpmaddwdIvpmaddwdI2vpmaddwdIvpmaddwdI5vpmaddwd vpmaddwdIvpmaddwd /vpmaddwdI/vpmaddwd!vpmaddwdIvpmaddwd!2vpmaddwdI2vpmaddwdIvpmaddwdI5kortestdOR 32-bit Masks and Set FlagskortestdIvfnmsub132ssQFused Negative Multiply-Subtract of Scalar Single-Precision Floating-Point Valuesvfnmsub132ssHvfnmsub132ssH'vfnmsub132ss#vfnmsub132ssHvfnmsub132ss#'vfnmsub132ssH'vfnmsub132ssHQvfnmsub132ssHQaddsubpsPacked Single-FP Add/Subtractaddsubpsaddsubps/vrcp14sdPCompute Approximate Reciprocal of a Scalar Double-Precision Floating-Point Valuevrcp14sdHvrcp14sdH+vrcp14sdHvrcp14sdH+movntdq-Store Double Quadword Using Non-Temporal HintmovntdqMOVNTO/vfmsub213sdHFused Multiply-Subtract of Scalar Double-Precision Floating-Point Valuesvfmsub213sdHvfmsub213sdH+vfmsub213sd#vfmsub213sdHvfmsub213sd#+vfmsub213sdH+vfmsub213sdHQvfmsub213sdHQvandnpdHBitwise Logical AND NOT of Packed Double-Precision Floating-Point ValuesvandnpdJ=vandnpdJvandnpdJ?vandnpdJvandnpdJAvandnpdJvandnpdJ=vandnpd vandnpdJvandnpd /vandnpdJ?vandnpd vandnpdJvandnpd 2vandnpdJAvandnpdJvbroadcastf64x26Broadcast Two Double-Precision Floating-Point Elementsvbroadcastf64x2J/vbroadcastf64x2J/vbroadcastf64x2J/vbroadcastf64x2J/gf2p8affineinvqb0Galois Field (2^8) Affine Inverse Transformationgf2p8affineinvqbgf2p8affineinvqb/clwbCache Line Write Backclwb;#valigndAlign Doubleword VectorsvaligndH9valigndHvaligndH:valigndHvaligndH;valigndHvaligndH9valigndHvaligndH:valigndHvaligndH;valigndH
vcvtusi2ssHConvert Unsigned Integer to Scalar Single-Precision Floating-Point Valuevcvtusi2sslHvcvtusi2sslH'vcvtusi2sslHQvpcmpd'Compare Packed Signed Doubleword ValuesvpcmpdH9vpcmpdH9vpcmpdHvpcmpdHvpcmpdH:vpcmpdH:vpcmpdHvpcmpdHvpcmpdH;vpcmpdH;vpcmpdHvpcmpdHvpsrlvw-Variable Shift Packed Word Data Right LogicalvpsrlvwIvpsrlvwI/vpsrlvwIvpsrlvwI2vpsrlvwIvpsrlvwI5vpsrlvwIvpsrlvwI/vpsrlvwIvpsrlvwI2vpsrlvwIvpsrlvwI5jbe+Jump if below or equal (CF == 1 or ZF == 1)jbeJLSNjbeJLSOjng)Jump if not greater (ZF == 1 or SF != OF)jngJLENjngJLEOjnsJump if not sign (SF == 0)jnsJPLNjnsJPLO
vgetmantpsOExtract Normalized Mantissas from Packed Single-Precision Floating-Point Values
vgetmantpsH9
vgetmantpsH:
vgetmantpsH;
vgetmantpsH
vgetmantpsH
vgetmantpsH
vgetmantpsH9
vgetmantpsH
vgetmantpsH:
vgetmantpsH
vgetmantpsH;
vgetmantpsH
vgetmantpsHR
vgetmantpsHRvaddps1Add Packed Single-Precision Floating-Point ValuesvaddpsH9vaddpsHvaddpsH:vaddpsHvaddpsH;vaddpsHvaddpsH9vaddps vaddpsHvaddps /vaddpsH:vaddps vaddpsHvaddps 2vaddpsH;vaddpsHvaddpsHQvaddpsHQvbroadcasti32x4"Broadcast Four Doubleword Elementsvbroadcasti32x4H/vbroadcasti32x4H/vbroadcasti32x4H/vbroadcasti32x4H/vpcmpb!Compare Packed Signed Byte ValuesvpcmpbIvpcmpbIvpcmpbI/vpcmpbI/vpcmpbIvpcmpbIvpcmpbI2vpcmpbI2vpcmpbIvpcmpbIvpcmpbI5vpcmpbI5pshufwShuffle Packed Wordspshufw
pshufw
+
vextracti32x44Extract 128 Bits of Packed Doubleword Integer Values
vextracti32x4H
vextracti32x4H0
vextracti32x4H
vextracti32x4H0
vextracti32x4H
vextracti32x4H
vextracti32x4H/
vextracti32x4H/vfnmsub132pdQFused Negative Multiply-Subtract of Packed Double-Precision Floating-Point Valuesvfnmsub132pdH=vfnmsub132pdHvfnmsub132pdH?vfnmsub132pdHvfnmsub132pdHAvfnmsub132pdHvfnmsub132pdH=vfnmsub132pd#vfnmsub132pdHvfnmsub132pd#/vfnmsub132pdH?vfnmsub132pd#vfnmsub132pdHvfnmsub132pd#2vfnmsub132pdHAvfnmsub132pdHvfnmsub132pdHQvfnmsub132pdHQvpminub(Minimum of Packed Unsigned Byte IntegersvpminubIvpminubI/vpminubIvpminubI2vpminubIvpminubI5vpminub vpminubIvpminub /vpminubI/vpminub!vpminubIvpminub!2vpminubI2vpminubIvpminubI5vcompresspsUStore Sparse Packed Single-Precision Floating-Point Values into Dense Memory/RegistervcompresspsHvcompresspsH0vcompresspsHvcompresspsH3vcompresspsHvcompresspsH6vcompresspsHvcompresspsHvcompresspsHvcompresspsH/vcompresspsH2vcompresspsH5	vpmovsxbw>Move Packed Byte Integers to Word Integers with Sign Extension	vpmovsxbwI	vpmovsxbwI	vpmovsxbwI	vpmovsxbwI+	vpmovsxbwI/	vpmovsxbwI2	vpmovsxbw 	vpmovsxbwI	vpmovsxbw +	vpmovsxbwI+	vpmovsxbw!	vpmovsxbwI	vpmovsxbw!/	vpmovsxbwI/	vpmovsxbwI	vpmovsxbwI2movMovemovbMOVB	movbMOVB		movbMOVB	#movwMOVWmovwMOVWmovwMOVW$movabsl movlMOVLmovlMOVLmovlMOVL'movbMOVB#movbMOVB#	movwMOVW$movwMOVW$movlMOVL'movlMOVL'movabsl movshdup(Move Packed Single-FP High and Duplicatemovshdupmovshdup/psrlw$Shift Packed Word Data Right Logicalpsrlwpsrlwpsrlw+psrlwpsrlwpsrlw/orLogical Inclusive ORorbORBorbORB	orbORB		orbORB	#orwORWorwORWorwORWorwORWorwORW$orlORLorlORLorlORLorlORLorlORL'orbORB#orbORB#	orwORW$orwORW$orwORW$orlORL'orlORL'orlORL'stmxcsrStore MXCSR Register Statestmxcsr'psubwSubtract Packed Word IntegerspsubwPSUBWpsubwPSUBW+psubwPSUBWpsubwPSUBW/vpsrad-Shift Packed Doubleword Data Right ArithmeticvpsradH9vpsradH:vpsradH;vpsradHvpsradHvpsradH/vpsradHvpsradHvpsradH/vpsradHvpsradHvpsradH/vpsradH9vpsrad vpsradHvpsrad vpsradHvpsrad /vpsradH/vpsradH:vpsrad!vpsradHvpsrad!vpsradHvpsrad!/vpsradH/vpsradH;vpsradHvpsradHvpsradH/jeJump if equal (ZF == 1)jeJEQNjeJEQOrdtscp(Read Time-Stamp Counter and Processor IDrdtscpvminph:Return Minimum Packed Half-Precision Floating-Point ValuesvminphK<vminphKvminphK>vminphKvminphR@vminphRvminphK<vminphKvminphK>vminphKvminphR@vminphRvminphRRvminphRRrcppsPCompute Approximate Reciprocals of Packed Single-Precision Floating-Point ValuesrcppsRCPPSrcppsRCPPS/vphsubswFPacked Horizontal Subtract Signed Word Integers with Signed Saturationvphsubsw vphsubsw /vphsubsw!vphsubsw!2vpcmovPacked Conditional Movevpcmov"vpcmov"/vpcmov"/vpcmov"vpcmov"2vpcmov"2
vgatherqpsRGather Packed Single-Precision Floating-Point Values Using Signed Quadword Indices
vgatherqpsHD
vgatherqpsHH
vgatherqpsHL
vgatherqps!D
vgatherqps!HvpcompressdHStore Sparse Packed Doubleword Integer Values into Dense Memory/RegistervpcompressdHvpcompressdH0vpcompressdHvpcompressdH3vpcompressdHvpcompressdH6vpcompressdHvpcompressdHvpcompressdHvpcompressdH/vpcompressdH2vpcompressdH5
vextracti64x22Extract 128 Bits of Packed Quadword Integer Values
vextracti64x2J
vextracti64x2J0
vextracti64x2J
vextracti64x2J0
vextracti64x2J
vextracti64x2J
vextracti64x2J/
vextracti64x2J/vpmovdbDDown Convert Packed Doubleword Values to Byte Values with TruncationvpmovdbHvpmovdbH(vpmovdbHvpmovdbH,vpmovdbHvpmovdbH0vpmovdbHvpmovdbHvpmovdbHvpmovdbH'vpmovdbH+vpmovdbH/packuswb.Pack Words into Bytes with Unsigned Saturationpackuswbpackuswb+packuswbpackuswb/pshufbPacked Shuffle BytespshufbPSHUFBpshufbPSHUFB+pshufbPSHUFBpshufbPSHUFB/rdrandRead Random Numberrdrand*rdrand*
vpunpckhwd7Unpack and Interleave High-Order Words into Doublewords
vpunpckhwdI
vpunpckhwdI/
vpunpckhwdI
vpunpckhwdI2
vpunpckhwdI
vpunpckhwdI5
vpunpckhwd 
vpunpckhwdI
vpunpckhwd /
vpunpckhwdI/
vpunpckhwd!
vpunpckhwdI
vpunpckhwd!2
vpunpckhwdI2
vpunpckhwdI
vpunpckhwdI5vsubsd6Subtract Scalar Double-Precision Floating-Point ValuesvsubsdHvsubsdH+vsubsd vsubsdHvsubsd +vsubsdH+vsubsdHQvsubsdHQbextrBit Field Extractbextrl6bextrl4bextrl6'bextrl4'phsubw(Packed Horizontal Subtract Word Integersphsubwphsubw+phsubwphsubw/pfpnacc2Packed Floating-Point Positive-Negative Accumulatepfpnaccpfpnacc+popcnt Count of Number of Bits Set to 1popcntw2popcntw2$popcntl2popcntl2'pfrcpit1,Packed Floating-Point Reciprocal Iteration 1pfrcpit1pfrcpit1+vbcstnesh2ps<Load FP16 Element and Convert to FP32 Element with Broadcastvbcstnesh2psZ$vbcstnesh2psZ$sbbSubtract with BorrowsbbbSBBBsbbbSBBB	sbbbSBBB		sbbbSBBB	#sbbwSBBWsbbwSBBWsbbwSBBWsbbwSBBWsbbwSBBW$sbblSBBLsbblSBBLsbblSBBLsbblSBBLsbblSBBL'sbbbSBBB#sbbbSBBB#	sbbwSBBW$sbbwSBBW$sbbwSBBW$sbblSBBL'sbblSBBL'sbblSBBL'vprolqRotate Packed Quadword LeftvprolqH=vprolqH?vprolqHAvprolqHvprolqHvprolqHvprolqH=vprolqHvprolqH?vprolqHvprolqHAvprolqHpfaddPacked Floating-Point Addpfaddpfadd+divss4Divide Scalar Single-Precision Floating-Point ValuesdivssDIVSSdivssDIVSS'	vpermilps.Permute Single-Precision Floating-Point Values 	vpermilpsH9	vpermilpsH:	vpermilpsH;	vpermilpsH9	vpermilpsH	vpermilpsH	vpermilpsH:	vpermilpsH	vpermilpsH	vpermilpsH;	vpermilpsH	vpermilpsH	vpermilpsH9	vpermilpsH9	vpermilps 	vpermilpsH	vpermilps 	vpermilpsH	vpermilps /	vpermilps /	vpermilpsH:	vpermilpsH:	vpermilps 	vpermilpsH	vpermilps 	vpermilpsH	vpermilps 2	vpermilps 2	vpermilpsH;	vpermilpsH;	vpermilpsH	vpermilpsHblsmskMask From Lowest Set Bitblsmskl4blsmskl4'vpavgbAverage Packed Byte IntegersvpavgbIvpavgbI/vpavgbIvpavgbI2vpavgbIvpavgbI5vpavgb vpavgbIvpavgb /vpavgbI/vpavgb!vpavgbIvpavgb!2vpavgbI2vpavgbIvpavgbI5pfnacc)Packed Floating-Point Negative Accumulatepfnaccpfnacc+phaddswAPacked Horizontal Add Signed Word Integers with Signed Saturationphaddswphaddsw+phaddswphaddsw/vpermt2w9Full Permute of Words From Two Tables Overwriting a Tablevpermt2wIvpermt2wI/vpermt2wIvpermt2wI2vpermt2wIvpermt2wI5vpermt2wIvpermt2wI/vpermt2wIvpermt2wI2vpermt2wIvpermt2wI5prefetchwt1APrefetch Vector Data Into Caches with Intent to Write and T1 Hintprefetchwt1C#movapd:Move Aligned Packed Double-Precision Floating-Point ValuesmovapdMOVAPDmovapdMOVAPD/movapdMOVAPD/	vgetexpsdiExtract Exponent of Scalar Double-Precision Floating-Point Value as Double-Precision Floating-Point Value	vgetexpsdH	vgetexpsdH+	vgetexpsdH	vgetexpsdH+	vgetexpsdHR	vgetexpsdHRvcvttpd2udqlConvert with Truncation Packed Double-Precision Floating-Point Values to Packed Unsigned Doubleword Integersvcvttpd2udqxH=vcvttpd2udqyH?vcvttpd2udqHAvcvttpd2udqxHvcvttpd2udqyHvcvttpd2udqHvcvttpd2udqxH=vcvttpd2udqyH?vcvttpd2udqxHvcvttpd2udqyHvcvttpd2udqHAvcvttpd2udqHvcvttpd2udqHRvcvttpd2udqHRintCall to Interrupt ProcedureintINT	vcvtqq2phOConvert Packed Quadword Integers to Packed Half-Precision Floating-Point Values
vcvtqq2phxK=
vcvtqq2phyK?
vcvtqq2phzRA
vcvtqq2phxK
vcvtqq2phyK
vcvtqq2phzR
vcvtqq2phxK=
vcvtqq2phyK?
vcvtqq2phzRA
vcvtqq2phxK
vcvtqq2phyK
vcvtqq2phzR
vcvtqq2phzRQ
vcvtqq2phzRQmfenceMemory FencemfenceMFENCEvbroadcasti64x4 Broadcast Four Quadword Elementsvbroadcasti64x4H2vbroadcasti64x4H2vmovhlps>Move Packed Single-Precision Floating-Point Values High to Lowvmovhlps vmovhlpsHbswap	Byte Swapbswapvpmaxub(Maximum of Packed Unsigned Byte IntegersvpmaxubIvpmaxubI/vpmaxubIvpmaxubI2vpmaxubIvpmaxubI5vpmaxub vpmaxubIvpmaxub /vpmaxubI/vpmaxub!vpmaxubIvpmaxub!2vpmaxubI2vpmaxubIvpmaxubI5vpmaxuq,Maximum of Packed Unsigned Quadword IntegersvpmaxuqH=vpmaxuqHvpmaxuqH?vpmaxuqHvpmaxuqHAvpmaxuqHvpmaxuqH=vpmaxuqHvpmaxuqH?vpmaxuqHvpmaxuqHAvpmaxuqHaddsd1Add Scalar Double-Precision Floating-Point ValuesaddsdADDSDaddsdADDSD+vprorqRotate Packed Quadword RightvprorqH=vprorqH?vprorqHAvprorqHvprorqHvprorqHvprorqH=vprorqHvprorqH?vprorqHvprorqHAvprorqHvpunpcklqdq?Unpack and Interleave Low-Order Quadwords into Double QuadwordsvpunpcklqdqH=vpunpcklqdqHvpunpcklqdqH?vpunpcklqdqHvpunpcklqdqHAvpunpcklqdqHvpunpcklqdqH=vpunpcklqdq vpunpcklqdqHvpunpcklqdq /vpunpcklqdqH?vpunpcklqdq!vpunpcklqdqHvpunpcklqdq!2vpunpcklqdqHAvpunpcklqdqH
vpcmpistrm3Packed Compare Implicit Length Strings, Return Mask
vpcmpistrm 
vpcmpistrm /
vpmaskmovq)Conditional Move Packed Quadword Integers
vpmaskmovq!/
vpmaskmovq!2
vpmaskmovq!/
vpmaskmovq!2
vcvtps2udqbConvert Packed Single-Precision Floating-Point Values to Packed Unsigned Doubleword Integer Values
vcvtps2udqH9
vcvtps2udqH:
vcvtps2udqH;
vcvtps2udqH
vcvtps2udqH
vcvtps2udqH
vcvtps2udqH9
vcvtps2udqH
vcvtps2udqH:
vcvtps2udqH
vcvtps2udqH;
vcvtps2udqH
vcvtps2udqHQ
vcvtps2udqHQvpopcntq-Packed Population Count for Quadword IntegersvpopcntqK=vpopcntqK?vpopcntqPAvpopcntqKvpopcntqKvpopcntqPvpopcntqK=vpopcntqKvpopcntqK?vpopcntqKvpopcntqPAvpopcntqPaddss1Add Scalar Single-Precision Floating-Point ValuesaddssADDSSaddssADDSS'cmovng)Move if not greater (ZF == 1 or SF != OF)cmovngwcmovngw$cmovnglcmovngl'vcomishJCompare Scalar Ordered Half-Precision Floating-Point Values and Set EFLAGSvcomishRvcomishR$vcomishRRvscatterqpsRScatter Packed Single-Precision Floating-Point Values with Signed Quadword IndicesvscatterqpsHEvscatterqpsHIvscatterqpsHMcdqConvert Doubleword to Quadwordcltdvcvtne2ps2bf16XConvert with Nearest-Even rounding 2 Single-Precision FP vectors into BFloat16 FP vectorvcvtne2ps2bf16K9vcvtne2ps2bf16Kvcvtne2ps2bf16K:vcvtne2ps2bf16Kvcvtne2ps2bf16Q;vcvtne2ps2bf16Qvcvtne2ps2bf16K9vcvtne2ps2bf16Kvcvtne2ps2bf16K:vcvtne2ps2bf16Kvcvtne2ps2bf16Q;vcvtne2ps2bf16Q
vmaskmovpd>Conditional Move Packed Double-Precision Floating-Point Values
vmaskmovpd /
vmaskmovpd 2
vmaskmovpd /
vmaskmovpd 2
vzeroupper Zero Upper Bits of YMM Registers
vzeroupper vhsubpd$Packed Double-FP Horizontal Subtractvhsubpd vhsubpd /vhsubpd vhsubpd 2roundpd3Round Packed Double Precision Floating-Point Valuesroundpdroundpd/vcompresspdUStore Sparse Packed Double-Precision Floating-Point Values into Dense Memory/RegistervcompresspdHvcompresspdH0vcompresspdHvcompresspdH3vcompresspdHvcompresspdH6vcompresspdHvcompresspdHvcompresspdHvcompresspdH/vcompresspdH2vcompresspdH5	punpckhbw1Unpack and Interleave High-Order Bytes into Words	punpckhbw	punpckhbw+	punpckhbw	punpckhbw/vfmaddssCFused Multiply-Add of Scalar Single-Precision Floating-Point Valuesvfmaddss$vfmaddss$'vfmaddss$'aeskeygenassistAES Round Key Generation Assistaeskeygenassist'aeskeygenassist'/	vscalefsh[Scale Scalar Half-Precision Floating-Point Value With a Half-Precision Floating-Point Value	vscalefshR	vscalefshR$	vscalefshR	vscalefshR$	vscalefshRQ	vscalefshRQpmaxuw(Maximum of Packed Unsigned Word Integerspmaxuwpmaxuw/psubd#Subtract Packed Doubleword IntegerspsubdPSUBLpsubdPSUBL+psubdPSUBLpsubdPSUBL/stdSet Direction FlagstdSTDrsqrtssQCompute Reciprocal of Square Root of Scalar Single-Precision Floating-Point ValuersqrtssRSQRTSSrsqrtssRSQRTSS'vdivsh2Divide Scalar Half-Precision Floating-Point ValuesvdivshRvdivshR$vdivshRvdivshR$vdivshRQvdivshRQvsm3msg1?Perform Initial Calculation for the Next Four SM3 Message Wordsvsm3msg1vsm3msg1/vphsubdq?Packed Horizontal Subtract Signed Doubleword to Signed Quadwordvphsubdq"vphsubdq"/vsqrtphCCompute Square Roots of Packed Half-Precision Floating-Point ValuesvsqrtphK<vsqrtphK>vsqrtphR@vsqrtphKvsqrtphKvsqrtphRvsqrtphK<vsqrtphKvsqrtphK>vsqrtphKvsqrtphR@vsqrtphRvsqrtphRQvsqrtphRQ	vcvtqq2psQConvert Packed Quadword Integers to Packed Single-Precision Floating-Point Values
vcvtqq2psxJ=
vcvtqq2psyJ?	vcvtqq2psJA
vcvtqq2psxJ
vcvtqq2psyJ	vcvtqq2psJ
vcvtqq2psxJ=
vcvtqq2psyJ?
vcvtqq2psxJ
vcvtqq2psyJ	vcvtqq2psJA	vcvtqq2psJ	vcvtqq2psJQ	vcvtqq2psJQpsllq'Shift Packed Quadword Data Left Logicalpsllqpsllqpsllq+psllqpsllqpsllq/	vfcmulcshOFused Conjugate Multiply of Complex Scalar Half-Precision Floating-Point Values	vfcmulcshR	vfcmulcshR'	vfcmulcshR	vfcmulcshR'	vfcmulcshRQ	vfcmulcshRQvexp2psyApproximation to the Exponential 2^x of Packed Single-Precision Floating-Point Values with Less Than 2^-23 Relative Errorvexp2psM;vexp2psMvexp2psM;vexp2psMvexp2psMRvexp2psMRvfmaddsubpdXFused Multiply-Alternating Add/Subtract of Packed Double-Precision Floating-Point Valuesvfmaddsubpd$vfmaddsubpd$/vfmaddsubpd$/vfmaddsubpd$vfmaddsubpd$2vfmaddsubpd$2ldmxcsrLoad MXCSR Registerldmxcsr'vpaddusw:Add Packed Unsigned Word Integers with Unsigned SaturationvpadduswIvpadduswI/vpadduswIvpadduswI2vpadduswIvpadduswI5vpaddusw vpadduswIvpaddusw /vpadduswI/vpaddusw!vpadduswIvpaddusw!2vpadduswI2vpadduswIvpadduswI5
vpmaddubsw9Multiply and Add Packed Signed and Unsigned Byte Integers
vpmaddubswI
vpmaddubswI/
vpmaddubswI
vpmaddubswI2
vpmaddubswI
vpmaddubswI5
vpmaddubsw 
vpmaddubswI
vpmaddubsw /
vpmaddubswI/
vpmaddubsw!
vpmaddubswI
vpmaddubsw!2
vpmaddubswI2
vpmaddubswI
vpmaddubswI5vmovss2Move Scalar Single-Precision Floating-Point Values	vmovssH(vmovssH'vmovss 'vmovssH'vmovss 'vmovssH'vmovssHvmovss vmovssHvpmulhuw<Multiply Packed Unsigned Word Integers and Store High ResultvpmulhuwIvpmulhuwI/vpmulhuwIvpmulhuwI2vpmulhuwIvpmulhuwI5vpmulhuw vpmulhuwIvpmulhuw /vpmulhuwI/vpmulhuw!vpmulhuwIvpmulhuw!2vpmulhuwI2vpmulhuwIvpmulhuwI5mulps6Multiply Packed Single-Precision Floating-Point ValuesmulpsMULPSmulpsMULPS/vsqrtpsECompute Square Roots of Packed Single-Precision Floating-Point ValuesvsqrtpsH9vsqrtpsH:vsqrtpsH;vsqrtpsHvsqrtpsHvsqrtpsHvsqrtpsH9vsqrtps vsqrtpsHvsqrtps /vsqrtpsH:vsqrtps vsqrtpsHvsqrtps 2vsqrtpsH;vsqrtpsHvsqrtpsHQvsqrtpsHQ	vpmovzxbw>Move Packed Byte Integers to Word Integers with Zero Extension	vpmovzxbwI	vpmovzxbwI	vpmovzxbwI	vpmovzxbwI+	vpmovzxbwI/	vpmovzxbwI2	vpmovzxbw 	vpmovzxbwI	vpmovzxbw +	vpmovzxbwI+	vpmovzxbw!	vpmovzxbwI	vpmovzxbw!/	vpmovzxbwI/	vpmovzxbwI	vpmovzxbwI2vfmadd132ssCFused Multiply-Add of Scalar Single-Precision Floating-Point Valuesvfmadd132ssHvfmadd132ssH'vfmadd132ss#vfmadd132ssHvfmadd132ss#'vfmadd132ssH'vfmadd132ssHQvfmadd132ssHQ
punpcklqdq?Unpack and Interleave Low-Order Quadwords into Double Quadwords
punpcklqdq
PUNPCKLQDQ
punpcklqdq
PUNPCKLQDQ/sqrtsdCCompute Square Root of Scalar Double-Precision Floating-Point ValuesqrtsdSQRTSDsqrtsdSQRTSD+vandnpsHBitwise Logical AND NOT of Packed Single-Precision Floating-Point ValuesvandnpsJ9vandnpsJvandnpsJ:vandnpsJvandnpsJ;vandnpsJvandnpsJ9vandnps vandnpsJvandnps /vandnpsJ:vandnps vandnpsJvandnps 2vandnpsJ;vandnpsJvfnmadd213phJFused Negative Multiply-Add of Packed Half-Precision Floating-Point Valuesvfnmadd213phK<vfnmadd213phKvfnmadd213phK>vfnmadd213phKvfnmadd213phR@vfnmadd213phRvfnmadd213phK<vfnmadd213phKvfnmadd213phK>vfnmadd213phKvfnmadd213phR@vfnmadd213phRvfnmadd213phRQvfnmadd213phRQvpmovsqdODown Convert Packed Quadword Values to Doubleword Values with Signed SaturationvpmovsqdHvpmovsqdH,vpmovsqdHvpmovsqdH0vpmovsqdHvpmovsqdH3vpmovsqdHvpmovsqdHvpmovsqdHvpmovsqdH+vpmovsqdH/vpmovsqdH2vpsrlvd3Variable Shift Packed Doubleword Data Right LogicalvpsrlvdH9vpsrlvdHvpsrlvdH:vpsrlvdHvpsrlvdH;vpsrlvdHvpsrlvdH9vpsrlvd!vpsrlvdHvpsrlvd!/vpsrlvdH:vpsrlvd!vpsrlvdHvpsrlvd!2vpsrlvdH;vpsrlvdHvbroadcastf32x88Broadcast Eight Single-Precision Floating-Point Elementsvbroadcastf32x8J2vbroadcastf32x8J2
vgatherpf0dpsoSparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Doubleword Indices Using T0 Hint
vgatherpf0dpsLK
vpunpckhbw1Unpack and Interleave High-Order Bytes into Words
vpunpckhbwI
vpunpckhbwI/
vpunpckhbwI
vpunpckhbwI2
vpunpckhbwI
vpunpckhbwI5
vpunpckhbw 
vpunpckhbwI
vpunpckhbw /
vpunpckhbwI/
vpunpckhbw!
vpunpckhbwI
vpunpckhbw!2
vpunpckhbwI2
vpunpckhbwI
vpunpckhbwI5	vptestnmd=Logical NAND of Packed Doubleword Integer Values and Set Mask	vptestnmdH9	vptestnmdH9	vptestnmdH	vptestnmdH	vptestnmdH:	vptestnmdH:	vptestnmdH	vptestnmdH	vptestnmdH;	vptestnmdH;	vptestnmdH	vptestnmdHvpsubq!Subtract Packed Quadword IntegersvpsubqH=vpsubqHvpsubqH?vpsubqHvpsubqHAvpsubqHvpsubqH=vpsubq vpsubqHvpsubq /vpsubqH?vpsubq!vpsubqHvpsubq!2vpsubqHAvpsubqHvroundps3Round Packed Single Precision Floating-Point Valuesvroundps vroundps /vroundps vroundps 2vrsqrtshOCompute Reciprocal of Square Root of Scalar Half-Precision Floating-Point ValuevrsqrtshRvrsqrtshR$vrsqrtshRvrsqrtshR$sahfStore AH into FlagssahfSAHFvmaxps<Return Maximum Packed Single-Precision Floating-Point ValuesvmaxpsH9vmaxpsHvmaxpsH:vmaxpsHvmaxpsH;vmaxpsHvmaxpsH9vmaxps vmaxpsHvmaxps /vmaxpsH:vmaxps vmaxpsHvmaxps 2vmaxpsH;vmaxpsHvmaxpsHRvmaxpsHR
vcvttpd2qqaConvert with Truncation Packed Double-Precision Floating-Point Values to Packed Quadword Integers
vcvttpd2qqJ=
vcvttpd2qqJ?
vcvttpd2qqJA
vcvttpd2qqJ
vcvttpd2qqJ
vcvttpd2qqJ
vcvttpd2qqJ=
vcvttpd2qqJ
vcvttpd2qqJ?
vcvttpd2qqJ
vcvttpd2qqJA
vcvttpd2qqJ
vcvttpd2qqJR
vcvttpd2qqJRvucomishLUnordered Compare Scalar Half-Precision Floating-Point Values and Set EFLAGSvucomishRvucomishR$vucomishRR	vblendvpd= Variable Blend Packed Double Precision Floating-Point Values	vblendvpd 	vblendvpd /	vblendvpd 	vblendvpd 2
vextractps4Extract Packed Single Precision Floating-Point Value
vextractps 
vextractpsH
vextractps '
vextractpsH'	vcvtsd2ssLConvert Scalar Double-Precision FP Value to Scalar Single-Precision FP Value	vcvtsd2ssH	vcvtsd2ssH+	vcvtsd2ss 	vcvtsd2ssH	vcvtsd2ss +	vcvtsd2ssH+	vcvtsd2ssHQ	vcvtsd2ssHQmovhlps>Move Packed Single-Precision Floating-Point Values High to LowmovhlpsMOVHLPSvpblendwBlend Packed Wordsvpblendw vpblendw /vpblendw!vpblendw!2vfmsub132psHFused Multiply-Subtract of Packed Single-Precision Floating-Point Valuesvfmsub132psH9vfmsub132psHvfmsub132psH:vfmsub132psHvfmsub132psH;vfmsub132psHvfmsub132psH9vfmsub132ps#vfmsub132psHvfmsub132ps#/vfmsub132psH:vfmsub132ps#vfmsub132psHvfmsub132ps#2vfmsub132psH;vfmsub132psHvfmsub132psHQvfmsub132psHQvprorvd'Variable Rotate Packed Doubleword RightvprorvdH9vprorvdHvprorvdH:vprorvdHvprorvdH;vprorvdHvprorvdH9vprorvdHvprorvdH:vprorvdHvprorvdH;vprorvdHjlJump if less (SF != OF)jlJLTNjlJLTO	vreducepsQPerform Reduction Transformation on Packed Single-Precision Floating-Point Values	vreducepsJ9	vreducepsJ:	vreducepsJ;	vreducepsJ	vreducepsJ	vreducepsJ	vreducepsJ9	vreducepsJ	vreducepsJ:	vreducepsJ	vreducepsJ;	vreducepsJcmcComplement Carry FlagcmcCMCvucomissNUnordered Compare Scalar Single-Precision Floating-Point Values and Set EFLAGSvucomiss vucomissHvucomiss 'vucomissH'vucomissHR
vextractf32x8AExtract 256 Bits of Packed Single-Precision Floating-Point Values
vextractf32x8J
vextractf32x8J3
vextractf32x8J
vextractf32x8J2vpcmpgtd:Compare Packed Signed Doubleword Integers for Greater ThanvpcmpgtdH9vpcmpgtdH9vpcmpgtdHvpcmpgtdHvpcmpgtdH:vpcmpgtdH:vpcmpgtdHvpcmpgtdHvpcmpgtdH;vpcmpgtdH;vpcmpgtdHvpcmpgtdHvpcmpgtd vpcmpgtd /vpcmpgtd!vpcmpgtd!2vcmpsd5Compare Scalar Double-Precision Floating-Point ValuesvcmpsdHvcmpsdHvcmpsdH+vcmpsdH+vcmpsd vcmpsd +vcmpsdHRvcmpsdHR	vcvtdq2ph@Convert Packed Dword Integers to Packed Half-Precision FP Values
vcvtdq2phxK9
vcvtdq2phyK:	vcvtdq2phR;
vcvtdq2phxK
vcvtdq2phyK	vcvtdq2phR
vcvtdq2phxK9
vcvtdq2phyK:
vcvtdq2phxK
vcvtdq2phyK	vcvtdq2phR;	vcvtdq2phR	vcvtdq2phRQ	vcvtdq2phRQ
vpmadcsswdOPacked Multiply Add Accumulate with Saturation Signed Word to Signed Doubleword
vpmadcsswd"
vpmadcsswd"/vfmsubadd231pdXFused Multiply-Alternating Subtract/Add of Packed Double-Precision Floating-Point Valuesvfmsubadd231pdH=vfmsubadd231pdHvfmsubadd231pdH?vfmsubadd231pdHvfmsubadd231pdHAvfmsubadd231pdHvfmsubadd231pdH=vfmsubadd231pd#vfmsubadd231pdHvfmsubadd231pd#/vfmsubadd231pdH?vfmsubadd231pd#vfmsubadd231pdHvfmsubadd231pd#2vfmsubadd231pdHAvfmsubadd231pdHvfmsubadd231pdHQvfmsubadd231pdHQpsrld*Shift Packed Doubleword Data Right Logicalpsrldpsrldpsrld+psrldpsrldpsrld/vmovntpdKStore Packed Double-Precision Floating-Point Values Using Non-Temporal Hintvmovntpd /vmovntpdH/vmovntpd 2vmovntpdH2vmovntpdH5vpsraq+Shift Packed Quadword Data Right ArithmeticvpsraqH=vpsraqH?vpsraqHAvpsraqHvpsraqHvpsraqH/vpsraqHvpsraqHvpsraqH/vpsraqHvpsraqHvpsraqH/vpsraqH=vpsraqHvpsraqHvpsraqH/vpsraqH?vpsraqHvpsraqHvpsraqH/vpsraqHAvpsraqHvpsraqHvpsraqH/vaddsh/Add Scalar Half-Precision Floating-Point ValuesvaddshRvaddshR$vaddshRvaddshR$vaddshRQvaddshRQ
vcvttph2dqPConvert with Truncation Packed Half-Precision FP Values to Packed Dword Integers
vcvttph2dqK.
vcvttph2dqK<
vcvttph2dqR>
vcvttph2dqK
vcvttph2dqK
vcvttph2dqR
vcvttph2dqK.
vcvttph2dqK
vcvttph2dqK<
vcvttph2dqK
vcvttph2dqR>
vcvttph2dqR
vcvttph2dqRR
vcvttph2dqRRpcmpgtw4Compare Packed Signed Word Integers for Greater Thanpcmpgtwpcmpgtw+pcmpgtwpcmpgtw/	vscalefpd_Scale Packed Double-Precision Floating-Point Values With Double-Precision Floating-Point Values	vscalefpdH=	vscalefpdH	vscalefpdH?	vscalefpdH	vscalefpdHA	vscalefpdH	vscalefpdH=	vscalefpdH	vscalefpdH?	vscalefpdH	vscalefpdHA	vscalefpdH	vscalefpdHQ	vscalefpdHQdivps4Divide Packed Single-Precision Floating-Point ValuesdivpsDIVPSdivpsDIVPS/ktestq#Bit Test 64-bit Masks and Set FlagsktestqI	vcvtpd2phLConvert Packed Double-Precision FP Values to Packed Half-Precision FP Values
vcvtpd2phxK=
vcvtpd2phyK?
vcvtpd2phzRA
vcvtpd2phxK
vcvtpd2phyK
vcvtpd2phzR
vcvtpd2phxK=
vcvtpd2phyK?
vcvtpd2phzRA
vcvtpd2phxK
vcvtpd2phyK
vcvtpd2phzR
vcvtpd2phzRQ
vcvtpd2phzRQ
vcvttph2qqlConvert with Truncation Packed Half Precision Floating-Point Values to Packed Singed Quadword Integer Values
vcvttph2qqK*
vcvttph2qqK.
vcvttph2qqR<
vcvttph2qqK
vcvttph2qqK
vcvttph2qqR
vcvttph2qqK*
vcvttph2qqK
vcvttph2qqK.
vcvttph2qqK
vcvttph2qqR<
vcvttph2qqR
vcvttph2qqRR
vcvttph2qqRRvfnmadd213sdLFused Negative Multiply-Add of Scalar Double-Precision Floating-Point Valuesvfnmadd213sdHvfnmadd213sdH+vfnmadd213sd#vfnmadd213sdHvfnmadd213sd#+vfnmadd213sdH+vfnmadd213sdHQvfnmadd213sdHQ
vgatherdpsTGather Packed Single-Precision Floating-Point Values Using Signed Doubleword Indices
vgatherdpsHB
vgatherdpsHF
vgatherdpsHJ
vgatherdps!B
vgatherdps!F	punpcklwd6Unpack and Interleave Low-Order Words into Doublewords	punpcklwd	punpcklwd'	punpcklwd	punpcklwd/vfnmadd213pdLFused Negative Multiply-Add of Packed Double-Precision Floating-Point Valuesvfnmadd213pdH=vfnmadd213pdHvfnmadd213pdH?vfnmadd213pdHvfnmadd213pdHAvfnmadd213pdHvfnmadd213pdH=vfnmadd213pd#vfnmadd213pdHvfnmadd213pd#/vfnmadd213pdH?vfnmadd213pd#vfnmadd213pdHvfnmadd213pd#2vfnmadd213pdHAvfnmadd213pdHvfnmadd213pdHQvfnmadd213pdHQvcvttpd2uqqjConvert with Truncation Packed Double-Precision Floating-Point Values to Packed Unsigned Quadword Integersvcvttpd2uqqJ=vcvttpd2uqqJ?vcvttpd2uqqJAvcvttpd2uqqJvcvttpd2uqqJvcvttpd2uqqJvcvttpd2uqqJ=vcvttpd2uqqJvcvttpd2uqqJ?vcvttpd2uqqJvcvttpd2uqqJAvcvttpd2uqqJvcvttpd2uqqJRvcvttpd2uqqJRvcvtw2phKConvert Packed Word Integers to Packed Half-Precision Floating-Point Valuesvcvtw2phK<vcvtw2phK>vcvtw2phR@vcvtw2phKvcvtw2phKvcvtw2phRvcvtw2phK<vcvtw2phKvcvtw2phK>vcvtw2phKvcvtw2phR@vcvtw2phRvcvtw2phRQvcvtw2phRQpcmpeqd+Compare Packed Doubleword Data for Equalitypcmpeqdpcmpeqd+pcmpeqdpcmpeqd/lddquLoad Unaligned Integer 128 Bitslddqu/pabsd,Packed Absolute Value of Doubleword Integerspabsdpabsd+pabsdpabsd/	vreduceshPPerform Reduction Transformation on a Scalar Half-Precision Floating-Point Value	vreduceshR	vreduceshR$	vreduceshR	vreduceshR$	vreduceshRR	vreduceshRRvpcmpuq'Compare Packed Unsigned Quadword ValuesvpcmpuqH=vpcmpuqH=vpcmpuqHvpcmpuqHvpcmpuqH?vpcmpuqH?vpcmpuqHvpcmpuqHvpcmpuqHAvpcmpuqHAvpcmpuqHvpcmpuqHvmulps6Multiply Packed Single-Precision Floating-Point ValuesvmulpsH9vmulpsHvmulpsH:vmulpsHvmulpsH;vmulpsHvmulpsH9vmulps vmulpsHvmulps /vmulpsH:vmulps vmulpsHvmulps 2vmulpsH;vmulpsHvmulpsHQvmulpsHQvpmaxsw&Maximum of Packed Signed Word IntegersvpmaxswIvpmaxswI/vpmaxswIvpmaxswI2vpmaxswIvpmaxswI5vpmaxsw vpmaxswIvpmaxsw /vpmaxswI/vpmaxsw!vpmaxswIvpmaxsw!2vpmaxswI2vpmaxswIvpmaxswI5psignd"Packed Sign of Doubleword Integerspsigndpsignd+psigndpsignd/
vcvtuqq2pdZConvert Packed Unsigned Quadword Integers to Packed Double-Precision Floating-Point Values
vcvtuqq2pdJ=
vcvtuqq2pdJ?
vcvtuqq2pdJA
vcvtuqq2pdJ
vcvtuqq2pdJ
vcvtuqq2pdJ
vcvtuqq2pdJ=
vcvtuqq2pdJ
vcvtuqq2pdJ?
vcvtuqq2pdJ
vcvtuqq2pdJA
vcvtuqq2pdJ
vcvtuqq2pdJQ
vcvtuqq2pdJQknotqNOT 64-bit Mask RegisterknotqIkshiftrqShift Right 64-bit MaskskshiftrqIcmovoMove if overflow (OF == 1)cmovowcmovow$cmovolcmovol'vaesimc+Perform the AES InvMixColumn Transformationvaesimc vaesimc /cvtsd2si3Convert Scalar Double-Precision FP Value to Integercvtsd2siCVTSD2SLcvtsd2siCVTSD2SL+cvtsd2ssLConvert Scalar Double-Precision FP Value to Scalar Single-Precision FP Valuecvtsd2ssCVTSD2SScvtsd2ssCVTSD2SS+vfmsubadd132phVFused Multiply-Alternating Subtract/Add of Packed Half-Precision Floating-Point Valuesvfmsubadd132phK<vfmsubadd132phKvfmsubadd132phK>vfmsubadd132phKvfmsubadd132phR@vfmsubadd132phRvfmsubadd132phK<vfmsubadd132phKvfmsubadd132phK>vfmsubadd132phKvfmsubadd132phR@vfmsubadd132phRvfmsubadd132phRQvfmsubadd132phRQpfrsqrtvAPacked Floating-Point Reciprocal Square Root Approximation Vectorpfrsqrtvpfrsqrtv+
vcvttpd2dqRConvert with Truncation Packed Double-Precision FP Values to Packed Dword Integersvcvttpd2dqxH=vcvttpd2dqyH?
vcvttpd2dqHAvcvttpd2dqxHvcvttpd2dqyH
vcvttpd2dqHvcvttpd2dqxH=vcvttpd2dqyH?vcvttpd2dqx vcvttpd2dqxHvcvttpd2dqy vcvttpd2dqyHvcvttpd2dqx /vcvttpd2dqy 2
vcvttpd2dqHA
vcvttpd2dqH
vcvttpd2dqHR
vcvttpd2dqHRvroundpd3Round Packed Double Precision Floating-Point Valuesvroundpd vroundpd /vroundpd vroundpd 2vhaddpdPacked Double-FP Horizontal Addvhaddpd vhaddpd /vhaddpd vhaddpd 2vpsravd6Variable Shift Packed Doubleword Data Right ArithmeticvpsravdH9vpsravdHvpsravdH:vpsravdHvpsravdH;vpsravdHvpsravdH9vpsravd!vpsravdHvpsravd!/vpsravdH:vpsravd!vpsravdHvpsravd!2vpsravdH;vpsravdHaadASCII Adjust AX Before DivisionaadAADaadAADvinsertf64x2@Insert 128 Bits of Packed Double-Precision Floating-Point Valuesvinsertf64x2Jvinsertf64x2J/vinsertf64x2Jvinsertf64x2J/vinsertf64x2Jvinsertf64x2J/vinsertf64x2Jvinsertf64x2J/vfmsub231ssHFused Multiply-Subtract of Scalar Single-Precision Floating-Point Valuesvfmsub231ssHvfmsub231ssH'vfmsub231ss#vfmsub231ssHvfmsub231ss#'vfmsub231ssH'vfmsub231ssHQvfmsub231ssHQ	vpexpanddGLoad Sparse Packed Doubleword Integer Values from Dense Memory/Register	vpexpanddH	vpexpanddH	vpexpanddH	vpexpanddH/	vpexpanddH2	vpexpanddH5	vpexpanddH	vpexpanddH/	vpexpanddH	vpexpanddH2	vpexpanddH	vpexpanddH5vfnmsub132shOFused Negative Multiply-Subtract of Scalar Half-Precision Floating-Point Valuesvfnmsub132shRvfnmsub132shR$vfnmsub132shRvfnmsub132shR$vfnmsub132shRQvfnmsub132shRQvfmsubaddpsXFused Multiply-Alternating Subtract/Add of Packed Single-Precision Floating-Point Valuesvfmsubaddps$vfmsubaddps$/vfmsubaddps$/vfmsubaddps$vfmsubaddps$2vfmsubaddps$2sarx.Arithmetic Shift Right Without Affecting Flagssarxl5sarxl5'seta'Set byte if above (CF == 0 and ZF == 0)setaSETHI	setaSETHI#vaeskeygenassistAES Round Key Generation Assistvaeskeygenassist vaeskeygenassist /pmaxsd,Maximum of Packed Signed Doubleword Integerspmaxsdpmaxsd/vcvttsd2usiXConvert with Truncation Scalar Double-Precision Floating-Point Value to Unsigned Integervcvttsd2usiHvcvttsd2usiH+vcvttsd2usiHRblendvps= Variable Blend Packed Single Precision Floating-Point Valuesblendvpsblendvps/	cvttps2dqRConvert with Truncation Packed Single-Precision FP Values to Packed Dword Integers	cvttps2dq	cvttps2dq/kunpckdq"Unpack and Interleave 32-bit MaskskunpckdqI
vextracti64x42Extract 256 Bits of Packed Quadword Integer Values
vextracti64x4H
vextracti64x4H3
vextracti64x4H
vextracti64x4H2vpmovd2m9Move Signs of Packed Doubleword Integers to Mask Registervpmovd2mJvpmovd2mJvpmovd2mJ	vreducepdQPerform Reduction Transformation on Packed Double-Precision Floating-Point Values	vreducepdJ=	vreducepdJ?	vreducepdJA	vreducepdJ	vreducepdJ	vreducepdJ	vreducepdJ=	vreducepdJ	vreducepdJ?	vreducepdJ	vreducepdJA	vreducepdJkorwBitwise Logical OR 16-bit MaskskorwHvpabsq*Packed Absolute Value of Quadword IntegersvpabsqH=vpabsqH?vpabsqHAvpabsqHvpabsqHvpabsqHvpabsqH=vpabsqHvpabsqH?vpabsqHvpabsqHAvpabsqHsetna*Set byte if not above (CF == 1 or ZF == 1)setnaSETLS	setnaSETLS#vfnmadd132phJFused Negative Multiply-Add of Packed Half-Precision Floating-Point Valuesvfnmadd132phK<vfnmadd132phKvfnmadd132phK>vfnmadd132phKvfnmadd132phR@vfnmadd132phRvfnmadd132phK<vfnmadd132phKvfnmadd132phK>vfnmadd132phKvfnmadd132phR@vfnmadd132phRvfnmadd132phRQvfnmadd132phRQpcmpeqq)Compare Packed Quadword Data for Equalitypcmpeqqpcmpeqq/	vdpbf16psLPacked Dot Product of BFloat16 FP subvectors into Single-Precision FP values	vdpbf16psK9	vdpbf16psK	vdpbf16psK:	vdpbf16psK	vdpbf16psQ;	vdpbf16psQ	vdpbf16psK9	vdpbf16psK	vdpbf16psK:	vdpbf16psK	vdpbf16psQ;	vdpbf16psQrcrRotate Right through Carry FlagrcrbRCRB	rcrbRCRB	rcrbRCRB	rcrwRCRWrcrwRCRWrcrwRCRWrcrlRCRLrcrlRCRLrcrlRCRLrcrbRCRB#rcrbRCRB#rcrbRCRB#rcrwRCRW$rcrwRCRW$rcrwRCRW$rcrlRCRL'rcrlRCRL'rcrlRCRL'cvtpd2dqBConvert Packed Double-Precision FP Values to Packed Dword Integerscvtpd2dqcvtpd2dq/psrldq*Shift Packed Double Quadword Right Logicalpsrldq
vcvttsd2siJConvert with Truncation Scalar Double-Precision FP Value to Signed Integer
vcvttsd2si 
vcvttsd2siH
vcvttsd2si +
vcvttsd2siH+
vcvttsd2siHRvpaddusb:Add Packed Unsigned Byte Integers with Unsigned SaturationvpaddusbIvpaddusbI/vpaddusbIvpaddusbI2vpaddusbIvpaddusbI5vpaddusb vpaddusbIvpaddusb /vpaddusbI/vpaddusb!vpaddusbIvpaddusb!2vpaddusbI2vpaddusbIvpaddusbI5clzeroZero-out 64-bit Cache Lineclzero?pminuw(Minimum of Packed Unsigned Word Integerspminuwpminuw/vfmsubadd231phVFused Multiply-Alternating Subtract/Add of Packed Half-Precision Floating-Point Valuesvfmsubadd231phK<vfmsubadd231phKvfmsubadd231phK>vfmsubadd231phKvfmsubadd231phR@vfmsubadd231phRvfmsubadd231phK<vfmsubadd231phKvfmsubadd231phK>vfmsubadd231phKvfmsubadd231phR@vfmsubadd231phRvfmsubadd231phRQvfmsubadd231phRQ
vgatherpf0qpdmSparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Quadword Indices Using T0 Hint
vgatherpf0qpdLMvpopcntd/Packed Population Count for Doubleword IntegersvpopcntdK9vpopcntdK:vpopcntdP;vpopcntdKvpopcntdKvpopcntdPvpopcntdK9vpopcntdKvpopcntdK:vpopcntdKvpopcntdP;vpopcntdPseteSet byte if equal (ZF == 1)seteSETEQ	seteSETEQ#vpshldvdBConcatenate and Variable Shift Packed Doubleword Data Left LogicalvpshldvdK9vpshldvdKvpshldvdK:vpshldvdKvpshldvdU;vpshldvdUvpshldvdK9vpshldvdKvpshldvdK:vpshldvdKvpshldvdU;vpshldvdUvfmaddsub231pdXFused Multiply-Alternating Add/Subtract of Packed Double-Precision Floating-Point Valuesvfmaddsub231pdH=vfmaddsub231pdHvfmaddsub231pdH?vfmaddsub231pdHvfmaddsub231pdHAvfmaddsub231pdHvfmaddsub231pdH=vfmaddsub231pd#vfmaddsub231pdHvfmaddsub231pd#/vfmaddsub231pdH?vfmaddsub231pd#vfmaddsub231pdHvfmaddsub231pd#2vfmaddsub231pdHAvfmaddsub231pdHvfmaddsub231pdHQvfmaddsub231pdHQvorpd<Bitwise Logical OR of Double-Precision Floating-Point ValuesvorpdJ=vorpdJvorpdJ?vorpdJvorpdJAvorpdJvorpdJ=vorpd vorpdJvorpd /vorpdJ?vorpd vorpdJvorpd 2vorpdJAvorpdJvscatterpf1qpd�Sparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Quadword Indices Using T1 Hint with Intent to Writevscatterpf1qpdLM
prefetcht2'Prefetch Data Into Caches using T2 Hint
prefetcht2
PREFETCHT2
#	vpackuswb.Pack Words into Bytes with Unsigned Saturation	vpackuswbI	vpackuswbI/	vpackuswbI	vpackuswbI2	vpackuswbI	vpackuswbI5	vpackuswb 	vpackuswbI	vpackuswb /	vpackuswbI/	vpackuswb!	vpackuswbI	vpackuswb!2	vpackuswbI2	vpackuswbI	vpackuswbI5psadbw#Compute Sum of Absolute DifferencespsadbwPSADBW
psadbwPSADBW
+psadbwPSADBWpsadbwPSADBW/vpminsq*Minimum of Packed Signed Quadword IntegersvpminsqH=vpminsqHvpminsqH?vpminsqHvpminsqHAvpminsqHvpminsqH=vpminsqHvpminsqH?vpminsqHvpminsqHAvpminsqHkxnorb Bitwise Logical XNOR 8-bit MaskskxnorbJcmovnzMove if not zero (ZF == 0)cmovnzwcmovnzw$cmovnzlcmovnzl'pextrwExtract Wordpextrw
pextrwpextrw$bzhi3Zero High Bits Starting with Specified Bit Positionbzhil5bzhil5'jmpJump UnconditionallyjmpJMPNjmpJMPOjmplJMPjmplJMP'setnbSet byte if not below (CF == 0)setnbSETCC	setnbSETCC#jsJump if sign (SF == 1)jsJMINjsJMIOumwaitUser mode Monitor WaitumwaitGvaddss1Add Scalar Single-Precision Floating-Point ValuesvaddssHvaddssH'vaddss vaddssHvaddss 'vaddssH'vaddssHQvaddssHQvpcmpgtb4Compare Packed Signed Byte Integers for Greater ThanvpcmpgtbIvpcmpgtbIvpcmpgtbI/vpcmpgtbI/vpcmpgtbIvpcmpgtbIvpcmpgtbI2vpcmpgtbI2vpcmpgtbIvpcmpgtbIvpcmpgtbI5vpcmpgtbI5vpcmpgtb vpcmpgtb /vpcmpgtb!vpcmpgtb!2sqrtpsECompute Square Roots of Packed Single-Precision Floating-Point ValuessqrtpsSQRTPSsqrtpsSQRTPS/vpcmpw!Compare Packed Signed Word ValuesvpcmpwIvpcmpwIvpcmpwI/vpcmpwI/vpcmpwIvpcmpwIvpcmpwI2vpcmpwI2vpcmpwIvpcmpwIvpcmpwI5vpcmpwI5vsqrtpdECompute Square Roots of Packed Double-Precision Floating-Point ValuesvsqrtpdH9vsqrtpdH:vsqrtpdHAvsqrtpdHvsqrtpdHvsqrtpdHvsqrtpdH9vsqrtpd vsqrtpdHvsqrtpd /vsqrtpdH:vsqrtpd vsqrtpdHvsqrtpd 2vsqrtpdHAvsqrtpdHvsqrtpdHQvsqrtpdHQvplzcntdBCount the Number of Leading Zero Bits for Packed Doubleword ValuesvplzcntdN9vplzcntdN:vplzcntdN;vplzcntdNvplzcntdNvplzcntdNvplzcntdN9vplzcntdNvplzcntdN:vplzcntdNvplzcntdN;vplzcntdNshufpd5Shuffle Packed Double-Precision Floating-Point Valuesshufpdshufpd/vpermps0Permute Single-Precision Floating-Point Elements
vpermpsH:vpermpsHvpermpsH;vpermpsHvpermpsH:vpermps!vpermpsHvpermps!2vpermpsH;vpermpsHvpxorq8Bitwise Logical Exclusive OR of Packed Quadword IntegersvpxorqH=vpxorqHvpxorqH?vpxorqHvpxorqHAvpxorqHvpxorqH=vpxorqHvpxorqH?vpxorqHvpxorqHAvpxorqHvpshlwPacked Shift Logical Wordsvpshlw"vpshlw"/vpshlw"/vpcomuw%Compare Packed Unsigned Word Integersvpcomuw"vpcomuw"/vfmsubadd132psXFused Multiply-Alternating Subtract/Add of Packed Single-Precision Floating-Point Valuesvfmsubadd132psH9vfmsubadd132psHvfmsubadd132psH:vfmsubadd132psHvfmsubadd132psH;vfmsubadd132psHvfmsubadd132psH9vfmsubadd132ps#vfmsubadd132psHvfmsubadd132ps#/vfmsubadd132psH:vfmsubadd132ps#vfmsubadd132psHvfmsubadd132ps#2vfmsubadd132psH;vfmsubadd132psHvfmsubadd132psHQvfmsubadd132psHQvpdpwusdHPacked Dot Product of Unsigned-by-Signed Word subvectors into DoublewordvpdpwusdYvpdpwusdY/vpdpwusdYvpdpwusdY2paddbAdd Packed Byte Integerspaddbpaddb+paddbpaddb/aesenc+Perform One Round of an AES Encryption FlowaesencAESENC'aesencAESENC'/cvtss2si9Convert Scalar Single-Precision FP Value to Dword Integercvtss2siCVTSS2SLcvtss2siCVTSS2SL'	vcvtsd2si3Convert Scalar Double-Precision FP Value to Integer	vcvtsd2si 	vcvtsd2siH	vcvtsd2si +	vcvtsd2siH+	vcvtsd2siHQvfmsubadd213psXFused Multiply-Alternating Subtract/Add of Packed Single-Precision Floating-Point Valuesvfmsubadd213psH9vfmsubadd213psHvfmsubadd213psH:vfmsubadd213psHvfmsubadd213psH;vfmsubadd213psHvfmsubadd213psH9vfmsubadd213ps#vfmsubadd213psHvfmsubadd213ps#/vfmsubadd213psH:vfmsubadd213ps#vfmsubadd213psHvfmsubadd213ps#2vfmsubadd213psH;vfmsubadd213psHvfmsubadd213psHQvfmsubadd213psHQpaddqAdd Packed Quadword IntegerspaddqPADDQpaddqPADDQ+paddqPADDQpaddqPADDQ/	vfnmsubssQFused Negative Multiply-Subtract of Scalar Single-Precision Floating-Point Values	vfnmsubss$	vfnmsubss$'	vfnmsubss$'pmovsxbw>Move Packed Byte Integers to Word Integers with Sign Extensionpmovsxbwpmovsxbw+cvtsi2ss9Convert Dword Integer to Scalar Single-Precision FP Value	cvtsi2sslCVTSL2SS	cvtsi2sslCVTSL2SS'vfixupimmpd<Fix Up Special Packed Double-Precision Floating-Point ValuesvfixupimmpdH=vfixupimmpdHvfixupimmpdH?vfixupimmpdHvfixupimmpdHAvfixupimmpdHvfixupimmpdH=vfixupimmpdHvfixupimmpdH?vfixupimmpdHvfixupimmpdHAvfixupimmpdHvfixupimmpdHRvfixupimmpdHRvpandnd5Bitwise Logical AND NOT of Packed Doubleword IntegersvpandndH9vpandndHvpandndH:vpandndHvpandndH;vpandndHvpandndH9vpandndHvpandndH:vpandndHvpandndH;vpandndHvpshlbPacked Shift Logical Bytesvpshlb"vpshlb"/vpshlb"/vpsllvd2Variable Shift Packed Doubleword Data Left LogicalvpsllvdH9vpsllvdHvpsllvdH:vpsllvdHvpsllvdH;vpsllvdHvpsllvdH9vpsllvd!vpsllvdHvpsllvd!/vpsllvdH:vpsllvd!vpsllvdHvpsllvd!2vpsllvdH;vpsllvdHvpcmpeqq)Compare Packed Quadword Data for EqualityvpcmpeqqH=vpcmpeqqH=vpcmpeqqHvpcmpeqqHvpcmpeqqH?vpcmpeqqH?vpcmpeqqHvpcmpeqqHvpcmpeqqHAvpcmpeqqHAvpcmpeqqHvpcmpeqqHvpcmpeqq vpcmpeqq /vpcmpeqq!vpcmpeqq!2	vpmovzxbdDMove Packed Byte Integers to Doubleword Integers with Zero Extension	vpmovzxbdH	vpmovzxbdH	vpmovzxbdH	vpmovzxbdH'	vpmovzxbdH+	vpmovzxbdH/	vpmovzxbd 	vpmovzxbdH	vpmovzxbd '	vpmovzxbdH'	vpmovzxbd!	vpmovzxbdH	vpmovzxbd!+	vpmovzxbdH+	vpmovzxbdH	vpmovzxbdH/cmovsMove if sign (SF == 1)cmovswcmovsw$cmovslcmovsl'pinsrwInsert Wordpinsrw
pinsrw
$pinsrwpinsrw$vfnmadd213psLFused Negative Multiply-Add of Packed Single-Precision Floating-Point Valuesvfnmadd213psH9vfnmadd213psHvfnmadd213psH:vfnmadd213psHvfnmadd213psH;vfnmadd213psHvfnmadd213psH9vfnmadd213ps#vfnmadd213psHvfnmadd213ps#/vfnmadd213psH:vfnmadd213ps#vfnmadd213psHvfnmadd213ps#2vfnmadd213psH;vfnmadd213psHvfnmadd213psHQvfnmadd213psHQ	vgetexpsheExtract Exponent of Scalar Half-Precision Floating-Point Value as Half-Precision Floating-Point Value	vgetexpshR	vgetexpshR$	vgetexpshR	vgetexpshR$	vgetexpshRR	vgetexpshRRnegTwo's Complement NegationnegbNEGB	negwNEGWneglNEGLnegbNEGB#negwNEGW$neglNEGL'pmulhrw!Packed Multiply High Rounded Wordpmulhrwpmulhrw+
vextractf32x4AExtract 128 Bits of Packed Single-Precision Floating-Point Values
vextractf32x4H
vextractf32x4H0
vextractf32x4H
vextractf32x4H0
vextractf32x4H
vextractf32x4H
vextractf32x4H/
vextractf32x4H/imulSigned MultiplyimulbIMULB	imulwIMULWimullIMULLimulbIMULB#imulwIMULW$imullIMULL'imulwIMULWimulwIMULW$imullIMULLimullIMULL'imulwimulwimulw$imulw$imullimullimull'imull'kaddwADD Two 16-bit MaskskaddwJ
vpermil2ps:Permute Two-Source Single-Precision Floating-Point Vectors
vpermil2ps"
vpermil2ps"/
vpermil2ps"/
vpermil2ps"
vpermil2ps"2
vpermil2ps"2blendvpd= Variable Blend Packed Double Precision Floating-Point Valuesblendvpdblendvpd/vpmovqdHDown Convert Packed Quadword Values to Doubleword Values with TruncationvpmovqdHvpmovqdH,vpmovqdHvpmovqdH0vpmovqdHvpmovqdH3vpmovqdHvpmovqdHvpmovqdHvpmovqdH+vpmovqdH/vpmovqdH2vpsubwSubtract Packed Word IntegersvpsubwIvpsubwI/vpsubwIvpsubwI2vpsubwIvpsubwI5vpsubw vpsubwIvpsubw /vpsubwI/vpsubw!vpsubwIvpsubw!2vpsubwI2vpsubwIvpsubwI5vrndscaless]Round Scalar Single-Precision Floating-Point Value To Include A Given Number Of Fraction BitsvrndscalessHvrndscalessH'vrndscalessHvrndscalessH'vrndscalessHRvrndscalessHR	vpdpbsudsXPacked Dot Product of Signed-by-Unsinged Byte subvectors into Doubleword with Saturation	vpdpbsudsX	vpdpbsudsX/	vpdpbsudsX	vpdpbsudsX2
vrsqrt14ps`Compute Approximate Reciprocals of Square Roots of Packed Single-Precision Floating-Point Values
vrsqrt14psH9
vrsqrt14psH:
vrsqrt14psH;
vrsqrt14psH
vrsqrt14psH
vrsqrt14psH
vrsqrt14psH9
vrsqrt14psH
vrsqrt14psH:
vrsqrt14psH
vrsqrt14psH;
vrsqrt14psH	vptestnmb7Logical NAND of Packed Byte Integer Values and Set Mask	vptestnmbI	vptestnmbI	vptestnmbI/	vptestnmbI/	vptestnmbI	vptestnmbI	vptestnmbI2	vptestnmbI2	vptestnmbI	vptestnmbI	vptestnmbI5	vptestnmbI5
punpckhqdq@Unpack and Interleave High-Order Quadwords into Double Quadwords
punpckhqdq
PUNPCKHQDQ
punpckhqdq
PUNPCKHQDQ/vfmsubadd231psXFused Multiply-Alternating Subtract/Add of Packed Single-Precision Floating-Point Valuesvfmsubadd231psH9vfmsubadd231psHvfmsubadd231psH:vfmsubadd231psHvfmsubadd231psH;vfmsubadd231psHvfmsubadd231psH9vfmsubadd231ps#vfmsubadd231psHvfmsubadd231ps#/vfmsubadd231psH:vfmsubadd231ps#vfmsubadd231psHvfmsubadd231ps#2vfmsubadd231psH;vfmsubadd231psHvfmsubadd231psHQvfmsubadd231psHQvpminuq,Minimum of Packed Unsigned Quadword IntegersvpminuqH=vpminuqHvpminuqH?vpminuqHvpminuqHAvpminuqHvpminuqH=vpminuqHvpminuqH?vpminuqHvpminuqHAvpminuqHaesdec+Perform One Round of an AES Decryption Flowaesdec'aesdec'/vpandnPacked Bitwise Logical AND NOTvpandn vpandn /vpandn!vpandn!2	vunpckhpsHUnpack and Interleave High Packed Single-Precision Floating-Point Values	vunpckhpsH9	vunpckhpsH	vunpckhpsH:	vunpckhpsH	vunpckhpsH;	vunpckhpsH	vunpckhpsH9	vunpckhps 	vunpckhpsH	vunpckhps /	vunpckhpsH:	vunpckhps 	vunpckhpsH	vunpckhps 2	vunpckhpsH;	vunpckhpsHvpmovwb>Down Convert Packed Word Values to Byte Values with TruncationvpmovwbIvpmovwbI,vpmovwbIvpmovwbI0vpmovwbIvpmovwbI3vpmovwbIvpmovwbIvpmovwbIvpmovwbI+vpmovwbI/vpmovwbI2lfence
Load FencelfenceLFENCE	vcvtph2ps>Convert Half-Precision FP Values to Single-Precision FP Values	vcvtph2psH	vcvtph2psH	vcvtph2psH	vcvtph2psH+	vcvtph2psH/	vcvtph2psH2	vcvtph2ps%	vcvtph2psH	vcvtph2ps%+	vcvtph2psH+	vcvtph2ps%	vcvtph2psH	vcvtph2ps%/	vcvtph2psH/	vcvtph2psH	vcvtph2psH2	vcvtph2psHR	vcvtph2psHRpmovsxwdDMove Packed Word Integers to Doubleword Integers with Sign Extensionpmovsxwdpmovsxwd+vpcompressqFStore Sparse Packed Quadword Integer Values into Dense Memory/RegistervpcompressqHvpcompressqH0vpcompressqHvpcompressqH3vpcompressqHvpcompressqH6vpcompressqHvpcompressqHvpcompressqHvpcompressqH/vpcompressqH2vpcompressqH5	vpmovusqbKDown Convert Packed Quadword Values to Byte Values with Unsigned Saturation	vpmovusqbH	vpmovusqbH%	vpmovusqbH	vpmovusqbH(	vpmovusqbH	vpmovusqbH,	vpmovusqbH	vpmovusqbH	vpmovusqbH	vpmovusqbH$	vpmovusqbH'	vpmovusqbH+rdseedRead Random SEEDrdseed+rdseed+unpckhpdHUnpack and Interleave High Packed Double-Precision Floating-Point ValuesunpckhpdUNPCKHPDunpckhpdUNPCKHPD/pi2fw0Packed Integer to Floating-Point Word Conversionpi2fwpi2fw+movq
Move QuadwordmovqMOVQmovqMOVQ+movqMOVQmovqMOVQ+movqMOVQ+movqMOVQ+prefetchnta(Prefetch Data Into Caches using NTA HintprefetchntaPREFETCHNTA
#vfmaddsub213phVFused Multiply-Alternating Add/Subtract of Packed Half-Precision Floating-Point Valuesvfmaddsub213phK<vfmaddsub213phKvfmaddsub213phK>vfmaddsub213phKvfmaddsub213phR@vfmaddsub213phRvfmaddsub213phK<vfmaddsub213phKvfmaddsub213phK>vfmaddsub213phKvfmaddsub213phR@vfmaddsub213phRvfmaddsub213phRQvfmaddsub213phRQvminsh9Return Minimum Scalar Half-Precision Floating-Point ValuevminshRvminshR$vminshRvminshR$vminshRRvminshRRvrcpphNCompute Approximate Reciprocals of Packed Half-Precision Floating-Point ValuesvrcpphK<vrcpphK>vrcpphR@vrcpphKvrcpphKvrcpphRvrcpphK<vrcpphKvrcpphK>vrcpphKvrcpphR@vrcpphR
vfpclasssh8Test Class of Scalar Half-Precision Floating-Point Value
vfpclassshR
vfpclassshR
vfpclassshR$
vfpclassshR$	vphaddubw+Packed Horizontal Add Unsigned Byte to Word	vphaddubw"	vphaddubw"/	vpmovuswbGDown Convert Packed Word Values to Byte Values with Unsigned Saturation	vpmovuswbI	vpmovuswbI,	vpmovuswbI	vpmovuswbI0	vpmovuswbI	vpmovuswbI3	vpmovuswbI	vpmovuswbI	vpmovuswbI	vpmovuswbI+	vpmovuswbI/	vpmovuswbI2pswapdPacked Swap Doublewordpswapdpswapd+movntdqa.Load Double Quadword Non-Temporal Aligned Hintmovntdqa/vaesenc+Perform One Round of an AES Encryption Flow
vaesenc vaesencKvaesenc /vaesencK/vaesencvaesencKvaesenc2vaesencK2vaesencHvaesencH5vfmadd213psCFused Multiply-Add of Packed Single-Precision Floating-Point Valuesvfmadd213psH9vfmadd213psHvfmadd213psH:vfmadd213psHvfmadd213psH;vfmadd213psHvfmadd213psH9vfmadd213ps#vfmadd213psHvfmadd213ps#/vfmadd213psH:vfmadd213ps#vfmadd213psHvfmadd213ps#2vfmadd213psH;vfmadd213psHvfmadd213psHQvfmadd213psHQ	vpmovsxbqBMove Packed Byte Integers to Quadword Integers with Sign Extension	vpmovsxbqH	vpmovsxbqH	vpmovsxbqH	vpmovsxbqH$	vpmovsxbqH'	vpmovsxbqH+	vpmovsxbq 	vpmovsxbqH	vpmovsxbq $	vpmovsxbqH$	vpmovsxbq!	vpmovsxbqH	vpmovsxbq!'	vpmovsxbqH'	vpmovsxbqH	vpmovsxbqH+vtestps/Packed Single-Precision Floating-Point Bit Testvtestps vtestps /vtestps vtestps 2pshufdShuffle Packed DoublewordspshufdPSHUFLpshufdPSHUFL/jnoJump if not overflow (OF == 0)jnoJOCNjnoJOCOvrsqrtssQCompute Reciprocal of Square Root of Scalar Single-Precision Floating-Point Valuevrsqrtss vrsqrtss 'shrx+Logical Shift Right Without Affecting Flagsshrxl5shrxl5'	vpmovzxdqHMove Packed Doubleword Integers to Quadword Integers with Zero Extension	vpmovzxdqH	vpmovzxdqH	vpmovzxdqH	vpmovzxdqH+	vpmovzxdqH/	vpmovzxdqH2	vpmovzxdq 	vpmovzxdqH	vpmovzxdq +	vpmovzxdqH+	vpmovzxdq!	vpmovzxdqH	vpmovzxdq!/	vpmovzxdqH/	vpmovzxdqH	vpmovzxdqH2vcvtneoph2ps9Convert Odd Elements of Packed FP16 Values to FP32 Valuesvcvtneoph2psZ/vcvtneoph2psZ2vpcmpud)Compare Packed Unsigned Doubleword ValuesvpcmpudH9vpcmpudH9vpcmpudHvpcmpudHvpcmpudH:vpcmpudH:vpcmpudHvpcmpudHvpcmpudH;vpcmpudH;vpcmpudHvpcmpudHvpmovqwBDown Convert Packed Quadword Values to Word Values with TruncationvpmovqwHvpmovqwH(vpmovqwHvpmovqwH,vpmovqwHvpmovqwH0vpmovqwHvpmovqwHvpmovqwHvpmovqwH'vpmovqwH+vpmovqwH/vrcp28sduApproximation to the Reciprocal of a Scalar Double-Precision Floating-Point Value with Less Than 2^-28 Relative Errorvrcp28sdMvrcp28sdM+vrcp28sdMvrcp28sdM+vrcp28sdMRvrcp28sdMRvminps<Return Minimum Packed Single-Precision Floating-Point ValuesvminpsH9vminpsHvminpsH:vminpsHvminpsH;vminpsHvminpsH9vminps vminpsHvminps /vminpsH:vminps vminpsHvminps 2vminpsH;vminpsHvminpsHRvminpsHRvrcp14pdPCompute Approximate Reciprocals of Packed Double-Precision Floating-Point Valuesvrcp14pdH=vrcp14pdH?vrcp14pdHAvrcp14pdHvrcp14pdHvrcp14pdHvrcp14pdH=vrcp14pdHvrcp14pdH?vrcp14pdHvrcp14pdHAvrcp14pdHdppd<Dot Product of Packed Double Precision Floating-Point Valuesdppddppd/kshiftlbShift Left 8-bit MaskskshiftlbJcwdeConvert Word to Doublewordcwtlpslldq)Shift Packed Double Quadword Left Logicalpslldqmovq2dq1Move Quadword from MMX Technology to XMM Registermovq2dq	vcvtdq2psBConvert Packed Dword Integers to Packed Single-Precision FP Values	vcvtdq2psH9	vcvtdq2psH:	vcvtdq2psH;	vcvtdq2psH	vcvtdq2psH	vcvtdq2psH	vcvtdq2psH9	vcvtdq2ps 	vcvtdq2psH	vcvtdq2ps /	vcvtdq2psH:	vcvtdq2ps 	vcvtdq2psH	vcvtdq2ps 2	vcvtdq2psH;	vcvtdq2psH	vcvtdq2psHQ	vcvtdq2psHQpf2iw0Packed Floating-Point to Integer Word Conversionpf2iwpf2iw+vfmsub213shFFused Multiply-Subtract of Scalar Half-Precision Floating-Point Valuesvfmsub213shRvfmsub213shR$vfmsub213shRvfmsub213shR$vfmsub213shRQvfmsub213shRQ	cvttps2piRConvert with Truncation Packed Single-Precision FP Values to Packed Dword Integers	cvttps2pi	CVTTPS2PL	cvttps2pi	CVTTPS2PL+vpaddwAdd Packed Word IntegersvpaddwIvpaddwI/vpaddwIvpaddwI2vpaddwIvpaddwI5vpaddw vpaddwIvpaddw /vpaddwI/vpaddw!vpaddwIvpaddw!2vpaddwI2vpaddwIvpaddwI5	vpdpwusdsXPacked Dot Product of Unsigned-by-Signed Word subvectors into Doubleword with Saturation	vpdpwusdsY	vpdpwusdsY/	vpdpwusdsY	vpdpwusdsY2	vreducephOPerform Reduction Transformation on Packed Half-Precision Floating-Point Values	vreducephK<	vreducephK>	vreducephR@	vreducephK	vreducephK	vreducephR	vreducephK<	vreducephK	vreducephK>	vreducephK	vreducephR@	vreducephR	vreducephRR	vreducephRRvrsqrtpsTCompute Reciprocals of Square Roots of Packed Single-Precision Floating-Point Valuesvrsqrtps vrsqrtps /vrsqrtps vrsqrtps 2vscatterqpdRScatter Packed Double-Precision Floating-Point Values with Signed Quadword IndicesvscatterqpdHEvscatterqpdHIvscatterqpdHM	vfcmulcphOFused Conjugate Multiply of Complex Packed Half-Precision Floating-Point Values	vfcmulcphK9	vfcmulcphK	vfcmulcphK:	vfcmulcphK	vfcmulcphR;	vfcmulcphR	vfcmulcphK9	vfcmulcphK	vfcmulcphK:	vfcmulcphK	vfcmulcphR;	vfcmulcphR	vfcmulcphRQ	vfcmulcphRQjnbJump if not below (CF == 0)jnbJCCNjnbJCCOjpeJump if parity even (PF == 1)jpeJPSNjpeJPSOaor
Atomically ORaor'maxps<Return Maximum Packed Single-Precision Floating-Point ValuesmaxpsMAXPSmaxpsMAXPS/movaps:Move Aligned Packed Single-Precision Floating-Point ValuesmovapsMOVAPSmovapsMOVAPS/movapsMOVAPS/vfmsub213phFFused Multiply-Subtract of Packed Half-Precision Floating-Point Valuesvfmsub213phK<vfmsub213phKvfmsub213phK>vfmsub213phKvfmsub213phR@vfmsub213phRvfmsub213phK<vfmsub213phKvfmsub213phK>vfmsub213phKvfmsub213phR@vfmsub213phRvfmsub213phRQvfmsub213phRQcmovge#Move if greater or equal (SF == OF)cmovgewcmovgew$cmovgelcmovgel'addsubpdPacked Double-FP Add/Subtractaddsubpdaddsubpd/
vcvttsh2siGConvert with Truncation Scalar Half-Precision FP Value to Dword Integer
vcvttsh2siR
vcvttsh2siR$
vcvttsh2siRRvfrczsd7Extract Fraction Scalar Double-Precision Floating-Pointvfrczsd"vfrczsd"+vpalignrPacked Align RightvpalignrIvpalignrI/vpalignrIvpalignrI2vpalignrIvpalignrI5vpalignr vpalignrIvpalignr /vpalignrI/vpalignr!vpalignrIvpalignr!2vpalignrI2vpalignrIvpalignrI5jnpJump if not parity (PF == 0)jnpJPCNjnpJPCO
vgetmantsdMExtract Normalized Mantissa from Scalar Double-Precision Floating-Point Value
vgetmantsdH
vgetmantsdH+
vgetmantsdH
vgetmantsdH+
vgetmantsdHR
vgetmantsdHR	vpmovusqwKDown Convert Packed Quadword Values to Word Values with Unsigned Saturation	vpmovusqwH	vpmovusqwH(	vpmovusqwH	vpmovusqwH,	vpmovusqwH	vpmovusqwH0	vpmovusqwH	vpmovusqwH	vpmovusqwH	vpmovusqwH'	vpmovusqwH+	vpmovusqwH/divsd4Divide Scalar Double-Precision Floating-Point ValuesdivsdDIVSDdivsdDIVSD+vgf2p8affineqb(Galois Field (2^8) Affine Transformationvgf2p8affineqb=vgf2p8affineqbvgf2p8affineqb?vgf2p8affineqbvgf2p8affineqbAvgf2p8affineqbvgf2p8affineqb=vgf2p8affineqbvgf2p8affineqbvgf2p8affineqb/vgf2p8affineqb?vgf2p8affineqbvgf2p8affineqbvgf2p8affineqb2vgf2p8affineqbAvgf2p8affineqbvptestmb6Logical AND of Packed Byte Integer Values and Set MaskvptestmbIvptestmbIvptestmbI/vptestmbI/vptestmbIvptestmbIvptestmbI2vptestmbI2vptestmbIvptestmbIvptestmbI5vptestmbI5
vpgatherqq;Gather Packed Quadword Values Using Signed Quadword Indices
vpgatherqqHD
vpgatherqqHH
vpgatherqqHL
vpgatherqq!D
vpgatherqq!Hvfmsubadd132pdXFused Multiply-Alternating Subtract/Add of Packed Double-Precision Floating-Point Valuesvfmsubadd132pdH=vfmsubadd132pdHvfmsubadd132pdH?vfmsubadd132pdHvfmsubadd132pdHAvfmsubadd132pdHvfmsubadd132pdH=vfmsubadd132pd#vfmsubadd132pdHvfmsubadd132pd#/vfmsubadd132pdH?vfmsubadd132pd#vfmsubadd132pdHvfmsubadd132pd#2vfmsubadd132pdHAvfmsubadd132pdHvfmsubadd132pdHQvfmsubadd132pdHQvbroadcasti64x2Broadcast Two Quadword Elementsvbroadcasti64x2J/vbroadcasti64x2J/vbroadcasti64x2J/vbroadcasti64x2J/vpdpwssdFPacked Dot Product of Signed-by-Signed Word subvectors into DoublewordvpdpwssdK9vpdpwssdKvpdpwssdK:vpdpwssdKvpdpwssdV;vpdpwssdVvpdpwssdK9vpdpwssdWvpdpwssdKvpdpwssdW/vpdpwssdK:vpdpwssdWvpdpwssdKvpdpwssdW2vpdpwssdV;vpdpwssdVvpcomw#Compare Packed Signed Word Integersvpcomw"vpcomw"/	vpmulhrswOPacked Multiply Signed Word Integers and Store High Result with Round and Scale	vpmulhrswI	vpmulhrswI/	vpmulhrswI	vpmulhrswI2	vpmulhrswI	vpmulhrswI5	vpmulhrsw 	vpmulhrswI	vpmulhrsw /	vpmulhrswI/	vpmulhrsw!	vpmulhrswI	vpmulhrsw!2	vpmulhrswI2	vpmulhrswI	vpmulhrswI5vpsllw#Shift Packed Word Data Left LogicalvpsllwIvpsllwIvpsllwI/vpsllwIvpsllwIvpsllwI/vpsllwIvpsllwIvpsllwI/vpsllwI/vpsllwI2vpsllwI5vpsllw vpsllwIvpsllw vpsllwIvpsllw /vpsllwI/vpsllwI/vpsllw!vpsllwIvpsllw!vpsllwIvpsllw!/vpsllwI/vpsllwI2vpsllwIvpsllwIvpsllwI/vpsllwI5movlps6Move Low Packed Single-Precision Floating-Point ValuesmovlpsMOVLPS+movlpsMOVLPS+aas!ASCII Adjust AL After SubtractionaasAASmaskmovq Store Selected Bytes of Quadwordmaskmovq
cmovncMove if not carry (CF == 0)cmovncwcmovncw$cmovnclcmovncl'	vfnmaddpsLFused Negative Multiply-Add of Packed Single-Precision Floating-Point Values	vfnmaddps$	vfnmaddps$/	vfnmaddps$/	vfnmaddps$	vfnmaddps$2	vfnmaddps$2
vfcmaddcshSFused Conjugate Multiply-Add of Complex Scalar Half-Precision Floating-Point Values
vfcmaddcshR
vfcmaddcshR'
vfcmaddcshR
vfcmaddcshR'
vfcmaddcshRQ
vfcmaddcshRQkshiftrdShift Right 32-bit MaskskshiftrdIvsqrtsdCCompute Square Root of Scalar Double-Precision Floating-Point ValuevsqrtsdHvsqrtsdH+vsqrtsd vsqrtsdHvsqrtsd +vsqrtsdH+vsqrtsdHQvsqrtsdHQcpuidCPU IdentificationcpuidCPUIDvfmadd213pdCFused Multiply-Add of Packed Double-Precision Floating-Point Valuesvfmadd213pdH=vfmadd213pdHvfmadd213pdH?vfmadd213pdHvfmadd213pdHAvfmadd213pdHvfmadd213pdH=vfmadd213pd#vfmadd213pdHvfmadd213pd#/vfmadd213pdH?vfmadd213pd#vfmadd213pdHvfmadd213pd#2vfmadd213pdHAvfmadd213pdHvfmadd213pdHQvfmadd213pdHQvfmaddsub213pdXFused Multiply-Alternating Add/Subtract of Packed Double-Precision Floating-Point Valuesvfmaddsub213pdH=vfmaddsub213pdHvfmaddsub213pdH?vfmaddsub213pdHvfmaddsub213pdHAvfmaddsub213pdHvfmaddsub213pdH=vfmaddsub213pd#vfmaddsub213pdHvfmaddsub213pd#/vfmaddsub213pdH?vfmaddsub213pd#vfmaddsub213pdHvfmaddsub213pd#2vfmaddsub213pdHAvfmaddsub213pdHvfmaddsub213pdHQvfmaddsub213pdHQvfnmsub213pdQFused Negative Multiply-Subtract of Packed Double-Precision Floating-Point Valuesvfnmsub213pdH=vfnmsub213pdHvfnmsub213pdH?vfnmsub213pdHvfnmsub213pdHAvfnmsub213pdHvfnmsub213pdH=vfnmsub213pd#vfnmsub213pdHvfnmsub213pd#/vfnmsub213pdH?vfnmsub213pd#vfnmsub213pdHvfnmsub213pd#2vfnmsub213pdHAvfnmsub213pdHvfnmsub213pdHQvfnmsub213pdHQvpshufhwShuffle Packed High WordsvpshufhwIvpshufhwIvpshufhwIvpshufhwI/vpshufhwI2vpshufhwI5vpshufhw vpshufhwIvpshufhw /vpshufhwI/vpshufhw!vpshufhwIvpshufhw!2vpshufhwI2vpshufhwIvpshufhwI5vpavgwAverage Packed Word IntegersvpavgwIvpavgwI/vpavgwIvpavgwI2vpavgwIvpavgwI5vpavgw vpavgwIvpavgw /vpavgwI/vpavgw!vpavgwIvpavgw!2vpavgwI2vpavgwIvpavgwI5sfenceStore FencesfenceSFENCE
testLogical ComparetestbTESTBtestbTESTB	testbTESTB		testwTESTWtestwTESTWtestwTESTWtestlTESTLtestlTESTLtestlTESTLtestbTESTB#testbTESTB#	testwTESTW$testwTESTW$testlTESTL'testlTESTL'vpmacsddAPacked Multiply Accumulate Signed Doubleword to Signed Doublewordvpmacsdd"vpmacsdd"/vpmovw2m3Move Signs of Packed Word Integers to Mask Registervpmovw2mIvpmovw2mIvpmovw2mIpfcmpge2Packed Floating-Point Compare for Greater or Equalpfcmpgepfcmpge+	pcmpistrm3Packed Compare Implicit Length Strings, Return Mask	pcmpistrm	pcmpistrm/cmpxchgCompare and ExchangecmpxchgbCMPXCHGB		cmpxchgwCMPXCHGWcmpxchglCMPXCHGLcmpxchgbCMPXCHGB#	cmpxchgwCMPXCHGW$cmpxchglCMPXCHGL'pabsw&Packed Absolute Value of Word Integerspabswpabsw+pabswpabsw/pmuldqDMultiply Packed Signed Doubleword Integers and Store Quadword Resultpmuldqpmuldq/psubsw;Subtract Packed Signed Word Integers with Signed SaturationpsubswPSUBSWpsubswPSUBSW+psubswPSUBSWpsubswPSUBSW/vfmsubpdHFused Multiply-Subtract of Packed Double-Precision Floating-Point Valuesvfmsubpd$vfmsubpd$/vfmsubpd$/vfmsubpd$vfmsubpd$2vfmsubpd$2vsubsh4Subtract Scalar Half-Precision Floating-Point ValuesvsubshRvsubshR$vsubshRvsubshR$vsubshRQvsubshRQvfmadd213ssCFused Multiply-Add of Scalar Single-Precision Floating-Point Valuesvfmadd213ssHvfmadd213ssH'vfmadd213ss#vfmadd213ssHvfmadd213ss#'vfmadd213ssH'vfmadd213ssHQvfmadd213ssHQpfsubPacked Floating-Point Subtractpfsubpfsub+rcpssOCompute Approximate Reciprocal of Scalar Single-Precision Floating-Point ValuesrcpssRCPSSrcpssRCPSS'vphaddbd6Packed Horizontal Add Signed Byte to Signed Doublewordvphaddbd"vphaddbd"/vphminposuw3Packed Horizontal Minimum of Unsigned Word Integersvphminposuw vphminposuw /vpminsb&Minimum of Packed Signed Byte IntegersvpminsbIvpminsbI/vpminsbIvpminsbI2vpminsbIvpminsbI5vpminsb vpminsbIvpminsb /vpminsbI/vpminsb!vpminsbIvpminsb!2vpminsbI2vpminsbIvpminsbI5andnpsHBitwise Logical AND NOT of Packed Single-Precision Floating-Point ValuesandnpsANDNPSandnpsANDNPS/shlLogical Shift LeftshlbSHLB	shlbSHLB	shlbSHLB	shlwSHLWshlwSHLWshlwSHLWshllSHLLshllSHLLshllSHLLshlbSHLB#shlbSHLB#shlbSHLB#shlwSHLW$shlwSHLW$shlwSHLW$shllSHLL'shllSHLL'shllSHLL'pblendvbVariable Blend Packed Bytespblendvbpblendvb/vscatterpf1qps�Sparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Quadword Indices Using T1 Hint with Intent to Writevscatterpf1qpsLMvfmsub231phFFused Multiply-Subtract of Packed Half-Precision Floating-Point Valuesvfmsub231phK<vfmsub231phKvfmsub231phK>vfmsub231phKvfmsub231phR@vfmsub231phRvfmsub231phK<vfmsub231phKvfmsub231phK>vfmsub231phKvfmsub231phR@vfmsub231phRvfmsub231phRQvfmsub231phRQvdivsd4Divide Scalar Double-Precision Floating-Point ValuesvdivsdHvdivsdH+vdivsd vdivsdHvdivsd +vdivsdH+vdivsdHQvdivsdHQcvtps2pdNConvert Packed Single-Precision FP Values to Packed Double-Precision FP Valuescvtps2pdCVTPS2PDcvtps2pdCVTPS2PD+ja#Jump if above (CF == 0 and ZF == 0)jaJHINjaJHIOvinsertf32x8@Insert 256 Bits of Packed Single-Precision Floating-Point Valuesvinsertf32x8Jvinsertf32x8J2vinsertf32x8Jvinsertf32x8J2vpshufbitqmb@Shuffle Bits From Quadword Elements Using Byte Indexes Into MaskvpshufbitqmbKvpshufbitqmbKvpshufbitqmbK/vpshufbitqmbK/vpshufbitqmbKvpshufbitqmbKvpshufbitqmbK2vpshufbitqmbK2vpshufbitqmbSvpshufbitqmbSvpshufbitqmbS5vpshufbitqmbS5movntsdKStore Scalar Double-Precision Floating-Point Values Using Non-Temporal Hintmovntsd+shufps5Shuffle Packed Single-Precision Floating-Point Valuesshufpsshufps/vpshrdd:Concatenate and Shift Packed Doubleword Data Right LogicalvpshrddK9vpshrddKvpshrddK:vpshrddKvpshrddU;vpshrddUvpshrddK9vpshrddKvpshrddK:vpshrddKvpshrddU;vpshrddU	vpmacsswdKPacked Multiply Accumulate with Saturation Signed Word to Signed Doubleword	vpmacsswd"	vpmacsswd"/movzxMove with Zero-ExtendmovzbwMOVBWZX	movzbwMOVBWZX#movzblMOVBLZX	movzwlMOVWLZXmovzblMOVBLZX#movzwlMOVWLZX$jae Jump if above or equal (CF == 0)jaeJCCNjaeJCCOpsignbPacked Sign of Byte Integerspsignbpsignb+psignbpsignb/
vpmacssdqhTPacked Multiply Accumulate with Saturation Signed High Doubleword to Signed Quadword
vpmacssdqh"
vpmacssdqh"/	extractps4Extract Packed Single Precision Floating-Point Value	extractps	extractps'phaddw#Packed Horizontal Add Word Integersphaddwphaddw+phaddwphaddw/vpshrdvdCConcatenate and Variable Shift Packed Doubleword Data Right LogicalvpshrdvdK9vpshrdvdKvpshrdvdK:vpshrdvdKvpshrdvdU;vpshrdvdUvpshrdvdK9vpshrdvdKvpshrdvdK:vpshrdvdKvpshrdvdU;vpshrdvdUvscatterpf0dps�Sparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Doubleword Indices Using T0 Hint with Intent to Writevscatterpf0dpsLKaaaASCII Adjust After AdditionaaaAAApaddusw:Add Packed Unsigned Word Integers with Unsigned Saturationpadduswpaddusw+padduswpaddusw/vpdpbssdFPacked Dot Product of Signed-by-Singed Byte subvectors into DoublewordvpdpbssdXvpdpbssdX/vpdpbssdXvpdpbssdX2vfnmadd231sdLFused Negative Multiply-Add of Scalar Double-Precision Floating-Point Valuesvfnmadd231sdHvfnmadd231sdH+vfnmadd231sd#vfnmadd231sdHvfnmadd231sd#+vfnmadd231sdH+vfnmadd231sdHQvfnmadd231sdHQ	vfnmsubpsQFused Negative Multiply-Subtract of Packed Single-Precision Floating-Point Values	vfnmsubps$	vfnmsubps$/	vfnmsubps$/	vfnmsubps$	vfnmsubps$2	vfnmsubps$2vpsravw0Variable Shift Packed Word Data Right ArithmeticvpsravwIvpsravwI/vpsravwIvpsravwI2vpsravwIvpsravwI5vpsravwIvpsravwI/vpsravwIvpsravwI2vpsravwIvpsravwI5pfcmpeq'Packed Floating-Point Compare for Equalpfcmpeqpfcmpeq+vpshaq!Packed Shift Arithmetic Quadwordsvpshaq"vpshaq"/vpshaq"/
vcvtudq2pd\Convert Packed Unsigned Doubleword Integers to Packed Double-Precision Floating-Point Values
vcvtudq2pdH8
vcvtudq2pdH9
vcvtudq2pdH:
vcvtudq2pdH
vcvtudq2pdH
vcvtudq2pdH
vcvtudq2pdH8
vcvtudq2pdH
vcvtudq2pdH9
vcvtudq2pdH
vcvtudq2pdH:
vcvtudq2pdHvcvttps2udqrConvert with Truncation Packed Single-Precision Floating-Point Values to Packed Unsigned Doubleword Integer Valuesvcvttps2udqH9vcvttps2udqH:vcvttps2udqH;vcvttps2udqHvcvttps2udqHvcvttps2udqHvcvttps2udqH9vcvttps2udqHvcvttps2udqH:vcvttps2udqHvcvttps2udqH;vcvttps2udqHvcvttps2udqHRvcvttps2udqHR	vfnmaddpdLFused Negative Multiply-Add of Packed Double-Precision Floating-Point Values	vfnmaddpd$	vfnmaddpd$/	vfnmaddpd$/	vfnmaddpd$	vfnmaddpd$2	vfnmaddpd$2vpandnq3Bitwise Logical AND NOT of Packed Quadword IntegersvpandnqH=vpandnqHvpandnqH?vpandnqHvpandnqHAvpandnqHvpandnqH=vpandnqHvpandnqH?vpandnqHvpandnqHAvpandnqHvpxord:Bitwise Logical Exclusive OR of Packed Doubleword IntegersvpxordH9vpxordHvpxordH:vpxordHvpxordH;vpxordHvpxordH9vpxordHvpxordH:vpxordHvpxordH;vpxordHmulx)Unsigned Multiply Without Affecting Flagsmulxl5mulxl5'packsswb,Pack Words into Bytes with Signed Saturationpacksswbpacksswb+packsswbpacksswb/vpcomuq)Compare Packed Unsigned Quadword Integersvpcomuq"vpcomuq"/	vmovntdqa.Load Double Quadword Non-Temporal Aligned Hint	vmovntdqa /	vmovntdqaH/	vmovntdqa!2	vmovntdqaH2	vmovntdqaH5	vcvtsi2ss9Convert Dword Integer to Scalar Single-Precision FP Value
vcvtsi2ssl 
vcvtsi2sslH
vcvtsi2ssl '
vcvtsi2sslH'
vcvtsi2sslHQblcsSet Lowest Clear Bitblcs6blcs6'vpmullq=Multiply Packed Signed Quadword Integers and Store Low ResultvpmullqJ=vpmullqJvpmullqJ?vpmullqJvpmullqJAvpmullqJvpmullqJ=vpmullqJvpmullqJ?vpmullqJvpmullqJAvpmullqJ
prefetcht1'Prefetch Data Into Caches using T1 Hint
prefetcht1
PREFETCHT1
#pextrdExtract Doublewordpextrdpextrd'korbBitwise Logical OR 8-bit MaskskorbJmovups<Move Unaligned Packed Single-Precision Floating-Point ValuesmovupsMOVUPSmovupsMOVUPS/movupsMOVUPS/
sha256msg1PPerform an Intermediate Calculation for the Next Four SHA256 Message Doublewords
sha256msg1(
sha256msg1(/vmovlpd5Move Low Packed Double-Precision Floating-Point Valuevmovlpd +vmovlpdH+vmovlpd +vmovlpdH+vmaxsh9Return Maximum Scalar Half-Precision Floating-Point ValuevmaxshRvmaxshR$vmaxshRvmaxshR$vmaxshRRvmaxshRRemmsExit MMX StateemmsEMMSvcvttsh2usiVConvert with Truncation Scalar Half-Precision Floating-Point Value to Unsigned Integervcvttsh2usiRvcvttsh2usiR$vcvttsh2usiRRvmovdMove Doublewordvmovd vmovdHvmovd vmovdHvmovd 'vmovdH'vmovd 'vmovdH'
vcvtsh2usiQConvert Scalar Half-Precision Floating-Point Value to Unsigned Doubleword Integer
vcvtsh2usiR
vcvtsh2usiR$
vcvtsh2usiRQvpsrlvq1Variable Shift Packed Quadword Data Right LogicalvpsrlvqH=vpsrlvqHvpsrlvqH?vpsrlvqHvpsrlvqHAvpsrlvqHvpsrlvqH=vpsrlvq!vpsrlvqHvpsrlvq!/vpsrlvqH?vpsrlvq!vpsrlvqHvpsrlvq!2vpsrlvqHAvpsrlvqHkxnorw!Bitwise Logical XNOR 16-bit MaskskxnorwHpfmulPacked Floating-Point Multiplypfmulpfmul+	vpmovsxbdDMove Packed Byte Integers to Doubleword Integers with Sign Extension	vpmovsxbdH	vpmovsxbdH	vpmovsxbdH	vpmovsxbdH'	vpmovsxbdH+	vpmovsxbdH/	vpmovsxbd 	vpmovsxbdH	vpmovsxbd '	vpmovsxbdH'	vpmovsxbd!	vpmovsxbdH	vpmovsxbd!+	vpmovsxbdH+	vpmovsxbdH	vpmovsxbdH/callCall ProcedurecallCALLOcalllcalll'kaddbADD Two 8-bit MaskskaddbJpminud.Minimum of Packed Unsigned Doubleword Integerspminudpminud/vscatterpf1dpd�Sparse Prefetch Packed Double-Precision Floating-Point Data Values with Signed Doubleword Indices Using T1 Hint with Intent to Writevscatterpf1dpdLGmovsxMove with Sign-ExtensionmovsbwMOVBWSX	movsbwMOVBWSX#movsblMOVBLSX	movswlMOVWLSXmovsblMOVBLSX#movswlMOVWLSX$pmaxud.Maximum of Packed Unsigned Doubleword Integerspmaxudpmaxud/
vextractf64x2AExtract 128 Bits of Packed Double-Precision Floating-Point Values
vextractf64x2J
vextractf64x2J0
vextractf64x2J
vextractf64x2J0
vextractf64x2J
vextractf64x2J
vextractf64x2J/
vextractf64x2J/ucomisdNUnordered Compare Scalar Double-Precision Floating-Point Values and Set EFLAGSucomisdUCOMISDucomisdUCOMISD+phsubd.Packed Horizontal Subtract Doubleword Integersphsubdphsubd+phsubdphsubd/	vcvtss2sdLConvert Scalar Single-Precision FP Value to Scalar Double-Precision FP Value	vcvtss2sdH	vcvtss2sdH'	vcvtss2sd 	vcvtss2sdH	vcvtss2sd '	vcvtss2sdH'	vcvtss2sdHR	vcvtss2sdHR
vfcmaddcphSFused Conjugate Multiply-Add of Complex Packed Half-Precision Floating-Point Values
vfcmaddcphK9
vfcmaddcphK
vfcmaddcphK:
vfcmaddcphK
vfcmaddcphR;
vfcmaddcphR
vfcmaddcphK9
vfcmaddcphK
vfcmaddcphK:
vfcmaddcphK
vfcmaddcphR;
vfcmaddcphR
vfcmaddcphRQ
vfcmaddcphRQ	vcvtsh2si7Convert Scalar Half-Precision FP Value to Dword Integer	vcvtsh2siR	vcvtsh2siR$	vcvtsh2siRQvmaxph:Return Maximum Packed Half-Precision Floating-Point ValuesvmaxphK<vmaxphKvmaxphK>vmaxphKvmaxphR@vmaxphRvmaxphK<vmaxphKvmaxphK>vmaxphKvmaxphR@vmaxphRvmaxphRRvmaxphRRvpcomd)Compare Packed Signed Doubleword Integersvpcomd"vpcomd"/	vpexpandqELoad Sparse Packed Quadword Integer Values from Dense Memory/Register	vpexpandqH	vpexpandqH	vpexpandqH	vpexpandqH/	vpexpandqH2	vpexpandqH5	vpexpandqH	vpexpandqH/	vpexpandqH	vpexpandqH2	vpexpandqH	vpexpandqH5vrangepdXRange Restriction Calculation For Packed Pairs of Double-Precision Floating-Point ValuesvrangepdJ=vrangepdJvrangepdJ?vrangepdJvrangepdJAvrangepdJvrangepdJ=vrangepdJvrangepdJ?vrangepdJvrangepdJAvrangepdJvrangepdJRvrangepdJRvmovlps6Move Low Packed Single-Precision Floating-Point Valuesvmovlps +vmovlpsH+vmovlps +vmovlpsH+monitorx+Monitor a Linear Address Range with TimeoutmonitorxE	vscalefps_Scale Packed Single-Precision Floating-Point Values With Single-Precision Floating-Point Values	vscalefpsH9	vscalefpsH	vscalefpsH:	vscalefpsH	vscalefpsH;	vscalefpsH	vscalefpsH9	vscalefpsH	vscalefpsH:	vscalefpsH	vscalefpsH;	vscalefpsH	vscalefpsHQ	vscalefpsHQcmovnle0Move if not less or equal (ZF == 0 and SF == OF)cmovnlewcmovnlew$cmovnlelcmovnlel'	vcvttph2waConvert with Truncation Packed Half-Precision Floating-Point Values to Packed Word Integer Values	vcvttph2wK<	vcvttph2wK>	vcvttph2wR@	vcvttph2wK	vcvttph2wK	vcvttph2wR	vcvttph2wK<	vcvttph2wK	vcvttph2wK>	vcvttph2wK	vcvttph2wR@	vcvttph2wR	vcvttph2wRR	vcvttph2wRRlahfLoad AH from FlagslahfLAHFvinsertf32x4@Insert 128 Bits of Packed Single-Precision Floating-Point Valuesvinsertf32x4Hvinsertf32x4H/vinsertf32x4Hvinsertf32x4H/vinsertf32x4Hvinsertf32x4H/vinsertf32x4Hvinsertf32x4H/sqrtpdECompute Square Roots of Packed Double-Precision Floating-Point ValuessqrtpdSQRTPDsqrtpdSQRTPD/cmovpeMove if parity even (PF == 1)cmovpewcmovpew$cmovpelcmovpel'divUnsigned DividedivbDIVB	divwDIVWdivlDIVLdivbDIVB#divwDIVW$divlDIVL'tpauseTimed PAUSEtpauseG
vfpclassss:Test Class of Scalar Single-Precision Floating-Point Value
vfpclassssJ
vfpclassssJ
vfpclassssJ'
vfpclassssJ'vpcmpq%Compare Packed Signed Quadword ValuesvpcmpqH=vpcmpqH=vpcmpqHvpcmpqHvpcmpqH?vpcmpqH?vpcmpqHvpcmpqHvpcmpqHAvpcmpqHAvpcmpqHvpcmpqHvpsrldq*Shift Packed Double Quadword Right Logicalvpsrldq vpsrldqIvpsrldqI/vpsrldq!vpsrldqIvpsrldqI2vpsrldqIvpsrldqI5cmoveMove if equal (ZF == 1)cmovewcmovew$cmovelcmovel'setzSet byte if zero (ZF == 1)setzSETEQ	setzSETEQ#vcvttps2uqqpConvert with Truncation Packed Single Precision Floating-Point Values to Packed Unsigned Quadword Integer Valuesvcvttps2uqqJ8vcvttps2uqqJ9vcvttps2uqqJ:vcvttps2uqqJvcvttps2uqqJvcvttps2uqqJvcvttps2uqqJ8vcvttps2uqqJvcvttps2uqqJ9vcvttps2uqqJvcvttps2uqqJ:vcvttps2uqqJvcvttps2uqqJRvcvttps2uqqJR
vpunpckhdq;Unpack and Interleave High-Order Doublewords into Quadwords
vpunpckhdqH9
vpunpckhdqH
vpunpckhdqH:
vpunpckhdqH
vpunpckhdqH;
vpunpckhdqH
vpunpckhdqH9
vpunpckhdq 
vpunpckhdqH
vpunpckhdq /
vpunpckhdqH:
vpunpckhdq!
vpunpckhdqH
vpunpckhdq!2
vpunpckhdqH;
vpunpckhdqHvrcp14psPCompute Approximate Reciprocals of Packed Single-Precision Floating-Point Valuesvrcp14psH9vrcp14psH:vrcp14psH;vrcp14psHvrcp14psHvrcp14psHvrcp14psH9vrcp14psHvrcp14psH:vrcp14psHvrcp14psH;vrcp14psHvrndscalesd]Round Scalar Double-Precision Floating-Point Value To Include A Given Number Of Fraction BitsvrndscalesdHvrndscalesdH+vrndscalesdHvrndscalesdH+vrndscalesdHRvrndscalesdHRvpcmpeqb%Compare Packed Byte Data for EqualityvpcmpeqbIvpcmpeqbIvpcmpeqbI/vpcmpeqbI/vpcmpeqbIvpcmpeqbIvpcmpeqbI2vpcmpeqbI2vpcmpeqbIvpcmpeqbIvpcmpeqbI5vpcmpeqbI5vpcmpeqb vpcmpeqb /vpcmpeqb!vpcmpeqb!2
vrsqrt14pd`Compute Approximate Reciprocals of Square Roots of Packed Double-Precision Floating-Point Values
vrsqrt14pdH=
vrsqrt14pdH?
vrsqrt14pdHA
vrsqrt14pdH
vrsqrt14pdH
vrsqrt14pdH
vrsqrt14pdH=
vrsqrt14pdH
vrsqrt14pdH?
vrsqrt14pdH
vrsqrt14pdHA
vrsqrt14pdHxlatbTable Look-up TranslationxlatXLATvmovdquMove Unaligned Double Quadwordvmovdqu vmovdqu /vmovdqu vmovdqu 2vmovdqu /vmovdqu 2	vpermi2pd\Full Permute of Double-Precision Floating-Point Values From Two Tables Overwriting the Index	vpermi2pdH=	vpermi2pdH	vpermi2pdH?	vpermi2pdH	vpermi2pdHA	vpermi2pdH	vpermi2pdH=	vpermi2pdH	vpermi2pdH?	vpermi2pdH	vpermi2pdHA	vpermi2pdHvpsllvw,Variable Shift Packed Word Data Left LogicalvpsllvwIvpsllvwI/vpsllvwIvpsllvwI2vpsllvwIvpsllvwI5vpsllvwIvpsllvwI/vpsllvwIvpsllvwI2vpsllvwIvpsllvwI5vpadddAdd Packed Doubleword IntegersvpadddH9vpadddHvpadddH:vpadddHvpadddH;vpadddHvpadddH9vpaddd vpadddHvpaddd /vpadddH:vpaddd!vpadddHvpaddd!2vpadddH;vpadddH
vgatherdpdTGather Packed Double-Precision Floating-Point Values Using Signed Doubleword Indices
vgatherdpdHB
vgatherdpdHB
vgatherdpdHF
vgatherdpd!B
vgatherdpd!BcmovzMove if zero (ZF == 1)cmovzwcmovzw$cmovzlcmovzl'vpmovsqwIDown Convert Packed Quadword Values to Word Values with Signed SaturationvpmovsqwHvpmovsqwH(vpmovsqwHvpmovsqwH,vpmovsqwHvpmovsqwH0vpmovsqwHvpmovsqwHvpmovsqwHvpmovsqwH'vpmovsqwH+vpmovsqwH/vfnmsub132phOFused Negative Multiply-Subtract of Packed Half-Precision Floating-Point Valuesvfnmsub132phK<vfnmsub132phKvfnmsub132phK>vfnmsub132phKvfnmsub132phR@vfnmsub132phRvfnmsub132phK<vfnmsub132phKvfnmsub132phK>vfnmsub132phKvfnmsub132phR@vfnmsub132phRvfnmsub132phRQvfnmsub132phRQandpsDBitwise Logical AND of Packed Single-Precision Floating-Point ValuesandpsANDPSandpsANDPS/cmpps5Compare Packed Single-Precision Floating-Point ValuescmppsCMPPScmppsCMPPS/setae$Set byte if above or equal (CF == 0)setaeSETCC	setaeSETCC#subps6Subtract Packed Single-Precision Floating-Point ValuessubpsSUBPSsubpsSUBPS/vpdpwuudJPacked Dot Product of Unsigned-by-Unsigned Word subvectors into DoublewordvpdpwuudYvpdpwuudY/vpdpwuudYvpdpwuudY2vsqrtshACompute Square Root of Scalar Half-Precision Floating-Point ValuevsqrtshRvsqrtshR$vsqrtshRvsqrtshR$vsqrtshRQvsqrtshRQpmaxsb&Maximum of Packed Signed Byte Integerspmaxsbpmaxsb/pmulhrswOPacked Multiply Signed Word Integers and Store High Result with Round and Scalepmulhrswpmulhrsw+pmulhrswpmulhrsw/
vgatherpf0qpsmSparse Prefetch Packed Single-Precision Floating-Point Data Values with Signed Quadword Indices Using T0 Hint
vgatherpf0qpsLMvpsignwPacked Sign of Word Integersvpsignw vpsignw /vpsignw!vpsignw!2cvtss2sdLConvert Scalar Single-Precision FP Value to Scalar Double-Precision FP Valuecvtss2sdCVTSS2SDcvtss2sdCVTSS2SD'pmovmskbMove Byte MaskpmovmskbPMOVMSKB
pmovmskbPMOVMSKBminpd<Return Minimum Packed Double-Precision Floating-Point ValuesminpdMINPDminpdMINPD/vfmsub213psHFused Multiply-Subtract of Packed Single-Precision Floating-Point Valuesvfmsub213psH9vfmsub213psHvfmsub213psH:vfmsub213psHvfmsub213psH;vfmsub213psHvfmsub213psH9vfmsub213ps#vfmsub213psHvfmsub213ps#/vfmsub213psH:vfmsub213ps#vfmsub213psHvfmsub213ps#2vfmsub213psH;vfmsub213psHvfmsub213psHQvfmsub213psHQ
vcvtneps2bf16YConvert with Nearest-Even rounding a Single-Precision FP vector into a BFloat16 FP vectorvcvtneps2bf16xK9vcvtneps2bf16yK:
vcvtneps2bf16Q;vcvtneps2bf16xKvcvtneps2bf16yK
vcvtneps2bf16Qvcvtneps2bf16xK9vcvtneps2bf16yK:vcvtneps2bf16xKvcvtneps2bf16xZvcvtneps2bf16yKvcvtneps2bf16yZvcvtneps2bf16xZ/vcvtneps2bf16yZ2
vcvtneps2bf16Q;
vcvtneps2bf16Qvmaxss;Return Maximum Scalar Single-Precision Floating-Point ValuevmaxssHvmaxssH'vmaxss vmaxssHvmaxss 'vmaxssH'vmaxssHRvmaxssHRvcmpsh3Compare Scalar Half-Precision Floating-Point ValuesvcmpshRvcmpshRvcmpshR$vcmpshR$vcmpshRRvcmpshRR	vpackusdw4Pack Doublewords into Words with Unsigned Saturation	vpackusdwI9	vpackusdwI	vpackusdwI:	vpackusdwI	vpackusdwI;	vpackusdwI	vpackusdwI9	vpackusdw 	vpackusdwI	vpackusdw /	vpackusdwI:	vpackusdw!	vpackusdwI	vpackusdw!2	vpackusdwI;	vpackusdwIvphaddbw0Packed Horizontal Add Signed Byte to Signed Wordvphaddbw"vphaddbw"/kortestbOR 8-bit Masks and Set FlagskortestbJvaddsd1Add Scalar Double-Precision Floating-Point ValuesvaddsdHvaddsdH+vaddsd vaddsdHvaddsd +vaddsdH+vaddsdHQvaddsdHQ	vphaddudq5Packed Horizontal Add Unsigned Doubleword to Quadword	vphaddudq"	vphaddudq"/pf2id5Packed Floating-Point to Integer Doubleword Conversonpf2idpf2id+padddAdd Packed Doubleword Integerspadddpaddd+padddpaddd/	vpmovusqdQDown Convert Packed Quadword Values to Doubleword Values with Unsigned Saturation	vpmovusqdH	vpmovusqdH,	vpmovusqdH	vpmovusqdH0	vpmovusqdH	vpmovusqdH3	vpmovusqdH	vpmovusqdH	vpmovusqdH	vpmovusqdH+	vpmovusqdH/	vpmovusqdH2vrcp14ssPCompute Approximate Reciprocal of a Scalar Single-Precision Floating-Point Valuevrcp14ssHvrcp14ssH'vrcp14ssHvrcp14ssH'vsubss6Subtract Scalar Single-Precision Floating-Point ValuesvsubssHvsubssH'vsubss vsubssHvsubss 'vsubssH'vsubssHQvsubssHQunpckhpsHUnpack and Interleave High Packed Single-Precision Floating-Point ValuesunpckhpsUNPCKHPSunpckhpsUNPCKHPS/mulUnsigned MultiplymulbMULB	mulwMULWmullMULLmulbMULB#mulwMULW$mullMULL'vpsrld*Shift Packed Doubleword Data Right LogicalvpsrldH9vpsrldH:vpsrldH;vpsrldHvpsrldHvpsrldH/vpsrldHvpsrldHvpsrldH/vpsrldHvpsrldHvpsrldH/vpsrldH9vpsrld vpsrldHvpsrld vpsrldHvpsrld /vpsrldH/vpsrldH:vpsrld!vpsrldHvpsrld!vpsrldHvpsrld!/vpsrldH/vpsrldH;vpsrldHvpsrldHvpsrldH/
vrsqrt14sdaCompute Approximate Reciprocal of a Square Root of a Scalar Double-Precision Floating-Point Value
vrsqrt14sdH
vrsqrt14sdH+
vrsqrt14sdH
vrsqrt14sdH+pcmpeqw%Compare Packed Word Data for Equalitypcmpeqwpcmpeqw+pcmpeqwpcmpeqw/knotbNOT 8-bit Mask RegisterknotbJporPacked Bitwise Logical ORporpor+porpor/vpblenddBlend Packed Doublewordsvpblendd!vpblendd!/vpblendd!vpblendd!2vprotqPacked Rotate Quadwordsvprotq"vprotq"vprotq"/vprotq"/vprotq"/	vcvtsd2shJConvert Scalar Double-Precision FP Value to Scalar Half-Precision FP Value	vcvtsd2shR	vcvtsd2shR+	vcvtsd2shR	vcvtsd2shR+	vcvtsd2shRQ	vcvtsd2shRQvpscatterqd=Scatter Packed Doubleword Values with Signed Quadword IndicesvpscatterqdHEvpscatterqdHIvpscatterqdHMvrsqrtphRCompute Reciprocals of Square Roots of Packed Half-Precision Floating-Point ValuesvrsqrtphK<vrsqrtphK>vrsqrtphR@vrsqrtphKvrsqrtphKvrsqrtphRvrsqrtphK<vrsqrtphKvrsqrtphK>vrsqrtphKvrsqrtphR@vrsqrtphRvpmovsdbKDown Convert Packed Doubleword Values to Byte Values with Signed SaturationvpmovsdbHvpmovsdbH(vpmovsdbHvpmovsdbH,vpmovsdbHvpmovsdbH0vpmovsdbHvpmovsdbHvpmovsdbHvpmovsdbH'vpmovsdbH+vpmovsdbH/kaddqADD Two 64-bit MaskskaddqIkshiftlqShift Left 64-bit MaskskshiftlqIcomissLCompare Scalar Ordered Single-Precision Floating-Point Values and Set EFLAGScomissCOMISScomissCOMISS'pfrcpv5Packed Floating-Point Reciprocal Approximation Vectorpfrcpvpfrcpv+bsrBit Scan ReversebsrwBSRWbsrwBSRW$bsrlBSRLbsrlBSRL'rdpidRead Processor IDrdpid,subss6Subtract Scalar Single-Precision Floating-Point ValuessubssSUBSSsubssSUBSS'umonitor(User mode Monitor a Linear Address RangeumonitorGvalignqAlign Quadword VectorsvalignqH=valignqHvalignqH?valignqHvalignqHAvalignqHvalignqH=valignqHvalignqH?valignqHvalignqHAvalignqH	vcvtsh2sdJConvert Scalar Half-Precision FP Value to Scalar Double-Precision FP Value	vcvtsh2sdR	vcvtsh2sdR$	vcvtsh2sdR	vcvtsh2sdR$	vcvtsh2sdRR	vcvtsh2sdRRvcvttph2udqpConvert with Truncation Packed Half-Precision Floating-Point Values to Packed Unsigned Doubleword Integer Valuesvcvttph2udqK.vcvttph2udqK<vcvttph2udqR>vcvttph2udqKvcvttph2udqKvcvttph2udqRvcvttph2udqK.vcvttph2udqKvcvttph2udqK<vcvttph2udqKvcvttph2udqR>vcvttph2udqRvcvttph2udqRRvcvttph2udqRRkmovwMove 16-bit MaskkmovwHkmovwHkmovwH$kmovwHkmovwH$vpsubd#Subtract Packed Doubleword IntegersvpsubdH9vpsubdHvpsubdH:vpsubdHvpsubdH;vpsubdHvpsubdH9vpsubd vpsubdHvpsubd /vpsubdH:vpsubd!vpsubdHvpsubd!2vpsubdH;vpsubdHvzeroallZero All YMM Registersvzeroall 
vcvtpd2uqqZConvert Packed Double-Precision Floating-Point Values to Packed Unsigned Quadword Integers
vcvtpd2uqqJ=
vcvtpd2uqqJ?
vcvtpd2uqqJA
vcvtpd2uqqJ
vcvtpd2uqqJ
vcvtpd2uqqJ
vcvtpd2uqqJ=
vcvtpd2uqqJ
vcvtpd2uqqJ?
vcvtpd2uqqJ
vcvtpd2uqqJA
vcvtpd2uqqJ
vcvtpd2uqqJQ
vcvtpd2uqqJQpandPacked Bitwise Logical ANDpandPANDpandPAND+pandPANDpandPAND/jnzJump if not zero (ZF == 0)jnzJNENjnzJNEOhsubps$Packed Single-FP Horizontal Subtracthsubpshsubps/vmovhpd6Move High Packed Double-Precision Floating-Point Valuevmovhpd +vmovhpdH+vmovhpd +vmovhpdH+idiv
Signed DivideidivbIDIVB	idivwIDIVWidivlIDIVLidivbIDIVB#idivwIDIVW$idivlIDIVL'orpd<Bitwise Logical OR of Double-Precision Floating-Point ValuesorpdORPDorpdORPD/subsd6Subtract Scalar Double-Precision Floating-Point ValuessubsdSUBSDsubsdSUBSD+vpmovm2q8Expand Bits of Mask Register to Packed Quadword Integersvpmovm2qJvpmovm2qJvpmovm2qJandnLogical AND NOTandnl4andnl4'vptestPacked Logical Comparevptest vptest /vptest vptest 2jle+Jump if less or equal (ZF == 1 or SF != OF)jleJLENjleJLEO	sha1nexte1Calculate SHA1 State Variable E after Four Rounds	sha1nexte(	sha1nexte(/