Available on x86 or x86-64 only.
Functionsยง
- loaddqu8_
128 ๐ โ - loaddqu8_
256 ๐ โ - loaddqu8_
512 ๐ โ - loaddqu16_
128 ๐ โ - loaddqu16_
256 ๐ โ - loaddqu16_
512 ๐ โ - storedqu8_
128 ๐ โ - storedqu8_
256 ๐ โ - storedqu8_
512 ๐ โ - storedqu16_
128 ๐ โ - storedqu16_
256 ๐ โ - storedqu16_
512 ๐ โ - vdbpsadbw ๐ โ
- vdbpsadbw128 ๐ โ
- vdbpsadbw256 ๐ โ
- vpackssdw ๐ โ
- vpacksswb ๐ โ
- vpackusdw ๐ โ
- vpackuswb ๐ โ
- vpermi2w ๐ โ
- vpermi2w128 ๐ โ
- vpermi2w256 ๐ โ
- vpermw ๐ โ
- vpermw128 ๐ โ
- vpermw256 ๐ โ
- vpmaddubsw ๐ โ
- vpmaddwd ๐ โ
- vpmovswb ๐ โ
- vpmovswb128 ๐ โ
- vpmovswb256 ๐ โ
- vpmovswbmem ๐ โ
- vpmovswbmem128 ๐ โ
- vpmovswbmem256 ๐ โ
- vpmovuswb ๐ โ
- vpmovuswb128 ๐ โ
- vpmovuswb256 ๐ โ
- vpmovuswbmem ๐ โ
- vpmovuswbmem128 ๐ โ
- vpmovuswbmem256 ๐ โ
- vpmovwbmem ๐ โ
- vpmovwbmem128 ๐ โ
- vpmovwbmem256 ๐ โ
- vpmulhrsw ๐ โ
- vpsadbw ๐ โ
- vpshufb ๐ โ
- vpsllvw ๐ โ
- vpsllvw128 ๐ โ
- vpsllvw256 ๐ โ
- vpsllw ๐ โ
- vpsravw ๐ โ
- vpsravw128 ๐ โ
- vpsravw256 ๐ โ
- vpsraw ๐ โ
- vpsrlvw ๐ โ
- vpsrlvw128 ๐ โ
- vpsrlvw256 ๐ โ
- vpsrlw ๐ โ
- _cvtmask32_
u32 โExperimental avx512bw
- Convert 32-bit mask a into an integer value, and store the result in dst.
- _cvtu32_
mask32 โExperimental avx512bw
- Convert integer value a into an 32-bit mask, and store the result in k.
- _kadd_
mask32 โExperimental avx512bw
- Add 32-bit masks in a and b, and store the result in k.
- _kadd_
mask64 โExperimental avx512bw
- Add 64-bit masks in a and b, and store the result in k.
- _kand_
mask32 โExperimental avx512bw
- Compute the bitwise AND of 32-bit masks a and b, and store the result in k.
- _kand_
mask64 โExperimental avx512bw
- Compute the bitwise AND of 64-bit masks a and b, and store the result in k.
- _kandn_
mask32 โExperimental avx512bw
- Compute the bitwise NOT of 32-bit masks a and then AND with b, and store the result in k.
- _kandn_
mask64 โExperimental avx512bw
- Compute the bitwise NOT of 64-bit masks a and then AND with b, and store the result in k.
- _knot_
mask32 โExperimental avx512bw
- Compute the bitwise NOT of 32-bit mask a, and store the result in k.
- _knot_
mask64 โExperimental avx512bw
- Compute the bitwise NOT of 64-bit mask a, and store the result in k.
- _kor_
mask32 โExperimental avx512bw
- Compute the bitwise OR of 32-bit masks a and b, and store the result in k.
- _kor_
mask64 โExperimental avx512bw
- Compute the bitwise OR of 64-bit masks a and b, and store the result in k.
- _kortest_
mask32_ โu8 Experimental avx512bw
- Compute the bitwise OR of 32-bit masks a and b. If the result is all zeros, store 1 in dst, otherwise store 0 in dst. If the result is all ones, store 1 in all_ones, otherwise store 0 in all_ones.
- _kortest_
mask64_ โu8 Experimental avx512bw
- Compute the bitwise OR of 64-bit masks a and b. If the result is all zeros, store 1 in dst, otherwise store 0 in dst. If the result is all ones, store 1 in all_ones, otherwise store 0 in all_ones.
- _kortestc_
mask32_ โu8 Experimental avx512bw
- Compute the bitwise OR of 32-bit masks a and b. If the result is all ones, store 1 in dst, otherwise store 0 in dst.
- _kortestc_
mask64_ โu8 Experimental avx512bw
- Compute the bitwise OR of 64-bit masks a and b. If the result is all ones, store 1 in dst, otherwise store 0 in dst.
- _kortestz_
mask32_ โu8 Experimental avx512bw
- Compute the bitwise OR of 32-bit masks a and b. If the result is all zeros, store 1 in dst, otherwise store 0 in dst.
- _kortestz_
mask64_ โu8 Experimental avx512bw
- Compute the bitwise OR of 64-bit masks a and b. If the result is all zeros, store 1 in dst, otherwise store 0 in dst.
- _kshiftli_
mask32 โExperimental avx512bw
- Shift the bits of 32-bit mask a left by count while shifting in zeros, and store the least significant 32 bits of the result in k.
- _kshiftli_
mask64 โExperimental avx512bw
- Shift the bits of 64-bit mask a left by count while shifting in zeros, and store the least significant 32 bits of the result in k.
- _kshiftri_
mask32 โExperimental avx512bw
- Shift the bits of 32-bit mask a right by count while shifting in zeros, and store the least significant 32 bits of the result in k.
- _kshiftri_
mask64 โExperimental avx512bw
- Shift the bits of 64-bit mask a right by count while shifting in zeros, and store the least significant 32 bits of the result in k.
- _ktest_
mask32_ โu8 Experimental avx512bw
- Compute the bitwise AND of 32-bit masks a and b, and if the result is all zeros, store 1 in dst, otherwise store 0 in dst. Compute the bitwise NOT of a and then AND with b, if the result is all zeros, store 1 in and_not, otherwise store 0 in and_not.
- _ktest_
mask64_ โu8 Experimental avx512bw
- Compute the bitwise AND of 64-bit masks a and b, and if the result is all zeros, store 1 in dst, otherwise store 0 in dst. Compute the bitwise NOT of a and then AND with b, if the result is all zeros, store 1 in and_not, otherwise store 0 in and_not.
- _ktestc_
mask32_ โu8 Experimental avx512bw
- Compute the bitwise NOT of 32-bit mask a and then AND with 16-bit mask b, if the result is all zeros, store 1 in dst, otherwise store 0 in dst.
- _ktestc_
mask64_ โu8 Experimental avx512bw
- Compute the bitwise NOT of 64-bit mask a and then AND with 8-bit mask b, if the result is all zeros, store 1 in dst, otherwise store 0 in dst.
- _ktestz_
mask32_ โu8 Experimental avx512bw
- Compute the bitwise AND of 32-bit masks a and b, if the result is all zeros, store 1 in dst, otherwise store 0 in dst.
- _ktestz_
mask64_ โu8 Experimental avx512bw
- Compute the bitwise AND of 64-bit masks a and b, if the result is all zeros, store 1 in dst, otherwise store 0 in dst.
- _kxnor_
mask32 โExperimental avx512bw
- Compute the bitwise XNOR of 32-bit masks a and b, and store the result in k.
- _kxnor_
mask64 โExperimental avx512bw
- Compute the bitwise XNOR of 64-bit masks a and b, and store the result in k.
- _kxor_
mask32 โExperimental avx512bw
- Compute the bitwise XOR of 32-bit masks a and b, and store the result in k.
- _kxor_
mask64 โExperimental avx512bw
- Compute the bitwise XOR of 64-bit masks a and b, and store the result in k.
- _load_
mask32 โExperimental avx512bw
- Load 32-bit mask from memory into k.
- _load_
mask64 โExperimental avx512bw
- Load 64-bit mask from memory into k.
- _mm256_
cmp_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
- _mm256_
cmp_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
- _mm256_
cmp_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
- _mm256_
cmp_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
- _mm256_
cmpeq_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k.
- _mm256_
cmpeq_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k.
- _mm256_
cmpeq_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k.
- _mm256_
cmpeq_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k.
- _mm256_
cmpge_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm256_
cmpge_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm256_
cmpge_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm256_
cmpge_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm256_
cmpgt_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm256_
cmpgt_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm256_
cmpgt_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm256_
cmpgt_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm256_
cmple_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm256_
cmple_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm256_
cmple_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm256_
cmple_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm256_
cmplt_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm256_
cmplt_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm256_
cmplt_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm256_
cmplt_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm256_
cmpneq_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm256_
cmpneq_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm256_
cmpneq_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm256_
cmpneq_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm256_
cvtepi16_ โepi8 Experimental avx512bw,avx512vl
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst.
- _mm256_
cvtsepi16_ โepi8 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst.
- _mm256_
cvtusepi16_ โepi8 Experimental avx512bw,avx512vl
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst.
- _mm256_
dbsad_ โepu8 Experimental avx512bw,avx512vl
- Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst. Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
- _mm256_
loadu_ โepi8 Experimental avx512bw,avx512vl
- Load 256-bits (composed of 32 packed 8-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
- _mm256_
loadu_ โepi16 Experimental avx512bw,avx512vl
- Load 256-bits (composed of 16 packed 16-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
- _mm256_
mask2_ โpermutex2var_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from idx when the corresponding mask bit is not set).
- _mm256_
mask_ โabs_ epi8 Experimental avx512bw,avx512vl
- Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โabs_ epi16 Experimental avx512bw,avx512vl
- Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โadd_ epi8 Experimental avx512bw,avx512vl
- Add packed 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โadd_ epi16 Experimental avx512bw,avx512vl
- Add packed 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โadds_ epi8 Experimental avx512bw,avx512vl
- Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โadds_ epi16 Experimental avx512bw,avx512vl
- Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โadds_ epu8 Experimental avx512bw,avx512vl
- Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โadds_ epu16 Experimental avx512bw,avx512vl
- Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โalignr_ epi8 Experimental avx512bw,avx512vl
- Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โavg_ epu8 Experimental avx512bw,avx512vl
- Average packed unsigned 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โavg_ epu16 Experimental avx512bw,avx512vl
- Average packed unsigned 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โblend_ epi8 Experimental avx512bw,avx512vl
- Blend packed 8-bit integers from a and b using control mask k, and store the results in dst.
- _mm256_
mask_ โblend_ epi16 Experimental avx512bw,avx512vl
- Blend packed 16-bit integers from a and b using control mask k, and store the results in dst.
- _mm256_
mask_ โbroadcastb_ epi8 Experimental avx512bw,avx512vl
- Broadcast the low packed 8-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โbroadcastw_ epi16 Experimental avx512bw,avx512vl
- Broadcast the low packed 16-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โcmp_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmp_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmp_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmp_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpeq_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpeq_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpeq_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpeq_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpge_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpge_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpge_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpge_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpgt_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpgt_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpgt_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpgt_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmple_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmple_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmple_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmple_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmplt_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmplt_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmplt_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmplt_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpneq_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpneq_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpneq_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcmpneq_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
mask_ โcvtepi8_ epi16 Experimental avx512bw,avx512vl
- Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โcvtepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โcvtepi16_ storeu_ epi8 Experimental avx512bw,avx512vl
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm256_
mask_ โcvtepu8_ epi16 Experimental avx512bw,avx512vl
- Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โcvtsepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โcvtsepi16_ storeu_ epi8 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm256_
mask_ โcvtusepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โcvtusepi16_ storeu_ epi8 Experimental avx512bw,avx512vl
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm256_
mask_ โdbsad_ epu8 Experimental avx512bw,avx512vl
- Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
- _mm256_
mask_ โloadu_ epi8 Experimental avx512bw,avx512vl
- Load packed 8-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm256_
mask_ โloadu_ epi16 Experimental avx512bw,avx512vl
- Load packed 16-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm256_
mask_ โmadd_ epi16 Experimental avx512bw,avx512vl
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmaddubs_ epi16 Experimental avx512bw,avx512vl
- Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmax_ epi8 Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmax_ epi16 Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmax_ epu8 Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmax_ epu16 Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmin_ epi8 Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmin_ epi16 Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmin_ epu8 Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmin_ epu16 Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmov_ epi8 Experimental avx512bw,avx512vl
- Move packed 8-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmov_ epi16 Experimental avx512bw,avx512vl
- Move packed 16-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmulhi_ epi16 Experimental avx512bw,avx512vl
- Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmulhi_ epu16 Experimental avx512bw,avx512vl
- Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmulhrs_ epi16 Experimental avx512bw,avx512vl
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โmullo_ epi16 Experimental avx512bw,avx512vl
- Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โpacks_ epi16 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โpacks_ epi32 Experimental avx512bw,avx512vl
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โpackus_ epi16 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โpackus_ epi32 Experimental avx512bw,avx512vl
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โpermutex2var_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm256_
mask_ โpermutexvar_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โreduce_ add_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by addition using mask k. Returns the sum of all active elements in a.
- _mm256_
mask_ โreduce_ add_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by addition using mask k. Returns the sum of all active elements in a.
- _mm256_
mask_ โreduce_ and_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by bitwise AND using mask k. Returns the bitwise AND of all active elements in a.
- _mm256_
mask_ โreduce_ and_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by bitwise AND using mask k. Returns the bitwise AND of all active elements in a.
- _mm256_
mask_ โreduce_ max_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
- _mm256_
mask_ โreduce_ max_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
- _mm256_
mask_ โreduce_ max_ epu8 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 8-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
- _mm256_
mask_ โreduce_ max_ epu16 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 16-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
- _mm256_
mask_ โreduce_ min_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
- _mm256_
mask_ โreduce_ min_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
- _mm256_
mask_ โreduce_ min_ epu8 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 8-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
- _mm256_
mask_ โreduce_ min_ epu16 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 16-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
- _mm256_
mask_ โreduce_ mul_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by multiplication using mask k. Returns the product of all active elements in a.
- _mm256_
mask_ โreduce_ mul_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by multiplication using mask k. Returns the product of all active elements in a.
- _mm256_
mask_ โreduce_ or_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by bitwise OR using mask k. Returns the bitwise OR of all active elements in a.
- _mm256_
mask_ โreduce_ or_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by bitwise OR using mask k. Returns the bitwise OR of all active elements in a.
- _mm256_
mask_ โset1_ epi8 Experimental avx512bw,avx512vl
- Broadcast 8-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โset1_ epi16 Experimental avx512bw,avx512vl
- Broadcast 16-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โshuffle_ epi8 Experimental avx512bw,avx512vl
- Shuffle 8-bit integers in a within 128-bit lanes using the control in the corresponding 8-bit element of b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โshufflehi_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โshufflelo_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsll_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โslli_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsllv_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsra_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsrai_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsrav_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsrl_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsrli_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsrlv_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โstoreu_ epi8 Experimental avx512bw,avx512vl
- Store packed 8-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
- _mm256_
mask_ โstoreu_ epi16 Experimental avx512bw,avx512vl
- Store packed 16-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
- _mm256_
mask_ โsub_ epi8 Experimental avx512bw,avx512vl
- Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsub_ epi16 Experimental avx512bw,avx512vl
- Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsubs_ epi8 Experimental avx512bw,avx512vl
- Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsubs_ epi16 Experimental avx512bw,avx512vl
- Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsubs_ epu8 Experimental avx512bw,avx512vl
- Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โsubs_ epu16 Experimental avx512bw,avx512vl
- Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โtest_ epi8_ mask Experimental avx512bw,avx512vl
- Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
- _mm256_
mask_ โtest_ epi16_ mask Experimental avx512bw,avx512vl
- Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
- _mm256_
mask_ โtestn_ epi8_ mask Experimental avx512bw,avx512vl
- Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
- _mm256_
mask_ โtestn_ epi16_ mask Experimental avx512bw,avx512vl
- Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
- _mm256_
mask_ โunpackhi_ epi8 Experimental avx512bw,avx512vl
- Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โunpackhi_ epi16 Experimental avx512bw,avx512vl
- Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โunpacklo_ epi8 Experimental avx512bw,avx512vl
- Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โunpacklo_ epi16 Experimental avx512bw,avx512vl
- Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
maskz_ โabs_ epi8 Experimental avx512bw,avx512vl
- Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โabs_ epi16 Experimental avx512bw,avx512vl
- Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โadd_ epi8 Experimental avx512bw,avx512vl
- Add packed 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โadd_ epi16 Experimental avx512bw,avx512vl
- Add packed 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โadds_ epi8 Experimental avx512bw,avx512vl
- Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โadds_ epi16 Experimental avx512bw,avx512vl
- Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โadds_ epu8 Experimental avx512bw,avx512vl
- Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โadds_ epu16 Experimental avx512bw,avx512vl
- Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โalignr_ epi8 Experimental avx512bw,avx512vl
- Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โavg_ epu8 Experimental avx512bw,avx512vl
- Average packed unsigned 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โavg_ epu16 Experimental avx512bw,avx512vl
- Average packed unsigned 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โbroadcastb_ epi8 Experimental avx512bw,avx512vl
- Broadcast the low packed 8-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โbroadcastw_ epi16 Experimental avx512bw,avx512vl
- Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โcvtepi8_ epi16 Experimental avx512bw,avx512vl
- Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โcvtepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โcvtepu8_ epi16 Experimental avx512bw,avx512vl
- Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โcvtsepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โcvtusepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โdbsad_ epu8 Experimental avx512bw,avx512vl
- Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
- _mm256_
maskz_ โloadu_ epi8 Experimental avx512bw,avx512vl
- Load packed 8-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm256_
maskz_ โloadu_ epi16 Experimental avx512bw,avx512vl
- Load packed 16-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm256_
maskz_ โmadd_ epi16 Experimental avx512bw,avx512vl
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmaddubs_ epi16 Experimental avx512bw,avx512vl
- Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmax_ epi8 Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmax_ epi16 Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmax_ epu8 Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmax_ epu16 Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmin_ epi8 Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmin_ epi16 Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmin_ epu8 Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmin_ epu16 Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmov_ epi8 Experimental avx512bw,avx512vl
- Move packed 8-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmov_ epi16 Experimental avx512bw,avx512vl
- Move packed 16-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmulhi_ epi16 Experimental avx512bw,avx512vl
- Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmulhi_ epu16 Experimental avx512bw,avx512vl
- Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmulhrs_ epi16 Experimental avx512bw,avx512vl
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โmullo_ epi16 Experimental avx512bw,avx512vl
- Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โpacks_ epi16 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โpacks_ epi32 Experimental avx512bw,avx512vl
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โpackus_ epi16 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โpackus_ epi32 Experimental avx512bw,avx512vl
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โpermutex2var_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โpermutexvar_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โset1_ epi8 Experimental avx512bw,avx512vl
- Broadcast 8-bit integer a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โset1_ epi16 Experimental avx512bw,avx512vl
- Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshuffle_ epi8 Experimental avx512bw,avx512vl
- Shuffle packed 8-bit integers in a according to shuffle control mask in the corresponding 8-bit element of b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshufflehi_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshufflelo_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
maskz_ โsll_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โslli_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsllv_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsra_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsrai_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsrav_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsrl_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsrli_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsrlv_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsub_ epi8 Experimental avx512bw,avx512vl
- Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsub_ epi16 Experimental avx512bw,avx512vl
- Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsubs_ epi8 Experimental avx512bw,avx512vl
- Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsubs_ epi16 Experimental avx512bw,avx512vl
- Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsubs_ epu8 Experimental avx512bw,avx512vl
- Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โsubs_ epu16 Experimental avx512bw,avx512vl
- Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โunpackhi_ epi8 Experimental avx512bw,avx512vl
- Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โunpackhi_ epi16 Experimental avx512bw,avx512vl
- Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โunpacklo_ epi8 Experimental avx512bw,avx512vl
- Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โunpacklo_ epi16 Experimental avx512bw,avx512vl
- Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
movepi8_ โmask Experimental avx512bw,avx512vl
- Set each bit of mask register k based on the most significant bit of the corresponding packed 8-bit integer in a.
- _mm256_
movepi16_ โmask Experimental avx512bw,avx512vl
- Set each bit of mask register k based on the most significant bit of the corresponding packed 16-bit integer in a.
- _mm256_
movm_ โepi8 Experimental avx512bw,avx512vl
- Set each packed 8-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
- _mm256_
movm_ โepi16 Experimental avx512bw,avx512vl
- Set each packed 16-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
- _mm256_
permutex2var_ โepi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst.
- _mm256_
permutexvar_ โepi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst.
- _mm256_
reduce_ โadd_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by addition. Returns the sum of all elements in a.
- _mm256_
reduce_ โadd_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by addition. Returns the sum of all elements in a.
- _mm256_
reduce_ โand_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by bitwise AND. Returns the bitwise AND of all elements in a.
- _mm256_
reduce_ โand_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by bitwise AND. Returns the bitwise AND of all elements in a.
- _mm256_
reduce_ โmax_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by maximum. Returns the maximum of all elements in a.
- _mm256_
reduce_ โmax_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by maximum. Returns the maximum of all elements in a.
- _mm256_
reduce_ โmax_ epu8 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 8-bit integers in a by maximum. Returns the maximum of all elements in a.
- _mm256_
reduce_ โmax_ epu16 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 16-bit integers in a by maximum. Returns the maximum of all elements in a.
- _mm256_
reduce_ โmin_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by minimum. Returns the minimum of all elements in a.
- _mm256_
reduce_ โmin_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by minimum. Returns the minimum of all elements in a.
- _mm256_
reduce_ โmin_ epu8 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 8-bit integers in a by minimum. Returns the minimum of all elements in a.
- _mm256_
reduce_ โmin_ epu16 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 16-bit integers in a by minimum. Returns the minimum of all elements in a.
- _mm256_
reduce_ โmul_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by multiplication. Returns the product of all elements in a.
- _mm256_
reduce_ โmul_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by multiplication. Returns the product of all elements in a.
- _mm256_
reduce_ โor_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by bitwise OR. Returns the bitwise OR of all elements in a.
- _mm256_
reduce_ โor_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by bitwise OR. Returns the bitwise OR of all elements in a.
- _mm256_
sllv_ โepi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
- _mm256_
srav_ โepi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst.
- _mm256_
srlv_ โepi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
- _mm256_
storeu_ โepi8 Experimental avx512bw,avx512vl
- Store 256-bits (composed of 32 packed 8-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
- _mm256_
storeu_ โepi16 Experimental avx512bw,avx512vl
- Store 256-bits (composed of 16 packed 16-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
- _mm256_
test_ โepi8_ mask Experimental avx512bw,avx512vl
- Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
- _mm256_
test_ โepi16_ mask Experimental avx512bw,avx512vl
- Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
- _mm256_
testn_ โepi8_ mask Experimental avx512bw,avx512vl
- Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
- _mm256_
testn_ โepi16_ mask Experimental avx512bw,avx512vl
- Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
- _mm512_
abs_ โepi8 Experimental avx512bw
- Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst.
- _mm512_
abs_ โepi16 Experimental avx512bw
- Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst.
- _mm512_
add_ โepi8 Experimental avx512bw
- Add packed 8-bit integers in a and b, and store the results in dst.
- _mm512_
add_ โepi16 Experimental avx512bw
- Add packed 16-bit integers in a and b, and store the results in dst.
- _mm512_
adds_ โepi8 Experimental avx512bw
- Add packed signed 8-bit integers in a and b using saturation, and store the results in dst.
- _mm512_
adds_ โepi16 Experimental avx512bw
- Add packed signed 16-bit integers in a and b using saturation, and store the results in dst.
- _mm512_
adds_ โepu8 Experimental avx512bw
- Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst.
- _mm512_
adds_ โepu16 Experimental avx512bw
- Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst.
- _mm512_
alignr_ โepi8 Experimental avx512bw
- Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst.
Unlike
_mm_alignr_epi8
,_mm256_alignr_epi8
functions, where the entire input vectors are concatenated to the temporary result, this concatenation happens in 4 steps, where each step builds 32-byte temporary result. - _mm512_
avg_ โepu8 Experimental avx512bw
- Average packed unsigned 8-bit integers in a and b, and store the results in dst.
- _mm512_
avg_ โepu16 Experimental avx512bw
- Average packed unsigned 16-bit integers in a and b, and store the results in dst.
- _mm512_
broadcastb_ โepi8 Experimental avx512bw
- Broadcast the low packed 8-bit integer from a to all elements of dst.
- _mm512_
broadcastw_ โepi16 Experimental avx512bw
- Broadcast the low packed 16-bit integer from a to all elements of dst.
- _mm512_
bslli_ โepi128 Experimental avx512bw
- Shift 128-bit lanes in a left by imm8 bytes while shifting in zeros, and store the results in dst.
- _mm512_
bsrli_ โepi128 Experimental avx512bw
- Shift 128-bit lanes in a right by imm8 bytes while shifting in zeros, and store the results in dst.
- _mm512_
cmp_ โepi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
- _mm512_
cmp_ โepi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
- _mm512_
cmp_ โepu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
- _mm512_
cmp_ โepu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by
IMM8
, and store the results in mask vector k. - _mm512_
cmpeq_ โepi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k.
- _mm512_
cmpeq_ โepi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k.
- _mm512_
cmpeq_ โepu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k.
- _mm512_
cmpeq_ โepu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k.
- _mm512_
cmpge_ โepi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm512_
cmpge_ โepi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm512_
cmpge_ โepu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm512_
cmpge_ โepu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm512_
cmpgt_ โepi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm512_
cmpgt_ โepi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm512_
cmpgt_ โepu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm512_
cmpgt_ โepu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm512_
cmple_ โepi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm512_
cmple_ โepi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm512_
cmple_ โepu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm512_
cmple_ โepu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm512_
cmplt_ โepi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm512_
cmplt_ โepi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm512_
cmplt_ โepu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm512_
cmplt_ โepu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm512_
cmpneq_ โepi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm512_
cmpneq_ โepi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm512_
cmpneq_ โepu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm512_
cmpneq_ โepu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm512_
cvtepi8_ โepi16 Experimental avx512bw
- Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst.
- _mm512_
cvtepi16_ โepi8 Experimental avx512bw
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst.
- _mm512_
cvtepu8_ โepi16 Experimental avx512bw
- Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst.
- _mm512_
cvtsepi16_ โepi8 Experimental avx512bw
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst.
- _mm512_
cvtusepi16_ โepi8 Experimental avx512bw
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst.
- _mm512_
dbsad_ โepu8 Experimental avx512bw
- Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst. Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
- _mm512_
kunpackd โExperimental avx512bw
- Unpack and interleave 32 bits from masks a and b, and store the 64-bit result in k.
- _mm512_
kunpackw โExperimental avx512bw
- Unpack and interleave 16 bits from masks a and b, and store the 32-bit result in k.
- _mm512_
loadu_ โepi8 Experimental avx512bw
- Load 512-bits (composed of 64 packed 8-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
- _mm512_
loadu_ โepi16 Experimental avx512bw
- Load 512-bits (composed of 32 packed 16-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
- _mm512_
madd_ โepi16 Experimental avx512bw
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst.
- _mm512_
maddubs_ โepi16 Experimental avx512bw
- Vertically multiply each unsigned 8-bit integer from a with the corresponding signed 8-bit integer from b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst.
- _mm512_
mask2_ โpermutex2var_ epi16 Experimental avx512bw
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from idx when the corresponding mask bit is not set).
- _mm512_
mask_ โabs_ epi8 Experimental avx512bw
- Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โabs_ epi16 Experimental avx512bw
- Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โadd_ epi8 Experimental avx512bw
- Add packed 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โadd_ epi16 Experimental avx512bw
- Add packed 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โadds_ epi8 Experimental avx512bw
- Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โadds_ epi16 Experimental avx512bw
- Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โadds_ epu8 Experimental avx512bw
- Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โadds_ epu16 Experimental avx512bw
- Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โalignr_ epi8 Experimental avx512bw
- Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โavg_ epu8 Experimental avx512bw
- Average packed unsigned 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โavg_ epu16 Experimental avx512bw
- Average packed unsigned 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โblend_ epi8 Experimental avx512bw
- Blend packed 8-bit integers from a and b using control mask k, and store the results in dst.
- _mm512_
mask_ โblend_ epi16 Experimental avx512bw
- Blend packed 16-bit integers from a and b using control mask k, and store the results in dst.
- _mm512_
mask_ โbroadcastb_ epi8 Experimental avx512bw
- Broadcast the low packed 8-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โbroadcastw_ epi16 Experimental avx512bw
- Broadcast the low packed 16-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โcmp_ epi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmp_ epi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmp_ epu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmp_ epu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpeq_ epi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpeq_ epi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpeq_ epu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpeq_ epu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpge_ epi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpge_ epi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpge_ epu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpge_ epu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpgt_ epi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpgt_ epi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpgt_ epu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpgt_ epu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmple_ epi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmple_ epi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmple_ epu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmple_ epu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmplt_ epi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmplt_ epi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmplt_ epu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmplt_ epu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpneq_ epi8_ mask Experimental avx512bw
- Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpneq_ epi16_ mask Experimental avx512bw
- Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpneq_ epu8_ mask Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcmpneq_ epu16_ mask Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
mask_ โcvtepi8_ epi16 Experimental avx512bw
- Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โcvtepi16_ epi8 Experimental avx512bw
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โcvtepi16_ storeu_ epi8 Experimental avx512bw
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm512_
mask_ โcvtepu8_ epi16 Experimental avx512bw
- Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โcvtsepi16_ epi8 Experimental avx512bw
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โcvtsepi16_ storeu_ epi8 Experimental avx512bw
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm512_
mask_ โcvtusepi16_ epi8 Experimental avx512bw
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โcvtusepi16_ storeu_ epi8 Experimental avx512bw
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm512_
mask_ โdbsad_ epu8 Experimental avx512bw
- Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
- _mm512_
mask_ โloadu_ epi8 Experimental avx512bw
- Load packed 8-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm512_
mask_ โloadu_ epi16 Experimental avx512bw
- Load packed 16-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm512_
mask_ โmadd_ epi16 Experimental avx512bw
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmaddubs_ epi16 Experimental avx512bw
- Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmax_ epi8 Experimental avx512bw
- Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmax_ epi16 Experimental avx512bw
- Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmax_ epu8 Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmax_ epu16 Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmin_ epi8 Experimental avx512bw
- Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmin_ epi16 Experimental avx512bw
- Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmin_ epu8 Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmin_ epu16 Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmov_ epi8 Experimental avx512bw
- Move packed 8-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmov_ epi16 Experimental avx512bw
- Move packed 16-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmulhi_ epi16 Experimental avx512bw
- Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmulhi_ epu16 Experimental avx512bw
- Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmulhrs_ epi16 Experimental avx512bw
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โmullo_ epi16 Experimental avx512bw
- Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โpacks_ epi16 Experimental avx512bw
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โpacks_ epi32 Experimental avx512bw
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โpackus_ epi16 Experimental avx512bw
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โpackus_ epi32 Experimental avx512bw
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โpermutex2var_ epi16 Experimental avx512bw
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm512_
mask_ โpermutexvar_ epi16 Experimental avx512bw
- Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โset1_ epi8 Experimental avx512bw
- Broadcast 8-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โset1_ epi16 Experimental avx512bw
- Broadcast 16-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โshuffle_ epi8 Experimental avx512bw
- Shuffle 8-bit integers in a within 128-bit lanes using the control in the corresponding 8-bit element of b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โshufflehi_ epi16 Experimental avx512bw
- Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โshufflelo_ epi16 Experimental avx512bw
- Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsll_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โslli_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsllv_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsra_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsrai_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsrav_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsrl_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsrli_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsrlv_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โstoreu_ epi8 Experimental avx512bw
- Store packed 8-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
- _mm512_
mask_ โstoreu_ epi16 Experimental avx512bw
- Store packed 16-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
- _mm512_
mask_ โsub_ epi8 Experimental avx512bw
- Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsub_ epi16 Experimental avx512bw
- Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsubs_ epi8 Experimental avx512bw
- Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsubs_ epi16 Experimental avx512bw
- Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsubs_ epu8 Experimental avx512bw
- Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โsubs_ epu16 Experimental avx512bw
- Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โtest_ epi8_ mask Experimental avx512bw
- Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
- _mm512_
mask_ โtest_ epi16_ mask Experimental avx512bw
- Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
- _mm512_
mask_ โtestn_ epi8_ mask Experimental avx512bw
- Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
- _mm512_
mask_ โtestn_ epi16_ mask Experimental avx512bw
- Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
- _mm512_
mask_ โunpackhi_ epi8 Experimental avx512bw
- Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โunpackhi_ epi16 Experimental avx512bw
- Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โunpacklo_ epi8 Experimental avx512bw
- Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โunpacklo_ epi16 Experimental avx512bw
- Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
maskz_ โabs_ epi8 Experimental avx512bw
- Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โabs_ epi16 Experimental avx512bw
- Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โadd_ epi8 Experimental avx512bw
- Add packed 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โadd_ epi16 Experimental avx512bw
- Add packed 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โadds_ epi8 Experimental avx512bw
- Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โadds_ epi16 Experimental avx512bw
- Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โadds_ epu8 Experimental avx512bw
- Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โadds_ epu16 Experimental avx512bw
- Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โalignr_ epi8 Experimental avx512bw
- Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โavg_ epu8 Experimental avx512bw
- Average packed unsigned 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โavg_ epu16 Experimental avx512bw
- Average packed unsigned 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โbroadcastb_ epi8 Experimental avx512bw
- Broadcast the low packed 8-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โbroadcastw_ epi16 Experimental avx512bw
- Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โcvtepi8_ epi16 Experimental avx512bw
- Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โcvtepi16_ epi8 Experimental avx512bw
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โcvtepu8_ epi16 Experimental avx512bw
- Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โcvtsepi16_ epi8 Experimental avx512bw
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โcvtusepi16_ epi8 Experimental avx512bw
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โdbsad_ epu8 Experimental avx512bw
- Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
- _mm512_
maskz_ โloadu_ epi8 Experimental avx512bw
- Load packed 8-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm512_
maskz_ โloadu_ epi16 Experimental avx512bw
- Load packed 16-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm512_
maskz_ โmadd_ epi16 Experimental avx512bw
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmaddubs_ epi16 Experimental avx512bw
- Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmax_ epi8 Experimental avx512bw
- Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmax_ epi16 Experimental avx512bw
- Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmax_ epu8 Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmax_ epu16 Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmin_ epi8 Experimental avx512bw
- Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmin_ epi16 Experimental avx512bw
- Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmin_ epu8 Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmin_ epu16 Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmov_ epi8 Experimental avx512bw
- Move packed 8-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmov_ epi16 Experimental avx512bw
- Move packed 16-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmulhi_ epi16 Experimental avx512bw
- Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmulhi_ epu16 Experimental avx512bw
- Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmulhrs_ epi16 Experimental avx512bw
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โmullo_ epi16 Experimental avx512bw
- Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โpacks_ epi16 Experimental avx512bw
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โpacks_ epi32 Experimental avx512bw
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โpackus_ epi16 Experimental avx512bw
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โpackus_ epi32 Experimental avx512bw
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โpermutex2var_ epi16 Experimental avx512bw
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โpermutexvar_ epi16 Experimental avx512bw
- Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โset1_ epi8 Experimental avx512bw
- Broadcast 8-bit integer a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โset1_ epi16 Experimental avx512bw
- Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshuffle_ epi8 Experimental avx512bw
- Shuffle packed 8-bit integers in a according to shuffle control mask in the corresponding 8-bit element of b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshufflehi_ epi16 Experimental avx512bw
- Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshufflelo_ epi16 Experimental avx512bw
- Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsll_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โslli_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsllv_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsra_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsrai_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsrav_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsrl_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsrli_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsrlv_ epi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsub_ epi8 Experimental avx512bw
- Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsub_ epi16 Experimental avx512bw
- Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsubs_ epi8 Experimental avx512bw
- Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsubs_ epi16 Experimental avx512bw
- Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsubs_ epu8 Experimental avx512bw
- Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โsubs_ epu16 Experimental avx512bw
- Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โunpackhi_ epi8 Experimental avx512bw
- Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โunpackhi_ epi16 Experimental avx512bw
- Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โunpacklo_ epi8 Experimental avx512bw
- Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โunpacklo_ epi16 Experimental avx512bw
- Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
max_ โepi8 Experimental avx512bw
- Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst.
- _mm512_
max_ โepi16 Experimental avx512bw
- Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst.
- _mm512_
max_ โepu8 Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst.
- _mm512_
max_ โepu16 Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst.
- _mm512_
min_ โepi8 Experimental avx512bw
- Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst.
- _mm512_
min_ โepi16 Experimental avx512bw
- Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst.
- _mm512_
min_ โepu8 Experimental avx512bw
- Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst.
- _mm512_
min_ โepu16 Experimental avx512bw
- Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst.
- _mm512_
movepi8_ โmask Experimental avx512bw
- Set each bit of mask register k based on the most significant bit of the corresponding packed 8-bit integer in a.
- _mm512_
movepi16_ โmask Experimental avx512bw
- Set each bit of mask register k based on the most significant bit of the corresponding packed 16-bit integer in a.
- _mm512_
movm_ โepi8 Experimental avx512bw
- Set each packed 8-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
- _mm512_
movm_ โepi16 Experimental avx512bw
- Set each packed 16-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
- _mm512_
mulhi_ โepi16 Experimental avx512bw
- Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst.
- _mm512_
mulhi_ โepu16 Experimental avx512bw
- Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst.
- _mm512_
mulhrs_ โepi16 Experimental avx512bw
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst.
- _mm512_
mullo_ โepi16 Experimental avx512bw
- Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst.
- _mm512_
packs_ โepi16 Experimental avx512bw
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst.
- _mm512_
packs_ โepi32 Experimental avx512bw
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst.
- _mm512_
packus_ โepi16 Experimental avx512bw
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst.
- _mm512_
packus_ โepi32 Experimental avx512bw
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst.
- _mm512_
permutex2var_ โepi16 Experimental avx512bw
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst.
- _mm512_
permutexvar_ โepi16 Experimental avx512bw
- Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst.
- _mm512_
sad_ โepu8 Experimental avx512bw
- Compute the absolute differences of packed unsigned 8-bit integers in a and b, then horizontally sum each consecutive 8 differences to produce eight unsigned 16-bit integers, and pack these unsigned 16-bit integers in the low 16 bits of 64-bit elements in dst.
- _mm512_
shuffle_ โepi8 Experimental avx512bw
- Shuffle packed 8-bit integers in a according to shuffle control mask in the corresponding 8-bit element of b, and store the results in dst.
- _mm512_
shufflehi_ โepi16 Experimental avx512bw
- Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst.
- _mm512_
shufflelo_ โepi16 Experimental avx512bw
- Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst.
- _mm512_
sll_ โepi16 Experimental avx512bw
- Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst.
- _mm512_
slli_ โepi16 Experimental avx512bw
- Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst.
- _mm512_
sllv_ โepi16 Experimental avx512bw
- Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
- _mm512_
sra_ โepi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst.
- _mm512_
srai_ โepi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst.
- _mm512_
srav_ โepi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst.
- _mm512_
srl_ โepi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst.
- _mm512_
srli_ โepi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst.
- _mm512_
srlv_ โepi16 Experimental avx512bw
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
- _mm512_
storeu_ โepi8 Experimental avx512bw
- Store 512-bits (composed of 64 packed 8-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
- _mm512_
storeu_ โepi16 Experimental avx512bw
- Store 512-bits (composed of 32 packed 16-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
- _mm512_
sub_ โepi8 Experimental avx512bw
- Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst.
- _mm512_
sub_ โepi16 Experimental avx512bw
- Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst.
- _mm512_
subs_ โepi8 Experimental avx512bw
- Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst.
- _mm512_
subs_ โepi16 Experimental avx512bw
- Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst.
- _mm512_
subs_ โepu8 Experimental avx512bw
- Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst.
- _mm512_
subs_ โepu16 Experimental avx512bw
- Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst.
- _mm512_
test_ โepi8_ mask Experimental avx512bw
- Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
- _mm512_
test_ โepi16_ mask Experimental avx512bw
- Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
- _mm512_
testn_ โepi8_ mask Experimental avx512bw
- Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
- _mm512_
testn_ โepi16_ mask Experimental avx512bw
- Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
- _mm512_
unpackhi_ โepi8 Experimental avx512bw
- Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst.
- _mm512_
unpackhi_ โepi16 Experimental avx512bw
- Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst.
- _mm512_
unpacklo_ โepi8 Experimental avx512bw
- Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst.
- _mm512_
unpacklo_ โepi16 Experimental avx512bw
- Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst.
- _mm_
cmp_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
- _mm_
cmp_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
- _mm_
cmp_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
- _mm_
cmp_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
- _mm_
cmpeq_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k.
- _mm_
cmpeq_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k.
- _mm_
cmpeq_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k.
- _mm_
cmpeq_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k.
- _mm_
cmpge_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm_
cmpge_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm_
cmpge_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm_
cmpge_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
- _mm_
cmpgt_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm_
cmpgt_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm_
cmpgt_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm_
cmpgt_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k.
- _mm_
cmple_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm_
cmple_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm_
cmple_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm_
cmple_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
- _mm_
cmplt_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm_
cmplt_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm_
cmplt_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm_
cmplt_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k.
- _mm_
cmpneq_ โepi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm_
cmpneq_ โepi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm_
cmpneq_ โepu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm_
cmpneq_ โepu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k.
- _mm_
cvtepi16_ โepi8 Experimental avx512bw,avx512vl
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst.
- _mm_
cvtsepi16_ โepi8 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst.
- _mm_
cvtusepi16_ โepi8 Experimental avx512bw,avx512vl
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst.
- _mm_
dbsad_ โepu8 Experimental avx512bw,avx512vl
- Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst. Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
- _mm_
loadu_ โepi8 Experimental avx512bw,avx512vl
- Load 128-bits (composed of 16 packed 8-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
- _mm_
loadu_ โepi16 Experimental avx512bw,avx512vl
- Load 128-bits (composed of 8 packed 16-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
- _mm_
mask2_ โpermutex2var_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from idx when the corresponding mask bit is not set).
- _mm_
mask_ โabs_ epi8 Experimental avx512bw,avx512vl
- Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set)
- _mm_
mask_ โabs_ epi16 Experimental avx512bw,avx512vl
- Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โadd_ epi8 Experimental avx512bw,avx512vl
- Add packed 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โadd_ epi16 Experimental avx512bw,avx512vl
- Add packed 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โadds_ epi8 Experimental avx512bw,avx512vl
- Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โadds_ epi16 Experimental avx512bw,avx512vl
- Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โadds_ epu8 Experimental avx512bw,avx512vl
- Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โadds_ epu16 Experimental avx512bw,avx512vl
- Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โalignr_ epi8 Experimental avx512bw,avx512vl
- Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โavg_ epu8 Experimental avx512bw,avx512vl
- Average packed unsigned 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โavg_ epu16 Experimental avx512bw,avx512vl
- Average packed unsigned 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โblend_ epi8 Experimental avx512bw,avx512vl
- Blend packed 8-bit integers from a and b using control mask k, and store the results in dst.
- _mm_
mask_ โblend_ epi16 Experimental avx512bw,avx512vl
- Blend packed 16-bit integers from a and b using control mask k, and store the results in dst.
- _mm_
mask_ โbroadcastb_ epi8 Experimental avx512bw,avx512vl
- Broadcast the low packed 8-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โbroadcastw_ epi16 Experimental avx512bw,avx512vl
- Broadcast the low packed 16-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โcmp_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmp_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmp_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmp_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpeq_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpeq_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpeq_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpeq_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpge_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpge_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpge_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpge_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpgt_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpgt_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpgt_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpgt_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmple_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmple_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmple_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmple_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmplt_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmplt_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmplt_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmplt_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpneq_ epi8_ mask Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpneq_ epi16_ mask Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpneq_ epu8_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcmpneq_ epu16_ mask Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
mask_ โcvtepi8_ epi16 Experimental avx512bw,avx512vl
- Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โcvtepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โcvtepi16_ storeu_ epi8 Experimental avx512bw,avx512vl
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm_
mask_ โcvtepu8_ epi16 Experimental avx512bw,avx512vl
- Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โcvtsepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โcvtsepi16_ storeu_ epi8 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm_
mask_ โcvtusepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โcvtusepi16_ storeu_ epi8 Experimental avx512bw,avx512vl
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm_
mask_ โdbsad_ epu8 Experimental avx512bw,avx512vl
- Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
- _mm_
mask_ โloadu_ epi8 Experimental avx512bw,avx512vl
- Load packed 8-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm_
mask_ โloadu_ epi16 Experimental avx512bw,avx512vl
- Load packed 16-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm_
mask_ โmadd_ epi16 Experimental avx512bw,avx512vl
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmaddubs_ epi16 Experimental avx512bw,avx512vl
- Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmax_ epi8 Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmax_ epi16 Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmax_ epu8 Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmax_ epu16 Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmin_ epi8 Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmin_ epi16 Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmin_ epu8 Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmin_ epu16 Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmov_ epi8 Experimental avx512bw,avx512vl
- Move packed 8-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmov_ epi16 Experimental avx512bw,avx512vl
- Move packed 16-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmulhi_ epi16 Experimental avx512bw,avx512vl
- Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmulhi_ epu16 Experimental avx512bw,avx512vl
- Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmulhrs_ epi16 Experimental avx512bw,avx512vl
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โmullo_ epi16 Experimental avx512bw,avx512vl
- Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โpacks_ epi16 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โpacks_ epi32 Experimental avx512bw,avx512vl
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โpackus_ epi16 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โpackus_ epi32 Experimental avx512bw,avx512vl
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โpermutex2var_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm_
mask_ โpermutexvar_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โreduce_ add_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by addition using mask k. Returns the sum of all active elements in a.
- _mm_
mask_ โreduce_ add_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by addition using mask k. Returns the sum of all active elements in a.
- _mm_
mask_ โreduce_ and_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by bitwise AND using mask k. Returns the bitwise AND of all active elements in a.
- _mm_
mask_ โreduce_ and_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by bitwise AND using mask k. Returns the bitwise AND of all active elements in a.
- _mm_
mask_ โreduce_ max_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
- _mm_
mask_ โreduce_ max_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
- _mm_
mask_ โreduce_ max_ epu8 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 8-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
- _mm_
mask_ โreduce_ max_ epu16 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 16-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
- _mm_
mask_ โreduce_ min_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
- _mm_
mask_ โreduce_ min_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
- _mm_
mask_ โreduce_ min_ epu8 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 8-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
- _mm_
mask_ โreduce_ min_ epu16 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 16-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
- _mm_
mask_ โreduce_ mul_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by multiplication using mask k. Returns the product of all active elements in a.
- _mm_
mask_ โreduce_ mul_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by multiplication using mask k. Returns the product of all active elements in a.
- _mm_
mask_ โreduce_ or_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by bitwise OR using mask k. Returns the bitwise OR of all active elements in a.
- _mm_
mask_ โreduce_ or_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by bitwise OR using mask k. Returns the bitwise OR of all active elements in a.
- _mm_
mask_ โset1_ epi8 Experimental avx512bw,avx512vl
- Broadcast 8-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โset1_ epi16 Experimental avx512bw,avx512vl
- Broadcast 16-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โshuffle_ epi8 Experimental avx512bw,avx512vl
- Shuffle 8-bit integers in a within 128-bit lanes using the control in the corresponding 8-bit element of b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โshufflehi_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โshufflelo_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsll_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โslli_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsllv_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsra_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsrai_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsrav_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsrl_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsrli_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsrlv_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โstoreu_ epi8 Experimental avx512bw,avx512vl
- Store packed 8-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
- _mm_
mask_ โstoreu_ epi16 Experimental avx512bw,avx512vl
- Store packed 16-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
- _mm_
mask_ โsub_ epi8 Experimental avx512bw,avx512vl
- Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsub_ epi16 Experimental avx512bw,avx512vl
- Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsubs_ epi8 Experimental avx512bw,avx512vl
- Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsubs_ epi16 Experimental avx512bw,avx512vl
- Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsubs_ epu8 Experimental avx512bw,avx512vl
- Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โsubs_ epu16 Experimental avx512bw,avx512vl
- Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โtest_ epi8_ mask Experimental avx512bw,avx512vl
- Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
- _mm_
mask_ โtest_ epi16_ mask Experimental avx512bw,avx512vl
- Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
- _mm_
mask_ โtestn_ epi8_ mask Experimental avx512bw,avx512vl
- Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
- _mm_
mask_ โtestn_ epi16_ mask Experimental avx512bw,avx512vl
- Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
- _mm_
mask_ โunpackhi_ epi8 Experimental avx512bw,avx512vl
- Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โunpackhi_ epi16 Experimental avx512bw,avx512vl
- Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โunpacklo_ epi8 Experimental avx512bw,avx512vl
- Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โunpacklo_ epi16 Experimental avx512bw,avx512vl
- Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
maskz_ โabs_ epi8 Experimental avx512bw,avx512vl
- Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โabs_ epi16 Experimental avx512bw,avx512vl
- Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โadd_ epi8 Experimental avx512bw,avx512vl
- Add packed 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โadd_ epi16 Experimental avx512bw,avx512vl
- Add packed 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โadds_ epi8 Experimental avx512bw,avx512vl
- Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โadds_ epi16 Experimental avx512bw,avx512vl
- Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โadds_ epu8 Experimental avx512bw,avx512vl
- Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โadds_ epu16 Experimental avx512bw,avx512vl
- Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โalignr_ epi8 Experimental avx512bw,avx512vl
- Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โavg_ epu8 Experimental avx512bw,avx512vl
- Average packed unsigned 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โavg_ epu16 Experimental avx512bw,avx512vl
- Average packed unsigned 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โbroadcastb_ epi8 Experimental avx512bw,avx512vl
- Broadcast the low packed 8-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โbroadcastw_ epi16 Experimental avx512bw,avx512vl
- Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โcvtepi8_ epi16 Experimental avx512bw,avx512vl
- Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โcvtepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โcvtepu8_ epi16 Experimental avx512bw,avx512vl
- Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โcvtsepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โcvtusepi16_ epi8 Experimental avx512bw,avx512vl
- Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โdbsad_ epu8 Experimental avx512bw,avx512vl
- Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
- _mm_
maskz_ โloadu_ epi8 Experimental avx512bw,avx512vl
- Load packed 8-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm_
maskz_ โloadu_ epi16 Experimental avx512bw,avx512vl
- Load packed 16-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
- _mm_
maskz_ โmadd_ epi16 Experimental avx512bw,avx512vl
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmaddubs_ epi16 Experimental avx512bw,avx512vl
- Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmax_ epi8 Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmax_ epi16 Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmax_ epu8 Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmax_ epu16 Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmin_ epi8 Experimental avx512bw,avx512vl
- Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmin_ epi16 Experimental avx512bw,avx512vl
- Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmin_ epu8 Experimental avx512bw,avx512vl
- Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmin_ epu16 Experimental avx512bw,avx512vl
- Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmov_ epi8 Experimental avx512bw,avx512vl
- Move packed 8-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmov_ epi16 Experimental avx512bw,avx512vl
- Move packed 16-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmulhi_ epi16 Experimental avx512bw,avx512vl
- Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmulhi_ epu16 Experimental avx512bw,avx512vl
- Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmulhrs_ epi16 Experimental avx512bw,avx512vl
- Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โmullo_ epi16 Experimental avx512bw,avx512vl
- Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โpacks_ epi16 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โpacks_ epi32 Experimental avx512bw,avx512vl
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โpackus_ epi16 Experimental avx512bw,avx512vl
- Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โpackus_ epi32 Experimental avx512bw,avx512vl
- Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โpermutex2var_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โpermutexvar_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โset1_ epi8 Experimental avx512bw,avx512vl
- Broadcast 8-bit integer a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โset1_ epi16 Experimental avx512bw,avx512vl
- Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshuffle_ epi8 Experimental avx512bw,avx512vl
- Shuffle packed 8-bit integers in a according to shuffle control mask in the corresponding 8-bit element of b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshufflehi_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshufflelo_ epi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
maskz_ โsll_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โslli_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsllv_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsra_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsrai_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsrav_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsrl_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsrli_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsrlv_ epi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsub_ epi8 Experimental avx512bw,avx512vl
- Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsub_ epi16 Experimental avx512bw,avx512vl
- Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsubs_ epi8 Experimental avx512bw,avx512vl
- Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsubs_ epi16 Experimental avx512bw,avx512vl
- Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsubs_ epu8 Experimental avx512bw,avx512vl
- Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โsubs_ epu16 Experimental avx512bw,avx512vl
- Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โunpackhi_ epi8 Experimental avx512bw,avx512vl
- Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โunpackhi_ epi16 Experimental avx512bw,avx512vl
- Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โunpacklo_ epi8 Experimental avx512bw,avx512vl
- Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โunpacklo_ epi16 Experimental avx512bw,avx512vl
- Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
movepi8_ โmask Experimental avx512bw,avx512vl
- Set each bit of mask register k based on the most significant bit of the corresponding packed 8-bit integer in a.
- _mm_
movepi16_ โmask Experimental avx512bw,avx512vl
- Set each bit of mask register k based on the most significant bit of the corresponding packed 16-bit integer in a.
- _mm_
movm_ โepi8 Experimental avx512bw,avx512vl
- Set each packed 8-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
- _mm_
movm_ โepi16 Experimental avx512bw,avx512vl
- Set each packed 16-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
- _mm_
permutex2var_ โepi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst.
- _mm_
permutexvar_ โepi16 Experimental avx512bw,avx512vl
- Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst.
- _mm_
reduce_ โadd_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by addition. Returns the sum of all elements in a.
- _mm_
reduce_ โadd_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by addition. Returns the sum of all elements in a.
- _mm_
reduce_ โand_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by bitwise AND. Returns the bitwise AND of all elements in a.
- _mm_
reduce_ โand_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by bitwise AND. Returns the bitwise AND of all elements in a.
- _mm_
reduce_ โmax_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by maximum. Returns the maximum of all elements in a.
- _mm_
reduce_ โmax_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by maximum. Returns the maximum of all elements in a.
- _mm_
reduce_ โmax_ epu8 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 8-bit integers in a by maximum. Returns the maximum of all elements in a.
- _mm_
reduce_ โmax_ epu16 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 16-bit integers in a by maximum. Returns the maximum of all elements in a.
- _mm_
reduce_ โmin_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by minimum. Returns the minimum of all elements in a.
- _mm_
reduce_ โmin_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by minimum. Returns the minimum of all elements in a.
- _mm_
reduce_ โmin_ epu8 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 8-bit integers in a by minimum. Returns the minimum of all elements in a.
- _mm_
reduce_ โmin_ epu16 Experimental avx512bw,avx512vl
- Reduce the packed unsigned 16-bit integers in a by minimum. Returns the minimum of all elements in a.
- _mm_
reduce_ โmul_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by multiplication. Returns the product of all elements in a.
- _mm_
reduce_ โmul_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by multiplication. Returns the product of all elements in a.
- _mm_
reduce_ โor_ epi8 Experimental avx512bw,avx512vl
- Reduce the packed 8-bit integers in a by bitwise OR. Returns the bitwise OR of all elements in a.
- _mm_
reduce_ โor_ epi16 Experimental avx512bw,avx512vl
- Reduce the packed 16-bit integers in a by bitwise OR. Returns the bitwise OR of all elements in a.
- _mm_
sllv_ โepi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
- _mm_
srav_ โepi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst.
- _mm_
srlv_ โepi16 Experimental avx512bw,avx512vl
- Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
- _mm_
storeu_ โepi8 Experimental avx512bw,avx512vl
- Store 128-bits (composed of 16 packed 8-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
- _mm_
storeu_ โepi16 Experimental avx512bw,avx512vl
- Store 128-bits (composed of 8 packed 16-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
- _mm_
test_ โepi8_ mask Experimental avx512bw,avx512vl
- Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
- _mm_
test_ โepi16_ mask Experimental avx512bw,avx512vl
- Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
- _mm_
testn_ โepi8_ mask Experimental avx512bw,avx512vl
- Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
- _mm_
testn_ โepi16_ mask Experimental avx512bw,avx512vl
- Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
- _store_
mask32 โExperimental avx512bw
- Store 32-bit mask from a into memory.
- _store_
mask64 โExperimental avx512bw
- Store 64-bit mask from a into memory.