Module avx512bw

Source
Available on x86 or x86-64 only.

Functionsยง

loaddqu8_128 ๐Ÿ”’ โš 
loaddqu8_256 ๐Ÿ”’ โš 
loaddqu8_512 ๐Ÿ”’ โš 
loaddqu16_128 ๐Ÿ”’ โš 
loaddqu16_256 ๐Ÿ”’ โš 
loaddqu16_512 ๐Ÿ”’ โš 
storedqu8_128 ๐Ÿ”’ โš 
storedqu8_256 ๐Ÿ”’ โš 
storedqu8_512 ๐Ÿ”’ โš 
storedqu16_128 ๐Ÿ”’ โš 
storedqu16_256 ๐Ÿ”’ โš 
storedqu16_512 ๐Ÿ”’ โš 
vdbpsadbw ๐Ÿ”’ โš 
vdbpsadbw128 ๐Ÿ”’ โš 
vdbpsadbw256 ๐Ÿ”’ โš 
vpackssdw ๐Ÿ”’ โš 
vpacksswb ๐Ÿ”’ โš 
vpackusdw ๐Ÿ”’ โš 
vpackuswb ๐Ÿ”’ โš 
vpermi2w ๐Ÿ”’ โš 
vpermi2w128 ๐Ÿ”’ โš 
vpermi2w256 ๐Ÿ”’ โš 
vpermw ๐Ÿ”’ โš 
vpermw128 ๐Ÿ”’ โš 
vpermw256 ๐Ÿ”’ โš 
vpmaddubsw ๐Ÿ”’ โš 
vpmaddwd ๐Ÿ”’ โš 
vpmovswb ๐Ÿ”’ โš 
vpmovswb128 ๐Ÿ”’ โš 
vpmovswb256 ๐Ÿ”’ โš 
vpmovswbmem ๐Ÿ”’ โš 
vpmovswbmem128 ๐Ÿ”’ โš 
vpmovswbmem256 ๐Ÿ”’ โš 
vpmovuswb ๐Ÿ”’ โš 
vpmovuswb128 ๐Ÿ”’ โš 
vpmovuswb256 ๐Ÿ”’ โš 
vpmovuswbmem ๐Ÿ”’ โš 
vpmovuswbmem128 ๐Ÿ”’ โš 
vpmovuswbmem256 ๐Ÿ”’ โš 
vpmovwbmem ๐Ÿ”’ โš 
vpmovwbmem128 ๐Ÿ”’ โš 
vpmovwbmem256 ๐Ÿ”’ โš 
vpmulhrsw ๐Ÿ”’ โš 
vpsadbw ๐Ÿ”’ โš 
vpshufb ๐Ÿ”’ โš 
vpsllvw ๐Ÿ”’ โš 
vpsllvw128 ๐Ÿ”’ โš 
vpsllvw256 ๐Ÿ”’ โš 
vpsllw ๐Ÿ”’ โš 
vpsravw ๐Ÿ”’ โš 
vpsravw128 ๐Ÿ”’ โš 
vpsravw256 ๐Ÿ”’ โš 
vpsraw ๐Ÿ”’ โš 
vpsrlvw ๐Ÿ”’ โš 
vpsrlvw128 ๐Ÿ”’ โš 
vpsrlvw256 ๐Ÿ”’ โš 
vpsrlw ๐Ÿ”’ โš 
_cvtmask32_u32โš Experimentalavx512bw
Convert 32-bit mask a into an integer value, and store the result in dst.
_cvtu32_mask32โš Experimentalavx512bw
Convert integer value a into an 32-bit mask, and store the result in k.
_kadd_mask32โš Experimentalavx512bw
Add 32-bit masks in a and b, and store the result in k.
_kadd_mask64โš Experimentalavx512bw
Add 64-bit masks in a and b, and store the result in k.
_kand_mask32โš Experimentalavx512bw
Compute the bitwise AND of 32-bit masks a and b, and store the result in k.
_kand_mask64โš Experimentalavx512bw
Compute the bitwise AND of 64-bit masks a and b, and store the result in k.
_kandn_mask32โš Experimentalavx512bw
Compute the bitwise NOT of 32-bit masks a and then AND with b, and store the result in k.
_kandn_mask64โš Experimentalavx512bw
Compute the bitwise NOT of 64-bit masks a and then AND with b, and store the result in k.
_knot_mask32โš Experimentalavx512bw
Compute the bitwise NOT of 32-bit mask a, and store the result in k.
_knot_mask64โš Experimentalavx512bw
Compute the bitwise NOT of 64-bit mask a, and store the result in k.
_kor_mask32โš Experimentalavx512bw
Compute the bitwise OR of 32-bit masks a and b, and store the result in k.
_kor_mask64โš Experimentalavx512bw
Compute the bitwise OR of 64-bit masks a and b, and store the result in k.
_kortest_mask32_u8โš Experimentalavx512bw
Compute the bitwise OR of 32-bit masks a and b. If the result is all zeros, store 1 in dst, otherwise store 0 in dst. If the result is all ones, store 1 in all_ones, otherwise store 0 in all_ones.
_kortest_mask64_u8โš Experimentalavx512bw
Compute the bitwise OR of 64-bit masks a and b. If the result is all zeros, store 1 in dst, otherwise store 0 in dst. If the result is all ones, store 1 in all_ones, otherwise store 0 in all_ones.
_kortestc_mask32_u8โš Experimentalavx512bw
Compute the bitwise OR of 32-bit masks a and b. If the result is all ones, store 1 in dst, otherwise store 0 in dst.
_kortestc_mask64_u8โš Experimentalavx512bw
Compute the bitwise OR of 64-bit masks a and b. If the result is all ones, store 1 in dst, otherwise store 0 in dst.
_kortestz_mask32_u8โš Experimentalavx512bw
Compute the bitwise OR of 32-bit masks a and b. If the result is all zeros, store 1 in dst, otherwise store 0 in dst.
_kortestz_mask64_u8โš Experimentalavx512bw
Compute the bitwise OR of 64-bit masks a and b. If the result is all zeros, store 1 in dst, otherwise store 0 in dst.
_kshiftli_mask32โš Experimentalavx512bw
Shift the bits of 32-bit mask a left by count while shifting in zeros, and store the least significant 32 bits of the result in k.
_kshiftli_mask64โš Experimentalavx512bw
Shift the bits of 64-bit mask a left by count while shifting in zeros, and store the least significant 32 bits of the result in k.
_kshiftri_mask32โš Experimentalavx512bw
Shift the bits of 32-bit mask a right by count while shifting in zeros, and store the least significant 32 bits of the result in k.
_kshiftri_mask64โš Experimentalavx512bw
Shift the bits of 64-bit mask a right by count while shifting in zeros, and store the least significant 32 bits of the result in k.
_ktest_mask32_u8โš Experimentalavx512bw
Compute the bitwise AND of 32-bit masks a and b, and if the result is all zeros, store 1 in dst, otherwise store 0 in dst. Compute the bitwise NOT of a and then AND with b, if the result is all zeros, store 1 in and_not, otherwise store 0 in and_not.
_ktest_mask64_u8โš Experimentalavx512bw
Compute the bitwise AND of 64-bit masks a and b, and if the result is all zeros, store 1 in dst, otherwise store 0 in dst. Compute the bitwise NOT of a and then AND with b, if the result is all zeros, store 1 in and_not, otherwise store 0 in and_not.
_ktestc_mask32_u8โš Experimentalavx512bw
Compute the bitwise NOT of 32-bit mask a and then AND with 16-bit mask b, if the result is all zeros, store 1 in dst, otherwise store 0 in dst.
_ktestc_mask64_u8โš Experimentalavx512bw
Compute the bitwise NOT of 64-bit mask a and then AND with 8-bit mask b, if the result is all zeros, store 1 in dst, otherwise store 0 in dst.
_ktestz_mask32_u8โš Experimentalavx512bw
Compute the bitwise AND of 32-bit masks a and b, if the result is all zeros, store 1 in dst, otherwise store 0 in dst.
_ktestz_mask64_u8โš Experimentalavx512bw
Compute the bitwise AND of 64-bit masks a and b, if the result is all zeros, store 1 in dst, otherwise store 0 in dst.
_kxnor_mask32โš Experimentalavx512bw
Compute the bitwise XNOR of 32-bit masks a and b, and store the result in k.
_kxnor_mask64โš Experimentalavx512bw
Compute the bitwise XNOR of 64-bit masks a and b, and store the result in k.
_kxor_mask32โš Experimentalavx512bw
Compute the bitwise XOR of 32-bit masks a and b, and store the result in k.
_kxor_mask64โš Experimentalavx512bw
Compute the bitwise XOR of 64-bit masks a and b, and store the result in k.
_load_mask32โš Experimentalavx512bw
Load 32-bit mask from memory into k.
_load_mask64โš Experimentalavx512bw
Load 64-bit mask from memory into k.
_mm256_cmp_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
_mm256_cmp_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
_mm256_cmp_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
_mm256_cmp_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
_mm256_cmpeq_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k.
_mm256_cmpeq_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k.
_mm256_cmpeq_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k.
_mm256_cmpeq_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k.
_mm256_cmpge_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm256_cmpge_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm256_cmpge_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm256_cmpge_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm256_cmpgt_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm256_cmpgt_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm256_cmpgt_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm256_cmpgt_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm256_cmple_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm256_cmple_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm256_cmple_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm256_cmple_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm256_cmplt_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k.
_mm256_cmplt_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k.
_mm256_cmplt_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k.
_mm256_cmplt_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k.
_mm256_cmpneq_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm256_cmpneq_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm256_cmpneq_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm256_cmpneq_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm256_cvtepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst.
_mm256_cvtsepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst.
_mm256_cvtusepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst.
_mm256_dbsad_epu8โš Experimentalavx512bw,avx512vl
Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst. Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
_mm256_loadu_epi8โš Experimentalavx512bw,avx512vl
Load 256-bits (composed of 32 packed 8-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
_mm256_loadu_epi16โš Experimentalavx512bw,avx512vl
Load 256-bits (composed of 16 packed 16-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
_mm256_mask2_permutex2var_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from idx when the corresponding mask bit is not set).
_mm256_mask_abs_epi8โš Experimentalavx512bw,avx512vl
Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_abs_epi16โš Experimentalavx512bw,avx512vl
Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_add_epi8โš Experimentalavx512bw,avx512vl
Add packed 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_add_epi16โš Experimentalavx512bw,avx512vl
Add packed 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_adds_epi8โš Experimentalavx512bw,avx512vl
Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_adds_epi16โš Experimentalavx512bw,avx512vl
Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_adds_epu8โš Experimentalavx512bw,avx512vl
Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_adds_epu16โš Experimentalavx512bw,avx512vl
Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_alignr_epi8โš Experimentalavx512bw,avx512vl
Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_avg_epu8โš Experimentalavx512bw,avx512vl
Average packed unsigned 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_avg_epu16โš Experimentalavx512bw,avx512vl
Average packed unsigned 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_blend_epi8โš Experimentalavx512bw,avx512vl
Blend packed 8-bit integers from a and b using control mask k, and store the results in dst.
_mm256_mask_blend_epi16โš Experimentalavx512bw,avx512vl
Blend packed 16-bit integers from a and b using control mask k, and store the results in dst.
_mm256_mask_broadcastb_epi8โš Experimentalavx512bw,avx512vl
Broadcast the low packed 8-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_broadcastw_epi16โš Experimentalavx512bw,avx512vl
Broadcast the low packed 16-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_cmp_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmp_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmp_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmp_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpeq_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpeq_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpeq_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpeq_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpge_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpge_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpge_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpge_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpgt_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpgt_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpgt_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpgt_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmple_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmple_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmple_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmple_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmplt_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmplt_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmplt_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmplt_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpneq_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpneq_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpneq_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cmpneq_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm256_mask_cvtepi8_epi16โš Experimentalavx512bw,avx512vl
Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_cvtepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_cvtepi16_storeu_epi8โš Experimentalavx512bw,avx512vl
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
_mm256_mask_cvtepu8_epi16โš Experimentalavx512bw,avx512vl
Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_cvtsepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_cvtsepi16_storeu_epi8โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
_mm256_mask_cvtusepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_cvtusepi16_storeu_epi8โš Experimentalavx512bw,avx512vl
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
_mm256_mask_dbsad_epu8โš Experimentalavx512bw,avx512vl
Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
_mm256_mask_loadu_epi8โš Experimentalavx512bw,avx512vl
Load packed 8-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm256_mask_loadu_epi16โš Experimentalavx512bw,avx512vl
Load packed 16-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm256_mask_madd_epi16โš Experimentalavx512bw,avx512vl
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_maddubs_epi16โš Experimentalavx512bw,avx512vl
Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_max_epi8โš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_max_epi16โš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_max_epu8โš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_max_epu16โš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_min_epi8โš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_min_epi16โš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_min_epu8โš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_min_epu16โš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_mov_epi8โš Experimentalavx512bw,avx512vl
Move packed 8-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_mov_epi16โš Experimentalavx512bw,avx512vl
Move packed 16-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_mulhi_epi16โš Experimentalavx512bw,avx512vl
Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_mulhi_epu16โš Experimentalavx512bw,avx512vl
Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_mulhrs_epi16โš Experimentalavx512bw,avx512vl
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_mullo_epi16โš Experimentalavx512bw,avx512vl
Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_packs_epi16โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_packs_epi32โš Experimentalavx512bw,avx512vl
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_packus_epi16โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_packus_epi32โš Experimentalavx512bw,avx512vl
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_permutex2var_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
_mm256_mask_permutexvar_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_reduce_add_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by addition using mask k. Returns the sum of all active elements in a.
_mm256_mask_reduce_add_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by addition using mask k. Returns the sum of all active elements in a.
_mm256_mask_reduce_and_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by bitwise AND using mask k. Returns the bitwise AND of all active elements in a.
_mm256_mask_reduce_and_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by bitwise AND using mask k. Returns the bitwise AND of all active elements in a.
_mm256_mask_reduce_max_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
_mm256_mask_reduce_max_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
_mm256_mask_reduce_max_epu8โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 8-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
_mm256_mask_reduce_max_epu16โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 16-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
_mm256_mask_reduce_min_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
_mm256_mask_reduce_min_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
_mm256_mask_reduce_min_epu8โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 8-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
_mm256_mask_reduce_min_epu16โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 16-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
_mm256_mask_reduce_mul_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by multiplication using mask k. Returns the product of all active elements in a.
_mm256_mask_reduce_mul_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by multiplication using mask k. Returns the product of all active elements in a.
_mm256_mask_reduce_or_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by bitwise OR using mask k. Returns the bitwise OR of all active elements in a.
_mm256_mask_reduce_or_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by bitwise OR using mask k. Returns the bitwise OR of all active elements in a.
_mm256_mask_set1_epi8โš Experimentalavx512bw,avx512vl
Broadcast 8-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_set1_epi16โš Experimentalavx512bw,avx512vl
Broadcast 16-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_shuffle_epi8โš Experimentalavx512bw,avx512vl
Shuffle 8-bit integers in a within 128-bit lanes using the control in the corresponding 8-bit element of b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_shufflehi_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_shufflelo_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_sll_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_slli_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_sllv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_sra_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_srai_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_srav_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_srl_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_srli_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_srlv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_storeu_epi8โš Experimentalavx512bw,avx512vl
Store packed 8-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
_mm256_mask_storeu_epi16โš Experimentalavx512bw,avx512vl
Store packed 16-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
_mm256_mask_sub_epi8โš Experimentalavx512bw,avx512vl
Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_sub_epi16โš Experimentalavx512bw,avx512vl
Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_subs_epi8โš Experimentalavx512bw,avx512vl
Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_subs_epi16โš Experimentalavx512bw,avx512vl
Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_subs_epu8โš Experimentalavx512bw,avx512vl
Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_subs_epu16โš Experimentalavx512bw,avx512vl
Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_test_epi8_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
_mm256_mask_test_epi16_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
_mm256_mask_testn_epi8_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
_mm256_mask_testn_epi16_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
_mm256_mask_unpackhi_epi8โš Experimentalavx512bw,avx512vl
Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_unpackhi_epi16โš Experimentalavx512bw,avx512vl
Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_unpacklo_epi8โš Experimentalavx512bw,avx512vl
Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_mask_unpacklo_epi16โš Experimentalavx512bw,avx512vl
Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_maskz_abs_epi8โš Experimentalavx512bw,avx512vl
Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_abs_epi16โš Experimentalavx512bw,avx512vl
Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_add_epi8โš Experimentalavx512bw,avx512vl
Add packed 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_add_epi16โš Experimentalavx512bw,avx512vl
Add packed 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_adds_epi8โš Experimentalavx512bw,avx512vl
Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_adds_epi16โš Experimentalavx512bw,avx512vl
Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_adds_epu8โš Experimentalavx512bw,avx512vl
Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_adds_epu16โš Experimentalavx512bw,avx512vl
Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_alignr_epi8โš Experimentalavx512bw,avx512vl
Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_avg_epu8โš Experimentalavx512bw,avx512vl
Average packed unsigned 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_avg_epu16โš Experimentalavx512bw,avx512vl
Average packed unsigned 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_broadcastb_epi8โš Experimentalavx512bw,avx512vl
Broadcast the low packed 8-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_broadcastw_epi16โš Experimentalavx512bw,avx512vl
Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_cvtepi8_epi16โš Experimentalavx512bw,avx512vl
Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_cvtepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_cvtepu8_epi16โš Experimentalavx512bw,avx512vl
Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_cvtsepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_cvtusepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_dbsad_epu8โš Experimentalavx512bw,avx512vl
Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
_mm256_maskz_loadu_epi8โš Experimentalavx512bw,avx512vl
Load packed 8-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm256_maskz_loadu_epi16โš Experimentalavx512bw,avx512vl
Load packed 16-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm256_maskz_madd_epi16โš Experimentalavx512bw,avx512vl
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_maddubs_epi16โš Experimentalavx512bw,avx512vl
Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_max_epi8โš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_max_epi16โš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_max_epu8โš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_max_epu16โš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_min_epi8โš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_min_epi16โš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_min_epu8โš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_min_epu16โš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_mov_epi8โš Experimentalavx512bw,avx512vl
Move packed 8-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_mov_epi16โš Experimentalavx512bw,avx512vl
Move packed 16-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_mulhi_epi16โš Experimentalavx512bw,avx512vl
Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_mulhi_epu16โš Experimentalavx512bw,avx512vl
Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_mulhrs_epi16โš Experimentalavx512bw,avx512vl
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_mullo_epi16โš Experimentalavx512bw,avx512vl
Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_packs_epi16โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_packs_epi32โš Experimentalavx512bw,avx512vl
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_packus_epi16โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_packus_epi32โš Experimentalavx512bw,avx512vl
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_permutex2var_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_permutexvar_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_set1_epi8โš Experimentalavx512bw,avx512vl
Broadcast 8-bit integer a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_set1_epi16โš Experimentalavx512bw,avx512vl
Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_shuffle_epi8โš Experimentalavx512bw,avx512vl
Shuffle packed 8-bit integers in a according to shuffle control mask in the corresponding 8-bit element of b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_shufflehi_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_shufflelo_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm256_maskz_sll_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_slli_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_sllv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_sra_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_srai_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_srav_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_srl_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_srli_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_srlv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_sub_epi8โš Experimentalavx512bw,avx512vl
Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_sub_epi16โš Experimentalavx512bw,avx512vl
Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_subs_epi8โš Experimentalavx512bw,avx512vl
Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_subs_epi16โš Experimentalavx512bw,avx512vl
Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_subs_epu8โš Experimentalavx512bw,avx512vl
Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_subs_epu16โš Experimentalavx512bw,avx512vl
Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_unpackhi_epi8โš Experimentalavx512bw,avx512vl
Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_unpackhi_epi16โš Experimentalavx512bw,avx512vl
Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_unpacklo_epi8โš Experimentalavx512bw,avx512vl
Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_maskz_unpacklo_epi16โš Experimentalavx512bw,avx512vl
Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm256_movepi8_maskโš Experimentalavx512bw,avx512vl
Set each bit of mask register k based on the most significant bit of the corresponding packed 8-bit integer in a.
_mm256_movepi16_maskโš Experimentalavx512bw,avx512vl
Set each bit of mask register k based on the most significant bit of the corresponding packed 16-bit integer in a.
_mm256_movm_epi8โš Experimentalavx512bw,avx512vl
Set each packed 8-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
_mm256_movm_epi16โš Experimentalavx512bw,avx512vl
Set each packed 16-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
_mm256_permutex2var_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst.
_mm256_permutexvar_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst.
_mm256_reduce_add_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by addition. Returns the sum of all elements in a.
_mm256_reduce_add_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by addition. Returns the sum of all elements in a.
_mm256_reduce_and_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by bitwise AND. Returns the bitwise AND of all elements in a.
_mm256_reduce_and_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by bitwise AND. Returns the bitwise AND of all elements in a.
_mm256_reduce_max_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by maximum. Returns the maximum of all elements in a.
_mm256_reduce_max_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by maximum. Returns the maximum of all elements in a.
_mm256_reduce_max_epu8โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 8-bit integers in a by maximum. Returns the maximum of all elements in a.
_mm256_reduce_max_epu16โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 16-bit integers in a by maximum. Returns the maximum of all elements in a.
_mm256_reduce_min_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by minimum. Returns the minimum of all elements in a.
_mm256_reduce_min_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by minimum. Returns the minimum of all elements in a.
_mm256_reduce_min_epu8โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 8-bit integers in a by minimum. Returns the minimum of all elements in a.
_mm256_reduce_min_epu16โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 16-bit integers in a by minimum. Returns the minimum of all elements in a.
_mm256_reduce_mul_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by multiplication. Returns the product of all elements in a.
_mm256_reduce_mul_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by multiplication. Returns the product of all elements in a.
_mm256_reduce_or_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by bitwise OR. Returns the bitwise OR of all elements in a.
_mm256_reduce_or_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by bitwise OR. Returns the bitwise OR of all elements in a.
_mm256_sllv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
_mm256_srav_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst.
_mm256_srlv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
_mm256_storeu_epi8โš Experimentalavx512bw,avx512vl
Store 256-bits (composed of 32 packed 8-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
_mm256_storeu_epi16โš Experimentalavx512bw,avx512vl
Store 256-bits (composed of 16 packed 16-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
_mm256_test_epi8_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
_mm256_test_epi16_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
_mm256_testn_epi8_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
_mm256_testn_epi16_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
_mm512_abs_epi8โš Experimentalavx512bw
Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst.
_mm512_abs_epi16โš Experimentalavx512bw
Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst.
_mm512_add_epi8โš Experimentalavx512bw
Add packed 8-bit integers in a and b, and store the results in dst.
_mm512_add_epi16โš Experimentalavx512bw
Add packed 16-bit integers in a and b, and store the results in dst.
_mm512_adds_epi8โš Experimentalavx512bw
Add packed signed 8-bit integers in a and b using saturation, and store the results in dst.
_mm512_adds_epi16โš Experimentalavx512bw
Add packed signed 16-bit integers in a and b using saturation, and store the results in dst.
_mm512_adds_epu8โš Experimentalavx512bw
Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst.
_mm512_adds_epu16โš Experimentalavx512bw
Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst.
_mm512_alignr_epi8โš Experimentalavx512bw
Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst. Unlike _mm_alignr_epi8, _mm256_alignr_epi8 functions, where the entire input vectors are concatenated to the temporary result, this concatenation happens in 4 steps, where each step builds 32-byte temporary result.
_mm512_avg_epu8โš Experimentalavx512bw
Average packed unsigned 8-bit integers in a and b, and store the results in dst.
_mm512_avg_epu16โš Experimentalavx512bw
Average packed unsigned 16-bit integers in a and b, and store the results in dst.
_mm512_broadcastb_epi8โš Experimentalavx512bw
Broadcast the low packed 8-bit integer from a to all elements of dst.
_mm512_broadcastw_epi16โš Experimentalavx512bw
Broadcast the low packed 16-bit integer from a to all elements of dst.
_mm512_bslli_epi128โš Experimentalavx512bw
Shift 128-bit lanes in a left by imm8 bytes while shifting in zeros, and store the results in dst.
_mm512_bsrli_epi128โš Experimentalavx512bw
Shift 128-bit lanes in a right by imm8 bytes while shifting in zeros, and store the results in dst.
_mm512_cmp_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
_mm512_cmp_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
_mm512_cmp_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
_mm512_cmp_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by IMM8, and store the results in mask vector k.
_mm512_cmpeq_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k.
_mm512_cmpeq_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k.
_mm512_cmpeq_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k.
_mm512_cmpeq_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k.
_mm512_cmpge_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm512_cmpge_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm512_cmpge_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm512_cmpge_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm512_cmpgt_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm512_cmpgt_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm512_cmpgt_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm512_cmpgt_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm512_cmple_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm512_cmple_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm512_cmple_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm512_cmple_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm512_cmplt_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k.
_mm512_cmplt_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k.
_mm512_cmplt_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k.
_mm512_cmplt_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k.
_mm512_cmpneq_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm512_cmpneq_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm512_cmpneq_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm512_cmpneq_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm512_cvtepi8_epi16โš Experimentalavx512bw
Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst.
_mm512_cvtepi16_epi8โš Experimentalavx512bw
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst.
_mm512_cvtepu8_epi16โš Experimentalavx512bw
Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst.
_mm512_cvtsepi16_epi8โš Experimentalavx512bw
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst.
_mm512_cvtusepi16_epi8โš Experimentalavx512bw
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst.
_mm512_dbsad_epu8โš Experimentalavx512bw
Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst. Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
_mm512_kunpackdโš Experimentalavx512bw
Unpack and interleave 32 bits from masks a and b, and store the 64-bit result in k.
_mm512_kunpackwโš Experimentalavx512bw
Unpack and interleave 16 bits from masks a and b, and store the 32-bit result in k.
_mm512_loadu_epi8โš Experimentalavx512bw
Load 512-bits (composed of 64 packed 8-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
_mm512_loadu_epi16โš Experimentalavx512bw
Load 512-bits (composed of 32 packed 16-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
_mm512_madd_epi16โš Experimentalavx512bw
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst.
_mm512_maddubs_epi16โš Experimentalavx512bw
Vertically multiply each unsigned 8-bit integer from a with the corresponding signed 8-bit integer from b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst.
_mm512_mask2_permutex2var_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from idx when the corresponding mask bit is not set).
_mm512_mask_abs_epi8โš Experimentalavx512bw
Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_abs_epi16โš Experimentalavx512bw
Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_add_epi8โš Experimentalavx512bw
Add packed 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_add_epi16โš Experimentalavx512bw
Add packed 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_adds_epi8โš Experimentalavx512bw
Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_adds_epi16โš Experimentalavx512bw
Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_adds_epu8โš Experimentalavx512bw
Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_adds_epu16โš Experimentalavx512bw
Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_alignr_epi8โš Experimentalavx512bw
Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_avg_epu8โš Experimentalavx512bw
Average packed unsigned 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_avg_epu16โš Experimentalavx512bw
Average packed unsigned 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_blend_epi8โš Experimentalavx512bw
Blend packed 8-bit integers from a and b using control mask k, and store the results in dst.
_mm512_mask_blend_epi16โš Experimentalavx512bw
Blend packed 16-bit integers from a and b using control mask k, and store the results in dst.
_mm512_mask_broadcastb_epi8โš Experimentalavx512bw
Broadcast the low packed 8-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_broadcastw_epi16โš Experimentalavx512bw
Broadcast the low packed 16-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_cmp_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmp_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmp_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmp_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpeq_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpeq_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpeq_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpeq_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpge_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpge_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpge_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpge_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpgt_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpgt_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpgt_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpgt_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmple_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmple_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmple_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmple_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmplt_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmplt_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmplt_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmplt_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpneq_epi8_maskโš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpneq_epi16_maskโš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpneq_epu8_maskโš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cmpneq_epu16_maskโš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_cvtepi8_epi16โš Experimentalavx512bw
Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_cvtepi16_epi8โš Experimentalavx512bw
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_cvtepi16_storeu_epi8โš Experimentalavx512bw
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
_mm512_mask_cvtepu8_epi16โš Experimentalavx512bw
Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_cvtsepi16_epi8โš Experimentalavx512bw
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_cvtsepi16_storeu_epi8โš Experimentalavx512bw
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
_mm512_mask_cvtusepi16_epi8โš Experimentalavx512bw
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_cvtusepi16_storeu_epi8โš Experimentalavx512bw
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
_mm512_mask_dbsad_epu8โš Experimentalavx512bw
Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
_mm512_mask_loadu_epi8โš Experimentalavx512bw
Load packed 8-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm512_mask_loadu_epi16โš Experimentalavx512bw
Load packed 16-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm512_mask_madd_epi16โš Experimentalavx512bw
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_maddubs_epi16โš Experimentalavx512bw
Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_max_epi8โš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_max_epi16โš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_max_epu8โš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_max_epu16โš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_min_epi8โš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_min_epi16โš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_min_epu8โš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_min_epu16โš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_mov_epi8โš Experimentalavx512bw
Move packed 8-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_mov_epi16โš Experimentalavx512bw
Move packed 16-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_mulhi_epi16โš Experimentalavx512bw
Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_mulhi_epu16โš Experimentalavx512bw
Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_mulhrs_epi16โš Experimentalavx512bw
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_mullo_epi16โš Experimentalavx512bw
Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_packs_epi16โš Experimentalavx512bw
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_packs_epi32โš Experimentalavx512bw
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_packus_epi16โš Experimentalavx512bw
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_packus_epi32โš Experimentalavx512bw
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_permutex2var_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
_mm512_mask_permutexvar_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_set1_epi8โš Experimentalavx512bw
Broadcast 8-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_set1_epi16โš Experimentalavx512bw
Broadcast 16-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_shuffle_epi8โš Experimentalavx512bw
Shuffle 8-bit integers in a within 128-bit lanes using the control in the corresponding 8-bit element of b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_shufflehi_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_shufflelo_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_sll_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_slli_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_sllv_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_sra_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_srai_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_srav_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_srl_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_srli_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_srlv_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_storeu_epi8โš Experimentalavx512bw
Store packed 8-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
_mm512_mask_storeu_epi16โš Experimentalavx512bw
Store packed 16-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
_mm512_mask_sub_epi8โš Experimentalavx512bw
Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_sub_epi16โš Experimentalavx512bw
Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_subs_epi8โš Experimentalavx512bw
Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_subs_epi16โš Experimentalavx512bw
Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_subs_epu8โš Experimentalavx512bw
Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_subs_epu16โš Experimentalavx512bw
Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_test_epi8_maskโš Experimentalavx512bw
Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
_mm512_mask_test_epi16_maskโš Experimentalavx512bw
Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
_mm512_mask_testn_epi8_maskโš Experimentalavx512bw
Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
_mm512_mask_testn_epi16_maskโš Experimentalavx512bw
Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
_mm512_mask_unpackhi_epi8โš Experimentalavx512bw
Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_unpackhi_epi16โš Experimentalavx512bw
Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_unpacklo_epi8โš Experimentalavx512bw
Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_mask_unpacklo_epi16โš Experimentalavx512bw
Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_maskz_abs_epi8โš Experimentalavx512bw
Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_abs_epi16โš Experimentalavx512bw
Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_add_epi8โš Experimentalavx512bw
Add packed 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_add_epi16โš Experimentalavx512bw
Add packed 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_adds_epi8โš Experimentalavx512bw
Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_adds_epi16โš Experimentalavx512bw
Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_adds_epu8โš Experimentalavx512bw
Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_adds_epu16โš Experimentalavx512bw
Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_alignr_epi8โš Experimentalavx512bw
Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_avg_epu8โš Experimentalavx512bw
Average packed unsigned 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_avg_epu16โš Experimentalavx512bw
Average packed unsigned 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_broadcastb_epi8โš Experimentalavx512bw
Broadcast the low packed 8-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_broadcastw_epi16โš Experimentalavx512bw
Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_cvtepi8_epi16โš Experimentalavx512bw
Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_cvtepi16_epi8โš Experimentalavx512bw
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_cvtepu8_epi16โš Experimentalavx512bw
Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_cvtsepi16_epi8โš Experimentalavx512bw
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_cvtusepi16_epi8โš Experimentalavx512bw
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_dbsad_epu8โš Experimentalavx512bw
Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
_mm512_maskz_loadu_epi8โš Experimentalavx512bw
Load packed 8-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm512_maskz_loadu_epi16โš Experimentalavx512bw
Load packed 16-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm512_maskz_madd_epi16โš Experimentalavx512bw
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_maddubs_epi16โš Experimentalavx512bw
Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_max_epi8โš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_max_epi16โš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_max_epu8โš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_max_epu16โš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_min_epi8โš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_min_epi16โš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_min_epu8โš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_min_epu16โš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_mov_epi8โš Experimentalavx512bw
Move packed 8-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_mov_epi16โš Experimentalavx512bw
Move packed 16-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_mulhi_epi16โš Experimentalavx512bw
Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_mulhi_epu16โš Experimentalavx512bw
Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_mulhrs_epi16โš Experimentalavx512bw
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_mullo_epi16โš Experimentalavx512bw
Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_packs_epi16โš Experimentalavx512bw
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_packs_epi32โš Experimentalavx512bw
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_packus_epi16โš Experimentalavx512bw
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_packus_epi32โš Experimentalavx512bw
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_permutex2var_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_permutexvar_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_set1_epi8โš Experimentalavx512bw
Broadcast 8-bit integer a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_set1_epi16โš Experimentalavx512bw
Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_shuffle_epi8โš Experimentalavx512bw
Shuffle packed 8-bit integers in a according to shuffle control mask in the corresponding 8-bit element of b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_shufflehi_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_shufflelo_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_sll_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_slli_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_sllv_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_sra_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_srai_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_srav_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_srl_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_srli_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_srlv_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_sub_epi8โš Experimentalavx512bw
Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_sub_epi16โš Experimentalavx512bw
Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_subs_epi8โš Experimentalavx512bw
Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_subs_epi16โš Experimentalavx512bw
Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_subs_epu8โš Experimentalavx512bw
Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_subs_epu16โš Experimentalavx512bw
Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_unpackhi_epi8โš Experimentalavx512bw
Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_unpackhi_epi16โš Experimentalavx512bw
Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_unpacklo_epi8โš Experimentalavx512bw
Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_maskz_unpacklo_epi16โš Experimentalavx512bw
Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_max_epi8โš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst.
_mm512_max_epi16โš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst.
_mm512_max_epu8โš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst.
_mm512_max_epu16โš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst.
_mm512_min_epi8โš Experimentalavx512bw
Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst.
_mm512_min_epi16โš Experimentalavx512bw
Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst.
_mm512_min_epu8โš Experimentalavx512bw
Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst.
_mm512_min_epu16โš Experimentalavx512bw
Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst.
_mm512_movepi8_maskโš Experimentalavx512bw
Set each bit of mask register k based on the most significant bit of the corresponding packed 8-bit integer in a.
_mm512_movepi16_maskโš Experimentalavx512bw
Set each bit of mask register k based on the most significant bit of the corresponding packed 16-bit integer in a.
_mm512_movm_epi8โš Experimentalavx512bw
Set each packed 8-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
_mm512_movm_epi16โš Experimentalavx512bw
Set each packed 16-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
_mm512_mulhi_epi16โš Experimentalavx512bw
Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst.
_mm512_mulhi_epu16โš Experimentalavx512bw
Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst.
_mm512_mulhrs_epi16โš Experimentalavx512bw
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst.
_mm512_mullo_epi16โš Experimentalavx512bw
Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst.
_mm512_packs_epi16โš Experimentalavx512bw
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst.
_mm512_packs_epi32โš Experimentalavx512bw
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst.
_mm512_packus_epi16โš Experimentalavx512bw
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst.
_mm512_packus_epi32โš Experimentalavx512bw
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst.
_mm512_permutex2var_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst.
_mm512_permutexvar_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst.
_mm512_sad_epu8โš Experimentalavx512bw
Compute the absolute differences of packed unsigned 8-bit integers in a and b, then horizontally sum each consecutive 8 differences to produce eight unsigned 16-bit integers, and pack these unsigned 16-bit integers in the low 16 bits of 64-bit elements in dst.
_mm512_shuffle_epi8โš Experimentalavx512bw
Shuffle packed 8-bit integers in a according to shuffle control mask in the corresponding 8-bit element of b, and store the results in dst.
_mm512_shufflehi_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst.
_mm512_shufflelo_epi16โš Experimentalavx512bw
Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst.
_mm512_sll_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst.
_mm512_slli_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst.
_mm512_sllv_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
_mm512_sra_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst.
_mm512_srai_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst.
_mm512_srav_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst.
_mm512_srl_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst.
_mm512_srli_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst.
_mm512_srlv_epi16โš Experimentalavx512bw
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
_mm512_storeu_epi8โš Experimentalavx512bw
Store 512-bits (composed of 64 packed 8-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
_mm512_storeu_epi16โš Experimentalavx512bw
Store 512-bits (composed of 32 packed 16-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
_mm512_sub_epi8โš Experimentalavx512bw
Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst.
_mm512_sub_epi16โš Experimentalavx512bw
Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst.
_mm512_subs_epi8โš Experimentalavx512bw
Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst.
_mm512_subs_epi16โš Experimentalavx512bw
Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst.
_mm512_subs_epu8โš Experimentalavx512bw
Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst.
_mm512_subs_epu16โš Experimentalavx512bw
Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst.
_mm512_test_epi8_maskโš Experimentalavx512bw
Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
_mm512_test_epi16_maskโš Experimentalavx512bw
Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
_mm512_testn_epi8_maskโš Experimentalavx512bw
Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
_mm512_testn_epi16_maskโš Experimentalavx512bw
Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
_mm512_unpackhi_epi8โš Experimentalavx512bw
Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst.
_mm512_unpackhi_epi16โš Experimentalavx512bw
Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst.
_mm512_unpacklo_epi8โš Experimentalavx512bw
Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst.
_mm512_unpacklo_epi16โš Experimentalavx512bw
Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst.
_mm_cmp_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
_mm_cmp_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
_mm_cmp_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
_mm_cmp_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k.
_mm_cmpeq_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k.
_mm_cmpeq_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k.
_mm_cmpeq_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k.
_mm_cmpeq_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k.
_mm_cmpge_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm_cmpge_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm_cmpge_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm_cmpge_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k.
_mm_cmpgt_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm_cmpgt_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm_cmpgt_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm_cmpgt_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k.
_mm_cmple_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm_cmple_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm_cmple_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm_cmple_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k.
_mm_cmplt_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k.
_mm_cmplt_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k.
_mm_cmplt_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k.
_mm_cmplt_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k.
_mm_cmpneq_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm_cmpneq_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm_cmpneq_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm_cmpneq_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k.
_mm_cvtepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst.
_mm_cvtsepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst.
_mm_cvtusepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst.
_mm_dbsad_epu8โš Experimentalavx512bw,avx512vl
Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst. Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
_mm_loadu_epi8โš Experimentalavx512bw,avx512vl
Load 128-bits (composed of 16 packed 8-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
_mm_loadu_epi16โš Experimentalavx512bw,avx512vl
Load 128-bits (composed of 8 packed 16-bit integers) from memory into dst. mem_addr does not need to be aligned on any particular boundary.
_mm_mask2_permutex2var_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from idx when the corresponding mask bit is not set).
_mm_mask_abs_epi8โš Experimentalavx512bw,avx512vl
Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set)
_mm_mask_abs_epi16โš Experimentalavx512bw,avx512vl
Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_add_epi8โš Experimentalavx512bw,avx512vl
Add packed 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_add_epi16โš Experimentalavx512bw,avx512vl
Add packed 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_adds_epi8โš Experimentalavx512bw,avx512vl
Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_adds_epi16โš Experimentalavx512bw,avx512vl
Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_adds_epu8โš Experimentalavx512bw,avx512vl
Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_adds_epu16โš Experimentalavx512bw,avx512vl
Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_alignr_epi8โš Experimentalavx512bw,avx512vl
Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_avg_epu8โš Experimentalavx512bw,avx512vl
Average packed unsigned 8-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_avg_epu16โš Experimentalavx512bw,avx512vl
Average packed unsigned 16-bit integers in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_blend_epi8โš Experimentalavx512bw,avx512vl
Blend packed 8-bit integers from a and b using control mask k, and store the results in dst.
_mm_mask_blend_epi16โš Experimentalavx512bw,avx512vl
Blend packed 16-bit integers from a and b using control mask k, and store the results in dst.
_mm_mask_broadcastb_epi8โš Experimentalavx512bw,avx512vl
Broadcast the low packed 8-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_broadcastw_epi16โš Experimentalavx512bw,avx512vl
Broadcast the low packed 16-bit integer from a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_cmp_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmp_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmp_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmp_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b based on the comparison operand specified by imm8, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpeq_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpeq_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpeq_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpeq_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for equality, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpge_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpge_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpge_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpge_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for greater-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpgt_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpgt_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpgt_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpgt_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for greater-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmple_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmple_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmple_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmple_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for less-than-or-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmplt_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmplt_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmplt_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmplt_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for less-than, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpneq_epi8_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpneq_epi16_maskโš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpneq_epu8_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cmpneq_epu16_maskโš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b for not-equal, and store the results in mask vector k using zeromask k1 (elements are zeroed out when the corresponding mask bit is not set).
_mm_mask_cvtepi8_epi16โš Experimentalavx512bw,avx512vl
Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_cvtepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_cvtepi16_storeu_epi8โš Experimentalavx512bw,avx512vl
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
_mm_mask_cvtepu8_epi16โš Experimentalavx512bw,avx512vl
Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_cvtsepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_cvtsepi16_storeu_epi8โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
_mm_mask_cvtusepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_cvtusepi16_storeu_epi8โš Experimentalavx512bw,avx512vl
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.
_mm_mask_dbsad_epu8โš Experimentalavx512bw,avx512vl
Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
_mm_mask_loadu_epi8โš Experimentalavx512bw,avx512vl
Load packed 8-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm_mask_loadu_epi16โš Experimentalavx512bw,avx512vl
Load packed 16-bit integers from memory into dst using writemask k (elements are copied from src when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm_mask_madd_epi16โš Experimentalavx512bw,avx512vl
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_maddubs_epi16โš Experimentalavx512bw,avx512vl
Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_max_epi8โš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_max_epi16โš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_max_epu8โš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_max_epu16โš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_min_epi8โš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_min_epi16โš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_min_epu8โš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_min_epu16โš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_mov_epi8โš Experimentalavx512bw,avx512vl
Move packed 8-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_mov_epi16โš Experimentalavx512bw,avx512vl
Move packed 16-bit integers from a into dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_mulhi_epi16โš Experimentalavx512bw,avx512vl
Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_mulhi_epu16โš Experimentalavx512bw,avx512vl
Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_mulhrs_epi16โš Experimentalavx512bw,avx512vl
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_mullo_epi16โš Experimentalavx512bw,avx512vl
Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_packs_epi16โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_packs_epi32โš Experimentalavx512bw,avx512vl
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_packus_epi16โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_packus_epi32โš Experimentalavx512bw,avx512vl
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_permutex2var_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
_mm_mask_permutexvar_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_reduce_add_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by addition using mask k. Returns the sum of all active elements in a.
_mm_mask_reduce_add_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by addition using mask k. Returns the sum of all active elements in a.
_mm_mask_reduce_and_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by bitwise AND using mask k. Returns the bitwise AND of all active elements in a.
_mm_mask_reduce_and_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by bitwise AND using mask k. Returns the bitwise AND of all active elements in a.
_mm_mask_reduce_max_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
_mm_mask_reduce_max_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
_mm_mask_reduce_max_epu8โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 8-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
_mm_mask_reduce_max_epu16โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 16-bit integers in a by maximum using mask k. Returns the maximum of all active elements in a.
_mm_mask_reduce_min_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
_mm_mask_reduce_min_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
_mm_mask_reduce_min_epu8โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 8-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
_mm_mask_reduce_min_epu16โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 16-bit integers in a by minimum using mask k. Returns the minimum of all active elements in a.
_mm_mask_reduce_mul_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by multiplication using mask k. Returns the product of all active elements in a.
_mm_mask_reduce_mul_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by multiplication using mask k. Returns the product of all active elements in a.
_mm_mask_reduce_or_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by bitwise OR using mask k. Returns the bitwise OR of all active elements in a.
_mm_mask_reduce_or_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by bitwise OR using mask k. Returns the bitwise OR of all active elements in a.
_mm_mask_set1_epi8โš Experimentalavx512bw,avx512vl
Broadcast 8-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_set1_epi16โš Experimentalavx512bw,avx512vl
Broadcast 16-bit integer a to all elements of dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_shuffle_epi8โš Experimentalavx512bw,avx512vl
Shuffle 8-bit integers in a within 128-bit lanes using the control in the corresponding 8-bit element of b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_shufflehi_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_shufflelo_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_sll_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_slli_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_sllv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_sra_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_srai_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_srav_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_srl_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_srli_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_srlv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_storeu_epi8โš Experimentalavx512bw,avx512vl
Store packed 8-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
_mm_mask_storeu_epi16โš Experimentalavx512bw,avx512vl
Store packed 16-bit integers from a into memory using writemask k. mem_addr does not need to be aligned on any particular boundary.
_mm_mask_sub_epi8โš Experimentalavx512bw,avx512vl
Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_sub_epi16โš Experimentalavx512bw,avx512vl
Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_subs_epi8โš Experimentalavx512bw,avx512vl
Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_subs_epi16โš Experimentalavx512bw,avx512vl
Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_subs_epu8โš Experimentalavx512bw,avx512vl
Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_subs_epu16โš Experimentalavx512bw,avx512vl
Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_test_epi8_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
_mm_mask_test_epi16_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is non-zero.
_mm_mask_testn_epi8_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
_mm_mask_testn_epi16_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k (subject to writemask k) if the intermediate value is zero.
_mm_mask_unpackhi_epi8โš Experimentalavx512bw,avx512vl
Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_unpackhi_epi16โš Experimentalavx512bw,avx512vl
Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_unpacklo_epi8โš Experimentalavx512bw,avx512vl
Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_mask_unpacklo_epi16โš Experimentalavx512bw,avx512vl
Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_maskz_abs_epi8โš Experimentalavx512bw,avx512vl
Compute the absolute value of packed signed 8-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_abs_epi16โš Experimentalavx512bw,avx512vl
Compute the absolute value of packed signed 16-bit integers in a, and store the unsigned results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_add_epi8โš Experimentalavx512bw,avx512vl
Add packed 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_add_epi16โš Experimentalavx512bw,avx512vl
Add packed 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_adds_epi8โš Experimentalavx512bw,avx512vl
Add packed signed 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_adds_epi16โš Experimentalavx512bw,avx512vl
Add packed signed 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_adds_epu8โš Experimentalavx512bw,avx512vl
Add packed unsigned 8-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_adds_epu16โš Experimentalavx512bw,avx512vl
Add packed unsigned 16-bit integers in a and b using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_alignr_epi8โš Experimentalavx512bw,avx512vl
Concatenate pairs of 16-byte blocks in a and b into a 32-byte temporary result, shift the result right by imm8 bytes, and store the low 16 bytes in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_avg_epu8โš Experimentalavx512bw,avx512vl
Average packed unsigned 8-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_avg_epu16โš Experimentalavx512bw,avx512vl
Average packed unsigned 16-bit integers in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_broadcastb_epi8โš Experimentalavx512bw,avx512vl
Broadcast the low packed 8-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_broadcastw_epi16โš Experimentalavx512bw,avx512vl
Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_cvtepi8_epi16โš Experimentalavx512bw,avx512vl
Sign extend packed 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_cvtepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_cvtepu8_epi16โš Experimentalavx512bw,avx512vl
Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_cvtsepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_cvtusepi16_epi8โš Experimentalavx512bw,avx512vl
Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_dbsad_epu8โš Experimentalavx512bw,avx512vl
Compute the sum of absolute differences (SADs) of quadruplets of unsigned 8-bit integers in a compared to those in b, and store the 16-bit results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). Four SADs are performed on four 8-bit quadruplets for each 64-bit lane. The first two SADs use the lower 8-bit quadruplet of the lane from a, and the last two SADs use the uppper 8-bit quadruplet of the lane from a. Quadruplets from b are selected from within 128-bit lanes according to the control in imm8, and each SAD in each 64-bit lane uses the selected quadruplet at 8-bit offsets.
_mm_maskz_loadu_epi8โš Experimentalavx512bw,avx512vl
Load packed 8-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm_maskz_loadu_epi16โš Experimentalavx512bw,avx512vl
Load packed 16-bit integers from memory into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set). mem_addr does not need to be aligned on any particular boundary.
_mm_maskz_madd_epi16โš Experimentalavx512bw,avx512vl
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_maddubs_epi16โš Experimentalavx512bw,avx512vl
Multiply packed unsigned 8-bit integers in a by packed signed 8-bit integers in b, producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_max_epi8โš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_max_epi16โš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_max_epu8โš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_max_epu16โš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b, and store packed maximum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_min_epi8โš Experimentalavx512bw,avx512vl
Compare packed signed 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_min_epi16โš Experimentalavx512bw,avx512vl
Compare packed signed 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_min_epu8โš Experimentalavx512bw,avx512vl
Compare packed unsigned 8-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_min_epu16โš Experimentalavx512bw,avx512vl
Compare packed unsigned 16-bit integers in a and b, and store packed minimum values in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_mov_epi8โš Experimentalavx512bw,avx512vl
Move packed 8-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_mov_epi16โš Experimentalavx512bw,avx512vl
Move packed 16-bit integers from a into dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_mulhi_epi16โš Experimentalavx512bw,avx512vl
Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_mulhi_epu16โš Experimentalavx512bw,avx512vl
Multiply the packed unsigned 16-bit integers in a and b, producing intermediate 32-bit integers, and store the high 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_mulhrs_epi16โš Experimentalavx512bw,avx512vl
Multiply packed signed 16-bit integers in a and b, producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_mullo_epi16โš Experimentalavx512bw,avx512vl
Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_packs_epi16โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_packs_epi32โš Experimentalavx512bw,avx512vl
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_packus_epi16โš Experimentalavx512bw,avx512vl
Convert packed signed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_packus_epi32โš Experimentalavx512bw,avx512vl
Convert packed signed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_permutex2var_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_permutexvar_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_set1_epi8โš Experimentalavx512bw,avx512vl
Broadcast 8-bit integer a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_set1_epi16โš Experimentalavx512bw,avx512vl
Broadcast the low packed 16-bit integer from a to all elements of dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_shuffle_epi8โš Experimentalavx512bw,avx512vl
Shuffle packed 8-bit integers in a according to shuffle control mask in the corresponding 8-bit element of b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_shufflehi_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in the high 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the high 64 bits of 128-bit lanes of dst, with the low 64 bits of 128-bit lanes being copied from a to dst, using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_shufflelo_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in the low 64 bits of 128-bit lanes of a using the control in imm8. Store the results in the low 64 bits of 128-bit lanes of dst, with the high 64 bits of 128-bit lanes being copied from a to dst, using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm_maskz_sll_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_slli_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_sllv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_sra_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_srai_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by imm8 while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_srav_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_srl_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_srli_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by imm8 while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_srlv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_sub_epi8โš Experimentalavx512bw,avx512vl
Subtract packed 8-bit integers in b from packed 8-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_sub_epi16โš Experimentalavx512bw,avx512vl
Subtract packed 16-bit integers in b from packed 16-bit integers in a, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_subs_epi8โš Experimentalavx512bw,avx512vl
Subtract packed signed 8-bit integers in b from packed 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_subs_epi16โš Experimentalavx512bw,avx512vl
Subtract packed signed 16-bit integers in b from packed 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_subs_epu8โš Experimentalavx512bw,avx512vl
Subtract packed unsigned 8-bit integers in b from packed unsigned 8-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_subs_epu16โš Experimentalavx512bw,avx512vl
Subtract packed unsigned 16-bit integers in b from packed unsigned 16-bit integers in a using saturation, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_unpackhi_epi8โš Experimentalavx512bw,avx512vl
Unpack and interleave 8-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_unpackhi_epi16โš Experimentalavx512bw,avx512vl
Unpack and interleave 16-bit integers from the high half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_unpacklo_epi8โš Experimentalavx512bw,avx512vl
Unpack and interleave 8-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_maskz_unpacklo_epi16โš Experimentalavx512bw,avx512vl
Unpack and interleave 16-bit integers from the low half of each 128-bit lane in a and b, and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm_movepi8_maskโš Experimentalavx512bw,avx512vl
Set each bit of mask register k based on the most significant bit of the corresponding packed 8-bit integer in a.
_mm_movepi16_maskโš Experimentalavx512bw,avx512vl
Set each bit of mask register k based on the most significant bit of the corresponding packed 16-bit integer in a.
_mm_movm_epi8โš Experimentalavx512bw,avx512vl
Set each packed 8-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
_mm_movm_epi16โš Experimentalavx512bw,avx512vl
Set each packed 16-bit integer in dst to all ones or all zeros based on the value of the corresponding bit in k.
_mm_permutex2var_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a and b across lanes using the corresponding selector and index in idx, and store the results in dst.
_mm_permutexvar_epi16โš Experimentalavx512bw,avx512vl
Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst.
_mm_reduce_add_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by addition. Returns the sum of all elements in a.
_mm_reduce_add_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by addition. Returns the sum of all elements in a.
_mm_reduce_and_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by bitwise AND. Returns the bitwise AND of all elements in a.
_mm_reduce_and_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by bitwise AND. Returns the bitwise AND of all elements in a.
_mm_reduce_max_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by maximum. Returns the maximum of all elements in a.
_mm_reduce_max_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by maximum. Returns the maximum of all elements in a.
_mm_reduce_max_epu8โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 8-bit integers in a by maximum. Returns the maximum of all elements in a.
_mm_reduce_max_epu16โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 16-bit integers in a by maximum. Returns the maximum of all elements in a.
_mm_reduce_min_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by minimum. Returns the minimum of all elements in a.
_mm_reduce_min_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by minimum. Returns the minimum of all elements in a.
_mm_reduce_min_epu8โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 8-bit integers in a by minimum. Returns the minimum of all elements in a.
_mm_reduce_min_epu16โš Experimentalavx512bw,avx512vl
Reduce the packed unsigned 16-bit integers in a by minimum. Returns the minimum of all elements in a.
_mm_reduce_mul_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by multiplication. Returns the product of all elements in a.
_mm_reduce_mul_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by multiplication. Returns the product of all elements in a.
_mm_reduce_or_epi8โš Experimentalavx512bw,avx512vl
Reduce the packed 8-bit integers in a by bitwise OR. Returns the bitwise OR of all elements in a.
_mm_reduce_or_epi16โš Experimentalavx512bw,avx512vl
Reduce the packed 16-bit integers in a by bitwise OR. Returns the bitwise OR of all elements in a.
_mm_sllv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a left by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
_mm_srav_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in sign bits, and store the results in dst.
_mm_srlv_epi16โš Experimentalavx512bw,avx512vl
Shift packed 16-bit integers in a right by the amount specified by the corresponding element in count while shifting in zeros, and store the results in dst.
_mm_storeu_epi8โš Experimentalavx512bw,avx512vl
Store 128-bits (composed of 16 packed 8-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
_mm_storeu_epi16โš Experimentalavx512bw,avx512vl
Store 128-bits (composed of 8 packed 16-bit integers) from a into memory. mem_addr does not need to be aligned on any particular boundary.
_mm_test_epi8_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise AND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
_mm_test_epi16_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise AND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is non-zero.
_mm_testn_epi8_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise NAND of packed 8-bit integers in a and b, producing intermediate 8-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
_mm_testn_epi16_maskโš Experimentalavx512bw,avx512vl
Compute the bitwise NAND of packed 16-bit integers in a and b, producing intermediate 16-bit values, and set the corresponding bit in result mask k if the intermediate value is zero.
_store_mask32โš Experimentalavx512bw
Store 32-bit mask from a into memory.
_store_mask64โš Experimentalavx512bw
Store 64-bit mask from a into memory.