Available on x86 or x86-64 only.
Functionsยง
- expandloadb_
128 ๐ โ - expandloadb_
256 ๐ โ - expandloadb_
512 ๐ โ - expandloadw_
128 ๐ โ - expandloadw_
256 ๐ โ - expandloadw_
512 ๐ โ - vcompressstoreb ๐ โ
- vcompressstoreb128 ๐ โ
- vcompressstoreb256 ๐ โ
- vcompressstorew ๐ โ
- vcompressstorew128 ๐ โ
- vcompressstorew256 ๐ โ
- vpcompressb ๐ โ
- vpcompressb128 ๐ โ
- vpcompressb256 ๐ โ
- vpcompressw ๐ โ
- vpcompressw128 ๐ โ
- vpcompressw256 ๐ โ
- vpexpandb ๐ โ
- vpexpandb128 ๐ โ
- vpexpandb256 ๐ โ
- vpexpandw ๐ โ
- vpexpandw128 ๐ โ
- vpexpandw256 ๐ โ
- vpshldvd ๐ โ
- vpshldvd128 ๐ โ
- vpshldvd256 ๐ โ
- vpshldvq ๐ โ
- vpshldvq128 ๐ โ
- vpshldvq256 ๐ โ
- vpshldvw ๐ โ
- vpshldvw128 ๐ โ
- vpshldvw256 ๐ โ
- vpshrdvd ๐ โ
- vpshrdvd128 ๐ โ
- vpshrdvd256 ๐ โ
- vpshrdvq ๐ โ
- vpshrdvq128 ๐ โ
- vpshrdvq256 ๐ โ
- vpshrdvw ๐ โ
- vpshrdvw128 ๐ โ
- vpshrdvw256 ๐ โ
- _mm256_
mask_ โcompress_ epi8 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 8-bit integers in a (those with their respective bit set in writemask k) to dst, and pass through the remaining elements from src.
- _mm256_
mask_ โcompress_ epi16 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 16-bit integers in a (those with their respective bit set in writemask k) to dst, and pass through the remaining elements from src.
- _mm256_
mask_ โcompressstoreu_ epi8 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 8-bit integers in a (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm256_
mask_ โcompressstoreu_ epi16 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 16-bit integers in a (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm256_
mask_ โexpand_ epi8 Experimental avx512vbmi2,avx512vl
- Load contiguous active 8-bit integers from a (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โexpand_ epi16 Experimental avx512vbmi2,avx512vl
- Load contiguous active 16-bit integers from a (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โexpandloadu_ epi8 Experimental avx512vbmi2,avx512vl
- Load contiguous active 8-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โexpandloadu_ epi16 Experimental avx512vbmi2,avx512vl
- Load contiguous active 16-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โshldi_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by imm8 bits, and store the upper 16-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โshldi_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by imm8 bits, and store the upper 32-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โshldi_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by imm8 bits, and store the upper 64-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โshldv_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 16-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm256_
mask_ โshldv_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 32-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm256_
mask_ โshldv_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 64-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm256_
mask_ โshrdi_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by imm8 bits, and store the lower 16-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โshrdi_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by imm8 bits, and store the lower 32-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm256_
mask_ โshrdi_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by imm8 bits, and store the lower 64-bits in dst using writemask k (elements are copied from srcโ when the corresponding mask bit is not set).
- _mm256_
mask_ โshrdv_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 16-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm256_
mask_ โshrdv_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 32-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm256_
mask_ โshrdv_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 64-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm256_
maskz_ โcompress_ epi8 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 8-bit integers in a (those with their respective bit set in zeromask k) to dst, and set the remaining elements to zero.
- _mm256_
maskz_ โcompress_ epi16 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 16-bit integers in a (those with their respective bit set in zeromask k) to dst, and set the remaining elements to zero.
- _mm256_
maskz_ โexpand_ epi8 Experimental avx512vbmi2,avx512vl
- Load contiguous active 8-bit integers from a (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โexpand_ epi16 Experimental avx512vbmi2,avx512vl
- Load contiguous active 16-bit integers from a (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โexpandloadu_ epi8 Experimental avx512vbmi2,avx512vl
- Load contiguous active 8-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โexpandloadu_ epi16 Experimental avx512vbmi2,avx512vl
- Load contiguous active 16-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshldi_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by imm8 bits, and store the upper 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshldi_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by imm8 bits, and store the upper 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshldi_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by imm8 bits, and store the upper 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshldv_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshldv_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshldv_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshrdi_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by imm8 bits, and store the lower 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshrdi_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by imm8 bits, and store the lower 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshrdi_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by imm8 bits, and store the lower 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshrdv_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshrdv_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
maskz_ โshrdv_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm256_
shldi_ โepi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by imm8 bits, and store the upper 16-bits in dst).
- _mm256_
shldi_ โepi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by imm8 bits, and store the upper 32-bits in dst.
- _mm256_
shldi_ โepi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by imm8 bits, and store the upper 64-bits in dst).
- _mm256_
shldv_ โepi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 16-bits in dst.
- _mm256_
shldv_ โepi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 32-bits in dst.
- _mm256_
shldv_ โepi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 64-bits in dst.
- _mm256_
shrdi_ โepi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by imm8 bits, and store the lower 16-bits in dst.
- _mm256_
shrdi_ โepi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by imm8 bits, and store the lower 32-bits in dst.
- _mm256_
shrdi_ โepi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by imm8 bits, and store the lower 64-bits in dst.
- _mm256_
shrdv_ โepi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 16-bits in dst.
- _mm256_
shrdv_ โepi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 32-bits in dst.
- _mm256_
shrdv_ โepi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 64-bits in dst.
- _mm512_
mask_ โcompress_ epi8 Experimental avx512vbmi2
- Contiguously store the active 8-bit integers in a (those with their respective bit set in writemask k) to dst, and pass through the remaining elements from src.
- _mm512_
mask_ โcompress_ epi16 Experimental avx512vbmi2
- Contiguously store the active 16-bit integers in a (those with their respective bit set in writemask k) to dst, and pass through the remaining elements from src.
- _mm512_
mask_ โcompressstoreu_ epi8 Experimental avx512vbmi2
- Contiguously store the active 8-bit integers in a (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm512_
mask_ โcompressstoreu_ epi16 Experimental avx512vbmi2
- Contiguously store the active 16-bit integers in a (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm512_
mask_ โexpand_ epi8 Experimental avx512vbmi2
- Load contiguous active 8-bit integers from a (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โexpand_ epi16 Experimental avx512vbmi2
- Load contiguous active 16-bit integers from a (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โexpandloadu_ epi8 Experimental avx512vbmi2
- Load contiguous active 8-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โexpandloadu_ epi16 Experimental avx512vbmi2
- Load contiguous active 16-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โshldi_ epi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by imm8 bits, and store the upper 16-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โshldi_ epi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by imm8 bits, and store the upper 32-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โshldi_ epi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by imm8 bits, and store the upper 64-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โshldv_ epi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 16-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm512_
mask_ โshldv_ epi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 32-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm512_
mask_ โshldv_ epi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 64-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm512_
mask_ โshrdi_ epi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by imm8 bits, and store the lower 16-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โshrdi_ epi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by imm8 bits, and store the lower 32-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm512_
mask_ โshrdi_ epi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by imm8 bits, and store the lower 64-bits in dst using writemask k (elements are copied from srcโ when the corresponding mask bit is not set).
- _mm512_
mask_ โshrdv_ epi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 16-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm512_
mask_ โshrdv_ epi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 32-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm512_
mask_ โshrdv_ epi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 64-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm512_
maskz_ โcompress_ epi8 Experimental avx512vbmi2
- Contiguously store the active 8-bit integers in a (those with their respective bit set in zeromask k) to dst, and set the remaining elements to zero.
- _mm512_
maskz_ โcompress_ epi16 Experimental avx512vbmi2
- Contiguously store the active 16-bit integers in a (those with their respective bit set in zeromask k) to dst, and set the remaining elements to zero.
- _mm512_
maskz_ โexpand_ epi8 Experimental avx512vbmi2
- Load contiguous active 8-bit integers from a (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โexpand_ epi16 Experimental avx512vbmi2
- Load contiguous active 16-bit integers from a (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โexpandloadu_ epi8 Experimental avx512vbmi2
- Load contiguous active 8-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โexpandloadu_ epi16 Experimental avx512vbmi2
- Load contiguous active 16-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshldi_ epi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by imm8 bits, and store the upper 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshldi_ epi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by imm8 bits, and store the upper 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshldi_ epi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by imm8 bits, and store the upper 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshldv_ epi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshldv_ epi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshldv_ epi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshrdi_ epi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by imm8 bits, and store the lower 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshrdi_ epi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by imm8 bits, and store the lower 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshrdi_ epi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by imm8 bits, and store the lower 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshrdv_ epi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshrdv_ epi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
maskz_ โshrdv_ epi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm512_
shldi_ โepi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by imm8 bits, and store the upper 16-bits in dst).
- _mm512_
shldi_ โepi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by imm8 bits, and store the upper 32-bits in dst.
- _mm512_
shldi_ โepi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by imm8 bits, and store the upper 64-bits in dst).
- _mm512_
shldv_ โepi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 16-bits in dst.
- _mm512_
shldv_ โepi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 32-bits in dst.
- _mm512_
shldv_ โepi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 64-bits in dst.
- _mm512_
shrdi_ โepi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by imm8 bits, and store the lower 16-bits in dst.
- _mm512_
shrdi_ โepi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by imm8 bits, and store the lower 32-bits in dst.
- _mm512_
shrdi_ โepi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by imm8 bits, and store the lower 64-bits in dst.
- _mm512_
shrdv_ โepi16 Experimental avx512vbmi2
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 16-bits in dst.
- _mm512_
shrdv_ โepi32 Experimental avx512vbmi2
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 32-bits in dst.
- _mm512_
shrdv_ โepi64 Experimental avx512vbmi2
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 64-bits in dst.
- _mm_
mask_ โcompress_ epi8 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 8-bit integers in a (those with their respective bit set in writemask k) to dst, and pass through the remaining elements from src.
- _mm_
mask_ โcompress_ epi16 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 16-bit integers in a (those with their respective bit set in writemask k) to dst, and pass through the remaining elements from src.
- _mm_
mask_ โcompressstoreu_ epi8 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 8-bit integers in a (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm_
mask_ โcompressstoreu_ epi16 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 16-bit integers in a (those with their respective bit set in writemask k) to unaligned memory at base_addr.
- _mm_
mask_ โexpand_ epi8 Experimental avx512vbmi2,avx512vl
- Load contiguous active 8-bit integers from a (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โexpand_ epi16 Experimental avx512vbmi2,avx512vl
- Load contiguous active 16-bit integers from a (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โexpandloadu_ epi8 Experimental avx512vbmi2,avx512vl
- Load contiguous active 8-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โexpandloadu_ epi16 Experimental avx512vbmi2,avx512vl
- Load contiguous active 16-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โshldi_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by imm8 bits, and store the upper 16-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โshldi_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by imm8 bits, and store the upper 32-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โshldi_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by imm8 bits, and store the upper 64-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โshldv_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 16-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm_
mask_ โshldv_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 32-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm_
mask_ โshldv_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 64-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm_
mask_ โshrdi_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by imm8 bits, and store the lower 16-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โshrdi_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by imm8 bits, and store the lower 32-bits in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).
- _mm_
mask_ โshrdi_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by imm8 bits, and store the lower 64-bits in dst using writemask k (elements are copied from srcโ when the corresponding mask bit is not set).
- _mm_
mask_ โshrdv_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 16-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm_
mask_ โshrdv_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 32-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm_
mask_ โshrdv_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 64-bits in dst using writemask k (elements are copied from a when the corresponding mask bit is not set).
- _mm_
maskz_ โcompress_ epi8 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 8-bit integers in a (those with their respective bit set in zeromask k) to dst, and set the remaining elements to zero.
- _mm_
maskz_ โcompress_ epi16 Experimental avx512vbmi2,avx512vl
- Contiguously store the active 16-bit integers in a (those with their respective bit set in zeromask k) to dst, and set the remaining elements to zero.
- _mm_
maskz_ โexpand_ epi8 Experimental avx512vbmi2,avx512vl
- Load contiguous active 8-bit integers from a (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โexpand_ epi16 Experimental avx512vbmi2,avx512vl
- Load contiguous active 16-bit integers from a (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โexpandloadu_ epi8 Experimental avx512vbmi2,avx512vl
- Load contiguous active 8-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โexpandloadu_ epi16 Experimental avx512vbmi2,avx512vl
- Load contiguous active 16-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshldi_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by imm8 bits, and store the upper 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshldi_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by imm8 bits, and store the upper 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshldi_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by imm8 bits, and store the upper 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshldv_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshldv_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshldv_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshrdi_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by imm8 bits, and store the lower 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshrdi_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by imm8 bits, and store the lower 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshrdi_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by imm8 bits, and store the lower 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshrdv_ epi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 16-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshrdv_ epi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 32-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
maskz_ โshrdv_ epi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 64-bits in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
- _mm_
shldi_ โepi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by imm8 bits, and store the upper 16-bits in dst).
- _mm_
shldi_ โepi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by imm8 bits, and store the upper 32-bits in dst.
- _mm_
shldi_ โepi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by imm8 bits, and store the upper 64-bits in dst).
- _mm_
shldv_ โepi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in a and b producing an intermediate 32-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 16-bits in dst.
- _mm_
shldv_ โepi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in a and b producing an intermediate 64-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 32-bits in dst.
- _mm_
shldv_ โepi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in a and b producing an intermediate 128-bit result. Shift the result left by the amount specified in the corresponding element of c, and store the upper 64-bits in dst.
- _mm_
shrdi_ โepi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by imm8 bits, and store the lower 16-bits in dst.
- _mm_
shrdi_ โepi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by imm8 bits, and store the lower 32-bits in dst.
- _mm_
shrdi_ โepi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by imm8 bits, and store the lower 64-bits in dst.
- _mm_
shrdv_ โepi16 Experimental avx512vbmi2,avx512vl
- Concatenate packed 16-bit integers in b and a producing an intermediate 32-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 16-bits in dst.
- _mm_
shrdv_ โepi32 Experimental avx512vbmi2,avx512vl
- Concatenate packed 32-bit integers in b and a producing an intermediate 64-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 32-bits in dst.
- _mm_
shrdv_ โepi64 Experimental avx512vbmi2,avx512vl
- Concatenate packed 64-bit integers in b and a producing an intermediate 128-bit result. Shift the result right by the amount specified in the corresponding element of c, and store the lower 64-bits in dst.