This documentation is archived and is not being maintained.


Visual Studio 2010

Visual Studio 2010 SP1 is required.

Microsoft Specific

Generates the XOP instruction vphaddbq to perform an integer horizontal add of its source.

__m128i _mm_haddq_epi8 (
   __m128i src

[in] src

A 128-bit parameter that contains sixteen 8-bit signed integers.

A 128-bit result r that contains two 64-bit signed integers.

r[i] := src[8*i] + src[8*i+1]) + src[8*i+2] + src[8*i+3] +
        src[8*i+4] + src[8*i+5] + src[8*i+6] + src[8*i+7];





Header file <intrin.h>

Each of the two groups of eight consecutive 8-bit signed integer values in src is added to produce a 64-bit signed integer result that is stored as the corresponding value in the destination. No overflow is possible in this instruction.

The vphaddbq instruction is part of the XOP family of instructions. Before you use this intrinsic, you must ensure that the processor supports this instruction. To determine hardware support for this instruction, call the __cpuid intrinsic with InfoType = 0x80000001 and check bit 11 of CPUInfo[2] (ECX). This bit is 1 when the instruction is supported, and 0 otherwise.

#include <stdio.h>
#include <intrin.h>
int main()
    __m128i a, d;
    int i;
    for (i = 0; i < 4; i++) {
        a.m128i_i8[i] = -128;
        a.m128i_i8[i+4] = i-128;
        a.m128i_i8[i+8] = 10*i;
        a.m128i_i8[i+12] = 127;
    d = _mm_haddq_epi8(a);
    for (i = 0; i < 2; i++) printf_s(" %I64d", d.m128i_i64[i]);
-1018 568