Concurrency namespace functions (AMP)

Article
08/03/2021

all_memory_fence

Blocks execution of all threads in a tile until all memory accesses have been completed. This ensures that all memory accesses are visible to other threads in the thread tile, and are executed in program order.

inline void all_memory_fence(const tile_barrier& _Barrier) restrict(amp);

Parameters

_Barrier
A tile_barrier object.

amp_uninitialize

Uninitializes the C++ AMP runtime. It is legal to call this function multiple times during an applications lifetime. Calling any C++ AMP API after calling this function will reinitialize the C++ AMP runtime. Note that it is illegal to use C++ AMP objects across calls to this function and doing so will result in undefined behavior. Also, concurrently calling this function and any other AMP APIs is illegal and would result in undefined behavior.

void __cdecl amp_uninitialize();

atomic_compare_exchange

Atomically compares the value stored at a memory location specified in the first argument for equality with the value of the second specified argument, and if the values are the same, the value at the memory location is changed to that of the third specified argument.

inline bool atomic_compare_exchange(
    _Inout_ int* _Dest,
    _Inout_ int* _Expected_value,
    int value
    ) restrict(amp)

inline bool atomic_compare_exchange(
    _Inout_ unsigned int* _Dest,
    _Inout_ unsigned int* _Expected_value,
    unsigned int value
    ) restrict(amp)

Parameters

_Dest
The location from which one of the values to be compared is read, and to which the new value, if any, is to be stored.

_Expected_value
The location from which the second value to be compared is read.

value
The value to be stored to the memory location specified in by _Dest if _Dest is equal to _Expected_value.

Return Value

true if the operation is successful; otherwise, false.

atomic_exchange Function (C++ AMP)

Sets the value of destination location as an atomic operation.

inline int atomic_exchange(
    _Inout_ int* _Dest,
    int value
    ) restrict(amp)

inline unsigned int atomic_exchange(
    _Inout_ unsigned int* _Dest,
    unsigned int value
    ) restrict(amp)

inline float atomic_exchange(
    _Inout_ float* _Dest,
    float value
    ) restrict(amp)

Parameters

_Dest
Pointer to the destination location.

value
The new value.

Return Value

The original value of the destination location.

atomic_fetch_add Function (C++ AMP)

Atomically add a value to the value of a memory location.

inline int atomic_fetch_add(
    _Inout_ int* _Dest,
    int value
    ) restrict(amp)

inline unsigned int atomic_fetch_add(
    _Inout_ unsigned int* _Dest,
    unsigned int value
    ) restrict(amp)

Parameters

_Dest
Pointer to the memory location.

value
The value to be added.

Return Value

The original value of the memory location.

atomic_fetch_and Function (C++ AMP)

Atomically performs a bitwise AND operation of a value and the value of a memory location.

inline int atomic_fetch_and(
    _Inout_ int* _Dest,
    int value
    ) restrict(amp)

inline unsigned int atomic_fetch_and(
    _Inout_ unsigned int* _Dest,
    unsigned int value
    ) restrict(amp)

Parameters

_Dest
Pointer to the memory location.

value
The value to use in the bitwise AND calculation.

Return Value

The original value of the memory location.

atomic_fetch_dec

Atomically decrements the value stored at the specified memory location.

inline int atomic_fetch_dec(_Inout_ int* _Dest
    ) restrict(amp)

inline unsigned int atomic_fetch_dec(_Inout_ unsigned int* _Dest) restrict(amp);

Parameters

_Dest
The location in memory of the value to be decremented.

Return Value

The original value stored at the memory location.

atomic_fetch_inc

Atomically increments the value stored at the specified memory location.

inline int atomic_fetch_inc(_Inout_ int* _Dest) restrict(amp);

inline unsigned int atomic_fetch_inc(_Inout_ unsigned int* _Dest) restrict(amp);

Parameters

_Dest
The location in memory of the value to be incremented.

Return Value

The original value stored at the memory location.

atomic_fetch_max

Atomically computes the maximum value between the value stored at the memory location specified in the first argument and the value specified in the second argument, and stores it at the same memory location.

inline int atomic_fetch_max(
    _Inout_ int* _Dest,
    int value
    ) restrict(amp)

inline unsigned int atomic_fetch_max(
    _Inout_ unsigned int* _Dest,
    unsigned int value
    ) restrict(amp)

Parameters

_Dest
The location from which one of the values to be compared is read, and to which the maximum of the two values is to be stored.

value
The value to be compared to the value at the specified location.

Return Value

The original value stored at the specified location location.

atomic_fetch_min

Atomically computes the minimum value between the value stored at the memory location specified in the first argument and the value specified in the second argument, and stores it at the same memory location.

inline int atomic_fetch_min(
    _Inout_ int* _Dest,
    int value
    ) restrict(amp)

inline unsigned int atomic_fetch_min(
    _Inout_ unsigned int* _Dest,
    unsigned int value
    ) restrict(amp)

Parameters

_Dest
The location from which one of the values to be compared is read, and to which the minimum of the two values is to be stored.

value
The value to be compared to the value at the specified location.

Return Value

The original value stored at the specified location location.

atomic_fetch_or Function (C++ AMP)

Atomically performs a bitwise OR operation with a value and the value of a memory location.

inline int atomic_fetch_or(
    _Inout_ int* _Dest,
    int value
    ) restrict(amp)

inline unsigned int atomic_fetch_or(
    _Inout_ unsigned int* _Dest,
    unsigned int value
    ) restrict(amp)

Parameters

_Dest
Pointer to the memory location.

value
The value to use in the bitwise OR calculation.

Return Value

The original value of the memory location.

atomic_fetch_sub Function (C++ AMP)

Atomically subtracts a value from a memory location.

inline int atomic_fetch_sub(
    _Inout_ int* _Dest,
    int value
    ) restrict(amp)

inline unsigned int atomic_fetch_sub(
    _Inout_ unsigned int* _Dest,
    unsigned int value
    ) restrict(amp)

Parameters

_Dest
Pointer to the destination location.

value
The value to be subtracted.

Return Value

The original value of the memory location.

atomic_fetch_xor Function (C++ AMP)

Atomically performs an bitwise XOR operation of a value and a memory location.

inline int atomic_fetch_xor(
    _Inout_ int* _Dest,
    int value
    ) restrict(amp)

inline unsigned int atomic_fetch_xor(
    _Inout_ unsigned int* _Dest,
    unsigned int value
    ) restrict(amp)

Parameters

_Dest
Pointer to the memory location.

value
The value to use in the XOR calculation.

Return Value

The original value of the memory location.

copy

Copies a C++ AMP object. All synchronous data transfer requirements are met. You can't copy data when running code on an accelerator. The general form of this function is copy(src, dest).

template <typename value_type, int _Rank>
void copy(
    const array<value_type, _Rank>& _Src,
    array<value_type, _Rank>& _Dest);

template <typename InputIterator, typename value_type, int _Rank>
void copy(
    InputIterator _SrcFirst,
    InputIterator _SrcLast,
    array<value_type, _Rank>& _Dest);

template <typename InputIterator, typename value_type, int _Rank>
void copy(
    InputIterator _SrcFirst,
    array<value_type, _Rank>& _Dest);

template <typename OutputIterator, typename value_type, int _Rank>
void copy(
    const array<value_type, _Rank>& _Src,
   OutputIterator _DestIter);

template <typename value_type, int _Rank>
void copy(
    const array<value_type, _Rank>& _Src,
    array_view<value_type, _Rank>& _Dest);

template <typename value_type, int _Rank>
void copy(
    const array_view<const value_type, _Rank>& _Src,
    array<value_type, _Rank>& _Dest);

template <typename value_type, int _Rank>
void copy(
    const array_view<value_type, _Rank>& _Src,
    array<value_type, _Rank>& _Dest);

template <typename value_type, int _Rank>
void copy(
    const array_view<const value_type, _Rank>& _Src,
    array_view<value_type, _Rank>& _Dest);

template <typename value_type, int _Rank>
void copy(
    const array_view<value_type, _Rank>& _Src,
    array_view<value_type, _Rank>& _Dest);

template <typename InputIterator, typename value_type, int _Rank>
void copy(
    InputIterator _SrcFirst,
    InputIterator _SrcLast,
    array_view<value_type, _Rank>& _Dest);

template <typename InputIterator, typename value_type, int _Rank>
void copy(
    InputIterator _SrcFirst,
    array_view<value_type, _Rank>& _Dest);

template <typename OutputIterator, typename value_type, int _Rank>
void copy(
    const array_view<value_type, _Rank>& _Src,
    OutputIterator _DestIter);

Parameters

_Dest
The object to copy to.

_DestIter
An output iterator to the beginning position at destination.

InputIterator
The type of the input iterator.

OutputIterator
The type of the output iterator.

_Rank
The rank of the object to copy from or the object to copy to.

_Src
To object to copy.

_SrcFirst
A beginning iterator into the source container.

_SrcLast
An ending iterator into the source container.

value_type
The data type of the elements that are copied.

copy_async

Copies a C++ AMP object and returns a completion_future object that can be waited on. You can't copy data when running code on an accelerator. The general form of this function is copy(src, dest).

template <typename value_type, int _Rank>
concurrency::completion_future copy_async(
    const array<value_type, _Rank>& _Src,
    array<value_type, _Rank>& _Dest);

template <typename InputIterator, typename value_type, int _Rank>
concurrency::completion_future copy_async(InputIterator _SrcFirst, InputIterator _SrcLast,
    array<value_type, _Rank>& _Dest);

template <typename InputIterator, typename value_type, int _Rank>
concurrency::completion_future copy_async(InputIterator _SrcFirst,
    array<value_type, _Rank>& _Dest);

template <typename OutputIterator, typename value_type, int _Rank>
concurrency::completion_future copy_async(
    const array<value_type, _Rank>& _Src, OutputIterator _DestIter);

template <typename value_type, int _Rank>
concurrency::completion_future copy_async(
    const array<value_type, _Rank>& _Src,
    array_view<value_type, _Rank>& _Dest);

template <typename value_type, int _Rank>
concurrency::completion_future copy_async(
    const array_view<const value_type, _Rank>& _Src,
    array<value_type, _Rank>& _Dest);

template <typename value_type, int _Rank>
concurrency::completion_future copy_async(
    const array_view<value_type, _Rank>& _Src,
    array<value_type, _Rank>& _Dest);

template <typename value_type, int _Rank>
concurrency::completion_future copy_async(
    const array_view<const value_type, _Rank>& _Src,
    array_view<value_type, _Rank>& _Dest);

template <typename value_type, int _Rank>
concurrency::completion_future copy_async(
    const array_view<value_type, _Rank>& _Src,
    array_view<value_type, _Rank>& _Dest);

template <typename InputIterator, typename value_type, int _Rank>
concurrency::completion_future copy_async(InputIterator _SrcFirst, InputIterator _SrcLast,
    array_view<value_type, _Rank>& _Dest);

template <typename InputIterator, typename value_type, int _Rank>
concurrency::completion_future copy_async(InputIterator _SrcFirst,
    array_view<value_type, _Rank>& _Dest);

template <typename OutputIterator, typename value_type, int _Rank>
concurrency::completion_future copy_async(
    const array_view<value_type, _Rank>& _Src, OutputIterator _DestIter);

Parameters

_Dest
The object to copy to.

_DestIter
An output iterator to the beginning position at destination.

InputIterator
The type of the input iterator.

OutputIterator
The type of the output iterator.

_Rank
The rank of the object to copy from or the object to copy to.

_Src
To object to copy.

_SrcFirst
A beginning iterator into the source container.

_SrcLast
An ending iterator into the source container.

value_type
The data type of the elements that are copied.

Return Value

A future<void> that can be waited on.

direct3d_abort

Aborts the execution of a function with the restrict(amp) restriction clause. When the AMP runtime detects the call, it raises a runtime_exception exception with the error message "Reference Rasterizer: Shader abort instruction hit".

void direct3d_abort() restrict(amp);

direct3d_errorf

Prints a formatted string to the Visual Studio output window. It is called from a function with the restrict(amp) restriction clause. When the AMP runtime detects the call, it raises a runtime_exception exception with the same formatting string.

void direct3d_errorf(
    const char *,
...) restrict(amp);

direct3d_printf

Prints a formatted string to the Visual Studio output window. It is called from a function with the restrict(amp) restriction clause.

void direct3d_printf(
    const char *,
...) restrict(amp);

global_memory_fence

Blocks execution of all threads in a tile until all global memory accesses have been completed. This ensures that global memory accesses are visible to other threads in the thread tile, and are executed in program order.

inline void global_memory_fence(const tile_barrier& _Barrier) restrict(amp);

Parameters

_Barrier
A tile_barrier object

parallel_for_each Function (C++ AMP)

Runs a function across the compute domain. For more information, see C++ AMP Overview.

template <int _Rank, typename _Kernel_type>
void parallel_for_each(
    const extent<_Rank>& _Compute_domain,
    const _Kernel_type& _Kernel);

template <int _Dim0, int _Dim1, int _Dim2, typename _Kernel_type>
void parallel_for_each(
    const tiled_extent<_Dim0, _Dim1, _Dim2>& _Compute_domain,
   const _Kernel_type& _Kernel);

template <int _Dim0, int _Dim1, typename _Kernel_type>
void parallel_for_each(
    const tiled_extent<_Dim0, _Dim1>& _Compute_domain,
    const _Kernel_type& _Kernel);

template <int _Dim0, typename _Kernel_type>
void parallel_for_each(
    const tiled_extent<_Dim0>& _Compute_domain,
    const _Kernel_type& _Kernel);

template <int _Rank, typename _Kernel_type>
void parallel_for_each(
    const accelerator_view& _Accl_view,
    const extent<_Rank>& _Compute_domain,
    const _Kernel_type& _Kernel);

template <int _Dim0, int _Dim1, int _Dim2, typename _Kernel_type>
void parallel_for_each(
    const accelerator_view& _Accl_view,
    const tiled_extent<_Dim0, _Dim1, _Dim2>& _Compute_domain,
    const _Kernel_type& _Kernel);

template <int _Dim0, int _Dim1, typename _Kernel_type>
void parallel_for_each(
    const accelerator_view& _Accl_view,
    const tiled_extent<_Dim0, _Dim1>& _Compute_domain,
    const _Kernel_type& _Kernel);

template <int _Dim0, typename _Kernel_type>
void parallel_for_each(
    const accelerator_view& _Accl_view,
    const tiled_extent<_Dim0>& _Compute_domain,
    const _Kernel_type& _Kernel);

Parameters

_Accl_view
The accelerator_view object to run the parallel computation on.

_Compute_domain
An extent object that contains the data for the computation.

_Dim0
The dimension of the tiled_extent object.

_Dim1
The dimension of the tiled_extent object.

_Dim2
The dimension of the tiled_extent object.

_Kernel
A lambda or function object that takes an argument of type "index<_Rank>" and performs the parallel computation.

_Kernel_type
A lambda or functor.

_Rank
The rank of the extent.

tile_static_memory_fence

Blocks execution of all threads in a tile until all outstanding tile_static memory accesses have been completed. This ensures that tile_static memory accesses are visible to other threads in the thread tile, and that accesses are executed in program order.

inline void tile_static_memory_fence(const tile_barrier& _Barrier) restrict(amp);

Parameters

_Barrier
A tile_barrier object.

Concurrency namespace functions (AMP)

all_memory_fence

Parameters

amp_uninitialize

atomic_compare_exchange

Parameters

Return Value

atomic_exchange Function (C++ AMP)

Parameters

Return Value

atomic_fetch_add Function (C++ AMP)

Parameters

Return Value

atomic_fetch_and Function (C++ AMP)

Parameters

Return Value

atomic_fetch_dec

Parameters

Return Value

atomic_fetch_inc

Parameters

Return Value

atomic_fetch_max

Parameters

Return Value

atomic_fetch_min

Parameters

Return Value

atomic_fetch_or Function (C++ AMP)

Parameters

Return Value

atomic_fetch_sub Function (C++ AMP)

Parameters

Return Value

atomic_fetch_xor Function (C++ AMP)

Parameters

Return Value

copy

Parameters

copy_async

Parameters

Return Value

direct3d_abort

direct3d_errorf

direct3d_printf

global_memory_fence

Parameters

parallel_for_each Function (C++ AMP)

Parameters

tile_static_memory_fence

Parameters

See also

Feedback

Feedback

Additional resources