Fetch-and-add


In computer science, the fetch-and-add CPU instruction atomically increments the contents of a memory location by a specified value.
That is, fetch-and-add performs the operation
in such a way that if this operation is executed by one process in a concurrent system, no other process will ever see an intermediate result.
Fetch-and-add can be used to implement concurrency control structures such as mutex locks and semaphores.

Overview

The motivation for having an atomic fetch-and-add is that operations that appear in programming languages as
are not safe in a concurrent system, where multiple processes or threads are running concurrently. The reason is that such an operation is actually implemented as multiple machine instructions:
  1. Fetch the value at the location, say, into a register;
  2. add to in the register;
  3. store the new value of the register back into.
When one process is doing and another is doing concurrently, there is a race condition. They might both fetch and operate on that, then both store their results with the effect that one overwrites the other and the stored value becomes either or, not as might be expected.
In uniprocessor systems with no kernel preemption supported, it is sufficient to disable interrupts before accessing a critical section.
However, in multiprocessor systems two or more processors could be attempting to access the same memory at the same time. The fetch-and-add instruction allows any processor to atomically increment a value in memory, preventing such multiple processor collisions.
Maurice Herlihy proved that fetch-and-add has a finite consensus number, in contrast to the compare-and-swap operation. The fetch-and-add operation can solve the wait-free consensus problem for no more than two concurrent processes.

Implementation

The fetch-and-add instruction behaves like the following function. Crucially, the entire function is executed atomically: no process can interrupt the function mid-execution and hence see a state that only exists during the execution of the function. This code only serves to help explain the behaviour of fetch-and-add; atomicity requires explicit hardware support and hence can not be implemented as a simple high level function.
<< atomic >>
function FetchAndAdd
To implement a mutual exclusion lock, we define the operation FetchAndIncrement, which is equivalent to FetchAndAdd with inc=1.
With this operation, a mutual exclusion lock can be implemented using the ticket lock algorithm as:
record locktype
procedure LockInit
procedure Lock
procedure UnLock
These routines provide a mutual-exclusion lock when following conditions are met:
An atomic function appears in the C++11 standard. It is available as a proprietary extension to C in the Itanium ABI specification, and in GCC.

x86 implementation

In the x86 architecture, the instruction ADD with the destination operand specifying a memory location is a fetch-and-add instruction that has been there since the 8086, and with the LOCK prefix, is atomic across multiple processors. However, it could not return the original value of the memory location until the 486 introduced the XADD instruction.
The following is a C implementation for the GCC compiler, for both 32 and 64 bit x86 Intel platforms, based on extended asm syntax:

static inline int fetch_and_add

History

fetch-and-add was introduced by the Ultracomputer project, which also produced a multiprocessor supporting fetch-and-add and containing custom vlsi switches that were able to combine concurrent memory references to prevent them from serializing at the memory module containing the destination operand.