Adaptive replacement cache


Adaptive Replacement Cache is a page replacement algorithm with
better performance than LRU. This is accomplished by keeping track of both frequently used and recently used pages plus a recent eviction history for both. The algorithm was developed at the IBM Almaden Research Center. In 2006, IBM was granted a .

Summary

Basic LRU maintains an ordered list of resource entries in the cache, with the sort order based on the time of most recent access. New entries are added at the top of the list, after the bottom entry has been evicted. Cache hits move to the top, pushing all other entries down.
ARC improves the basic LRU strategy by splitting the cache directory into two lists, T1 and T2, for recently and frequently referenced entries. In turn, each of these is extended with a ghost list, which is attached to the bottom of the two lists. These ghost lists act as scorecards by keeping track of the history of recently evicted cache entries, and the algorithm uses ghost hits to adapt to recent change in resource usage. Note that the ghost lists only contain metadata and not the resource data itself, i.e. as an entry is evicted into a ghost list its data is discarded. The combined cache directory is organised in four LRU lists:
  1. T1, for recent cache entries.
  2. T2, for frequent entries, referenced at least twice.
  3. B1, ghost entries recently evicted from the T1 cache, but are still tracked.
  4. B2, similar ghost entries, but evicted from T2.
T1 and B1 together are referred to as L1, a combined history of recent single references.
Similarly, L2 is the combination of T2 and B2.
The whole cache directory can be visualised in a single line:
... '-> B2 ]..
'.... ]
'
The inner
' brackets indicate actual cache, which although fixed in size, can move freely across the B1 and B2 history.
L1 is now displayed from right to left, starting at the top, indicated by the ! marker. ^ indicates the target size for T1, and may be equal to, smaller than, or larger than the actual size.
Entries entering the cache will cause ! to move towards the target marker ^. If no free space exists in the cache, this marker also determines whether either T1 or T2 will evict an entry.
ARC is currently deployed in IBM's DS6000/DS8000 storage controllers.
Sun Microsystems's scalable file system ZFS uses a variant of ARC as an alternative to the traditional Solaris filesystem page cache in virtual memory. It has been modified to allow for locked pages that are currently in use and cannot be vacated.
PostgreSQL used ARC in its buffer manager for a brief time, but quickly replaced it with another algorithm,
citing concerns over an IBM patent on ARC.
VMware's vSAN is a hyper-converged, software-defined storage product developed by VMware. It uses A variant of ARC in its Caching Algorithm.