Closure (computer programming)
In programming languages, a closure, also lexical closure or function closure, is a technique for implementing lexically scoped name binding in a language with first-class functions. Operationally, a closure is a record storing a function together with an environment. The environment is a mapping associating each free variable of the function with the value or reference to which the name was bound when the closure was created. Unlike a plain function, a closure allows the function to access those captured variables through the closure's copies of their values or references, even when the function is invoked outside their scope.
History and etymology
The concept of closures was developed in the 1960s for the mechanical evaluation of expressions in the λ-calculus and was first fully implemented in 1970 as a language feature in the PAL programming language to support lexically scoped first-class functions.Peter J. Landin defined the term closure in 1964 as having an environment part and a control part as used by his SECD machine for evaluating expressions. Joel Moses credits Landin with introducing the term closure to refer to a lambda expression whose open bindings have been closed by the [|lexical environment], resulting in a closed expression, or closure. This usage was subsequently adopted by Sussman and Steele when they defined Scheme in 1975, a lexically scoped variant of LISP, and became widespread.
Anonymous functions
The term closure is often used as a synonym for anonymous function, though strictly, an anonymous function is a function literal without a name, while a closure is an instance of a function, a value, whose non-local variables have been bound either to values or to storage locations.For example, in the following Python code:
def f:
def g:
return x + y
return g # Return a closure.
def h:
return lambda y: x + y # Return a closure.
- Assigning specific closures to variables.
b = h
- Using the closures stored in variables.
assert b 6
- Using closures without binding them to variables first.
assert h 6 # h is the closure.
the values of
a
and b
are closures, in both cases produced by returning a nested function with a free variable from the enclosing function, so that the free variable binds to the value of parameter x
of the enclosing function. The closures in a
and b
are functionally identical. The only difference in implementation is that in the first case we used a nested function with a name, g
, while in the second case we used an anonymous nested function. The original name, if any, used in defining them is irrelevant. A closure is a value like any other value. It does not need to be assigned to a variable and can instead be used directly, as shown in the last two lines of the example. This usage may be deemed an "anonymous closure".
The nested function definitions are not themselves closures: they have a free variable which is not yet bound. Only once the enclosing function is evaluated with a value for the parameter is the free variable of the nested function bound, creating a closure, which is then returned from the enclosing function.
Lastly, a closure is only distinct from a function with free variables when outside of the scope of the non-local variables, otherwise the defining environment and the execution environment coincide and there is nothing to distinguish these. For example, in the below program, functions with a free variable
x
are executed in the same environment where x
is defined, so it is immaterial whether these are actually closures:x = 1
nums =
def f:
return x + y
map
map
This is most often achieved by a function return, since the function must be defined within the scope of the non-local variables, in which case typically its own scope will be smaller.
This can also be achieved by variable shadowing, though this is less common in practice, as it is less useful and shadowing is discouraged. In this example
f
can be seen to be a closure because x
in the body of f
is bound to the x
in the global namespace, not the x
local to g
:x = 0
def f:
return x + y
def g:
x = 1 # local x shadows global x
return f
g # evaluates to 1, not 2
Applications
The use of closures is associated with languages where functions are first-class objects, in which functions can be returned as results from higher-order functions, or passed as arguments to other function calls; if functions with free variables are first-class, then returning one creates a closure. This includes functional programming languages such as Lisp and ML, as well as many modern, multi-paradigm languages, such as Python and Rust. Closures are also frequently used with callbacks, particularly for event handlers, such as in JavaScript, where they are used for interactions with a dynamic web page.Closures can also be used in a continuation-passing style to hide state. Constructs such as objects and control structures can thus be implemented with closures. In some languages, a closure may occur when a function is defined within another function, and the inner function refers to local variables of the outer function. At run-time, when the outer function executes, a closure is formed, consisting of the inner function's code and references to any variables of the outer function required by the closure.
First-class functions
Closures typically appear in languages with first-class functions—in other words, such languages enable functions to be passed as arguments, returned from function calls, bound to variable names, etc., just like simpler types such as strings and integers. For example, consider the following Scheme function:; Return a list of all books with at least THRESHOLD copies sold.
)
book-list))
In this example, the lambda expression
appears within the function best-selling-books
. When the lambda expression is evaluated, Scheme creates a closure consisting of the code for the lambda expression and a reference to the threshold
variable, which is a free variable inside the lambda expression.The closure is then passed to the
filter
function, which calls it repeatedly to determine which books are to be added to the result list and which are to be discarded. Because the closure itself has a reference to threshold
, it can use that variable each time filter
calls it. The function filter
itself might be defined in a completely separate file.Here is the same example rewritten in JavaScript, another popular language with support for closures:
// Return a list of all books with at least 'threshold' copies sold.
function bestSellingBooks
The
function
keyword is used here instead of lambda
, and an Array.filter
method instead of a global filter
function, but otherwise the structure and the effect of the code are the same.A function may create a closure and return it, as in the following example:
// Return a function that approximates the derivative of f
// using an interval of dx, which should be appropriately small.
function derivative
Because the closure in this case outlives the execution of the function that creates it, the variables
f
and dx
live on after the function derivative
returns, even though execution has left their scope and they are no longer visible. In languages without closures, the lifetime of an automatic local variable coincides with the execution of the stack frame where that variable is declared. In languages with closures, variables must continue to exist as long as any existing closures have references to them. This is most commonly implemented using some form of garbage collection.State representation
A closure can be used to associate a function with a set of "private" variables, which persist over several invocations of the function. The scope of the variable encompasses only the closed-over function, so it cannot be accessed from other program code. These are analogous to private variables in object-oriented programming, and in fact closures are analogous to a type of object, specifically function objects, with a single public method, and possible many private variables.In stateful languages, closures can thus be used to implement paradigms for state representation and information hiding, since the closure's upvalues are of indefinite extent, so a value established in one invocation remains available in the next. Closures used in this way no longer have referential transparency, and are thus no longer pure functions; nevertheless, they are commonly used in impure functional languages such as Scheme.
Other uses
Closures have many uses:- Because closures delay evaluation—i.e., they do not "do" anything until they are called—they can be used to define control structures. For example, all of Smalltalk's standard control structures, including branches and loops, are defined using objects whose methods accept closures. Users can easily define their own control structures also.
- In languages which implement assignment, multiple functions can be produced that close over the same environment, enabling them to communicate privately by altering that environment. In Scheme:
)
))
))
; prints "none"
; prints "meet me by the docks at midnight"
- Closures can be used to implement object systems.
Implementation and theory
Closures are typically implemented with a special data structure that contains a pointer to the function code, plus a representation of the function's lexical environment at the time when the closure was created. The referencing environment binds the non-local names to the corresponding variables in the lexical environment at the time the closure is created, additionally extending their lifetime to at least as long as the lifetime of the closure itself. When the closure is entered at a later time, possibly with a different lexical environment, the function is executed with its non-local variables referring to the ones captured by the closure, not the current environment.A language implementation cannot easily support full closures if its run-time memory model allocates all automatic variables on a linear stack. In such languages, a function's automatic local variables are deallocated when the function returns. However, a closure requires that the free variables it references survive the enclosing function's execution. Therefore, those variables must be allocated so that they persist until no longer needed, typically via heap allocation, rather than on the stack, and their lifetime must be managed so they survive until all closures referencing them are no longer in use.
This explains why, typically, languages that natively support closures also use garbage collection. The alternatives are manual memory management of non-local variables, or, if using stack allocation, for the language to accept that certain use cases will lead to undefined behaviour, due to dangling pointers to freed automatic variables, as in lambda expressions in C++11 or nested functions in GNU C. The funarg problem describes the difficulty of implementing functions as first class objects in a stack-based programming language such as C or C++. Similarly in D version 1, it is assumed that the programmer knows what to do with delegates and automatic local variables, as their references will be invalid after return from its definition scope – this still permits many useful functional patterns, but for complex cases needs explicit heap allocation for variables. D version 2 solved this by detecting which variables must be stored on the heap, and performs automatic allocation. Because D uses garbage collection, in both versions, there is no need to track usage of variables as they are passed.
In strict functional languages with immutable data, it is very easy to implement automatic memory management, as there are no possible cycles in variables' references. For example, in Erlang, all arguments and variables are allocated on the heap, but references to them are additionally stored on the stack. After a function returns, references are still valid. Heap cleaning is done by incremental garbage collector.
In ML, local variables are lexically scoped, and hence define a stack-like model, but since they are bound to values and not to objects, an implementation is free to copy these values into the closure's data structure in a way that is invisible to the programmer.
Scheme, which has an ALGOL-like lexical scope system with dynamic variables and garbage collection, lacks a stack programming model and does not suffer from the limitations of stack-based languages. Closures are expressed naturally in Scheme. The lambda form encloses the code, and the free variables of its environment persist within the program as long as they can possibly be accessed, and so they can be used as freely as any other Scheme expression.
Closures are closely related to Actors in the Actor model of concurrent computation where the values in the function's lexical environment are called acquaintances. An important issue for closures in concurrent programming languages is whether the variables in a closure can be updated and, if so, how these updates can be synchronized. Actors provide one solution.
Closures are closely related to function objects; the transformation from the former to the latter is known as defunctionalization or lambda lifting; see also closure conversion.
Differences in semantics
Lexical environment
As different languages do not always have a common definition of the lexical environment, their definitions of closure may vary also. The commonly held minimalist definition of the lexical environment defines it as a set of all bindings of variables in the scope, and that is also what closures in any language have to capture. However the meaning of a variable binding also differs. In imperative languages, variables bind to relative locations in memory that can store values. Although the relative location of a binding does not change at runtime, the value in the bound location can. In such languages, since closure captures the binding, any operation on the variable, whether done from the closure or not, are performed on the same relative memory location. This is often called capturing the variable "by reference". Here is an example illustrating the concept in ECMAScript, which is one such language:// ECMAScript
var f, g;
function foo
foo; // 2
alert: ' + g); // 1
alert: ' + g); // 0
alert: ' + f); // 1
alert: ' + f); // 2
Function
foo
and the closures referred to by variables f
and g
all use the same relative memory location signified by local variable x
.In some instances the above behaviour may be undesirable, and it is necessary to bind a different lexical closure. Again in ECMAScript, this would be done using the
Function.bind
.Example 2: Accidental reference to a bound variable
For this example the expected behaviour would be that each link should emit its id when clicked; but because the variable 'e' is bound the scope above, and lazy evaluated on click, what actually happens is that each on click event emits the id of the last element in 'elements' bound at the end of the for loop.var elements= document.getElementsByTagName;
//Incorrect: e is bound to the function containing the 'for' loop, not the closure of "handle"
for
Again here variable
e
would need to be bound by the scope of the block using handle.bind
or the let
keyword.On the other hand, many functional languages, such as ML, bind variables directly to values. In this case, since there is no way to change the value of the variable once it is bound, there is no need to share the state between closures—they just use the same values. This is often called capturing the variable "by value". Java's local and anonymous classes also fall into this category—they require captured local variables to be
final
, which also means there is no need to share state.Some languages enable you to choose between capturing the value of a variable or its location. For example, in C++11, captured variables are either declared with
, which means captured by reference, or with
, which means captured by value.Yet another subset, lazy functional languages such as Haskell, bind variables to results of future computations rather than values. Consider this example in Haskell:
-- Haskell
foo :: Fractional a => a -> a ->
foo x y =
where r = x / y
f :: Fractional a => a -> a
f = foo 1 0
main = print
The binding of
r
captured by the closure defined within function foo
is to the computation
—which in this case results in division by zero. However, since it is the computation that is captured, and not the value, the error only manifests itself when the closure is invoked, and actually attempts to use the captured binding.Closure leaving
Yet more differences manifest themselves in the behavior of other lexically scoped constructs, such asreturn
, break
and continue
statements. Such constructs can, in general, be considered in terms of invoking an escape continuation established by an enclosing control statement. In some languages, such as ECMAScript, return
refers to the continuation established by the closure lexically innermost with respect to the statement—thus, a return
within a closure transfers control to the code that called it. However, in Smalltalk, the superficially similar operator ^
invokes the escape continuation established for the method invocation, ignoring the escape continuations of any intervening nested closures. The escape continuation of a particular closure can only be invoked in Smalltalk implicitly by reaching the end of the closure's code. The following examples in ECMAScript and Smalltalk highlight the difference:"Smalltalk"
foo
| xs |
xs := #.
xs do: .
^0
bar
Transcript show: "prints 1"
// ECMAScript
function foo
alert); // prints 0
The above code snippets will behave differently because the Smalltalk
^
operator and the JavaScript return
operator are not analogous. In the ECMAScript example, return x
will leave the inner closure to begin a new iteration of the forEach
loop, whereas in the Smalltalk example, ^x
will abort the loop and return from the method foo
.Common Lisp provides a construct that can express either of the above actions: Lisp
behaves as Smalltalk ^x
, while Lisp
behaves as JavaScript return x
. Hence, Smalltalk makes it possible for a captured escape continuation to outlive the extent in which it can be successfully invoked. Consider:"Smalltalk"
foo
^
bar
| f |
f := self foo.
f value: 123 "error!"
When the closure returned by the method
foo
is invoked, it attempts to return a value from the invocation of foo
that created the closure. Since that call has already returned and the Smalltalk method invocation model does not follow the spaghetti stack discipline to facilitate multiple returns, this operation results in an error.Some languages, such as Ruby, enable the programmer to choose the way
return
is captured. An example in Ruby:- Ruby
- Closure using a Proc
f = Proc.new
f.call # control leaves foo here
return "return from foo"
end
- Closure using a lambda
f = lambda
f.call # control does not leave bar here
return "return from bar"
end
puts foo # prints "return from foo from inside proc"
puts bar # prints "return from bar"
Both
Proc.new
and lambda
in this example are ways to create a closure, but semantics of the closures thus created are different with respect to the return
statement.In Scheme, definition and scope of the
return
control statement is explicit. The following is a direct translation of the Ruby sample.; Scheme
)
; control leaves foo here
)))
)))
; control does not leave bar here
)))
; prints "return from foo from inside proc"
; prints "return from bar"
Closure-like constructs
Some languages have features which simulate the behavior of closures. In languages such as Java, C++, Objective-C, C#, VB.NET, and D, these features are the result of the language's object-oriented paradigm.Callbacks (C)
Some C libraries supportcallbacks. This is
sometimes implemented by providing two values when
registering the callback with the library: a function
pointer and a separate
void*
pointer to arbitrary data of the user's choice. When the library
executes the callback function, it passes along the data
pointer. This enables the callback to maintain state and
to refer to information captured at the time it was
registered with the library. The idiom is similar to
closures in functionality, but not in syntax. The
void*
pointer is not type safe so this Cidiom differs from type-safe closures in C#, Haskell or ML.
Callbacks are extensively used in GUI Widget toolkits to
implement Event-driven programming by associating general
functions of graphical widgets with application-specific functions
implementing the specific desired behavior for the application.
Nested function and function pointer(C)
With a gcc extension, a can be used and a function pointer can emulate closures, providing the containing function does not exit. The example below is invalid:- include
fn_int_to_int adder
int main
Local classes and lambda functions (Java)
enables classes to be defined inside methods. These are called local classes. When such classes are not named, they are known as anonymous classes. A local class may refer to names in lexically enclosing classes, or read-only variables in the lexically enclosing method.class CalculationWindow extends JFrame
The capturing of
final
variables enables you to capture variables by value. Even if the variable you want to capture is non-final
, you can always copy it to a temporary final
variable just before the class.Capturing of variables by reference can be emulated by using a
final
reference to a mutable container, for example, a single-element array. The local class will not be able to change the value of the container reference itself, but it will be able to change the contents of the container.With the advent of Java 8's lambda expressions, the closure causes the above code to be executed as:
class CalculationWindow extends JFrame
Local classes are one of the types of inner class that are declared within the body of a method. Java also supports inner classes that are declared as non-static members of an enclosing class. They are normally referred to just as "inner classes". These are defined in the body of the enclosing class and have full access to instance variables of the enclosing class. Due to their binding to these instance variables, an inner class may only be instantiated with an explicit binding to an instance of the enclosing class using a special syntax.
public class EnclosingClass
Upon execution, this will print the integers from 0 to 9. Beware to not confuse this type of class with the nested class, which is declared in the same way with an accompanied usage of the "static" modifier; those have not the desired effect but are instead just classes with no special binding defined in an enclosing class.
As of Java 8, Java supports functions as first class objects. Lambda expressions of this form are considered of type
Function
with T being the domain and U the image type. The expression can be called with its .apply
method, but not with a standard method call.public static void main
Blocks (C, C++, Objective-C 2.0)
introduced blocks, a form of closure, as a nonstandard extension into C, C++, Objective-C 2.0 and in Mac OS X 10.6 "Snow Leopard" and iOS 4.0. Apple made their implementation available for the GCC and clang compilers.Pointers to block and block literals are marked with
^
. Normal local variables are captured by value when the block is created, and are read-only inside the block. Variables to be captured by reference are marked with __block
. Blocks that need to persist outside of the scope they are created in may need to be copied.typedef int ;
IntBlock downCounter
IntBlock f = downCounter;
NSLog);
NSLog);
NSLog);
Delegates (C#, VB.NET, D)
anonymous methods and lambda expressions support closure:var data = new ;
var multiplier = 2;
var result = data.Select;
Visual Basic.NET, which has many language features similar to those of C#, also supports lambda expressions with closures:
Dim data =
Dim multiplier = 2
Dim result = data.Select
In D, closures are implemented by delegates, a function pointer paired with a context pointer.
auto test1
auto test2
void bar
D version 1, has limited closure support. For example, the above code will not work correctly, because the variable a is on the stack, and after returning from test, it is no longer valid to use it. This can be solved by explicitly allocating the variable 'a' on heap, or using structs or class to store all needed closed variables and construct a delegate from a method implementing the same code. Closures can be passed to other functions, as long as they are only used while the referenced values are still valid, and are useful for writing generic data processing code, so this limitation, in practice, is often not an issue.
This limitation was fixed in D version 2 - the variable 'a' will be automatically allocated on the heap because it is used in the inner function, and a delegate of that function can escape the current scope. Any other local variables that are not referenced by delegates or that are only referenced by delegates that do not escape the current scope, remain on the stack, which is simpler and faster than heap allocation. The same is true for inner's class methods that references a function's variables.
Function objects (C++)
enables defining function objects by overloadingoperator
. These objects behave somewhat like functions in a functional programming language. They may be created at runtime and may contain state, but they do not implicitly capture local variables as closures do. As of the 2011 revision, the C++ language also supports closures, which are a type of function object constructed automatically from a special language construct called lambda-expression. A C++ closure may capture its context either by storing copies of the accessed variables as members of the closure object or by reference. In the latter case, if the closure object escapes the scope of a referenced object, invoking its operator
causes undefined behavior since C++ closures do not extend the lifetime of their context.void foo
Inline agents (Eiffel)
includes inline agents defining closures. An inline agent is an object representing a routine, defined by giving the code of the routine in-line. For example, inok_button.click_event.subscribe do
map.country_at_coordinates.display
end
the argument to
subscribe
is an agent, representing a procedure with two arguments; the procedure finds the country at the corresponding coordinates and displays it. The whole agent is "subscribed" to the event type click_event
for acertain button, so that whenever an instance of the event type occurs on that button — because a user has clicked the button — the procedure will be executed with the mouse coordinates being passed as arguments for
x
and y
.The main limitation of Eiffel agents, which distinguishes them from closures in other languages, is that they cannot reference local variables from the enclosing scope. This design decision helps in avoiding ambiguity when talking about a local variable value in a closure - should it be the latest value of the variable or the value captured when the agent is created? Only
Current
, its features, and arguments of the agent itself can be accessed from within the agent body. The values of the outer local variables can be passed by providing additional closed operands to the agent.C++Builder __closure reserved word
Embarcadero C++Builder provides the reserve word __closure to provide a pointer to a method with a similar syntax to a function pointer.In standard C you could write a for a pointer to a function type using the following syntax:
typedef void ;
typedef void ;