Exponentiation by squaring

In mathematics and computer programming, exponentiating by squaring is a general method for fast computation of large positive integer powers of a number, or more generally of an element of a semigroup, like a polynomial or a square matrix. Some variants are commonly referred to as square-and-multiply algorithms or binary exponentiation. These can be of quite general use, for example in modular arithmetic or powering of matrices. For semigroups for which additive notation is commonly used, like elliptic curves used in cryptography, this method is also referred to as double-and-add.

Basic method

The method is based on the observation that, for a positive integer n, we have
This method uses the bits of the exponent to determine which powers are computed.
This example shows how to compute using this method.
The exponent, 13, is 1101 in binary. The bits are used in left to right order.
The exponent has 4 bits, so there are 4 iterations.
First, initialize the result to 1:.
If we write in binary as, then this is equivalent to defining a sequence by letting and then defining for, where will equal.
This may be implemented as the following recursive algorithm:

Function exp_by_squaring
if n < 0 then return exp_by_squaring;
else if n = 0 then return 1;
else if n = 1 then return x ;
else if n is even then return exp_by_squaring;
else if n is odd then return x * exp_by_squaring;

Although not tail-recursive, this algorithm may be rewritten into a tail recursive algorithm by introducing an auxiliary function:

Function exp_by_squaring
return exp_by_squaring2
Function exp_by_squaring2
if n < 0 then return exp_by_squaring2;
else if n = 0 then return y;
else if n = 1 then return x * y;
else if n is even then return exp_by_squaring2;
else if n is odd then return exp_by_squaring2.

A tail-recursive variant may also be constructed using a pair of accumulators instead of an auxiliary function as seen in the F# example below. The accumulators a1 and a2 can be thought of as storing the values and where i and j are initialized to 1 and 0 respectively. In the even case i is doubled, and in the odd case j is increased by i. The final result is where.

let exp_by_squaring x n =
let rec _exp x n' a1 a2 =
if n' = 0 then 1
elif n' = 1 then a1*a2
elif n'%2 = 0 then _exp x a2
else _exp x a1
_exp x n x 1

The iterative version of the algorithm also uses a bounded auxiliary space, and is given by

Function exp_by_squaring_iterative
if n < 0 then
x := 1 / x;
n := -n;
if n = 0 then return 1
y := 1;
while n > 1 do
if n is even then
x := x * x;
n := n / 2;
else
y := x * y;
x := x * x;
n := / 2;
return x * y

Computational complexity

A brief analysis shows that such an algorithm uses squarings and at most multiplications, where denotes the floor function. More precisely, the number of multiplications is one less than the number of ones present in the binary expansion of n. For n greater than about 4 this is computationally more efficient than naively multiplying the base with itself repeatedly.
Each squaring results in approximately double the number of digits of the previous, and so, if multiplication of two d-digit numbers is implemented in O operations for some fixed k, then the complexity of computing xⁿ is given by

2^''k''-ary method

This algorithm calculates the value of xⁿ after expanding the exponent in base 2^k. It was first proposed by Brauer in 1939. In the algorithm below we make use of the following function f = and f =, where m = u·2^s with u odd.
Algorithm:
;Input: An element x of G, a parameter k > 0, a non-negative integer and the precomputed values.
;Output: The element xⁿ in G
y := 1; i := l - 1
while i ≥ 0 do
:= f
for j := 1 to k - s do
y := y²
y := y * x^u
for j := 1 to s do
y := y²
i := i - 1
return y
For optimal efficiency, k should be the smallest integer satisfying

Sliding-window method

This method is an efficient variant of the 2^k-ary method. For example, to calculate the exponent 398, which has binary expansion ₂, we take a window of length 3 using the 2^k-ary method algorithm and calculate 1, x³, x⁶, x¹², x²⁴, x⁴⁸, x⁴⁹, x⁹⁸, x⁹⁹, x¹⁹⁸, x¹⁹⁹, x³⁹⁸.
But, we can also compute 1, x³, x⁶, x¹², x²⁴, x⁴⁸, x⁹⁶, x¹⁹², x¹⁹⁸, x¹⁹⁹, x³⁹⁸, which saves one multiplication and amounts to evaluating ₂
Here is the general algorithm:
Algorithm:
;Input: An element x of G, a non negative integer, a parameter k > 0 and the pre-computed values.
;Output: The element xⁿ ∈ G.
Algorithm:
y := 1; i := l - 1
while i > -1 do
if n_i = 0 then
y := y²' i := i - 1
else
s := max
while n_s = 0 do
s := s + 1
for h := 1 to i - s + 1 do
y := y²
u := ₂
y := y * x^u
i := s - 1
return y

Montgomery's ladder technique

Many algorithms for exponentiation do not provide defence against side-channel attacks. Namely, an attacker observing the sequence of squarings and multiplications can recover the exponent involved in the computation. This is a problem if the exponent should remain secret, as with many public-key cryptosystems. A technique called "Montgomery's ladder" addresses this concern.
Given the binary expansion of a positive, non-zero integer n = ₂ with n_k−1 = 1, we can compute xⁿ as follows:
x₁ = x; x₂ = x²
for i = k - 2 to 0 do
If n_i = 0 then
x₂ = x₁ * x₂; x₁ = x₁²
else
x₁ = x₁ * x₂; x₂ = x₂²
return x₁
The algorithm performs a fixed sequence of operations : a multiplication and squaring takes place for each bit in the exponent, regardless of the bit's specific value. A similar algorithm for multiplication by doubling exists.
This specific implementation of Montgomery's ladder is not yet protected against cache timing attacks: memory access latencies might still be observable to an attacker, as different variables are accessed depending on the value of bits of the secret exponent. Modern cryptographic implementations use a "scatter" technique to make sure the processor always misses the faster cache.

Fixed-base exponent

There are several methods which can be employed to calculate xⁿ when the base is fixed and the exponent varies. As one can see, precomputations play a key role in these algorithms.

Yao's method

Yao's method is orthogonal to the -ary method where the exponent is expanded in radix and the computation is as performed in the algorithm above. Let,,, and be integers.
Let the exponent be written as
where for all.
Let.
Then the algorithm uses the equality
Given the element of, and the exponent written in the above form, along with the precomputed values, the element is calculated using the algorithm below:
y = 1, u = 1, j = h - 1
while j > 0 do
for i = 0 to w - 1 do
if n_i = j then
u = u × x^b_i
y = y × u
j = j - 1
return y
If we set and, then the values are simply the digits of in base. Yao's method collects in u first those that appear to the highest power ; in the next round those with power are collected in as well etc. The variable y is multiplied times with the initial, times with the next highest powers, and so on.
The algorithm uses multiplications, and elements must be stored to compute.

Euclidean method

The Euclidean method was first introduced in Efficient exponentiation using precomputation and vector addition chains by P.D Rooij.
This method for computing in group, where is a natural integer, whose algorithm is given below, is using the following equality recursively:
where.
In other words, a Euclidean division of the exponent by is used to return a quotient and a rest.
Given the base element in group, and the exponent written as in Yao's method, the element is calculated using precomputed values and then the algorithm below.
Begin loop

Break loop

End loop;
The algorithm first finds the largest value among the and then the supremum within the set of.
Then it raises to the power, multiplies this value with, and then assigns the result of this computation and the value modulo.

Further applications

The same idea allows fast computation of large exponents modulo a number. Especially in cryptography, it is useful to compute powers in a ring of integers modulo q. It can also be used to compute integer powers in a group, using the rule
The method works in every semigroup and is often used to compute powers of matrices.
For example, the evaluation of
would take a very long time and lots of storage space if the naïve method were used: compute 13789⁷²²³⁴¹, then take the remainder when divided by 2345. Even using a more effective method will take a long time: square 13789, take the remainder when divided by 2345, multiply the result by 13789, and so on. This will take less than modular multiplications.
Applying above exp-by-squaring algorithm, with "*" interpreted as x * y = xy mod 2345 leads to only 27 multiplications and divisions of integers, which may all be stored in a single machine word.

Example implementations

Computation by powers of 2

This is a non-recursive implementation of the above algorithm in Ruby.
n = n - 1 is redundant when n = n / 2 implicitly rounds towards zero, as strongly-typed languages with integer division would do. n is the rightmost bit of the binary representation of n, so if it is 1, then the number is odd, and if it is zero, then the number is even. It is also n modulo 2.

def power
result = 1
while n.nonzero?
if n.nonzero?
result *= x
n -= 1
end
x *= x
n /= 2
end
return result
end

Runtime example: compute 3¹⁰

parameter x = 3
parameter n = 10
result := 1
Iteration 1
n = 10 -> n is even
x := x² = 3² = 9
n := n / 2 = 5
Iteration 2
n = 5 -> n is odd
-> result := result * x = 1 * x = 1 * 3² = 9
n := n - 1 = 4
x := x² = 9² = 3⁴ = 81
n := n / 2 = 2
Iteration 3
n = 2 -> n is even
x := x² = 81² = 3⁸ = 6561
n := n / 2 = 1
Iteration 4
n = 1 -> n is odd
-> result := result * x = 3² * 3⁸ = 3¹⁰ = 9 * 6561 = 59049
n := n - 1 = 0
return result

Runtime example: compute 3¹⁰

result := 3
bin := "1010"
Iteration for digit 4:
result := result² = 3² = 9
1010_bin - Digit equals "0"

Iteration for digit 3:
result := result² = ² = 3⁴ = 81
1010_bin - Digit equals "1" --> result := result*3 = ²*3 = 3⁵ = 243
Iteration for digit 2:
result := result² = ² = 3¹⁰ = 59049
1010_bin - Digit equals "0"
return result
This example is based on the algorithm above. If calculated by hand, should go from left to right. If the start number is 1, just ignore it. Then if the next is one, square and multiply. If the next is zero, only square.

Calculation of products of powers

Exponentiation by squaring may also be used to calculate the product of 2 or more powers. If the underlying group or semigroup is commutative, then it is often possible to reduce the number of multiplications by computing the product simultaneously.

Example

The formula a⁷×b⁵ may be calculated within 3 steps:
so one gets 8 multiplications in total.
A faster solution is to calculate both powers simultaneously:
which needs only 6 multiplications in total. Note that a×b is calculated twice; the result could be stored after the first calculation, which reduces the count of multiplication to 5.
Example with numbers:
Calculating the powers simultaneously instead of calculating them separately always reduces the count of multiplications if at least two of the exponents are greater than 1.

Using transformation

The example above a⁷×b⁵ may also be calculated with only 5 multiplications if the expression is transformed before calculation:
Generalization of transformation shows the following scheme:
For calculating a^A×b^B×...×m^M×n^N

Define ab := a×b, abc = ab×c,...
Calculate the transformed expression a^A−B×ab^B−C×...×abc..m^M−N×abc..mn^N.

Transformation before calculation often reduces the count of multiplications, but in some cases it also increases the count, so it may be a good idea to check the count of multiplications before using the transformed expression for calculation.

Examples

For the following expressions the count of multiplications is shown for calculating each power separately, calculating them simultaneously without transformation, and calculating them simultaneously after transformation.

Example	a⁷×b⁵×c³	a⁵×b⁵×c³	a⁷×b⁴×c¹
separate	× ×	× ×	× ×
simultaneous	²×a×b×c	²×a×b×c	²×a×c
transformation	a := a ab := a×b abc := ab×c	a := a ab := a×b abc := ab×c	a := a ab := a×b abc := ab×c
calculation after that	²×abc	²×abc	²×a×ab×abc

Signed-digit recoding

In certain computations it may be more efficient to allow negative coefficients and hence use the inverse of the base, provided inversion in is "fast" or has been precomputed. For example, when computing, the binary method requires multiplications and squarings. However, one could perform squarings to get and then multiply by to obtain.
To this end we define the signed-digit representation of an integer in radix as
Signed binary representation corresponds to the particular choice and. It is denoted by. There are several methods for computing this representation. The representation is not unique. For example, take : two distinct signed-binary representations are given by and, where is used to denote. Since the binary method computes a multiplication for every non-zero entry in the base-2 representation of, we are interested in finding the signed-binary representation with the smallest number of non-zero entries, that is, the one with minimal Hamming weight. One method of doing this is to compute the representation in non-adjacent form, or NAF for short, which is one that satisfies and denoted by. For example, the NAF representation of 478 is. This representation always has minimal Hamming weight. A simple algorithm to compute the NAF representation of a given integer with is the following:
for to do

Another algorithm by Koyama and Tsuruoka does not require the condition that ; it still minimizes the Hamming weight.

Alternatives and generalizations

Exponentiation by squaring can be viewed as a suboptimal addition-chain exponentiation algorithm: it computes the exponent by an addition chain consisting of repeated exponent doublings and/or incrementing exponents by one only. More generally, if one allows any previously computed exponents to be summed, one can sometimes perform the exponentiation using fewer multiplications. The smallest power where this occurs is for n = 15:
In general, finding the optimal addition chain for a given exponent is a hard problem, for which no efficient algorithms are known, so optimal chains are typically only used for small exponents. However, there are a number of heuristic algorithms that, while not being optimal, have fewer multiplications than exponentiation by squaring at the cost of additional bookkeeping work and memory usage. Regardless, the number of multiplications never grows more slowly than Θ, so these algorithms only improve asymptotically upon exponentiation by squaring by a constant factor at best.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...

Exponentiation by squaring

Basic method

Computational complexity

2''k''-ary method

Sliding-window method

Montgomery's ladder technique

Fixed-base exponent

Yao's method

Euclidean method

Further applications

Example implementations

Computation by powers of 2

Runtime example: compute 310

Runtime example: compute 310

Calculation of products of powers

Example

Using transformation

Examples

Signed-digit recoding

Alternatives and generalizations

2^''k''-ary method

Runtime example: compute 3¹⁰

Runtime example: compute 3¹⁰