Automatic sequence
In mathematics and theoretical computer science, an automatic sequence is an infinite sequence of terms characterized by a finite automaton. The n-th term of an automatic sequence a is a mapping of the final state reached in a finite automaton accepting the digits of the number n in some fixed base k.
An automatic set is a set of non-negative integers S for which the sequence of values of its characteristic function χS is an automatic sequence; that is, S is k-automatic if χS is k-automatic, where χS = 1 if n S and 0 otherwise.
Definition
Automatic sequences may be defined in a number of ways, all of which are equivalent. Four common definitions are as follows.Automata-theoretic
Let k be a positive integer, and let D = be a deterministic finite automaton with output, where- Q is the finite set of states;
- the input alphabet Σk consists of the set of possible digits in base-k notation;
- δ : Q × Σk → Q is the transition function;
- q0 ∈ Q is the initial state;
- the output alphabet Δ is a finite set; and
- τ : Q → Δ is the output function mapping from the set of internal states to the output alphabet.
Define a function a from the set of positive integers to the output alphabet Δ as follows:
where s is n written in base k. Then the sequence a = a'a'a... is a k-automatic sequence.
An automaton reading the base k digits of s starting with the most significant digit is said to be direct reading, while an automaton starting with the least significant digit is reverse reading. The above definition holds whether s is direct or reverse reading.
Substitution
Let be a k-uniform morphism of a free monoid and let be a coding, as in the automata-theoretic case. If is a fixed point of —that is, if —then is a k-automatic sequence. Conversely, every k-automatic sequence is obtainable in this way. This result is due to Cobham, and it is referred to in the literature as Cobham's little theorem.''k''-kernel
Let k ≥ 2. The k-kernel of the sequence s is the set of subsequencesIn most cases, the k-kernel of a sequence is infinite. However, if the k-kernel is finite, then the sequence s is k-automatic, and the converse is also true. This is due to Eilenberg.
It follows that a k-automatic sequence is necessarily a sequence on a finite alphabet.
Formal power series
Let u be a sequence over an alphabet Σ and suppose that there is an injective function β from Σ to the finite field Fq, where q = pn for some prime p. The associated formal power series isThen the sequence u is q-automatic if and only if this formal power series is algebraic over Fq. This result is due to Christol, and it is referred to in the literature as Christol's theorem.
History
Automatic sequences were introduced by Büchi in 1960, although his paper took a more logico-theoretic approach to the matter and did not use the terminology found in this article. The notion of automatic sequences was further studied by Cobham in 1972, who called these sequences "uniform tag sequences".The term "automatic sequence" first appeared in a paper of Deshouillers.
Examples
The following sequences are automatic:Thue–Morse sequence
The Thue–Morse sequence t is the fixed point of the morphism 0 → 01, 1 → 10. Since the n-th term of the Thue–Morse sequence counts the number of ones modulo 2 in the base-2 representation of n, it is generated by the two-state deterministic finite automaton with output pictured here, where being in state q0 indicates there are an even number of ones in the representation of n and being in state q1 indicates there are an odd number of ones.Hence, the Thue–Morse sequence is 2-automatic.
Period-doubling sequence
The n-th term of the period-doubling sequence d is determined by the parity of the exponent of the highest power of 2 dividing n. It is also the fixed point of the morphism 0 → 01, 1 → 00. Starting with the initial term w = 0 and iterating the 2-uniform morphism φ on w where φ = 01 and φ = 00, it is evident that the period-doubling sequence is the fixed-point of φ and thus it is 2-automatic.Rudin–Shapiro sequence
The n-th term of the Rudin–Shapiro sequence r is determined by the number of consecutive ones in the base-2 representation of n. The 2-kernel of the Rudin–Shapiro sequence isSince the 2-kernel consists only of r, r, r, and r, it is finite and thus the Rudin–Shapiro sequence is 2-automatic.
Other sequences
Both the Baum–Sweet sequence and the regular paperfolding sequence are automatic. In addition, the general paperfolding sequence with a periodic sequence of folds is also automatic.Properties
Automatic sequences exhibit a number of interesting properties. A non-exhaustive list of these properties is presented below.- Every automatic sequence is a morphic word.
- For k ≥ 2 and r ≥ 1, a sequence is k-automatic if and only if it is kr-automatic. This result is due to Eilenberg.
- For h and k multiplicatively independent, a sequence is both h-automatic and k-automatic if and only if it is ultimately periodic. This result is due to Cobham, with a multidimensional generalisation due to Semenov.
- If u is a k-automatic sequence over an alphabet Σ and f is a uniform morphism from Σ∗ to another alphabet Δ∗, then f is a k-automatic sequence over Δ.
- If u is a k-automatic sequence, then the sequences u and u are ultimately periodic. Conversely, if u is an ultimately periodic sequence, then the sequence v defined by v = u and otherwise zero is k-automatic.
Proving and disproving automaticity
be the Thue–Morse word. Let be the word given by concatenating successive terms in the sequence of run-lengths of. Then begins
It is known that is the fixed point of the morphism
The word is not 2-automatic, but certain elements of its 2-kernel agree for many terms. For example,
but not for.
Given a sequence that is conjectured to be automatic, there are a few useful approaches to proving it actually is. One approach is to directly construct a deterministic automaton with output that gives the sequence. Let written in the alphabet, and let denote the base- expansion of. Then the sequence is -automatic if and only each of the fibres
is a regular language. Checking regularity of the fibres can often be done using the pumping lemma for regular languages.
If denotes the sum of the digits in the base- expansion of and is a polynomial with non-negative integer coefficients, and if, are integers, then the sequence
is -automatic if and only if or..
1-automatic sequences
k-automatic sequences are normally only defined for k ≥ 2. The concept can be extended to k = 1 by defining a 1-automatic sequence to be a sequence whose n-th term depends on the unary notation for n; that is, n. Since a finite state automaton must eventually return to a previously visited state, all 1-automatic sequences are ultimately periodic.Generalizations
Automatic sequences are robust against variations to either the definition or the input sequence. For instance, as noted in the automata-theoretic definition, a given sequence remains automatic under both direct and reverse reading of the input sequence. A sequence also remains automatic when an alternate set of digits is used or when the base is negated; that is, when the input sequence is represented in base −k instead of in base k. However, in contrast to using an alternate set of digits, a change of base may affect the automaticity of a sequence.The domain of an automatic sequence can be extended from the natural numbers to the integers via two-sided automatic sequences. This stems from the fact that, given k ≥ 2, every integer can be represented uniquely in the form where. Then a two-sided infinite sequence an is -automatic if and only if its subsequences an ≥ 0 and an ≥ 0 are k-automatic.
The alphabet of a k-automatic sequence can be extended from finite size to infinite size via k-regular sequences. The k-regular sequences can be characterized as those sequences whose k-kernel is finitely-generated. Every bounded k-regular sequence is automatic.
Logical approach
For many 2-automatic sequences, the map has the property that the first-order theory is decidable. Since many non-trivial properties of automatic sequences can be written in first-order logic, it is possible to prove these properties mechanically by executing the decision procedure.For example, the following properties of the Thue–Morse word can all be verified mechanically in this way:
- The Thue–Morse word is overlap-free, i.e., it does not contain a word of the form where is a single letter and is a possibly empty word.
- A non-empty word is bordered if there is a non-empty word and a possibly empty word with. The Thue–Morse word contains a bordered factor for each length greater than 1.
- There is an unbordered factor of length in the Thue–Morse word if and only if where denotes the binary representation of.