1 Introduction

Starting with the launch of Explorer 1, on January 31, 1958 of the first application of pseudorandom sequences in space communications pioneered by Solomon Golomb when he worked in the Communication Research Group at the Jet Propulsion Laboratory, sequence design has played an important role from radio communication to our current 5G cellular wireless communications for anti-noise and reducing multi-path and multiple access interference. The initial study of sequences has been presented in Golomb’s book: Shift Register Sequences [21], which leads the research in sequence design more than seven decades [27]. With respect to digital communications, it has found many applications described in the following incomplete list:

  • Radar distance ranging, signal processing, and multi-sensory sensing for higher resolution.

  • Interference rejection in multi-path interference, channel estimation.

  • Energy density reduction, anti-jaming, low probability interception, and multiple access.

  • Reduction of peak-to-average power ratio (PAPR) or peak-to-mean envelope power ratio (PMEPR) for orthogonal frequency division multiplexing (OFDM) systems.

  • It also can offer a certain degree of privacy.

Pseudorandom sequences also have many applications in cryptography, computer science, game theory, DNA processing, etc., to just list a few. In this survey, we focus on sequence design for wireless communications and some relations to cryptography. The current research on sequence design can be classified into three different categories: a) sequences with low periodic correlation; b) complementary sequence sets (CSS) and complete complementary codes (CCC), measured by aperiodic correlation; and c) sequence sets with low ambiguity functions for both periodic and aperiodic sequences.

It is almost impossible without a very voluminous space to survey recent developments of sequences. For sequences with low correlation, we have the following comments.

  1. (a)

    There is a large volume of research on the constructions of sequence sets with low correlation and large sizes for code-division multiple access (CDMA) applications. A number of survey papers, authored by renowned researchers have been published in the literature. For example, an excellent survey paper by Kumar and Helleseth [32], entitled as Sequences with Low Correlation, appeared in Handbook of Coding Theory (1998), has covered all the developments from the constructions to the bounds for optimality. A recent survey on sequences with optimal autocorrelation and the constructions for de Bruijn sequences is authored by Helleseth and Li [33], an earlier survey on binary and quaternary sequences by Lück, Schotten and Hadinejad-Mahram [44], and a survey on sequences with good aperiodic correlation by Katz [36], to just list a few. The reader is referring to the references therein for the original contributions.

  2. (b)

    Given a channel condition, the signal sets with low correlation or zero correlation confined in a certain interval instead of the entire period [62] are referred to as low or zero correlation zone sequences, which have stimulated large publications in the literature in theory, for example [25, 64, 65] and the references therein. However, it is problematic to be implemented in practical systems due to the heavy hardware cost for real-time channel estimation.

In this survey, we attempt to present some work and comments on the sequences with 2-level autocorrelation, sequence sets with low ambiguity, and CSSs and CCCs.

1.1 Sequences with 2-level autocorrelation and their applications

As we have mentioned above, the research on sequences with good correlation started with linear feedback shift register (LFSR) sequences [21], and first used in Explorer 1 to detect transmitted signals from the Explorer 1 back to the earth. This event was explained by Golomb in his paper [22] wonderfully. Successfully, Golomb applied it to estimate the distance from the earth to Venus [45]. Those two applications utilize the autocorrelation property of m-sequences, i.e., all the values of the out-of-phase autocorrelation are equal to − 1 (the so-called an (ideal) 2-level autocorrelation function). A sequence with 2-level autocorrelation resembles a white Gaussian noise random process, so it has the best capability to distinguish itself from interference caused by multi-path fading. All the known constructions on binary sequences with 2-level autocorrelation are collectively surveyed in Golomb and Gong’s book [19], published in 2005. Unfortunately, until now, there are no new constructions (even conjectures) on 2-level autocorrelation sequences, especially binary sequences. Nevertheless, there is a dramatical progress on proving conjectured ternary 2-level autocorrelation sequences [3, 35].

Impulse response behaviour and cryptographic connections

The investigation for the applications of 2-level autocorrelation sequences (behaving approximately an impulse response) for wireless transmission including radar distance range and for cryptography such as generating key stream for stream cipher encryption is always a twin-brain body, which is inseparable since the creation of this area by Golomb. Astonishingly, when he proposed to use maximal length LFSR sequences (i.e., m sequences) in 1957 for the application of the space navigation [21], Golomb [20] also published a paper, entitled as “On the Classification of Boolean Functions” in 1959, in which he has essentially proposed the concepts of correlation immunity and resiliency of Boolean functions in the form of the invariants of Boolean functions, and also studied the invariants using the Rademacher-Walsh expansion (which is equivalent to Hadamard transform or Walsh-Hadamard transform). Those concepts are currently in the core of modern cryptography and cryptanalysis.

One version of the Globe Position System (GPS) employs the LFSRs of degree 10 and degree 13 as general navigation codes (i.e., m-sequences with periods 210 − 1 = 1023 and 213 − 1 = 8191 respectively). However, those sequences are too short, attackers can spoof the signals sending from a satellite to a device which can compute its location. The spoofed signal can make a fault location for the victim device. Those attacks have been reported in some diplomatic cases. We understand that there are many other 2-level autocorrelation sequences in those lengths which can provide a stronger capability to resist the spoof attack. However, the problem is how to balance the hardware cost, performance and security in those systems. Unfortunately, in many real world applications, it will be in favor of reducing the cost first.

Indistinguishability from out-of-phase sequences

The Welch-Gong (WG) stream cipher was submitted to the eSTREAM Project of the eCRYPT network in 2004 [46]. This cipher is based on the WG sequences with 2-level autocorrelation (see [19]) which remains as secure up to the date. A new authenticated encryption (AE) algorithm, called WAGE (i.e., WG in AE mode), was submitted to the NIST Lightweight Cryptography Standardization Competition in 2019 [1, 2] and was advanced to the round 2 candidate. It differs from the other ciphers due to its capability to switch to the original WG stream cipher where the key stream has 2-level autocorrelation. This property provides the indistinguishability for any shifts of the key stream. (For the history of WG sequences, the reader is referring to [19] and the references therein.)

Actually, in cryptographic applications, key-stream sequences with low aperiodic autocorrelation are requested, since in general, attacker cannot get the entire space of ciphertexts by conducting chosen plaintext attacks. Nevertheless, it constitutes a challenge to construct binary sequences with low aperiodic autocorrelation. Thus, a thumb rule is that the key stream sequences should have good periodic correlation, because Boehmer [5] observed that having low periodic correlation at all shifts is a necessary but not sufficient condition for having low aperiodic correlation at all shifts.

Paths leading to the proofs of the conjectured ternary sequences

One of the most exciting results on 2-level autocorrelation sequences in recent 20 years is the construction of the families of binary 2-level autocorrelation sequences, discovered by Dillion and Dobbertin, published in 2004 [13], which includes the conjectured 3-term, 5-term and WG sequences as special cases of their construction. Their construction is fairly simple which can be easily explained as follows. Let d = 22k − 2k + 1, which is the so-called Kasami exponent in the power function xd for \(x\in \mathbb {F}_{2^{n}}\) where \(\gcd (k, n)=1\) and \(\mathbb {F}_{2^{n}}\) is a finite field with 2n elements. Let a difference set be

$${{\varDelta}} = \left\{(x+1)^{d} +x^{d}+1 | x\in {\mathbb{F}}_{2^{n}} \right\}.$$

Then a binary sequence f = {f(t)} is defined as: f(t) = 0 if αtΔ, f(t) = 1 otherwise where α is a primitive element in \(\mathbb {F}_{2^{n}}\), which has 2-level autocorrelation. However, the proof is deeply involved in very complicated manipulations of the difference set Δ. Also there are many remarkably hidden treasures in their paper, such as a set of new almost perfect nonlinear (APN) functions, new permutations of \(\mathbb {F}_{2^{n}}\), and Hadamard equivalent classes. Recently, Carlet [9] has revisited this construction and attempted to establish the result of his new notion, called component Walsh uniformity, over the functions given by the difference set Δ.

The method used by Dillion and Dobbertin is the Hadamard equivalence relation. This has been further explored by Golomb and Gong, which showed that all the known binary 2-level autocorrelation sequences can be obtained by repeating applying the decimation and Hadamard transform twice starting with an m-sequence (this transform is referred to as the second-order decimation-Hadamard transform (DHT), which will be defined in Section 2). Their work also shined a light along the path to settle the validity of Lin conjectured ternary 2-level autocorrelation sequences (1998) and to discover Ludkovski-Gong conjectured sequences (2000) which were obtained by applying the 2nd-order DHT to the Lin conjectured sequences. The proof given by Hu, Shao, Gong and Helleseth [35] in 2014 is through the 2nd-order DHT, and the second proof, given by Arasu, Dillion and Player [3] in 2015, is through character sum factorizations. Arasu-Dillion-Player also asserted the validity of Ludkovski-Gong conjectures.

So, we believe that it is time to revisit the research on 2-level autocorrelation sequences, since they not only play important roles in applications in 5G and beyond cellular communications, but also serve as building blocks for the constructions of sequences with optimal autocorrelation, low or zero correlation zone sequences as well as the salient applications in cryptography.

1.2 Sequences with low ambiguity

An auto ambiguity function of a one-dimensional sequence is a two-dimensional function in both time and frequency. The low values of an ambiguity function outside of the origin (0, 0) are required for determining the range (proportional to the time-delay) and for combating the Doppler shift (the velocity to or from the observer, proportional to the frequency shift, resulting in a phase shift) of a transmitted signal. Sequences with low ambiguity function can be achieved by Costas arrays, which yield the so-called ideal or thumb-tack ambiguity function (which only takes the values 0 or 1 at any shift not (0, 0)). But this is only applied to a single sequence, not a sequence set.

Heisenberg and Weil representation sequences

Heisenberg and Weil representation sequences have played a central role in the development of sequence sets with low ambiguity. In 2006, Calderbank’s group investigated sequences constructed from the Heisenberg representation [34]. Although the resulting sequences turned out to be the Frank-Chu sequences, this opens another direction of sequence design. In 2008, Gurevich, Hadani, and Sochen [28] introduced sequences constructed from the Weil representation. However, their sequences are not given in a mathematical analytical form, instead given in terms of an algorithm. Following their work, Wang and Gong [68] provided a new construction aiming for an explicit analytic formula for those sequences. Wang-Gong proved that their construction gives a simple elementary expression for the sequences from the Weil representation by Gurevich-Hadani-Sochen [28].

Interestingly, the sequences constructed by Wang and Gong are the element-wise product of Frank-Chu sequences and Legendre sequences, which can be respectively represented by additive characters and multiplicative characters of the finite field \(\mathbb {F}_{p}\). In their paper, they asked for a direct proof for the correlation, ambiguity, and discrete Fourier transform (DFT). Shortly, Schmidt [57] responded this call, and found that the bounds for those metrics can be obtained directly by applying the Weil bound on hybrid character sums without any complex machinery deviation, and the bounds are also improved, compared with those established through the Weil representation by Gurevich, Hadani, and Sochen. We call the class constructed by Wang-Gong the Weil representation sequences, since they can also be constructed through the Weil representation.

Wang, Gong and Yu [70] extended the construction of the Weil representation sequences while the bounds on correlation, ambiguity, and DFT remain unchanged. Up to now, those sequence sets are the best with respect to correlation, ambiguity, DFT, and the sizes. In addition of those classes, there is another sequence set, proposed by Ding et al. [14], in which each sequence is the sum of the reciprocal pairs of m-sequences (we may call it a Kloosterman sequence, since it corresponds to the Kloosterman’s sum). Ding et al. showed the bound on the ambiguity functions of the Kloosterman sequences using Li’s beautiful result [40] on the DFT of the Kloosterman sequences.

New wireless transmission systems call for new sequences

With rapid developments of 5G and beyond cellular communication systems, it calls new sequences for the applications in massive multi-input multi-output (MIMO), MIMO-OFDM, millimeter-wave (mm-wave) modulation systems, and the combinations of different modulation systems, such as orthogonal time frequency space (OTFS) modulation [29]. Basically, OTFS modulation first maps information symbols into a 2-dimensional signal which lies in a 2-dimensional time-delay and Doppler-shift space (i.e., a delay-Doppler space), then applies the so-called symplectic Fourier transform (i.e., a special form of 2-dimensional Fourier transform), finally transfers to a time-domain signal for antenna transmission. So, this can be considered as a pre-coded OFDM transmission system. For those combinations of transmission systems, it requests the sequence sets with the capability for PMEPR reduction, low ambiguity, and large sizes. Another class of transmission system is sparse code multiple access. The codebook for those schemes is the combination of spreading code and frequency division multiple access. So similarly, it demands new sequence sets with good correlation and extraordinary larger sizes.

Gong [23] has presented a survey with a lot of deeper results on the sequences with low ambiguity, and Ding et al. [14] has investigated some bounds on ambiguity functions together with some new constructions with low ambiguity. The reader is referring to these two papers and the references therein. In this survey, we revisit the existing work for constructing sequence sets with low ambiguity, and provide some parameterized examples.

1.3 Golay complementary sequence sets and complete complementary codes

There exists no sequences with zero aperiodic autocorrelation at each out-of-phase shift. Innovatively, Golay [18] (1961) proposed to use two sequences which have the zero sum of their autocorrelation values at each out-of-phase shift, i.e., the orthogonality of a shifted sequence against the sequence is established by two sequences when he studied infrared spectrometry [17]. Such two sequences are called a Golay pair. About ten years later, Liu and Tseng [66] extended the concept of Golay pairs to a set of the sequences with the zero sum of their out-of-phase autocorrelation values, called complementary sequence set (CSS) for the binary case. In a few years later, Sivaswamy [59] (1978) extended it to the polyphase case. The other important concept introduced by Liu and Tseng in the same paper is the mutually orthogonal CSS. A largest set consisting of multiple mutually orthogonal CSSs is referred to as a complete complementary code (CCC).

Golay pairs and CSSs have found many applications in physics, combinatorics and telecommunications such as channel measurement, synchronization and spread spectrum communications. In particular, it has been proposed for power reduction in OFDM transmission due to high PMEPR of uncoded OFDM signals [12, 50]. CCCs were proposed to serve as a certain orthogonal transform, similar as the DFT, but the entries in a CCC matrix are sequences and the operation is the computation of cross/autocorrelation [66]. It also found an application in constructing zero correlation zone sequences more recently [52, 64, 65].

How can one systematically generate Golay pairs and CSSs? Golay [18] first proposed two recursive algorithms to construct binary Golay pairs of length 2m. Davis and Jedwab realized that the algebraic structure of Golay sequences can be given by the specific second-order cosets of the first-order generalized Reed-Muller (GRM) code in their milestone work [12] in 1999. This is the so-called generalized Boolean function (GBF) method. Along this line, Paterson [50, 51], Schmidt [55, 56], and many excellent researchers contributed a large volume of works along this direction in the past years, including the constructions of the Golay pairs with low PMEPR over quadrature amplitude modulation (QAM) constellation which were first investigated by Tarokh et al. [53], just to list a few. For a more complete list of the references, please see [71].

The other approach is based on Hadamard matrices to construct CSSs and CCCs. In fact, the first recursive construction of CSSs and CCCs in 1972 [66] was obtained by Hadamard matrices. However, how to obtain the explicit function form of the sequences derived by Hadamard-matrix-based methods is a long-standing problem for several decades. So these methods are not as popular as the GBF approaches in the literature. Recently, Budišin [6, 7] and Wang [72] (2016) revisited this method, which has strongly influenced our efforts to follow this path. Recently, we have made a progress [71] for constructing a special type of Hadamard matrices, called parameter unitary (PU) matrices, so are CSSs and CCCs, for which we can systematically extract explicit GBFs for the corresponding sequences, constructed from the PU matrices. This work showed that all known constructions on CSSs and CCCs can be explained by our PU-matrix-based constructions and abundant new ones come out.

In the remaining of this survey, we will introduce some basic concepts and definitions on sequences in Section 2. We will provide a summary of the progress of the sequences with 2-level autocorrelation in Section 3, and the known constructions of sequence sets with low ambiguity and some specific parameters in Section 4. Then we will turn our attention to introduce our progress on constructing CSSs and CCCs by the PU-matrix-based approaches in Section 5. Section 6 contains some concluding remarks and a collection of some aforementioned open problems. We list all acronyms appeared in this survey in Appendix for easy reference.

2 Basic concepts and definitions

In this section, we introduce some basic concepts on sequences and notations which will be used throughout the paper.

  • \(\mathcal {C}\) represents the complex field.

  • For integer q, \({\mathbb {Z}}_{q}=\{0,1, \cdots , q-1\}\) is the residue class ring modulo q. \({\mathbb {Z}}^{*}_{q}\) denotes \(\mathbb {Z}_{q}\backslash \{0\}\).

  • \({\mathbb {F}}_{p^{n}}\) is a finite field where p is a prime and n is a positive integer, and \({\mathbb {F}}_{p^{n}}^{*}\) consists of nonzero elements of \({\mathbb {F}}_{p^{n}}\), and α is a primitive element in \(\mathbb {F}_{p^{n}}\). We also denote Q = pn.

  • The trace function from \({\mathbb {F}}_{p^{n}}\) to \({\mathbb {F}}_{p}\) is defined as

    $$Tr(x)=x+x^{p}+{\cdots} +x^{p^{n-1}}, x\in {\mathbb{F}}_{p^{n}}.$$
  • \(\omega _{k}=e^{\frac {2\pi \sqrt {-1}}{k}}=e^{\frac {2\pi i}{k}}\) (\(i=\sqrt {-1}\)) is a k th primitive root of unity.

  • For every element \(\beta \in {\mathbb {F}}_{p^{n}}^{*}\), there exists t with \(0\leqslant t\leqslant p^{n}-2\), such that β = αt. In other words, \(t=\log _{\alpha } \beta\), the discrete logarithm of β under the base α.

  • We assume that \(\log 0=0\) as a convention, and set \(\omega _{k}^{\log _{\alpha } 0}=1\).

We use notation s = {s(t)} to denote a complex valued sequence with infinite terms, i.e., \(s(t)\in \mathcal {C}\). If it is a periodic sequence with period L, i.e., s(t + L) = s(t), ∀t, we also denote it by a vector s = (s(0), ⋯ , s(T − 1)). A sequence f = {f(t)} with infinite terms is a q-ary sequence with period L if \(f(t)\in \mathbb {Z}_{q}\) where q is a positive integer, i.e.,

$${\textbf{f}}=(f(0), f(1), \cdots, f(L-1)), f(t)\in {\mathbb{Z}}_{q}.$$

In this survey, we associate a q-ary sequence with a function f, which is a map from \({\mathbb {Z}}_{L}\) to \({\mathbb {Z}}_{q}\). The representation of f depends on L, which will be introduced in more detail in the later sections. We always use f, the bold f to represent the sequence and f, the function.

2.1 Periodic correlation and perfect sequences

Definition 1

Given two complex valued sequences, say s0 and s1 of period L, their periodic crosscorrelation function is defined as

$$R_{{\textbf{s}}_{0}, {\textbf{s}}_{1}}(\tau)=\sum\limits_{t=0}^{L-1}s_{1}(t+\tau)\overline{s_{0}(t)}, \tau=0, \pm 1, \pm 2, \cdots$$

where \(\overline {s_{0}(t)}\) is the complex conjugate of s0(t), and t + τ is reduced by modulo L. It becomes the autocorrelation when s0 = s1 = s, denoted as Rs(τ), i.e.,

$$R_{{\textbf{s}}}(\tau)= \sum\limits_{t=0}^{L-1}s(t+\tau)\overline{s(t)}, \tau=0, \pm 1, \pm 2, \cdots,$$

or simply as R(τ) if the contest is clear. For two q-ary sequences f0 and f1, their crosscorrelation function is defined as the crosscorrelation of \(s_{0}(t)=\omega _{q}^{f_{0}(t)}\) and \(s_{1}(t)=\omega _{q}^{f_{1}(t)}\), and it defines the autocorrelation of the sequence when they are equal.

In digital communications, generally, we consider a q-ary sequence {f(t)} for representing information symbols, and its autocorrelation function is defined for the complex sequence \(\left \{ \omega _{q}^{f(t)} \right \}\) instead of {f(t)} due to modulation for transmission.

Note that a periodic cross/autocorrelation function is also periodic with period L. So, we only need to consider 0 ≤ τ < L. The autocorrelation R(τ) at τ is also referred to as an in-phase autocorrelation if τ ≡ 0 mod L, and an out-of-phase autocorrelation if τ ≢ 0 mod L.

Remark 1

The cross/auto correlation function can be defined for two sequences with different periods, i.e., the correlation window length is the \(\gcd\) of two periods.

Example 1

Let f = (1001011), a binary sequence with period 7. Then the autocorrelation of f is given by

$$R(\tau)=\sum\limits_{t=0}^{6} (-1)^{s(t+\tau)+s(t)}.$$

Since this is an m-sequence of period 7, we have

$$R(\tau) = \left\{\begin{array} {ll} 7, & {for } \ \tau \equiv 0 \bmod{7}\\ -1, & \text{otherwise}. \end{array} \right.$$

A q-ary sequence f is called a perfect sequence if all the out-of-phase (periodic) autocorrelation is equal to zero. Namely, the autocorrelation function is a delta function:

$$R(\tau) = \left\{ \begin{array} {ll} L, & {for }\ \tau \equiv 0 \bmod{L}\\ 0, & \text{otherwise}. \end{array} \right.$$

However, for the binary case, the only known perfect sequence, up to equivalence is a = (0111) (see [44] for details). A perfect sequence, treated as a random process, is a white Gaussian noise, i.e., autocorrelation is a delta function. For nonbinary sequences, a Frank-Chu sequence is an integer sequence of period L in \(\mathbb {Z}_{2L}\), the residue ring modular 2L, mapped to a phase-sequence with a 2L th primitive root of unity with perfect autocorrelation, i.e., the out-of-phase of autocorrelation is equal to zero or equivalently, any out-of-phase sequence is orthogonal to the sequence (see [48, 60, 61] for the most recent constructions on p-ary perfect sequences). Frank-Chu sequences are proposed for channel estimation in 4G-LTE systems due to this orthogonality.

For an m-sequence with period L = 2n − 1, we have the normalized autocorrelation:

$$\frac{R(\tau)}L=\left\{\begin{array}{ll}1,&\mathrm{for}\;\tau\equiv0\mod L\\-\frac1L\rightarrow0&(\tau\mod L)\neq0,(n\rightarrow\text{large}).\end{array}\right.$$

This property of m-sequences resembles a white Gaussian noise, which makes it so popular in digital communications after Golomb successfully used it for detecting returning signals from Explorer 1. Examples include GPS, radar distance range, spread spectrum, and hardware testing, etc., which utilize this property.

A q-ary sequence of period L with the same autocorrelation as an m-sequence, is called an (ideal) 2-level autocorrelation sequence. All known binary 2-level autocorrelation sequences are collected in Golomb and Gong’s book [19] which still holds the record up to the date, since there are no new 2-level autocorrelation sequences since then. For nonbinary cases, there is no discovery on new constructions for sequences with 2-level autocorrelation. However, there is a great advance for proving the conjectured ternary 2-level autocorrelation sequences, which will be presented in Section 3.

In addition of the effort for finding perfect nonbinary sequences for the applications in wireless communications, there are two major approaches to approximate perfect binary sequences: one is to use 2-level autocorrelation sequences, and the other is to use Golay pair or CSSs, which will be introduced in the next subsection.

Remark 2

For the optimal autocorrelation function of a binary sequence f of period L, it depends on the value of L:

L

R(τ)

L ≡ 0 mod 4

R(τ) ∈{L,0}

L ≡ 3 mod 4

R(τ) ∈{L,− 1}

L ≡ 1 mod 4

R(τ) ∈{L,1,− 3}

L ≡ 2 mod 4

R(τ) ∈{L,2,− 2}

The first two cases are the cases of perfect sequences and 2-level autocorrelation sequences respectively. For the last two cases, the constructions are largely from residue classes [4, 15]. In general, the last two cases are more costly in their implementations. This observation is solely from their mathematical constructions and computational complexity. Nevertheless, there has been no report in the literature about their implementations yet, which may be worth to do some research on this problem, although it leans to more engineering perspective. In Section 4, we will provide the power residue sequences which is optimal in Case 3.

2.2 Aperiodic correlation and Golay complementary pairs/sets

Given a complex valued sequence of length L, s = (s(0), s(1), ⋯ , s(L − 1)), we extend it to a sequence with an infinite length by setting

$$s(t)=0\; \text{for both}\; t<0\; \text{and}\; t\ge L.$$
(1)

The aperiodic correlation is defined for this aperiodic extension of s.

Definition 2

For two complex valued sequences s0 and s1 of length L, the aperiodic crosscorrelation function of s0 and s1 at shift τ is defined by

$$C_{{\textbf{s}}_{0},{\textbf{s}}_{1}}(\tau)= \sum\limits_{t=0}^{L-1}s_{1}(t+\tau)\overline{s_{0}(t)}, -L< \tau<L.$$

If s0 = s1 = s, it becomes the aperiodic autocorrelation of sequence s at shift τ, i.e.,

$$C_{{\textbf{s}}}(\tau)=C_{{\textbf{s}},{\textbf{s}}}(\tau) = \sum\limits_{t=0}^{L-1}s(t+\tau)\overline{s(t)}, -L<\tau<L$$

or simply as C(τ). For two q-ary sequences f0 and f1 of length L, the aperiodic crosscorrelation function of f0 and f1 at shift τ is defined as the aperiodic crosscorrelation of \(\textbf{s}_{j}=\left \{\omega _{q}^{f_{j}(t)}\right \}, j=0, 1\), the aperiodic autocorrelation of a q-ary sequence f is also defined in analogue.

In this survey, we use \(R_{{\textbf{f}}_{0}, {\textbf{f}}_{1}}(\tau )\) and R(τ) to represent periodic crosscorrelation and autocorrelation functions respectively, and \(C_{\textbf{f}_{0}, \textbf{f}_{1}}(\tau )\) and C(τ), aperiodic crosscorrelation and autocorrelation functions.

Aperiodic correlation functions correspond to the functionality of matched filters in wireless transmission.

Example 2

We take q = 4, L = 4, f = (0,1,0,3). In this case, we have \(\omega _{4} = i, i = \sqrt {-1}\). So, \(\left \{i^{k} | k=0, 1, 2, 3\right \}=\{1, i, -1, -i\}\). Aperiodic autocorrelation computation in the fashion of a matched filter detection is shown as follows.

figure d

Due to the following property, we only need to compute autocorrelation C(τ) for 0 < τ < L.

Proposition 1

$$C(\tau)=\overline{C(-\tau)}, \forall \tau.$$

A perfect sequence can also be defined by aperiodic autocorrelation in a similar way as the periodic case. Unfortunately, there is no single sequence with zero aperiodic autocorrelation at every shift. The best case is the so-called Barker sequences, i.e., the magnitude of the out-of-phase aperiodic autocorrelation values are bounded by 1, i.e., |C(τ)|≤ 1 for τ ≠ 0, −L < τ < L (see [19] for more details about Barker sequences).

Definition 3

A set of q-ary sequences S = {f0, f1, ⋯ , fN− 1} is called a complementary sequence set (CSS) of size N if

$$\sum\limits_{u=0}^{N-1}C_{\boldsymbol{f}_{u}}(\tau)=0\; \mathrm{for}\; \tau \not\equiv 0 (\bmod{ L}).$$
(2)

If the set size N = 2, such a set is called a Golay (complementary) pair. Each sequence in a Golay pair is called a Golay sequence.

Remark 3

In a wireless transmission system, a transmitter may use two sequences, i.e., a Golay pair for transmitting the same information symbols using each sequence. At the receiver side, it will employ two correlators to compute their respective autocorrelation functions of each received signal (i.e., the sequence) with the corresponding locally generated sequences. So, this resembles the space redundancy, i.e., a 2 × 2 MIMO transmission system.

Example 3

For binary sequences f0 = (0001) and f1 = (0010), we have

figure a

Thus, they form a Golay pair.

In the following definitions, the given sequences being complex sequences or q-ary sequences depend on the context. Two CSSs

$$\begin{array} {l} S_{0}=\{\boldsymbol{f}_{0,0}, \boldsymbol{f}_{0,1},\cdots, \boldsymbol{f}_{0, N-1}\}, \text{and} \\ S_{1}=\{\boldsymbol{f}_{1,0}, \boldsymbol{f}_{1,1},\cdots, \boldsymbol{f}_{1, N-1}\} \end{array}$$

are said to be mutually orthogonal if

$$\sum\limits_{v=0}^{N-1}C_{\boldsymbol{f}_{0, v}, \boldsymbol{f}_{1, v}}(\tau)=0 \ \text{for}\ \forall \tau.$$
(3)

Definition 4

Let Su = {fu,0,fu,1,⋯ ,fu,N− 1} be CSSs of size N for 0 ≤ u < N, which are pairwise mutually orthogonal. Such a collection of Su is called complete mutually orthogonal complementary sets (CMOCS) or complete complementary codes (CCC).

Example 4

Let N = 2 and let binary sequences sets be given as follows.

$$\begin{array} {l} S_{0}=\{{\textbf{f}}_{00} =(0001), {\textbf{f}}_{01} = (0010)\}\\ S_{1}=\{{\textbf{f}}_{10} =(0100), {\textbf{f}}_{11} = (0111)\}. \end{array}$$

From Example 3, the first set is a Golay pair. We can easily verify that S1 is also a Golay pair, and two sets are mutually orthogonal. So {S0,S1} forms a CCC.

For quaternary sequences of length 4, we have the following example for CCC:

$$S=\left[\begin{array}{lll} {\textbf{f}}_{0,0}=(0,1,0,3), & {\textbf{f}}_{0,1}=(0,1,2,1) \\ {\textbf{f}}_{1,0}=(0,3,0,1), & {\textbf{f}}_{1,1}=(0,3,2,3) \end{array}\right].$$

In other words, each row is a Golay pair and two rows are mutually orthogonal.

We have the following relation between the periodic and aperiodic correlation functions.

Proposition 2

$$R(\tau) =C(\tau) + \overline{C(L-\tau)} = C(\tau) + C(\tau-L), \forall \tau.$$

2.3 The discrete Fourier transform

The discrete Fourier transform (DFT) of a q-ary sequence f = {f(t)} of period L is defined as

$$\hat{f}(k) = \sum\limits_{t=0}^{L-1} \omega_{q}^{f(t)}e^{\frac{-\sqrt{-1} 2\pi kt}{L}} = \sum\limits_{t=0}^{L-1} \omega_{q}^{f(t)} \omega_{L}^{-kt} , 0\le k<L.$$

(Recall that \(\omega _{L} = e^{\frac {\sqrt {-1} 2\pi }{L}}\).) The inverse DFT (IDFT) is given by

$$\omega_{q}^{f(t)} =\frac{1}{L} \sum\limits_{k=0}^{L-1} \hat{f}(k) \omega_{L}^{tk}, 0\le t<L.$$

DFT can also be given in a matrix form:

$$\left( \begin{array}{ccccc} \hat{f}(0)\\ \hat{f}(1)\\ \hat{f}(2)\\ \vdots\\ \hat{f}(L-1) \end{array}\right) = \left( \begin{array}{ccccc} 1 & 1 & 1& {\cdots} & 1\\ 1 & \omega_{L}^{-1} & \omega_{L}^{-2} & {\cdots} & \omega_{L}^{-(L-1)}\\ 1 & \omega_{L}^{-2} & \omega_{L}^{-4} & {\cdots} & \omega_{L}^{-2(L-1)}\\ {\vdots} &&&&\\ 1 & \omega_{L}^{-(L-1)} & \omega_{L}^{-2(L-1)} & {\cdots} & \omega_{L}^{-(L-1)^{2}} \end{array}\right) \cdot \left( \begin{array}{ccccc} \omega_{q}^{f(0)}\\ \omega_{q}^{f(1)}\\ \omega_{q}^{f(2)}\\ {\vdots} \\ \omega_{q}^{f(L-1)} \end{array}\right)$$

where the exponents of ωL is reduced by modulo L, since \({\omega _{L}^{L}}=1\). The L × L matrix in the above identity is referred to as the DFT matrix.

Example 5

For the m-sequence of period 7, f = {f(t)} = (1110100), we have {(− 1)f(t)} = (− 1, − 1, − 1, 1, − 1, 1, 1). The DFT of f is given by

$$\left( \begin{array}{cccccccc} \hat{f}(0)\\ \hat{f}(1)\\ \hat{f}(2)\\ \hat{f}(3)\\ \hat{f}(4)\\ \hat{f}(5)\\ \hat{f}(6) \end{array}\right) = \left( \begin{array}{cccccccc} 1 & 1 & 1& 1 & 1 & 1& 1\\ 1 & {\omega_{7}^{6}} & {\omega_{7}^{5}} & {\omega_{7}^{4}} &{\omega_{7}^{3}} & {\omega_{7}^{2}} & \omega_{7}\\ 1 & {\omega_{7}^{5}} & {\omega_{7}^{3}} & \omega_{7} &{\omega_{7}^{6}} & {\omega_{7}^{4}} & {\omega_{7}^{2}}\\ 1 & {\omega_{7}^{4}} & \omega_{7} & {\omega_{7}^{5}} &{\omega_{7}^{2}} & {\omega_{7}^{6}} & {\omega_{7}^{3}}\\ 1 & {\omega_{7}^{3}} & {\omega_{7}^{6}} & {\omega_{7}^{2}} &{\omega_{7}^{5}} & \omega_{7} & {\omega_{7}^{4}}\\ 1 & {\omega_{7}^{2}} & {\omega_{7}^{4}} & {\omega_{7}^{6}} &\omega_{7} & {\omega_{7}^{3}} & {\omega_{7}^{5}}\\ 1 & \omega_{7} & {\omega_{7}^{2}} & {\omega_{7}^{3}} &{\omega_{7}^{4}} & {\omega_{7}^{5}} & {\omega_{7}^{6}} \end{array}\right) \cdot \left( \begin{array}{cccccccc} -1\\-1\\-1\\ 1\\-1\\1\\1 \end{array}\right)$$

where ω7 = ei2π/7 and \(i=\sqrt {-1}\). After the evaluation, we have

$$\left( \begin{array}{cc} \hat{f}(0) \\ \hat{f}(1)\\ \hat{f}(2)\\ \hat{f}(3)\\ \hat{f}(4)\\ \hat{f}(5)\\ \hat{f}(6) \end{array}\right) = 2\cdot\left( \begin{array}{ccc} &-1/2 &\\ {\omega_{7}^{4}} &+{\omega_{7}^{2}} &+ \omega_{7}\\ \omega_{7} &+ {\omega_{7}^{4}} &+ {\omega_{7}^{2}}\\ {\omega_{7}^{5}} &+ {\omega_{7}^{6}} &+ {\omega_{7}^{3}}\\ {\omega_{7}^{2}} &+ \omega_{7} &+ {\omega_{7}^{4}}\\ {\omega_{7}^{6}} &+ {\omega_{7}^{3}} &+ {\omega_{7}^{5}}\\ {\omega_{7}^{3}} &+ {\omega_{7}^{5}} &+ {\omega_{7}^{6}} \end{array}\right) = \left( \begin{array}{ccc} -1\\ -1+2.646i\\-1+2.646i\\ -1-2.646i\\-1+2.646i\\-1-2.646i\\-1-2.646i \end{array}\right).$$

2.4 Ambiguity functions

Given a complex sequence s = {s(t)} of period L, a phase-shift of s is given by \(\left \{s(t)\omega _{L}^{kt}\right \}\) for a fixed k : 0 ≤ k < L. A phase-shift of a q-ary sequence f is defined for \(\textbf{s}=\omega _{q}^{\textbf{f}}\), i.e., \(\left \{\omega _{q}^{f(t)} \omega _{L}^{kt}\right \}\).

Definition 5

Let s0 and s1 be two complex valued sequences. Then the periodic cross ambiguity function of s0 and s1 at (τ,k) is defined as

$$A_{{\textbf{s}}_{0}, {\textbf{s}}_{1}}(\tau, k) = \sum\limits_{t=0}^{L-1} s_{1}(t+\tau)\overline{s_{0}(t)} \omega_{L}^{-kt}, 0\le \tau, k < L,$$
(4)

which is a 2-dimensional function in variables τ, delayed time, and k, the Doppler variable where t + τ is reduced by modulo L. When s0 = s1 = s, it is referred to as the auto ambiguity function of the sequence s, or simply the ambiguity function, i.e.,

$$A_{{\textbf{s}}}(\tau, k) = \sum\limits_{t=0}^{L-1} s(t+\tau)\overline{s(t)} \omega_{L}^{-kt}, 0\le \tau, k < L,$$

or simply as A(τ,k). For two q-ary sequences f0 and f1, the cross ambiguity function of f0 and f1 is defined for \({\textbf{s}}_{0}=\left \{\omega _{q}^{f_{0}(t)}\right \}\) and \({\textbf{s}}_{1}=\left \{\omega _{q}^{f_{1}(t)}\right \}\). It becomes the auto ambiguity function of f when f0 = f1 = f.

Remark 4

For the concept of ambiguity functions, there are several different definitions: in signal processing, it uses \(\overline {A}(\tau , k)\); it may conjugate the term s(t + τ) instead of s(t) (e.g., in [68]). However, we adopt the above definition in order to keep the conventional notation in sequence design. This will be clearly seen in the following definition for q-ary sequences. But for all the different definitions, the set consisting all values of A(τ,k) is not changed.

Remark 5

From the definition of the ambiguity function, when k = 0, the cross/auto ambiguity function becomes the cross/auto correlation. Thus a bound on the ambiguity function is also the bound of correlation. However, in some cases, the correlation has a better bound than the ambiguity function.

We may write the auto ambiguity function as a 2-dimensional array:

$$A = \left( \begin{array}{ccccc} A(0, 0) & A(0, 1) & A(0,2) & {\cdots} & A(0, L-1)\\ A(1, 0) & A(1, 1) & A(1,2) & {\cdots} & A(1, L-1)\\ A(2, 0) & A(2, 1) & A(2,2) & {\cdots} & A(2, L-1)\\ {\vdots} &&&&\\ A(L-1, 0) & A(L-1, 1) & A(L-1,2) & {\cdots} & A(L-1, L-1) \end{array}\right).$$

We define a time-shift matrix of s as

$$T= \left( \begin{array}{cccccc} s(0) & s(1) & s(2) & {\cdots} & s(L-2) & s(L-1)\\ s(1) & s(2) & s(3) & {\cdots} & s(L-1) &s(0)\\ {\vdots} &&&&\\ s(L-1) & s(0) & s(1) & {\cdots} & s(L-3)& s(L-2) \end{array}\right)$$

and the phase-shift matrix of the sequence s is given by

$$P = \left( \begin{array}{cccccc} s(0) & s(0) & s(0)& {\cdots} & s(0)\\ s(1) & s(1){\omega_{L}^{1}} & s(1){\omega_{L}^{2}} & {\cdots} & s(1)\omega_{L}^{L-1}\\ s(2) & s(2){\omega_{L}^{2}} & s(2){\omega_{L}^{4}} & {\cdots} & s(2)\omega_{L}^{2(L-1)}\\ {\vdots} &&&&\\ s(L{-}1) & s(L{-}1)\omega_{L}^{L-1} & s(L{-}1)\omega_{L}^{2(L-1)} & {\cdots} & s(L{-}1)\omega_{L}^{(L-1)^{2}} \end{array}\right).$$

Then we have the following representation

$$A=T\overline{P}$$

where \(\overline {P}\) is the conjugate of the phase-shift matrix P by conjugating each entry in P. The cross ambiguity function also can be represented similarly.

Example 6

For the m-sequence of period 7, f = {f(t)} = (1110100), s(t) = (− 1)f(t). The time-shift matrix and the conjugate of the phase-shift matrix of the m-sequence are given as follows.

$$T = \left( \begin{array}{rrr rrrr} -1 & -1 & -1& 1 & -1 & 1& 1\\ -1 & -1& 1 & -1 & 1& 1 &-1\\ -1& 1 & -1 & 1& 1 &-1 & -1\\ 1 & -1 & 1& 1 &-1 & -1 & -1\\ -1 & 1& 1 &-1 & -1 & -1 & 1\\ 1& 1 &-1 & -1 & -1 & 1 & -1\\ 1 &-1 & -1 & -1 & 1 & -1 &1 \end{array}\right)$$

and

$$\overline{P} =\left( \begin{array}{rrr rrrr} -1 & -1 & -1 & -1 & -1 & -1 & -1 \\ -1 & -{\omega_{7}^{6}} & -{\omega_{7}^{5}} & -{\omega_{7}^{4}} & - {\omega_{7}^{3}} & -{\omega_{7}^{2}} & -\omega_{7} \\ -1 & -{\omega_{7}^{5}} & -{\omega_{7}^{3}} & -\omega_{7} & -{\omega_{7}^{6}} & -{\omega_{7}^{4}} & -{\omega_{7}^{2}} \\ 1 & {\omega_{7}^{4}} & \omega_{7} & {\omega_{7}^{5}} & {\omega_{7}^{2}} & {\omega_{7}^{6}} & {\omega_{7}^{3}} \\ -1 & -{\omega_{7}^{3}} & -{\omega_{7}^{6}} & -{\omega_{7}^{2}} & -{\omega_{7}^{5}} & -\omega_{7} & -{\omega_{7}^{4}} \\ 1 & {\omega_{7}^{2}} & {\omega_{7}^{4}} & {\omega_{7}^{6}} & \omega_{7} & {\omega_{7}^{3}} & {\omega_{7}^{5}} \\ 1 & \omega_{7} & {\omega_{7}^{2}} & {\omega_{7}^{3}} & {\omega_{7}^{4}} & {\omega_{7}^{5}} & {\omega_{7}^{6}} \end{array}\right).$$

Thus, the ambiguity function of the m-sequence f is given by \(A=T\overline {P}\), as shown below

$$\begin{array}{@{}rcl@{}} {A}&=&T\overline{P} = \left( \begin{array}{rrrrrrr} 7.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 \\ -1.0000 & 2.8019 & -0.2470 & 1.4450 & 1.4450 & -0.2470 & 2.8019 \\ -1.0000 & -0.2470 & 1.4450 & 2.8019 & 2.8019 & 1.4450 & -0.2470 \\ -1.0000 & -2.3569 & 2.0489 & -2.6920 & -2.6920 & 2.0489 & -2.3569 \\ -1.0000 & 1.4450 & 2.8019 & -0.2470 & -0.2470 & 2.8019 & 1.4450 \\ -1.0000 & -2.6920 & -2.3569 & 2.0489 & 2.0489 & -2.3569 & -2.6920 \\ -1.0000 & 2.0489 & -2.6920 & -2.3569 & -2.3569 & -2.6920 & 2.0489 \end{array}\right)\\&&+\left( \begin{array}{rrrrrrr} 0.0000 & 0.0000 &0.0000 &0.0000 &0.0000 &0.0000 &0.0000\\ 0.0000 & 0.3862 &-2.8176 & -2.4314 & 2.4314 & 2.8176 & -0.3862\\ 0.0000 & -2.8176 & 2.4314 & -0.3862 & 0.3862 & -2.4314 & 2.8176\\ 0.0000 & -1.5637 & -1.9499 & -0.8678 & 0.8678 & 1.9499 & 1.5637\\ 0.0000 & 2.4314 & 0.3862 & 2.8176 & -2.8176 & -0.3862 & -2.4314\\ 0.0000 & 0.8678 &-1.5637 & 1.9499 & -1.9499 & 1.5637 & -0.8678\\ 0.0000 & -1.9499 & 0.8678 & 1.5637 & -1.5637 & -0.8678 & 1.9499 \end{array}\right)\cdot{i} \end{array}$$

and

$$|A(\tau,k)|=\left( \begin{array}{rrrrrrr} 7.0000 &0.0000 &0.0000 &0.0000 &0.0000 &0.0000 &0.0000\\ 1.0000 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284\\ 1.0000 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284\\ 1.0000 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284\\ 1.0000 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284\\ 1.0000 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284\\ 1.0000 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284 &2.8284 \end{array}\right)$$

which is shown in Fig. 1.

Fig. 1
figure 1

|A(τ,k)| for m-sequence f = (1110100)

For a sequence given by

$$\begin{array}{@{}rcl@{}} &&{\textbf{g}}=(1010110)\\ &&\left\{(-1)^{g(t)}\right\} = (-1, 1,-1, 1, -1, -1,1). \end{array}$$

Note this is the sum of two m-sequences of period 7, so we can compute its ambiguity function using the matrix method as follows. The time-shift matrix Tg is given by

$$T_{{\textbf{g}}} = \left( \begin{array}{rrr rrrr} -1 & 1 & -1 & 1 & -1 & -1 & 1 \\ 1 & -1 & 1 & -1 & -1 & 1 & -1 \\ -1 & 1 & -1 & -1 & 1 & -1 & 1 \\ 1 & -1 & -1 & 1 & -1 & 1 & -1 \\ -1 & -1 & 1 & -1 & 1 & -1 & 1 \\ -1 & 1 & -1 & 1 & -1 & 1 & -1 \\ 1 & -1 & 1 & -1 & 1 & -1 & -1 \end{array} \right)$$

and the conjugate of the phase shift matrix, Pg, as follows.

$$\overline{P_{{\textbf{g}}}} = \left( \begin{array}{rrrrrrr} -1 & -1 & -1 & -1 & -1 & -1 & -1 \\ 1 & {\omega_{7}^{6}} & {\omega_{7}^{5}} & {\omega_{7}^{4}} & {\omega_{7}^{3}} & {\omega_{7}^{2}} & \omega_{7} \\ -1 & -{\omega_{7}^{5}} & -{\omega_{7}^{3}} & -\omega_{7} & -{\omega_{7}^{6}} & -{\omega_{7}^{4}} & -{\omega_{7}^{2}} \\ 1 & {\omega_{7}^{4}} & \omega_{7} & {\omega_{7}^{5}} & {\omega_{7}^{2}} & {\omega_{7}^{6}} & {\omega_{7}^{3}} \\ -1 & -{\omega_{7}^{3}} & -{\omega_{7}^{6}} & -{\omega_{7}^{2}} & -{\omega_{7}^{5}} & -\omega_{7} & -{\omega_{7}^{4}} \\ -1 & -{\omega_{7}^{2}} & -{\omega_{7}^{4}} & -{\omega_{7}^{6}} & -\omega_{7} & -{\omega_{7}^{3}} & -{\omega_{7}^{5}} \\ 1 & \omega_{7} & {\omega_{7}^{2}} & {\omega_{7}^{3}} & {\omega_{7}^{4}} & {\omega_{7}^{5}} & {\omega_{7}^{6}} \end{array} \right).$$

Thus, the ambiguity function of the sequence g = (1010110) is

$$\begin{array}{@{}rcl@{}} {A_{{\textbf{g}}}}&=&T_{{\textbf{g}}}\overline{P_{{\textbf{g}}}}=\left( \begin{array}{rrrrrrr} 7.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 \\ -5.0000 & -1.8019 & 1.2470 & -0.4450 & -0.4450 & 1.2470 & -1.8019 \\ 3.0000 & 3.6039 & -2.4940 & 0.8901 & 0.8901 & -2.4940 & 3.6039 \\ -1.0000 & -4.0489 & 0.6920 & 0.3569 & 0.3569 & 0.6920 & -4.0489 \\ -1.0000 & 2.8019 & -0.2470 & 1.4450 & 1.4450 & -0.2470 & 2.8019 \\ 3.0000 & -0.8019 & 2.2470 & 0.5550 & 0.5550 & 2.2470 & -0.8019 \\ -5.0000 & -0.4450 & -1.8019 & 1.2470 & 1.2470 & -1.8019 & -0.4450 \end{array}\right)\\&&+\left( \begin{array}{rrrrrrr} 0.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 \\ 0.0000 & 0.8678 & -1.5637 & 1.9499 & -1.9499 & 1.5637 & -0.8678 \\ 0.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 & 0.0000 \\ 0.0000 & -1.9499 & 0.8678 & 1.5637 & -1.5637 & -0.8678 & 1.9499 \\ 0.0000 & 3.5135 & 1.0821 & -0.6959 & 0.6959 & -1.0821 & -3.5135 \\ 0.0000 & -3.5135 & -1.0821 & 0.6959 & -0.6959 & 1.0821 & 3.5135 \\ 0.0000 & 1.9499 & -0.8678 & -1.5637 & 1.5637 & 0.8678 & -1.9499 \end{array}\right)\cdot{i} \end{array}$$

and

$$|A_{{\textbf{g}}}(\tau,k)|=\left( \begin{array}{rrrrrrr} 7.0000 &0.0000 &0.0000 &0.0000 &0.0000 &0.0000 &0.0000\\ 5.0000 &2.0000 &2.0000 &2.0000 &2.0000 &2.0000 &2.0000\\ 3.0000 &3.6039 &2.4940 &0.8901 &0.8901 &2.4940 &3.6039\\ 1.0000 &4.4940 &1.1099 &1.6039 &1.6039 &1.1099 &4.4940\\ 1.0000 &4.4940 &1.1099 &1.6039 &1.6039 &1.1099 &4.4940\\ 3.0000 &3.6039 &2.4940 &0.8901 &0.8901 &2.4940 &3.6039\\ 5.0000 &2.0000 &2.0000 &2.0000 &2.0000 &2.0000 &2.0000 \end{array}\right)$$

which is shown in Fig. 2.

Fig. 2
figure 2

|Ag(τ,k)| for g = (1010110)

We can also define the aperiodic ambiguity functions of sequences as follows.

Definition 6

Let s0 and s1 be two complex-valued sequences of length L and extended into infinite length by (1), i.e., si(t) = 0,i = 0,1 for t < 0 and tL. The aperiodic cross ambiguity function of s0 and s1 at (τ,k) is defined as

$$A_{{\textbf{s}}_{0}, {\textbf{s}}_{1}}(\tau, k) = \sum\limits_{t=0}^{L-1}s_{1}(t+\tau)\overline{s_{0}(t)} \omega_{L}^{-kt},\ -L<\tau<L, 0\le k<L\\$$
(5)

and it defines the auto ambiguity function of the sequence s when s0 = s1 = s, i.e.,

$$A_{{\textbf{s}}}(\tau, k)= \sum\limits_{t=0}^{L-1}s(t+\tau)\overline{s(t)} \omega_{L}^{-kt}, -L< \tau<L, 0\le k<L.$$
(6)

The aperiodic cross ambiguity function of two q-ary sequences f0 and f1 is defined for \(\textbf{s}_{k}=\omega ^{\textbf{f}_{k}}, k=0, 1\) and it defines the aperiodic auto ambiguity function when f0 = f1.

Note that Turyn [67] defined the aperiodic cross ambiguity function of two sequences of length L using continuous phase shift, i.e., the phase shift term \(\omega _{L}^{-kt} = e^{\frac {-i2\pi kt}{L}}\) is replaced by eit𝜃 where 0 ≤ 𝜃 < 2π. The phase shift term in the ambiguity function in [23] also considered a more general case, i.e., it can be any discrete phase shift, not restricted to L phases. Currently, there are no constructions for a sequence (or signal) set (it may not be a CSS) even for low aperiodic correlation, not to mention low aperiodic ambiguity. But there are some constructions for the sequence sets with low periodic ambiguity. We will introduce those constructions in Section 4.

Remark 6

In fact, periodic and aperiodic correlation and ambiguity functions can be given by the same formulae. The difference is that an infinite extension of s = (s(0),s(1),⋯ ,s(L − 1)) of length L is a periodic extension for the periodic case, and the zero extension for the aperiodic case.

3 Two-level autocorrelation sequences

The research on constructions for 2-level autocorrelation sequences has been gloriously active for more than 7 decades. Those sequences correspond to the combinatorial design, i.e., cyclic Hadamard difference sets. The constructions are mainly based on finite fields except for the binary case. For the references, the reader is referring to [19].

3.1 The second-order decimation-Hadamard transform

Since one of the proofs of Lin conjecture and Ludkovski-Gong conjectured sequences with 2-level autocorrelation are obtained by applying the second-order decimation-Hadamard transform (DHT), which was introduced by Golomb and Gong in 2002. Below we introduce the definitions together with its extension of the multiplexing case.

Definition 7

Let f(x) be a function from \({\mathbb {F}}_{Q}\) to \({\mathbb {F}}_{p}\) (Q = pn). The Hadamard transform of f is defined as

$$\begin{array}{@{}rcl@{}} \widehat{f}(\lambda) & =& \sum\nolimits_{x\in {\mathbb{F}}_{q}} \omega_{p}^{Tr(\lambda x)-f(x)}, \lambda \in {\mathbb{F}}_{Q}, \text{and its inverse transform is }\\ \omega_{p}^{f(\lambda)} & =& \frac{1}{Q} \sum\nolimits_{x\in {\mathbb{F}}_{q}} \omega_{p}^{Tr(\lambda x)} \overline{ \widehat{f}(\lambda)} \end{array}$$

Definition 8

For integers 0 < v,t < Q − 1 with \(\gcd (v, Q-1)=1\) and \(\gcd (t, Q-1)=1\), the first-order DHT is defined as the Hadamard transform of f(xv), i.e.,

$$\widehat{f}(v) (\lambda) = \sum\limits_{x\in {\mathbb{F}}_{q}} \omega_{p}^{Tr(\lambda x)-f(x^{v})}, \lambda \in {\mathbb{F}}_{Q},$$

and the second-order DHT is defined as the Hadamard transform of \(\widehat {f} (v)(x^{t})\), i.e.,

$$\widehat{f}(v, t) (\lambda) = \sum\limits_{x\in {\mathbb{F}}_{q}} \omega_{p}^{Tr(\lambda x)} \overline{ \widehat{f} (v)(x^{t})}, \lambda \in \mathbb{F}_{Q}.$$

Definition 9

If

$$\widehat{f}(v, t) (\lambda)\in \left\{Q{\omega_{p}^{t}} | 0\le t<p\right\}, \lambda \in {\mathbb{F}}_{Q},$$

then we define a function \(g: {\mathbb {F}}_{Q} \mapsto {\mathbb {F}}_{p}\) as

$$\omega_{p}^{g(x)} = \widehat{f}(v, t) (\lambda), \lambda \in {\mathbb{F}}_{Q}$$

where (v,t) is called a realizable pair of f(x) and g(x), a realization of f(x).

From Parseval formula, the following result follows immediately (see [19]).

Proposition 3

For the pair f(x) and g(x) in Definition 9, the p-ary sequence {f(αt)} has 2-level autocorrelation if and only if the p-ary sequence {g(αt)} has 2-level autocorrelation.

If \(d=\gcd (v, Q-1)>1\), then xv is not a permutation of \({\mathbb {F}}_{Q}\). So we need multiple pieces of the first-order DHT in order to get g(x), which is called a multiplexing DHT, defined as follows.

Definition 10

For integers 0 < v,t < Q − 1 with \(d=\gcd (v, Q-1)>1\) and \(\gcd (t, Q-1)=1\), the first-order multiplexing DHT is defined as

$$\widehat{f}(v) (\lambda, \gamma) = \sum\limits_{x\in {\mathbb{F}}_{q}} \omega_{p}^{Tr(\lambda x)-f(\gamma x^{v})}, \lambda \in {\mathbb{F}}_{Q}, \gamma \in {\mathbb{F}}_{Q}^{*},$$

and the second-order multiplexing DHT is

$$\widehat{f}(v, t) (\lambda, \gamma) = \sum\limits_{x\in {\mathbb{F}}_{q}} \omega_{p}^{Tr(\lambda x)}\overline{ \widehat{f}(v)(x^{t}, \gamma)}, \lambda \in \mathbb{F}_{Q}, \gamma \in \mathbb{F}_{Q}^{*}.$$

Furthermore, if

$$\widehat{f}(v, t) (\lambda, \gamma)\in \left\{Q{\omega_{p}^{t}} | 0\le t<p\right\}, \lambda \in {\mathbb{F}}_{Q}, \gamma \in {\mathbb{F}}_{Q}^{*},$$

then we define a function \(g: {\mathbb {F}}_{Q} \mapsto {\mathbb {F}}_{p}\) as

$$\omega_{p}^{g\left( \alpha^{kv+j} \right)}= \widehat{f}(v, t) \left( \alpha^{k}, \alpha^{j}\right), 0\le k<s, 0\le j<d, \text{where} s=\frac{Q-1}{d}, g(0)=0$$

(recall that α is a primitive element of \({\mathbb {F}}_{Q}\)) and (v,t) is called a realizable pair of f(x), and g(x), a multiplexing realization of f(x).

Proposition 4

[35] With the notation in Definition 10, the p-ary sequence {f(αt)} has 2-level autocorrelation if and only if the multiplexing p-ary sequence {g(αt)} has 2-level autocorrelation.

The following concept was introduced by Golomb and Gong as an alternative way to define a Hadamard equivalent relation, based on the concept introduced by Dillon and Dobbertin.

Definition 11

If g(x) is a realization of f(x) by the second-order DHT or multiplexing DHT, then we say that g is Hadamard equivalent to f.

3.2 Known constructions

The known primary constructions of binary sequences of period L:

  1. 1.

    Number theory based construction: L is prime and L ≡ 3 mod 4, we have quadratic residue sequences or called Legendre sequences (1932); furthermore, if L = 4a2 + 27, we have Hall’s sextic residue sequences (Hall 1957).

  2. 2.

    Finite fields based constructions for L = 2n − 1 where n is a positive integer:

    • m-sequences by Singer (1938 in the form of cyclic Hadamard difference sets) and Golomb (1954).

    • For n ≥ 6, n composite, we have GMW sequences, constructed by Goldon, Mills and Welch in 1962 in the language of cyclic Hadamard difference sets, and Scholtz and Welch provided a sequence version in 1984).

    • Hyper-oval constructions by Maschietti (1998) for which there are three constructions using three types of monomial hyper-ovals, i.e., Segre, and Glynn I and II cases.

    • Dillon-Dobbertin’s Kasami power function construction (2004): Dillon and Dobbertin in their milestone work published in 2004 using the Kasami exponents to construct binary sequences with 2-level autocorrelation, which include the conjectured 3-term, 5-term sequences and Welch-Gong (WG) sequences as subclasses.

The known primary constructions for p > 2-ary case:

  1. 1.

    For p > 2, m-sequences (Zieler, 1959), GMW sequences (1962), and HG sequences [31] (2002) (also found by Dillion independently) including Helleseth-Kumar-Martin’s ternary sequences as a special case.

  2. 2.

    For p = 3, conjectured sequences:

    • Lin conjecture (1998): Let n = 2m + 1 and

      $$f(x) = Tr(x + x^{d}), \text{where} d = 2\cdot3^{m} +1.$$

      Lin, in his Ph.D thesis, conjectured that {f(αt)} has 2-level autocorrelation where α is a primitive element of \(\mathbb {F}_{3^{n}}\). The conjecture has been proved by two different research groups, one is given by Hu, Shao, Gong and Helleseth [35] in 2014, and the other, by Arasu, Dillon and Player [3] in 2015.

    • Ludkovski and Gong conjectures [42] (2001): Starting with a Lin conjectured sequence, applying the 2nd-order DHT, four classes of new ternary 2-level autocorrelation sequences are obtained, namely A, B, C and D where the class D contains classes B and C. Ludkovski and Gong differentiated them from the class D because of the similarity to their counter part p = 2. The validity of these two classes of the ternary sequences is also established by Arasu-Dillon-Player in the same paper [3].

The known secondary construction

For p-ary case (p prime), there is a secondary construction for p-ary 2-level autocorrelation sequences of period pn − 1, which is given by compositing a 2-level autocorrelation sequence with period pm − 1 where m is a proper factor of n and an m-sequence over \(\mathbb {F}_{p^{m}}\) of degree n/m or generalized GMW sequence over \(\mathbb {F}_{p^{m}}\). (See [19] for details.) Note that No [47] observed that it can composite with a sequence over \(\mathbb {F}_{p^{m}}\) with the uniform difference condition. However, the known sequences over \(\mathbb {F}_{p^{m}}\) having the uniform difference condition are only the aforementioned m-sequences and GMW sequences.

3.3 Some observations on the proofs for the conjectures

Ludkovski-Gong conjectured sequences are the realizations of f, the 2-term function in Lin conjecture, with the two different classes of the realizable pairs. The condition on k with \(\gcd (m-k, n)=1\), marked by in Table 1 was added by Arasu-Dillon-Player [3] in their proof.

Table 1 Ludkovski-Gong conjectured sequences realized from Lin conjecture

The validity of Lin conjecture and Ludkovski-Gong conjecture is confirmed after more than one and half decades later since the conjectures were formed. Both proofs for Lin conjecture are rather technical. The proof given by Hu-Shao-Gong-Helleseth is to apply the second-order multiplexing DHT, and the method used by Arasu-Dillon-Player is to use the character sum factorization approach, which is much more general and powerful. They not only proved the validity of Lin conjecture and Ludkovski-Gong conjectures, but also showed that the Dillon-Dobbertin construction for binary 2-level autocorrelation sequences and the Helleseth-Gong/Dillon construction for p-ary 2-level autocorrelation sequences can be represented by their character sum factorizations. This is extremely remarkable!

In fact, the character sum factorization approach is tightly related to the second-order DHT approach. The triple exponents involved in the character sum factorization in Arasu-Dillon-Player’s work for the Dillon-Dobbertin construction correspond to the realization pair in the second-order DHT. We observe that one of the triple exponents in the character sum factorization for each of Lin conjecture, Ludkovski-Gong conjectures, and Helleseth-Gong and Dillon construction is not coprime with the period L = pn − 1.

So we can show that all those three constructions can be realized from m-sequences by applying the second-order multiplexing DHT. This means that all known p-ary 2-level autocorrelation sequences are Hadamard equivalent through the second-order multiplexing DHT! For binary cases, all the known constructions through finite fields are Hadamard equivalent through the second-order DHT (see [19]) or the second-order multiplexing DHT for Dillon-Dobberin’s construction for n even. In other words, starting with an m-sequence, using the realizable pairs (multiplexing or not), we can obtain all the known 2-level autocorrelation sequences. This is rather surprising for p-ary (p > 2) cases. Those include the secondary constructions as well as Hadamard equivalence of quadratic residue sequences, which are investigated by Yu and Gong [74].

Observation 1

Except for the Hall sextic residue sequences, all the known p-ary (p prime) 2-level autocorrelation sequences can be realized by p-ary m-sequences by applying the second-order DHT or multiplexing DHT.

Remark 7

For n = 7, from Section 9.4.4 in [19], the Hall sextic residue sequence is also Hadamard equivalent to the m-sequences. However, there is no proof for the result in general n for period 2n − 1 being prime and n ≡ 3 mod 4. This is an unsolved problem.

In terms of the Hadamard equivalence (multiplexing or not), the above observation and Remark 7 imply that excluding the Hall sextic residue sequences, there is only one class of 2-level autocorrelation sequences, i.e., m-sequences.

3.4 Open problem on linear span of Ludkovski-Gong conjectured sequences by Gong and Helleseth

Nevertheless, all the currently conjectured 2-level autocorrelation sequences have been proved! However, we still have important unsolved problems for Ludkovski-Gong conjectured sequences, i.e., their trace representations (the sum of the monomial trace terms) and linear spans (the linear span of a q-ary sequence of period L is defined as the number of stages of the shortest LFSR which generates the sequence). In 2004, Gong and Helleseth conjectured the linear span of the class D sequences, presented below.

Observation 2

(Gong and Helleseth (2004) [26]) Let gk(x) be the realization of f(x) = Tr(x + xd), the 2-term function in Lin conjecture, with realization pair (1,tk) where \(\gcd (m-k, n)=1\) in Table 1. Then the linear span of gk, denoted as LS(gk), is given by

$$LS({\textbf{g}}_{k})= nl_{k}$$
(7)

where lk is the number of trace terms, determined by the Fibonacci sequence {Fk} as follows:

$$\left\{ \begin{array}{llll} l_{1}& = & F_{2(m-1)}& \\ l_{k} & = & F_{2k}, & k=2, 3, \cdots, m-2 \\ l_{m-1}& = & F_{2m} & \end{array} \right.$$

when n prime or n≢0 mod 3, and

$$\left\{ \begin{array}{cccc} l_{k} & = & F_{2k}, & k=2, 3, \cdots, m-2\\ l_{m-1}& = & F_{2m}& \end{array} \right.$$

where n ≡ 0 mod 3.

Since {Fk} is the Fibonacci sequence, we have the following linear recursive relation:

$$F_{2k}= 3F_{2(k-1)}- F_{2(k-2)}, k=2, 3, \cdots, m.$$
(8)

In other words, the number of trace terms of the sequence gk,k = 2,3,⋯ ,m − 1 are linearly recursively related by (8).

3.5 The exhaustive search approach

The exhaustive search for binary 2-level autocorrelation sequences have been done for L = 2n − 1 for n ≤ 10. So the next case will be n = 11. For n = 11, there are 187 coset leaders modulo L = 211 − 1 = 2047. Hence the exhaustive searching space is given by 2187 − 187 × 186/2 − 187, which is beyond current computing power. A succeeding case after that is n = 12. Since \(\mathbb {F}_{2^{12}}\) has subfields and several subgroups of the multiplicative group determined by the factors of 212 − 1, by applying Baumert’s tricks (see [4, Section III], for example [4, Lemma 3.8] on page 63), it may result in a smaller search complexity than the case for n = 11. We encourage anyone who is interested to attempt that.

The other direction would be for conducting partial searches for 2-level autocorrelation sequences. For example, a partial search for 10 < n < 20 has been done for Hadamard equivalent classes by applying the second-order DHT of Definition 8 and reported in [24] for n ≤ 17 and for n = 18,19 succeedingly done, but not including the cases by applying the second-order multiplexing DHT of Definition 10. Nevertheless, this partial search confirmed that all known 2-level autocorrelation sequences can be classified for 10 < n < 20 under the Hadamard equivalence.

4 Sequences with low ambiguity

In this section, we present a summary for the currently best-known designs for sequence sets with low ambiguity and large sizes. For two sequences, if one cannot be obtained by applying both the time shift operator and phase shift operator, then they are said to be time-phase shift distinct, a definition introduced by Ding et al. [14], otherwise, the time-phase shift equivalent.

For example, let S consist of the following five quaternary sequences of period 4:

$$\begin{array}{@{}rcl@{}} {\textbf{f}}_{0}&=(1, 2, 0, 3) \\ {\textbf{f}}_{1}&=(0, 3, 1, 2) \\ {\textbf{f}}_{2}& = (1, 3, 2, 2) \\ {\textbf{f}}_{3}& =(1, 0, 0, 1)\\ {\textbf{f}}_{4} & =(0, 0, 3, 1). \end{array}$$

By observation, since q = L = 4, we have f1 is a time-shift of f0 at shift τ = 2, f2 is a phase-shift of f0 with k = 1, and f3 is a phase-shift of f0 with k = 2, and f4 is a time-shift at τ = 2 and phase-shift at k = 1 from f0, i.e., at (τ,k) = (2,1) from f0, i.e.,

$$\begin{array}{@{}rcl@{}} {\textbf{f}}_{1} &=&\{f_{0}(t+2)\}_{t=0}^{3} \\ {\textbf{f}}_{2}&=&{\textbf{f}}_{0}+(0, 1, 2, 3)=(1, 2, 0, 3) +(0, 1, 2, 3)\\ {\textbf{f}}_{3}&=&{\textbf{f}}_{0}+(0, 2, 0, 2)=(1, 2, 0, 3) +(0, 2, 0, 2)\\ {\textbf{f}}_{4}&=& \{f_{0}(t+2)\}_{t=0}^{3} + (0, 1, 2, 3) = (0, 3, 1,2)+(0, 1, 2, 3). \end{array}$$

Thus, by the time-phase shift relation, S has only one sequence f0, and the others are time-phase shift equivalent to f0. (Note that in general, we cannot do the computation like the examples shown above when qL, since the phase-shift of f is given by \(\left \{\omega _{q}^{f(t)}\omega _{L}^{kt}\right \}\).)

We now assume that S = {s0,⋯ ,sN− 1} is a set consisting of N complex sequences of period L, which are time-phase shift distinct. We use the following notations for different bounds.

  • \(G_{\max \limits }\): the maximum magnitude of the auto ambiguity function of S, defined by

    $$G_{\max} = \underset{0\le j<N, 0\le \tau, k <L}{\max} \{ |A_{{\textbf{s}}_{j}}(\tau, k)| \}.$$
  • \(A_{\max \limits }\): the maximum magnitude of the cross ambiguity function \(A_{{\textbf{s}}_{k}, {\textbf{s}}_{j}}(\tau , k)\), i.e.,

    $$A_{\max} = \underset{0\le k, j<N, 0\le \tau, k <L}{\max} |A_{{\textbf{s}}_{k}, {\textbf{s}}_{j}}(\tau, k)|, (\tau, k)\ne (0, 0) \ {if} \ k=j.$$

We may extend S to T by including all time-shift distinct sequences from S and define

  • \(R_{\max \limits }\), the maximum out-of-phase autocorrelation of the sequences in T, i.e.,

    $$R_{\max} = \underset{{\textbf{s}}\in T, 0\le \tau <L}{\max} |C_{{\textbf{s}}}(\tau)|.$$
  • \(C_{\max \limits }\): the maximum magnitude of cross correlation of any two sequences in T, i.e.,

    $$C_{\max} =\underset{{\textbf{s}}_{k},{\textbf{s}}_{j}\in T , 0\le \tau <L}{\max} |C_{{\textbf{s}}_{k}, {\textbf{s}}_{j}}(\tau)|, \tau\ne 0 \ {if} \ k=j.$$

Ding et al. [14] have detailed discussions on how to get a time-shift distinct sequence set from a time-phase shift distinct sequence set with the following property.

Property 1

Let S = {s0,⋯ ,sN− 1} be a time-phase shift distinct sequence set. Then \(T=\left \{ \left \{\omega _{L}^{kt}\cdot s_{j}(t)\right \}_{0\leq t<L}: 0\leq k, j<N, \right \}\) is a time-shift distinct sequence set. Moreover, we have

$$S\subseteq T, \text{with} \ |T| = L\cdot |S|, R_{\max} = G_{\max}, \text{and}\ C_{\max}= A_{\max}.$$

In this section, the four classes of the sequence sets that we will introduce have their \(R_{\max \limits }\) and \(A_{\max \limits }\) bounded by directly applying the Weil bounds with only one exception. Now we will present the Weil bounds on character sums first.

4.1 The Weil bounds and W. Li bound

Restate that Q = pn.

Definition 12

For each j = 0,1,⋯ ,p − 1, we define an additive character of \({\mathbb {F}}_{Q}\) as the function ψj, given by

$$\psi_{j}(x)=e^{2\pi ij Tr(x)/p}=\omega_{p}^{jTr(x)}, x\in {\mathbb{F}}_{Q}.$$

We also denote it as ψ(x) when j = 1. Let M|(Q − 1). For each j = 0,1,⋯ ,M − 1, a multiplicative character χj of order \(M/\gcd (j, M)\) is defined by

$$\chi_{j}(\alpha^{k})=e^{2\pi ijk/M} = \omega_{M}^{jk}, \alpha^{k}\in {\mathbb{F}}_{Q}^{*}$$

or equivalently

$$\chi_{j}(x)= \omega_{M}^{(j\log_{\alpha} x) \bmod{M}}, x \in {\mathbb{F}}_{Q}^{*}.$$

The original definition in the Weil bounds is for χj(0) = 0 and we modify it as χj(0) = 1 for consistence with sequence design.

Let f(x) and g(x) be polynomials over \({\mathbb {F}}_{Q}[x]\) where f has degree d with \(\gcd (d, Q)=1\). Assume that s and e are the numbers of distinct roots of g(x) in the algebraic closure of \(\mathbb {F}_{Q}\) and \(\mathbb {F}_{Q}\) respectively. We assume that both f and g are nontrivial with respect to the additive character and multiplicative character respectively.

Fact 1

(The Weil bound (Weil 48, Delegne 74)) With the above notation, the following inequalities hold.

$$\begin{array}{@{}rcl@{}} &&{\text{The additive character sum:}} \left|\sum\nolimits_{x\in {\mathbb{F}}_{Q}} \psi(f(x)) \right|\le (d-1)\sqrt{Q}.\\ &&{\text{The multiplicative character sum:}} \left|\sum\nolimits_{x\in {\mathbb{F}}_{Q}}\chi(g(x))\right| \leqslant (s-1)\sqrt{Q}+e.\\ &&{\text{The hybrid character sum:}} \left|\sum\nolimits_{x\in {\mathbb{F}}_{Q}}\chi(g(x))\psi(f(x))\right|\leqslant (d+s-1)\sqrt{Q} +e. \end{array}$$

We also have the following bound for the exponential sum involved the function in Kloosterman’s sum, which is given by W. Li.

Fact 2

(W. Li [40] (1995)) For any 1 ≤ j < Q − 1, \(a, b \in {\mathbb {F}}_{Q}\),

$$\left | \sum\limits_{x\in {\mathbb{F}}_{Q}^{*}} \chi_{j}(x)\psi\left( a x+ b x^{-1}\right) \right |\le 2\sqrt{Q}.$$

4.2 Autocorrelation of power residue sequences and Sidel’nikov sequences

From the discussions on 2-level autocorrelation sequences in Section 3, it implies that for those sequences, there exist some polynomials of \(\mathbb {F}_{p^{n}}\), say f(x) such that the sum of additive character ψ(f(λx) − f(x)) is equal to zero for every \(\lambda \in \mathbb {F}_{Q} (Q=p^{n})\) with λ≠ 1, i.e.,

$$\sum\limits_{x\in {\mathbb{F}}_{Q}} \psi(f(\lambda x)-f(x)) =0, \lambda \in {\mathbb{F}}_{Q}, \lambda \ne 1.$$

However f(x) may have a high degree except for m-sequences given by f(x) = x. So, the Weil bound cannot be used in those cases. Two natural questions which we would like to ask: whether there are sequences constructed from multiplicative characters over \(\mathbb {F}_{p^{n}}\) (including n = 1), which are analogue to: Case 1) m-sequences; Case 2) 2-level autocorrelation sequences where f(x) has a high degree. We can confirm that there exist some solutions to the first question, but it is unknown whether there is a solution to the second question. In fact, for Case 1), there are two constructions for f(x) has degree 1: one is for \(\mathbb {F}_{p}\), called power residue sequences and the other is for \(\mathbb {F}_{p^{n}}, n>1\), Sidel’nikov sequences, which have optimal or low autocorrelation, which will be introduced as follows.

Definition 13

With the notation in Section 4.1, we define the following two sequences:

\({\mathbb {F}}_{p}\)

\({\mathbb {F}}_{Q}, Q=p^{n}\)

\(\chi _{j}(f(t)) = \omega _{M}^{ju(t)}\)

\(\chi _{j}(g(t)) = \omega _{M}^{jv(t)}\)

\(u(t)= \log _{\alpha } f(t) \bmod {M}\)

\(v(t) = \log _{\alpha } g(\alpha ^{t}) \bmod {M}\)

0 ≤ t < p,M|p − 1

0 ≤ t < Q − 1,M|Q − 1

f(x) = xpoweresidusequence

g(x) = x + 1, Sidelnikosequence

Note that the elements of a power residue sequence are ordered in the additive group of \({\mathbb {F}}_{p}\), while the elements of a Sidel’nikov sequence are ordered in the multiplicative group of \({\mathbb {F}}_{Q}\).

Theorem 1

(Sidel’nikov [58], Lempel et al. [39], Sarwate [54]) The autocorrelation function of a power residue sequence is bounded by

$$|C_{{\textbf{u}}}(\tau)|\le 3, \tau{ \not\equiv } 0 \bmod{p}.$$

In particular, for M = 2,

$$C_{\textbf{u}}(\tau) \in \left\{\begin{array}{lll} \{-1\} & \tau { \not\equiv } 0 \bmod{p} & \text{if}\ p\equiv 3 \bmod{4}\\ & & ({which \ gives \ quadratic \ residue \ sequences}\\ && {with \ 2-level \ autocorrelation})\\ \{1, -3\} &\tau { \not\equiv } 0 \bmod{p} & \text{if} \ p\equiv 1 \bmod{4}. \end{array} \right.$$

And the autocorrelation function of a Sidel’nikov sequence is bounded by

$$|C_{{\textbf{v}}}(\tau)|\le 4, \tau{ \not\equiv } 0 \bmod{(Q-1)}.$$

In particular, for M = 2,

$$C_{{\textbf{v}}}(\tau) \in \left\{\begin{array}{lll} \{-2, 2\} & \tau { \not\equiv } 0 \bmod{(Q-1)} & \text{if} \ Q \equiv 3 \bmod{4}\\ \{0, -4\} &\tau { \not\equiv } 0 \bmod{(Q-1)} & \text{if} \ Q \equiv 1 \bmod{4}. \end{array} \right.$$

Remark 8

The autocorrelation of the power residue sequences is optimal from Remark 2. Note that the ambiguity, crosscorrelation, and DFT of power residue sequences and Sidel’nikov sequences are also known. The reader is referring to [38] and [37] for crosscorrelation, and [23, Theorems 4.8 and 4.12] for the summary of the other properties of those sequence sets.

4.3 Best known constructions for sequence sets

As we have introduced the history of the Weil representation sequences in Section 1.3, in the following, we directly list the bounds on ambiguity and correlation given by Schmidt. Note that “best” means that those constructions produce “best known” sequence sets with both low ambiguity and correlation as well as large sizes. Those results can be obtained by applying the Weil bounds.

Construction 1

(Weil Representation Sequence Sets, Wang and Gong (2011) [68] and Schmidt (2011) [57]). Let α be a primitive element of finite field \(\mathbb {F}_{p}=\{0, 1, \cdots , p-1\}\) and \(\mathbb {F}_{p}^{*}=\{1, 2, \cdots , p-1\}\). Recall \(\omega _{k}=e^{\frac {i 2\pi }{k}}\), k a positive integer. Let s(x,y,z) with the elements given by

$$\begin{array}{@{}rcl@{}} s_{(x, y, z)}(t) &=& \omega_{p-1}^{x\cdot \log_{\alpha}t}\omega_{p}^{yt^{2}+zt}, \\ 0&\le& t<p, 1\le x < p-1, 0\le y, z<p. \end{array}$$
(9)

Let

$$\begin{array}{@{}rcl@{}} S_{1}& =&\{{\textbf{s}}_{(x, y, 0}) | 1\le x < p-1, 0\le y<p\}, \\ T_{1}& =& \{{\textbf{s}}_{(x, y, z)} | 1\le x < p-1, 0\le y, z<p\}. \end{array}$$
(10)

We then have

  • The sequence set S1 contains (p − 2)p time-phase shift distinct sequences, and its extension, T1 contains (p − 2)p2 time-shift distinct sequences.

  • Each of those sequences has length p and the alphabetic set has size (p − 1)p.

  • The correlation, ambiguity and DFT are bounded by

(11)

Remark 9

If we only consider correlation, then the number of time-shift distinct sequences in T1 is approximately a cubic function of the length p.

Construction 2

(Wang-Gong-Yu (2013 [70])) With the notation above, let M|p − 1 and

$$s_{(x, y, z)}(t) = \omega_{M}^{x\cdot \log_{\alpha} t}\omega_{p}^{yt^{2}+zt}, 0\le t<p, 1\le x < M, 0\le y, z<p.$$
(12)

We define S2 by varying (x,y) in (12) and setting z = 0 and T2 by varying all (x,y,z). Then we have the following results.

  • There are (M − 1)p time-phase shift distinct sequences in S2 and (M − 1)p2 time-shift distinct sequences in T2.

  • Each sequence has length p and the alphabetic set with size Mp.

  • The bounds are given by (11).

Note that Construction 1 is the case of Construction 2 for M = p − 1.

Construction 3

(Wang-Gong-Yu (2013), Gong (2012) [23]). For 1 ≤ x < M,1 ≤ y < L where L = pn − 1, α a primitive element of \(\mathbb {F}_{p^{n}}\) with h(α) = 0 where h(x) = xn + cn− 1xn− 1 + ⋯ + c1x + c0, \(c_{i}\in \mathbb {F}_{p}\), a primitive polynomial of degree n over \(\mathbb {F}_{p}\), M, a factor of L. The elements of s(x,y) are defined as follows.

(13)

We define S3 a set consisting of the sequences s(x,y) by varying all (x,y). Then

  • There are (M − 1)L time-phase shift distinct sequences in S3.

  • Each sequence has length L and the alphabetic set with size Mp.

  • The bounds on correlation, ambiguity function and DFT given by (11) where p is replaced by Q = pn.

Note that u is a p-ary m-sequence of period L, i.e., an m-sequence over a finite field \({\mathbb {F}}_{p}\) of degree n, with \(u(t)=Tr(\beta \alpha ^{t}), \beta \in \mathbb {F}_{p^{n}}\), the trace representation of u. Furthermore, \(v(t)=\log _{\alpha } (\alpha ^{t}+1) \bmod {M}\) is an M-ary Sidel’nikov sequence (Sidel’nikov 1969) of period L = pn − 1. So this design yields a sequence which is the term-wise product of a modulated p-ary m-sequence and a modulated M-ary Sidel’nikov sequence.

All three constructions have the same bounds on the autocorrelation and auto ambiguity function, crosscorrelation, cross ambiguity function and DFT.

We have the following construction, which is given by Ding et al. [14] using Fact 2.

Construction 4

(W. Li (1995) [40] and Ding et al. (2013) [14]) Let

$$s_{a}(t)=\omega_{p}^{Tr(a\alpha^{t}+\alpha^{-t})}, a\in {\mathbb{F}}_{Q}$$

and \(S_{4}=\{{\textbf{s}}_{a}, {\textbf{s}}_{-1}| a\in {\mathbb {F}}_{Q}\}\) where s− 1 = {Tr(αt)}. Then S4 has Q + 1 time-phase shift distinct sequences, and correlation, ambiguity, and DFT are bounded by \(2 \sqrt {Q}\), i.e.,

$$G_{\max}, A_{\max}, |\hat{{\textbf{s}}}_{a}(k)|\le 2 \sqrt{Q}.$$

Remark 10

Note the sequence s− 1 is not included in the construction given in [14]. But the bound will not be changed by adding this sequence according to Fact 2.

The choices for Constructions 2 and 3 lie in the trade-off for getting a small alphabetic size at the cost of the decreasing the size of the sequence set, which are listed in Table 2. Construction 4 gives a sequence set with the smallest alphabetic size p for period pn − 1. However, the sizes of the sequence set from each of the other three constructions can be in the order of the square of the size of Construction 4 at the price of increasing alphabetic sizes (Tables 3 and 4 ).

Table 2 Good parameters for sequence sets with low ambiguity and DFT

In the following, we show some examples from Construction 3, which have more efficient hardware implementation. We call the table with entries (t,v(t)) a trinomial table if v(t) is computed as follows

$$\alpha^{t}+1=\alpha^{v(t)}, 0\le t<p^{n}-1.$$

In other words, \(v(t)= \log _{\alpha }(\alpha ^{t}+1)\).

Example 7

Both the following two cases produce 6-ary sequences from Construction 3.

Case 1. For q = 24 − 1 = 15 and 3 is a divisor of 15: let α be a primitive element in \({\mathbb {F}}_{2^{4}}\) with α4 + α + 1 = 0. The sequences {v(t)} and {u(t)} are defined as follows.

By varying (x,y) in the range of 0 < x < 3,0 ≤ y < 15, S3 has 30 time-phase shift distinct 6-ary sequences of period 15.

Case 2. For q = 33 − 1, let α be a primitive element of \({\mathbb {F}}_{3^{3}}\) with α3 + 2α2 + 1 = 0. The sequences {v(t)} and {u(t)} are defined as follows.

Hence this produces a sequence set of the 6-ary sequences of period 26 with 27 time-phase shift distinct sequences.

Example 8

For Q = 28, we give an example for Construction 3. The trinomial table for n = 8 is a list of the v(t) values such that

$$\alpha^{t}+1=\alpha^{v(t)}, t=0, 1, \cdots, 254$$

which is given in Table 5. Let α be a primitive element in GF(28) with α8 + α4 + α3 + α2 + 1 = 0. Then {u(t)} is given by

$$u(t)=Tr(\gamma \alpha^{t}), \gamma\in {\mathbb{F}}_{2^{8}}$$

or equivalently,

$$\begin{array}{@{}rcl@{}} u(t+8)&=& u(t+4)+u(t+3)+u(t+2)+u(t), t=0, 1, \cdots, \\ &&(u(0), \cdots, u(7))=(0, 0, 0, 0, 0, 1, 0, 0, 0). \end{array}$$

Let S3 be the set consisting of the following sequences for all (x,y):

$$s_{(x, y)}(t) = \omega_{255}^{xv(t)} \times (-1)^{u(t+y)}, 0<x<255, 0\le y<255.$$

We now construct two sequences to illustrate their correlation and ambiguity.

$$\begin{array}{@{}rcl@{}} s_{0}(t)& =& \omega_{255}^{v(t)} \times (-1)^{u(t)}, \qquad (x, y)=(1, 0)\\ s_{1} (t)& =& \omega_{255}^{v(t)} \times (-1)^{u(t+1)}, \quad (x, y)=(1, 1)\\ \end{array}$$

where {u(t)} and {v(t)} are given in Tables 4 and 5 respectively. The properties of S3 is presented in Table 3. For comparisons, we also list the properties of the sequence set from Construction 4. Construction 3 produces 64770 time-phase shift distinct sequences, i.e., the product sequences of Sidel’nikov sequences with 254 phases and binary m-sequences. Construction 4 only contains 257 time-phase shift distinct sequences, but has a smaller maximal ambiguity value than that from Construction 3.

Note that if there are enough resources in memory, then the trinomial table can be stored. In this case, the implementation cost is similar as the case for Construction 4. The sketches of auto/crosscorrelation and ambiguity functions of s0 and s1 are given in Figs. 34 and 5.

Table 3 The sequence sets of \({\mathbb {F}}_{2^{8}}\)
Table 4 m-sequence \(\{u(t)\}_{t=0}^{254}\)
Table 5 The sequence \(\{v(t)\}_{t=0}^{254}\) where αt + 1 = αv(t)
Fig. 3
figure 3

Auto ambiguity function of sequence s0

Fig. 4
figure 4

Auto ambiguity function of sequence s1

Fig. 5
figure 5

Cross ambiguity function of the sequences s0 and s1

5 Aperiodic complementary sequences and arrays

In this section, we revisit a recent method to construct CSSs and CCCs by PU matrices and corresponding constructions [71]. We extend the size of generalized PU matrices from 2n to pn where p is an arbitrary integer in this survey. Moreover, we adopt the approach from [69] and show a sketch of the proof on extracting the function from a generalized seed PU matrix, which greatly simplifies the proof process in [71].

5.1 A viewpoint from arrays and PU matrices

A q-ary sequence f = (f(0),f(1),⋯ ,f(L − 1)) can be associated with its generating function given by

$$F(Z)=\sum\limits_{t=0}^{L-1}\omega_{q}^{f(t)}Z^{t}.$$
(14)

It is helpful to view the CSS and CCC by the generating functions of the sequences. The complex conjugate of F(Z) is denoted by \(\overline {F}(Z)={\sum }_{t=0}^{L-1}\omega ^{-f(t)}Z^{t}\).

Suppose that F0(Z) and F1(Z) are the generating functions of q-ary sequences f0 and f1, respectively. The polynomial \(F_{0}(Z)\overline {F}_{1}(Z^{-1})\) records all the aperiodic values:

$$F_{0}(Z)\overline{F}_{1}(Z^{-1})=\sum\limits_{\tau=1-L}^{L-1}C_{\boldsymbol{f}_{0},\boldsymbol{f}_{1}}(\tau)Z^{\tau}.$$
(15)

Then S = {f0,f1,⋯ ,fN− 1} forms a CSS if and only if their generating functions {F0(Z),F1(Z), ⋯ ,FN− 1(Z)} satisfy

$$\sum\limits_{j=0}^{N-1} F_{j}(Z)\overline{F}_{j}(Z^{-1})=NL.$$
(16)

Moreover, two CSSs S0 = {f0,0,f0,1,⋯ ,f0,N− 1} and S1 = {f1,0,f1,1,⋯ ,f1,N− 1} are mutually orthogonal if and only if their respective generating functions

$$\{F_{i,0}(Z), F_{i,1}(Z),\cdots, F_{i, N-1}(Z)\}, i=0,1$$

satisfy

$$\sum\limits_{j=0}^{N-1} F_{0,j}(Z)\overline{F}_{1,j}(Z^{-1})=0.$$
(17)

Complementary arrays

An m-dimensional q-ary array of size L0 × L1 ×⋯ × Lm− 1 can be represented by a function mapping from \(\mathbb {Z}_{L_{0}}\oplus \mathbb {Z}_{L_{1}}\oplus \cdots \oplus {\mathbb {Z}}_{L_{m-1}}\) to \({\mathbb {Z}}_{q}\): f(x) = f(x0,x1,⋯ ,xm− 1) for \(x_{j}\in {\mathbb {Z}}_{L_{j}}\). The (multivariate) generating function of array f(x) is defined by

$$F(\boldsymbol{z})=\sum\limits_{x_{0}=0}^{L_{0}-1}\sum\limits_{x_{1}=0}^{L_{1}-1}{\cdots} \sum\limits_{x_{m-1}=0}^{L_{m-1}-1}\omega^{f(\boldsymbol{x})}z_{0}^{x_{0}}z_{1}^{x_{1}}{\cdots} z_{m-1}^{x_{m-1}}$$
(18)

for z = (z0,z1,⋯ ,zm− 1). Denote \(\boldsymbol {z}^{-1}=(z_{0}^{-1},z_{1}^{-1},\cdots , z_{m-1}^{-1})\).

The concept of a Golay complementary pair was generalized to that of a Golay array pair (GAP) in [43], and the concepts of CSSs and CCCs were generalized from sequences to arrays in [49].

Definition 14

A set of arrays {f0(x),f1(x),⋯ ,fN− 1(x)} is called a complementary array set (CAS) of size N if their generating functions {F0(z),F1(z),⋯ ,FN− 1(z)} satisfy

$$\sum\limits_{u=0}^{N-1}F_{u}(\boldsymbol{z})\cdot \overline{F}_{u}(\boldsymbol{z}^{-1})=N\cdot p^{m}.$$
(19)

Two CASs

$$\begin{array}{@{}rcl@{}} S_{0}&=&\{f_{0,0}(\boldsymbol{x}), f_{0,1}(\boldsymbol{x}),\cdots, f_{0,N-1}(\boldsymbol{x})\}\\ S_{1}&=&\{f_{1,0}(\boldsymbol{y}), f_{1,1}(\boldsymbol{y}),\cdots, f_{1,N-1}(\boldsymbol{y})\} \end{array}$$

are said to be mutually orthogonal if their generating functions {Fu,0(z),Fu,1(z),⋯ ,Fu,N− 1(z)} (u = 0,1) satisfy

$$\sum\limits_{v=0}^{N-1} F_{0,v}(\boldsymbol{z})\overline{F}_{1,v}(\boldsymbol{z}^{-1})=0.$$
(20)

Definition 15

Let Su = {fu,0(x),fu,1(x),⋯ ,fu,N− 1(x)} (0 ≤ u < N) be CASs of size N, which are pairwise mutually orthogonal. We call such a collection of Su (0 ≤ u < N) a complete mutually orthogonal array set or a complete complementary arrays (CCA).

From arrays to sequences

For an array \(f(\boldsymbol {x}): {{\mathbb {Z}}_{p}^{m}} \rightarrow {\mathbb {Z}}_{q}\), we can specify a sequence \(\boldsymbol {f}: {\mathbb {Z}}_{p^{m}}\rightarrow {\mathbb {Z}}_{q}\) of length L = pm by listing the values of f(x) in lexicographic order of x:

$$f(t)=f(\boldsymbol{x})\ \text{for}\ t=\sum\limits_{k=0}^{m-1} x_{k}\cdot p^{k}, 0\le x_{k}<p.$$

We say that the sequence f is evaluated by the array f(x).

For an array \(f(\boldsymbol {x}): {{\mathbb {Z}}_{p}^{m}} \rightarrow {\mathbb {Z}}_{q}\) and its evaluated sequence \(\boldsymbol {f}: {\mathbb {Z}}_{p^{m}}\rightarrow {\mathbb {Z}}_{q}\), their generating functions satisfy F(z) = F(Z), if we set \(z_{k}=Z^{p^{k}}\). The relationship between sequence and array is summarized in Fig. 6.

Fig. 6
figure 6

Relationships between arrays, sequences, and their generating functions

Example 9

For an array from \({{\mathbb {Z}}_{2}^{2}}\) to \({\mathbb {Z}}_{4}\): f(x0,x1) = 2x0x1 + x0, its generating function is given by F(z0,z1) = 1 + iz0 + z1iz0z1. Sequence f, evaluated by the array f(x0,x1), is obtained by setting t = x0 + 2x1: f = {0,1,0,3}, which is equal to f0,0 given in Example 4 in Section 2.2. Then the generating function of sequence f is given by F(Z) = 1 + iZ + Z2iZ3.

An action of permutation π on the function f(x) from \({{\mathbb {Z}}_{p}^{m}}\) to \({\mathbb {Z}}_{q}\) is to permute the variables, i.e.,

$$\pi\cdot f(\boldsymbol{x})=f(\pi\cdot\boldsymbol{x})=f\left( x_{\pi(0)},x_{\pi(1)},\cdots, x_{\pi(m-1)}\right).$$

Fact 3

Let π be any permutation.

  • If a set of arrays {f0,f1,⋯ ,fN− 1} is a CAS, then the sequences evaluated by functions {πf0,πf1,⋯ ,πfN− 1} form a CSS.

  • If Si = {fi,0,fi,1,⋯ ,fi,N− 1} (0 ≤ i < N) is a CCA, the sequences evaluated by functions \(S^{\prime }_{i}=\{\pi \cdot f_{i,0}, \pi \cdot f_{i,1},\cdots , \pi \cdot f_{i,N-1}\}\) (0 ≤ i < N) form a CCC.

According to Fact 3, a larger number of CSSs and CCCs can be constructed from a single CCA by mapping arrays to sequences. For simplicity, we shall only show the construction of CCAs in this survey.

Remark 11

Note that the results in this subsection can be generalized to arrays of more flexible size L0 × L1 ×⋯ × Lm− 1 straightforwardly. Moreover, a larger number of lower dimensional CASs and CCAs can also be constructed from a high dimensional CCA.

PU matrices

For 0 ≤ u,vN − 1, let fu,v(x) be q-ary arrays of dimension m, and \(\widetilde {\boldsymbol {M}}(\boldsymbol {x})\) a matrix with entry fu,v(x), i.e.,

$$\boldsymbol{{\widetilde{M}}}({\boldsymbol{x}})= \left[\begin{array}{cccc} f_{0,0}({\boldsymbol{x}})&f_{0,1}({\boldsymbol{x}})&\dots&f_{0,N-1}({\boldsymbol{x}})\\ f_{1,0}({\boldsymbol{x}})&f_{1,1}({\boldsymbol{x}})&\dots&f_{1,N-1}({\boldsymbol{x}})\\ {\vdots} &{\vdots} &\ddots&{\vdots} \\ f_{N-1,0}({\boldsymbol{x}})&f_{N-1,1}({\boldsymbol{x}})&\dots&f_{N-1,N-1}({\boldsymbol{x}}) \end{array}\right].$$
(21)

The generating functions of the entries in matrix \(\widetilde {\boldsymbol {M}}({\boldsymbol {x}})\) can be presented by a matrix M(z) with each entry given by Mu,v(z) = Fu,v(z), the generating function of fu,v(x), i.e.,

$$\boldsymbol{M}({\boldsymbol{z}})= \left[\begin{array}{cccc} F_{0,0}({\boldsymbol{z}})&F_{0,1}({\boldsymbol{z}})&\dots&F_{0,N-1}({\boldsymbol{z}})\\ F_{1,0}({\boldsymbol{z}})&F_{1,1}({\boldsymbol{z}})&\dots&F_{1,N-1}({\boldsymbol{z}})\\ {\vdots} &{\vdots} &\ddots&{\vdots} \\ F_{N-1,0}({\boldsymbol{z}})&F_{N-1,1}({\boldsymbol{z}})&\dots&F_{N-1,N-1}({\boldsymbol{z}}) \end{array}\right].$$
(22)

M(z), given by (22) is called the generating matrix of \(\widetilde {\boldsymbol {M}}({\boldsymbol {x}})\), given by (21).

Definition 16

Let M(z) be a square multivariate polynomial matrix of order N. If

$$\boldsymbol{M}(\boldsymbol{z})\cdot\boldsymbol{M}^{\dagger}(\boldsymbol{z}^{-1})=c\cdot \boldsymbol{I}_{N},$$

where (⋅) denotes the Hermitian transpose, IN is the identity matrix of order N, and c is a real constant. We say that M(z) is a para-unitary (PU) matrix.

According to (19) and (20), it is necessary that M(z) is a PU matrix if \(\widetilde {\boldsymbol {M}}({\boldsymbol {x}})\) forms a CCA. However, the PU condition is not sufficient since the entries of a PU matrix may not be able to map to polyphase arrays. A PU matrix M(z) is called a desired PU matrix if it is the generating matrix of a function matrix \({\boldsymbol {\widetilde {M}}}(\boldsymbol {x})\), i.e., each entry is a function from \({\mathbb {Z}_{p}^{m}}\) to \(\mathbb {Z}_{q}\). Then the process to construct CCA can be divided into two steps:

  1. (1)

    Construct a desired PU matrix M(z).

  2. (2)

    Find the function matrix \(\boldsymbol {{\widetilde {M}}}(\boldsymbol {x})\) from its generating matrix M(z).

Example 10

It is easy to verify \(\boldsymbol {M}(z_{0}, z_{1})\cdot \boldsymbol {M}^{\dagger }(z_{0}^{-1}, z_{1}^{-1})=8\cdot \boldsymbol {I}_{2}\), where

$$\boldsymbol{M}(z_{0}, z_{1})=\left[\begin{array}{cc} 1+iz_{0}+z_{1}-iz_{0}z_{1} & 1+iz_{0}-z_{1}+iz_{0}z_{1} \\ 1-iz_{0}+z_{1}+iz_{0}z_{1} & 1-iz_{0}-z_{1}-iz_{0}z_{1} \end{array}\right].$$

Then M(z0,z1) is a desired PU matrix. We can extract its function matrix

$$\boldsymbol{{\widetilde{M}}}(x_{0}, x_{1})= \left[\begin{array}{cc} 2x_{0}x_{1}+x_{0} & 2x_{0}x_{1}+x_{0}+2x_{1} \\ 2x_{0}x_{1}+3x_{0} & 2x_{0}x_{1}+3x_{0}+2x_{1} \end{array}\right],$$

which forms a CCA. Then the sequences evaluated by the arrays in \(\boldsymbol {{\widetilde {M}}}(x_{0}, x_{1})\), presented by

$$S=\left[\begin{array}{cc} \boldsymbol{f}_{0,0}=(0,1,0,3) & \boldsymbol{f}_{0,1}=(0,1,2,1) \\ \boldsymbol{f}_{1,0}=(0,3,0,1) & \boldsymbol{f}_{1,1}=(0,3,2,3) \end{array}\right],$$

is a CCC, which is the quaternary CCC, given in Example 4 in Section 2.2. Moreover, we can construct many CCCs from the CCA \(\boldsymbol {{\widetilde {M}}}(x_{0}, x_{1})\) according to Fact 3.

5.2 Construction I: ingredients and basic recursive formula

Ingredients of desired PU matrices and function matrices

We first introduce two ingredients of PU matrices. One is Butson-type Hadamard matrices, which will be defined below, the simplest desired PU matrices, and the other is delay matrices.

A complex matrix H of order N is called a Butson-type Hadamard (BH) matrix [8] if \(\boldsymbol {H}\cdot \boldsymbol {H}^{\dagger }=N\cdot \boldsymbol {I}_{N}\) and all the entries of H are q th roots of unity.

For example, if N = q, DFT matrix H with entry Hu,v = ωuv is a BH matrix; if N = 2n and q = 2, Walsh Hadamard Transform (WHT) matrix H with entry Hu,v = ωTr(uv) for \(u,v\in {\mathbb {F}}_{2^{n}}\) is also a BH matrix.

For given N and q, the set of all BH matrices is denoted by H(q,N). BH matrices are the simplest desired PU matrices. Another type of desired PU matrices of dimension 1 and order 2 can be constructed from the so-called non-standard Golay pairs, introduced in [16, 41].

Now we introduce our second ingredient, i.e., delay matrices.

Definition 17

A delay matrix DN(z) of order N is defined by

$$\boldsymbol{D}_{N}(z)=diag\left\{z^{0}, z^{1}, \cdots, z^{N-1}\right\}.$$
(23)

If N = pn, the delay matrix can be presented as a multi-variate polynomial with multi-variables z = (z0,z1,⋯ ,zn− 1), obtained by the Kronecker tensor product:

$$\boldsymbol{D}_{N}(\boldsymbol{z})=\boldsymbol{D}_{p}(z_{n-1}) \otimes{\cdots} \otimes \boldsymbol{D}_{p}(z_{1})\otimes \boldsymbol{D}_{p}(z_{0}).$$
(24)

This is called a generalized delay matrix.

It is straightforward to show that the generalized delay matrix DN(z) can be explicitly expressed by a diagonal matrix

$$\boldsymbol{D}_{N}(\boldsymbol{z})=diag\{\phi_{0}(\boldsymbol{z}),\phi_{1}(\boldsymbol{z}), \cdots, \phi_{p^{n}-1}(\boldsymbol{z})\},$$

where (x0,x1⋯ ,xn− 1) is p-ary expansion of y and \(\phi _{y}(\boldsymbol {z})={\prod }_{j=0}^{n-1}z_{j}^{x_{j}}\).

For example, if N = 4, the delay matrix is given by

$$\boldsymbol{D}_{4}(z)=diag\left\{z^{0}, z^{1}, z^{2}, z^{3}\right\},$$

while the generalized delay matrix is represented by

$$\boldsymbol{D}_{4}(z_{0}, z_{1})=diag\left\{{z_{1}^{0}}{z_{0}^{0}}, {z_{1}^{0}}{z_{0}^{1}}, {z_{1}^{1}}{z_{0}^{0}}, {z_{1}^{1}}{z_{0}^{1}}\right\}=diag\left\{1, {z_{0}^{1}}, {z_{1}^{1}}, {z_{1}^{1}}{z_{0}^{1}}\right\}.$$

Next we introduce two ingredients for functions, which will be used to extract functions from the desired PU matrices. One is the phase matrix of a BH matrix, and the other is the Kronecker-delta function.

For a BH matrix HH(q,N), define its phase matrix \(\widetilde {\boldsymbol {H}}\) by \(\widetilde {H}_{u,v}=h(u, v)\) if Hu,v = ωh(u,v), where h(u,v) is a function from \({{\mathbb {Z}}_{N}^{2}}\) to \({\mathbb {Z}}_{q}\) for \(u,v\in {\mathbb {Z}}_{N}\). Actually, a BH matrix H can be treated as a desired PU matrix, and its phase matrix \(\widetilde {\boldsymbol {H}}\) is the corresponding function matrix.

In this survey, we adopt the approach from [69] where a phase matrix of a BH matrix is replaced by the function h(u,v) from \({\mathbb {Z}_{N}^{2}}\) to \(\mathbb {Z}_{q}\) to simplify the proof process. For example, h(u,v) = uv represents the phase matrix of the DFT matrix of order N for \(u,v\in \mathbb {Z}_{N}\), while h(u,v) = Tr(uv) is the phase matrix of the WHT matrix of order N = 2n for \(u,v\in \mathbb {F}_{2^{n}}\).

Definition 18

Let δu be a Kronecker-delta function from \(\mathbb {Z}_{N}\) to \(\mathbb {Z}_{q}\) given by

$$\delta_{u}(y)=\left\{\begin{array}{cc} &1, \text{if} \ y=u, \\ &0, \text{if} \ y\neq u. \end{array}\right.$$
(25)

Moreover, if N = pn, we have

$$\delta_{u}(y)=\prod\limits_{j=0}^{n-1}\delta_{u_{j}}({x_{j}}),$$

where x = (x0,x1,⋯ ,xn− 1) and (u0,u1,⋯ ,un− 1) are p-ary expansions of y and u respectively, and \(\delta _{u_{j}}\) is a Kronecker-delta function from \({\mathbb {Z}}_{p}\) to \({\mathbb {Z}}_{q}\) defined by (25). In this case, we use function δu(x) instead of δu(y) from now on.

Note that the set consisting of all the Kronecker-delta functions, i.e., \(\{\delta _{u} | u\in \mathbb {Z}_{N}\}\) forms a linear basis of all functions from \(\mathbb {Z}_{N}\) to \(\mathbb {Z}_{q}\). So, we also call it a Kronecker-delta (function) basis.

Example 11

For N = q = 3, the Kronecker-delta functions from \(\mathbb {Z}_{3}\) (or \({\mathbb {F}}_{3}\)) to \(\mathbb {Z}_{3}\) are given by

$$\left\{\begin{array}{ccc} \delta_{0}(y)&=2y^{2}+1,\\ \delta_{1}(y)&=2y^{2}+2y,\\ \delta_{2}(y)&=2y^{2}+y. \end{array}\right.$$

The monomial basis of all functions from \(\mathbb {Z}_{3}\) to \(\mathbb {Z}_{3}\) is {1,y,y2}, while the set of these Kronecker-delta functions is another basis.

Example 12

For N = 4, the Kronecker-delta functions from \(\mathbb {Z}_{4}\) (or \({{\mathbb {F}}_{2}^{2}}\)) to \(\mathbb {Z}_{q}\) are given by

$$\left\{\begin{array}{lll} \delta_{0}(y)=(1-x_{0})(1-x_{1}),\\ \delta_{1}(y)=x_{0}(1-x_{1}),\\ \delta_{2}(y)=(1-x_{0})x_{1},\\ \delta_{3}(y)=x_{0}x_{1}, \end{array}\right.$$

where (x1,x0) is the binary expansion of y. The monomial basis of all generalized Boolean functions from \(\mathbb {Z}_{4}\) to \(\mathbb {Z}_{q}\) is {1,x0,x1,x0x1}, while the set of the Kronecker-delta functions is another basis.

Definition 19

The Δ-matrix of order N is a diagonal matrix defined by

$$\boldsymbol{{{\varDelta}}}_{N}(y)=diag\{\delta_{0}(y), \delta_{1}(y), \cdots, \delta_{N-1}(y)\}.$$

If N = pn, δu(y) can be replaced by the function δu(x), where x = (x0,x1,⋯ ,xn− 1) which is the p-ary expansions of y. Then we use the matrix ΔN(x) instead of ΔN(y).

Example 13

According to Example 12, the Δ-matrix of order 4 can be represented by

$$\boldsymbol{{{\varDelta}}}_{4}(x_{0}, x_{1})= \left[\begin{array}{cccc} (1-x_{0})(1-x_{1}) & 0 &0&0\\ 0 & x_{0}(1-x_{1})&0&0\\ 0 &0&(1-x_{0})x_{1}&0\\ 0 &0&0& x_{0}x_{1} \end{array}\right].$$

In the rest of the paper, if the context is clear, we use D(z),Δ(x),I and J instead of DN(z),ΔN(x),IN and JN, respectively, where IN and JN represent the identity matrix and all 1 matrix of order N, respectively.

A basic recursive formula for desired PU matrices

Let A(z1) and B(z2) be the generating matrices of the function matrices \(\widetilde {\boldsymbol {A}}(\boldsymbol {x}_{1})=\{a_{i,j}(\boldsymbol {x}_{1})\}\) and \(\widetilde {\boldsymbol {B}}(\boldsymbol {x}_{2})=\{b_{i,j}(\boldsymbol {x}_{2})\}\) of order N = pn respectively, where ai,j(x1) and bi,j(x2) are functions from \(\mathbb {Z}_{p}^{m_{1}}\) to \({\mathbb {Z}}_{q}\) and \({\mathbb {Z}}_{p}^{m_{2}}\) to \({\mathbb {Z}}_{q}\) respectively. Note that A(z1) and B(z2) are not required to be PU matrices here.

A recursive formula for constructing desired PU matrices from lower dimensions to higher dimensions was shown in [71, Lemma 9]. We extend it to the form given by the following theorem.

Theorem 2

Let C(z0,z1,z2) be a multivariate polynomial matrix defined by

$$\boldsymbol{C}(\boldsymbol{z}_{0},\boldsymbol{z}_{1},\boldsymbol{z}_{2})=\boldsymbol{A}(\boldsymbol{z}_{1})\cdot\boldsymbol{D}(\boldsymbol{z}_{0})\cdot\boldsymbol{B}(\boldsymbol{z}_{2}),$$

where \(\boldsymbol {z}_{\boldsymbol {0}}=(z_{0}, z_{1},\dots , z_{n-1})\). Then C(z0,z1,z2) is the generating matrix of the function matrix \(\widetilde {\boldsymbol {C}}(\boldsymbol {x}_{0},\boldsymbol {x}_{1},\boldsymbol {x}_{2})\) given by

$$\widetilde{\boldsymbol{C}}(\boldsymbol{x}_{0},\boldsymbol{x}_{1},\boldsymbol{x}_{2})=\widetilde{\boldsymbol{A}}(\boldsymbol{x}_{1})\cdot \boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{0})\cdot \boldsymbol{J}+\boldsymbol{J}\cdot \boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{0})\cdot \widetilde{\boldsymbol{B}}(\boldsymbol{x}_{2}),$$
(26)

where \(\boldsymbol {x}_{\boldsymbol {0}}=(x_{0}, x_{1},\dots , x_{n-1})\in {{\mathbb {Z}}_{2}^{n}}\).

Moreover, if both A(z1) and B(z2) are desired PU matrices, then C(z0,z1,z2) is also a desired PU matrix.

Proof

For the case p = 2, it has been shown in [71, Lemma 9] that each entry of \(\widetilde {\boldsymbol {C}}(\boldsymbol {x}_{0},\boldsymbol {x}_{1},\boldsymbol {x}_{2})\) can be presented by

$$c_{u,v}(\boldsymbol{x}_{0},\boldsymbol{x}_{1},\boldsymbol{x}_{2}) =\sum\limits_{i=0}^{N-1}\left( a_{u,i}(\boldsymbol{x}_{1})+b_{i,v}(\boldsymbol{x}_{2})\right)\delta_{i}(\boldsymbol{x}_{0}),$$
(27)

for 0 ≤ u,v < N. It is straightforward to show that this result are valid for arbitrary integer p. In addition, one can verify that formulae (27) and (26) are equivalent, which completes the proof.

5.3 Construction II: generalized seed PU matrices and function matrices

In this subsection, we set N = pn, where p is not necessarily a prime.

Generalized seed PU matrices and function matrices

By recursively substituting the BH matrices as the desired PU matrices in Theorem 2, we can immediately obtain the following desired PU matrices, which are called generalized seed PU matrices.

Theorem 3

Let H{k} be arbitrary BH matrices chosen from H(q,N = pn) for 0 ≤ km and D(zk) be the generalized delay matrices of order N with zk = (zkn,zkn+ 1,⋯ ,zkn+n− 1) for 0 ≤ k < m. Then the following multivariate polynomial matrix

$$\boldsymbol{M}(\boldsymbol{z})=\boldsymbol{H}^{\{0\}}\cdot \boldsymbol{D}(\boldsymbol{z}_{0})\cdot \boldsymbol{H}^{\{1\}}\cdot\boldsymbol{D}(\boldsymbol{z}_{1}){\cdots} \boldsymbol{H}^{\{m-1\}}\cdot \boldsymbol{D}(\boldsymbol{z}_{m-1})\cdot \boldsymbol{H}^{\{m\}}$$
(28)

is a desired PU matrix of order N, where z = (z0,z1,⋯zm− 1). Moreover, the function matrix of M(z) is given by

$$\begin{array}{@{}rcl@{}} \widetilde{\boldsymbol{M}}(\boldsymbol{x})&=&\widetilde{\boldsymbol{H}}^{\{0\}}\cdot \boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{0})\cdot \boldsymbol{J}+\boldsymbol{J}\cdot\boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{m-1})\cdot \widetilde{\boldsymbol{H}}^{\{m\}}\\ &&+\sum\nolimits_{k=1}^{m-2}\boldsymbol{J}\cdot \boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{k})\cdot\widetilde{\boldsymbol{H}}^{\{k\}}\cdot\boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{k+1})\cdot\boldsymbol{J}, \end{array}$$
(29)

where \(\boldsymbol {x}_{\boldsymbol {k}}= (x_{kn}, x_{kn+1}, \cdots , x_{kn+n-1})\in {{\mathbb {Z}}_{p}^{n}}\), \(\boldsymbol {x}=(\boldsymbol {x}_{0},\boldsymbol {x}_{1},\cdots , \boldsymbol {x}_{m-1})\in \mathbb {Z}_{p}^{mn}\), and \(\widetilde {\boldsymbol {H}}^{\{k\}}\) is the phase matrix of H{k}.

Proof

For the case p = 2, it has been shown in [71, Theorem 6] that M(z) is a desired PU matrix of order N = 2n. It is straightforward show that this result is valid for arbitrary integer p.

In addition, formula (29) can be proved by iteratively applying formula (26) in Theorem 2.

Although PU matrices M(z) have been constructed in Theorem 3, it is difficult to give a general form of the function matrices \(\widetilde {\boldsymbol {M}}(\boldsymbol {x})\), since the BH matrices \(\widetilde {\boldsymbol {H}}^{\{k\}}\) can be arbitrarily chosen. To solve this problem, a very complicated method to extract the functions in \(\widetilde {\boldsymbol {M}}(\boldsymbol {x})\) was shown in [71]. In this paper, we provide a sketch of an alternative proof coming from the view in [69] where each BH matrix \(\widetilde {\boldsymbol {H}}^{\{k\}}\) is treated as a function from \({{\mathbb {Z}}_{N}^{2}}\) to \({\mathbb {Z}}_{q}\). In this way, the function matrices \(\widetilde {\boldsymbol {M}}(\boldsymbol {x})\) can be given by the following corollary.

Corollary 1

Let the phase matrix \(\widetilde {\boldsymbol {H}}^{\{k\}}\) be presented by the function h{k}(u,v) from \({\mathbb {Z}_{N}^{2}}\) to \(\mathbb {Z}_{q}\) for 0 ≤ km, then the phase matrix \(\widetilde {\boldsymbol {M}}(\boldsymbol {x})\) in Theorem 3 can be alternatively expressed by

$$\begin{array}{@{}rcl@{}} \widetilde{\boldsymbol{M}}(\boldsymbol{x})&=& \sum\limits_{k=1}^{m-2}h^{\{k\}}(\boldsymbol{x}_{k}, \boldsymbol{x}_{k+1} )\cdot\boldsymbol{J} +\left[\begin{array}{ccccc} h^{\{0\}}(0, \boldsymbol{x}_{\boldsymbol{0}})&0&\cdots&0\\ 0&h^{\{0\}}(1, \boldsymbol{x}_{\boldsymbol{0}})&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&h^{\{0\}}(N-1, \boldsymbol{x}_{\boldsymbol{0}}) \end{array}\right]\cdot \boldsymbol{J}\\ &&+\boldsymbol{J}\cdot \left[\begin{array}{ccccc} h^{\{m\}}(\boldsymbol{x}_{\boldsymbol{m-1}}, 0)&0&\cdots&0\\ 0&h^{\{m\}}(\boldsymbol{x}_{\boldsymbol{m-1}}, 1)&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&h^{\{m\}}(\boldsymbol{x}_{\boldsymbol{m-1}}, N-1) \end{array}\right]. \end{array}$$

Proof

Let the phase matrix \(\widetilde {\boldsymbol {H}}\) be presented by the function h(u,v) from \({{\mathbb {Z}}_{N}^{2}}\) to \({\mathbb {Z}}_{q}\). From the definition of the Kronecker-delta functions, we can easily verify the following equalities:

$$\begin{array}{@{}rcl@{}} \boldsymbol{J}\cdot \boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{k})\cdot\widetilde{\boldsymbol{H}}\cdot\boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{k+1})\cdot\boldsymbol{J}&=&h^{\{k\}}(\boldsymbol{x}_{k}, \boldsymbol{x}_{k+1} )\cdot\boldsymbol{J},\\ \widetilde{\boldsymbol{H}}\cdot \boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{0})\cdot \boldsymbol{J}&=&diag\left\{h^{\{0\}}(0, \boldsymbol{x}_{\boldsymbol{0}}), h(1, \boldsymbol{x}_{\boldsymbol{0}}),\cdots, h(N-1, \boldsymbol{x}_{\boldsymbol{0}})\right\}\cdot\boldsymbol{J},\\ \boldsymbol{J}\cdot\boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{m-1})\cdot \widetilde{\boldsymbol{H}}&=&\boldsymbol{J}\cdot diag\left\{h(\boldsymbol{x}_{\boldsymbol{m-1}}, 0), h(\boldsymbol{x}_{\boldsymbol{m-1}}, 1),\cdots, h(\boldsymbol{x}_{\boldsymbol{m-1}}, N-1)\right\}. \end{array}$$

Then the proof is completed by formula (29).

Example 14

For the case N = 4 = 22, q = 4, and m = 3, let H{0} and H{2} be the DFT matrix of order 4, and H{1} and H{3} be the WHT matrix of order 4, namely,

$$\boldsymbol{H}^{\{0\}}=\boldsymbol{H}^{\{2\}}= \left[\begin{array}{cccc} 1 & 1 &1&1\\1 &\sqrt{-1} &-1&-\sqrt{-1}\\ 1 &-1&1&-1\\ 1 &-\sqrt{-1}&-1& \sqrt{-1} \end{array}\right]$$

and

$$\boldsymbol{H}^{\{1\}}=\boldsymbol{H}^{\{3\}}= \left[\begin{array}{cccc} 1 & 1 &1&1\\1 & -1&1&-1\\ 1 &1&-1&-1\\ 1 &-1&-1& 1 \end{array}\right].$$

Then we have

$$\boldsymbol{M}(\boldsymbol{z})=\boldsymbol{H}^{\{0\}}\cdot \boldsymbol{D}(\boldsymbol{z}_{0})\cdot \boldsymbol{H}^{\{1\}}\cdot\boldsymbol{D}(\boldsymbol{z}_{1})\cdot \boldsymbol{H}^{\{2\}}\cdot \boldsymbol{D}(\boldsymbol{z}_{2})\cdot \boldsymbol{H}^{\{3\}},$$

which is a desired PU matrix of order 4.

Furthermore, the phase matrix of DFT matrix can be represented by the function h0(u,v) = uv from \({\mathbb {Z}}_{4}\) to \(\mathbb {Z}_{4}\) or alternatively in a multivariate polynomial, i.e., we have

$$h_{0}((u_{0}, u_{1}); (v_{0}, v_{1}))=(2u_{0}+u_{1})(2v_{0}+v_{1})=2u_{0}v_{1}+2u_{1}v_{0}+u_{1}v_{1},$$

which is a generalized Boolean function from \({\mathbb {Z}_{2}^{4}}\) to \(\mathbb {Z}_{4}\) where (u0,u1) and (v0,v1) are the binary expansion of u and v, respectively. The phase matrix of WHT matrix can be represented by the function h1(u,v) = 2Tr(uv) from \(\mathbb {F}_{2^{2}}\) to \(\mathbb {Z}_{4}\) or alternatively

$$h_{1}((u_{0}, u_{1}); (v_{0}, v_{1}))=2u_{0}v_{0}+2u_{1}v_{1},$$

which is a generalized Boolean function from \({\mathbb {Z}_{2}^{4}}\) to \(\mathbb {Z}_{4}\). Thus, these functions are given by h{0} = h{2} = h0 and h{1} = h{3} = h2. Thus we obtain the function matrix as follows.

$$\begin{array}{@{}rcl@{}} \widetilde{\boldsymbol{M}}(\boldsymbol{x})&=& (h_{1}(\boldsymbol{x}_{0}, \boldsymbol{x}_{1})+h_{0}(\boldsymbol{x}_{1}, \boldsymbol{x}_{2}))\cdot\boldsymbol{J} + \\ &&+ \left[\begin{array}{cccc} h_{0}((0, 0); \boldsymbol{x}_{\boldsymbol{0}})&0&0&0\\ 0& h_{0}((1, 0); \boldsymbol{x}_{\boldsymbol{0}})&0&0\\ 0&0& h_{0}((0, 1); \boldsymbol{x}_{\boldsymbol{0}})&0\\ 0&0&0& h_{0}((1, 1); \boldsymbol{x}_{\boldsymbol{0}}) \end{array}\right]\cdot \boldsymbol{J}\\ &&+ \left[\begin{array}{cccc} h_{1}(\boldsymbol{x}_{\boldsymbol{m-1}}; (0, 0))&0&0&0\\ 0& h_{1}(\boldsymbol{x}_{\boldsymbol{m-1}}; (1, 0))&0&0\\ 0&0& h_{1}(\boldsymbol{x}_{\boldsymbol{m-1}}; (0, 1))&0\\ 0&0&0& h_{1}(\boldsymbol{x}_{\boldsymbol{m-1}}; (1, 1)) \end{array}\right]\cdot \boldsymbol{J}. \end{array}$$

A general form of the functions

A general form of CCA \(\widetilde {\boldsymbol {M}}(\boldsymbol {x})\), extracted from the generalized seed PU matrix with form (28), has been recently obtained in [69, 71]. Below we will give an alternative proof for those results by Corollary 1, which are much simpler than those presented in [71].

Two BH matrices, say H1 and H are called equivalent, if there exist diagonal unitary matrices Q1 and Q2 where each diagonal entry is a q th root of unity and permutation matrices P1, P2 such that H1 = Q1P1HP2Q2.

Let h1(u,v) and h(u,v) be the phase matrices induced by BH matrices H1 and H, respectively. From the view in [69], there exist permutation functions g(⋅) and \(g^{\prime }(\cdot )\) of \({\mathbb {Z}}_{N}\), \(c_{y}, d_{y}\in {\mathbb {Z}}_{N}\), such that

$$h_{1}(u, v)=h(g(u), g^{\prime}(v))+\sum\limits_{y=0}^{N-1}c_{y}\delta_{y}(u)+\sum\limits_{y=0}^{N-1}d_{y}\delta_{y}(v).$$
(30)

Moreover, for a given BH matrix H, arbitrary permutation functions \(g(\cdot ), g^{\prime }(\cdot )\), and \(\forall c_{y}, d_{y}\in \mathbb {Z}_{N}\), there exists a BH matrix H1 satisfy the (30).

Recall that the Kronecker-delta functions form a linear basis of all functions from \({\mathbb {Z}}_{N}\) to Zq. Therefore the term \({\sum }_{y=0}^{N-1}c_{y}\delta _{y}(u)\) in formula (30) can produce arbitrary function l(y) from \(\mathbb {Z}_{N}\) to Zq. Thus the general form of CCAs \(\widetilde {\boldsymbol {M}}(\boldsymbol {x})\) extracted from generalized seed PU matrix in [71] can be obtained immediately. Before we show it, we introduce the definitions of two types of functions.

Definition 20

Let

$$\delta_{L}(q, N=p^{n})=\left\{\sum\limits_{k=0}^{m-1}l_{k}(\boldsymbol{x}_{\boldsymbol{k}}) \bigg| \forall l_{k}(\boldsymbol{x}_{\boldsymbol{k}}):{\mathbb{Z}_{p}^{n}}\rightarrow \mathbb{Z}_{q} \right\}$$

where each function in the set is referred to as a δ-linear term, and let

$$\delta_{Q}(q, N=p^{n}) = \{h(g(\boldsymbol{x}_{\boldsymbol{0}}), g^{\prime}(\boldsymbol{x}_{\boldsymbol{1}})) | g, g^{\prime}, h\},$$

where \(g(\cdot ), g^{\prime }(\cdot )\) are arbitrary permutation functions over \({{\mathbb {Z}}_{p}^{n}}\) and h(u,v) is any representative of a phase BH matrix with respect to the equivalence relationship. A function in this set is called a δ-quadratic term.

Remark 12

The functions in δL(q,N = pn) and δQ(q,N = pn) are called δ-linear terms and δ-quadratic terms, respectively, since these functions are linear and quadratic functions with respect to Kronecker-delta functions, respectively.

A general form of any entry of \(\widetilde {\boldsymbol {M}}(\boldsymbol {x})\), denoted by f(x), can be explicitly represented by a combination of these δ-linear terms and δ-quadratic terms.

Theorem 4

[71, Theorem 3] All the functions extracted from the generalized seed PU matrices with form (28) can be represented in a general form

$$f(\boldsymbol{x})=\sum\limits_{k=1}^{m-1}h_{k}(\boldsymbol{x}_{\boldsymbol{k-1}}, \boldsymbol{x}_{\boldsymbol{k}})+l(\boldsymbol{x}),$$
(31)

where hk(⋅,⋅) ∈ δQ(q,pn)(1 ≤ km − 1) and l(x) ∈ δL(q,pn).

The set of δ-linear terms is a free \({\mathbb {Z}}_{q}\)-module. To obtain all the functions with form (31), we only need to calculate those δ-quadratic terms \({\sum }_{k=1}^{m-1}h_{k}(\boldsymbol {x}_{\boldsymbol {k-1}}, \boldsymbol {x}_{\boldsymbol {k}})\), which are the coset representatives with respect to δL(q,pn). These coset representatives can be associated with a directed and weighted Hamilton path on m vertices, as shown in Fig. 7, which resembles the standard Golay sequences!

Fig. 7
figure 7

The graph of coset representatives

Theorem 5

[71, Theorem 5] Let f(x) be a function (or array) with the form (31) and h0(⋅,⋅),hm(⋅,⋅) ∈ δQ(q,pn). Then the entries of the function matrix of the generalized seed PU matrix with form (28) can be represented by

$$f_{u,v}(\boldsymbol{x})=f(\boldsymbol{x})+h_{0}(\boldsymbol{u},\boldsymbol{x}_{\boldsymbol{0}})+h_{m}(\boldsymbol{x}_{\boldsymbol{m-1}},\boldsymbol{v}), \ u, v\in {\mathbb{Z}}_{p^{n}},$$

where u and v are the p-ary expansion of u and v respectively.

We will show parameterized constructions of CAS and CCA of sizes 2, 3, 4 [71] by Theorems 4 and 5 as follows.

Constructions of q-ary Golay array pairs

We set N = p = 2 and q even in Theorems 4 and 5. Up to equivalence, there is only one BH matrix of order 2:

$$\left[\begin{array}{cc} 1 & 1 \\ 1 & -1 \end{array}\right],$$

namely, the WHT matrix of order 2, with phase matrix \(h(u,v)=\frac {q}{2}uv\) from \({{\mathbb {Z}}_{2}^{2}}\) to \({\mathbb {Z}}_{q}\).

Then the sets of δ-linear terms and δ-quadratic terms can be easily determined by

$$\begin{array}{@{}rcl@{}} \delta_{L}(q, 2)&=&\left\{\sum\limits_{k=0}^{m-1}c_{k}x_{k}+c^{\prime}\bigg| c_{k}, c^{\prime}\in {\mathbb{Z}}_{q} \right\},\\ \delta_{Q}(q, 2)&=&\left\{\frac{q}{2}x_{0}x_{1}\right\}. \end{array}$$

From Theorems 4 and 5, we obtain the function

$$f(\boldsymbol{x})=\frac{q}{2}\sum\limits_{k=1}^{m-1} x_{k-1}x_{k}+\sum\limits_{k=0}^{m-1}c_{k}x_{k}+c^{\prime}, \ \text{for} \ c_{k}, c^{\prime}\in {\mathbb{Z}}_{q},$$
(32)

and the corresponding function matrix \(\widetilde {\boldsymbol {M}}(\boldsymbol {x})\) (or CCA)

$$\widetilde{\boldsymbol{M}}(\boldsymbol{x})=f(\boldsymbol{x})\cdot \boldsymbol{J}_{2}+\frac{q}{2}\cdot \left[\begin{array}{cc} 0 & x_{m-1} \\ x_{0} & x_{0}+x_{m-1} \end{array}\right].$$

The construction, given above is identical to the construction of the standard Golay sequences given by Davis and Jedwab in their milestone work [12].

Constructions of ternary complementary array triads

We set N = p = q = 3 in Theorems 4 and 5. Up to equivalence, there exists only one BH matrix of order 3:

$$\left[\begin{array}{cccc} 1 & 1 & 1\\ 1 & \omega^{1} & \omega^{2}\\ 1 & \omega^{2} & \omega^{1} \end{array}\right],$$

where ω = ω3. This is the DFT matrix of order 3 with phase matrix h(u,v) = uv from \({{\mathbb {Z}}_{3}^{2}}\) to \(\mathbb {Z}_{3}\). The sets of δ-linear terms and δ-quadratic terms are determined by

$$\begin{array}{@{}rcl@{}} \delta_{L}(3, 3)&=&\left\{\sum\limits_{k=0}^{m-1}c_{k, 2}{x_{k}^{2}}+\sum\limits_{k=0}^{m-1}c_{k, 1}x_{k}+c^{\prime}\bigg| c_{k, 1},c_{k,2}, c^{\prime}\in \mathbb{Z}_{3} \right\}.\\ \delta_{Q}(3, 3)&=&\left\{x_{0}x_{1}, 2x_{0}x_{1}\right\}. \end{array}$$

From Theorem 4, we obtain the function

$$f(\boldsymbol{x})=\sum\limits_{k=1}^{m-1}d_{k} x_{k-1}x_{k}+\sum\limits_{k=0}^{m-1}c_{k, 2}{x_{k}^{2}}+\sum\limits_{k=0}^{m-1}c_{k, 1}x_{k}+c^{\prime},$$
(33)

for \(d_{k}\in {\mathbb {Z}}^{*}_{3}\) and \(c_{k, 1}, c_{k, 2}, c^{\prime }\in {\mathbb {Z}}_{3}\). Moreover, from Theorem 5, the corresponding function matrix \(\widetilde {\boldsymbol {M}}(\boldsymbol {x})\) (or CCA) is given by

$$\widetilde{\boldsymbol{M}}(\boldsymbol{x})=f\cdot \boldsymbol{J}_{3}+ \left[\begin{array}{ccc} 0 & x_{m-1} & 2x_{m-1} \\ x_{0} & x_{0}+x_{m-1} & x_{0}+2x_{m-1} \\ 2x_{0} & 2x_{0}+x_{m-1} & 2x_{0}+2x_{m-1} \end{array}\right].$$

This construction was first found in [71] using the methodology of the desired PU matrices.

Constructions of quaternary complementary arrays of size 4

We set p = n = 2, N = q = 4 in Theorems 4 and 5. The δ-linear terms is determined by

$$\delta_{L}(4, 4)=\left\{\sum\limits_{k=0}^{m-1}d_{k}x_{2k}x_{2k+1} +\sum\limits_{k=0}^{2m-1}c_{k}x_{k}+c^{\prime}\bigg| \forall d_{k}, c_{k}, c^{\prime}\in \mathbb{Z}_{4} \right\}.$$

Up to equivalence, there are only two quaternary BH matrices of order 4, one is the WHT matrix and the other, the DFT matrix of order 4, as shown in Example 14.

The WHT matrix of order 4 determines 6 δ-quadratic terms:

$$h(\boldsymbol{x}_{\boldsymbol{0}}, \boldsymbol{x}_{\boldsymbol{1}})=\left\{\begin{array}{ll} 2(x_{1}x_{3}+x_{0}x_{3}+x_{1}x_{2}),\\ 2(x_{0}x_{2}+x_{0}x_{3}+ x_{1}x_{2}),\\ 2(x_{1}x_{3}+x_{0}x_{2}),\\ 2(x_{0}x_{3}+x_{1}x_{2}),\\ 2(x_{0}x_{2}+x_{1}x_{3}+x_{1}x_{2}),\\ 2(x_{0}x_{2}+x_{0}x_{3}+ x_{1}x_{3}), \end{array}\right.$$

where x0 = (x0,x1) and x1 = (x2,x3). And the DFT matrix of order 4 determines 18 δ-quadratic terms:

$$h(\boldsymbol{x}_{\boldsymbol{0}}, \boldsymbol{x}_{\boldsymbol{1}})=dg(\boldsymbol{x}_{\boldsymbol{0}})g^{\prime}(\boldsymbol{x}_{\boldsymbol{1}}) \ \text{for}\ g, g^{\prime}\in \{g_{1}, g_{2}, g_{3}\}, d=1,3,$$

where

$$\left\{\begin{array}{lll} g_{1}(\boldsymbol{x}_{\boldsymbol{0}})=x_{0}+2x_{1},\\ g_{2}(\boldsymbol{x}_{\boldsymbol{0}})=2x_{0}+x_{1},\\ g_{3}(\boldsymbol{x}_{\boldsymbol{0}})=2x_{0}x_{1}+x_{0}+3x_{1}. \end{array}\right.$$

Those 24 δ-quadratic terms form δQ(4,4). From Theorem 4, we obtain the function

$$f(\boldsymbol{x})=\sum\limits_{k=1}^{m-1}h_{k}(\boldsymbol{x}_{\boldsymbol{k-1}}, \boldsymbol{x}_{\boldsymbol{k}})+l(\boldsymbol{x}),$$

for hk(⋅,⋅) ∈ δQ(4,4) and l(x) ∈ δL(4,4).

Moreover, from Theorem 5, the corresponding function matrix is given by

$$\widetilde{\boldsymbol{M}}(\boldsymbol{x})=f(\boldsymbol{x})\cdot \boldsymbol{J}_{4}+A(\boldsymbol{x}_{\boldsymbol{0}})\cdot \boldsymbol{J}_{4}+\boldsymbol{J}_{4} \cdot B(\boldsymbol{x}_{\boldsymbol{m-1}}),$$

where the diagonal matrices A(x0),B(x0) are arbitrarily chosen from the following set:

$$\begin{array}{@{}rcl@{}} &&diag(0, 2x_{0}, 2x_{1}, 2x_{0}+2x_{1}),\\ &&diag(0, 2x_{0}+x_{1}, 2x_{1}, 2x_{0}+3x_{1}),\\&& diag(0, x_{0}+3x_{1}+2x_{0}x_{1}, 2x_{0}+2x_{1}, 3x_{0}+x_{1}+2x_{0}x_{1}). \end{array}$$

Since every row (or column) of \(\widetilde {\boldsymbol {M}}(\boldsymbol {x})\) forms a CAS, the arrays in the following any set

$$\left\{\begin{array}{llll} f(\boldsymbol{x}),\\ f(\boldsymbol{x})+2x_{0},\\ f(\boldsymbol{x})+2x_{1},\\ f(\boldsymbol{x})+2x_{0}+2x_{1}, \end{array}\right. \left\{\begin{array}{llll} f(\boldsymbol{x}),\\ f(\boldsymbol{x})+2x_{0}+x_{1},\\ f(\boldsymbol{x})+2x_{1},\\ f(\boldsymbol{x})+2x_{0}+3x_{1}, \end{array}\right. \text{or}\ \left\{\begin{array}{llll} f(\boldsymbol{x}),\\ f(\boldsymbol{x})+3x_{0}+x_{1}+2x_{0}x_{1},\\ f(\boldsymbol{x})+2x_{0}+2x_{1},\\ f(\boldsymbol{x})+x_{0}+3x_{1}+2x_{0}x_{1}, \end{array}\right.$$

form a CAS of size 4.

The constructions of complementary arrays of size 4 have been studied for long time along the direction of determining aperiodic correlation of the generalized Boolean functions. Using that method, the recursive formulae for quaternary complementary arrays of size 4 are all generalized from those for binary complementary arrays of size 4. However, for the binary case, there is only one BH matrix, namely, the WHT matrix. In fact, the constructions of the quaternary complementary arrays involving by δ-quadratic terms determined by the DFT matrices have not been reported before the use of the desired PU matrices proposed in [71].

5.4 Other recursive constructions

The recursive construction in Theorem 2 constructs desired PU matrices from lower dimensions to higher dimensions, which can be generalized in the following form.

Theorem 6

Let V(z1) and \(\boldsymbol {U}^{\{\beta \}}(\boldsymbol {z}_{\boldsymbol {2}})\) (\(0\le \beta <p^{n^{\prime }}\)) be desired PU matrices of order \(p^{n+n^{\prime }}\) and pn, respectively. Then

$$\begin{array}{@{}rcl@{}} \boldsymbol{M}(\boldsymbol{z}_{\boldsymbol{0}}, \boldsymbol{z}_{\boldsymbol{1}}, \boldsymbol{z}_{\boldsymbol{2}})= \boldsymbol{V}(\boldsymbol{z}_{1})\cdot \left[\begin{array}{cccc} \boldsymbol{D}(\boldsymbol{z}_{0})&0&\cdots&0\\ 0&\boldsymbol{D}(\boldsymbol{z}_{0})&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\boldsymbol{D}(\boldsymbol{z}_{0}) \end{array}\right]\cdot \left[\begin{array}{cccc} \boldsymbol{U}^{\{0\}}(\boldsymbol{z}_{\boldsymbol{2}})&0&\cdots&0\\ 0&\boldsymbol{U}^{\{1\}}(\boldsymbol{z}_{\boldsymbol{2}})&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\boldsymbol{U}^{\{2^{n^{\prime}}-1\}}(\boldsymbol{z}_{\boldsymbol{2}}) \end{array}\right] \end{array}$$

is a desired PU matrix. Moreover, the function matrix of M(z0,z1,z2) is given by

$$\begin{array}{@{}rcl@{}} \widetilde{\boldsymbol{M}}(\boldsymbol{x}_{\boldsymbol{0}}, \boldsymbol{x}_{\boldsymbol{1}}, \boldsymbol{x}_{\boldsymbol{2}})&=& \widetilde{\boldsymbol{V}}(\boldsymbol{x}_{1})\cdot \left[\begin{array}{cccc} \boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{0})&0&\cdots&0\\ 0&\boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{0})&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{0}) \end{array}\right]\cdot \left[\begin{array}{cccc} \boldsymbol{J}&0&\cdots&0\\ 0&\boldsymbol{J}&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\boldsymbol{J} \end{array}\right]\\ &&+ \boldsymbol{J}_{p^{n+n^{\prime}}}\cdot \left[\begin{array}{cccc} \boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{0})&0&\cdots&0\\ 0&\boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{0})&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\boldsymbol{{{\varDelta}}}(\boldsymbol{x}_{0}) \end{array}\right]\cdot \left[\begin{array}{cccc} \widetilde{\boldsymbol{U}}^{\{0\}}(\boldsymbol{x}_{2})&0&\cdots&0\\ 0&\widetilde{\boldsymbol{U}}^{\{1\}}(\boldsymbol{x}_{2})&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\widetilde{\boldsymbol{U}}^{\{2^{n^{\prime}}-1\}}(\boldsymbol{x}_{2}) \end{array}\right] \end{array}$$

Proof

For the case p = 2, it has been shown in [71, Theorem 11] that M(z0,z1,z2) is a desired PU matrix. It is straightforward to show that this result is valid for arbitrary integer p.

In addition, It is not difficult to verify that the matrix form of \(\widetilde {\boldsymbol {M}}(\boldsymbol {x}_{\boldsymbol {0}}, \boldsymbol {x}_{\boldsymbol {1}}, \boldsymbol {x}_{\boldsymbol {2}})\) here is equivalent to the function from of entries in \(\widetilde {\boldsymbol {M}}(\boldsymbol {x}_{\boldsymbol {0}}, \boldsymbol {x}_{\boldsymbol {1}}, \boldsymbol {x}_{\boldsymbol {2}})\) in [71, Theorem 11]. □

Furthermore, we can construct desired PU matrices from lower dimensions and smaller orders to higher dimensions and larger orders as follows.

Theorem 7

Let \(\boldsymbol {U}^{\{j\}}(\boldsymbol {z}_{\boldsymbol {0}})\) (0 ≤ j < pn) and \(\boldsymbol {V}^{\{\alpha \}}(\boldsymbol {z}_{\boldsymbol {1}})\) (\(0\le \alpha <p^{n^{\prime }}\)) be desired PU matrices of order \(p^{n^{\prime }}\) and pn, respectively. And suppose that P is a permutation matrix of order \(p^{n+n^{\prime }}\) with each entry Pu,v = 1 if and only if \(v\equiv p^{n^{\prime }}u\) (mod \(p^{n+n^{\prime }}-1\)), \(\boldsymbol {U}^{\{j\}}(\boldsymbol {z}_{\boldsymbol {0}})\) (0 ≤ j < pn). Then

$$\boldsymbol{G}(\boldsymbol{z}_{\boldsymbol{0}}, \boldsymbol{z}_{\boldsymbol{1}})= \left[\begin{array}{cccc} \boldsymbol{V}^{\{0\}}(\boldsymbol{z}_{\boldsymbol{1}})&0&\cdots&0\\ 0&\boldsymbol{V}^{\{1\}}(\boldsymbol{z}_{\boldsymbol{1}})&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\boldsymbol{V}^{\{2^{n^{\prime}}-1\}}(\boldsymbol{z}_{\boldsymbol{1}}) \end{array}\right] \boldsymbol{P} \left[\begin{array}{cccc} \boldsymbol{U}^{\{0\}}(\boldsymbol{z}_{\boldsymbol{0}})&0&\cdots&0\\ 0&\boldsymbol{U}^{\{1\}}(\boldsymbol{z}_{\boldsymbol{0}})&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\boldsymbol{U}^{\{2^{n}-1\}}(\boldsymbol{z}_{\boldsymbol{0}}) \end{array}\right] \boldsymbol{P}^{T}.$$

is a desired PU matrix of order \(p^{n+n^{\prime }}\). Moreover, the function matrix of G(z0,z1) is given by

$$\begin{array}{@{}rcl@{}} \boldsymbol{G}(\boldsymbol{x}_{\boldsymbol{0}}, \boldsymbol{x}_{\boldsymbol{1}})&=& \left[\begin{array}{cccc} \widetilde{\boldsymbol{V}}^{\{0\}}(\boldsymbol{x}_{\boldsymbol{1}})&0&\cdots&0\\ 0&\widetilde{\boldsymbol{V}}^{\{1\}}(\boldsymbol{x}_{\boldsymbol{1}})&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\widetilde{\boldsymbol{V}}^{\{2^{n^{\prime}}-1\}}(\boldsymbol{x}_{\boldsymbol{1}}) \end{array}\right] \boldsymbol{P} \left[\begin{array}{cccc} \boldsymbol{J}&0&\cdots&0\\ 0&\boldsymbol{J}&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\boldsymbol{J} \end{array}\right]\boldsymbol{P}^{T} \\ &&+ \left[\begin{array}{cccc} \boldsymbol{J}&0&\cdots&0\\ 0&\boldsymbol{J}&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\boldsymbol{J} \end{array}\right] \boldsymbol{P} \left[\begin{array}{cccc} \widetilde{\boldsymbol{U}}^{\{0\}}(\boldsymbol{x}_{\boldsymbol{0}})&0&\cdots&0\\ 0&\widetilde{\boldsymbol{U}}^{\{1\}}(\boldsymbol{x}_{\boldsymbol{0}})&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&\widetilde{\boldsymbol{U}}^{\{2^{n}-1\}}(\boldsymbol{x}_{\boldsymbol{0}}) \end{array}\right]\boldsymbol{P}^{T} \end{array}$$

Proof

For the case p = 2, it has been shown in [71, Theorem 10] that G(z0,z1) is a desired PU matrix. We can extend it to arbitrary integer p straightforwardly. Thus the equivalence of the matrix form G(x0,x1) and the function form of the entries in G(x0,x1) in [71, Theorem 10] follows immediately.

It has been shown in [71], the CSSs, constructed in [50, 56], and CCCs, constructed in [52] can be obtained by the above recursive formulae only involving the delay matrices and WHT matrices of order 2. Furthermore, the work in [71] has shown that all the known constructions of CSSs and CCCs [10, 11, 50, 52, 56, 73] (except non-standard Golay pair) can be obtained by these desired PU matrices involving only the delay matrices and WHT matrices of order 2 as well.

5.5 Discussions

Up to equivalence, for the binary case [30], there is a unique Hadamard matrix of orders 1,2,4,8, and 12. There are 5 inequivalent matrices of order 16, 3 of order 20, 60 of order 24, and 487 of order 28. Millions of inequivalent matrices are known for orders 32,36, and 40. And for quaternary case [63], there are 15 inequivalent BH matrices in H(4,8), and 319 inequivalent BH matrices in H(4,12). Combining with the results that inequivalent BH matrices yield inequivalent δ-quadratic terms with respect to the δ-linear terms, this makes it very interesting that the functions derived from desired PU matrices involving higher order (N > 2) BH matrices. This may significantly increase the number of the sequences with low PMEPR.

A recent process in [69] showed the CCAs and CASs extracted from the PU matrices with form (28) involving in DFT matrices, WHT matrices, BH matrices constructed from 2-level autocorrelation sequences, and BH matrices constructed from bent functions. This connects the research of CCAs and CASs with other separate objects in the literature, such as 2-level autocorrelation sequences, perfect sequences, bent functions and permutation polynomials over finite fields. From Proposition 2 in Section 2, we know that the value of the periodic correlation at shift τ can be determined by the sum of the aperiodic correlation at shifts τ and τL where L is the sequence length. It is surprising that the CCCs and CSSs defined by aperiodic correlation can be constructed by 2-level autocorrelation sequences or perfect sequences, which are defined by periodic correlation.

From the discussions in [71] and this section, the method to construct CSSs and CCCs by desired PU matrices explains all the known results in this area, except for the sporadic cases and produce tremendously many new constructions as well, which presents a very compelling tool.

If a desired PU matrix cannot be obtained by another desired PU matrix of higher dimension, we call it a primitive desired PU matrix. Currently, we only know two types of primitive desired PU matrices: one is of BH matrices and the other is of PU matrices constructed by non-standard Golay complementary pairs. We can easily show that all the known Golay complementary pairs can be constructed by these two types of primitive desired PU matrices and the recursive formula shown in Theorem 2.

It is natural to ask the following questions. Are there any other primitive desired PU matrices of order 2 or any higher order? Are there any other recursive formulae of desired PU matrices for producing new CASs?

6 Concluding remarks and open problems

In this survey, we have presented the current status of 2-level autocorrelation sequences, the sequence sets with low ambiguity and CSSs and CCCs. We have exhibited several unsolved problems or some inspiring remarks throughout the context. In the following, we summarize those problems as follows.

  1. 1.

    Sequences with 2-level autocorrelation:

    1. (a)

      Can we classify all sequences with 2-level autocorrelation? Can we confirm that any 2-level autocorrelation sequence is (multiplexing) Hadamard equivalent to an m-sequence? At least, for binary sequences, can we classify for n = 12?

    2. (b)

      Prove the observation of Gong-Helleseth on the linear span of Ludkovski-Gong conjectured ternary sequences of type D (or with some modification, since the experiments were done only up to n = 15).

    3. (c)

      Find new primary constructions (i.e., not composited constructions) for 2-level autocorrelation sequences.

    4. (d)

      Determine some 2-level autocorrelation sequences with good aperiodic autocorrelation, say the values of aperiodic autocorrelation are bounded by the square root of the period with some constant multiplier.

  2. 2.

    Sequence sets with low ambiguity:

    1. (a)

      Construct more time-phase shift distinct sequence sets with the same ambiguity and DFT bounds in Constructions 1-4, but with larger sizes, say the sizes are in the cubic order of the period in Constructions 1-3 and quadratic-order of the period in Construction 4. If we extend the elements of a sequence to complex values, can the sizes of sequence sets be increased greater than cubic order of the period? If so, find constructions.

    2. (b)

      Can we have the functions associated with additive characters with higher degrees than those in Constructions 1-3, but the bounds on ambiguity and DFT remain unchanged, also with good aperiodic correlation?

    3. (c)

      Open problems proposed by Gong presented in Sections 7.2 and 7.3 in [23], which are simplified and rephrased as follows according to the definitions in Sections 4.1 and 4.2.

      Problem 1. Find sequences constructed from multiplicative characters over \({\mathbb {F}}_{p^{n}}\) which have the same autocorrelation as the power residue sequences or Sidel’nikov sequences (i.e., analogue to 2-level autocorrelation sequences), and constructions for sequence sets based on those sequences with good ambiguity and large sizes.

      Problem 2. Note that the elements of a sequence in Sections 3 and 4 are ordered in terms of the multiplicative group of \(\mathbb {F}_{p^{n}}\) whenever an additive character or a multiplicative character is used, while the elements of a complementary sequence in Section 5 are ordered in terms of the additive group of \(\mathbb {F}_{p^{n}}\). So the open question would be to find aperiodic autocorrelation of a sequence ψ(f(t)),t = 0,1,⋯ ,pn − 1 where ψ(f(αt)) gives a 2-level autocorrelation, and reversely, find periodic autocorrelation of a sequence f(αt) where f(t) gives a complementary sequence. Find the constructions of sequence sets based on those sequences with good ambiguity and large sizes.

  3. 3.

    PU matrices based constructions on CSSs and CCCs:

    1. (a)

      Find new primitive desired PU matrices.

    2. (b)

      Explore other recursive formulae for desired PU matrices for producing new CASs.