<table>
<thead>
<tr>
<th><strong>Title</strong></th>
<th>Time-Interleaved Analog-to-Digital Converter (TIADC) Compensation Using Multichannel Filters</th>
</tr>
</thead>
<tbody>
<tr>
<td><strong>Author(s)</strong></td>
<td>Lim, YC; Zou, YX; Chan, JW; Chan, SC</td>
</tr>
<tr>
<td><strong>Citation</strong></td>
<td>IEEE Transactions on Circuits and Systems Part 1: Regular Papers, 2009, v. 56 n. 10, p. 2234-2247</td>
</tr>
<tr>
<td><strong>Issued Date</strong></td>
<td>2009</td>
</tr>
<tr>
<td><strong>URL</strong></td>
<td><a href="http://hdl.handle.net/10722/73667">http://hdl.handle.net/10722/73667</a></td>
</tr>
<tr>
<td><strong>Rights</strong></td>
<td>IEEE Transactions on Circuits and Systems Part 1: Regular Papers. Copyright © IEEE.; ©2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.; This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.</td>
</tr>
</tbody>
</table>
In many signal processing applications, the highest sampling speed is limited by the speed of the analog-to-digital converter (ADC). In order to achieve an analog-to-digital (AD) conversion time that is much shorter than what can be achieved with a single ADC, a bank of properly sequenced sub-ADCs is used. Each sub-ADC samples and converts at a rate much higher than the speed of each sub-ADC. For the same number of coefficients, the optimization of the coefficients reduces the aliasing artifacts. Each sub-ADC may also have a different dc offset and gain mismatch. Each sub-ADC may also have a different dc offset leading to an offset mismatch. Detailed discussions on offset and gain mismatches may be found in [7]–[13]. In Fig. 1, ADC-m is driven by clock signal CLK-m, m = 0, 1, 2, ..., M − 1, generated from a stable system clock CLK-S. Let the frequency of CLK-S be f_s. The sampling interval for the TIADC system is T, given by

$$T = 1/f_s.$$  

Table I shows a list of the symbols for easy reference.

In Fig. 1, each of the M sub-ADCs of the TIADC system is assumed to be linear but a different sub-ADC may have a slightly different constant of proportionality leading to a “gain mismatch” and a different bandwidth leading to a bandwidth mismatch. Each sub-ADC may also have a different dc offset leading to an offset mismatch. Detailed discussions on offset and gain mismatches may be found in [7]–[13]. In Fig. 1, ADC-m is driven by clock signal CLK-m, m = 0, 1, 2, ..., M − 1, generated from a stable system clock CLK-S. Let the frequency of CLK-S be f_s. The sampling interval for the TIADC system is T, given by

$$T = 1/f_s.$$  

We shall assume that the highest frequency component of the input signal is band limited to 0.5f_s.

Ideally, the sampling instant of each sub-ADC is spaced apart by T unit time. The conversion speed of each sub-ADC is f_s/M but the TIADC system conversion speed is f_s. The circuits generating CLK-m, m = 0, 1, 2, ..., M − 1, (including routing) introduce delay between the edges of CLK-S and CLK-m; the delay may be different for different clock signals and may drift with temperature. The difference in delay contributes to timing mismatch.

Many techniques have been published in literature for compensating bandwidth and timing mismatch [14]–[28]. Among these techniques, [14]–[25] use a bank of filters where each filter of the filter bank filters the input signal of each sub-ADC; the outputs of the filters are summed to form the final output, as shown in Fig. 2. A single-channel filter is a filter with one input port. Each filter in the filter bank is a single-channel filter producing output at a rate of f_s; the compensated output is obtained by summing all the outputs of the filters. In Fig. 2, H_{m}(\omega) represents the

---

**Abstract**—Published methods that employ a filter bank for compensating the timing and bandwidth mismatches of an M-channel time-interleaved analog-to-digital converter (TIADC) were developed based on the fact that each sub-ADC channel is a downsampled version of the analog input. The output of each sub-ADC is filtered in such a way that, when all the filter outputs are summed, the aliased components are minimized. If each channel of the filter bank has N coefficients, the optimization of the coefficients requires computing the inverse of an MN × MN matrix if the weighted least squares (WLS) technique is used as the optimization tool. In this paper, we present a multichannel filtering approach for TIADC mismatch compensation. We apply the generalized sampling theorem to directly estimate the ideal output of each sub-ADC using the outputs of all the sub-ADCs. If the WLS technique is used as the optimization tool, the dimension of the matrix to be inverted is MN × MN. The same spurious component performance given sufficient arithmetic precision, our technique is computationally less complex and more robust than the filter-bank approach. If mixed integer linear programming is used as the optimization tool to produce filters with coefficient values that are integer powers of two, our technique produces a saving in computing resources by a factor of approximately (10^{0.2(N(M−1))}/(M−1)) in the TIADC filter design.

**Index Terms**—Time-interleaved analog-to-digital converter (TIADC), TIADC mismatch compensation, multichannel filter.

---

**I. INTRODUCTION**

---

Yong Ching Lim, Fellow, IEEE, Yue-Xian Zou, Senior Member, IEEE, Jun Wei Lee, Member, IEEE, and Shing-Chow Chan, Senior Member, IEEE
frequency response of the \( m \)th sub-ADC channel. It includes the frequency responses of the sub-ADC, the waveguide transmitting the signal to the sub-ADC, and the photonic front end if the analog signal is sampled using a laser. It also includes timing mismatch, i.e., if the timing mismatch is \( \Delta T \), \( e^{j\omega \Delta T} \) will be a factor in \( H_{\text{ADC}}(\omega) \). It does not include dc bias, harmonic distortion, and noise. The \( e^{j\omega \Delta T} \) term in Fig. 2 represents the sampling clock displacement between sub-ADCs.

The detailed development for the methods reported in [14]–[25] may differ from one another. However, the basic principles are the same: 1) TIADC is a collection of many channels of under-sampled analog input, and 2) the goal is to design a filter bank in which each filter shapes the spectral characteristics of each sub-ADC output in such a way that, when all the filter outputs are summed, the aliasing terms are removed as much as possible and the “signal term” is as close as possible to a linear phase term. In order to facilitate comparison between our technique and existing techniques, we shall briefly describe the technique presented in [18].

The frequency spectrum of the compensated TIADC output is \( \tilde{Y}(f) \), where \( f = \omega/(2\pi) \). It is given in terms of the frequency spectrum of the analog input \( X(f) \) by (after [18, eq. (6), (7)])

\[
\tilde{Y}(f) = \sum \tilde{c}_m \left( f - \frac{m}{M} f_s \right) X \left( f - \frac{m}{M} f_s \right)
\]

where

\[
\tilde{c}_m(f) = \frac{1}{M} \sum_{k=0}^{M-1} \sum_{n=0}^{N-1} e^{-j2\pi k \frac{m}{M}} e^{-j2\pi n \frac{f}{f_s}} H_{A,k}(f) \tilde{h}_k(n)
\]

In (3), \( \tilde{h}_k(n) \) is the \( k \)th impulse response of FILTER-\( k \). The impulse response \( \tilde{h}_k(n) \) is optimized so that the aliasing gain \( \tilde{c}_m(f) \) for \( m \neq 0 \) is as close as possible to zero and \( \tilde{c}_m(f) \) for \( m = 0 \) is as close as possible to a linear phase term. The weighted least squares (WLS) solution is given by (after [18, eq. (10)])

\[
\tilde{h}_{\text{WLS}} = (A^HWA)^{-1}A^HW D.
\]

The derivation for (4) can be found in [18]. Let \( L \) be the number of evaluation frequencies and \( N \) be the length of FILTER-\( k \). The matrix \( A \) is \( LM \times NM \), \( W \) is \( LM \times LM \), \( D \) is \( NM \times 1 \), and \( D \) is \( LM \times 1 \) (after [18]). The solution requires the inversion of \( A^HWA \), which is an \( NM \times NM \) matrix.

In Section II, we present a new approach with reduced computational complexity in the design process. Our approach is a multichannel filtering approach. A multichannel filter is a filter with more than one input port. In our approach, each filter is an \( M \)–channel filter producing output at a rate of \( f_s/M \); the compensated output is obtained by selecting the outputs of the filters one at a time. The derivation of the coefficient values of the multichannel filter is cast into a WLS optimization problem in Section III. In our approach, the corresponding matrices in (4) have dimensions \( L \times N \), \( L \times L_n \times N \times 1 \), and \( L \times 1 \) respectively (instead of \( LM \times NM \), \( LM \times LM \), \( NM \times 1 \), and \( LM \times 1 \)). The corresponding matrix to be inverted has a dimension of \( N \times N \) (instead of \( NM \times NM \)). As can be seen in the computational complexity analysis presented in Section IV, the computational complexity of our approach is much lower than that of the filter-bank approach. As a result of the much smaller matrices, our approach is also numerically more robust. Another advantage of our approach is that the filters need not have the same number of coefficients. The optimization problem is cast into a mixed integer linear programming (MILP) problem in Section VII for the design of TIADC compensation filters whose coefficients are integer powers of two. An actual 14-bit 4 × 400 MS/s (megasamples per second) TIADC implemented with our technique is presented in Section VIII. A summary of the differences between our approach and the filter-bank approach and the advantages of our approach over the filter-bank approach is presented in Section IX.

Several examples are presented in Sections V–VII. In the examples, offset mismatch is not considered. Offset mismatch cannot be compensated by the filter-bank approach nor the
multichannel approach. Nevertheless, it can be easily estimated from the sub-ADC outputs with their inputs grounded. A constant bias may then be added to each sub-ADC output to remove the offset. Both the filter-bank and multichannel approaches are able to compensate for gain mismatch although they are not included in the examples presented in Sections V–VII. Gain mismatch can also be compensated easily by connecting all the sub-ADCs to a full-scale dc input and then scaling each of the sub-ADC outputs using a scaling amplifier such that all sub-ADCs give the same full-scale output. In Example-I and Example-IV, both timing and bandwidth mismatches are considered. In Example-II, only bandwidth mismatch is considered. In Example-III only timing mismatch is considered. Not both timing and bandwidth mismatches are considered in Example-II and Example-III simply to provide a variety of examples; timing and bandwidth mismatches can be added if desired. Example-V is an actual TIADC example with offset, gain, timing, bandwidth, and frequency-response mismatches.

Closely related to the techniques reported in [14]–[28] is another class of multiple sub-ADC systems which uses a bank of analog filters to split the input signal in the frequency domain into $M$ channels [29]–[31]. Each sub-ADC digitizes the output of each frequency channel at $1/M$ the TIADC sampling rate. A bank of digital filters is designed to combine all the frequency interleaved sub-ADC outputs to form the final system output in such a way that the aliasing terms are minimized and the output is a close approximation to a delayed version of the input.

Several authors have proposed blind equalization techniques [9], [32]–[36]. Blind techniques do not require using a known signal to estimate the mismatch parameters. In [9], the gain and timing mismatches are estimated using the fact that, if the mismatch compensation is perfect, the TIADC output would be wide sense stationary. When compared to offline calibration techniques, blind calibration techniques are, in general, less accurate and have much higher computational complexity.

II. TIADC MISMATCH COMPENSATION WITH REDUCED DESIGN COMPLEXITY

In this section, we present a new approach to TIADC compensation. This approach results in a reduction in the arithmetic complexity in the design process. In our approach, we make use of the fact that neighboring sub-ADCs sample a filtered version of the analog input signal nonuniformly at a time interval whose average value is $T$. Fig. 3 shows an example of such a scenario for $M = 3$. In Fig. 3, $x_m(n)$ is the $n$th output sample of ADC-$m$, and $v(n)$ is the $n$th sample of the analog input if it was sampled by an ideal ADC. It has been shown in [37] that a signal may be represented by a weighted combination of any irregularly spaced samples provided that the highest frequency component of the signal is less than 0.5 times the average sampling rate. (The result presented in [37] is known as the generalized sampling theorem. It is the general form of the sampling theorem in [38]–[40].) Let $\hat{x}_m(n)$ be the ideal value of $x_m(n)$. The ideal value of $x_m(n)$ is the value of $x_m(n)$ if ADC-$m$ is a perfect ADC, free from timing error and free from all forms of mismatch. Thus

$$v(nM + m) \approx \hat{x}_m(n).$$  (5)

Our approach is to estimate the value of $v(n)$ using a weighted combination of the outputs from all the sub-ADCs sampled consecutively in the neighborhood of $v(n)$. The weights also provide a filtering effect to compensate for frequency-response mismatch (including bandwidth mismatch) between the sub-ADCs. For example, $v(5)$ (or $\hat{x}_2(1)$) in Fig. 3 may be estimated using the seven consecutive samples $x_0(0), x_0(1), x_1(1), x_2(1), x_0(2), x_1(2), x_2(2)$; a better estimate may be obtained by using more consecutive samples.

As a result of the sampling structure of the sub-ADCs, the sampled data appear in $M$ channels of a multichannel time series [41]. Thus, we may write “estimated value of $\hat{x}_2(1)$ is $h_2(0)x_2(2) + h_2(1)x_1(2) + h_2(2)x_0(2) + h_2(3)x_2(1) + h_2(4)x_1(1) + h_2(5)x_0(1) + h_2(6)x_0(0)$ where $h_2(n), n = 0, 1, \ldots, 6$, is a constant. We may also write “estimated value of $\hat{x}_2(1)$ is $h_2(0)x_2(2) + h_2(1)x_1(2) + h_2(2)x_0(2) + h_2(3)x_2(1) + h_2(4)x_1(1) + h_2(5)x_0(1) + h_2(6)x_0(0)$ where $h_{m,k}(n)$ means the $n$th partial coefficient in estimating the ideal outputs of ADC-$m$ using outputs from ADC-$k$. Obviously, $h_{2,2}(0) = h_2(0), h_{2,1}(0) = h_2(1), \ldots$. The equation defines the input–output relationship of a multichannel filter with three input ports. We note from the equation that $x_2(2)$ is used to estimate $x_2(1)$. However, $x_2(2)$ is “future” to $x_2(1). Thus, in order to make the estimation process causal, we shall aim at estimating a past output of a sub-ADC instead of estimating its immediate output. Let $\hat{x}_m(n)$ denote the estimated value of $x_m(n - \eta)$ where $\eta$ represents a time delay. In general, $\eta$ need not be an integer. In order to simplify the notation, we shall write $\hat{x}_m(\eta)$ instead of “estimated value of $\hat{x}_m(n - \eta)$.” Thus, we write

$$\hat{x}_m(n) = [\mathbf{x}_m(n)]^T \mathbf{h}_m.$$  (6)

In (6), $\mathbf{x}_m(n)$ is the signal vector in the filter, and $\mathbf{h}_m$ is the coefficient vector. We shall illustrate the notation using $\hat{x}_2(2)$ as an example. In the aforementioned example, $\eta = 1$. $\hat{x}_2(2)$ is the estimated value of $\hat{x}_2(1), [\mathbf{x}_2(2)]^T = [x_2(2), x_1(2), x_0(2), x_2(1), x_1(1), x_0(1), x_2(0)]$, and $\mathbf{h}_2 = [h_{2,2}(0), h_{2,1}(0), h_{2,0}(0), h_{2,2}(1), h_{2,1}(1), h_{2,0}(1), h_{2,2}(2)]^T$. The symbol $[\mathbf{x}]^T$ denotes the transpose of $\mathbf{x}$. In order to fix the idea, the computation structure for $\hat{x}_2(2)$ for this particular example is shown in Fig. 4. Note that, in Fig. 4, the sub-ADC outputs $x_0(0), x_1(1), x_2(2)$ form a multichannel time series [41]. The filter shown in Fig. 4 is a multichannel filter since it has three input ports.
Let \( N_m \) denote the number of coefficients in \( h_m \). In general, \( N_i \) may be different from \( N_j \) for \( i \neq j \). If \( \text{ADC}_i \) has a more serious mismatch than \( \text{ADC}_j \), \( N_j \) may be chosen to be larger than \( N_i \). In order to facilitate a comparison with the filter-bank approach, we shall let \( N = N_i = N_j \) for all \( i \neq j \), i.e., all \( N_m \)'s are equal, if no specific value is given to each \( N_m \). Therefore, we have used the same notation \( * \) to denote the number of coefficients for each output channel in our multichannel approach and the number of coefficients for each filter in the filter-bank approach. For the same \( M \) and \( N \), both our multichannel approach and the filter-bank approach will have the same total number of coefficients \( MN \). Hence, we shall make a comparison based on the same total number of coefficients.

Let \( \hat{x}_m(z) \) denote the \( z \) transform of \( \hat{x}_m(n) \). The \( z \) factor in \( \tilde{x}_m(z) \) signifies that \( \hat{x}_m(n) \) is clocked with sampling interval \( MT \). We have

\[
\tilde{x}_m(z) = \sum_{n=0}^{\infty} \hat{x}_m(n) z^{-MN}.
\]

(7)

Let \( \tilde{x}_m(z) \) and \( H_m(z) \) denote the multichannel \( z \) transforms of \( \tilde{x}_m(n) \) and \( h_m(n) \), respectively. Similarly, the \( z \) factor in \( \tilde{x}_m(z) \) and \( H_m(z) \) signify that the clock period is \( MT \). We have

\[
\tilde{x}_m(z) = \sum_{n=0}^{\infty} [x_{M-1}(n), \ldots, x_1(n), x_0(n)]^T z^{-MN},
\]

(8)

\[
H_m(z) = \sum_{n=0}^{\infty} [h_{M-1}(n), \ldots, h_1(n), h_0(n)]^T z^{-MN},
\]

(9)

In (9), the summation \( \sum_{n=0}^{\infty} \) is summed over all \( n \), where at least one of the elements in \( [h_{M-1}(n), \ldots, h_1(n), h_0(n)] \) is nonzero. (The \( z \) transform of a multichannel system is the matrix/vector formed from the \( z \) transform of each of the separate responses [41].) For the purpose of clarifying the concept, in the aforementioned example, \( [\tilde{x}_m(z)]_{i} = \sum_{n=0}^{\infty} [x_{2}(n), x_1(n), x_0(n)] z^{-3n} \) and \( H_m(z)_{i} = \sum_{n=0}^{\infty} [h_{2}(n), h_1(n), h_0(n)] z^{-3n} \).

Taking the \( z \) transform of (6), we have

\[
\tilde{x}_m(z) = [\tilde{x}_m(z)]^{\top} H_m(z).
\]

(10)

Fig. 5 shows a schematic of our approach for TIADC compensation.

Fig. 6 shows an implementation structure for \( H_2(z) \) with \( M = 5 \) and \( N_2 = 15 \). Figs. 7 and 8 show two complete implementation structures for the case where \( M = 3 \) and \( N = 4 \). The differences in Figs. 7 and 8 lead to differences in the complexity of the optimization process and in the mismatch compensation performance. In Fig. 7, \( \tilde{x}_0(n) = h_{0,0}(0)x_2(n) + h_{0,0}(0)x_1(n) + h_{0,0}(0)x_0(n) + h_{1,0}(0)x_0(n)+h_{1,0}(1)x_0(n) \). \( \tilde{x}_1(n) = h_{1,0}(0)x_2(n) + h_{1,0}(0)x_1(n) + h_{1,0}(0)x_0(n) + h_{1,0}(1)x_1(n) + h_{1,0}(0)x_0(n) \). \( \tilde{x}_2(n) = h_{2,0}(0)x_2(n) + h_{2,0}(1)x_1(n) + h_{2,0}(0)x_0(n) + h_{2,0}(1)x_0(n) \). Specifically, \( \tilde{x}_0(n) \), \( \tilde{x}_1(n) \), and \( \tilde{x}_2(n) \) are computed in one batch using the same set of raw data \( x_2(n) \), \( x_1(n) \), \( x_0(n) \), and \( x_0(n-1) \). We shall call the structure in Fig. 7 the batch processing structure.

In Fig. 8, \( \tilde{x}_0(n) = h_{0,0}(0)x_2(n) + h_{0,0}(1)x_2(n-1) + h_{0,0}(1)x_0(n) + h_{0,0}(1)x_0(n-1) \). \( \tilde{x}_1(n) = h_{1,0}(0)x_2(n) + h_{1,0}(1)x_2(n-1) + h_{1,0}(0)x_0(n) + h_{1,0}(1)x_0(n) + h_{1,0}(1)x_0(n-1) + h_{1,0}(1)x_0(n-1) \). \( \tilde{x}_2(n) = h_{2,0}(0)x_2(n) + h_{2,0}(1)x_0(n) + h_{2,0}(0)x_0(n) + h_{2,0}(1)x_0(n) \).

The raw data used to compute \( \tilde{x}_0(n) \), \( \tilde{x}_1(n) \), and \( \tilde{x}_2(n) \) are \( x_2(n) \), \( x_1(n) \), \( x_0(n) \), \( x_0(n-1) \), and \( x_0(n-1) \). Since \( x_1(n) \), \( x_0(n) \), and \( x_0(n) \) are available sequentially one after the other, \( \tilde{x}_0(n) \), \( \tilde{x}_1(n) \), and \( \tilde{x}_2(n) \) are computed sequentially one after the other, i.e., the computation of \( \tilde{x}_2(n) \), \( \tilde{x}_1(n) \), and \( \tilde{x}_0(n) \) can be initiated once their respective data are available. We shall call the structure in Fig. 8 the online processing structure.

For the same number of coefficients, in general, Fig. 8 computes \( \tilde{x}_j(n) \) using a longer span of raw data \( x_j(n) \). For example, to compute \( \tilde{x}_0(n) \), \( \tilde{x}_1(n) \), and \( \tilde{x}_2(n) \) in Fig. 8, the raw data used are \( x_2(n) \), \( x_1(n) \), \( x_0(n) \), \( x_2(n-1) \), \( x_1(n-1) \), and \( x_0(n-1) \), whereas, in Fig. 7, the raw data used are \( x_2(n) \), \( x_1(n) \), \( x_0(n) \), and \( x_0(n-1) \). As a consequence, Fig. 8 has a better mismatch compensation performance than Fig. 7 when \( N \) is not significantly larger than \( M \).

We have shown that the online processing structure (Fig. 8) uses more raw data than the batch processing structure (Fig. 7) does for computing \( \tilde{x}_0(n) \), \( \tilde{x}_1(n) \), and \( \tilde{x}_2(n) \) if both the online processing and batch processing structures have the same number of coefficients. If the online processing structure is restricted to using the same number of data for computing
\( \hat{x}_0(n) \), \( \hat{x}_1(n) \), and \( \hat{x}_2(n) \) as the *batch processing* structure does, the *online processing* structure will have less coefficients. For example, if the *online processing* structure is restricted to using only \( x_2(n) \), \( x_1(n) \), \( x_0(n) \), and \( x_2(n-1) \) for computing \( \hat{x}_0(n) \), \( \hat{x}_1(n) \), and \( \hat{x}_2(n) \). \( N \) will be equal to two, and the structure is shown in Fig. 9. Fig. 9 has six coefficients but Fig. 7 has 12 coefficients. In this case, in general, Fig. 7 has a better mismatch compensation performance than Fig. 9. Nevertheless, the difference in performance is negligibly small if \( N \) is significantly larger than \( M \). We shall illustrate this by using an example in Section VI.

Let \( \tilde{X}_m(e^{jM\omega}) \) be the frequency spectrum of the signal vector whose \( z \) transform is \( \tilde{X}_m(z^M) \). Let \( V(\omega) \) be the frequency spectrum of the band-limited analog input signal shown in Fig. 5. From Fig. 5, we have

\[
\tilde{X}_m(e^{jM\omega}) = V(\omega) \begin{bmatrix} e^{j\omega(M-1)} & e^{j\omega} & \cdots & e^{j\omega M} \\ \vdots & \ddots & \ddots & \ddots \\ e^{j\omega(M-1)} & e^{j\omega} & \cdots & e^{j\omega M} \\ \end{bmatrix}^T.
\] (11a)

In order to simplify the notation, we shall normalize the sampling interval such that \( T = 1 \). The frequency axis for \( V(\omega) \), \( \tilde{X}_m(e^{jM\omega}) \), and \( H_{A_m}(\omega) \), \( m = 0, \ldots, M-1 \), will also be appropriately scaled by \( f_s \). Thus, we write

\[
\tilde{X}_m(e^{jM\omega}) = V(\omega) \begin{bmatrix} e^{j\omega(M-1)} & e^{j\omega} & \cdots & e^{j\omega M} \\ \vdots & \ddots & \ddots & \ddots \\ e^{j\omega(M-1)} & e^{j\omega} & \cdots & e^{j\omega M} \\ \end{bmatrix}^T.
\] (11b)

Let the frequency spectrum of \( \hat{x}_m(n) \) (whose \( z \) transform is \( \tilde{X}_m(z^M) \)) be \( \hat{X}_m(e^{jM\omega}) \) and let the frequency response of \( H_m(z^M) \) be \( H_m(e^{jM\omega}) \). Replacing \( z \) by \( e^{j\omega} \) in (10), we have

\[
\hat{X}_m(e^{jM\omega}) = \left[ \tilde{X}_m(e^{jM\omega}) \right]^T H_m(e^{jM\omega}).
\] (12)

From (11b) and (12), we have

\[
\hat{X}_m(e^{jM\omega}) = V(\omega) \begin{bmatrix} e^{j\omega(M-1)} & H_{A,M-1}(\omega), \cdots, H_{A,0}(\omega) \\ \end{bmatrix}
\times H_m(e^{jM\omega}).
\] (13)

We have defined \( \hat{x}_m(n) \) to be an estimate of a delayed version of \( v(n,M+n) \), i.e., \( \hat{x}_m(n-\eta) \). Thus, we wish to find a set of coefficients for \( H_m(e^{jM\omega}) \) such that

\[
\hat{X}_m(e^{jM\omega}) \approx V(\omega) e^{j\omega(M-Mn)}
\]

\[
\hat{X}_m(e^{jM\omega}) \approx V(\omega) e^{j\omega(M-Mn)} + E_m(e^{jM\omega}).
\] (14b)

In (14b), \( E_m(e^{jM\omega}) \) is the difference between \( \hat{X}_m(e^{jM\omega}) \) and \( V(\omega) e^{j\omega(M-Mn)} \). From (13) and (14b), we have

\[
E_m(e^{jM\omega}) = \hat{X}_m(e^{jM\omega}) - V(\omega) e^{j\omega(M-Mn)}
\]

\[
= V(\omega) \begin{bmatrix} e^{j\omega(M-1)} & H_{A,M-1}(\omega), \cdots, H_{A,0}(\omega) \\ \end{bmatrix}
\times H_m(e^{jM\omega}) - e^{j\omega(M-Mn)} \}.
\] (15)

In order to clarify the concept, we shall illustrate the term \( \{ e^{j\omega(M-1)} H_{A,M-1}(\omega), \cdots, H_{A,0}(\omega) \} H_m(e^{jM\omega}) \) using the example shown in Fig. 7 for \( m = 0 \). In this particular example, \( H_{0}(e^{j3\omega}) \approx [h_{0,0}(0), h_{0,1}(0), h_{0,0}(0)]^T \). Using the definition \( h_{0} = [h_{0,0}(0), h_{0,1}(0), h_{0,2}(1)]^T \), \( e^{j\omega} H_{A,2}(\omega), e^{j\omega} H_{A,1}(\omega), H_{A,0}(\omega) \) be written as \( e^{j\omega} H_{A,2}(\omega), e^{j\omega} H_{A,1}(\omega), H_{A,0}(\omega) \). Hence, (15) can be rewritten in terms of \( h_m \) as

\[
E_m(e^{jM\omega}) = V(\omega) \begin{bmatrix} e^{j\omega(M-1)} & H_{A,M-1}(\omega), \cdots, H_{A,0}(\omega) \\ \end{bmatrix}
\times H_m(e^{jM\omega}) - e^{j\omega(M-Mn)} \}.
\] (16)

In the example for Fig. 7, \( \{ e^{j\omega(M-1)} H_{A,2}(\omega), e^{j\omega} H_{A,1}(\omega), H_{A,0}(\omega) \} H_m(e^{jM\omega}) \)

### III. WLS Optimization Approach

In (16), the factor \( V(\omega) \) (which is the frequency spectrum of \( v(n) \)) corresponds to a weighting function on the optimization.
of $E_m(e^{jM\omega})$. The phase of $V(\omega)$ is unimportant but its magnitude may be of interest. For example, if the highest frequency component of the input signal is $\omega_c$, we may let $V(\omega) = 1$ for $\omega < \omega_c$ and $V(\omega) = 0$ for $\omega > \omega_c$. Let

$$w_{LS}(\omega) = |V(\omega)|^2.$$  

(17)

Substituting (17) into (16), we have

$$E_m(e^{jM\omega}) = (\xi_m(j\omega)^T h_m - e^{jM\omega} M \xi_m(j\omega)) \sqrt{w_{LS}(\omega)}.$$  

(18)

In the WLS approach, (18) is evaluated on a dense grid of frequencies $\omega_i$, $i = 1, 2, 3, \ldots, L$. Define the complex column vectors $E_m$ and $\Lambda_m$, the complex matrix $\Xi_m$, and the diagonal real matrix $W$ as follows:

$$E_m = [E_m(e^{jM\omega_1}), E_m(e^{jM\omega_2}), \ldots, E_m(e^{jM\omega_L})]^T$$  

(19a)

$$\Lambda_m = [e^{j(M-1)\omega_1}, e^{j(M-1)\omega_2}, \ldots, e^{j(M-1)\omega_L}]^T$$  

(19b)

$$\Xi_m = [\xi_m(j\omega_1), \xi_m(j\omega_2), \ldots, \xi_m(j\omega_L)]^T$$  

(19c)

$$W = \begin{bmatrix} 0 & 0 & \cdots & 0 \\ W_{LS}(\omega_2) & 0 & \cdots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \cdots & W_{LS}(\omega_L) \end{bmatrix}. $$  

(19d)

Evaluating (18) on a dense grid of $\omega_i$, $i = 1, 2, 3, \ldots, L$ and using the notation of (19), we have

$$E_m = \sqrt{W}(\Xi_m h_m - \Lambda_m).$$  

(20)

$E_m$ and $\Lambda_m$ are $L \times 1$ vectors. $\Xi_m$ and $W$ are $L \times N$ and $L \times L$ matrices, respectively. The WLS optimum solution of $h_m$ that minimizes $\sum_{l=1}^L |E_m(e^{jM\omega_l})|^2$ is given by [42]

$$h_m = (\Xi_m^H W \Xi_m)^{-1} (\Xi_m^H W \Lambda_m). $$  

(21)

In (21), $\Xi_m^H$ denotes the complex conjugate transpose of $\Xi_m$. Solving (21) requires the computation of the inverse of $\Xi_m^H W \Xi_m$, an $N \times N$ matrix. The filter-bank approach requires the inversion of an $MN \times MN$ matrix [18]. The difference in the size of the matrix to be inverted is significant when $M$ is large. Interestingly, the compensated digital outputs of the filter-bank approach and the multichannel approach are the same.

Note that, for the batch processing structure, $\xi_m(j\omega)$ is not a function of $m$. This can be easily verified by evaluating $[\xi_m(j\omega)]$ as in Fig. 7 for $m = 0, 1,$ and 2. For Fig. 7, $h_0 = [h_{0,0}(0), h_{0,1}(0), h_{0,2}(0), h_{0,2}(1)]^T$, $h_1 = [h_{1,0}(0), h_{1,1}(0), h_{1,2}(0), h_{1,2}(1)]^T$, and $h_2 = [h_{2,0}(0), h_{2,1}(0), h_{2,2}(0), h_{2,2}(1)]^T$. Thus, we have $\xi_0(j\omega)^T = [\xi_0(j\omega)]^T = [e^{j2\omega} H_{A,0}(\omega) e^{-j2\omega} H_{A,1}(\omega) H_{A,2}(\omega) H_{A,2}(\omega)]^T$. Since $\xi_m(j\omega)$ is not a function of $m$, $\Xi_m$ is also not a function of $m$. This means that $\Xi_m^H W \Xi_m$ is the same for all $m$. Thus, for the batch processing structure, evaluating $h_m$ for all $m$ using (21) requires only one matrix inversion.

For the online processing structure, $\xi_m(j\omega)$ is a function of $m$. This can be easily verified by evaluating $\xi_m(j\omega)$ as in Fig. 8 for $m = 0, 1,$ and 2. For Fig. 8, $h_0 = [h_{0,0}(0), h_{0,2}(1), h_{0,3}(1), h_{0,0}(1)]^T$, $h_1 = [h_{1,0}(0), h_{1,2}(1), h_{1,3}(1), h_{1,0}(1)]^T$, and $h_2 = [h_{2,0}(0), h_{2,2}(1), h_{2,3}(1), h_{2,0}(1)]^T$. Thus, we have $\xi_0(j\omega)^T = [\xi_0(j\omega)]^T = [e^{j2\omega} H_{A,0}(\omega) e^{-j2\omega} H_{A,1}(\omega) H_{A,2}(\omega) H_{A,2}(\omega)]^T$. Since $\xi_m(j\omega)$ is not a function of $m$, $\Xi_m$ is also not a function of $m$. This means that $\Xi_m^H W \Xi_m$ is the same for all $m$. Thus, for the batch processing structure, evaluating $h_m$ for all $m$ using (21) requires only one matrix inversion.

IV. COMPUTATIONAL COMPLEXITY OF FILTER DESIGN

We shall compare the computational complexity for evaluating (21) in our approach and that for evaluating (4) in the filter-bank approach. The comparison is based on the same total number of coefficients $N M$. Simulation results show that the two approaches gives the same TIADC spurious frequency performance when both (21) and (4) are evaluated with sufficient arithmetic precision.

In (21), $\Xi_m$ is $L \times N$. The corresponding matrix in (4) is $A$ which is $LM \times NM$. The computation of $\Xi_m^H W \Xi_m$ in (21) requires $2(LN^2 + LN)$ multiplications. $\Xi_m^H W \Lambda_m$ in our multichannel approach corresponds to $A^H W A$ in the filter-bank approach. The computation of the $A^H W A$ matrix in (4) requires $2(LN^2M + LN^2 M^2)$ multiplications. The computation of $\Xi_m^H W \Lambda_m$ in (21) requires $2(LN + L)$ multiplications. The corresponding computation in the filter-bank approach is the computation of $A^H W D$ in (4) and requires $2(LN M^2 + LN M^2)$ multiplications. $\Xi_m^H W \Xi_m$ in (21) is an $N \times N$ matrix whose inverse can be computed with complexity $O(N^3)$ (order $N^2$). The $A^H W A$ matrix in (4) is an $NM \times NM$ matrix whose inverse can be computed with complexity $O((NM)^2)$. If the implementation structure is to be in Fig. 7, $\Xi_m$ is not a function of $m$, and therefore, it is necessary to compute the inverse $(\Xi_m^H W \Xi_m)^{-1}$ only once; otherwise, it is necessary to compute the inverse $M$ times. The computational complexity is summarized in Table II. From the aforementioned analysis, it is clear that solving (21) is significantly less complex than solving (4).

V. COMPUTATIONAL ROBUSTNESS

We have shown the computational advantage of our multichannel technique over that of the filter-bank technique in terms of computational complexity in Section IV. In this section, we shall demonstrate, using an example, that the computational robustness of solving (21) is greater than the computational robustness of solving (4).
and are tabulated in Table III. In Table III, the $e^{j\omega t}$ factor corresponds to timing mismatch, and the $(1 + j\omega \bullet)$ factor corresponds to bandwidth mismatch. The multichannel filters and the filter bank were optimized in the WLS sense with uniform weight to compensate the mismatches for frequencies ranging from 0 to $0.45f_s$. The optimization processes were done using 32- and 64-bit floating point arithmetic for comparison. The timing of ADC-0 was taken as timing reference. Fig. 10 shows the spectral lines for the multichannel approach when the filters were optimized using 32-bit floating point arithmetic; the magnitudes of the spurious components (in decibels) are shown directly above the components. The largest spurious component has a magnitude of $-57.6$ dB. Comparing Fig. 10(c) and (f), it can be seen that the performance of the multichannel approach is, on average, better than the performance of the filter-bank approach. This is expected because the optimization of the filter-bank requires the inversion of a $410 \times 410$ matrix whereas the optimization of the multichannel filter requires the inversion of a $41 \times 41$ matrix; a larger matrix not only takes a longer time to invert but the computation is also more demanding in terms of arithmetic precision.

VI. EXAMPLES

Example-II: We shall compare the performance of the batch processing structure in Fig. 7 and that of the online processing structure in Fig. 8 using an example with $M = 5$ and $N = 41$. The bandwidth mismatch parameters $H_{A,m}(\omega)$, $m = 0, 1, \ldots, 4$, are shown in Table IV. The filters are optimized in the WLS sense with uniform weight over the frequency range from 0 to $0.4375f_s$. For the structure in Fig. 7, the optimization involves the inversion of one $41 \times 41$ matrix whereas the optimization of the filter-bank produces filters that have the same performance; the largest spurious component has a magnitude of $-73.2$ dB. Comparing Fig. 10(c) and (e), it can be seen that there is significant degradation in performance when the filters are optimized using 32-bit floating point arithmetic. Fig. 10(f) shows the spectral lines for the filter-bank approach when the filter bank was optimized using 32-bit floating point arithmetic; the magnitudes of the spurious components (in decibels) are shown directly above the components.

![Fig. 10](image1)

![Fig. 11](image2)

**TABLE III**

<table>
<thead>
<tr>
<th>Sub-ADC Mismatch Parameters for Example-I</th>
</tr>
</thead>
<tbody>
<tr>
<td>$H_{A,0}(\omega) = \frac{1}{1 + j\frac{0.03\omega f_s}{2\pi f_s}}$</td>
</tr>
<tr>
<td>$H_{A,3}(\omega) = \frac{e^{-j0.04\omega f_s}}{1 + j\frac{0.5\omega f_s}{2\pi f_s}}$</td>
</tr>
<tr>
<td>$H_{A,6}(\omega) = \frac{e^{j0.03\omega f_s}}{1 + j\frac{1.15\omega f_s}{2\pi f_s}}$</td>
</tr>
<tr>
<td>$H_{A,9}(\omega) = \frac{e^{-j0.02\omega f_s}}{1 + j\frac{0.75\omega f_s}{2\pi f_s}}$</td>
</tr>
</tbody>
</table>

**Example-I:** Consider an example with $M = 10$ and $N = 41$. The mismatch parameters $H_{A,m}(\omega)$, $m = 0, 1, \ldots, 9$, are tabulated in Table III. Fig. 10(a) shows the spectral lines of the input sinusoids. Fig. 10(b) shows the spectral lines of the uncompensated TIADC output. Fig. 10(c) and (d) shows the input and output using multichannel filters and filter banks, respectively, both optimized using 64-bit floating point arithmetic. It can be seen from Fig. 10(c) and (d) that both methods produce filters that have the same performance; the largest spurious component has a magnitude of $-107.0$ dB occurring at $0.3625f_s$. Fig. 10(e) shows the spectral lines for the multichannel approach when the filters were optimized using 32-bit floating point arithmetic; the magnitudes of the spurious components (in decibels) are shown directly above the components.
matrix since $\Xi_m$ is independent of $m$. For the structure in Fig. 8, the optimization involves the inversion of four $41 \times 41$ matrices since $\Xi_m$ is different for each $m$. The coefficient values for the multichannel filters synthesized using the batch matrix since $\Xi_m$ is independent of $m$. For the structure in Fig. 8, the optimization involves the inversion of four $41 \times 41$ matrices since $\Xi_m$ is different for each $m$. The coefficient values for the multichannel filters synthesized using the batch
0.0625 m f₁, m = 1, 2, ..., 7. Fig. 11(a) and (b) shows the spectral lines of the input sinusoids and the uncompensated TIADC output, respectively. Fig. 11(c) and (d) shows the spectral lines of the compensated outputs for the filter of Table V and that of Table VI, respectively. The magnitudes (in decibels) of the larger spurious components are also marked above them; the difference is insignificant. This is usually so for

Example-III: The multichannel approach has an advantage that Nₘ need not be the same for all m. If the mismatch of ADC-₁ is less serious than that of ADC-ᵢ, Nᵢ may be less than Nᵢ. For example, if ADC-0 is taken as reference, it is assumed to have no mismatch. Thus, we choose N₀ = 1. This can be seen in the coefficient values under the column k = 0 tabulated in Tables V and VI.

For the filter-bank approach, the length of FILTER-₀ cannot be reduced even though ADC-₀ is assumed to be perfect. Although dividing the transfer functions of FILTER-ᵢ by the transfer function of FILTER-₀ will make the transfer function of FILTER-₀ unity, it will make FILTER-ᵢ, i ̸= 0, become a recursive filter; the total number of distinct coefficients remains unchanged, and the filter becomes unstable if FILTER-₀ has a zero outside the unit circle.

We shall illustrate this advantage of the multichannel approach using an example with timing mismatch parameters as shown in Table VII. It can be seen from Table VII that the mismatch of ADC-₁ is significantly more serious than the others. Thus, we choose N₁ = 41, N₂ = N₃ = N₄ = 21. We choose N₀ = 1 since ADC-₀ is taken as reference. The coefficient values are shown in Table VIII, and the spectral plots are shown in Fig. 12. For comparison, the spectral plot for a design with N₁ = N₂ = N₃ = N₄ = 41 is shown in Fig. 12(d). Comparing Fig. 12(c) and (d), it can be seen that the degradation in performance is negligible for a large saving in the number of nontrivial coefficient values.

VII. MULTIPLIERLESS TIADC DESIGN USING MILP

A TIADC with photonic front end is able to sample at a very high sampling rate well above 100 GS/s (gigasamples per second) [43]. The analog input may be sampled using a modelocked laser and stretched in an optical fiber until it is slow enough for resampling using a high-speed electronic device. In the implementation of very high-sampling-rate TIADCs, it is necessary to reduce the computational complexity of the compensation filters. The building blocks of a compensation filter are the coefficient multiplier, adder, and delay. Among these building blocks, the coefficient multiplier is the one that consumes the most power, occupies the largest silicon area, and is slowest in speed. Thus, it is essential to reduce the complexity of the coefficient multiplier.

<table>
<thead>
<tr>
<th>k=0</th>
<th>k=1</th>
<th>k=2</th>
<th>k=3</th>
<th>k=4</th>
</tr>
</thead>
<tbody>
<tr>
<td>hₐ₀₀₀₀₆₃₆₂</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₀₁₀₀₈₉</td>
<td>0</td>
<td>-0.000100₈₉</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₀₂₂₀₅</td>
<td>0</td>
<td>0.00022₀₅</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₀₄₄₀₂</td>
<td>0</td>
<td>-0.000₄₄₀₂</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₀₆₆₀</td>
<td>0</td>
<td>0.000₆₆₀</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₀₈₈₇₇</td>
<td>0</td>
<td>-0.000₈₈₇₇</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₁₀₁₃</td>
<td>0</td>
<td>0.001₀₁₃</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₁₂₁₈</td>
<td>0</td>
<td>-0.00₁₂₁₈</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₁₄₂₄</td>
<td>0</td>
<td>0.00₁₄₂₄</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₁₆₃₀</td>
<td>0</td>
<td>-0.00₁₆₃₀</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₁₈₃₆</td>
<td>0</td>
<td>0.00₁₈₃₆</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₂₀₄₂</td>
<td>0</td>
<td>-0.00₂₀₄₂</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₂₂₅₀</td>
<td>0</td>
<td>0.00₂₂₅₀</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₂₄₅₇</td>
<td>0</td>
<td>-0.00₂₄₅₇</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₂₆₆₄</td>
<td>0</td>
<td>0.00₂₆₆₄</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₂₈₇₈</td>
<td>0</td>
<td>-0.00₂₈₇₈</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₃₀₈₅</td>
<td>0</td>
<td>0.00₃₀₈₅</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₃₂₉₉</td>
<td>0</td>
<td>-0.00₃₂₉₉</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₃₄₁₃</td>
<td>0</td>
<td>0.00₃₄₁₃</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₃₆₂₇</td>
<td>0</td>
<td>-0.00₃₆₂₇</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₃₈₄₁</td>
<td>0</td>
<td>0.00₃₈₄₁</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₄₀₅₅</td>
<td>0</td>
<td>-0.00₄₀₅₅</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₄₂₆₉</td>
<td>0</td>
<td>0.00₄₂₆₉</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₄₄₈₃</td>
<td>0</td>
<td>-0.00₄₄₈₃</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₄₆₉₇</td>
<td>0</td>
<td>0.00₄₆₉₇</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₄₈₁₁</td>
<td>0</td>
<td>-0.00₄₈₁₁</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₅₀₂₅</td>
<td>0</td>
<td>0.00₅₀₂₅</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₅₂₃₉</td>
<td>0</td>
<td>-0.00₅₂₃₉</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₅₄₄₃</td>
<td>0</td>
<td>0.00₅₄₄₃</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₅₆₅₇</td>
<td>0</td>
<td>-0.00₅₆₅₇</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₅₈₇₁</td>
<td>0</td>
<td>0.00₅₈₇₁</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₆₀₈₅</td>
<td>0</td>
<td>-0.00₆₀₈₅</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₆₂₉₉</td>
<td>0</td>
<td>0.00₆₂₉₉</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₆₄₁₃</td>
<td>0</td>
<td>-0.00₆₄₁₃</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₆₆₂₇</td>
<td>0</td>
<td>0.00₆₆₂₇</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₆₈₄₁</td>
<td>0</td>
<td>-0.00₆₈₄₁</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₇₀₅₅</td>
<td>0</td>
<td>0.00₇₀₅₅</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₇₂₆₉</td>
<td>0</td>
<td>-0.00₇₂₆₹</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₇₄₇₃</td>
<td>0</td>
<td>0.00₇₄₇₃</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₇₆₈₇</td>
<td>0</td>
<td>-0.00₇₆₈₇</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₇₈₉₁</td>
<td>0</td>
<td>0.00₇₈₉₁</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₈₀₦₅</td>
<td>0</td>
<td>-0.00₈₀₦₅</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₈₂₈₉</td>
<td>0</td>
<td>0.00₈₂₈₉</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₈₄₉₃</td>
<td>0</td>
<td>-0.00₈₄₉₃</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₈₆₉₇</td>
<td>0</td>
<td>0.00₈₆₉₇</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₈₈₀₁</td>
<td>0</td>
<td>-0.00₈₈₀₁</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>hₐ₀₀₉₀₀₅</td>
<td>0</td>
<td>0.00₉₀₀₅</td>
<td>0</td>
<td>0</td>
</tr>
</tbody>
</table>
In binary arithmetic, multiplying a number by an integer power of two is a trivial process; it can be realized by wiring without using any active device in a full custom implementation in silicon. Consequently, if each coefficient value of a compensation filter is expressed as a sum of a limited number of signed power of two (SPT) terms [44]–[50], the resulting filter is essentially “multiplierless” from an implementation point of view. Various methods have been developed for optimizing the coefficient values in the SPT coefficient space for a digital filter. Among these techniques, MILP [51] is the only method that can guarantee the obtaining of the optimum solution for a given number of SPT terms allocated to each coefficient. Discrete space optimization is a time-consuming process. The computing time required increases exponentially with the number of discrete variables. From the statistics given in [49], the computer time required is approximately proportional to $10^{0.2P}$, where $P$ is the number of filter coefficients to be optimized in the discrete space. Note that the expression $10^{0.2P}$ gives an estimate on the order of magnitude of the computer time required. Individual runs may differ by over an order of magnitude from what is predicted by $10^{0.2P}$.

When using MILP, the error magnitude $|E_m(e^{jM_\omega})|$ is constrained to be not more than $w(\omega)\delta$, where $w(\omega)$ is a weighting function and $\delta$ is a variable to be minimized. Unfortunately, $E_m(e^{jM_\omega})$ is complex, and $|E_m(e^{jM_\omega})| = \sqrt{\text{Re}[E_m(e^{jM_\omega})]^2 + \text{Im}[E_m(e^{jM_\omega})]^2}$ is not a linear function of $h_m$ where $\text{Re}[x]$ and $\text{Im}[x]$ denote the real and imaginary parts of $x$, respectively. The optimum cannot be found using linear programming by directly minimizing $\sqrt{\text{Re}[E_m(e^{jM_\omega})]^2 + \text{Im}[E_m(e^{jM_\omega})]^2}$. However, the difficulty can be overcome as follows. By multiplying both sides of (16) by $e^{\theta \delta}$, we have

$$e^{\theta \delta}E_m(e^{jM_\omega}) = V(\omega) \left\{ e^{\theta \delta} \left[ h_m e^{jM_\omega} V(\omega) \right]^T \right\} - e^{\theta \delta} e^{j(M - M_n)\omega}.$$  

(22)

Our aim is to constrain the maximum value of $|E_m(e^{jM_\omega})|$ such that $|E_m(e^{jM_\omega})| \leq w(\omega)\delta$. Note that $|E_m(e^{jM_\omega})| = e^{\theta \delta} E_m(e^{jM_\omega}) = \text{Re}[e^{\theta \delta} E_m(e^{jM_\omega})] \text{Re}[e^{\theta \delta} E_m(e^{jM_\omega})]$ if $\theta$ is equal to the negative phase angle of $E_m(e^{jM_\omega})$. We do not know the phase angle of $E_m(e^{jM_\omega})$ but we can optimize $E_m(e^{jM_\omega})$ over a dense grid of possible phase angles. Thus, the constraint $|E_m(e^{jM_\omega})| \leq w(\omega)\delta$ can be imposed by imposing the set of constraints $|\text{Re}[e^{\theta \delta} E_m(e^{jM_\omega})]| \leq w(\omega)\delta$ on a dense grid of $\theta$ where $0 \leq \theta \leq \pi$. Thus, we have the MILP problem

$$\text{Re}\left[ e^{\theta \delta} \left[ h_m e^{jM_\omega} V(\omega) \right]^T \right] - w(\omega)\delta \leq \text{Re}\left[ e^{\theta \delta} e^{j(M - M_n)\omega} V(\omega) \right] \quad \text{over all } \omega \text{ and } \theta \quad (23a)$$

$$\text{Re}\left[ e^{\theta \delta} \left[ h_m e^{jM_\omega} V(\omega) \right]^T \right] + w(\omega)\delta \geq \text{Re}\left[ e^{\theta \delta} e^{j(M - M_n)\omega} V(\omega) \right] \quad \text{over all } \omega \text{ and } \theta \quad (23b)$$

Objective: minimize $\delta$ (23c)

Subject to: elements of $h_m$ are sum of a specified number of SPT terms, (23d)

| Example-IV: | We shall illustrate a multiplierless multichannel TIADC compensation filter using the mismatch parameters as shown in Table IX. The multichannel filters are optimized using MILP over the frequency ranging from 0 to 0.375$f_s$ with $\delta$ constrained to be less than 0.00001. Our objective is to minimize the total number of SPT terms. In the optimization, each coefficient is given a fixed number of SPT terms. The number of coefficients for the multichannel filters are $N_1 = 1$, $N_1 = N_2 = 20$, and $N_2 = N_3 = 21$. The coefficient values of our multichannel filters are shown in Table X. The last row in Table X shows the computer time required on an HPxw4400 PC. The spectrum plots are shown in Fig. 13. Fig. 13(a) shows the input spectral lines at $0.625 mf_s$, $m = 1, 2, \ldots, 6$. Fig. 13(b) and (c) shows the spectral lines of the uncompensated and compensated TIADC, respectively.

On the average, the computer time required to optimize each of the multichannel filters is $(42 + 412 + 80 + 122)/4 \approx 3$ min. If the filter-bank approach is used, the number of variables to be optimized in the discrete space would be about 100; using the estimation formula that the computer time required is proportional to $10^{0.2P}$, the computer time required would be approximately $3 \times 10^{16}$ min, $\approx 60$ billion years.

VIII. 14-BIT 1.6-GS/s TIADC

The spectral plots shown in the previous examples were simulated results where the sub-ADCs were assumed to be perfect. In an actual TIADC system, the sub-ADCs are not perfect, and the TIADC spectral plot contains spurs other than those due to sub-ADC mismatch.
Example-V: Our mismatch compensation technique was implemented on a system that employs four 14-bit 400-MS/s sub-ADCs to construct a 14-bit 1.6-GS/s TIADC. The mismatch between sub-ADCs was characterized by injecting sinusoids spaced 1 MHz apart one at a time. The phase and amplitude of the fundamental frequency of the sinusoid at the output of the sub-ADCs were used to determine the phase and amplitude of the mismatch.

Fig. 14 shows a spectral plot of the test result when the input frequency was 580.2 MHz. The spurious frequencies due to sub-ADC mismatch were 180.2, 219.8, and 619.8 MHz, respectively. The magnitude of the input sinusoid was about 82% of full scale and was normalized to 0 dB. Each spectral line in each of the plots was the root-mean-square (rms) value of 50 test runs; 8000 data samples were collected for computing the spectral lines in each test run. Since the ADC had 14 bits, if the TIADC was perfect, the expected value of each spectral line (other than those corresponding to the input frequencies) would be 0 dB.

It can be seen from Fig. 14 that the noise floor was significantly higher than 0 dB. Actual measurements showed that the noise floor was 25.2 dB; this corresponds to a degradation of 25.2 dB (or 4.2 bits). Thus, the effective precision of the ADC was about 9.8 bits (instead of 14 bits) at 580.2 MHz.

The top plot of Fig. 14 shows the spectral plot for the output of the uncompensated TIADC. The magnitudes of the spurious lines due to mismatch were 0, -2, and -17 dB, respectively. The other spurs were due to the nonlinearity of the sub-ADCs. The bottom plot of Fig. 14 shows the spectral lines for the output of the WLS-compensated TIADC where the length of each of the compensation filters was 7. It can be seen that the spurs due to sub-ADC mismatch were significantly reduced and the magnitudes were less than those of the spurs due to the nonlinearity of the sub-ADC. Thus, for this particular case, unless
the linearity of the individual sub-ADC was improved, there was little advantage in using compensation filters longer than 7.

Figs. 15 and 16 show the results when the 580.2-MHz sinusoid of Fig. 14 was replaced by 221.2- and 69.8-MHz sinusoids, respectively. In Fig. 15, the magnitude of the 221.2-MHz sinusoid was 85% of the full scale, and after normalizing the magnitude of the 221.2-MHz signal to 0 dB, the noise floor was 105.2 dB. Thus, the effective word length of the ADC at 221.2 MHz was

\[ 1 + \frac{\log_{10} \left( \frac{10^{105.2/10}}{0.5 \times 12 \times 8000} \right)}{0.85} \text{bits} = 10.9 \text{ bits,} \]

In Fig. 16, the magnitude of the 69.8-MHz sinusoid was 89% of the full scale, and after normalizing the magnitude of the 69.8-MHz signal to 0 dB, the noise floor was 107.5 dB. Thus, the effective word length of the ADC at 69.8 MHz was

\[ 1 + \frac{\log_{10} \left( \frac{10^{107.5/10}}{0.5 \times 12 \times 8000} \right)}{0.89} \text{bits} = 11.2 \text{ bits,} \]

IX. Conclusion

In this paper, we have presented a new multichannel approach for TIADC mismatch compensation. Comparing with the filter-bank approach, for the same number of coefficients \( NM \), the two approaches produce filters with the same performance if the filters were designed with sufficient arithmetic precision.

Our new multichannel approach has the following advantages over the filter-bank approach. First, for the same total number of coefficients \( NM \), each dimension of the matrices in our approach is a factor of \( M \) smaller than in the filter-bank approach. Second, in our approach, the matrix to be inverted has a dimension of \( N \times N \), whereas the matrix to be inverted in the filter-bank case has a dimension of \( NM \times NM \). This means that the computational complexity of the filter design in our approach is lower. Third, as a result of the smaller matrices, the filter design in our approach is computationally more robust than the filter design in the filter-bank approach, as illustrated in Example-I. Lastly, in our approach, each compensation filter may have a different number of coefficients; a less seriously mismatched sub-ADC may be compensated by using a filter with less number of coefficients.

Our multichannel approach and the filter-bank approach are developed based on different principles. In our approach, we make use of the fact that a signal may be represented by a weighted combination of any irregularly spaced samples provided that the highest frequency component of the signal is less than 0.5 times the average sampling speed (the generalized sampling theorem). We use a multichannel filter to estimate the ideal output of each sub-ADC. In the filter-bank approach, the frequency spectrum of each sub-ADC output is shaped by a filter in such a way that, when all the filter outputs are summed, the aliasing terms cancel each other, and the baseband term becomes the approximate to the input analog signal.

The response of a hardware 1.6-GS/s TIADC implemented using four 400-MS/s sub-ADCs has also been presented. It can be seen from the spurious frequency components before and after mismatch compensation that mismatch is not the only source of spurious frequency components. Spurious frequency components that are not caused by mismatch cannot be removed using the mismatch compensation technique.

X. Future Work

In some systems, the TIADC operates in a tightly controlled environment, and mismatch parameter drift is not a problem. In some systems, the luxury of a tightly controlled environment is not available. In such cases, the TIADC mismatch parameters may drift with environmental variables such as temperature, humidity, proximity to ferromagnetic materials, etc., depending on implementation. The design of a TIADC compensation filter subject to environmental variable drift is part of our work that will be reported in the future.

From 1982 to 2003, he was with the Department of Electrical Engineering, National University of Singapore, Singapore, Singapore. From 1980 to 1982, he was a National Research Council Research Associate with the Naval Postgraduate School, Monterey, CA. Since 2003, he has been with the School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore, where he is currently a Professor. He has also served as an Associate Editor for Circuits, Systems and Signal Processing from 1993 to 2000. His research interests include digital signal processing and VLSI circuits and system design.

Dr. Lim was a recipient of the 1996 IEEE Circuits and Systems Society’s Guillemin–Cauer Award, the 1990 IREE (Australia) Norman Hayes Memorial Award, 1977 IEE (U.K.) Prize, and the 1974–1977 Siemens Memorial (Imperial College) Award. He served as a Lecturer for the IEEE Circuits and Systems Society under the distinguished lecturer program from 2001 to 2002 and as an Associate Editor for the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS from 1993 to 2000.

He served as the Chairman of the DSP Technical Committee of the IEEE Circuits and Systems Society from 1998 to 2000. He served in the Technical Program Committee’s DSP Track as the Chairman in the IEEE International Symposium on Circuits and Systems (ISCAS) 1997 and 2000 and as a Cochairman in the IEEE ISCAS 1999. He is the General Chairman for the IEEE Asia Pacific Conference on Circuits and Systems 2006 and a General Cochair for IEEE ISCAS 2009. He is a member of Eta Kappa Nu.

Yue-Xian Zou received the Ph.D. degree from The University of Hong Kong, Hong Kong, Hong Kong, in 2000. She is an Associate Professor with Peking University, Beijing, China, and is the Director of the Advanced Digital Signal Processing Laboratory, Shenzhen Graduate School, Peking University, Shenzhen, China. She is currently working on a high-speed and high-resolution analog-to-digital converter project. Her research work includes digital filter design, adaptive signal processing, array signal processing, and video signal processing.

Jun Wei Lee (S’02–M’06) was born in Singapore on October 26, 1976. He received the B.Eng. degree in electrical engineering from the National University of Singapore, Singapore, Singapore, in 2001 and the Ph.D. degree in electrical engineering from both the National University of Singapore and the Technische Universiteit Eindhoven, Eindhoven, The Netherlands, in 2007.

Since 2005, he has been a Researcher with Temasek Laboratories, Nanyang Technological University, Singapore. His research interests include time-interleaved analog-to-digital conversion, design of digital filters and filter banks, and audio compression.

Shing-Chow Chan (S’87–M’92) received the B.Sc.(Eng.) and Ph.D. degrees in electrical engineering from The University of Hong Kong, Hong Kong, Hong Kong, in 1986 and 1992, respectively. He joined the City Polytechnic of Hong Kong, Kowloon, Hong Kong, in 1990 as an Assistant Lecturer and, later, as a University Lecturer. Since 1994, he has been with the Department of Electrical and Electronic Engineering, The University of Hong Kong, where he is currently a Professor. He holds visiting positions in Microsoft Corporation, Redmond, U.S., Microsoft Research Asia, University of Texas, Arlington, and Nanyang Technological University, Singapore, Singapore. He has published more than 200 journal and conference articles and is the coauthor of a book entitled Image-Based Rendering (Springer, 2006). He is an Associate Editor of the Journal of Signal Processing Systems. His research interests include fast transform algorithms, filter design and realization, multirate signal processing, communications signal processing, and image-based rendering.

Dr. Chan is currently a member of the Digital Signal Processing Technical Committee of the IEEE Circuits and Systems Society and an Associate Editor for the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS. He was the Chairman of the IEEE Hong Kong Chapter of Signal Processing from 2000 to 2002. He is an Associate Editor of the IEEE Transactions on Circuits and Systems—II: Express Briefs from 2002 to 2004 and the Technical Program Chair of the IEEE Asia Pacific Conference on Circuits and Systems 2006 and IEEE APCCAS 2008, IEEE APCCAS 2008, and IEEE Biomedical Circuits and Systems Conference 2006. He is also in the organizing committee of the IEEE International Conference on Image Processing 2010.