 Research
 Open access
 Published:
Robust design of Farrowstructurebased steerable broadband beamformers with sparse tap weights via convex optimization
EURASIP Journal on Audio, Speech, and Music Processing volumeÂ 2015, ArticleÂ number:Â 14 (2015)
Abstract
The Farrowstructurebased steerable broadband beamformer (FSBB) is particularly useful in the applications where sound source of interest may move around a wide angular range. However, in contrast with conventional filterandsum beamformer, the passband steerability of FSBB is achieved at the cost of high complexity in structure, i.e., highly increased number of tap weights. Moreover, it has been shown that the FSBB is sensitive to microphone mismatches, and robust FSBB design is of interest to practical applications. To deal with the aforementioned problems, this paper studies the robust design of the FSBB with sparse tap weights via convex optimization by considering some a priori knowledge of microphone mismatches. It is shown that although the worstcase performance (WCP) optimization has been successfully applied to the design of robust filterandsum beamformers with bounded microphone mismatches, it may become unapplicable to robust FSBB design due to its overconservativeness nature. When limited knowledge of mean and variance of microphone mismatches is available, a robust FSBB design approach based on the worstcase mean performance optimization with the passband response variance (PRV) constraint is devised. Unlike the WCP optimization design, this approach performs well with the capability of passband stability control of array response. Finally, the robust FSBB design with sparse tap weights has been studied. It is shown that there is redundancy in the tap weights of FSBB, i.e., robust FSBB design with sparse tap weights is viable, and thus leads to lowcomplexity FSBB.
1 Introduction
As one of the key technologies for microphone arrays, broadband beamforming has been used in a wide range of audio and speech processing applications, such as teleconferencing, hearing aids, and audio surveillance [1â€“6]. The most popular methods for broadband beamforming for microphone arrays are based on the wellknown filterandsum structure [2]. In practice, sound source of interest may move around some angular range. Accordingly, the passband width of a broadband beamformer usually needs to be designed to cover the whole angular range of movement of the sound source.^{1} It is known that, there is a tradeoff between passband width and stopband attenuation for a filterandsum broadband beamformer, i.e., the larger the passband width, the worse the stopband attenuation, and vice versa.^{2} As a result, the spatial filtering performance of filterandsum broadband beamformers will deteriorate greatly when the sound source is moving around a wide angular range. To combat this problem, one promising solution is to design steerable broadband beamformers, where their passbands can be adjusted dynamically with a simple scheme, with no need of redesign of the broadband beamformers.
Recent years have seen great interest in the design of steerable broadband beamformers for microphone arrays [7â€“14]. Among the proposed design approaches, some are tailored to specific array configurations, such as differential microphone arrays [7] and spherical microphone arrays [8]. Comparatively, the Farrowstructurebased steerable broadband beamformers (FSBBs), also known as the polynomial beamformers [12], are particularly interesting in some applications, since they are applicable to arbitrary array configurations, and moreover, their passbands can be steered online with just one single parameter [10]. In practice, there usually exist some mismatches among microphones, such as gain and phase errors [15]. Unfortunately, the FSBBs are highly sensitive to microphone mismatches. Therefore, the design of FSBBs robust against microphone mismatches has drawn attention recently.
Generally speaking, according to whether any priori knowledge of microphone characteristics is used or not, the existing design approaches for robust FSBBs can be classified into two categories. In [9, 12], white noise gain (WNG) constraint has been utilized to design robust FSBBs, where no knowledge of microphone characteristics is considered. However, the main problem with the WNG constraintbased approach is that it is unclear how to choose the WNG constraint level optimally. To get over the problem, a robust FSBB design approach based on the weighted least squares has been proposed in [11], which takes into account the probability density function (PDF) of microphone characteristics. By considering the knowledge of microphone characteristics, no usertuning parameters are required any more; thus, it can facilitate the FSBB design. Although this design approach has shown robust against microphone mismatches, the difficulty with the approach is that the PDF of microphone characteristics may not be easily accessible in practice. Instead, the bounds of uncertain microphone mismatches [16, 17] or the limited knowledge of mean and variance of microphone mismatches [18] may be practically available to a designer. Therefore, it is necessary to establish efficient design schemes for robust FSBBs by considering these types of knowledge of microphone characteristics. Besides the aforementioned robustness problem of the FSBB in the presence of microphone mismatches, another problem with the FSBB is that its computational complexity is particularly demanding in contrast with its counterpart based on filterandsum structure, which is the price it has paid for the passband steerability. However, the lowcomplexity FSBB design has not been addressed in the literature, which is of interest to practical applications.
Inspired by our previous work on robust filterandsum beamformer design [16, 18], the robust FSBB design using convex optimization with some priori knowledge of microphone mismatches is studied in this paper. Moreover, to reduce the computational complexity of the robust FSBB, the robust FSBB design with sparse tap weights has also been studied. To summarize, the contributions of the paper are threefold:

For bounded microphone mismatches, the robust FSBB design based on the worstcase performance (WCP) optimization criterion has been established. It is shown that although the WCP optimization has been successfully applied to the design of robust filterandsum beamformers as in [16, 17]; unfortunately, it may become unapplicable to robust FSBB design due to its overconservativeness nature as analyzed in the paper.

When limited knowledge of mean and variance of microphone mismatches is available to a designer, the robust FSBB design approach based on the worstcase mean performance (WCMP) optimization with the passband response variance (PRV) constraint is developed. Unlike the WCP optimizationbased design, the proposed approach performs well for robust FSBB design with the capability of passband stability control of array response. Moreover, some insights into the properties of the PRV of robust FSBB have also been revealed.

In contrast with filterandsum beamformer, the passband steerability of FSBB is achieved at the cost of high complexity in structure, i.e., highly increased number of tap weights. However, it is shown that there is redundancy in tap weights of FSBB, i.e., robust FSBB design with sparse tap weights is viable. To this end, a twostage approach for the design of robust FSBB with sparse tap weights using the reweighted l _{1}norm constraint optimization has been proposed, which leads to the design of lowcomplexity FSBB.
The rest of the paper is organized as follows. In Section 2, we formulate the problem of robust FSBB design. In Section 3, we present the robust FSBB design using the WCP optimization, when the bounds of microphone mismatches are known. In Section 4, we develop the robust FSBB design using the WCMP optimization with the PRV constraint, when the limited knowledge of mean and variance of microphone mismatches is available. In Section 5, the robust FSBB design with sparse tap weights is studied. Design examples are presented in Section 6 to illustrate the performance of the proposed approaches. Finally, Section 7 concludes the paper.
2 Problem formulation
Consider a Kelement linear microphone array in the farfield, where the distance the kth microphone and the center of the array is denoted by d _{ k }. The configuration of the FSBB is shown in Fig. 1. Unlike the wellknown filterandsum beamformers, herein, a Farrow structure consisting of M finiteimpulseresponse (FIR) subfilters is used behind each microphone, where the tap length of each FIR subfilter is N. The beampattern of the FSBB at frequency f and angle of arrival Î¸ (defined with respective to the array axis, Î¸âˆˆ(0,180Â°)) can be expressed as [11]
where \(A_{k}\left (\,f,\theta \right)=\left [1+a_{k}\left (\,f,\theta \right)\right ]e^{j\gamma _{k}\left (\,f,\theta \right)}\) with a _{ k }(f,Î¸) and Î³ _{ k }(f,Î¸) being the gain and phase errors of the kth microphone, w _{ k,n,m } denote the weights of the FSBB, c is the speed of sound, f _{ s } represents the sampling frequency, and D=(Ï• _{ d }âˆ’90Â°)/90Â° with Ï• _{ d } being the desired steering direction of the FSBB.
To simplify notation, (1) can be rewritten in the vector form
where (Â·)^{T} denotes the transpose, w=[w _{0,0,0},â‹¯,w _{0,0,Mâˆ’1},w _{0,Nâˆ’1,0},â‹¯,w _{0,Nâˆ’1,Mâˆ’1},â‹¯,w _{ Kâˆ’1,Nâˆ’1,0},â‹¯,w _{ Kâˆ’1,Nâˆ’1,Mâˆ’1}]^{T} is the weight vector of the FSBB, and \(\mathbf {\overline {g}}(\phi _{d},f,\theta)\) is the array steering vector, which is given by
with
where âŠ™ denotes the Hadamard product, and âŠ— denotes the Kronecker product.
Given some priori knowledge on microphone characteristics A(f,Î¸) and a desired response P _{ d }(Ï• _{ d },f,Î¸), our problem is to design an optimal robust beamformer weight vector w using some criterion such that the beamformer response P(Ï• _{ d },f,Î¸) can optimally fit P _{ d }(Ï• _{ d },f,Î¸) over the predefined frequencyangle range of interest (f,Î¸)âˆˆÎ© and the predefined steering direction range of interest Ï• _{ d }âˆˆÎ¨âŠ†(0,180Â°). The advantage of the FSBB is that its passband can be steered towards arbitrary directions with no need of redesign of beamformer weight vector.
3 Robust FSBB design using the WCP optimization
In this section, we study the robust FSBB design via convex optimization by using the WCP optimization in the case of bounded microphone mismatches. To proceed, first, we need to introduce a nonrobust design approach using minimax criterion when there are no microphone mismatches.
3.1 Nonrobust design
When there are no microphone mismatches, the microphone characteristics now become A _{ k }(f,Î¸)=1,(k=0,â‹¯,Kâˆ’1). Accordingly, (3) reduces to
where g(Ï• _{ d },f,Î¸) denotes the steering vector without microphone mismatches.
The problem for FSBB design using the minimax criterion can be formulated as
which can be recast as the following semiindefinite convex programming
The above problem can be further formulated as a secondorder cone programming (SOCP) problem and thus can be solved efficiently via the interior point methods [19, 20].
3.2 Robust design
Now, we consider the robust design of FSBB in the presence of bounded microphone mismatches by using the WCP optimizationbased criterion. Due to microphone mismatches, there will exist some perturbation in the steering vector of FSBB, i.e., \(\Delta \mathbf {g}(\phi _{d},f,\theta)=\mathbf {\overline {g}}(\phi _{d},f,\theta)\mathbf {g}(\phi _{d},f,\theta)\). Assume a _{ k }(f,Î¸)â‰¤Î´ _{ a }<1 and Î³ _{ k }(f,Î¸)â‰¤Î´ _{ Î³ }<Ï€/2, where Î´ _{ a } and Î´ _{ Î³ } are the known bounds. Regarding the perturbation of the steering vector of FSBB, we have the following proposition.
Proposition 1.
The perturbation of the steering vector of FSBB is bounded by
Proof.
Using (3) and (8), and noting that D<1, it holds that
The design of robust FSBB with the WCP optimization can be formulated as
With Proposition 1, the problem (12) can be reformulated as the following minimax problem
where Îµ is chosen as the lower bound of Î” g given by (11). By introducing some auxiliary variables, (13) can be recast as the following convex optimization problem
The procedures of the robust FSBB design using the WCP optimization are summarized in the following.
Remark 1.
As we know, although the WCP optimization approach has been successfully used in the design of robust broadband beamformers with the filterandsum structure, it is conservative because the worst scenario that all microphone mismatch errors simultaneously attain their maximal values rarely occurs in practice. In contrast, the robust design of FSBB using the WCP optimization is more conservative since it just considers the more rarely occurred worst case, which requires not only that all microphone mismatch errors simultaneously attain their maximal values but also that the steering direction of the FSBB is at the boundary of the steering direction range of interest (note that (1âˆ’D ^{2M})/(1âˆ’D ^{2}) in (11) achieves its maximal value when the steering direction is at the boundary of Î¨). As a result, the WCP optimizationbased design for robust FSBB suffers from outstanding overconstraint problem which may lead to poor design performance.
4 Robust FSBB design using the WCMP optimization with the PRV constraint
In this section, we study the robust FSBB design via convex optimization when the knowledge we have on microphone mismatches is only their bounded mean and variance.
4.1 Robust design using the WCMP optimization
Suppose the mean values of microphone gain and phase mismatches are imprecisely known and are bounded by some known small constants Î¼ _{ a } and Î¼ _{ Î³ } respectively, i.e., \(\mathbb {E}\{a_{k}(f,\theta)\}\leq \mu _{a}\), \(\mathbb {E}\{\gamma _{k}(f,\theta)\}\leq \mu _{\gamma }\), where \(\mathbb {E}\{\cdot \}\) denotes the mean value. Following the similar derivation as Proposition 1, it holds that the mean perturbation of the steering vector of the FSBB is bounded by
The robust design for the FSBB using the WCMP optimization can be cast as
Using (15), the WCMP optimization problem can be reformulated as
where \(\overline {\varepsilon } \) is chosen as the lower bound of \(\\mathbb {E}\{\Delta \mathbf {g}(\phi _{d},f,\theta)\}\\) given by (15). Alternatively, the optimization problem (17) can further be recast as the following SOCP problem
Remark 2.
Like the WCP optimizationbased design, the WCMP optimizationbased design also belongs to the class of white noise gain constraintbased approaches. Consider the fact that Î¼ _{ a }<Î´ _{ a } and Î¼ _{ Î³ }<Î´ _{ Î³ }, it follows from (15) and (11) that \(\overline {\varepsilon }<\varepsilon \). Therefore, the WCMP optimizationbased design is less conservative than the WCP optimizationbased design and hence is suitable for robust FSBB design as demonstrated by the simulation results in Section 6.
4.2 Robust design incorporating the PRV constraint
To enhance the robustness of the FSBB, i.e., to improve its stability of passband response and hence to reduce target signal distortion, we hereby consider to incorporate the PRV constraint into the design procedures by using the bounded variances of microphone mismatches. To proceed, we make the following assumptions [15]: 1) microphone gain and phase errors are uncorrelated; 2) all microphones have the same variances Var{a(f,Î¸)} and Var{Î³(f,Î¸)} for gain and phase errors, respectively. The only knowledge we have about Var{a(f,Î¸)} and Var{Î³(f,Î¸)} is that they are bounded by some known constants, i.e., \(\text {Var}\{a(f,\theta)\}{\leq \sigma ^{2}_{a}}\) and \(\text {Var}\{\gamma (f,\theta)\}\leq \sigma ^{2}_{\gamma }\).
Theorem 1.
The variance of the array response of the FSBB in the presence of microphone gain and phase mismatches is given by
where the (i,j)th element of Q(Ï• _{ d },f,Î¸) is
where n _{1}= mod (âŒˆi/MâŒ‰âˆ’1,N), k _{1}=âŒˆ(âŒˆi/MâŒ‰)/NâŒ‰âˆ’1, m _{1}= mod (iâˆ’1,M), n _{2}= mod (âŒˆj/MâŒ‰âˆ’1,N), k _{2}=âŒˆ(âŒˆj/MâŒ‰)/NâŒ‰âˆ’1, m _{2}= mod (jâˆ’1,M), where mod (iâˆ’1,M) is the remainder of (iâˆ’1)/M, and âŒˆi/MâŒ‰ denotes the smallest integer larger than or equal to i/M.
Proof.
With (2), (3), and (8), we have
where the superscript (Â·)^{H} represents the Hermitian transpose, and
with its (i,j)th element given by
where the superscript (Â·)^{âˆ—} denotes the complex conjugate. This completes the proof.
Regarding the properties of the PRV of the FSBB, we have the following remarks.
Remark 3.
Given a specific steering direction Ï• _{ d }, it is interesting to note that the PRV of the FSBB is independent of angle Î¸, i.e., the effect of microphone gain and phase mismatches on the PRV of the FSBB is angleinvariant. However, the PRV of the FSBB is steering direction variant. It has been found that the PRV of the FSBB tends to increase with the steering direction deviating from the array broadside direction as revealed in Section 6.
Based on (18) and Theorem 1, our proposed robust design criterion using the WCMP optimization with the PRV constraint can be formulated as
where Î²>0 is a tradeoff parameter between the mean deviation of the actual array response from the desired response and the PRV. It is noted that, when incorporating the PRV constraint directly from (19), the illconditioned matrix Q(Ï• _{ d },f,Î¸) may lead to numerical instability problem. To overcome this problem, the average PRV over the whole passband has been used instead in the third constraint \(\mathbf {w}^{T}\mathbf {\overline {Q}(\phi _{d},f,\theta)}\mathbf {w}\leq \lambda \), where \(\mathbf {\overline {Q}(\phi _{d},f,\theta)}\) denotes the average of Q ( Ï• _{ d } , f , Î¸ ) in the passband.
To summarize, the design approach for the robust FSBB using the WCMP optimization with the PRV constraint consists of the following steps.
5 Robust design of the FSBB with sparse tap weights
Although the FSBB can be flexibly steered towards any desired direction, it is at the cost of increased number of FIR filters in structure, and hence is more computationally demanding, compared with conventional filterandsum beamformers. An interesting problem now arises: Is there any redundancy in the tap weights of the FSBB by using the above design approaches? If so, the constraint on the sparseness of tap weights of the FSBB can be incorporated into the robust design approaches to reduce the computational complexity of the FSBB. To this end, a twostage approach for the design of robust FSBB with sparse tap weights via convex optimization is proposed in this section. Considering the WCMP optimizationbased design with the PRV constraint is more efficient than its counterpart based on the WCP optimization for robust design of the FSBB; as discussed in Section 6, hereafter, we will focus on the WCMP optimizationbased design by incorporating the sparsity constraint on tap weights.
The first stage of our proposed design approach is to find potential redundancy in tap weights of the FSBB using the WCMP optimizationbased design approach. Based on (17), the problem can be mathematically formulated as
where the l _{0}norm âˆ¥Â·âˆ¥_{0} is the count of the number of nonzero elements of its argument, and Î¼ denotes the user parameter to control the degree of sparsity of the tap weights. Unfortunately, (24) is a NPhard optimization problem due to the nonconvex l _{0}norm. As it is known, the l _{1}norm is the closest convex function to the l _{0} norm and the l _{1}norm is usually able to produce sparse solutions. To solve the difficult problem (24) efficiently, the iterative reweighted l _{1}norm constraint [21] is used instead to approximate the l _{0}norm constraint in (24). Explicitly, at the lth iteration, we solve the following constrained convex optimization problem
where âˆ¥Â·âˆ¥_{1} denotes the l _{1}norm, D ^{(l)}=diag{D _{1},D _{2},â‹¯,D _{ KNM }} with \(D_{i}=1/\left (w_{i}^{(l1)}+\epsilon \right)\) being the reweighting matrix and Îµ being a small positive value to provide numerical stability, \(w_{i}^{(l1)}\) represents the ith component of w at the (lâˆ’1)th iteration, and S ^{(l)} is the index set of the sparse tap weights for the lth iteration, which is obtained by comparing the tap weights w at the (lâˆ’1)th iteration with a predefined smallvalued threshold Î¾ _{ T }, in particular, when the weight \({w_{i}^{(l1)}}\leq \xi _{T}\), then \(w_{i}^{(l1)}\) should be reset to zero; otherwise, it will be kept unchanged. By using the reweighted l _{1}norm constraint, those tap weights whose magnitudes are small are imposed larger weightings in the next iteration and vice versa, and accordingly, the sparsity of the tap weights is enhanced. For initialization, D ^{(0)} is set to identity matrix and S ^{(0)} is set to the null set. The above convex optimization problem (25) is solved repeatedly until the preset maximum number of iterations L is achieved.
The second stage of the proposed approach is to incorporate the PRV constraint in the design procedures. Considering (23), the design problem can be finally formulated as the following convex optimization problem
In summary, the twostage design approach for the robust FSBB with sparse tap weights include the following steps.
6 Design examples
In this section, some design examples are presented to demonstrate the performance of the design approaches proposed above. The CVX convex optimization toolbox [22] has been used to solve all the convex optimization problems in the following.
Consider a tenelement uniform linear microphone array with the interelement spacing 5 cm. Behind each microphone, a Farrow structure consisting of five FIR filters is used, where the tap length of the FIR filters is 20 unless otherwise stated, i.e., K=10, M=5, and N=20. The steering direction range of interest is [40Â°,140Â°], the normalized frequency range of interest is [0.25Ï€,0.875Ï€], and the sampling frequency f _{ s } is 8000 Hz. The passband width, denoted as Ï–, is set to 20Â°, and for a specific steering direction Ï• _{ d }, the two stopband regions are \(\Phi ^{\phi _{d}}_{sl}=[0\circ,\phi _{d}\varpi /220\circ ]\), and \(\Phi ^{\phi _{d}}_{sr}=[\phi _{d}+\varpi /2+20\circ,180\circ ]\), where two transition bands each with a width of 20Â° has been considered. The desired response is defined as P _{ d }(Ï• _{ d },f,Î¸)=1 in the passband and P _{ d }(Ï• _{ d },f,Î¸)=0 in the stopbands. Suppose that all the microphone gain errors a _{ k }(f,Î¸) have a uniform distribution in [âˆ’0.05,0.05], and that all the microphone phase errors Î³ _{ k }(f,Î¸) have a uniform distribution in [âˆ’Ï€/36,Ï€/36], i.e., corresponding to \(\mathbb {E}\left \{a_{k}\left (\,f,\theta \right)\right \}=0\), \(\mathbb {E}\left \{\gamma _{k}\left (\,f,\theta \right)\right \}=0\), Var{a _{ k }(f,Î¸)}=8.333Ã—10^{âˆ’4}, and Var{Î³ _{ k }(f,Î¸)}=2.5Ã—10^{âˆ’3}.
6.1 Example 1: robust design using WCP optimization
First, we consider the case of no microphone mismatches when using the WCP optimization, i.e., corresponding to the nonrobust design. Figure 2 shows the array response of the FSBB using the nonrobust design, where the steering direction, i.e., the direction of arrival of the sound source of interest, is set to 60Â°. As can be seen, the FSBB design based on the WCP optimization performs well when there are no microphone mismatches, since its mainlobe can be steered to the desired direction with a stopband level below âˆ’13.7 dB for the stopband region [0Â°,30Â°]âˆª[90Â°,180Â°]. For comparison, the array response of the wellknown leastsquares (LS) design based on the conventional filterandsum structure [3] is also shown in Fig. 3, with the same number of microphones as for the FSBB. Note that the LS design based on the filterandsum structure is nonsteerable; therefore, its passband has to cover the whole direction range of interest where sound source may be present, i.e., [40Â°,140Â°]. Consequently, the passband region is too wide, which will lead to poor spatial filtering performance. For instance, when the sound source of interest is impinging on the array from the angle 60Â°, the undesired interference and noise signals within the angular region (90Â°,140Â°) can not be reduced anymore by the nonsteerable beamformer.
Next, we consider the FSBB design using the WCP optimization in the presence of microphone mismatches. Figure 4 shows the corresponding array response of the FSBB steered to 60Â°, where the userdefined parameter Îµ is set to 1.74 according to (11). The simulation result is the average over 100 Monte Carlo trials, i.e., by using 100 random samples of microphone mismatches. As we have discussed above, although the WCP optimizationbased criterion has been successfully applied to the robust design of filterandsum beamformers, it has failed to work for the design of robust FSBB due to its overconservativeness. Therefore, the WCP optimizationbased criterion may not be suitable for the design of robust FSBB. To justify the overconservativeness of WCP optimization for FSBB design, the array response of the FSBB designed by the lessconservative WCP optimization with the userdefined parameter Îµ reduced to 0.02 is shown in Fig. 5. Compared with Fig. 4, it can be seen clearly that the beamformer performance can be improved significantly through reducing the effect of conservativeness of WCP optimization.
6.2 Example 2: robust design using WCMP optimization with the PRV constraint
In the following, we assume that the mean and variance values of microphone gain and phase errors are all not precisely known due to practical measurement errors. That is, the gain and phase errors are not zeromean and instead bounded by some small values, i.e., \(\mathbb {E}\left \{a_{k}\left (\,f,\theta \right)\right \}\leq 5\times 10^{6}\), \(\mathbb {E}\left \{\gamma _{k}\left (\,f,\theta \right)\right \}\leq 8.73\times 10^{6}\); the variance values are also bounded by Var{a _{ k }(f,Î¸)}â‰¤4.2Ã—10^{âˆ’3} and Var{Î³ _{ k }(f,Î¸)}â‰¤1.27Ã—10^{âˆ’2}, respectively, i.e., each is around five times more than the actual variance of microphone gain/phase errors. All the results are the average over 100 Monte Carlo trials with random samples of microphone mismatches.
Figure 6a and e shows the array response and the PRV of the robust FSBB based on the WCMP optimization with the PRV constraint, respectively, where Î²=20. While Fig. 6b and f shows the array response and the PRV of the robust FSBB based on the WCMP optimization without the PRV constraint, respectively, i.e. Î²=0. The steering direction is set to 60Â°. To see the results more clearly, the associated side views are also presented in Fig. 6c, d, g, and h. Compared with the WCP optimizationbased design, i.e., Fig. 4, the design approach using the WCMP optimization performs well in the presence of microphone mismatches. Moreover, by imposing the PRV constraint, the variance of passband array response can be effectively reduced, especially in the lowfrequency region. Note also that the PRV of the robust FSBB is nearly invariant with angle Î¸ as demonstrated in Fig. 6g and h, which is consistent with the theoretical finding in Remark 3.
Now, we study the performance of the robust FSBB using WCMP optimization with PRV constraint in the presence of larger microphone mismatch errors. Here, we assume that all microphone gain errors a _{ k }(f,Î¸) have a uniform distribution in [âˆ’0.1,0.1], and all microphone phase errors Î³ _{ k }(f,Î¸) have a uniform distribution in [âˆ’Ï€/18,Ï€/18]. Figure 7a and e shows the array response and the PRV of the robust FSBB based on the WCMP optimization with the PRV constraint, respectively, where Î²=20. While Fig. 7b and f plots the array response and the PRV of the robust FSBB based on the WCMP optimization without the PRV constraint, respectively. The steering direction is set to 60Â°. For ease of comparison, the associated side views are also presented in Fig. 7c, d, g, and h. From the simulation results, we can see that the robust FSBB still shows satisfactory performance even in the presence of larger microphone mismatch errors. Similar to the above case with smaller microphone mismatches, by imposing the PRV constraint, the variance of passband array response of the FSBB beamformer can also be reduced.
To show the effect of the PRV constraint on the performance of robust FSBB, we first introduce the passband fluctuation [18], which is defined as the ratio of maximum mean magnitude response to the minimum one in the passband. Passband fluctuation is an indicator of the deviation of the actual mean passband response obtained from the desired flattop one. Figure 8a, b, and c shows the passband fluctuation, the stopband level, and the average PRV of the robust FSBB with various PRV constraints in the presence of microphone gain errors [âˆ’0.05,0.05] and microphone phases errors [âˆ’Ï€/36,Ï€/36], where two cases are considered, i.e., Ï• _{ d }=60Â° and 90Â°. As can be seen from Fig. 8a and c, with more stringent PRV constraint, i.e., increasing the tradeoff parameter Î², the PRV of the FSBB tends to decrease, while keeping the passband fluctuation at a lower level. However, this is at the cost of sacrificing the stopband level as shown in Fig. 8b. Therefore, a tradeoff between the performance of passband and that of the stopband should be considered during design of robust FSBB.
As analyzed above, the PRV of the FSBB is dependent on steering direction. Now, we study the effect of the steering direction on the PRV of the FSBB. Figure 9a and b shows the average PRV of the FSBB versus steering direction Ï• _{ d } with Î²=0 and Î²=20, respectively. Herein, four FSBBs with different number of microphones K and different FIR tap length N have been considered, i.e., K=7, N=20; K=7, N=30; K=10, N=20; and K=10, N=30. As expected, it can be seen from Fig. 9 that the PRV of the FSBB is varying with steering direction. Interestingly, the average PRV tends to increase with the steering direction deviating from the array broadside.
6.3 Example 3: robust design with sparse tap weights
Now, we study the performance of the robust FSBB design with sparse tap weights by using Algorithm III. The user parameters are set as: the tradeoff parameter Î¼=5Ã—10^{âˆ’7}, the threshold parameter Î¾ _{ T }=10^{âˆ’6}, the parameter for numerical stability Îµ _{ T }=10^{âˆ’6}, and the maximum number of iterations L=4. The remaining user parameters are set same as in Example 2. All the results are the average over 100 Monte Carlo trials with random samples of microphone mismatches.
First, we demonstrate the complexityreducing impact of the sparsity constraint on the robust FSBB. Figure 10 shows the performance comparison of the sparse FSBB and its nonsparse counterpart with N=30, where the steering direction is Ï• _{ d }=60Â°, and there is no PRV constraint (i.e., Î²=0). Herein, the nonsparse FSBB refers to the FSBB designed by Algorithm II, which has a full active tap weights, i.e., no zerovalued tap weights. For the sparse FSBB, the number of the active weights is reduced to 738, i.e., over 50 % tap weights of the nonsparse FSBB are nullified. The array response of the sparse and nonsparse FSBBs is shown in Fig. 10a and b, while the PRV of the sparse and nonsparse FSBBs is shown in Fig. 10e and f. In order to see the results more clearly, the corresponding side views are also presented in Fig. 10c and d and g and h, respectively. As can be seen from Fig. 10c and d, although over one half of tap weights are nullified, the beampattern of the resultant sparse FSBB has nearly unaffected compared with the beampattern of its nonsparse counterpart. Moreover, the variance of passband array response of the sparse FSBB has only varied slightly compared with that of its nonsparse counterpart, as shown in Fig. 10g and h. Therefore, it has justified our statement that there are redundancy in the tap weights of an FSBB, and a lowercomplexity FSBB can be designed via imposing the sparsity constraint without producing a significant degradation of performance.
Next, we show another advantage of the sparse FSBB over the nonsparse FSBB with similar computational complexity. Figure 11 shows the performance comparison of sparse and nonsparse FSBBs with a comparable amount of active tap weights, i.e., with a similar computational complexity, where the steering direction is Ï• _{ d }=60Â° and there is no PRV constraint. For the sparse FSBB, the number of the active weights is 738, with N=30. For the nonsparse FSBB, the number of the active weights is 750, with N=15. Note here that, for the purpose of ensuring a fair comparison, the active weights of the sparse FSBB is chosen slightly less than that of the nonsparse FSBB. The array response of the sparse and nonsparse FSBBs is shown in Fig. 11a and b, while the PRV of the sparse and nonsparse FSBBs is shown in Fig. 11e and f. To see the results more clearly, the corresponding side views are also plotted in Fig. 11c and d and g and h, respectively. For the spare FSBB, the stopband level and passband fluctuation are âˆ’7.988 and 2.043 dB, respectively, with the average PRV 0.005. For the nonsparse FSBB, the stopband level and passband fluctuation are âˆ’7.802 dB and 2.408 dB, respectively, with the average PRV 0.011. Comparatively, the sparse FSBB is superior to the nonsparse FSBB with a similar computational complexity.
Finally, we consider the effect of the PRV constraint on the robust FSBB with sparse tap weights. Figure 12 shows the performance of the sparse FSBB under various PRV constraints. For comparison, the performance of nonsparse FSBB with a comparable number of active weights is also shown in Fig. 12. Here, the simulation settings are same as in Fig. 11. Moreover, the case with the steering direction Ï• _{ d }=90Â° is also considered. As can be seen from Fig. 12, the sparse FSBB outperforms its nonsparse counterpart under various PRV constraints. Similar to the case of nonsparse FSBB shown in Fig. 8, the PRV of the sparse FSBB will decrease with a more stringent PRV constraint, and this is also at the cost of the stopband level increasing as shown in Fig. 11b.
7 Conclusions
In this paper, the study of robust FSBB design with sparse tap weights via convex optimization has been conducted by incorporating some priori knowledge of microphone mismatches. It has been shown that due to the overconservativeness of the WCP optimization criterion, it may become unapplicable to the robust FSBB design, though it has been successfully applied in the robust filterandsum beamformer design. When the limited knowledge of mean and variance of microphone mismatches is available, the robust FSBB design approach based on the WCMP optimization with the PRV constraint has been presented. Compared with the WCP optimizationbased design, it performs well in the presence of microphone mismatches; moreover, it has the capability of passband stability control of array response. Some insights into the PRV properties of FSBB are also revealed to better understand the robustness characteristic of FSBB. It was also shown in the paper that there exists redundancy in the tap weights of the robust FSBB. To further reduce the computational complexity of the robust FSBB, a twostage design approach based on the reweighted l _{1}norm constraint optimization has been proposed to sparsify the tap weights of robust FSBB. Several design examples have been presented to illustrate the performance of the presented approaches.
8 Endnotes
^{1} The passband is also known as the mainlobe of a beamformer.
^{2} The stopband is also known as the sidelobe of a beamformer.
Abbreviations
 FSBB:

Farrowstructurebased steerable broadband beamformer
 FIR:

finite impulse response filter
 WCP:

worstcase performance
 WCMP:

worstcase mean performance
 WNG:

white noise gain
 PRV:

passband response variance
 PDF:

probability density function
 SOCP:

secondorder cone programming
References
M Brandstein, D Ward (eds.), Microphone arrays: signal processing techniques and applications (Springer, Berlin, 2001).
BD Van Veen, KM Buckley, Beamforming: a versatile approach to spatial filtering. IEEE ASSP Mag. 5(2), 4â€“24 (1988).
S Doclo, M Moonen, Design of farfield and nearfield broadband beamformers using eigenfilters. Signal Process. 83(12), 2641â€“2673 (2003).
S Sriram, P Ashish, J Kees, Beamforming under quantization errors in wireless binaural hearing aids. EURASIP J. Audio Speech Music Process. 2008, 824797 (2008).
M Pirro, S Squartini, L Romoli, F Piazza, Stereophonic handsfree communication system based on microphone array fixed beamforming: realtime implementation and evaluation. EURASIP J. Audio Speech Music Process. 2012, 26 (2012).
YX Zou, P Wang, YQ Wang, CH Ritz, J Xi, Speech enhancement with an acoustic vector sensor: an effective adaptive beamforming and postfiltering approach. EURASIP J. Audio Speech Music Process. 2014, 17 (2014).
RMM Derkx, K Janse, Theoretical analysis of a firstorder azimuthsteerable superdirective microphone array. IEEE Trans. Audio Speech Lang. Process. 17(1), 150â€“162 (2009).
CC Lai, S Nordholm, YH Leung, Design of steerable spherical broadband beamformers with flexible sensor configurations. IEEE Trans. Audio Speech Lang. Process. 21(2), 427â€“438 (2013).
CC Lai, S Nordholm, YH Leung, in Proceedings of International Workshop on Acoustic Echo and Noise Control. Design of robust steerable broadband beamformers with spiral arrays and the farrow filter structure (Tel Aviv, 2010).
M Kajala, M Hamalainen, Filterandsum beamformer with adjustable filter characteristics. Acoustics, Speech, and Signal Processing 2001. Proc. IEEE Int. Conf. Acoust. Speech, Signal Process. 5, 2917â€“2920 (2001).
CC Lai, S Nordholm, YH Leung, in Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing. Design of robust steerable broadband beamformers incorporating microphone gain and phase error characteristics (Prague, 2011), pp. 101â€“104.
E Mabande, W Kellermann, in Proceedings of International Workshop on Acoustic Echo and Noise Control. Design of robust polynomial beamformers as a convex optimization problem (Tel Aviv, 2010).
E Mabande, M Buerger, W Kellermann, in Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing. Design of robust polynomial beamformers for symmetric arrays (Kyoto, 2012), pp. 1â€“4.
H Wang, H Chen, Y Bao, et al, in Proceedings of IEEE International Conference on Signal Processing, Communication and Computing. Design of steerable and frequency invariant beamformers robust against microphone mismatches (Kunming, 2013), pp. 1â€“6.
S Doclo, M Moonen, Design of broadband beamformers robust against gain and phase errors in the microphone array characteristics. IEEE Trans. Signal Process. 51(10), 2511â€“2526 (2003).
H Chen, W Ser, ZL Yu, Optimal design of nearfield wideband beamformers robust against errors in microphone array characteristics. IEEE Trans. Circuits Syst. I: Regular Papers. 54(9), 1950â€“1959 (2007).
RC Nongpiur, Design of minimax broadband beamformers that are robust to microphone gain, phase, and position errors. IEEE/ACM Trans. Audio Speech Lang. Process. 22(6), 1013â€“1022 (2014).
H Chen, W Ser, J Zhou, Robust nearfield wideband beamformer design using worst case mean performance optimization with passband response variance constraint. Audio Speech Lang. Process. IEEE Trans. 20(5), 1565â€“1572 (2012).
S Boyd, L Vandenberghe, Convex optimization (Cambridge university press, New York, 2009).
JF Sturm, Using SeDuMi 1.02, a MATLAB toolbox for optimization over symmetric cones. Optimization Methods Softw. 11(1â€“4), 625â€“653 (1999).
EJ Candes, MB Wakin, SP Boyd, Enhancing sparsity by reweighted l _{1} minimization. J. Fourier Anal. Appl. 14(5â€“6), 877â€“905 (2008).
M Grant, S Boyd, Y Ye, CVX: Matlab software for disciplined convex programming (2008). http://cvxr.com/cvx/. Accessed 28 May 2015.
Acknowledgements
This work was supported by the Fundamental Research Funds for the Central Universities under Grant NS2014041.
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Wang, T., Chen, H. Robust design of Farrowstructurebased steerable broadband beamformers with sparse tap weights via convex optimization. J AUDIO SPEECH MUSIC PROC. 2015, 14 (2015). https://doi.org/10.1186/s136360150060y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s136360150060y