16
Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 http://asp.eurasipjournals.com/content/2014/1/160 RESEARCH Open Access Multi-antenna spectrum sensing by exploiting spatio-temporal correlation Sadiq Ali 1,2* , David Ramírez 3 , Magnus Jansson 4 , Gonzalo Seco-Granados 1 and José A López-Salcedo 1 Abstract In this paper, we propose a novel mechanism for spectrum sensing that leads us to exploit the spatio-temporal correlation present in the received signal at a multi-antenna receiver. For the proposed mechanism, we formulate the spectrum sensing scheme by adopting the generalized likelihood ratio test (GLRT). However, the GLRT degenerates in the case of limited sample support. To circumvent this problem, several extensions are proposed that bring robustness to the GLRT in the case of high dimensionality and small sample size. In order to achieve these sample-efficient detection schemes, we modify the GLRT-based detector by exploiting the covariance structure and factoring the large spatio-temporal covariance matrix into spatial and temporal covariance matrices. The performance of the proposed detectors is evaluated by means of numerical simulations, showing important advantages over existing detectors. Keywords: Cognitive radio; Generalized likelihood ratio test (GLRT); Multi-antenna; Spatio-temporal correlation 1 Introduction For a cognitive radio system, which opportunistically accesses the wireless channel, spectrum sensing becomes a crucial task for detecting the presence of primary user transmissions [1]. Recently, sensors with multi- ple antennas have become an integral part of many cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve the per- formance of spectrum sensing. Multiple antennas can offer spatial diversity and improve the spectrum sens- ing performance [4,5]. Intuitively, the presence of any primary signal should result in some spatial correla- tion in the observations received at the multi-antenna receivers [5] and thus, exploiting this correlation improves the primary user detection performance. In addition to being spatially correlated, the received signal sam- ples are usually correlated (wide-sense stationary) in time due to presence of a temporal dispersive chan- nel, oversampling of the received signals or just because the originally transmitted signals are correlated in time *Correspondence: [email protected] 1 SPCOMNAV, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain 2 Electrical Engineering, University of Engineering and Technology, Peshawar, 25000 Peshawar, Pakistan Full list of author information is available at the end of the article [5,6]. This spatio-temporal correlation is a feature that can be used for detection purposes, since the remaining (i.e. undesired) noise processes at different antennas can be safely assumed statistically independent, both in time and space. Spectrum sensing methods that only exploit the spa- tial structure of the received signal covariance matrix have been of great interest in the recent years [5]. The majority of these schemes are based on multivariate sta- tistical inference theory [7,8], and interested readers can find comprehensive details in ([8], Ch. 9-10), which dis- cusses multivariate detectors for testing the independence of random observations with the help of the generalized likelihood ratio test (GLRT). These GLRT-based detectors typically end up in a simple quotient between the determi- nant of the sample covariance matrix and the determinant of its diagonal version, and these tests have been widely applied to the detection of signals especially in the context of cognitive radios [2,9]. Through careful study of various existing spectrum sensing techniques, one can conclude that the signal’s temporal correlation is not fully exploited in most of these techniques. In fact, in most of them, temporal correlation is ignored or considered as a deleterious effect. In the very few works that exploit the temporal correlation, they usually assume some prior knowledge about it [10-14]. © 2014 Ali et al.; licensee Springer. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.

RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160http://asp.eurasipjournals.com/content/2014/1/160

RESEARCH Open Access

Multi-antenna spectrum sensing by exploitingspatio-temporal correlationSadiq Ali1,2*, David Ramírez3, Magnus Jansson4, Gonzalo Seco-Granados1 and José A López-Salcedo1

Abstract

In this paper, we propose a novel mechanism for spectrum sensing that leads us to exploit the spatio-temporalcorrelation present in the received signal at a multi-antenna receiver. For the proposed mechanism, we formulatethe spectrum sensing scheme by adopting the generalized likelihood ratio test (GLRT). However, the GLRTdegenerates in the case of limited sample support. To circumvent this problem, several extensions are proposedthat bring robustness to the GLRT in the case of high dimensionality and small sample size. In order to achieve thesesample-efficient detection schemes, we modify the GLRT-based detector by exploiting the covariance structure andfactoring the large spatio-temporal covariance matrix into spatial and temporal covariance matrices. The performanceof the proposed detectors is evaluated by means of numerical simulations, showing important advantages overexisting detectors.

Keywords: Cognitive radio; Generalized likelihood ratio test (GLRT); Multi-antenna; Spatio-temporal correlation

1 IntroductionFor a cognitive radio system, which opportunisticallyaccesses the wireless channel, spectrum sensing becomesa crucial task for detecting the presence of primaryuser transmissions [1]. Recently, sensors with multi-ple antennas have become an integral part of manycognitive receivers [2,3], thus giving us the chance toconsider multi-antenna techniques to improve the per-formance of spectrum sensing. Multiple antennas canoffer spatial diversity and improve the spectrum sens-ing performance [4,5]. Intuitively, the presence of anyprimary signal should result in some spatial correla-tion in the observations received at the multi-antennareceivers [5] and thus, exploiting this correlation improvesthe primary user detection performance. In additionto being spatially correlated, the received signal sam-ples are usually correlated (wide-sense stationary) intime due to presence of a temporal dispersive chan-nel, oversampling of the received signals or just becausethe originally transmitted signals are correlated in time

*Correspondence: [email protected], Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona,Spain2Electrical Engineering, University of Engineering and Technology, Peshawar,25000 Peshawar, PakistanFull list of author information is available at the end of the article

[5,6]. This spatio-temporal correlation is a feature thatcan be used for detection purposes, since the remaining(i.e. undesired) noise processes at different antennas canbe safely assumed statistically independent, both in timeand space.

Spectrum sensing methods that only exploit the spa-tial structure of the received signal covariance matrixhave been of great interest in the recent years [5]. Themajority of these schemes are based on multivariate sta-tistical inference theory [7,8], and interested readers canfind comprehensive details in ([8], Ch. 9-10), which dis-cusses multivariate detectors for testing the independenceof random observations with the help of the generalizedlikelihood ratio test (GLRT). These GLRT-based detectorstypically end up in a simple quotient between the determi-nant of the sample covariance matrix and the determinantof its diagonal version, and these tests have been widelyapplied to the detection of signals especially in the contextof cognitive radios [2,9].

Through careful study of various existing spectrumsensing techniques, one can conclude that the signal’stemporal correlation is not fully exploited in most of thesetechniques. In fact, in most of them, temporal correlationis ignored or considered as a deleterious effect. In thevery few works that exploit the temporal correlation, theyusually assume some prior knowledge about it [10-14].

© 2014 Ali et al.; licensee Springer. This is an Open Access article distributed under the terms of the Creative CommonsAttribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproductionin any medium, provided the original work is properly credited.

Page 2: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 2 of 16http://asp.eurasipjournals.com/content/2014/1/160

One of the reasons for ignoring or only partially exploit-ing temporal correlation is that it often makes it difficultto achieve tractable solutions. However, the exploitationof temporal correlation jointly with spatial correlation canprovide us extra side information to enhance the detec-tion performance. Hence, it will be interesting to findways to devise detection mechanisms that exploit tempo-ral correlation with tractable solutions. Taking this intoaccount, the main focus of this work is to develop a multi-antenna detector that robustly exploits spatio-temporalcorrelation. In order to do so, we propose a signalmodel that leads us to tractable detection schemes whileexploiting the temporal correlation jointly with spatialcorrelation.

In order to achieve the proposed goal, we start with ourearlier work in [15], where we derived the GLRT. Thisdetector basically tests whether the covariance matrixis block diagonal or not. To make the discussion andnotation simple, for our case, we call this GLRT schemethe spatio-temporal GLRT (ST-GLRT). Compared to thetraditional spatial covariance-based GLRT, the ST-GLRTprovides some improved performance. The reason for thisis that the ST-GLRT scheme exploits temporal correla-tion as an additional feature on top of spatial correlationand energy. However, since any GLRT involves the estima-tion of unknown parameters (e.g., covariance matrix), itsperformance depends on the sample size and the dimen-sionality of the signal model. In practice, the GLRT isused based on the assumption that the sample size is largecompared to the model dimension. When this is not thecase, the performance of the GLRT degenerates becausethe sample covariance matrix becomes singular, and thewhole problem becomes ill-conditioned [16,17]. In thecase of the ST-GLRT, we have to deal with both the spa-tial and temporal dimensions, and hence, the overall datadimension is even larger. Thus, the ST-GLRT has somefurther limitations when the detection process requires aquick decision, as it is in the case of the detection of pri-mary signals in cognitive radio. Hence, although for thelarge sample support, the ST-GLRT can certainly achievean improved detection performance; for small samplesupport, it has some limitations that deserve a detailedstudy.

In order to reduce the demand for large sample sup-port and bring robustness to the ST-GLRT, one mayassume existence of some underlying structure based onthe spatial and temporal components of the covariancematrix. In our earlier work [15,18], by assuming widesense stationary (WSS), we exploited the Toeplitz struc-ture of the covariance matrix. Doing so, we proposed anapproximated GLRT in the frequency domain that leadsto robustness against the small sample support. Contraryto that work, in the present one, we rather focus onexploiting the covariance structures without the GLRT

approximation in the frequency domain. In particular, weapproximate the block-Toeplitz structure of a multivari-ate WSS process as persymmetric. Doing so, we will takeadvantage of the result in [19] which states that by exploit-ing the persymmetric structure, the number of indepen-dent vector measurements required for the covariancematrix estimator can be decreased by up to a factor of two.This will certainly bring down the demand for high sam-ple support required for the ST-GLRT to not degenerateand provide robustness against the repercussions of smallsample support and large dimensional data.

Moving one step forward, we also approximate theblock-Toeplitz structure of the spatio-temporal correlatedmulti-antenna measurements with a Kronecker productstructure [20,21]. Recently, exploitation of the Kroneckerstructure in covariance matrices has received a lot ofinterest in statistics [22,23]. Moreover, the maximum like-lihood (ML) method for estimating the covariance matrixbased on the Kronecker product has been previouslydiscussed in [22,23]. Similarly, in the cases where thecorrelation structure is not separable through the Kro-necker product, [24] discusses some insightful detailsabout the nearest Kronecker product approximations. Inthis paper, we use the Kronecker structure to reformu-late the ST-GLRT by taking advantage of the inherentspatio-temporal structure of the received observations. Inorder to do so, we adopt and extend our earlier work[25] by using the Kronecker product to efficiently exploitthe space-time correlation in a multi-antenna spectrumsensing scheme. In addition to the Kronecker product-based factorization, we also exploit the fact that the fac-tored matrices could have additional persymmetric struc-ture [21]. Therefore, by exploiting the Kronecker productstructure jointly with the persymmetric structure, the per-formance of the proposed detection scheme can furtherbe improved in terms of the required number of sam-ple to estimate the covariance matrix (i.e. the detectorefficiency).

To compare the proposed methods with traditionaltechniques, numerical simulations are conducted. Theseresults illustrate that the proposed detection schemesindeed outperform the traditional approaches, especiallyin the case of small sample support.

The remainder of the paper is organized as follows.Section 2 introduces the proposed methodology and thesignal model. In Section 3, we solve the problem by usingthe traditional GLRT formulations. The proposed detec-tion schemes are derived in Sections 4 and 5. Numericalresults are provided in Section 6. The conclusion is finallydrawn in Section 7.

2 Proposed methodology and signal modelHerein, we address the problem of detecting the pres-ence of a primary user by a single cognitive radio that

Page 3: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 3 of 16http://asp.eurasipjournals.com/content/2014/1/160

is equipped with L antennas as shown in Figure 1. Weassume no prior knowledge about the primary transmis-sion or the noise processes except that the noise is spatio-temporally independent. We focus on a practical scenariowhere due to the presence of a primary user (PU) signal,the received signals at the L antennas are correlated inspace as well as time. The spatial correlation is due to theproximity between the receiving antennas and the tempo-ral correlation is due to the presence of a temporal dis-persive channel, oversampling of the received signal or thetime correlation of the transmitted signal [6]. In this work,we consider the temporal correlation to be WSS. Hence,consecutive received samples of the (L × 1) vector x (n),n = 1, . . . , NT at L antennas of a user are temporally cor-related, where x (n) � [x1 (n) , x2 (n) , . . . , xL (n)]T and NTis the total number of received vector samples. Exploitingthe temporal correlation, jointly with spatial correlation,can provide us extra information to enhance the detectionperformance. In order to devise detection mechanismsthat exploit temporal correlation with tractable solutions,the proposed technique can be described as follows:

1. We split the received block of NT vectors x (n) intoM sub-blocks where each block contains N samplesof vector x (n), as shown in Figure 2.

2. We assume that the consecutive vector-valuedsamples within each sub-block are temporallycorrelated with N × N temporal correlationmatrix Ct .

3. Independence is assumed between consecutivesub-blocks.

It is clear that the proposed approach does not exploitthe correlation among different sub-blocks. However, asit will become clear in the following sections, exploit-ing such correlation would result in an ill-posed prob-lem. This would make it intractable to derive the MLestimators of the covariance matrices, required for thegeneralized likelihood ratio test (GLRT). The proposed

Figure 1 System model for a single secondary user equippedwith multiple antennas and multi-antenna primary user.

Figure 2 Schematic representation of the proposed methodologyfor slicing the observation block into M sub-blocks.

mechanism could also be motivated from the concept ofcorrelated block-fading channel, which presents correla-tion in each N-samples block, but independence betweenconsecutive blocks [6,26].

In order to proceed, we need the distribution of{x (n)}NT

n=1, and we assume it to be zero-mean complexGaussiana. This assumption is particularly reasonable ifthe primary network employs orthogonal frequency divi-sion multiplexing (OFDM) as modulation format [15,18].Similarly, we also assume that the SU collects NT consec-utive samples of vector x (n). Based on the proposed threestep approach discussed above, the received NT vectorsamples are divided into M blocks and each block consistsof N vector samples, such that MN = NT . The m-th blockis defined as:

X (m)

=

⎡⎢⎢⎣

x1(1+(m−1)N) x1(2+(m−1)N) · · · x1(N +(m−1)N)x2(1+(m−1)N) x2(2+(m−1)N) · · · x2(N +(m−1)N)

......

. . ....

xL(1+(m−1)N) xL(2+(m−1)N) · · · xL(N +(m−1)N)

⎤⎥⎥⎦

=

⎡⎢⎢⎢⎣

xT1 (m)

xT2 (m)

...

xTL (m)

⎤⎥⎥⎥⎦ ,

(1)

where the i-th row, xTi (m) for m = 1, . . . , M, contains N-

samples at the i-th antenna. Let us define a vector z (m) �vec

(XT (m)

). The covariance matrix of the LN × 1 vector

z (m) under hypothesis H1 is

� = E[

zzT]

=

⎡⎢⎢⎢⎣

�11 �12 · · · �1L�21 �22 · · · �2L...

.... . .

...

�L1 �L2 · · · �LL

⎤⎥⎥⎥⎦ ∈ C

LN×LN ,

(2)

Page 4: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 4 of 16http://asp.eurasipjournals.com/content/2014/1/160

where the sub-block covariance matrices �il = E[xixT

l] ∈

CN×N , 1 ≤ i, l ≤ L in � capture all space-time second-

order information about the random vector {xi}Li=1. Thus,

the hypothesis testing problem becomes

H0 :z ∼ CN (0, �0) ,H1 :z ∼ CN (0, �) .

(3)

Now, considering that the noise powers at the L anten-nas of the user are different and spatially and temporallyuncorrelated noise, the covariance matrix under noise-only hypothesis is �0 = �A,0 ⊗ IN , where

�A,0 =

⎡⎢⎢⎢⎢⎣

σ 21 0 · · · 0

0 σ 22

......

. . . 00 · · · 0 σ 2

L

⎤⎥⎥⎥⎥⎦ , (4)

and σ 2i , i = 1, 2, . . . , L is noise power at i-th antenna of the

array.

3 GLRT based on spatio-temporal correlationIn this section, by adopting the GLRT formulation, wereview a spectrum sensing scheme that exploits both thespatial and temporal correlation without exploiting anya-priori structure. This will be done just to provide areference benchmark to compare the proposed detectorsin Sections 4.2 and 5. Since, the parameters {�0, �} areunknown, we need to adopt the GLRT approach and thetest statistic of the ST-GLRT can be formulated as:

�ST (Z) =max�0

fz (Z; �0)

max�

fz (Z; �)≷H0

H1γ , (5)

where fz (Z, �0) and fz (Z; �) are the likelihood functionsunder hypothesis H0 and H1, respectively. We assumethat we have M independent blocks of the data X, orequivalently vector z, Z �

[z (1) z (2) , · · · , z (M)

]available. To solve the GLRT (5), we have to derive theML estimators of the parameters for each of the hypothe-ses. Note that under weak conditions, the ML estimatoris asymptotically an unbiased and efficient estimator [27].The expression for the likelihood function fz (Z; �) underhypothesis H1 can then be written as:

fz (Z; �) = 1(π)MLN |�|M exp

{−Mtr

(�−1�

)}. (6)

Taking into account that � has no further structurebeyond being positive semi-definite, it is easy to prove [7]that its ML estimate is given by the sample covariance

matrix i.e. � = 1M∑M

m=1 z(m)zH(m). Under the alter-native hypothesis, the expression for fz (Z, �0), ignoringconstant factors, is

fz (Z, �0) = fz(Z; �A,0 ⊗ IN

)= ∣∣�A,0 ⊗ IN

∣∣−M

× exp[−

M∑m=1

z(m)(�−1

A,0 ⊗ IN)

zH(m)

].

(7)

Now, taking into account the Kronecker structure of thecovariance matrix, (7) may be rewritten as [25]:

fz (Z, �0) = fx(XT; �A,0, IN

)= ∣∣�A,0

∣∣−MN |IN |−ML

× exp[−

M∑m=1

tr{�−1

A,0X (m) XH (m)}]

,

(8)

where XT �[

X (1) , X (2) , · · · , X (M)], as in Figure 2,

and the ML estimate of the covariance matrix �A,0

becomes �A,0 = diag(

1MN

∑Mm=1 X (m) XH (m)

)[7] and

thus �0 = �A,0 ⊗ IN . To solve (5), we have to replace�0 and � by �0 and �, respectively. Therefore, the finalexpression of the ST-GLRT becomes:

�ST (Z) =∣∣∣�∣∣∣∣∣∣�0

∣∣∣ ≷H0H1

γ (9)

Note that the detection scheme in (9) assumes no struc-ture for the covariance matrix, except that the covariancematrix is Hermitian and non-singular.

Before concluding the discussion in this section, as a ref-erence, we present the GLRT that only exploits the spatialcorrelation, ignoring the temporal correlation. Assumingthat the vector samples x (n) , n = 1, . . . , NT are tempo-rally uncorrelated, the GLRT scheme can be formulatedas:

�Tr (XT) =max�A,0

fx(XT; �A,0

)max�A,1

fx(XT; �A,1

) ≷H0H1

γ , (10)

where fx(XT; �A,0

)and fx

(XT; �A,1

)are the likelihood

functions, and �A,0 and �A,1 represent the covari-ance matrices under hypothesis H0 and H1, respectively.

Page 5: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 5 of 16http://asp.eurasipjournals.com/content/2014/1/160

Solving (10), the final expression of the detector that doesnot exploit the temporal structure is

�Tr (XT) =∣∣�A,1

∣∣∣∣�A,0∣∣ ≷H0

H1γ , (11)

where �A,1 = 1NT

∑NTn=1 x(n)xH(n) is a sample covariance

matrix. Under the hypothesis H0, as we have previouslyshown, the covariance matrix may be estimated as: �A,0 =diag

(�A,1

)[7]. The detector (11) only exploits the energy

and the spatial correlation across the L antennas of thereceiver and it wrongly assumes independence in timethe information provided by the temporal correlation.Compared to (11), the ST-GLRT (9) provides improveddetection performance since it uses temporal correlationas an additional source of information. However, in thecase when M < NL, the ST-GLRT may completely col-lapse due to the ill-conditioned sample covariance matrix[17]. In order to circumvent this limitation, in the fol-lowing sections, we propose some modifications in theST-GLRT by exploiting the presence of some inherentstructures in the space-time correlation.

4 Exploiting persymmetric structureIn order to solve the detection problem (3) with unknowncovariance matrices, a critical requirement for the detec-tors based on the GLRT is that the sample covariancematrices must be non-singular [23]. To this end, we haveto make sure that the number of available observationsgiven by M, is not smaller than LN (i.e. M ≥ LN).However, in quick spectrum sensing, a number of sam-ples greater than LN is a requirement difficult to fulfill inpractice [28]. Hence, the motivation of the remaining dis-cussion is to bring robustness against this small samplesupport. Note that in (9) we assume no prior knowl-edge about the spatio-temporal structure of the covari-ance matrix except that it is positive definite. One way toachieve the robustness against the small sample support isto look for possible a-priori known patterns/structures inthe large spatio-temporal covariance matrix.

4.1 Persymmetric structureIn this section, we use the fact that the spatio-temporalcovariance matrix � has a persymmetric structure. From[15,18], we know that the multivariate WSS time serieshas a block-Toeplitz covariance matrix. We remark herethat Toeplitz structured matrices belong to a subclassof the persymmetric matrices [19,21,29]. Furthermore,considering structured antenna array configurations (i.e.,uniform linear arrays) at the SU, the spatio-temporalcovariance matrix � can be modelled as persymmetric-block-Toeplitz, as shown in [30]. Therefore, we proposeto approximate the spatio-temporal covariance matrix �,

which is block-Toeplitz, as a persymmetric matrix. Obvi-ously, this approximation results in a suboptimal detectionscheme, but it is well known that it does not exist inclosed-form ML estimators for Toeplitz matrices. More-over, it provides better detection performance than theunconstrained ML estimate, particularly, in the case ofsmall sample support. Persymmetric covariance matricesfulfill [31]

� = JLN�T JLN , (12)

where the LN × LN counter-identity matrix JLN is

JLN =

⎡⎢⎢⎢⎢⎣

0 · · · 0 1... · · · 1 0

0 . ..

0...

1 0 · · · 0

⎤⎥⎥⎥⎥⎦ , (13)

also known as exchange matrix [19]. Based on these ideas,in Section 4.2, we present a modified GLRT that exploitsthe persymmetric property of the block-Toeplitz covari-ance matrix.

4.2 Persymmetric GLRT (P-GLRT)The difference in the formulation of the GLRT thatexploits the persymmetry comes due to the constraint(12). Therefore, the formulation of the GLRT based on thepersymmetric covariance matrix can be represented as:

�PS (Z) =max�0

fz (Z; �0)

max�

fz (Z; �)≷H0

H1γ .

s.t. � = JLN�T JLN

(14)

Comparing (14) to (5), we can see that the differ-ence lies only in the denominator. In order to exploitthe persymmetry of �, we need to use the forward-backward (FB) log likelihood [32], which is a combinationof the forward-looking and the backward-looking log-likelihood functions. The forward-looking log-likelihoodlog fz (Z; �) can be written as:

log fz (Z; �) = − log |�| − tr{�−1�

}, (15)

where we have ignored constant terms that do not dependon data. Similarly, using the constraint � = JLN�T JLN ,the backward-looking log-likelihood log f (B)

z (Z; �) can bewritten as [32]:

log f (B)z (Z; �) = − log |�|−tr

{�−1JLN �

TJLN

}. (16)

Adding (15) and (16), it gives us the forward-backwardlog likelihood as:

12

log f (FB)z (Z; �) = − log |�| − tr

{�−1�PS

}, (17)

Page 6: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 6 of 16http://asp.eurasipjournals.com/content/2014/1/160

where

�PS = 12

(� + JLN �

TJLN

). (18)

The covariance matrix estimator (18) is called forward-backward sample covariance matrix, which is the MLestimator of the persymmetric covariance matrix [32]. Anexact theory indicating the performance of the estima-tor as a function of number of independent vector z(m),m = 1, 2, . . . , M, is not available. However, a qualitativediscussion reported in [19,29] shows that the requirednumber of samples decreases by approximately a factorof two. Similarly, it is reported in [33] that �PS has con-sistently lower variance than the variance of �. Finally,solving (14) and using (18), the GLRT becomes

�PS (Z) =∣∣∣�PS

∣∣∣∣∣∣�0

∣∣∣ ≷H0H1

γ . (19)

where �0 = �A,0 ⊗ IN as before. Compared to the detec-tion scheme in (9), the new one in (19) offers improvedperformance at small sample support, as the numberof independent vector measurements required for thecovariance matrix estimator decreases by up to a factorof two [19]. Motivated by these facts, in Section 5, we goone step further and exploit the properties of the Kro-necker product to decompose the large covariance matrixinto smaller ones, reducing considerably the number ofunknown parameters.

5 GLRT based on Kronecker factorizationIn Section 3, we presented the ST-GLRT approach forthe detection problem in (3) and argued that it performspoorly for M < KN . In Section 4.1, we have (partially)exploited the temporal structure by imposing persym-metry on the covariance matrix. We have also discussedthat the spatio-temporal covariance matrix � has block-Toeplitz (with Toeplitz blocks) structure. In [34,35], it isreported that the block-Toeplitz structure can be approx-imated by the Kronecker product of two matrices. Takingthis into account, we therefore approximate the covari-ance matrix � into a purely spatial and a purely temporalcomponent as:

� = �A ⊗ �T. (20)

In (20), the matrix �A captures the spatial correlationbetween the observations received at different antennasand matrix �T captures the time correlation between Ncolumn vectors in X. Herein, we remark that the covari-ance structure in (20) makes the implicit assumption thatthe temporal correlation structure remains the same at all

spatial locations. Similarly, the spatial correlation struc-ture remains the same for the whole sub-block.

5.1 KR-GLRTIn this section, we consider the Kronecker product-basedfactorization and derive the GLRT that exploits this struc-ture, which is given by

�KR(Z) =max�0

fz (Z; �0)

max�T,�A

fz (Z; �A ⊗ �T)≷H0

H1γ . (21)

In order to solve (21), under hypothesis H1, we needto obtain the ML estimates of the unknown covariancematrices �A and �T. Under H1, the likelihood func-tion fz (Z; �A ⊗ �T), ignoring constant factors, can bewritten as:

fz (Z; �A ⊗ �T) = |�A ⊗ �T|−M

× exp[−

M∑m=1

zH(m)(�−1

A ⊗ �−1T

)z(m)

].

(22)

Now, we can write fz (Z; �A ⊗ �T) = fx (XT; �A, �T)

[36], which yields

fx (XT; �A, �T) = |�A|−MN |�T|−ML

× exp[−

M∑m=1

tr{�−1

T XH (m) �−1A X (m)

}],

(23)

To find the ML estimates of �A and �T, we need to takethe derivative of log fX (xT; �A, �T), given by

log fx (XT; �A, �T) = − N log |�A| − L log |�T|

− 1M

M∑m=1

tr{�−1

T XH(m)�−1A X(m)

}(24)

with respect to �A (�T), keeping �T (�A) fixed. Equat-ing the result of the derivative to zero and after simplemathematical operations, the estimators under H1 can bewritten as:

�T = 1LM

M∑m=1

XH (m) �−1A X (m) , (25)

�A = 1NM

M∑m=1

X (m) �−1T XH (m) . (26)

Equations (25) and (26) suggest that �T and �Acan be obtained using an iterative method such as theFlip-Flop algorithm. The Flip-Flop algorithm maximizeslog fx (XT; �A, �T) w.r.t. �A, keeping the last available

Page 7: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 7 of 16http://asp.eurasipjournals.com/content/2014/1/160

estimate of �T fixed and vice versa. In [23], numericalexperiments show that the Flip-Flop algorithm performsvery well and is much faster than a more general pur-pose optimization algorithm such as Newton–Raphson[23]. In [22], it has been reported that for the case of largeenough M, asymptotic efficiency of the ML approach canbe achieved without iterating. Moreover, it is reportedin [23,36] that if (M − 1) N ≥ L and (M − 1) L ≥ N ,or equivalently M ≥ max (N/L; L/N) + 1, then everyiterate in the Flip-Flop algorithm results in positive defi-nite �A ⊗ �T with probability one. Taking into accountthis fact, in order to get �, we adopt a non-iterativeFlip-Flop approach and only perform the steps given inAlgorithm 1, with an initial value of �

0A = IL×L. On the

other hand, underH0, we have the estimate of �0 as: �0 =�A,0 ⊗IN×N . Having all of the maximum likelihood-basedestimates, and solving (21), we can get the expression:

Algorithm 1 ML based Non-IterativeFlip-Flop

• Choose a starting value for �0A as

IL×L.

• Estimate �1T from (25) with �A = �

0A.• Find the following

1. Estimate �A from (26) with �1T.

2. Estimate �T from (25) with �Afrom previous step.

�KR(Z) =∣∣∣�T

∣∣∣L ∣∣∣�A

∣∣∣N∣∣∣�0

∣∣∣ ≷H0H1

γ . (27)

The main advantage of the proposed GLRT (27) overthe traditional is that under H1 instead of 1

2 LN(LN + 1)

parameters, it has only 12 L (L + 1) + 1

2 N (N + 1) parame-ters to estimate. Furthermore, the dimensions of these twocovariance matrices �T and �A are much smaller thanthe dimension of full covariance matrix �, that is whythe computations are much less demanding. Hence, theKronecker model is a good approximation that capturesimportant information about the correlations, while it ispositive definite for M ≥ max (N/L; L/N) + 1, i.e. a muchsmaller number of samples.

5.2 PK-GLRTIn Section 4, we have assumed that the covariance matrix� has a block-Toeplitz (with Toeplitz blocks) structure.Keeping this in mind, in this section, we assess thepossible improvement in the detection performance of

(27) by exploiting the fact that the factored matrices �Tand �A have persymmetric structures. We show that itis possible to account for the persymmetric structure bya simple modification of the Flip-Flop algorithm. Hence,as we did in Section 4.1, the persymmetric structures areexploited by imposing the constraints [21]:

�T = JN�TTJN, (28)

�A = JL�TAJL, (29)

where JL and JN are the reversal matrices of dimensions(L × L) and (N × N), respectively. The modified versionof KR-GLRT (21) that we denoted as PK-GLRT can bewritten as:

�PKR(Z) =max�0

fz (Z; �0)

max�T,�A

fz (Z; �A ⊗ �T)

s.t �A = JL�TAJL

�T = JN�TTJN

(30)

We see that in the expression (30), under hypothesis H0,the solution is the same as in (14) and (21). The differencelies in the case of hypothesis H1, where we need to solve

max�A,�T

log fZ (Z; �A ⊗ �T) ,

s.t �A = JL�TAJL

�T = JN�TTJN

(31)

As in Section 4.1, to obtain the ML estimate of apersymmetric covariance matrix, we need the forward-backward (FB) log likelihood, which is the combination ofthe forward-looking and the backward-looking log like-lihoods. In this case, the forward-looking log-likelihoodfunction for estimating �T can be written as:

log f Fz (Z; �A⊗�T) = −N log |�A| − L log |�T|

− 1M

M∑m=1

tr{�−1

T XH(m)�−1A X(m)

}.

(32)

Similarly, the backward-looking log likelihood is

log f Bz (Z; �A ⊗ �T) = − N log |�A| − L log |�T|

− 1M

tr{�−1

T JNQH JN}

,

(33)

where in (33), Q �∑M

m=1 XH (m)�−1A X (m). Adding (32)

and (33) gives us the Kronecker product-based forward-backward log-likelihood log f BF

z (Z; �A ⊗ �T), requiredfor estimating the persymmetric estimate of �T. Follow-ing similar steps as in (32) and (33), we can find forward-

Page 8: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 8 of 16http://asp.eurasipjournals.com/content/2014/1/160

and backward-looking log-likelihood functions for findingthe persymmetric estimate of �A. To find the ML esti-mates of the covariance matrices, in the first step, wefix �A and find �T that maximizes log f BF

z (Z; �A ⊗ �T).Taking into account these results, the estimator of thepersymmetric �T can be found as:

�PS,T = 12ML

M∑m=1

XH (m) �−1PS,AX (m)

+ 12ML

M∑m=1

JN(

XH (m) �−1PS,AX (m)

)TJN,

(34)

Similarly, by following the same process with fixed �T,the estimator of the persymmetric �A can be written as:

�PS,A = 12MN

M∑m=1

X (m) �−1PS,TXH (m)

+ 12MN

M∑m=1

JL(

X (m) �−1PS,TXH (m)

)TJL

(35)

As it was in the case of expressions (25) and (26), both ofthe expressions (34) and (35) suggest that �PS,T and �PS,Acan be estimated using an iterative method such as theFlip-Flop algorithm, as shown in Algorithm 2. Using (34)and (35), the final expression for the GLRT becomes

�PKR(Z) =∣∣∣�PS,T

∣∣∣L ∣∣∣�PS,A

∣∣∣N∣∣∣�0

∣∣∣ ≷H0H1

γ . (36)

Compared to (27), (36) provides better detection perfor-mance in the small sample support regime. It is because,finding the estimates of �T and �A, (34) and (35) requiresmaller M compared to (25) and (26). In conclusion,by exploiting the underlying structure of the covariancematrix � via the persymmetric ML estimates of thecovariance matrices �T and �A, it further increases therobustness of (27) at small sample support.

Algorithm 2 Non-Iterative Flip-Flop(persymmetric)

• Choose a starting value for �0A as

IL×L• Estimate �

1T from (25) with �

1A = �

0A.• Find the following

1. Estimate �PS,A from (35) with �1T.

2. Estimate �PS,T from (34) with �PS,Afrom step 1.

6 Numerical resultsIn this section, we present numerical results to evaluatethe performance of the proposed detection schemes, pre-sented in the preceding sections. For the analysis to beconducted herein, we use the receiver operating charac-teristic (ROC) curve and the area under the ROC curve(AUC), which varies between 0.5 (poor performance) and1 (good performance) [37], as the performance measures.We also evaluate the performance by the probability ofdetection vs the signal-to-noise ratio for a fixed falsealarm probability. For the evaluation, we performed thefollowing two experiments:

6.1 Experiment no. 1: detection of a WSS signal atmultiple antennas

In this experiment, we assume that the signal received atthe multi-antenna receiver is a vector WSS Gaussian pro-cess corrupted by uncorrelated noise. In order to assessthe detectors with this assumption, the SNRs (expected)κl, l = 1, 2, · · · , L are allocated differently, with averageSNR of all antennas is: κ = 1

L∑L

l=1 κl. For a speci-fied received signal power (equal at different antennas)and κ , we find Pn, the mean noise power. Noise powers(expectedb) at L different antennas σ 2

n,l, l = 1, 2, · · · , Lare kept different while Pn = 1

L∑L

i=1 σ 2n,l. Moreover, we

assume L = 4 antennas and a separable spatio-temporalcorrelation. The spatial covariance matrix is generatedas [�A]i,j = 0.3|i−j|, i, j = 1, . . . , L, and the temporalcovariance matrix is generated as [�T]i,j = 0.9|i−j|, i, j =1, . . . , NT . The remaining parameters for each experi-ment are described in the captions of the correspondingdiagrams.

In the first experiment, we obtain the ROC curvesusing Matlab to compare the performance of the pro-posed schemes to that of the traditional schemes (GLRTthat ignores temporal correlation �TR (XT), the energydetector �Eng (XT)) and the ST-GLRT and its frequency-domain approximation for WSS series. In particular, theenergy detector can be expressed as:

�Eng (XT) =M∑

m=1

N∑n=1

L∑l=1

∣∣xl,n (m)∣∣2 ≷H1

H0γ , (37)

and the frequency-domain detector is given by (39) inAppendix A. It is to be noted that the energy detectorassumes known noise power.

In this experiment, we plot ROC curves for two differ-ent cases. In the first case, we assume that the unknownnoise powers at different antennas are perfectly estimatedand there is no noise power uncertainty. The ROC plotsare given in Figure 3. In the second part of this experi-ment, shown in Figure 4, we repeat the same experimental

Page 9: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 9 of 16http://asp.eurasipjournals.com/content/2014/1/160

0 0.2 0.4 0.6 0.8 10

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Probabilty of False Alarm, PFA

Pro

babi

lty o

f Det

ectio

n, P

D ΛST

(X)

ΛPST

(X)

ΛKR

(X)

ΛPKR

(X)

ΛTr

(X)

ΛFreq

(X)

ΛEng

(X)

Figure 3 ROC curves for comparison of detection schemes: sample support size M = 70, number of vector samples per sub-block N = 10,number of antennas L = 4, noise uncertainty αnu = 1, and average SNR κ = −10 dB.

setup for the case with noise power uncertainty. Note thatwe model the noise power uncertainty by generating the

noise power at the l-th antenna as σ 2w,l ∼ U

(σ 2

n,lαnu

, αnuσ 2n,l

),

where αnu ≥ 1, and αnu = 1 means no noise uncer-tainty [1]. From the results, it is clear that the proposed

schemes clearly outperform traditional schemes. More-over, from the experiment, we can also conclude thatthe noise power uncertainty slightly deteriorates the per-formance of all detectors. However, the energy detectorcompletely collapses at αnu = 2. To further analyze theeffects of the sample support, noise power uncertainty,

0 0.2 0.4 0.6 0.8 10

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Probabilty of False Alarm, PFA

Pro

babi

lty o

f Det

ectio

n, P

D

ΛST

(X)

ΛPST

(X)

ΛKR

(X)

ΛPKR

(X)

ΛTr

(X)

ΛFreq

(X)

ΛEng

(X)

Figure 4 ROC curves for comparison of detection schemes: sample support size M = 70, number of vector samples per sub-block N = 10,number of antennas L = 4, noise uncertainty αnu = 2, and average SNR κ = −10 dB.

Page 10: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 10 of 16http://asp.eurasipjournals.com/content/2014/1/160

and shadowing parameters, we need to have a single andquantitative figure of merit. This metric is the area underthe ROC curve (AUC), which varies between 0.5 (poorperformance) and 1 (good performance). Hence, next,we use AUC curves to see effects of the sample sup-port, noise power uncertainty, and shadowing parameterson the detection performance of the spectrum sensingschemes. We model the shadowing effect by using log-normal random variable as: psh = pm10xσ /10, where pmis the expected signal power at the receiver (equal for allantennas), psh is the signal power after shadowing effect,and xσ is Gaussian random variable with 0 mean andstandard deviation σSh. The log-normal shadow fading isoften characterized by its dB spread, σdB, which has therelationship σSh = 0.1σdB log10 [38].

In Figure 5, we plot the AUC curves to analyze theeffects of sample size in the presence of shadowing, withand without the effects of noise power uncertainty. Inthis figure, the curves with dashed lines represent thecase where both shadowing and noise power uncertaintywith αnu = 1.5. On the other hand, the curves withsolid lines show the case with no noise power uncertainty(i.e. αnu = 1). From these plots, it can be concluded thatthe proposed schemes �KR(Z) and �PKR(Z) are robustagainst the small sample support both in the presence andabsence of noise power uncertainty. Particularly, in theregion 20 ≤ M ≤ 80, the detectors �KR(Z) and �PKR(Z)

clearly outperform the other detectors. As expected, wecan also see that in the small sample regime, �PKR(Z) per-forms better than �KR(Z). The obvious reason for this isthat underH1 instead of 1

2 L(L+1)+ 12 N(N+1) parameters

in the case of �KR(Z), the detection scheme �PKR(Z)

has approximately only (2L − 1) + (2N − 1) parametersto estimate. In order to confirm this, in Figure 6, weplot the normalized minimum square error (MSE) of theestimator of the covariance matrix under hypothesis H1,expressed as:

RNMSE =

√√√√√ 1Javg

Javg∑j=1

∥∥∥�j − �

∥∥∥2

F‖�‖2

F(38)

where � is the true spatio-temporal covariance matrix,�j is the estimated covariance for each estimator in theMonte Carlo simulation j, ‖.‖2

F is the Frobenius norm, andJavg is the number of Monte Carlo simulations. The resultsconfirm that for small sample support, estimators of �

used in the case �KR(Z) and �PKR(Z) have smaller error.In Figure 7, we show the AUC plots to analyze the

effect of shadowing (i.e. σdB) both in the presence andabsence of noise uncertainty, αnu = 1.5 and αnu = 1,respectively. It is clear from these results that the effectsof shadowing are very small on the performance of thedetection schemes. However, we can see that by incre-menting σdB, a slight improvement occurs in the perfor-mance of the detection schemes �TR (Z), �ST (Z), and�PS (Z). The most obvious reason for this interestingoutcome can be the heavy-tailed distribution of the pri-mary signal strength due to the log-normally-distributedshadow fading that behaves in such a way at lowerSNR [39].

20 40 60 80 100 120 140 160 180 2000.5

0.55

0.6

0.65

0.7

0.75

0.8

0.85

0.9

0.95

1

Number of Sample, M

Are

a un

der R

OC

cur

ve, A

UC

ΛST

(X)

ΛPST

(X)

ΛKR

(X)

ΛPKR

(X)

ΛTR

(X)

ΛFreq

(X)

ΛEng

(X)

Figure 5 AUC curves (solid lines for αnu = 1 and dashed lines for αnu = 2) to assess the effects of number of samples M: number of vectorsamples per sub-block N = 15, number of antennas L = 4, shadowing σdB−spread = 4, and average SNR κ = −12 dB.

Page 11: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 11 of 16http://asp.eurasipjournals.com/content/2014/1/160

50 100 150 200

3.2

3.4

3.6

3.8

4

4.2

4.4

Number of Samples, M

Nor

mal

ized

MS

E

MLEPersym−MLEKron−MLEPerKron−MLE

Figure 6 Normalized MSE of the estimator of covariance matrix under hypothesis H1.

In Figure 8, we show the AUC plots to analyze theeffects of noise power uncertainty. The results show arobust behavior for the detection schemes against thenoise power uncertainty. Once again, we observe thatthe performance of the proposed schemes �KR(Z) and�PKR(Z) is better than other schemes that do not exploitthe underlying structure of the received signal.

Finally, keeping the probability of false alarm PF fixed,we simulate the performance of the detection schemesby plotting PD against different values of average SNR κ .The simulation results are shown in Figure 9. The resultsclearly show that for different SNR values, the proposedschemes consistently perform better than the detectionschemes that do not exploit the covariance structure.

0 2 4 6 8 10 120.86

0.88

0.9

0.92

0.94

0.96

0.98

1

dB Spread σdB

Are

a un

der R

OC

cur

ve, A

UC

ΛST

(X)

ΛPST

(X)

ΛKR

(X)

ΛPKR

(X)

ΛTR

(X)

Figure 7 AUC curves (solid lines for αnu = 1.5 and dashed lines for αnu = 1) to assess the effects of shadowing σdB: sample size M = 80,number of vector samples per sub-block N = 15, number of antennas L = 4, and average SNR κ = −8 dB.

Page 12: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 12 of 16http://asp.eurasipjournals.com/content/2014/1/160

1 1.2 1.4 1.6 1.8 20.65

0.7

0.75

0.8

0.85

0.9

0.95

1

Noise uncertainty factor, αnu

Are

a un

der R

OC

cur

ve, A

UC

ΛST

(X)

ΛPST

(X)

ΛKR

(X)

ΛPKR

(X)

ΛTR

(X)

ΛFreq

(X)

ΛEng

(X)

Figure 8 AUC curves to assess the effects of noise uncertainty αnu: with sample size M = 80, number of vector samples per sub-blockN = 15, number of antennas L = 4, effect of shadowing σdB−spread = 4, and average SNR κ = −8 dB.

In conclusion, we can say that the exploitation of inher-ent structure of covariance matrix both in frequency andtime domain leads us to robustness against the smallsample support compared to the ST-GLRT in (9).

6.2 Experiment no. 2: cognitive radioIn the previous set of experiments, we analyzed the pro-posed schemes for detection of a Gaussian signal with

separable spatio-temporal covariance matrix in unknownadditive uncorrelated noise. In the present set of exper-iments, we perform simulations to illustrate the appli-cation of the proposed detection schemes in cognitiveradio, i.e. with an actual communication signal instead of aGaussian signal and with a non-separable spatio-temporalcovariance matrix. For the simulations, we have used anODFM-modulated DVB-T signal (8-K mode, 64 − QAM,

−20 −15 −10 −5 00

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Average Signal to Noise Ratio (SNR)

Pro

babi

lty o

f Det

ectio

n, P

D

ΛST

(X)

ΛPST

(X)

ΛFreq

(X)

ΛKR

(X)

ΛPKR

(X)PFA=0.1

PFA=0.01

Figure 9 PD vs average SNR: number of vector samples per sub-block N = 15, number of antennas L = 4, M = 80.

Page 13: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 13 of 16http://asp.eurasipjournals.com/content/2014/1/160

0 0.2 0.4 0.6 0.8 10

0.2

0.4

0.6

0.8

1

Probability of False Alarm

Pro

babi

lity

of D

etec

tion

ΛST(X)

ΛPS(X)

ΛFreq(X)

ΛKR(X)

ΛPKR(X)

ΛTr(X)

Figure 10 ROC curves (solid lines for αnu = 2 and dashed lines for αnu = 1): sample support size M = 70, number of vector samples persub-block N = 15, number of antennas L = 4, channel delay spread 0.779 μs, and average SNR κ = −8 dB.

guard interval 1/4, and inner code rate 2/3) with a band-width of 7.61 MHz. We have considered a 4 × 4 Rayleighchannel with unit power and an exponential power delayprofile with length 64 samples (at a sampling frequencyof 7.61 MHz). The additive noises at each antenna aregenerated as a zero-mean complex Gaussian process. Weused a noise power different at each antenna with average

SNR κ = −8 dB. To analyze the schemes, we plot ROCcurves with N = 15 vector samples per sub-block. Therest of the parameters are given in the captions of thefigures.

In Figure 10, we plot the ROC curves for the casewhen the channel delay spread is 0.779 μs with andwithout noise power uncertainty. In this figure, we can

0 0.2 0.4 0.6 0.8 10

0.2

0.4

0.6

0.8

1

Probability of False Alarm

Pro

babi

lity

of D

etec

tion

ΛST(X)

ΛPS(X)

ΛFreq(X)

ΛKR(X)

ΛPKR(X)

ΛTr(X)

Figure 11 ROC curves (solid lines for αnu = 2 and dashed lines for αnu = 1): sample support size M = 70, number of vector samples persub-block N = 15, number of antennas L = 4, channel delay spread 0.097 μs, and average SNR κ = −8 dB.

Page 14: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 14 of 16http://asp.eurasipjournals.com/content/2014/1/160

easily see that the detectors �KR(Z) and �PKR(Z) clearlyoutperform the other detectors, even for this realisticspatio-temporal correlation. In Figure 11, we repeatthe experiment of Figure 10 for a channel delay spread0.097μs (almost flat fading channel). Even for channelswith low-frequency selectivity, we can see that the detec-tion schemes �KR(Z) and �PKR(Z) consistently performbetter compared to the rest of the detection schemes.However, compared to Figure 10, in Figure 11, we cansee interesting results that the detector which ignorestemporal correlation outperforms spatio-temporalcorrelation-based schemes due to the low selectivity of thechannel.

Before commenting on these interesting results, forfurther confirmation, we need to have a single andquantitative figure of merit, so we plot AUC curves inFigure 12.

The AUC curves in Figure 12 demonstrate that theproposed detectors have better performance comparedto the traditional detection schemes, for different val-ues of sample support. In order to see the effect ofdelay spreads, we consider two types of channel delayspreads (0.097 and 0.779 μs). The results in Figure 12confirm that the proposed schemes �KR(Z) and �PKR(Z)

consistently outperform other schemes in small samplesupport regime. We can also see that in general, the per-formance of all schemes is degraded for different valuesof delay spreads; since the larger the delay spread, thelarger the degradation incurred. However, we can see thatcompared to rest of the detection schemes, �KR(Z) and

�PKR(Z) show more robustness against changes in thedelay spread. We can further observe an interesting out-come in the case of the detector that ignores temporalcorrelation and the frequency-based approximate GLRT,where we can see that the performances of these twodetection schemes are quite different for the two chan-nel delay spreads. This is also evident from comparisonof ROC curves in Figures 10 and 11. The possible expla-nation for this can be that for a short channel delayspread (i.e. 0.097 μs), the multi-tap channel adds neg-ligible temporal correlation. On the other hand, in thecase of channel delay spread 0.779 μs, we have tempo-ral correlation imposed by the channel on the temporallyuncorrelated transmitted OFDM signal. Therefore, theinclusion of temporal correlation as an additional detec-tion metric is not helping in the case of delay spread(i.e. 0.097 μs).

In order to further analyze the detection performanceof the proposed schemes, in Figure 13, we compare theresults by plotting PD vs average SNR κ . Once again, wecan observe that the proposed schemes outperform theremaining detection schemes presented in this paper.

7 ConclusionIn this paper, we have proposed novel detection schemesthat exploit the spatio-temporal correlation present inthe received observations at a multi-antenna receiver.When exploiting the spatio-temporal correlation, we haveobserved that the GLRT performs poorly when the sam-ple support is small. To cope with this problem, we

50 100 150 2000.5

0.6

0.7

0.8

0.9

1

Number Sample M

Are

a un

der R

OC

cur

ve (A

UC

)

ΛST(X)

ΛPS(X)

ΛFreq(X)

ΛKR(X)

ΛPKR(X)

ΛTr(X)

Figure 12 AUC curves (solid lines for channel delay spread 0.779 μs and dashed lines for channel delay spread 0.097 μs) to assess theeffects of number of samples M: number of vector samples per sub-block N = 15, number of antennas L = 4, and average SNR κ = −8 dB.

Page 15: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 15 of 16http://asp.eurasipjournals.com/content/2014/1/160

−12 −10 −8 −6 −4 −2 00

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Average Signal to Noise Ratio (SNR)

Pro

babi

lty o

f Det

ectio

n, P

D

ΛST

(Z)

ΛFreq

ΛKR

ΛPKR

PFA=0.1

PFA=0−01

Figure 13 PD vs average SNR: number of vector samples per sub-block N = 15, number of antennas L = 4, M = 80, channel delay spread0.097 μs.

have proposed detectors that are robust against thesample support. The proposed detectors (approximately)exploit the inherent spatio-temporal structure of thereceived covariance matrix by using the properties ofpersymmetric matrices and the Kronecker product ofthe spatial and temporal covariance matrices. The per-formance of the proposed detectors has been evaluatedwith the help of numerical simulations, which showimportant improvements compared to the traditionalschemes.

EndnotesaWe begin with the complex base-band signal sampled

at the specific Nyquist rate.bThese values are maintained during Monte-Carlo

process. In order to create noise power uncertainty, thesevalues are affected by random uncertainty at each run.

AppendixA Approximated GLRT in the frequency domainFor the interested readers, herein, we reproduce theapproximated GLRT in the frequency domain, originallypresented in [15]. While considering the case in whichthe received signals are jointly WSS, the limiting form(L fixed and M, N → ∞) of the the frequency domainGLRT statistic is written as [15]:

�ST(Zf

) −→M,N→∞ exp

⎧⎨⎩∫ π

−π

log

⎡⎣det

(S(e jθn

))∏L

i=1 sii(e jθn

)⎤⎦ dθn

⎫⎬⎭ ,

(39)

where[

S(e jθn

)]l,k

= fH (e jθn

)�l,kf

(e jθn

)is a quadratic

estimator and the Fourier vector is given by

f(e jθn

) =[1, e−jθn , e−j2θn , e−j3θn , . . . , e−j(N−1)θn

]T.

(40)

Competing interestsThe authors declare that they have no competing interests.

AcknowledgementsThis work was supported by the Catalan Government under the grantFIDGR- 2011-FIB0071.

Author details1SPCOMNAV, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona,Spain. 2Electrical Engineering, University of Engineering and Technology,Peshawar, 25000 Peshawar, Pakistan. 3Department of Electrical Engineeringand Information Technology (EIM-E), Universität Paderborn, Paderborn,Germany. 4Signal Processing School of Electrical Engineering, KTH RoyalInstitute of Technology, SE-100 44 Stockholm, Sweden.

Received: 1 July 2014 Accepted: 25 October 2014Published: 7 November 2014

References1. E Biglieri, AJ Goldsmith, LJ Greenstein, NB Mandayam, HV Poor, Principles

of Cognitive Radio (Cambridge University Press, Cambridge, 2012)2. AW Min, X Zhang, KG Shin, Detection of small-scale primary users in

cognitive radio networks. IEEE J. Selected Areas Commun. 29, 349–361(2011)

3. R Zhang, TJ Lim, Y-C Liang, Y Zeng, Multi-antenna based spectrumsensing for cognitive radios: a GLRT approach. IEEE Trans. Commun.58(1), 84–88 (2010)

4. E Axell, G Leus, EG Larsson, HV Poor, Spectrum sensing for cognitiveradio: state-of-the-art and recent advances. IEEE Signal Process. Mag.29(3), 101–116 (2012)

Page 16: RESEARCH OpenAccess Multi ... › assets › papers › R13_EJASP_GLRT_ST.pdf · cognitive receivers [2,3], thus giving us the chance to consider multi-antenna techniques to improve

Ali et al. EURASIP Journal on Advances in Signal Processing 2014, 2014:160 Page 16 of 16http://asp.eurasipjournals.com/content/2014/1/160

5. Y Zeng, Y-C Liang, AT Hoang, R Zhang, A review on spectrum sensing forcognitive radio: challenges and solutions. EURASIP J. Adv. Signal Process.(2010). doi:10.1155/2010/381465

6. W Yang, G Durisiand, VI Morgenshtern, E Riegler, in Proc. ISWCS. Capacitypre-log of SIMO correlated block-fading channels (Piscataway, 2011),pp. 869–873

7. KV Mardia, JT Kent, JM Bibby, Multivariate Analysis, 1st edn. (AcademicPress, Waltham, 1979), p. 536

8. TW Anderson, An Introduction to Multivariate Statistical Analysis(Wiley-Interscience, Hoboken, 2003)

9. Y Zeng, Y-C Liang, Eigenvalue-based spectrum sensing algorithms forcognitive radio. IEEE Trans. Commun. 57(6), 1784–1793 (2009)

10. C Vazquez-Vilar, R López-Valcarce, Spectrum sensing exploiting guardbands and weak channels. IEEE Trans. Signal Process. 59(12), 6045–6057(2011)

11. Z Quan, W Zhang, SJ Shellhammer, A Sayed, Optimal spectral featuredetection for spectrum sensing at very low SNR. IEEE Trans. Commun.59(1), 201–212 (2011)

12. J Sala-Alvarez, G Vazquez-Vilar, R Lopez-Valcarce, Multiantenna GLRdetection of rank-one signals with known power spectrum in whitenoise with unknown spatial correlation. IEEE Trans. Signal Process.60(6), 3065–3078 (2012)

13. A Perez-Neira, M-A Lagunas, MA Rojas, P Stoica, Correlation matchingapproach for spectrum sensing in open spectrum communications.IEEE Trans. Signal Process. 57(12), 4823–4836 (2009)

14. G Vazquez-Vilar, R Lopez-Valcarce, J Sala, Multiantenna spectrum sensingexploiting spectral a priori information. IEEE Trans. Wireless Commun.10(12), 4345–4355 (2011)

15. D Ramirez, J Via, I Santamaria, LL Scharf, Detection of Spatially CorrelatedGaussian Time Series. IEEE Trans. Signal Process. 58(10), 5006–5015 (2010)

16. TF Ayoub, AR Haimovich, Modified GLRT signal detection algorithm.IEEE Trans. Aerospace Electron. Syst. 36(3), 810–818 (2000)

17. Y Chen, A Wiesel, AO Hero, in Proc. of IEEE ICASSP. Shrinkage Estimation ofHigh Dimensional Covariance Matrices (Piscataway, 2009), pp. 2937–2940

18. D Ramírez, G Vazquez-Vilar, R López-Valcarce, J Vía, I Santamaría,Detection of rank-p signals in cognitive radio networks with uncalibratedmultiple antennas. IEEE Trans. Signal Process. 59(8), 3764–3774 (2011)

19. R Nitzberg, Application of maximum likelihood estimation ofpersymmetric covariance matrices to adaptive processing. IEEE Trans.Aerospace Electron. Syst. 16(1), 124–127 (1980)

20. D Akdemir, AK Gupta, Array variate random variables with multiwayKronecker delta covariance matrix structure. J. Algebraic Stat. 2(1), 98–113(2011)

21. P Wirfalt, M Jansson, in Proc. of Sensor Array and Multichannel SignalProcessing Workshop (SAM). On Toeplitz and Kronecker StructuredCovariance Matrix Estimation (Piscataway, 2010), pp. 185–188

22. K Werner, M Jansson, P Stoica, On estimation of covariance matrices withKronecker product structure. IEEE Trans. Signal Process. 56(2), 478–491(2008)

23. N Lu, DL Zimmerman, The likelihood ratio test for a separable covariancematrix. Stat. Probab. Lett. 73(5), 449–457 (2005)

24. MG Genton, Separable approximations of space-time covariancematrices. Environmetrics 18(7), 681–695 (2007)

25. S Ali, JA López-Salcedo, G Seco-Granados, in Proc. of 20th EuropeanSignal Processing Conference. Exploiting Structure of Spatio-temporalCorrelation for Detection in Wireless Sensor Networks (Piscataway, 2012),pp. 774–778

26. Y Liang, VV Veeravalli, Capacity of noncoherent time-selectiverayleigh-fading channels. IEEE Trans. Inform. Theory 50(12), 3095–3110(2004)

27. SM Kay, Fundamentals of Statistical Signal Processing, Volume 2: DetectionTheory (Prentice Hall, Upper Saddle River, 1998)

28. L Lu, H-C Wu, SS Iyengar, A novel robust detection algorithm forspectrum sensing. IEEE J. Selected Areas Commun. 29(2), 305–315 (2011)

29. M Wax, T Kailath, Efficient inversion of Toeplitz-block Toeplitz matrix.IEEE Trans. Acoustics, Speech Signal Process. 31(5), 1218–1221 (1983)

30. J Ward, Space-Time Adaptive Processing for Airborne Radar. TechnicalReport 1015, Lincoln Laboratory, MIT, (Lexington, December 1994)

31. TK Huckle, K Waldherr, T Schulte-Herbrüggen, Exploiting matrixsymmetries and physical symmetries in matrix product states and tensortrains. Linear Multilinear Algebra 61(1), 91–122 (2013)

32. M Jansson, P Stoica, Forward-only and forward-backward samplecovariances – a comparative study. Signal Process. 77(3), 235–245 (1999)

33. B Armour, Structured Covariance Autoregressive Parameter Estimation.PhD thesis, Department of Electrical Engineering, McGill Canada, 1989

34. J Kamm, J Nagy, Optimal Kronecker Product Approximation of BlockToeplitz Matrices. SIAM J. Matrix Anal. Appl. 22(1), 155–172 (2000)

35. MW Mitchell, MG Genton, ML Gumpertz, A likelihood ratio test forseparability of covariances. J. Multivar. Anal. 97(5), 1025–1043 (2006)

36. P Dutilleul, The MLE algorithm for the matrix normal distribution. J. Stat.Comput. Simul. 64, 105–123 (1999)

37. T Fawcett, ROC graphs: Notes and practical considerations for researchers.Technical Report HPL-2003-4, HP Laboratories, (Palo Alto, 2003)

38. A Ghasemi, ES Souca, Asymptotic performance of collaborative spectrumsensing under correlated log-normal shadowing. IEEE Commun. Lett.11(1), 34–36 (2007)

39. T Muetze, P Stuedi, F Kuhn, G Alonso, in Proc. of IEEE SECON, San Francisco.Understanding radio irregularity in wireless networks (Piscataway, 2008),pp. 82–90

doi:10.1186/1687-6180-2014-160Cite this article as: Ali et al.: Multi-antenna spectrum sensing by exploitingspatio-temporal correlation. EURASIP Journal on Advances in SignalProcessing 2014 2014:160.

Submit your manuscript to a journal and benefi t from:

7 Convenient online submission

7 Rigorous peer review

7 Immediate publication on acceptance

7 Open access: articles freely available online

7 High visibility within the fi eld

7 Retaining the copyright to your article

Submit your next manuscript at 7 springeropen.com