Non-Parametric Non-Line-of-Sight Identification 1

Similar documents
UWB Geolocation Techniques for IEEE a Personal Area Networks

SPECTRUM sensing is a core concept of cognitive radio

Feature Extraction Techniques

Experimental Design For Model Discrimination And Precise Parameter Estimation In WDS Analysis

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon

TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES

Machine Learning Basics: Estimators, Bias and Variance

Symbolic Analysis as Universal Tool for Deriving Properties of Non-linear Algorithms Case study of EM Algorithm

This model assumes that the probability of a gap has size i is proportional to 1/i. i.e., i log m e. j=1. E[gap size] = i P r(i) = N f t.

Inspection; structural health monitoring; reliability; Bayesian analysis; updating; decision analysis; value of information

DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS

Support recovery in compressed sensing: An estimation theoretic approach

Proc. of the IEEE/OES Seventh Working Conference on Current Measurement Technology UNCERTAINTIES IN SEASONDE CURRENT VELOCITIES

Optimal Resource Allocation in Multicast Device-to-Device Communications Underlaying LTE Networks

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization

Pattern Recognition and Machine Learning. Learning and Evaluation for Pattern Recognition

A Low-Complexity Congestion Control and Scheduling Algorithm for Multihop Wireless Networks with Order-Optimal Per-Flow Delay

Randomized Recovery for Boolean Compressed Sensing

Combining Classifiers

Probability Distributions

Ensemble Based on Data Envelopment Analysis

Upper bound on false alarm rate for landmine detection and classification using syntactic pattern recognition

Keywords: Estimator, Bias, Mean-squared error, normality, generalized Pareto distribution

Soft Computing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis

e-companion ONLY AVAILABLE IN ELECTRONIC FORM

Intelligent Systems: Reasoning and Recognition. Perceptrons and Support Vector Machines

COS 424: Interacting with Data. Written Exercises

are equal to zero, where, q = p 1. For each gene j, the pairwise null and alternative hypotheses are,

Block designs and statistics

E0 370 Statistical Learning Theory Lecture 6 (Aug 30, 2011) Margin Analysis

A Simple Regression Problem

A Simplified Analytical Approach for Efficiency Evaluation of the Weaving Machines with Automatic Filling Repair

Effective joint probabilistic data association using maximum a posteriori estimates of target states

Kernel based Object Tracking using Color Histogram Technique

Ch 12: Variations on Backpropagation

Qualitative Modelling of Time Series Using Self-Organizing Maps: Application to Animal Science

An Improved Particle Filter with Applications in Ballistic Target Tracking

Convolutional Codes. Lecture Notes 8: Trellis Codes. Example: K=3,M=2, rate 1/2 code. Figure 95: Convolutional Encoder

On Constant Power Water-filling

Topic 5a Introduction to Curve Fitting & Linear Regression

Estimating Parameters for a Gaussian pdf

Detection and Estimation Theory

SHORT TIME FOURIER TRANSFORM PROBABILITY DISTRIBUTION FOR TIME-FREQUENCY SEGMENTATION

ANALYTICAL INVESTIGATION AND PARAMETRIC STUDY OF LATERAL IMPACT BEHAVIOR OF PRESSURIZED PIPELINES AND INFLUENCE OF INTERNAL PRESSURE

A method to determine relative stroke detection efficiencies from multiplicity distributions

Using a De-Convolution Window for Operating Modal Analysis

Energy Efficient Predictive Resource Allocation for VoD and Real-time Services

arxiv: v1 [cs.ds] 3 Feb 2014

MSEC MODELING OF DEGRADATION PROCESSES TO OBTAIN AN OPTIMAL SOLUTION FOR MAINTENANCE AND PERFORMANCE

Fairness via priority scheduling

Identical Maximum Likelihood State Estimation Based on Incremental Finite Mixture Model in PHD Filter

Fast Montgomery-like Square Root Computation over GF(2 m ) for All Trinomials

Extension of CSRSM for the Parametric Study of the Face Stability of Pressurized Tunnels

On Conditions for Linearity of Optimal Estimation

Compression and Predictive Distributions for Large Alphabet i.i.d and Markov models

Compressive Distilled Sensing: Sparse Recovery Using Adaptivity in Compressive Measurements

When Short Runs Beat Long Runs

List Scheduling and LPT Oliver Braun (09/05/2017)

Lower Bounds for Quantized Matrix Completion

HIGH RESOLUTION NEAR-FIELD MULTIPLE TARGET DETECTION AND LOCALIZATION USING SUPPORT VECTOR MACHINES

NUMERICAL MODELLING OF THE TYRE/ROAD CONTACT

OBJECTIVES INTRODUCTION

A remark on a success rate model for DPA and CPA

General Properties of Radiation Detectors Supplements

Pattern Recognition and Machine Learning. Artificial Neural networks

Computable Shell Decomposition Bounds

An Adaptive UKF Algorithm for the State and Parameter Estimations of a Mobile Robot

On the Communication Complexity of Lipschitzian Optimization for the Coordinated Model of Computation

Birthday Paradox Calculations and Approximation

Biostatistics Department Technical Report

Statistical Logic Cell Delay Analysis Using a Current-based Model

13.2 Fully Polynomial Randomized Approximation Scheme for Permanent of Random 0-1 Matrices

A note on the multiplication of sparse matrices

Sharp Time Data Tradeoffs for Linear Inverse Problems

Fixed-to-Variable Length Distribution Matching

NOTES AND CORRESPONDENCE. Two Extra Components in the Brier Score Decomposition

Best Linear Unbiased and Invariant Reconstructors for the Past Records

Warning System of Dangerous Chemical Gas in Factory Based on Wireless Sensor Network

In this chapter, we consider several graph-theoretic and probabilistic models

Testing equality of variances for multiple univariate normal populations

Interactive Markov Models of Evolutionary Algorithms

Chapter 6 1-D Continuous Groups

A Self-Organizing Model for Logical Regression Jerry Farlow 1 University of Maine. (1900 words)

Kernel Methods and Support Vector Machines

Determining OWA Operator Weights by Mean Absolute Deviation Minimization

Device-to-Device Collaboration through Distributed Storage

DISSIMILARITY MEASURES FOR ICA-BASED SOURCE NUMBER ESTIMATION. Seungchul Lee 2 2. University of Michigan. Ann Arbor, MI, USA.

Efficient Filter Banks And Interpolators

PULSE-TRAIN BASED TIME-DELAY ESTIMATION IMPROVES RESILIENCY TO NOISE

The proofs of Theorem 1-3 are along the lines of Wied and Galeano (2013).

A Better Algorithm For an Ancient Scheduling Problem. David R. Karger Steven J. Phillips Eric Torng. Department of Computer Science

Boosting with log-loss

On Poset Merging. 1 Introduction. Peter Chen Guoli Ding Steve Seiden. Keywords: Merging, Partial Order, Lower Bounds. AMS Classification: 68W40

Optimal Jamming Over Additive Noise: Vector Source-Channel Case

Bayesian Approach for Fatigue Life Prediction from Field Inspection

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

An improved self-adaptive harmony search algorithm for joint replenishment problems

Department of Electronic and Optical Engineering, Ordnance Engineering College, Shijiazhuang, , China

A Note on the Applied Use of MDL Approximations

Meta-Analytic Interval Estimation for Bivariate Correlations

Transcription:

Non-Paraetric Non-Line-of-Sight Identification Sinan Gezici, Hisashi Kobayashi and H. Vincent Poor Departent of Electrical Engineering School of Engineering and Applied Science Princeton University, Princeton, NJ 08544 {sgezici,hisashi,poor}@princeton.edu Abstract Recently, there has been uch interest in accurate deterination of obile user locations in cellular environents. A general approach to this geolocation proble is to gather tie-of-arrival easureents fro a nuber of base stations (BSs and to estiate user locations using the traditional least square approach. However, in non-line-of-sight (NLOS situations, easureents are significantly biased. Hence, very large errors in location estiation ay be introduced when traditional techniques are adopted. For this reason, before eploying an algorith for location estiation, it is useful to know which BS s are in line-of-sight (LOS and which are in NLOS of the obile station. In this paper, a non-paraetric approach to this NLOS identification proble is proposed. Since the statistics of NLOS errors are usually unknown, a non-paraetric probability density estiation technique is eployed to approxiate the distribution of the easureents. Then, an appropriate etric is used to deterine the distance between the distribution of the easureents and the distribution of the easureent noise. Depending on the closeness of the distributions, the propagation environent is classified as LOS or NLOS. In a situation where reliability of easureents fro a BS is to be quantified, the distance can be used to represent the reliability of the easureents as well as to classify the station. I. INTRODUCTION Recently, the subject of obile positioning in wireless counication systes has drawn considerable attention. With accurate location estiation, a variety of new applications and services such as Enhanced-9, location sensitive billing, iproved fraud detection, intelligent transport syste (ITS and iproved traffic anageent will becoe feasible [2]. Mobile positioning using radiolocation techniques usually involves tie of arrival (TOA, tie difference of arrival (TDOA, angle of arrival (AOA, or signal strength (SS easureents, or soe cobination of these ethods. Multipath, non-line-of-sight (NLOS propagation and ultiple access interference are often the ain sources of errors in geolocation, and ake obile positioning challenging. Aong these error sources, NLOS is perhaps the ost crucial one. In an NLOS situation, TOA easureents that are used to estiate the distance between a obile station (MS and a base station (BS are severely biased. In this case, using traditional location algoriths ay result in large errors in location estiation []. However, if it is known that the MS is in NLOS with respect to a BS, then soe special ethods can be applied This research was supported in part by the U.S. National Science Foundation under Grant CCR-99-7936, and in part by the New Jersey Center for Wireless Telecounications. depending on the scenario. For exaple, if at least three BS s are in LOS of the MS, then the easureents fro the NLOS BS s can be discarded in obtaining a two-diensional location estiate. Alternatively, when the Recursive Weighting Algorith [2] is eployed to reduce NLOS propagation errors, knowledge of LOS/NLOS BS s becoes iportant. In other words, identification of NLOS BS s can help considerably to iprove the location estiation. The proble of NLOS identification is essentially a detection proble. It copares the LOS hypothesis to the NLOS hypothesis. The probability distribution of the easureents under the LOS hypothesis is usually known except for its ean. If the distribution under NLOS hypothesis is also assued to be known, then the proble can be solved by the conventional hypothesis testing ethod [3]. However, the probability distribution of NLOS errors, hence that of the easureents under the NLOS hypothesis, is usually unknown. Therefore, a technique which does not assue the knowledge of NLOS error statistics should be developed. In this paper, a non-paraetric NLOS identification technique is proposed. Since the statistics of TOA delays due to NLOS are not known exactly, a non-paraetric approach is adopted to approxiate the probability density function of the easureents. Then, a suitable distance etric between a known easureent error distribution and a nonparaetrically estiated distance easureent distribution is defined to deterine whether a given BS is within LOS or NLOS of the MS. The distance between these two distributions can also be used as a reliability easure for the easureents fro the given BS. The reainder of the paper is organized as follows. Section II forulates the proble of NLOS identification and describes the non-paraetric NLOS identification algorith. The perforance of the algorith is evaluated in Section III by siulation studies. Finally, soe concluding rearks are ade in Section IV. II. NON-PARAMETRIC NLOS IDENTIFICATION Consider a situation in which independent identically distributed (iid range easureents (obtained fro TOA easureents ultiplied by the speed of light between an MS and a BS are taken. Assue that the change in the location of the MS during these easureents can be ignored. Hence the distance between the MS and the BS can be considered approxiately constant for the geolocation purpose. Then, for 0-7803-7954-3/03/$7.00 2003 IEEE. 2544

the ith easureent, the hypotheses can be expressed as: : r i = d + n i H : r i = d + n i + e i, ( for i =,...,, where is the LOS hypothesis and H is the NLOS hypothesis. In the forer case, the easureent is odelled as the suation of the true distance d and a easureent noise, n i, while in the latter case, the NLOS error e i is also present, which is odelled by a positive rando variable. We assue that the easureent noise statistics are copletely known and is odelled by a zero ean Gaussian rando variable. However, neither d nor the probability density function of the NLOS error are known. Therefore, it is not possible to invoke conventional hypothesis testing techniques like generalized likelihood ratios. Let the probability density function (pdf of the easureent noise be p n (x, which is copletely known. Then, the pdf of the easureents in the LOS hypothesis case is given by p n (x d. Note that this distribution is copletely known except for one paraeter, d, which affects only the ean of the distribution. The ain idea in the non-paraetric NLOS identification test is to copare the closeness of this pdf to the pdf of range easureents. Thus we first approxiate the pdf of the range easureents non-paraetrically, copare the closeness of this pdf to the LOS pdf by defining a distance etric, and then decide LOS/NLOS after a threshold test. This test can be suarized as follows: Estiate the pdf of the distance fro iid range easureents. Let this estiate be denoted by ˆp r (x. 2 Calculate the distance between p n (x d and ˆp r (x for all possible d values and find the iniu distance. 3 Copare this iniu distance to a threshold: Decide if the iniu distance is saller than the threshold, and decide H otherwise. We will discuss these steps in ore details in the following subsections. A. Parzen Window Density Estiation In order to estiate the pdf of the distance, a non-paraetric density estiation technique, called Parzen window density estiation, is eployed, which approxiates the pdf using soe window functions around the saples. The reason for eploying this technique is its flexibility in choosing density estiation paraeters depending on the saple size. Given iid distance easureents r,..., r, the distance pdf can be estiated by the following forula [4] ˆp r (x = φ( x r i, (2 h h where φ(. is the window function and h is a scaling paraeter. The window function ust be a probability density function in order for ˆp r (x to be a valid pdf. In other words, it is always non-negative and integrates to one. Coonly used window functions include Gaussian and rectangular windows [4]. B. Distance Function After obtaining the approxiate pdf of the distance, our next step is to deterine whether these distance easureents are coing fro p n (x d or the pdf under the NLOS hypothesis. Since the pdf under the NLOS hypothesis is unknown, it is reasonable to copare the distance between p n (x d and ˆp r (x and accept the LOS hypothesis if the distance is saller than a threshold, that is, if the two distributions are sufficiently close. Since the true distance, d, is unknown, the iniu distance between p n (x d and ˆp r (x ust be calculated aong all possible d s. The Kullback-Leibler (KL distance [5] can be used to calculate the distance between two probability distributions. For given pdf s p and p 2, the KL distance between the is given by D(p p 2 = p (x log p (x dx. (3 p 2 (x C. Decision Criterion The decision criterion to deterine LOS or NLOS hypothesis becoes the following test: inf d {D(ˆp r (x p n (x d} δ, (4 H where δ is the threshold. If the value of the d iniizing the decision variable can be found, the test can be expressed siply as D(ˆp r (x p n (x ˆd δ. (5 H We assue that the easureent noise is a zero ean Gaussian rando variable, which is a valid approxiation when the TOA s are acquired with a atched filter approach at high signal-to-noise ratio (SNR [6]. Then, p n (x d is expressed as p n (x d = e (x d2 /(2σ 2. (6 2πσ In this case, the following result indicates the siplification of the decision test. Proposition 2. For a zero ean Gaussian easureent error and a syetric window function, i.e., φ(x =φ( x for all x, the value of d iniizing the distance function of (4 is the saple ean of the easureents, that is, ˆd = r i. Proof See Appendix A. Proposition 2. states that for a syetric window function, the iniu distance to be used in the decision criterion can be coputed by siply shifting the Gaussian easureent error pdf by the saple ean of the easureents and calculating the KL distance between this shifted pdf and the estiated pdf, ˆp r (x. 0-7803-7954-3/03/$7.00 2003 IEEE. 2545

Another iportant issue is the appropriate choice of the threshold value, δ. Since the pdf s are not known exactly under either hypothesis, it does not see possible to set the false alar (i.e. isinterpret a LOS situation as NLOS and iss detection (i.e. isinterpret an NLOS situation as LOS probabilities. However, the following result states that in soe situations the false alar probability can be set even though the true distance d is not known, that is, without any inforation about the ean of the rando variable under. Proposition 2.2 For a zero ean Gaussian easureent error and a syetric window function, the false alar probability can be set independently of the true distance between the obile and the base station. Proof See Appendix B Proposition 2.2 states that under suitable conditions the distance function is independent of the true distance under the LOS hypothesis and it is therefore theoretically possible to set the false alar rate. Under the conditions stated in the above two propositions, the decision test can be expressed as follows (Appendix C: ˆp r (x log( 2πσˆp r (xdx + ˆσ2 2σ 2 δ, (7 H where ˆσ 2 is the saple variance 2 of the range easureents, that is ˆσ 2 = (r i ˆd 2. Depending on the technique to locate the obile user, the classification of BS s ay not be necessary. Instead soe reliability inforation about the easureents fro each BS ight be required. In this case, the distance value between the LOS and NLOS pdf s can be used as a reliability inforation, which can help us to locate the obile ore accurately. III. SIMULATION RESULTS In this section, the perforance of the non-paraetric NLOS BS identification technique is evaluated. Figure shows the false alar probability, P FA, of the non-paraetric technique for different nubers of easureents ( =3, 5, 0, 5 when the iss detection probability, P MD, is set to 0.05. The easureent noise is odelled by N (0, 00 2 and the NLOS error is odelled by an exponential rando variable with ean 25. The unit variance Gaussian window is used and the scaling paraeter h is set to 20. In this scenario, we see that we do not need any saples to have a reliable decision. In order to evaluate the perforance of the technique for different NLOS errors, we plot the false alar probability for different NLOS errors. Specifically, we change the ean of the exponential rando variable representing the NLOS error and plot P FA when P MD =0.0 and the nuber of saples is 0. The Gaussian easureent noise and the paraeters of the Parzen window density estiation technique are kept the sae as in the previous case. Figure 2 shows the perforance of the 2 The saple variance is often defined as s 2 = (r i r 2, where r is the saple ean. This definition akes s 2 an unbiased estiate of the population variance. P FA 0 0 0 0 2 0 3 2 4 6 8 0 2 4 6 Nuber of Measureents Fig.. False alar probability versus the nuber of easureents when P MD =0.05. P FA 0 0 0 0 2 5 0 5 20 25 Mean of exponential NLOS error (. Fig. 2. False alar probability versus NLOS errors when P MD =0.0 and =0. technique for different NLOS noise levels. Obviously, the test becoes ore successful as the ean (hence the variance of the exponential rando variable increases since it becoes easier to distinguish between the two hypotheses. IV. CONCLUSION A non-paraetric test to deterine whether a given BS is in LOS or NLOS of the MS in question has been proposed. A suitable distance etric between a known easureent error distribution and a non-paraetrically estiated distance distribution has been defined to deterine whether a given BS is within LOS or NLOS of the MS. The perforance of the algorith has been evaluated by siulation experients. Future work includes ore detailed siulation studies under 0-7803-7954-3/03/$7.00 2003 IEEE. 2546

different NLOS scenarios and coparisons to other techniques [3]. REFERENCES [] J. Caffery, Jr. and G. Stuber, Subscriber location in CDMA cellular networks, IEEE Trans. Vehicular Technology, vol. 47, pp. 406-46, May 998. [2] J. Caffery, Jr., Wireless location in CDMA cellular radio systes, Kluwer Acadeic Publishers, Boston, 2000. [3] J. Borras, P. Hatrack and N. B. Mandaya, Decision theoretic fraework for NLOS identification, Proc. IEEE Vehicular Technology Conference, Spring 998, vol. 2, Ottawa, Canada, pp. 583-587. [4] R. O. Duda, P. E. Hart and D. D. Stork, Pattern classification, Wiley- Interscience, New York, 200. [5] S. Kullback, Inforation theory and statistics, Wiley, New York, 959. [6] C. E. Cook and M. Bernfeld, Radar signal: an introduction to theory and application, Acadeic Press, 970. APPENDIX A. Proof of Proposition 2. Using (3, the KL distance between ˆp r (x and p n (x d can be expressed as ˆp r (x D(ˆp r (x p n (x d = ˆp r (x log dx. (8 p n (x d After inserting (6 and soe anipulations, we get D(ˆp r (x p n (x d = ˆp r (x log( 2πσ ˆp r (x dx + 2σ 2 (x d 2 ˆp r (x dx. (9 When we differentiate (9 with respect to d and equate to zero, we get x ˆpr (x dx ˆd = ˆpr (x dx. (0 Since ˆp r (x is a probability density function, the denoinator is equal to unity. For the nuerator, if we consider a syetric window function, ( x ri x ˆp r (x dx = xφ dx h h = r i ( where we use the fact that window functions integrate to unity. So the optial value of d is the saple ean of the easureents, ˆd = r i, (2 which iniizes (8. B. Proof of Proposition 2.2 The false alar probability is the probability that the KL distance between ˆp r (x and p n (x d exceeds the threshold given that is the true hypothesis. That is, P FA = Pr{D(ˆp r (x p n (x ˆd >δ }. (3 Under, each easureent is equal to su of the true distance and easureent noise, which is a zero ean Gaussian rando variable. In other words, r i = d + n i for i =,...,. Fro Proposition 2., the optial value of d iniizing the KL distance of (4 is given by ˆd = r i. Inserting (2 and (6 in (5, we get D(ˆp r(x p n(x ˆd = ( φ( x ri log h h h φ( x ri log h h φ( x ri dx h dx.(4 ( 2πσ e (x ˆd/(2σ 2 Note that x ˆd = x r i = x d n i, since r i = d+n i under. So defining a new duy variable for the integrals as u = x d, we obtain D(ˆp r(x p n(x ˆd = ( φ( u ni log φ( u ni du h h h h φ( u ( ni log e (u n i /(2σ 2 du, h h 2πσ which solely depends on n,..., n. Therefore, it is possible to set the false alar rate without any inforation about the true distance, d. C. Derivation of Equation (7 The decision test in the Gaussian easureent noise case can be expressed as D(ˆp r (x p n (x ˆd δ, (5 H where ˆd = r i. Siilar to (9, the distance can be expressed as D(ˆp r (x p n (x ˆd = ˆp r (x log( 2πσ ˆp r (x dx + 2σ 2 (x ˆd 2 ˆp r (x dx. (6 The integral in the second ter can be expressed as follows: (x ˆd 2 ˆp r (x dx = (x 2 2 h ˆdx+ ˆd 2 φ( x r i dx, h (7 which can be shown equal to (x ˆd 2 ˆp r (x dx = {h [h 2 h σw+r 2 i 2 ] 2 ˆdr i h + ˆd 2 h }, (8 where σw 2 = x 2 φ(xdx. After cobining the ters we get (x ˆd 2 ˆp r (x dx = h 2 σw 2 +ˆσ 2, (9 0-7803-7954-3/03/$7.00 2003 IEEE. 2547

where ˆσ 2 = (r i ˆd 2 is the saple variance of the easureents. Inserting (9 in (6, the final decision test can be expressed as in (7 where δ = δ h 2 σ 2 w/(2σ 2. 0-7803-7954-3/03/$7.00 2003 IEEE. 2548