Losless compression of LIGO data.

Similar documents
LASER INTERFEROMETER GRAVITATIONAL WAVE OBSERVATORY - LIGO - CALIFORNIA INSTITUTE OF TECHNOLOGY MASSACHUSETTS INSTITUTE OF TECHNOLOGY

Non-Linear Response of Test Mass to External Forces and Arbitrary Motion of Suspension Point

Validation of the Source-Detector Simulation Formulation

( nonlinear constraints)

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Case Study: Faraday Rotators in LIGO

- An Image Coding Algorithm

ANALYSIS OF BURST SIGNALS IN LIGO DATA. Irena Zivkovic, Alan Weinstein

Digital Image Processing

<Outline> JPEG 2000 Standard - Overview. Modes of current JPEG. JPEG Part I. JPEG 2000 Standard

Beam Splitter Optical Surface Deformation due to Gravity

Multiresolution image processing

(z) UN-1(z) PN-1(z) Pn-1(z) - z k. 30(z) 30(z) (a) (b) (c) x y n ...

Drag Wiping with Methanol vs First Contact

PO Beam Waist Size and Location on the ISC Table

Wavelets Marialuce Graziadei

Image Compression. 1. Introduction. Greg Ames Dec 07, 2002

S5 V3 h(t) review and validation

+ (50% contribution by each member)

L. Yaroslavsky. Fundamentals of Digital Image Processing. Course

ECE533 Digital Image Processing. Embedded Zerotree Wavelet Image Codec

BASICS OF COMPRESSION THEORY

Implementing an Alignment Sensing and Control (ASC) System for the 40m Prototype Interferometer

Multiscale Image Transforms

Lifting Parameterisation of the 9/7 Wavelet Filter Bank and its Application in Lossless Image Compression

1 The Continuous Wavelet Transform The continuous wavelet transform (CWT) Discretisation of the CWT... 2

Installation and testing of L1 PSL table legs

Wavelets in Image Compression

LASER INTERFEROMETER GRAVITATIONAL WAVE OBSERVATORY - LIGO - CALIFORNIA INSTITUTE OF TECHNOLOGY MASSACHUSETTS INSTITUTE OF TECHNOLOGY

6WDWXVRI/,*2. Laser Interferometer Gravitational-wave Observatory. Nergis Mavalvala MIT IAU214, August 2002 LIGO-G D

LASER INTERFEROMETER GRAVITATIONAL WAVE OBSERVATORY - LIGO - CALIFORNIA INSTITUTE OF TECHNOLOGY MASSACHUSETTS INSTITUTE OF TECHNOLOGY

LASER INTERFEROMETER GRAVITATIONAL WAVE OBSERVATORY - LIGO - CALIFORNIA INSTITUTE OF TECHNOLOGY MASSACHUSETTS INSTITUTE OF TECHNOLOGY

Spatial Dependence of Force in the Initial LIGO OSEMs

Wavelet Filter Transforms in Detail

Number Representation and Waveform Quantization

Niklas Grip, Department of Mathematics, Luleå University of Technology. Last update:

Image and Multidimensional Signal Processing

4. Quantization and Data Compression. ECE 302 Spring 2012 Purdue University, School of ECE Prof. Ilya Pollak

SIGNAL COMPRESSION. 8. Lossy image compression: Principle of embedding

DISCRETE HAAR WAVELET TRANSFORMS

UNIT I INFORMATION THEORY. I k log 2

Discrete Wavelet Transform

Applied Cartography and Introduction to GIS GEOG 2017 EL. Lecture-2 Chapters 3 and 4

Wavelets and Signal Processing

Cosmography and Black Hole Spectroscopy by Coherent Synthesis of the Terrestrial and Space GW Antennae Network: Orbit Optimization

Moments conservation in adaptive Vlasov solver

Signal Processing With Wavelets

Proyecto final de carrera

LIGO I mirror scattering loss by non smooth surface structure

Lecture 7. Fourier Analysis

The LIGO Observatory Environment

INTEGER WAVELET TRANSFORM BASED LOSSLESS AUDIO COMPRESSION

Analog Digital Sampling & Discrete Time Discrete Values & Noise Digital-to-Analog Conversion Analog-to-Digital Conversion

PHYS Statistical Mechanics I Assignment 4 Solutions

Discrete Wavelet Transform: A Technique for Image Compression & Decompression

Module 5 EMBEDDED WAVELET CODING. Version 2 ECE IIT, Kharagpur

LIGO Laboratory / LIGO Scientific Collaboration LIGO. aligo BSC-ISI, Pre-integration Testing report, Phase II (before cartridge install) E V6

Analysis of Fractals, Image Compression and Entropy Encoding

Source Coding. Master Universitario en Ingeniería de Telecomunicación. I. Santamaría Universidad de Cantabria

V j+1 W j+1 Synthesis. ψ j. ω j. Ψ j. V j

Wavelet Transforms That Map Integers to Integers

Signal Processing With Wavelets

EFFICIENT IMAGE COMPRESSION ALGORITHMS USING EVOLVED WAVELETS

Run-length & Entropy Coding. Redundancy Removal. Sampling. Quantization. Perform inverse operations at the receiver EEE

Wavelets in Pattern Recognition

Vector Quantization Encoder Decoder Original Form image Minimize distortion Table Channel Image Vectors Look-up (X, X i ) X may be a block of l

Laser Interferometer Gravitational-Wave Observatory (LIGO)! A Brief Overview!

measuring GW polarizations beyond GR recent results and future prospects

Module 4. Multi-Resolution Analysis. Version 2 ECE IIT, Kharagpur

A WAVELET BASED CODING SCHEME VIA ATOMIC APPROXIMATION AND ADAPTIVE SAMPLING OF THE LOWEST FREQUENCY BAND

1. Write a program to calculate distance traveled by light

Fault Tolerance Technique in Huffman Coding applies to Baseline JPEG

Nontechnical introduction to wavelets Continuous wavelet transforms Fourier versus wavelets - examples

Haar wavelets. Set. 1 0 t < 1 0 otherwise. It is clear that {φ 0 (t n), n Z} is an orthobasis for V 0.

On Compression Encrypted Data part 2. Prof. Ja-Ling Wu The Graduate Institute of Networking and Multimedia National Taiwan University

Data Structures in Java

Fast Progressive Wavelet Coding

ETSF15 Analog/Digital. Stefan Höst

Examples MAT-INF1100. Øyvind Ryan

ISC In-vacuum Gouy phase telescopes

Let p 2 ( t), (2 t k), we have the scaling relation,

Nearest Neighbor Search with Keywords: Compression

Compression. What. Why. Reduce the amount of information (bits) needed to represent image Video: 720 x 480 res, 30 fps, color

Fundamentals of the DFT (fft) Algorithms

LIGO On-line Documents July 1997

ECE472/572 - Lecture 11. Roadmap. Roadmap. Image Compression Fundamentals and Lossless Compression Techniques 11/03/11.

The Discrete Kalman Filtering of a Class of Dynamic Multiscale Systems

LASER INTERFEROMETER GRAVITATIONAL WAVE OBSERVATORY - LIGO - CALIFORNIA INSTITUTE OF TECHNOLOGY MASSACHUSETTS INSTITUTE OF TECHNOLOGY

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Wavelets and Multiresolution Processing

The information loss in quantization

WAVELET TRANSFORMS THAT MAP INTEGERS TO INTEGERS. 1. Introduction

Arithmetic in Integer Rings and Prime Fields

Computing Infrastructure for Gravitational Wave Data Analysis

Image Compression Using the Haar Wavelet Transform

From Fourier Series to Analysis of Non-stationary Signals - II

Digital Image Processing

Coding for Discrete Source

EE67I Multimedia Communication Systems

1 Introduction to information theory

Transcription:

LASER INTERFEROMETER GRAVITATIONAL WAVE OBSERVATORY -LIGO- CALIFORNIA INSTITUTE OF TECHNOLOGY MASSACHUSETTS INSTITUTE OF TECHNOLOGY Technical Note LIGO-T000076-00- D 08/7/2000 Losless compression of LIGO data. S.Klimenko, G.Mitselmakher, A.Sazonov This is an internal working note of the LIGO Project. California Institute of Technology Massachusetts Institute of Technology LIGO Project - MS 18-34 LIGO Project - MS NW17-161 Pasadena CA 91125 Cambridge, MA 02139 Phone (626) 395-2129 Phone (617) 253-4824 Fax (626) 304-9834 Fax (617) 253-4824 E-mail: info@ligo.caltech.edu E-mail: info@ligo.mit.edu WWW: http://www.ligo.caltech.edu/ file /tex/ionote/t000076.ps

Abstract A method for lossless compressionof LIGO data based oninteger wavelet transforms is described. It works in combination with fast and simple data encoder optimized for compression of Gaussian random signals. The results of data compression for the engineering run data are presented for different compression algorithms. Contents 1 Introduction 2 2 Random Data Encoder. 2 3 Integer wavelet transform. 3 3.1 Liftin g Wavelet Tran sform.............................. 4 3.2 Wavelet Tran sform Bin ary Tree........................... 6 4 Compression of LIGO data. 6 5 Conclusion 7 6 Acknowledgments 7 1

1 Introduction LIGO is one of the most data-intensive projects. The expected total bit-rate is 15MB/s and the full 2 year LIGO data stream will yield about 1 Petabyte of data. The designof the data reduction procedures, that produce scientific data sets, is one of the important tasks of the LIGO data analysis. There are four levels of the data, ranging from the full interferometer data (Level 0) to whitened GW strain data (Level 3, 1/1000 of the full data stream) [1]. The full data stream will be available for about 16 hours after acquisition, but will not be archived. The data will be processed to form the Archived Reduced Data Set (Level 1) that will be about 1/10 of the full data stream. At this two stages of analysis it s important to save all useful data with minimal losses. Thus, fast and efficient lossless data compression can be essential for generation of the Archived Reduced Data Set Inthis paper we discuss lossless compressionof the LIGO data with wavelets. The LIGO data is mainly Gaussian noise with admixture of non-gaussian signals. The idea of using wavelets is to decompose data into components that can be fairly well described as a white Gaussiannoise. Inother words, wavelets are used to decorelate the data, that means the representation of data in wavelet domain is more compact then the original representation. We present a method for lossless data compression based on the lifting wavelet transform that maps integers to integers. The wavelet transform works in combination with the rdc (stands for random data compression) encoder that is optimized for compression of random Gaussian signals. For this analysis the engineering run data (Hanford, April, 2000) was used. The results of compression for different data channels are presented in comparison with other compression techniques. 2 Random Data Encoder. The output of LIGO data channels is digitized with 16 bit ADCs and sampled at the rate between1hz - 16kHz. The typical data set is a time series where each sample is represented with a 2 bytes word, so the total length of the data set with N samples is 2 N bytes. First, we consider the compression of white Gaussian noise (WGN) signal with rms σ n.the average number of bits needed to encode such a signal is proportional to log 2 (σ n ). Figure 2 shows how many bits (k) is needed to encode data samples of the WGN signal with σ n = 1000. Since there is no correlation between the samples, we could estimate the minimal length of the encoded signal, L min = i k i (assuming that we need 0 bits to encode zero). It is not achievable in practice, because of the overhead due to service records that define structure of the encoded data. However, different algorithms are possible, when the encoded data length L is quite close to the L min. The simplest one is to divide the data on short and long samples, which are encoded using words of k S and k L bits long respectively. The k L is just a maximum number of bits needed to encode samples for a given data set. The k S can be found from minimizing of the following equation: 2

600 events 500 400 300 200 100 0 0 2 4 6 8 10 12 14 16 bits Figure 1: Number of bits required to encode white Gaussian noise signal with rms=10 (left histogram) and rms=1000 (right histogram). L(k S )=N S (k S ) k S + N L (k S ) (k L + k o ), (1) where, N S and N L is the number of short and long samples respectively (N = N S +N L ), and k o is the overhead constant. The contiguous sequence of short samples ended with a long sample forms a data block, so the k o, actually, is the length of the block service word, that has the information about the number of short and long samples inthe block. Using this algorithm we developed a simple data encoder (rdc), that produces data with the structure showninfigure 2. We tested the rdc encoder along with the standard one (gzip) onthe WGN signals. The results are showninfigure 2. For the WGN signals the rdc encoder gives better compression factor then gzip 9 (best compression) encoder. Thought we didn t investigate the time efficiency of the rdc encoder accurately (currently we are running the rdc inthe ROOT CINT interpreter), it is at least 5 times faster thenthe gzip 1 (fastest) encoder. 3 Integer wavelet transform. For lossless compression an invertible wavelet transform that maps integers to integers is needed. Another requirement is to find the wavelet representation of the data quickly. For this reasons we use the biorthogonal lifting wavelet transform [3, 4]. It can map integers and it allows to switch between the original data and its wavelet representation in a time proportional to the size of the data. Below we describe the lifting wavelet transform that we have implemented for the purpose of lossless compressionof the LIGO data. 3

number of short words number of long words Block Service Word: kbsw-1 bit short words 1bit long word BSW (kbsw bit) ks bit ks bit kl bit Block: layer padded with zeroes Layer header Block Block Block 0 total # of 32-bit words in layer 32 bit # of uncompressed 16 bit words in layer shift & zero 32 bit 32 bit Layer service word (32 bit) compression opt ks kl kbsw... 8bit 4bit 4bit 4bit Layer service word Figure 2: Structure of the encoded data. 3.1 Lifting Wavelet Transform The basic idea behind lifting wavelets is that the transform can be done in three stages: split, predict and update. At the first stage the data x(t) is split into two subsets (or layers) x o and x e. We simply put odd samples inthe x o layer and even samples in the x e layer. Inthe second stage, based onthe correlationpresent inthe original data, the x e layer is used to predict the x o layer. Inpractice it might not be possible to predict the odd layer exactly, however, the prediction P (x e ) is likely to be close to the x o. Subtracting the prediction from the x o we can get the detail wavelet coefficients d = x o P (x e ), (2) that retainmuch less informationif the predictionis reasonable. The update stage is used to generate the approximation wavelet coefficients. It s necessary to preserve the mean of the data for the approximationlayer that represents the coarse structure of the signal: a = x e + U(d), ā = x. (3) For example, for the Haar wavelet the predict and update stages are P (x e )=x e, U(d) = 1 d. (4) 2 Figure 3.1(a) shows all three stages of the lifting wavelet transform. Since the initial data set is integer, we can easily get the integer wavelet transform by modifying the predict and 4

1 Ratio(compressed/uncompressed) 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 limit gzip rdc 0.1 0-2 -1 0 1 2 3 4 5 6 7 8 9 10 11 Log 2 (rms) Figure 3: Compressionof white Gaussiannoise with different rms. The gzip compressionflag is -9. update operators: P I = int(p ), U I = int(u). (5) The transform can be done in place (using the same space allocated for the data array x(t)) and it s easily reversible (Figure 3.1(b)). Currently we use a family of symmetric, biorthogonal wavelet transforms. It s predict and update operators can be build using the Lagrange interpolation formula: P = i x ei h i, U = 1 d i h i, h i = 2 i j (t t j ), i =1, 2,...NP, i j. (6) j (t i t j ) The coefficients h i for the first two wavelet transforms NP =2, 4 are respectively (1/2, 1/2), ( 1/16, 9/16, 9/16, 1/16). One lifting step produces the detail and approximation coefficients and then the next step canbe applied to the data inthe approximationlayer. Because of the specific select procedure, each next lifting step yields a detail layer which is twice shorter in length then the previous one. Thus the length of the initial data set x should be divisible by 2 m,wherem is the number of steps. 5

decomposition (a) Input x(t) Split even odd Predict Update approximation detail input to the next stage store as a result of decomposition reconstruction (b) input from the previous stage input of stored elements approximation detail Update Predict even odd Merge Figure 4: Lifting wavelet transform: a - decomposition, b - reconstruction 3.2 Wavelet Transform Binary Tree The conventional wavelet transform described above can be illustrated graphically with the wavelet decompositiontree (Fig. 3.2(a)). We canconsider the detail coefficients as time series and apply the wavelet transform to decompose the detail layers further. In this case the result of decompositionwill be a binary tree (Fig. 3.2(b)). If there is a complete decompositionof depth m, the lower layer of the tree has 2 m nodes, each N/2 m samples long, where N is the number of samples in the original data. Since each transformation step doubles the frequency resolution, the nodes represent data in equidistant frequency bands, that is different compare to the traditional transform where the wavelet layers have different frequency resolution (dyadic basis). 4 Compression of LIGO data. We used the engineering run data collected in April 2000 to test different compression methods. We compared three encoders: gzip (the ompressionflag is -9), eri [2] and the rdc encoder described above. The encoders were applied to the original data in time domain (TD) and the decorrelated data. To decorrelate data we used differentiation, wavelet transform (NP=6) and wavelet binary tree transform (NP=6). The results of compression are shown in Figure 4. The combinations wavelet + rdc shows better compressionratio thenthe other methods. Compare 6

d 0 d 0 d 1 d 2 d 1 a d 3 d 4 a a. wavelet transform tree b. binary wavelet transform tree d 2 Figure 5: Wavelet transform trees. to the traditional diffirentiation + gzip method, the average compressionratio for 16kHz channels is better by 20% and for the 2kHz channels the improvement is 30%. Note, that he combination of the rsd and gzip encoders (last column) doesn t show any improvement in the compressionfactor. 5 Conclusion The lossless compressionof LIGO data canbe a useful tool for generationof the Archived Data Set. The expected compression factor is around 2, that is essential for saving of the data storage space. We suggest to use wavelet transforms and the rdc encoder to decorrelate and compress the LIGO data. This method is fast, simple and offers better compression factor then the other methods. 6 Acknowledgments The authors would like to thank A.Studnik for help with the data processing. This work is supported by the NSF grant PHY-9722114. 7

Compression ratios for 16kH data channels and different compression methods Data type => Time domain data Time domain data with d Wavelet domein data WTree Compression method => gzip ERI RDC gzip ERI RDC gzip ERI RDC RDC RDC+ Channel name gzip H2:IOO-MC_F 1.48 2.30 1.55 1.88 2.06 2.33 1.87 1.98 2.40 2.35 2.36 H2:IOO-MC_I 1.41 1.68 1.75 1.38 1.63 1.65 1.39 1.61 1.75 1.78 1.78 H2:PSL-FSS_FAST_F 3.34 6.30 1.92 5.06 6.35 5.98 4.66 5.58 5.27 4.64 4.69 H2:PSL-FSS_MIXERM_F 1.22 1.35 1.34 1.24 1.35 1.40 1.24 1.35 1.42 1.44 1.44 H2:PSL-FSS_PCDRIVE_F 1.20 1.36 1.33 1.21 1.36 1.35 1.22 1.29 1.40 1.42 1.43 H2:PSL-ISS_ISERR_F 3.05 4.04 3.08 3.17 4.04 3.86 3.10 3.98 3.91 3.72 3.76 H2:LSC-AS_Q_TEMP 2.33 4.35 2.08 3.34 3.57 3.82 3.41 4.25 4.31 4.05 4.08 H2:LSC-AS_I_TEMP 1.13 1.24 1.22 1.16 1.26 1.27 1.17 1.29 1.30 1.43 1.44 H2:LSC-AS_DC_TEMP 3.64 6.01 2.46 4.96 6.23 6.00 4.61 5.68 5.48 4.81 4.86 Average ratio 1.72 2.19 1.71 1.89 2.13 2.18 1.88 2.11 2.23 2.24 2.25 Compression ratios for 2kH data channels and different compression methods Data type => Time domain data Time domain data with d Wavelet domain data WTree Compression method => gzip ERI RDC gzip ERI RDC gzip ERI RDC RDC RDC+ Channel name gzip H0:PEM-BSC7_ACCX 1.60 1.90 2.00 1.47 1.70 1.83 1.52 1.71 2.05 2.12 2.14 H0:PEM-BSC5_ACCZ 1.84 2.34 2.28 1.71 1.97 2.20 1.77 1.88 2.41 2.46 2.48 H0:PEM-BSC7_ACCZ 1.73 2.21 2.21 1.66 1.87 2.18 1.71 1.86 2.27 2.27 2.29 H2:ASC-ETMX_P 1.12 1.17 1.21 1.08 1.09 1.13 1.11 1.12 1.22 1.23 1.10 H2:ASC-BS_P 1.17 1.27 1.30 1.12 1.17 1.22 1.15 1.20 1.31 1.29 1.30 H0:PEM-HAM7_ACCX 1.64 2.20 2.06 1.48 1.86 1.92 1.56 1.77 2.14 2.24 2.25 H0:PEM-BSC5_ACCY 1.72 2.15 2.20 1.54 1.74 1.98 1.61 1.77 2.17 2.27 2.28 H2:SUS-EMTX_SENSOR_UL 1.87 2.09 2.40 1.67 1.94 2.17 1.78 1.88 2.47 2.50 2.52 Average ratio 1.53 1.80 1.84 1.43 1.59 1.72 1.48 1.59 1.88 1.91 1.88 Figure 6: Compression ratio for different LIGO channels. 8

References [1] LIGO-T990104-04-D, 1999 LSC Data Analysis White Paper. [2] http://www.geocities.com/siliconvalley/ [3] Wim Sweldens, Peter Schrder. Wavelets in Computer Graphics, ACM SIGGRAPH Course Notes, 1996. Building your own wavelets at home. [4] R. C. Calderbank, Ingrid Daubechies, Wim Sweldens, and Boon-Lock Yeo Applied and Computational Harmonic Analysis (ACHA), Vol. 5, Nr. 3, pp. 332-369, 1998. Wavelet Transforms that Map Integers to Integers 9