Comparing the processing efficiency of the parallel FDTD method on the Intel Phi processor and IBM s Bluegene/Q
|
|
- Preston Stevenson
- 5 years ago
- Views:
Transcription
1 Comparig the processig efficiecy of the parallel FDTD method o the Itel Phi processor ad IBM s Bluegee/Q R. Ilger, D. B. Davidso Computatioal Electromagetics Group Departmet of Electrical ad Electroic Egieerig Uiversity of Stellebosch
2 Outlie CHPC cof, Dec FDTD i the cotext of HPC 2 Parallelisatio of the FDTD 3 Efficiecy of the FDTD o the Itel Phi 4 Efficiecy of the FDTD o the Blue Gee/Q 5 Coclusios ad Recommedatios Compare FDTD o Phi ad BGQ 1/28
3 FDTD i the cotext of HPC Electromagetic modellig, 3 parts to applicatio: 1) Data iput facility such as Autocad or other GUI,Meshig 2) Computatioal model processig egie FDTD 3) Visualisatio egie & save results big data Remove variatio owig to differet algorithms, programmig styles, data models. Performace i terms of efficiecy Compare FDTD o Phi ad BGQ 2/ 28
4 FDTD as a process CHPC cof, Dec 2013 Total ru time of FDTD program 1520s 100% Time cosumed by FDTD differece equatios 1499s 98.62% Time cosumed by calculatig FDTD boudaries 20s 1.32% Start-up time before FDTD iteratios 1s 0.06% Embarrassigly parallel? Not really! Memory boud Compare FDTD o Phi ad BGQ 3/28
5 Maxwell s equatios ad FDTD Yee grid. cell sizig boudary coditios, source ad reflectors. FDTD computatioally sparse. eeds low latecy data supply. subject to data orderig ad coalescece. Compare FDTD o Phi ad BGQ 4/28
6 FDTD equatios CHPC cof, Dec H x(i,j,k) = D ax (i,j,k) H x i,j,k D bx (i,j,k) E z i,j,k + D bx (i,j,k) E y i,j,k E z(i,j 1,k) E y (i,j,k 1) H y (i,j,k) = D ay (i,j,k) H y i,j,k D by (i,j,k) E x i,j,k + D by (i,j,k) E z i,j,k E x(i,j,k 1) E y (i 1,j,k) H z(i,j,k) = D az (i,j,k) H z i,j,k D bz (i,j,k) E y i,j,k + D bz (i,j,k) E x i,j,k E y (i 1,j,k) E x(i,j 1,k) E x(i,j,k) = C ax (i,j,k) E x i,j,k + C bx (i,j,k) H z i,j +1,k H z(i,j,k) C bx (i,j,k) H y i,j,k H y i,j,k J source (x) E y (i,j,k) = C ay (i,j,k) E y i,j,k + C by (i,j,k) H x i,j,k+1 H x(i,j,k) C by (i,j,k) H z i+1,j, H z(i,j,k) J source (y ) Courat coditios: +1 E z(i,j,k) = C az (i,j,k) E z i,j,k C bz (i,j,k) H y i+1,j,k C bz (i,j,k) H x i,j +1,k H y (i,j,k) H x(i,j,k) J source (z) Δt 1 c 1 x y z 2 Compare FDTD o Phi ad BGQ 5/28
7 Outlie CHPC cof, Dec FDTD i the cotext of HPC 2 Parallelisatio of the FDTD 3 Efficiecy of the FDTD o the Itel Phi 4 Efficiecy of the FDTD o the Blue Gee/Q 5 Coclusios ad Recommedatios Compare FDTD o Phi ad BGQ 6/28
8 Programmig methods, task parallel MPI_Recv(Receive colum of magetic frige values from thread B to replace i thread A) MPI_Sed(Sed colum of electric frige values from thread A to replace i thread B) #pragma omp parallel for private(i,j) for (i = 0; i < ie; i++) { for (j = 1; j < je; j++) { ex[i][j] = caex[i][j]*ex[i][j]+cbex[i][j] * ( hz[i][j]- hz[i][j-1]); } } Compare FDTD o Phi ad BGQ 7/28
9 Programmig, vector registers Vector registers iheret i some cotemporary cores: Sigle float 32 Double float 64 SSE 128 AVX or QPX 256 AVX2 512 AVX specificatio up to 1024 bits Compare FDTD o Phi ad BGQ 8/28
10 DTD with AVX or QPX usig compiler itrisics AVX with 8 floats per operatio for (i = 1; i < ie; i++) { for (j = 0; j < je; j+=8) { aoh0=( m256 *)&ex1[i][j]; aoh1=( m256 *)&caex[i][j]; aoh2=( m256 *)&cbex[i][j]; aoh3=( m256 *)&hz1[i][j]; Outlie aoh5=( m256 *)&ey1[i][j]; aoh6=( m256 *)&caey[i][j]; aoh7=( m256 *)&cbey[i][j]; aoh8=( m256 *)&hz1[i-1][j]; hz_shift=&hz1[i][j-1]; amdst=( m256*)hz_shift amtar=_mm256_loadu_ps(amdst); am1 = _mm256_mul_ps(*aoh0,*aoh1); am2 = _mm256_sub_ps(*aoh3,amtar); am3 = _mm256_mul_ps(*aoh2,am2); am4 = _mm256_add_ps(am1,am3); am5 = _mm256_mul_ps(*aoh5,*aoh6); am6 = _mm256_sub_ps(*aoh8,*aoh3); am7 = _mm256_mul_ps(*aoh6,am6); am8 = _mm256_add_ps(am5,am7); _mm256_store_ps(&ey1[i][j],am8); _mm256_store_ps(&ex1[i][j],am4); QPX with 4 doubles per operatio for (i = 1; i < ie; i++) { for (j = 0; j < je; j+=4) { am1=vec_ld(0l,&hz1[i][j]); am2=vec_ld(0l,&dahz[i][j]); am3=vec_ld(0l,&dbhz[i][j]); am4=vec_ld(0l,&ex1[i][j]); am5=vec_ld(0l,&ey1[i][j]); am6=vec_ld(0l,&ex1[i][j+1]); am7=vec_ld(0l,&ey1[i+1][j]); am8=vec_mul(am1,am2); am9=vec_sub(am6,am4); am10=vec_sub(am7,am5); am11=vec_add(am9,am10); am12=vec_mul(am11,am3); am13=vec_add(am12,am8); vec_st(am13, 0, &hz1[i][j]);
11 Throughput scale with SSE or AVX Compare FDTD o Phi ad BGQ 10/28
12 Outlie CHPC cof, Dec FDTD i the cotext of HPC 2 Parallelisatio of the FDTD 3 Efficiecy of the FDTD o the Itel Phi 4 Efficiecy of the FDTD o the Blue Gee/Q 5 Coclusios ad Recommedatios Compare FDTD o Phi ad BGQ 11/28
13 Schematic of Itel Phi architecture 60 cores 4 hardware threads per core 8GB Ru i ative mode GDDR5 memory Fully coheret L2 cache bit VPU per core Symmetric desig, MC receive evely distributed & Compare FDTD o Phi ad BGQ 12/28
14 Efficiecy of FDTD o the Itel Phi Compare FDTD o Phi ad BGQ 13/28
15 DTD with loop combiatio Two loop structures: CHPC cof, Dec 2013 Loop i from 0 to m { Loop j from 0 to { ex[i][j] = caex[i][j] * ex[i][j] + cbex[i][j] * ( hz[i][j] - hz[i][j-1] ); } } Loop i from 0 to m { Loop j from 0 to { ey[i][j] = caey[i][j] * ey[i][j] + cbey[i][j] * ( hz[i-1][j] - hz[i][j] ); } } Sigle loop structure results i more commo data residet i cache: Loop i from 0 to m { Loop j from 0 to { ex[i][j] = caex[i][j] * ex[i][j] + cbex[i][j] * ( hz[i][j] - hz[i][j-1] ); ey[i][j] = caey[i][j] * ey[i][j] + cbey[i][j] * ( hz[i-1][j] - hz[i][j] ); } }
16 Code re-arragemet, NAG Compare FDTD o Phi ad BGQ 15/28
17 Outlie CHPC cof, Dec FDTD i the cotext of HPC 2 Parallelisatio of the FDTD 3 Efficiecy of the FDTD o the Itel Phi 4 Efficiecy of the FDTD o the Blue Gee/Q 5 Coclusios ad Recommedatios Compare FDTD o Phi ad BGQ 16/28
18 Blue Gee/Q A2 processor compoets CHPC cof, Dec PowerPC-A2 processor cores ruig at 1.6GHz. 16 of them are User cores, 1 is for system maagemet (hadles iterrupts message passig, etc) ad the 18th is a spare, for icreased fault tolerace. 4 chaels to 16GB DDR3 memory 55w per ode Compare FDTD o Phi ad BGQ 17/28
19 Blue Gee/Q CHPC cof, Dec 2013 Compare FDTD o Phi ad BGQ 18/28
20 FDTD throughput o Itel Phi, A2 processors Compare FDTD o Phi ad BGQ 19/28
21 Multiple threads per ode CHPC cof, Dec 2013 Compare FDTD o Phi ad BGQ 20/ 28
22 FDTD o previous versios of the Bluegee Compare FDTD o Phi ad BGQ 21/28
23 FDTD o GPU cluster ad Blue Gee/P FDTD computatioal performace o small GPU cluster ad Blue Gee/P is of similar order. Compare FDTD o Phi ad BGQ 22/28
24 Evolutio of FDTD processig o the Blue Gee Per rack BG/P 4096 cores BG/Q cores Compare FDTD o Phi ad BGQ 23/28
25 QPX o Blue Gee/Q CHPC cof, Dec 2013 Compare FDTD o Phi ad BGQ 24/28
26 Blue Gee/Q immese dimesios, 24 racks Compare FDTD o Phi ad BGQ 25/ 28
27 Outlie CHPC cof, Dec FDTD i the cotext of HPC 2 Parallelisatio of the FDTD 3 Efficiecy of the FDTD o the Itel Phi 4 Efficiecy of the FDTD o the Blue Gee/Q 5 Coclusios ad Recommedatios Compare FDTD o Phi ad BGQ 26/28
28 Icomplete developmet cycle To fully uderstad the parallel FDTD implemetatios: Imperative stage i HPC developmet cycle is the tuig/performace aalysis phase so as to uderstad ad optimise the icreasigly sophisticated hardware. Low level uderstadig resultig from aalysis will lead to higher level gais. Compare FDTD o Phi ad BGQ 27/28
29 Coclusios & Recommedatios Costatly chagig hardware evolutio makes compariso a time widowed evet. Best Performace from hybrid architecture ad hybrid parallelisatio methods Processor & Memory, latecy, computatioal sparsity, memory eeds to be related to processig core for FDTD efficiecy? Lessos leared from GPU data supply. Eve faster with curret hardware, ot always obvious. Parallel performace tools vital to uderstad process o more sophisticated hardware. Ackowledgmets NRF,CHPC,SKA, Itel User Forums, Juelich Computig cetre, NAG(UK), Chris Armstrog FDTD optimisatio. Compare FDTD o Phi ad BGQ 28/ 28
Image Processing on the extreme Processing Platform
PACT Muich, Germay August 2002 Image Processig o the extreme Processig Platform Robert Strzodka Numerical Aalysis ad Scietific Computig Uiversity of Duisburg http://www.umerik.math.ui-duisburg.de strzodka@math.ui-duisburg.de
More informationAlgorithm Analysis. Chapter 3
Data Structures Dr Ahmed Rafat Abas Computer Sciece Dept, Faculty of Computer ad Iformatio, Zagazig Uiversity arabas@zu.edu.eg http://www.arsaliem.faculty.zu.edu.eg/ Algorithm Aalysis Chapter 3 3. Itroductio
More informationECE 669 Parallel Computer Architecture
ECE 669 Parallel Computer Architecture Lecture 4 Parallel Applicatios Outlie Motivatig Problems (applicatio case studies) Classifyig problems Parallelizig applicatios Examiig tradeoffs Uderstadig commuicatio
More informationSliding Mode Indicator Model s at Thermal Control System for Furnace
Proceedigs of the 7th World Cogress The Iteratioal Federatio of Automatic Cotrol Slidig Mode Idicator Model s at Thermal Cotrol System for Furace Varda Mkrttchia*, Sargis Simoya**, Aa Khachaturova***,
More informationPipelined and Parallel Recursive and Adaptive Filters
VLSI Digital Sigal Processig Systems Pipelied ad Parallel Recursive ad Adaptive Filters La-Da Va 范倫達, Ph. D. Departmet of Computer Sciece Natioal Chiao ug Uiversity aiwa, R.O.C. Fall, 05 ldva@cs.ctu.edu.tw
More informationModel of Computation and Runtime Analysis
Model of Computatio ad Rutime Aalysis Model of Computatio Model of Computatio Specifies Set of operatios Cost of operatios (ot ecessarily time) Examples Turig Machie Radom Access Machie (RAM) PRAM Map
More informationApplications of Distributed Arithmetic to Digital Signal Processing: A Tutorial Review
pplicatios of Distriuted rithmetic to Digital Sigal Processig: Tutorial Review Ref: Staley. White, pplicatios of Distriuted rithmetic to Digital Sigal Processig: Tutorial Review, IEEE SSP Magazie, July,
More informationLive Line Measuring the Parameters of 220 kv Transmission Lines with Mutual Inductance in Hainan Power Grid
Egieerig, 213, 5, 146-151 doi:1.4236/eg.213.51b27 Published Olie Jauary 213 (http://www.scirp.org/joural/eg) Live Lie Measurig the Parameters of 22 kv Trasmissio Lies with Mutual Iductace i Haia Power
More informationExplicit Group Methods in the Solution of the 2-D Convection-Diffusion Equations
Proceedigs of the World Cogress o Egieerig 00 Vol III WCE 00 Jue 0 - July 00 Lodo U.K. Explicit Group Methods i the Solutio of the -D Covectio-Diffusio Equatios a Kah Bee orhashidah Hj. M. Ali ad Choi-Hog
More informationOlli Simula T / Chapter 1 3. Olli Simula T / Chapter 1 5
Sigals ad Systems Sigals ad Systems Sigals are variables that carry iformatio Systemstake sigals as iputs ad produce sigals as outputs The course deals with the passage of sigals through systems T-6.4
More informationQuantum Information & Quantum Computation
CS9A, Sprig 5: Quatum Iformatio & Quatum Computatio Wim va Dam Egieerig, Room 59 vadam@cs http://www.cs.ucsb.edu/~vadam/teachig/cs9/ Admiistrivia Do the exercises. Aswers will be posted at the ed of the
More informationPAijpam.eu ON TENSOR PRODUCT DECOMPOSITION
Iteratioal Joural of Pure ad Applied Mathematics Volume 103 No 3 2015, 537-545 ISSN: 1311-8080 (prited versio); ISSN: 1314-3395 (o-lie versio) url: http://wwwijpameu doi: http://dxdoiorg/1012732/ijpamv103i314
More informationMechanical Efficiency of Planetary Gear Trains: An Estimate
Mechaical Efficiecy of Plaetary Gear Trais: A Estimate Dr. A. Sriath Professor, Dept. of Mechaical Egieerig K L Uiversity, A.P, Idia E-mail: sriath_me@klce.ac.i G. Yedukodalu Assistat Professor, Dept.
More informationClassification of problem & problem solving strategies. classification of time complexities (linear, logarithmic etc)
Classificatio of problem & problem solvig strategies classificatio of time complexities (liear, arithmic etc) Problem subdivisio Divide ad Coquer strategy. Asymptotic otatios, lower boud ad upper boud:
More informationAdvanced Course of Algorithm Design and Analysis
Differet complexity measures Advaced Course of Algorithm Desig ad Aalysis Asymptotic complexity Big-Oh otatio Properties of O otatio Aalysis of simple algorithms A algorithm may may have differet executio
More informationDesign and Analysis of ALGORITHM (Topic 2)
DR. Gatot F. Hertoo, MSc. Desig ad Aalysis of ALGORITHM (Topic 2) Algorithms + Data Structures = Programs Lessos Leared 1 Our Machie Model: Assumptios Geeric Radom Access Machie (RAM) Executes operatios
More informationModel of Computation and Runtime Analysis
Model of Computatio ad Rutime Aalysis Model of Computatio Model of Computatio Specifies Set of operatios Cost of operatios (ot ecessarily time) Examples Turig Machie Radom Access Machie (RAM) PRAM Map
More informationRecursive Algorithms. Recurrences. Recursive Algorithms Analysis
Recursive Algorithms Recurreces Computer Sciece & Egieerig 35: Discrete Mathematics Christopher M Bourke cbourke@cseuledu A recursive algorithm is oe i which objects are defied i terms of other objects
More informationParallel Vector Algorithms David A. Padua
Parallel Vector Algorithms 1 of 32 Itroductio Next, we study several algorithms where parallelism ca be easily expressed i terms of array operatios. We will use Fortra 90 to represet these algorithms.
More informationComparison of Finite Difference and Finite Element (Telemac) Models of Dublin Bay
Dubli Istitute of Techology ARROW@DIT Presetatios Egieerig: Educatio ad Iovatio 2013-12-06 Compariso of Fiite Differece ad Fiite Elemet (Telemac) Models of Dubli Bay Garrett Keae Dubli Istitute of Techology,
More informationProbabilistic analyses in Phase 2 8.0
Probability desity Probabilistic aalyses i Phase 8.0 Beoît Valley & Damie Duff - CEMI - Ceter for Excellece i Miig Iovatio "Everythig must be made as simple as possible. But ot simpler." Albert Eistei
More informationApplications of Distributed Arithmetic to Digital Signal Processing: A Tutorial Review
pplicatios of Distriuted rithmetic to Digital Sigal Processig: Tutorial Review Ref: Staley. White, pplicatios of Distriuted rithmetic to Digital Sigal Processig: Tutorial Review, IEEE SSP Magazie, July,
More informationInternal Information Representation and Processing
Iteral Iformatio Represetatio ad Processig CSCE 16 - Fudametals of Computer Sciece Dr. Awad Khalil Computer Sciece & Egieerig Departmet The America Uiversity i Cairo Decimal Number System We are used to
More informationStudy on Coal Consumption Curve Fitting of the Thermal Power Based on Genetic Algorithm
Joural of ad Eergy Egieerig, 05, 3, 43-437 Published Olie April 05 i SciRes. http://www.scirp.org/joural/jpee http://dx.doi.org/0.436/jpee.05.34058 Study o Coal Cosumptio Curve Fittig of the Thermal Based
More informationResearch Article A Unified Weight Formula for Calculating the Sample Variance from Weighted Successive Differences
Discrete Dyamics i Nature ad Society Article ID 210761 4 pages http://dxdoiorg/101155/2014/210761 Research Article A Uified Weight Formula for Calculatig the Sample Variace from Weighted Successive Differeces
More informationEfficient Hashing using the AES Instruction Set
Efficiet Hashig usig the AES Istructio Set Joppe Bos 1 Our Öze 1 Martij Stam 2 1 Ecole Polytechique Fédérale de Lausae 2 Uiversity of Bristol Nara, 1 October 2011 Outlie 1 Itroductio AES ad Hash Fuctios
More informationCS161 Design and Analysis of Algorithms. Administrative
CS161 Desig ad Aalysis of Algorithms Da Boeh 1 Admiistrative Lecture 1, April 3, 1 Web page http://theory.staford.edu/~dabo/cs161» Hadouts» Aoucemets» Late breakig ews Gradig ad course requiremets» Midterm/fial/hw»
More informationReliability and Queueing
Copyright 999 Uiversity of Califoria Reliability ad Queueig by David G. Messerschmitt Supplemetary sectio for Uderstadig Networked Applicatios: A First Course, Morga Kaufma, 999. Copyright otice: Permissio
More informationFinite-Difference Time-Domain Method (FDTD)
Computatioal Photoics Semiar 05 Fiite-Differece Time-Domai Method (FDTD) Lear how to implemet a D versio of the FDTD method ted the code to 3D problems D FDTD: Yee Grid for & Compoets Chagig of ide otatio
More informationEfficient Reverse Converter Design for Five Moduli
Joural of Computatios & Modellig, vol., o., 0, 93-08 ISSN: 79-765 (prit), 79-8850 (olie) Iteratioal Scietific ress, 0 Efficiet Reverse Coverter Desig for Five Moduli Set,,,, MohammadReza Taheri, Elham
More informationAlgorithm Analysis. Algorithms that are equally correct can vary in their utilization of computational resources
Algorithm Aalysis Algorithms that are equally correct ca vary i their utilizatio of computatioal resources time ad memory a slow program it is likely ot to be used a program that demads too much memory
More informationOPTIMAL PIECEWISE UNIFORM VECTOR QUANTIZATION OF THE MEMORYLESS LAPLACIAN SOURCE
Joural of ELECTRICAL EGIEERIG, VOL. 56, O. 7-8, 2005, 200 204 OPTIMAL PIECEWISE UIFORM VECTOR QUATIZATIO OF THE MEMORYLESS LAPLACIA SOURCE Zora H. Perić Veljo Lj. Staović Alesadra Z. Jovaović Srdja M.
More informationPolynomial Multiplication and Fast Fourier Transform
Polyomial Multiplicatio ad Fast Fourier Trasform Com S 477/577 Notes Ya-Bi Jia Sep 19, 2017 I this lecture we will describe the famous algorithm of fast Fourier trasform FFT, which has revolutioized digital
More informationNUMERICAL METHOD FOR SINGULARLY PERTURBED DELAY PARABOLIC PARTIAL DIFFERENTIAL EQUATIONS
THERMAL SCIENCE, Year 07, Vol., No. 4, pp. 595-599 595 NUMERICAL METHOD FOR SINGULARLY PERTURBED DELAY PARABOLIC PARTIAL DIFFERENTIAL EQUATIONS by Yula WANG *, Da TIAN, ad Zhiyua LI Departmet of Mathematics,
More information( x) SUMMARY (2) α z. (3) Now approximating z derivative by forward difference
Depth etrapolatio of seismic wavefields usig cubic splie approimatio Suhas Phadke* ad Dheera Bhardwa Cetre for Developmet of Advaced Computig, Pue Uiversity Campus, GaeshKhid, Pue 4007, Idia SUMMARY Depth
More informationCS 270 Algorithms. Oliver Kullmann. Growth of Functions. Divide-and- Conquer Min-Max- Problem. Tutorial. Reading from CLRS for week 2
Geeral remarks Week 2 1 Divide ad First we cosider a importat tool for the aalysis of algorithms: Big-Oh. The we itroduce a importat algorithmic paradigm:. We coclude by presetig ad aalysig two examples.
More informationDesign and Analysis of Algorithms
Desig ad Aalysis of Algorithms Probabilistic aalysis ad Radomized algorithms Referece: CLRS Chapter 5 Topics: Hirig problem Idicatio radom variables Radomized algorithms Huo Hogwei 1 The hirig problem
More informationLinear Algebra Issues in Wireless Communications
Rome-Moscow school of Matrix Methods ad Applied Liear Algebra August 0 September 18, 016 Liear Algebra Issues i Wireless Commuicatios Russia Research Ceter [vladimir.lyashev@huawei.com] About me ead of
More informationElectricity consumption forecasting method based on MPSO-BP neural network model Youshan Zhang 1, 2,a, Liangdong Guo2, b,qi Li 3, c and Junhui Li2, d
4th Iteratioal Coferece o Electrical & Electroics Egieerig ad Computer Sciece (ICEEECS 2016) Electricity cosumptio forecastig method based o eural etwork model Yousha Zhag 1, 2,a, Liagdog Guo2, b,qi Li
More informationFFTs in Graphics and Vision. The Fast Fourier Transform
FFTs i Graphics ad Visio The Fast Fourier Trasform 1 Outlie The FFT Algorithm Applicatios i 1D Multi-Dimesioal FFTs More Applicatios Real FFTs 2 Computatioal Complexity To compute the movig dot-product
More informationPixel Recurrent Neural Networks
Pixel Recurret Neural Networks Aa ro va de Oord, Nal Kalchbreer, Koray Kavukcuoglu Google DeepMid August 2016 Preseter - Neha M Example problem (completig a image) Give the first half of the image, create
More informationIntermittent demand forecasting by using Neural Network with simulated data
Proceedigs of the 011 Iteratioal Coferece o Idustrial Egieerig ad Operatios Maagemet Kuala Lumpur, Malaysia, Jauary 4, 011 Itermittet demad forecastig by usig Neural Network with simulated data Nguye Khoa
More informationG. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan
Deviatio of the Variaces of Classical Estimators ad Negative Iteger Momet Estimator from Miimum Variace Boud with Referece to Maxwell Distributio G. R. Pasha Departmet of Statistics Bahauddi Zakariya Uiversity
More informationHigh Performance Linear Algebra on Data Parallel Co-Processors II
926535897932384626433832795028841971693993754918980183 592653589793238462643383279502884197169399375491898018 415926535897932384626433832795028841971693993754918980 592653589793238462643383279502884197169399375491898018
More informationChapter 13, Part A Analysis of Variance and Experimental Design
Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of
More informationDesign of distribution transformer loss meter considering the harmonic loss based on one side measurement method
3rd Iteratioal Coferece o Machiery, Materials ad Iformatio Techology Applicatios (ICMMITA 05) Desig of distributio trasformer loss meter cosiderig the harmoic loss based o oe side measuremet method HU
More informationSignals & Systems Chapter3
Sigals & Systems Chapter3 1.2 Discrete-Time (D-T) Sigals Electroic systems do most of the processig of a sigal usig a computer. A computer ca t directly process a C-T sigal but istead eeds a stream of
More informationEvapotranspiration Estimation Using Support Vector Machines and Hargreaves-Samani Equation for St. Johns, FL, USA
Evirometal Egieerig 0th Iteratioal Coferece eissn 2029-7092 / eisbn 978-609-476-044-0 Vilius Gedimias Techical Uiversity Lithuaia, 27 28 April 207 Article ID: eviro.207.094 http://eviro.vgtu.lt DOI: https://doi.org/0.3846/eviro.207.094
More informationCIS 121 Data Structures and Algorithms with Java Spring Code Snippets and Recurrences Monday, February 4/Tuesday, February 5
CIS 11 Data Structures ad Algorithms with Java Sprig 019 Code Sippets ad Recurreces Moday, February 4/Tuesday, February 5 Learig Goals Practice provig asymptotic bouds with code sippets Practice solvig
More informationThere is no straightforward approach for choosing the warmup period l.
B. Maddah INDE 504 Discrete-Evet Simulatio Output Aalysis () Statistical Aalysis for Steady-State Parameters I a otermiatig simulatio, the iterest is i estimatig the log ru steady state measures of performace.
More informationSchool of Electrical Engineering. EI2400 Applied Antenna Theory Lecture 9: Broadband antennas
School of Electrical Egieerig EI2400 Applied Atea Theory Lecture 9: Broadbad ateas Questio 1 What is the badwidth of a atea? It is the rage of frequecies withi which the performace of the atea with respect
More informationQuantum Computing Lecture 7. Quantum Factoring
Quatum Computig Lecture 7 Quatum Factorig Maris Ozols Quatum factorig A polyomial time quatum algorithm for factorig umbers was published by Peter Shor i 1994. Polyomial time meas that the umber of gates
More informationAnalysis of Antennas by FDTD Method Using Parallel Processing with MPI
Aalysis of Ateas by FDTD Method Usig Parallel Processig with MPI Josivaldo de S. Araújo, Roaldo O. Satos, Carlos L. da S. S. Sobriho, Johy M. Rocha, Lui A. Guedes ad Regiae Y. Kawasaki Federal Uiversity
More informationLecture 12: Spiral: Domain Specific HLS. James C. Hoe Department of ECE Carnegie Mellon University
8 643 Lecture : Spiral: Domai Specific HLS James C. Hoe Departmet of ECE Caregie Mello Uiversity 8 643 F7 L S, James C. Hoe, CMU/ECE/CALCM, 07 Houseeepig Your goal today: see a eample of really highlevel
More informationRobust Resource Allocation in Parallel and Distributed Computing Systems (tentative)
Robust Resource Allocatio i Parallel ad Distributed Computig Systems (tetative) Ph.D. cadidate V. Shestak Colorado State Uiversity Electrical ad Computer Egieerig Departmet Fort Collis, Colorado, USA shestak@colostate.edu
More informationElectromagnetic wave propagation in Particle-In-Cell codes
Electromagetic wave propagatio i Particle-I-Cell codes Remi Lehe Lawrece Berkeley Natioal Laboratory (LBNL) US Particle Accelerator School (USPAS) Summer Sessio Self-Cosistet Simulatios of Beam ad Plasma
More informationOutput Analysis (2, Chapters 10 &11 Law)
B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe
More informationProceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013)
Proceedigs of the 2d Iteratioal Coferece o Computer Sciece ad Electroics Egieerig (ICCE 20) GPU Implemetatio of the Electric Power System Model for Real-Time Simulatio of Electromagetic Trasiets Slawomir
More informationArithmetic Circuits. (Part I) Randy H. Katz University of California, Berkeley. Spring Time vs. Space Trade-offs. Arithmetic Logic Units
rithmetic rcuits (art I) Rady H. Katz Uiversity of Califoria, erkeley otivatio rithmetic circuits are excellet examples of comb. logic desig Time vs. pace Trade-offs Doig thigs fast requires more logic
More informationAN INTERIM REPORT ON SOFT SYSTEMS EVALUATION
AN INTERIM REPORT ON SOFT SYSTEMS EVALUATION Viljem Rupik INTERACTA, LTD, Busiess Iformatio Processig Parmova 53, Ljubljaa 386 01 4291809, e-mail: Viljem.Rupik@siol.et Abstract: As applicatio areas rapidly
More informationUsing Spreadsheets as a Computational Tool in Teaching Mechanical. Engineering
Proceedigs of the th WSEAS Iteratioal Coferece o COMPUTERS, Vouliagmei, Athes, Greece, July 3-5, 6 (pp35-3) Usig Spreadsheets as a Computatioal Tool i Teachig Mechaical Egieerig AHMADI-BROOGHANI, ZAHRA
More informationProbability of error for LDPC OC with one co-channel Interferer over i.i.d Rayleigh Fading
IOSR Joural of Electroics ad Commuicatio Egieerig (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 9, Issue 4, Ver. III (Jul - Aug. 24), PP 59-63 Probability of error for LDPC OC with oe co-chael
More informationFinite-Difference Time-Domain Method (FDTD)
Computatioal Photoics, Summer Term 04, Abbe School of Photoics, FSU Jea, Prof. Thomas Pertsch Computatioal Photoics Semiar 05, 0 Jue 04 Fiite-Differece Time-Domai Method (FDTD) Lear how to implemet a D
More informationModule 5 EMBEDDED WAVELET CODING. Version 2 ECE IIT, Kharagpur
Module 5 EMBEDDED WAVELET CODING Versio ECE IIT, Kharagpur Lesso 4 SPIHT algorithm Versio ECE IIT, Kharagpur Istructioal Objectives At the ed of this lesso, the studets should be able to:. State the limitatios
More informationInteger Programming (IP)
Iteger Programmig (IP) The geeral liear mathematical programmig problem where Mied IP Problem - MIP ma c T + h Z T y A + G y + y b R p + vector of positive iteger variables y vector of positive real variables
More informationAnalysis of Deutsch-Jozsa Quantum Algorithm
Aalysis of Deutsch-Jozsa Quatum Algorithm Zhegju Cao Jeffrey Uhlma Lihua Liu 3 Abstract. Deutsch-Jozsa quatum algorithm is of great importace to quatum computatio. It directly ispired Shor s factorig algorithm.
More informationLearning Bounds for Support Vector Machines with Learned Kernels
Learig Bouds for Support Vector Machies with Leared Kerels Nati Srebro TTI-Chicago Shai Be-David Uiversity of Waterloo Mostly based o a paper preseted at COLT 06 Kerelized Large-Margi Liear Classificatio
More informationDESIGN AND IMPLEMENTATION OF IMPROVED AREA EFFICIENT WEIGHTED MODULO 2N+1 ADDER DESIGN
ARPN Joural of Egieerig ad Applied Scieces 2006-204 Asia Research Publishig Network (ARPN). All rights reserved. www.arpjourals.com DESIGN AND IMPLEMENTATION OF IMPROVED AREA EFFICIENT WEIGHTED MODULO
More informationFinite-Difference Time-Domain Method (FDTD)
Computatioal Photoics, Summer Term 0, Abbe School of Photoics, FSU Jea, Prof. Thomas Pertsch Computatioal Photoics Semiar 06, 8 Jue 0 Fiite-Differece Time-Domai Method (FDTD) Lear how to implemet a oe-dimesioal
More informationArithmetic Circuits. (Part I) Randy H. Katz University of California, Berkeley. Spring 2007
rithmetic Circuits (Part I) Rady H. Katz Uiversity of Califoria, erkeley prig 27 Lecture #23: rithmetic Circuits- Motivatio rithmetic circuits are excellet examples of comb. logic desig Time vs. pace Trade-offs
More informationBest Optimal Stable Matching
Applied Mathematical Scieces, Vol., 0, o. 7, 7-7 Best Optimal Stable Matchig T. Ramachadra Departmet of Mathematics Govermet Arts College(Autoomous) Karur-6900, Tamiladu, Idia yasrams@gmail.com K. Velusamy
More informationPerformance Analysis, Modeling and Prediction of a Parallel Multiblock Lattice Boltzmann Application Using Prophesy System
Performace Aalysis, Modelig ad Predictio of a Parallel Multiblock Lattice Boltzma Applicatio Usig Prophesy System Xigfu Wu, Valerie Taylor, Shae Garrick Departmet of Computer Sciece Texas A&M Uiversity,
More informationEE 505. Lecture 28. ADC Design SAR
EE 505 Lecture 28 ADC Desig SAR Review from Last Lecture Elimiatio of Iput S/H C LK X IN S/H Stage 1 r 1 Stage 2 r 2 Stage k r k Stage m r m 1 2 k m Pipelied Assembler (Shift Register
More informationECEN 655: Advanced Channel Coding Spring Lecture 7 02/04/14. Belief propagation is exact on tree-structured factor graphs.
ECEN 655: Advaced Chael Codig Sprig 014 Prof. Hery Pfister Lecture 7 0/04/14 Scribe: Megke Lia 1 4-Cycles i Gallager s Esemble What we already kow: Belief propagatio is exact o tree-structured factor graphs.
More informationLecture 3: Asymptotic Analysis + Recurrences
Lecture 3: Asymptotic Aalysis + Recurreces Data Structures ad Algorithms CSE 373 SU 18 BEN JONES 1 Warmup Write a model ad fid Big-O for (it i = 0; i < ; i++) { for (it j = 0; j < i; j++) { System.out.pritl(
More informationISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 2, Issue 5, November 2012
Iteratioal Joural of Egieerig ad Iovative Techology (IJEIT) Pre Improved Weighted Modulo 2 +1 Desig Based O Parallel Prefix Adder Dr.V.Vidya Devi, T.Veishkumar, T.Thomas Leoid PG Head/Professor, Graduate
More informationRecursive Algorithm for Generating Partitions of an Integer. 1 Preliminary
Recursive Algorithm for Geeratig Partitios of a Iteger Sug-Hyuk Cha Computer Sciece Departmet, Pace Uiversity 1 Pace Plaza, New York, NY 10038 USA scha@pace.edu Abstract. This article first reviews the
More informationCSI 5163 (95.573) ALGORITHM ANALYSIS AND DESIGN
CSI 5163 (95.573) ALGORITHM ANALYSIS AND DESIGN CSI 5163 (95.5703) ALGORITHM ANALYSIS AND DESIGN (3 cr.) (T) Topics of curret iterest i the desig ad aalysis of computer algorithms for graphtheoretical
More informationAnalysis of composites with multiple rigid-line reinforcements by the BEM
Aalysis of composites with multiple rigid-lie reiforcemets by the BEM Piotr Fedeliski* Departmet of Stregth of Materials ad Computatioal Mechaics, Silesia Uiversity of Techology ul. Koarskiego 18A, 44-100
More informationA STUDY ON MHD BOUNDARY LAYER FLOW OVER A NONLINEAR STRETCHING SHEET USING IMPLICIT FINITE DIFFERENCE METHOD
IRET: Iteratioal oural of Research i Egieerig ad Techology eissn: 39-63 pissn: 3-7308 A STUDY ON MHD BOUNDARY LAYER FLOW OVER A NONLINEAR STRETCHING SHEET USING IMPLICIT FINITE DIFFERENCE METHOD Satish
More informationChapter 9 - CD companion 1. A Generic Implementation; The Common-Merge Amplifier. 1 τ is. ω ch. τ io
Chapter 9 - CD compaio CHAPTER NINE CD-9.2 CD-9.2. Stages With Voltage ad Curret Gai A Geeric Implemetatio; The Commo-Merge Amplifier The advaced method preseted i the text for approximatig cutoff frequecies
More informationChandrasekhar Type Algorithms. for the Riccati Equation of Lainiotis Filter
Cotemporary Egieerig Scieces, Vol. 3, 00, o. 4, 9-00 Chadrasekhar ype Algorithms for the Riccati Equatio of Laiiotis Filter Nicholas Assimakis Departmet of Electroics echological Educatioal Istitute of
More informationProceedings of the 2018 Winter Simulation Conference M. Rabe, A. A. Juan, N. Mustafee, A. Skoogh, S. Jain, and B. Johansson, eds.
Proceedigs of the 2018 Witer Simulatio Coferece M. Rabe, A. A. Jua, N. Mustafee, A. Soogh, S. Jai, ad B. Johasso, eds. A NEW FRAMEWORK FOR PARALLEL RANKING & SELECTION USING AN ADAPTIVE STANDARD Lida Pei
More informationMixed Criticality Systems with Weakly-Hard Constraints
Mixed Criticality Systems with Weakly-Hard Costraits Oliver Gettigs Uiversity of York oliver@cs.york.ac.uk Sophie Quito INRIA Greoble sophie.quito@iria.fr Rob Davis Uiversity of York rob.davis@york.ac.uk
More informationData Structures and Algorithm. Xiaoqing Zheng
Data Structures ad Algorithm Xiaoqig Zheg zhegxq@fudaeduc What are algorithms? A sequece of computatioal steps that trasform the iput ito the output Sortig problem: Iput: A sequece of umbers
More informationDefinitions and Theorems. where x are the decision variables. c, b, and a are constant coefficients.
Defiitios ad Theorems Remember the scalar form of the liear programmig problem, Miimize, Subject to, f(x) = c i x i a 1i x i = b 1 a mi x i = b m x i 0 i = 1,2,, where x are the decisio variables. c, b,
More informationRun-length & Entropy Coding. Redundancy Removal. Sampling. Quantization. Perform inverse operations at the receiver EEE
Geeral e Image Coder Structure Motio Video (s 1,s 2,t) or (s 1,s 2 ) Natural Image Samplig A form of data compressio; usually lossless, but ca be lossy Redudacy Removal Lossless compressio: predictive
More informationERT 318 UNIT OPERATIONS
ERT 318 UNIT OPERATIONS DISTILLATION W. L. McCabe, J. C. Smith, P. Harriot, Uit Operatios of Chemical Egieerig, 7 th editio, 2005. 1 Outlie: Batch distillatio (pg. 724) Cotiuous distillatio with reflux
More informationADJOINT VARIABLE METHODS FOR DESIGN SENSITIVITY ANALYSIS WITH THE METHOD OF MOMENTS
ADJOINT VARIABLE METHODS FOR DESIGN SENSITIVITY ANALYSIS WITH THE METHOD OF MOMENTS N.K. Georgieva, S. Glavic, M.H. Bakr ad J.W. Badler, CRL223, 1280 Mai Street West, Hamilto, ON L8S 4K1, Caada e-mail:
More informationResearch Article Single-Machine Group Scheduling Problems with Deterioration to Minimize the Sum of Completion Times
Hidawi Publishig Corporatio Mathematical Problems i Egieerig Volume 2012, Article ID 275239, 9 pages doi:101155/2012/275239 Research Article Sigle-Machie Group Schedulig Problems with Deterioratio to Miimize
More informationComplexity Analysis of Highly Improved Hybrid Turbo Codes
I J C T A, 8(5, 015, pp. 433-439 Iteratioal Sciece Press Complexity Aalysis of Highly Improved Hybrid Turbo Codes M. Jose Ra* ad Sharmii Eoch** Abstract: Moder digital commuicatio systems eed efficiet
More informationAsymptotic Coupling and Its Applications in Information Theory
Asymptotic Couplig ad Its Applicatios i Iformatio Theory Vicet Y. F. Ta Joit Work with Lei Yu Departmet of Electrical ad Computer Egieerig, Departmet of Mathematics, Natioal Uiversity of Sigapore IMS-APRM
More informationMathematical Modeling of Optimum 3 Step Stress Accelerated Life Testing for Generalized Pareto Distribution
America Joural of Theoretical ad Applied Statistics 05; 4(: 6-69 Published olie May 8, 05 (http://www.sciecepublishiggroup.com/j/ajtas doi: 0.648/j.ajtas.05040. ISSN: 6-8999 (Prit; ISSN: 6-9006 (Olie Mathematical
More informationCache-Efficient Algorithms II
6.172 Performace Egieerig of Software Systems LECTURE 9 Cache-Efficiet Algorithms II Charles E. Leiserso October 7, 2010 2010 Charles E. Leiserso 1 Ideal-Cache Model Features Two-level hierarchy. Cache
More information1 Duality revisited. AM 221: Advanced Optimization Spring 2016
AM 22: Advaced Optimizatio Sprig 206 Prof. Yaro Siger Sectio 7 Wedesday, Mar. 9th Duality revisited I this sectio, we will give a slightly differet perspective o duality. optimizatio program: f(x) x R
More informationLecture 19. sup y 1,..., yn B d n
STAT 06A: Polyomials of adom Variables Lecture date: Nov Lecture 19 Grothedieck s Iequality Scribe: Be Hough The scribes are based o a guest lecture by ya O Doell. I this lecture we prove Grothedieck s
More informationEstimation of Population Mean Using Co-Efficient of Variation and Median of an Auxiliary Variable
Iteratioal Joural of Probability ad Statistics 01, 1(4: 111-118 DOI: 10.593/j.ijps.010104.04 Estimatio of Populatio Mea Usig Co-Efficiet of Variatio ad Media of a Auxiliary Variable J. Subramai *, G. Kumarapadiya
More informationThe prediction of cement energy demand based on support vector machine Xin Zhang a, *, Shaohong Jing b
4th Iteratioal Coferece o Computer, Mechatroics, Cotrol ad Electroic Egieerig (ICCMCEE 05) The predictio of cemet eergy demad based o support vector machie Xi Zhag a,, Shaohog Jig b School of Electrical
More informationBi-criteria Scheduling on Parallel Machines Under Fuzzy Processing Time
22d Iteratioal Cogress o Modellig ad Simulatio, Hobart, Tasmaia, Australia, 3 to 8 December 207 mssaz.org.au/modsim207 Bi-criteria Schedulig o Parallel Machies Uder Fuzzy Processig Time Sameer Sharma a,
More informationTMA4205 Numerical Linear Algebra. The Poisson problem in R 2 : diagonalization methods
TMA4205 Numerical Liear Algebra The Poisso problem i R 2 : diagoalizatio methods September 3, 2007 c Eiar M Røquist Departmet of Mathematical Scieces NTNU, N-749 Trodheim, Norway All rights reserved A
More information