Team 250 Page 2. I. Introduction
|
|
- Lorin Burns
- 6 years ago
- Views:
Transcription
1 Summary Gve the possble umber of geetc varatos, the probablty of havg a aturally occurrg Doppelgager s low. Ths s why DA evdece acqured at crme scees s such coclusve evdece whe preseted crmal trals. Though the process of DA fgerprtg s fallble, the probablty that two urelated people wth the same DA exst s mcroscopc. Barrg, the, that you have a detcal evl tw, the probablty that you wll be mstake for a crmal based o such evdece s low. Fgerprts, however, beg oly a porto of ths geetc detty, seem far less restrctg. It s the cocevably possble that oe could be mstake as the perpetrator of a crme based o fgerprt evdece. It s our goal to determe exactly how probable ths s. Oe of the progetors of the study of fgerprt detty was Sr Fracs Galto, who detfed characterstc rdge patters the sk that vary wdely amog a populato, but whch are costat over tme to a dvdual. I addto to these mutae, fgerprts also have a overall patter that early all cases falls to oe of three groups: loops, arches, ad whorls. Usg both the overall fgerprt patters, ad a set of the most commoly occurrg Galto Characterstcs (GCs), we created a model to test the dvdualty of fgerprts, based o a probablstc terpretato: hghly probable fgerprts are less dvdual, ad less probably fgerprts are more dvdual. I ths model, we frst dvded a deal rectagular thumbprt to squares of equal area, deoted as cells. Kowg that ay comparso betwee two fgerprts frst matches the geeral patter of a fgerprt ad the a certa umber of GCs, we calculated the fgerprt patters that have the maxmum probablty of occurrece. Ths was doe by usg fgures whch determed the relatve frequecy of occurrece of each of the patters ad GCs. To start, we assumed that from a deal thumbprt cotag total cells, we chose to cofrm the form ad placemet of GCs those cells. Our model proceeds stages, frst choosg the overall patter of the prt, ad the proceedg to choose locatos of GCs from the total placemets possble. Oce the patter ad placemet have bee determed, t remas oly to factor the relatve occurrece probabltes of each GC order to determe a measure of the dvdualty of the fgerprt. The model s costructed based o a umber of assumptos. To beg wth, we frst assume that the patters ad GCs occur depedetly; ether has a fluece o the other s probablty. I later stages of our aalyss, the, we accout for the fact that depedeces may exst, ad alter the selecto of GCs accordgly. Aother assumpto that our model makes s that the GCs occur depedetly; that s, the spaces whch we wsh to cofrm the presece of GCs, placemet has o effect o whch characterstc s selected. Sce there has bee o coclusve evdece that a partcular fgerprt patter has ay fluece o the mutae preset the fgerprt, ths seems to be a vald assumpto, ad hece o uecessary restrctos were placed o the form of the fgerprt. The costructo of the model allowed us to calculate the ablty to cofrm a fgerprt based o partal fgerprt evdece. I addto, we used populato fgures of may coutres ad the etre world to fd what the mmum umber of GCs commo betwee fgerprts should be before a match ca be sad to occur. I testg ths model, we dd ot calculate the probablty of occurrece for every dvdual patter ad placemet of GCs. Rather, we calculated oly the probablty of the most lkely occurrece. Also, the oretato of GCs was ot take to cosderato. Ths may at frst seem to be a weakess, but s fact a stregth, as requrg a fgerprt to occur wth GCs oreted a partcular drecto s strcter tha ot requrg ay partcular drecto for ther placemet. Thus, ay fgerprt occurrg ature s hypothetcally less lkely to occur tha our calculated maxmum. For a template fgerprt wth 12 detfed mutae, a reasoable requred umber gve ew advacemets laser recogto of fgerprts, the probablty fdg a match was calculated to be o the order of Ths fgure shows that eve the most lkely fgerprt s thus hghly dvdual, ad fgerprt detfcato s as relable o deal grouds as DA detfcato, whch has relablty o the order of
2 Team 250 Page 2 A IQUIRY ITO IDIVIDUALITY OF THUMBPRITS Asma Al-Raw, Steve Glbersto, Joatha Whtmer Kasas State Uversty Mathematcal Cotest Modelg 2004 I. Itroducto How ca you dsbeleve me whe I have created each oe of you dow to the prts o your fgers? --God (The Holy Qur a 75:3-4) [4] The above referece, depedg o oe s relgousess or secularsm, ether cofrms that fgerprts are dstct to dvduals, or at the very least, that kowledge of varato of fgerprts betwee persos, ad ts heret propertes detfcato, has exsted sce the 8 th cetury. I moder Wester culture, the dea of usg fgerprts as a meas of detfcato frst appeared a artcle wrtte by Hery Faulds 1880 the joural ature [3]. Hs terest was aroused by hs dscovery of rdged patter mprts hadmade pottery. After performg a seres of expermets to determe dfferece fgerprts amog dvduals as well as ther reslece, he recommeded that a prmary use of these rdged mprts could be used as evdece of crmal detty at the scee of the crme. At the root of ths asserto s the assumpto of uqueess each huma s fgerprt patters. There are several commoaltes the patters of rdged sk, however, whch allow fgerprts to be systematcally classfed. For example, the rdged les o fgers appear a umber of major patter types: loops, whch comprse the largest porto of all fgerprts ad occur two chraltes; whorls, whch are characterzed by the spralg patter of the rdges; ad the arches, whch comprse the smallest major group [1]. Other possble mafestatos exst; however ther occurrece s very rare. I addto to these major groups, the rdges of dfferet fgerprts show certa defg characterstcs. Ths dea was prevalet oe of the frst attempted quatfcatos of fgerprt dvdualty, whch was performed by Sr Fracs Galto 1892 [1]. The patters of fger rdge dvergeces ad combatos, termed mutae, are also detfed as Galto Characterstcs hs hoor. Later developmets have corporated hs deas alog wth other prt-determg factors to establsh more exactly each prt s uqueess [1,2,6]. Whether or ot each fgerprt patter s truly uque, ther use as a form of detfcato has foud much use foresc scece. Recetly, however, the valdty of fgerprt evdece has bee called to questo, as evdeced by the case Uted States v. Mtchell, whch preseted the US wth ts frst challege as to the admssblty of latet fgerprt evdece as a meas of detfcato [7]. Ths ecesstates a reevaluato of the valdty of fgerprt uqueess measuremet. Thus, we become faced wth the problem of determg the probablty that two people the world mght share the same fgerprts to measurable accuracy. Ths s qute a complex problem f oe allows t to be, as there seem at frst to be almost ftely may varatos wth rdge patters whose appearace ad terplay must be accouted for, ad yet t has a smple ad elegat soluto whch we wll show ths paper. I our study, we focus ot o each of the te fgers, but o oly the thumb, whch effectvely serves as a upper boud for the
3 Team 250 Page 3 multple occurrece probablty of all frcto rdged sk. Our calculatos have foud o the bass of a dscrete probablty model that t s extremely ulkely that two people wth the same thumbprts have ever exsted, wth the lmtatos of curret measuremet practces. II. Model The frst step devsg a model for thumbprt dvdualty s smply to uderstad what types of fgerprts exst. As metoed prevously, fgerprts occur what seems to be a fte umber of varatos, determed by both ther overall patter ad the dstrbuto of Galto Characterstcs (GCs). The patters fall to three ma categores: loops, arches, ad whorls. These ca be further dvded to over a thousad subcategores [1]. Fgure 1 shows the major types of prts. FIGURE 1. These are four most commo patters of fgerprt patters: Left ad rght loops, whorls, ad arches. From Prts whch fall to these categores ca, to the utraed eye, ad oftetmes eve the traed eye, appear very smlar. Whe the cotrbuto of GCs s factored, a partcular fgerprt s uque character starts to become apparet. The major types of GCs are llustrated Fgure 2. Whether the patter o the fger s a loop, arch, or whorl, GCs occur radomly throughout the etre prt. These occurreces gve dstct attrbutes to the prt that ca be systematcally classfed.
4 Team 250 Page 4 FIGURE 2. A chart showg the 10 most commo forms of Galto Characterstcs. (Osterburg??) The cetral problem, gve a kow classfcato of a fgerprt by ts patter ad GCs, becomes to calculate the probablty that a detcal fger exsts. Our model focuses specfcally o thumbprts, for a varety of reasos. For stace, a thumb s easy to dealze. I practce, whe fgerprts are take, the fger s rolled over early ts etre surface above the frst kuckle. Ths s smlar to the urollg of a ucapped cylder. The shape of ths prt o paper s approxmately rectagular. The thumbprt has the largest area, ad also the largest umber of defg qualtes, due to the radom dstrbuto of GCs. For a deal rectagular thumbprt, we partto the area to equally szed squares, wth a mmum sze o the order of oe square mllmeter, due to the mmum extet to whch a GC ca be detfed as occurrg oe of the squares. Sce oly a fte umber of vsble GCs ca occur o a sgle pattered fger, a dscrete probablty method s useful for determg the possblty of Doppelgager thumbs. It s the perfectly admssble to use a coutg argumet to fd approxmately the umber of possble arragemets of frcto rdges o the thumb, ad ther relatve occurreces based o the features they cota. It should be oted that deal fgerprts as descrbed above do ot usually occur actual feldwork. Usually oly portos of fgerprts are left by ols or other substaces o the fgers of the crmal; these are called latet prts. After these latet prts are developed ad brought to vsble form, they are descrbed as partal prts. These partal prts cota oly a fracto of the total surface of the frcto rdged sk o the thumb. Usg smlar deas to the oes above, we ca model partal prts smply
5 Team 250 Page 5 by decreasg ; that s, lmtg the umber of cells o whch the prts have to match up. Sce a partal prt caot possbly match the rest of the cells cotaed a deal prt, the characterstcs of those cells are rrelevat. Decreasg the gves a accurate model, as we ca say that the area we are samplg from s smaller. Accordgly, the probablty of matchg the prt amog people of a gve populato grows, as we show below. III. Probablty Algorthms Our frst step was to measure the dmesos of a dealzed thumb. Averagg over the three members our group, we foud the dmesos of a early rectagular prt, whe measured as descrbed above, to be approxmately 3 cm by 4 cm. Thus there are approxmately 1200 square mllmeters o two thumbs. We took each square mllmeter to be a cell, so that our deal thumb model, a full prt has a possblty of 1200 detfcato pots. I practce, a suspect s thumbprt ad the thumbprt foud at the scee of the crme are compared to each other o both the overall patter ad a certa umber of dstgushg characterstcs. The dstgushg factors ca correspod to ether scars o the suspect s thumbprt or GCs. Sce scars are the result of completely radom evets, ad thus are early mpossble to quatfy wthout exact persoal hstores, our model cosders oly the cases whch GCs occupy these detfyg pots. I prevous models [1,2], the relato betwee GCs ad the overall patter was ot cosdered; oly the occurrece of GCs was take to accout. I our model, varous degrees of patter ad GC depedece were cosdered. Ths accouts for the possblty that a certa percetage of the GCs are heret the overall patter. I the case where patter ad GC occurreces are completely depedet, oe ca separate the probablty of a fgerprt s occurrece to two factors: P P P (1). fp p I the above equato, P fp s the probablty a partcular fgerprt wll occur, P p s the probablty a partcular patter wll occur, some approxmate fgures for whch are gve Table 1, ad P GC s the probablty of a partcular combato of GCs. Class of Prt Probablty Rght Loop Left Loop Whorl 0.3 Arch 0.05 Total 1 TABLE 1: A lst of approxmate occurrece probabltes of the four most commo thumbprts from Osterburg, et. al. The loop category s determed there to have a 65% occurrece probablty, whch here s dvded to the two chraltes, whch are easly dstgushable ad occur at early the same rate overall. Our model treats o-measured GCs ad cells whch there are o GCs as equvalet empty cells. Thus, the case where GCs are depedet o whch patter a fgerprt has, we ca stll use ths depedece model, by otg that sce a partcular GC
6 Team 250 Page 6 percetage of the GCs are determed by the patter, we ca treat those as empty space whch o defg characterstc occurs. Suppose the, that we wsh to fd the probablty that a partcular dstrbuto of measured GCs occurs. To do ths, we ote that of the total cells the fgerprt, oly of these cells have ay sgfcace terms of GC measuremet. The umber of ways ths ca be dstrbuted s easy to compute. Placg all measured cells o the same level, we beg placg GC s ad empty cells o the surface of the thumbprt. At frst there are GCs to place wth the total area of the prt, ad total cells to place them. If the frst cell s empty space, we are left wth -1 cells whch to place characterstcs, ad characterstcs. If the frst cell cotas a characterstc, we have -1 empty cells whch to place characterstcs, ad -1 GCs. Iteratg ths choce process over all cells, we fd that the umber of ways we ca place the GCs s!!( )! (2). Ths leaves us to calculate the probablty that each GC cell cotas a partcular GC. Osterburg, et al, cotas relatve frequeces of occurrece for each characterstc averaged over 39 fgers. Table 2 gves these fgures. I our model, sce we dsregard empty spaces, we cosdered oly the relatve frequecy of the eleve most commo elemets. Double occurreces, or the evet that two GCs occur the same space, whle certaly possble, were gored ths model calculato, due to ther small frequecy. The umber the table s msleadg, as t accouts for all double occurreces, ot double occurreces of partcular types. Parameter Cell cofgurato Frequecy Probablty of Parameter 0 Empty 6, Islad Brdge Spur Dot Edg rdge Fork Lake Trfurcato Double bfurcato Delta Broke rdge Multple occuraces Total 8, TABLE 2. Expermetally determed Galto Characterstc probablty umbers. From Osterburg, et al. Our model dsregards multple occurreces, hece for our purposes, the characterstcs umbered 0 ad 12 are empty cells. Oly the characterstcs umbered 1-11 are relevat. The relatve probablty s a ecessary factor for determg whch characterstc s most lkely to occur the GC cells. The probablty of the th occurrece s gve by:
7 Team 250 Page 7 r = P( ) P( ) (3), where the elemets P() are determed from Table 1. The ths case rages from 1 to 11, as our model cosders oly sgle GC occurreces, ad treats the low probablty ad multple occurrece GCs as empty space. It should be oted that ther cluso would decrease the relatve probablty of the th term as defed above; hece, t would decrease the upper boud whch our calculato ams to set. Clearly, the sum of these relatve probablty quattes s 1, hece they are valdly defed as probabltes. For GCs, the probablty of each arragemet s gve by the relatve probablty of each GC to the power of the umber of tmes the GC s selected dvded by the umber of ways to dvde those elemets to groups categorzed by the eleve GCs cosdered. Though the dea s complex, the otato s rather mathematcally smple, ad correspods to the product of the selecto probabltes dvded by the multomal coeffcet correspodg to choosg 1 of GC umber 1, 2 of GC umber 2, etc. If we dvde ths quatty by the umber of ways each of the GCs cosdered, we obta the probablty of each arragemet of GC s, show equato (4a). P GC 1 11 = 1 r 11!! = 1! r!!( )! 11 (!) r! ( )! (4a) Oe should ote that the above, α (4b), hece there are oly as may stages cosdered the determato of GCs as there are GCs that are measured ad avalable to compare to. To reterate, our algorthm for calculatg Doppelgager thumb probabltes cosders separately the probabltes of both the geeral patter ad GC occurrece. The probablty of GC occurrece s determed by the umber of places whch GCs are observed, the relatve probablty of a GC occurrg there, ad the umber of ways these GC s ca the be ordered. The quatfcato of ths s the gve by equato (4a). ow, gve equatos (1) ad (4a), we ca calculate the probablty of ay partcular fgerprt matchg o both the patter ad ay GCs by usg the formato Tables 1 ad 2. Sce we wsh, the, to put a lmt o the umber of people the world who ca match fgerprts, gve these characterstcs, we calculated P max, the probablty of ay thumbprt matchg a template wth oly the most lkely characterstcs each of the GC places. Ths smplfes equato (4a), by restrctg choce to oly the GC wth maxmum probablty. Thus we have
8 Team 250 Page 8 P GC 1 11 = 1 r 11 0 r max 0 r max P max (5). Some plots of ths are gve Appedx A. These plots use the value of r max obtaed by computg the relatve probablty of edg rdges, ad cosder oly the rght ad left loop patters (occurrg equal supply) to costtute the maxmum patter probablty. To calculate the quattes determed equato (5), t becomes ecessary to calculate factorals of very large umbers to determe values of choose. Ths ca be approxmately doe by usg Sterlg s approxmato, whose formula s gve by Ths, tur, leads us to the approxmato 1 log( m!) mlog( m) m + log(2 m) (6). 2 log log(!) log ( )! log(!) (7), whch ca be utlzed to approxmate. If we suppose that a percetage of GCs are depedet o the overlyg patter, the our model chages very lttle. Assumg that l of the total GCs are depedet o a partcular patter, we ca essetally dsregard all patter-depedet GCs as empty cells, as they would be exactly what s expected the prt at that pot the patter. Hece, wth a slght modfcato from to l, where l deotes the umber of GCs depedet o the patter, equatos (4a) ad (4b) ca stll be utlzed. I the evet that the GCs are wholly determed by the overlyg patter, we ca dsregard the fluece of the patter our calculato of P fp, as we have more precse formato about GC form ad occurrece tha we do about patter ad sub-patter form ad occurrece. Also, our estmates for the lkelhood of a GC occurrg at a gve pot the -square array gve a more lmtg maxmum for the probablty tha do our fgures o geeral patter characterstcs. The omsso of the patter fluece o the fgerprt probablty s completely vald, sce total GC depedece o patter s equvalet to total patter depedece o GC; they smply become two dfferet types of taxoomy. IV. Data Returg to problem ow, we are specfcally asked to determe what the probablty s that a perso ca be msdetfed by fgerprt evdece; that s, we are to determe the probablty that two people share the same fgerprt characterstcs. For a template wth GCs, we are to calculate the probablty that two dstct people match the template. Ths s lmted by the square of P max for a gve, whch as graphed
9 Team 250 Page 9 Fgure 3 below, s see to be very low for all 10. For the value of = 12, take Osterburg, et al to be a meda value for what s requred for verfcato by varous teratoal law eforcemet ageces, we ca see that the probablty of fgerprt multplcty s 4.64 x These calculatos were smply performed usg a Mcrosoft Excel spreadsheet ad the formulas Secto III. Maxmum Probabltes at Varous Patter Depedeces 1.00E E E E-11 P_max 1.00E E E E umber of GCs o Depedece 25% Depedece 50% Depedece 75% Depedece 100% Depedece FIGURE 3: Plot of maxmum probablty as a fucto of the umber of GCs used the verfcato process. Here s allowed to rage from 1 to 30. Aother, drectly applcable, ad hghly terestg problem s the followg: What s the maxmum umber of GCs that a partcular coutry s law eforcemet ageces must use order to get the hghest probablty of a match usg the lowest umber of GCs per detfcato? Usg populato fgures Table 3, we ca determe ths. To do so, we multply the populato of a coutry by P max to fd the umber of people a coutry that are probable to match a gve GC template. The results are plotted Appedx A. The plots Appedx A all pot to ear certa detfcato for 12. Ths s true regardless of the coutry whch the detfcato s beg made. I fact, usg the world populato fgure, t s ear certa that o a thumb wth 1200 cells, a match s all but certa, ad deed, oly oe perso s lkely to have ever exsted wth such a prt. Coutry US World Cha umber of people 2.925E E E+09
10 Team 250 Page 10 Lchteste 3.284E+04 # People Ever 1.269E+10 Table 3: Populato fgures for the world ad some represetatve coutres. The umber of people ever was a fgure computed o the assumpto that roughly twce as may people have exsted the hstory of humaty tha exst at ths partcular pot tme. As was oted before, however, t mght be the case that a thumb wth 1200 cells s overly large, or that oly partal prts ca be obtaed for detfcato purposes. I ths case, we restrct the umber to a umber less tha For the plots Appedx B, we chaged the umber 1200 our calculato to values of = 600 ad = 300. Though ths creases the probablty of fdg multple matches, due to restrcto the umber of stes to place GCs. However, f as few as 12 GCs are matched, the fgerprt s uque detty s all but assured. V. Error Aalyss A prevous vestgato by Pakat, et. al. cluded the oretato of each muta the model for fgerprt dvdualty. We eglect to clude the factor of oretato of the characterstc for may reasos. Frstly, removg the factor of GC oretato ca oly decrease our estmate of the maxmum possble thumb Doppelgager probablty. Sce we are attemptg oly to fd a maxmum boud for ths probablty, removal of a factor whch ca oly decrease the probablty of a partcular prt, whle the same breath uecessarly complcates our soluto, does o damage to our model. Pakat, whom accouts for oretato hs model, arrved at a lower fgure for fgerprt dvdualty tha we dd. I accoutg for ths oretato, however, Pakat completely dsregards the dffereces mutae, oly cocetratg o locato ad oretato of defg features the fgerprt rdges. Some fgures doe o varous model calculatos that are cluded Pakat s paper are lsted Table 4, Appedx C. A secod reaso our model dsregards oretato s that our model reles o the assumpto that mutae occur ether depedetly or sem-depedetly. I accoutg for oretato, we would have to take to accout restrctos placed o the oretato of the GC by the overall patter. Ths s smple to see: persos wth loop patters have a hgher probablty upward ad dowward potg GCs tha do persos wth arches. Accoutg for oretato would make the patter ad mutae probabltes separable, ad aga harm the smplcty of our model whle offerg lttle mprovemet to our lmtg maxmum. Aother uavodable problem wth our model s the roughess of patter ad GC frequeces. Ufortuately, there are o good assessmets publshed o the percetages of the populato who patters that fall to the arch, loop, ad whorl categores. The frequecy of occurrece of GCs faces a smlar problem. I fact, the oly fgures we could fd were rough estmatos based o a small sample of people. Osterburg, whose fgures we used ths model, arrved at hs probablty parameters of GCs by samplg from 39 fgerprts. He dd break them to a total of 8,591 cells, but as we do ot kow whether or ot a sgle perso s more lkely to have a certa type of GC, these probabltes caot be take at face value [1]. Surely more recet fgures o these
11 Team 250 Page 11 parameters exst, but they aga do ot harm our model, oly the fgures whch t calculates. As metoed before, there s a possblty that there exsts depedece betwee GCs ad the overall patter of a fgerprt. I our model, we attempted accout for ths by decreasg the detfyg trats of a partcular muta by 25%, 50%, ad 100%. For the 100%, we smply calculated the probablty of a partcular GC occurrece ad dsregarded the patter, as ether ca be see to be the determg factor of the other. Ths s ot a exact model smply because ths assumes sem-depedece where complete depedece may occur. Wthout proper relatos that gve the depedece of mutae o the overall patter, however, we are uable to properly accout for ths. Iasmuch as we were able to adjust for these parameters, our model stll predcts that detfyg 12 or more mutae o a prt, whch s well wth curret techology, all but assures a postve match. Oe who pays astute atteto to our graph Fgure 3 otes that the graphs of 100% ad 0% depedece are actually the closest predcted probablty. Ths s because removal of the patter parameter the calculato of P max oly creases the overall maxmum probablty by a approxmate factor of 10. The other fgures suffer from exactess relatg the depedece betwee occurrece of patter ad mutae. I the fgures for our model, we have more precse kowledge of GC occurrece tha of patter occurrece. Hece, the plots whch we requre a percet depedece o patter suffer uecessarly from exact data. As we are creatg a somewhat dealstc model of fgerprts, scars were ot take to cosderato. As ca bee see Fgure 4, scars do have a effect o the appearace of fgerprts. Ths may create accuraces; however, there s o good way to model the formato of scars, as ths s completely due to persoal expereces. FIGURE 4: The effect of scars o fgerprt aalyss. From Cowger, p. 4. Our model also dffers o oe accout from most other models of fgerprts. Prevous artcles [3] publshed o fgerprt aalyss defe fgerprts oly as the porto the geeral vcty of the cetral patter. Our model actually takes the prt o the etre area above the upper jot of the thumb, whch would be the type of fgerprt o fle. Accordgly, our probabltes are sgfcatly lower tha those calculated by others. However, our model ca, as metoed before, be made to approxmate these the lmt where the umber of cells s at a value aroud 300 ad s aroud 12. The values we calculated ths method match up to other models accordgly, as see Table 4. The major problem whch our model suffers from s ts ablty to accout for huma error determg thumbprt probablty. Epste [7] otes that the major problem wth latet fgerprt evdece s the ablty of the humas whom exame the prts to dscer exact characterstcs. We ow have the ablty to use optcal scas to determe fgerprts of a dvdual exactly, as opposed to puttg k o fle. If the thumbprts matches were able to be tested by a computer, t would be hghly ulkely, gve our model, that ayoe would ever be msdetfed.
12 Team 250 Page 12 Comparg the output of our model wth the probabltes of error DA aalyss, we fd that fgerprts are a much more accurate method of detfcato. Though everyoe except detcal tws ad cloes has a uque sequece of DA, for crmology, the exact sequece s ot actually used as evdece. Istead, DA s cut up wth a ezyme to Restrcto fragmet legth polymorphsms (RFLPs). These peces of DA are the ru out o a gel, whch separates t out by the sze of the segmet [8]. Accordgly, f two or more people smply have restrcto stes approxmately the same area, or eve have the same amouts of DA betwee restrcto stes, they ca be mstake for oe aother. Ths s a much hgher probablty tha f the exact sequece were take to accout. Accordgly, though msdetfcato s rare, the probablty of msdetfcato DA aalyss s o the order of oe te bllo, whle accordg to our data that of fgerprt aalyss s much lower [5].
13 Team 250 Page 13 VI. Cocluso Itally, ths problem aroused us may cocers. What f oe of us really had a thumb Doppelgager? We could be covcted for crmes we had ever commtted! Ths stuato would be most ufortuate. However, after rug our model uder a case of maxmum probablty, we dscovered that there s a better chace of msdetfcato through DA proflg f the fgerprt aalyss s coducted wth mmal huma error. Ths s plaly evdet the fact that the odds of msdetfcato of DA evdece, regarded legal ad publc opo as early fallble, has a probablty of msdetfcato o the order of 10-10, whle the odds of fgerprt msdetfcato s four orders of magtude less, accordg to our model. eedless to say, t seems ureasoable to dey fgerprt proflg as evdece a crmal tral.
14 Team 250 Page 14 Appedx A: Shared Characterstcs of a Populato The followg plots were used to determe the optmum fgure for detfcato of crmals based o fgerprt evdece that s gve secto IV. umber of lke thumbprts, 0% depedece, = E+10 1.E+05 umber of people wth thumbprt 1.E+00 1.E-05 1.E-10 1.E-15 1.E-20 1.E umber of GCs US Most World Most Cha Most Lchteste Most Ever Most Fgure 5: Plot of the umber of probable lke thumbprts a gve coutry usg the model of zero percet patter depedece. Ths shows that f oly 10 mutae are requred to match, the t s lkely that o oe the hstory of the world has had a exactly matchg whole thumbprt. umber of lke thumbprts, 25% depedece, = E E+06 umber of people wth thumbprt 1.00E E E E E E umber of GCs US Most World Most Cha Most Lchteste Most Ever Most Fgure 6: Same as above, for 25% depedece model. Here, oly 10 mutae are requred for postve detfcato as well.
15 Team 250 Page 15 umber of lke thumbprts, 50% depedece, = E E+04 umber of people wth thumbprt 1.00E E E E E umber of GCs US Most World Most Cha Most Lchteste Most Ever Most Fgure 7: Same as above, for the 50% patter depedece model. Here, aroud 12 characterstcs are requred for a hghly probable detfcato. The dfferece here s lkely caused by error our kowledge of patter frequeces. umber of lke thumbprts, 100% depedece = E E+04 umber of people wth thumbprt 1.00E E E E E E umber of GCs US Most World Most Cha Most Lchteste Most Ever Most Fgure 8: Same as above, for the complete depedece model. Aga, oly about 10 characterstcs are requred for a postve detfcato.
16 Team 250 Page 16 Appedx B: Shared Partal Prt Characterstcs of a Populato The followg plots were used to determe the optmum umber of GCs to match up wth a gve populato f oly partal prts are avalable for comparso. umber of lke thumbprts, 0% depedece, = E E+08 umber of people wth thumbp 1.00E E E E E E umber of GCs US Most World Most Cha Most Lchteste Most Ever Most Fgure 9: A plot of the umber of possble lke half-thumbprts, gve zero depedece o fgerprt patter. umber of lke thumbprts, 100% depedece, = E E+08 umber of people wth thumbprt 1.00E E E E E E umber of GC's US Most World Most Cha Most Lchteste Ever Most Fgure 10: A plot of the umber of possble lke half-thumbprts, gve oe hudred percet depedece o fgerprt patter.
17 Team 250 Page 17 umber of lke thumbprts, 0% depedece, = E E+07 umber of people wth thumbpr 1.00E E E E E umber of GCs US Most World Most Cha Most Lchteste Most Ever Most Fgure 11: A plot of the umber of possble lke quarter-thumbprts, gve zero depedece o fgerprt patter. umber of lke thumbprts, 100% depedecy, = E+12 umber of people wth thumbprt 1.00E E E E E E umber of GCs US Most World Most Cha Most Lchteste Most Ever Most Fgure 12: A plot of the umber of possble lke quarter-thumbprts, gve oe hudred percet depedece o fgerprt patter.
18 Team 250 Page 18 Appedx C: Table of Calculated Probabltes These probabltes were calculated usg varous past models by Pakat [6]. As oted earler, our model, whch predcts a value less tha 4 x for the probablty of each dvdual fgerprt, s good agreemet wth these calculatos. Author P fp =36, R=24, M=72 =12, R=8, M=72 Galto (1892) R 1.45x x Pearso(1930) R 1.09x x Hery(1900) x x Balthazard(1911) x x10-8 Bose(1917) Wetworh & Wlder (1918) Cumms & Mdlo (1943) Gupta (1968) Roxburgh (1933) x x x x x x x x x x Traurg (1963) ( ) 2.47x x10-9 Osterburg et al. (1980) M ( 0.766) (0.234) 1.33x x10-15 Stoey (1985) ( ) 1 1.2x x TABLE 4: Calculated probabltes for varous models. Obtaed from Pakat, et. al. [6]. Here, R s the umber of regos of a fgerprt cosdered as defed by Galto, M s the umber of regos as defed by Osterburg.
19 Team 250 Page 19 Refereces [1] J.Osterburg, et al., Developmet of a Mathematcal Formula for the Calculato of Fgerprt Probabltes Based o Idvdual Characterstcs, Joural of the Amerca Statstcal Assocato, Vol. 72, o. 360, pg , 1977 [2] S. L. Sclove, The Occurrece of Fgerprt Characterstcs as a Two Dmesoal Process, Joural of Amerca Statstcal Assocato, Vol. 74, o. 367, pp , 1979 [3] James F. Cowger, Frcto Rdge Sk: Comparso ad Idetfcato of Fgerprts, Elsever Scece Publshg Co. Ic., ew York, ew York, [4] The oble Qur a: I the Eglsh Laguage, Dr. Muhammad Taq-u-D Al-Hlal. Ryadh, Housto, Lahore: Darussalam Publshers ad Dstrbutors, [5] DA Fgerprtg. The Columba Ecyclopeda, Sxth Edto. ew York: Columba Uversty Press, 2003 [6] Sharath Pakat, et al., O the Idvdualty of Fgerprts [7] Robert Epste, Fgerprts Meet Daubert: The Myth of Fgerprt Scece s Revealed, Souther Calfora Law Revew, Vol. 75, pp , 2002 [8] Athoy J. F. Grffths, Moder Geetc Aalyss, W. H. Freema ad Compay, ew York, Mew York, 2002.
CHAPTER VI Statistical Analysis of Experimental Data
Chapter VI Statstcal Aalyss of Expermetal Data CHAPTER VI Statstcal Aalyss of Expermetal Data Measuremets do ot lead to a uque value. Ths s a result of the multtude of errors (maly radom errors) that ca
More informationChapter 8. Inferences about More Than Two Population Central Values
Chapter 8. Ifereces about More Tha Two Populato Cetral Values Case tudy: Effect of Tmg of the Treatmet of Port-We tas wth Lasers ) To vestgate whether treatmet at a youg age would yeld better results tha
More informationLecture Notes Types of economic variables
Lecture Notes 3 1. Types of ecoomc varables () Cotuous varable takes o a cotuum the sample space, such as all pots o a le or all real umbers Example: GDP, Polluto cocetrato, etc. () Dscrete varables fte
More informationEconometric Methods. Review of Estimation
Ecoometrc Methods Revew of Estmato Estmatg the populato mea Radom samplg Pot ad terval estmators Lear estmators Ubased estmators Lear Ubased Estmators (LUEs) Effcecy (mmum varace) ad Best Lear Ubased Estmators
More informationIntroduction to local (nonparametric) density estimation. methods
Itroducto to local (oparametrc) desty estmato methods A slecture by Yu Lu for ECE 66 Sprg 014 1. Itroducto Ths slecture troduces two local desty estmato methods whch are Parze desty estmato ad k-earest
More informationLecture 9: Tolerant Testing
Lecture 9: Tolerat Testg Dael Kae Scrbe: Sakeerth Rao Aprl 4, 07 Abstract I ths lecture we prove a quas lear lower boud o the umber of samples eeded to do tolerat testg for L dstace. Tolerat Testg We have
More informationLecture 3. Sampling, sampling distributions, and parameter estimation
Lecture 3 Samplg, samplg dstrbutos, ad parameter estmato Samplg Defto Populato s defed as the collecto of all the possble observatos of terest. The collecto of observatos we take from the populato s called
More informationChapter Statistics Background of Regression Analysis
Chapter 06.0 Statstcs Backgroud of Regresso Aalyss After readg ths chapter, you should be able to:. revew the statstcs backgroud eeded for learg regresso, ad. kow a bref hstory of regresso. Revew of Statstcal
More informationSummary of the lecture in Biostatistics
Summary of the lecture Bostatstcs Probablty Desty Fucto For a cotuos radom varable, a probablty desty fucto s a fucto such that: 0 dx a b) b a dx A probablty desty fucto provdes a smple descrpto of the
More informationModule 7: Probability and Statistics
Lecture 4: Goodess of ft tests. Itroducto Module 7: Probablty ad Statstcs I the prevous two lectures, the cocepts, steps ad applcatos of Hypotheses testg were dscussed. Hypotheses testg may be used to
More informationbest estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best
Error Aalyss Preamble Wheever a measuremet s made, the result followg from that measuremet s always subject to ucertaty The ucertaty ca be reduced by makg several measuremets of the same quatty or by mprovg
More informationSimple Linear Regression
Statstcal Methods I (EST 75) Page 139 Smple Lear Regresso Smple regresso applcatos are used to ft a model descrbg a lear relatoshp betwee two varables. The aspects of least squares regresso ad correlato
More informationTo use adaptive cluster sampling we must first make some definitions of the sampling universe:
8.3 ADAPTIVE SAMPLING Most of the methods dscussed samplg theory are lmted to samplg desgs hch the selecto of the samples ca be doe before the survey, so that oe of the decsos about samplg deped ay ay
More informationFunctions of Random Variables
Fuctos of Radom Varables Chapter Fve Fuctos of Radom Varables 5. Itroducto A geeral egeerg aalyss model s show Fg. 5.. The model output (respose) cotas the performaces of a system or product, such as weght,
More informationOrdinary Least Squares Regression. Simple Regression. Algebra and Assumptions.
Ordary Least Squares egresso. Smple egresso. Algebra ad Assumptos. I ths part of the course we are gog to study a techque for aalysg the lear relatoshp betwee two varables Y ad X. We have pars of observatos
More informationEstimation of Stress- Strength Reliability model using finite mixture of exponential distributions
Iteratoal Joural of Computatoal Egeerg Research Vol, 0 Issue, Estmato of Stress- Stregth Relablty model usg fte mxture of expoetal dstrbutos K.Sadhya, T.S.Umamaheswar Departmet of Mathematcs, Lal Bhadur
More informationMEASURES OF DISPERSION
MEASURES OF DISPERSION Measure of Cetral Tedecy: Measures of Cetral Tedecy ad Dsperso ) Mathematcal Average: a) Arthmetc mea (A.M.) b) Geometrc mea (G.M.) c) Harmoc mea (H.M.) ) Averages of Posto: a) Meda
More informationENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections
ENGI 441 Jot Probablty Dstrbutos Page 7-01 Jot Probablty Dstrbutos [Navd sectos.5 ad.6; Devore sectos 5.1-5.] The jot probablty mass fucto of two dscrete radom quattes, s, P ad p x y x y The margal probablty
More informationLecture 3 Probability review (cont d)
STATS 00: Itroducto to Statstcal Iferece Autum 06 Lecture 3 Probablty revew (cot d) 3. Jot dstrbutos If radom varables X,..., X k are depedet, the ther dstrbuto may be specfed by specfyg the dvdual dstrbuto
More information2.28 The Wall Street Journal is probably referring to the average number of cubes used per glass measured for some population that they have chosen.
.5 x 54.5 a. x 7. 786 7 b. The raked observatos are: 7.4, 7.5, 7.7, 7.8, 7.9, 8.0, 8.. Sce the sample sze 7 s odd, the meda s the (+)/ 4 th raked observato, or meda 7.8 c. The cosumer would more lkely
More informationhp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations
HP 30S Statstcs Averages ad Stadard Devatos Average ad Stadard Devato Practce Fdg Averages ad Stadard Devatos HP 30S Statstcs Averages ad Stadard Devatos Average ad stadard devato The HP 30S provdes several
More information{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:
Chapter 4 Exercses Samplg Theory Exercse (Smple radom samplg: Let there be two correlated radom varables X ad A sample of sze s draw from a populato by smple radom samplg wthout replacemet The observed
More informationMultiple Linear Regression Analysis
LINEA EGESSION ANALYSIS MODULE III Lecture - 4 Multple Lear egresso Aalyss Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur Cofdece terval estmato The cofdece tervals multple
More informationMean is only appropriate for interval or ratio scales, not ordinal or nominal.
Mea Same as ordary average Sum all the data values ad dvde by the sample sze. x = ( x + x +... + x Usg summato otato, we wrte ths as x = x = x = = ) x Mea s oly approprate for terval or rato scales, ot
More informationChapter 5 Properties of a Random Sample
Lecture 6 o BST 63: Statstcal Theory I Ku Zhag, /0/008 Revew for the prevous lecture Cocepts: t-dstrbuto, F-dstrbuto Theorems: Dstrbutos of sample mea ad sample varace, relatoshp betwee sample mea ad sample
More informationPROPERTIES OF GOOD ESTIMATORS
ESTIMATION INTRODUCTION Estmato s the statstcal process of fdg a appromate value for a populato parameter. A populato parameter s a characterstc of the dstrbuto of a populato such as the populato mea,
More informationPTAS for Bin-Packing
CS 663: Patter Matchg Algorthms Scrbe: Che Jag /9/00. Itroducto PTAS for B-Packg The B-Packg problem s NP-hard. If we use approxmato algorthms, the B-Packg problem could be solved polyomal tme. For example,
More informationBounds on the expected entropy and KL-divergence of sampled multinomial distributions. Brandon C. Roy
Bouds o the expected etropy ad KL-dvergece of sampled multomal dstrbutos Brado C. Roy bcroy@meda.mt.edu Orgal: May 18, 2011 Revsed: Jue 6, 2011 Abstract Iformato theoretc quattes calculated from a sampled
More informationThe Mathematical Appendix
The Mathematcal Appedx Defto A: If ( Λ, Ω, where ( λ λ λ whch the probablty dstrbutos,,..., Defto A. uppose that ( Λ,,..., s a expermet type, the σ-algebra o λ λ λ are defed s deoted by ( (,,...,, σ Ω.
More informationTHE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE
THE ROYAL STATISTICAL SOCIETY 00 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutos to assst caddates preparg for the examatos future years ad for the
More informationLecture 1 Review of Fundamental Statistical Concepts
Lecture Revew of Fudametal Statstcal Cocepts Measures of Cetral Tedecy ad Dsperso A word about otato for ths class: Idvduals a populato are desgated, where the dex rages from to N, ad N s the total umber
More informationSome Notes on the Probability Space of Statistical Surveys
Metodološk zvezk, Vol. 7, No., 200, 7-2 ome Notes o the Probablty pace of tatstcal urveys George Petrakos Abstract Ths paper troduces a formal presetato of samplg process usg prcples ad cocepts from Probablty
More informationUNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS
UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Exam: ECON430 Statstcs Date of exam: Frday, December 8, 07 Grades are gve: Jauary 4, 08 Tme for exam: 0900 am 00 oo The problem set covers 5 pages Resources allowed:
More informationChapter 3 Sampling For Proportions and Percentages
Chapter 3 Samplg For Proportos ad Percetages I may stuatos, the characterstc uder study o whch the observatos are collected are qualtatve ature For example, the resposes of customers may marketg surveys
More informationThe number of observed cases The number of parameters. ith case of the dichotomous dependent variable. the ith case of the jth parameter
LOGISTIC REGRESSION Notato Model Logstc regresso regresses a dchotomous depedet varable o a set of depedet varables. Several methods are mplemeted for selectg the depedet varables. The followg otato s
More informationDimensionality Reduction and Learning
CMSC 35900 (Sprg 009) Large Scale Learg Lecture: 3 Dmesoalty Reducto ad Learg Istructors: Sham Kakade ad Greg Shakharovch L Supervsed Methods ad Dmesoalty Reducto The theme of these two lectures s that
More informationChapter 14 Logistic Regression Models
Chapter 4 Logstc Regresso Models I the lear regresso model X β + ε, there are two types of varables explaatory varables X, X,, X k ad study varable y These varables ca be measured o a cotuous scale as
More informationTHE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA
THE ROYAL STATISTICAL SOCIETY EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA PAPER II STATISTICAL THEORY & METHODS The Socety provdes these solutos to assst caddates preparg for the examatos future years ad for
More informationChapter 13 Student Lecture Notes 13-1
Chapter 3 Studet Lecture Notes 3- Basc Busess Statstcs (9 th Edto) Chapter 3 Smple Lear Regresso 4 Pretce-Hall, Ic. Chap 3- Chapter Topcs Types of Regresso Models Determg the Smple Lear Regresso Equato
More informationChapter 9 Jordan Block Matrices
Chapter 9 Jorda Block atrces I ths chapter we wll solve the followg problem. Gve a lear operator T fd a bass R of F such that the matrx R (T) s as smple as possble. f course smple s a matter of taste.
More informationStatistics Descriptive and Inferential Statistics. Instructor: Daisuke Nagakura
Statstcs Descrptve ad Iferetal Statstcs Istructor: Dasuke Nagakura (agakura@z7.keo.jp) 1 Today s topc Today, I talk about two categores of statstcal aalyses, descrptve statstcs ad feretal statstcs, ad
More informationObjectives of Multiple Regression
Obectves of Multple Regresso Establsh the lear equato that best predcts values of a depedet varable Y usg more tha oe eplaator varable from a large set of potetal predctors {,,... k }. Fd that subset of
More informationESS Line Fitting
ESS 5 014 17. Le Fttg A very commo problem data aalyss s lookg for relatoshpetwee dfferet parameters ad fttg les or surfaces to data. The smplest example s fttg a straght le ad we wll dscuss that here
More information1 Onto functions and bijections Applications to Counting
1 Oto fuctos ad bectos Applcatos to Coutg Now we move o to a ew topc. Defto 1.1 (Surecto. A fucto f : A B s sad to be surectve or oto f for each b B there s some a A so that f(a B. What are examples of
More informationBayes (Naïve or not) Classifiers: Generative Approach
Logstc regresso Bayes (Naïve or ot) Classfers: Geeratve Approach What do we mea by Geeratve approach: Lear p(y), p(x y) ad the apply bayes rule to compute p(y x) for makg predctos Ths s essetally makg
More informationf f... f 1 n n (ii) Median : It is the value of the middle-most observation(s).
CHAPTER STATISTICS Pots to Remember :. Facts or fgures, collected wth a defte pupose, are called Data.. Statstcs s the area of study dealg wth the collecto, presetato, aalyss ad terpretato of data.. The
More informationLecture 8: Linear Regression
Lecture 8: Lear egresso May 4, GENOME 56, Sprg Goals Develop basc cocepts of lear regresso from a probablstc framework Estmatg parameters ad hypothess testg wth lear models Lear regresso Su I Lee, CSE
More informationFeature Selection: Part 2. 1 Greedy Algorithms (continued from the last lecture)
CSE 546: Mache Learg Lecture 6 Feature Selecto: Part 2 Istructor: Sham Kakade Greedy Algorthms (cotued from the last lecture) There are varety of greedy algorthms ad umerous amg covetos for these algorthms.
More informationA Combination of Adaptive and Line Intercept Sampling Applicable in Agricultural and Environmental Studies
ISSN 1684-8403 Joural of Statstcs Volume 15, 008, pp. 44-53 Abstract A Combato of Adaptve ad Le Itercept Samplg Applcable Agrcultural ad Evrometal Studes Azmer Kha 1 A adaptve procedure s descrbed for
More information(Monte Carlo) Resampling Technique in Validity Testing and Reliability Testing
Iteratoal Joural of Computer Applcatos (0975 8887) (Mote Carlo) Resamplg Techque Valdty Testg ad Relablty Testg Ad Setawa Departmet of Mathematcs, Faculty of Scece ad Mathematcs, Satya Wacaa Chrsta Uversty
More informationFor combinatorial problems we might need to generate all permutations, combinations, or subsets of a set.
Addtoal Decrease ad Coquer Algorthms For combatoral problems we mght eed to geerate all permutatos, combatos, or subsets of a set. Geeratg Permutatos If we have a set f elemets: { a 1, a 2, a 3, a } the
More informationSPECIAL CONSIDERATIONS FOR VOLUMETRIC Z-TEST FOR PROPORTIONS
SPECIAL CONSIDERAIONS FOR VOLUMERIC Z-ES FOR PROPORIONS Oe s stctve reacto to the questo of whether two percetages are sgfcatly dfferet from each other s to treat them as f they were proportos whch the
More informationChapter 4 (Part 1): Non-Parametric Classification (Sections ) Pattern Classification 4.3) Announcements
Aoucemets No-Parametrc Desty Estmato Techques HW assged Most of ths lecture was o the blacboard. These sldes cover the same materal as preseted DHS Bometrcs CSE 90-a Lecture 7 CSE90a Fall 06 CSE90a Fall
More informationLecture 2 - What are component and system reliability and how it can be improved?
Lecture 2 - What are compoet ad system relablty ad how t ca be mproved? Relablty s a measure of the qualty of the product over the log ru. The cocept of relablty s a exteded tme perod over whch the expected
More informationIdeal multigrades with trigonometric coefficients
Ideal multgrades wth trgoometrc coeffcets Zarathustra Brady December 13, 010 1 The problem A (, k) multgrade s defed as a par of dstct sets of tegers such that (a 1,..., a ; b 1,..., b ) a j = =1 for all
More informationBootstrap Method for Testing of Equality of Several Coefficients of Variation
Cloud Publcatos Iteratoal Joural of Advaced Mathematcs ad Statstcs Volume, pp. -6, Artcle ID Sc- Research Artcle Ope Access Bootstrap Method for Testg of Equalty of Several Coeffcets of Varato Dr. Navee
More information5 Short Proofs of Simplified Stirling s Approximation
5 Short Proofs of Smplfed Strlg s Approxmato Ofr Gorodetsky, drtymaths.wordpress.com Jue, 20 0 Itroducto Strlg s approxmato s the followg (somewhat surprsg) approxmato of the factoral,, usg elemetary fuctos:
More informationChapter -2 Simple Random Sampling
Chapter - Smple Radom Samplg Smple radom samplg (SRS) s a method of selecto of a sample comprsg of umber of samplg uts out of the populato havg umber of samplg uts such that every samplg ut has a equal
More informationChapter -2 Simple Random Sampling
Chapter - Smple Radom Samplg Smple radom samplg (SRS) s a method of selecto of a sample comprsg of umber of samplg uts out of the populato havg umber of samplg uts such that every samplg ut has a equal
More informationRandom Variables and Probability Distributions
Radom Varables ad Probablty Dstrbutos * If X : S R s a dscrete radom varable wth rage {x, x, x 3,. } the r = P (X = xr ) = * Let X : S R be a dscrete radom varable wth rage {x, x, x 3,.}.If x r P(X = x
More informationis the score of the 1 st student, x
8 Chapter Collectg, Dsplayg, ad Aalyzg your Data. Descrptve Statstcs Sectos explaed how to choose a sample, how to collect ad orgaze data from the sample, ad how to dsplay your data. I ths secto, you wll
More informationCorrelation and Regression Analysis
Chapter V Correlato ad Regresso Aalss R. 5.. So far we have cosdered ol uvarate dstrbutos. Ma a tme, however, we come across problems whch volve two or more varables. Ths wll be the subject matter of the
More informationDiscrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand DIS 10b
CS 70 Dscrete Mathematcs ad Probablty Theory Fall 206 Sesha ad Walrad DIS 0b. Wll I Get My Package? Seaky delvery guy of some compay s out delverg packages to customers. Not oly does he had a radom package
More informationMultiple Choice Test. Chapter Adequacy of Models for Regression
Multple Choce Test Chapter 06.0 Adequac of Models for Regresso. For a lear regresso model to be cosdered adequate, the percetage of scaled resduals that eed to be the rage [-,] s greater tha or equal to
More informationApplication of Calibration Approach for Regression Coefficient Estimation under Two-stage Sampling Design
Authors: Pradp Basak, Kaustav Adtya, Hukum Chadra ad U.C. Sud Applcato of Calbrato Approach for Regresso Coeffcet Estmato uder Two-stage Samplg Desg Pradp Basak, Kaustav Adtya, Hukum Chadra ad U.C. Sud
More informationLecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model
Lecture 7. Cofdece Itervals ad Hypothess Tests the Smple CLR Model I lecture 6 we troduced the Classcal Lear Regresso (CLR) model that s the radom expermet of whch the data Y,,, K, are the outcomes. The
More informationChapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance
Chapter, Part A Aalyss of Varace ad Epermetal Desg Itroducto to Aalyss of Varace Aalyss of Varace: Testg for the Equalty of Populato Meas Multple Comparso Procedures Itroducto to Aalyss of Varace Aalyss
More informationMedian as a Weighted Arithmetic Mean of All Sample Observations
Meda as a Weghted Arthmetc Mea of All Sample Observatos SK Mshra Dept. of Ecoomcs NEHU, Shllog (Ida). Itroducto: Iumerably may textbooks Statstcs explctly meto that oe of the weakesses (or propertes) of
More informationTESTS BASED ON MAXIMUM LIKELIHOOD
ESE 5 Toy E. Smth. The Basc Example. TESTS BASED ON MAXIMUM LIKELIHOOD To llustrate the propertes of maxmum lkelhood estmates ad tests, we cosder the smplest possble case of estmatg the mea of the ormal
More informationUNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS
UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Postpoed exam: ECON430 Statstcs Date of exam: Jauary 0, 0 Tme for exam: 09:00 a.m. :00 oo The problem set covers 5 pages Resources allowed: All wrtte ad prted
More informationDescriptive Statistics
Page Techcal Math II Descrptve Statstcs Descrptve Statstcs Descrptve statstcs s the body of methods used to represet ad summarze sets of data. A descrpto of how a set of measuremets (for eample, people
More informationCHAPTER 4 RADICAL EXPRESSIONS
6 CHAPTER RADICAL EXPRESSIONS. The th Root of a Real Number A real umber a s called the th root of a real umber b f Thus, for example: s a square root of sce. s also a square root of sce ( ). s a cube
More informationInvestigating Cellular Automata
Researcher: Taylor Dupuy Advsor: Aaro Wootto Semester: Fall 4 Ivestgatg Cellular Automata A Overvew of Cellular Automata: Cellular Automata are smple computer programs that geerate rows of black ad whte
More informationSTATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1
STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Recall Assumpto E(Y x) η 0 + η x (lear codtoal mea fucto) Data (x, y ), (x 2, y 2 ),, (x, y ) Least squares estmator ˆ E (Y x) ˆ " 0 + ˆ " x, where ˆ
More informationCHAPTER 2. = y ˆ β x (.1022) So we can write
CHAPTER SOLUTIONS TO PROBLEMS. () Let y = GPA, x = ACT, ad = 8. The x = 5.875, y = 3.5, (x x )(y y ) = 5.85, ad (x x ) = 56.875. From equato (.9), we obta the slope as ˆβ = = 5.85/56.875., rouded to four
More informationECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity
ECONOMETRIC THEORY MODULE VIII Lecture - 6 Heteroskedastcty Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur . Breusch Paga test Ths test ca be appled whe the replcated data
More informationOutline. Point Pattern Analysis Part I. Revisit IRP/CSR
Pot Patter Aalyss Part I Outle Revst IRP/CSR, frst- ad secod order effects What s pot patter aalyss (PPA)? Desty-based pot patter measures Dstace-based pot patter measures Revst IRP/CSR Equal probablty:
More informationPart 4b Asymptotic Results for MRR2 using PRESS. Recall that the PRESS statistic is a special type of cross validation procedure (see Allen (1971))
art 4b Asymptotc Results for MRR usg RESS Recall that the RESS statstc s a specal type of cross valdato procedure (see Alle (97)) partcular to the regresso problem ad volves fdg Y $,, the estmate at the
More informationPoint Estimation: definition of estimators
Pot Estmato: defto of estmators Pot estmator: ay fucto W (X,..., X ) of a data sample. The exercse of pot estmato s to use partcular fuctos of the data order to estmate certa ukow populato parameters.
More information12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model
1. Estmatg Model parameters Assumptos: ox ad y are related accordg to the smple lear regresso model (The lear regresso model s the model that says that x ad y are related a lear fasho, but the observed
More informationStatistics MINITAB - Lab 5
Statstcs 10010 MINITAB - Lab 5 PART I: The Correlato Coeffcet Qute ofte statstcs we are preseted wth data that suggests that a lear relatoshp exsts betwee two varables. For example the plot below s of
More informationMultiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades
STAT 101 Dr. Kar Lock Morga 11/20/12 Exam 2 Grades Multple Regresso SECTIONS 9.2, 10.1, 10.2 Multple explaatory varables (10.1) Parttog varablty R 2, ANOVA (9.2) Codtos resdual plot (10.2) Trasformatos
More informationThe equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use.
INTRODUCTORY NOTE ON LINEAR REGREION We have data of the form (x y ) (x y ) (x y ) These wll most ofte be preseted to us as two colum of a spreadsheet As the topc develops we wll see both upper case ad
More information1. Overview of basic probability
13.42 Desg Prcples for Ocea Vehcles Prof. A.H. Techet Sprg 2005 1. Overvew of basc probablty Emprcally, probablty ca be defed as the umber of favorable outcomes dvded by the total umber of outcomes, other
More informationClass 13,14 June 17, 19, 2015
Class 3,4 Jue 7, 9, 05 Pla for Class3,4:. Samplg dstrbuto of sample mea. The Cetral Lmt Theorem (CLT). Cofdece terval for ukow mea.. Samplg Dstrbuto for Sample mea. Methods used are based o CLT ( Cetral
More informationESTIMATION OF MISCLASSIFICATION ERROR USING BAYESIAN CLASSIFIERS
Producto Systems ad Iformato Egeerg Volume 5 (2009), pp. 4-50. ESTIMATION OF MISCLASSIFICATION ERROR USING BAYESIAN CLASSIFIERS PÉTER BARABÁS Uversty of Msolc, Hugary Departmet of Iformato Techology barabas@t.u-msolc.hu
More informationEvaluating Polynomials
Uverst of Nebraska - Lcol DgtalCommos@Uverst of Nebraska - Lcol MAT Exam Expostor Papers Math the Mddle Isttute Partershp 7-7 Evaluatg Polomals Thomas J. Harrgto Uverst of Nebraska-Lcol Follow ths ad addtoal
More informationAssignment 5/MATH 247/Winter Due: Friday, February 19 in class (!) (answers will be posted right after class)
Assgmet 5/MATH 7/Wter 00 Due: Frday, February 9 class (!) (aswers wll be posted rght after class) As usual, there are peces of text, before the questos [], [], themselves. Recall: For the quadratc form
More informationThe Selection Problem - Variable Size Decrease/Conquer (Practice with algorithm analysis)
We have covered: Selecto, Iserto, Mergesort, Bubblesort, Heapsort Next: Selecto the Qucksort The Selecto Problem - Varable Sze Decrease/Coquer (Practce wth algorthm aalyss) Cosder the problem of fdg the
More informationNon-uniform Turán-type problems
Joural of Combatoral Theory, Seres A 111 2005 106 110 wwwelsevercomlocatecta No-uform Turá-type problems DhruvMubay 1, Y Zhao 2 Departmet of Mathematcs, Statstcs, ad Computer Scece, Uversty of Illos at
More informationUNIVERSITY OF EAST ANGLIA. Main Series UG Examination
UNIVERSITY OF EAST ANGLIA School of Ecoomcs Ma Seres UG Examato 03-4 INTRODUCTORY MATHEMATICS AND STATISTICS FOR ECONOMISTS ECO-400Y Tme allowed: 3 hours Aswer ALL questos from both Sectos. Aswer EACH
More informationChapter Two. An Introduction to Regression ( )
ubject: A Itroducto to Regresso Frst tage Chapter Two A Itroducto to Regresso (018-019) 1 pg. ubject: A Itroducto to Regresso Frst tage A Itroducto to Regresso Regresso aalss s a statstcal tool for the
More informationENGI 3423 Simple Linear Regression Page 12-01
ENGI 343 mple Lear Regresso Page - mple Lear Regresso ometmes a expermet s set up where the expermeter has cotrol over the values of oe or more varables X ad measures the resultg values of aother varable
More informationAnalysis of Variance with Weibull Data
Aalyss of Varace wth Webull Data Lahaa Watthaacheewaul Abstract I statstcal data aalyss by aalyss of varace, the usual basc assumptos are that the model s addtve ad the errors are radomly, depedetly, ad
More informationv 1 -periodic 2-exponents of SU(2 e ) and SU(2 e + 1)
Joural of Pure ad Appled Algebra 216 (2012) 1268 1272 Cotets lsts avalable at ScVerse SceceDrect Joural of Pure ad Appled Algebra joural homepage: www.elsever.com/locate/jpaa v 1 -perodc 2-expoets of SU(2
More informationLikelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests. Soccer Goals in European Premier Leagues
Lkelhood Rato, Wald, ad Lagrage Multpler (Score) Tests Soccer Goals Europea Premer Leagues - 4 Statstcal Testg Prcples Goal: Test a Hpothess cocerg parameter value(s) a larger populato (or ature), based
More informationChapter 11 Systematic Sampling
Chapter stematc amplg The sstematc samplg techue s operatoall more coveet tha the smple radom samplg. It also esures at the same tme that each ut has eual probablt of cluso the sample. I ths method of
More information= 1. UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Parameters and Statistics. Measures of Centrality
UCLA STAT Itroducto to Statstcal Methods for the Lfe ad Health Sceces Istructor: Ivo Dov, Asst. Prof. of Statstcs ad Neurology Teachg Assstats: Fred Phoa, Krste Johso, Mg Zheg & Matlda Hseh Uversty of
More informationA New Family of Transformations for Lifetime Data
Proceedgs of the World Cogress o Egeerg 4 Vol I, WCE 4, July - 4, 4, Lodo, U.K. A New Famly of Trasformatos for Lfetme Data Lakhaa Watthaacheewakul Abstract A famly of trasformatos s the oe of several
More informationUNIT 4 SOME OTHER SAMPLING SCHEMES
UIT 4 SOE OTHER SAPLIG SCHEES Some Other Samplg Schemes Structure 4. Itroducto Objectves 4. Itroducto to Systematc Samplg 4.3 ethods of Systematc Samplg Lear Systematc Samplg Crcular Systematc Samplg Advatages
More information