Dustin Fink (dsfink) and Katharina Roesler (roesler) 12/08/2015

Size: px
Start display at page:

Download "Dustin Fink (dsfink) and Katharina Roesler (roesler) 12/08/2015"

Transcription

1 Arrest harges (Non)-Independence in Black and White Men An Extension and Application of SimRank to include Network Structure when omputing Black and White Men s Arrest harge Similarities Dustin Fink (dsfink) and Katharina Roesler (roesler) 12/08/ Abstract When people are arrested, they are frequently charged with several offenses at once, yet most criminological research focuses on charges individually, as though they were independent of one another. This project examines how charges relate to each other within a charge-arrestee network, and how charge similarity differs across race. We extend the SimRank algorithm (Jeh and Widom 2002) to efficiently examine the similarity of charge pairs in a large dataset of men arrested in We compare computed SimRank scores of charge pairs within black and white arrestees and evaluate how these measures differ from the more traditional measure of similarity of hi-squared residuals, which does not account for network structure. Our procedure identified certain charges involving drugs,, theft, and assault to have a higher similarity in terms of the network than would be assumed based on their simple co-occurrences, and found that these measures were similar across black and white men. 2 Introduction Most criminological research examines arrest charges independently. That is, studies tend to examine individuals arrested for a certain type of charge, ignoring other charges in the same arrest. However, individuals are often charged with multiple offenses during a single arrest. It is of interest which charges co-occur frequently and whether co-occurrence patterns vary by arrestee race. For instance, certain patterns of co-occurrence and their interaction with arrestee race may indicate biased policing. That is, if stereotypically black charges are related to a more diverse set of other charges for black rather than white arrestees, it may be that Black individuals are under heightened police scrutiny for these offenses. In addition, if charges for which officers have relatively greater discretion to arrest individuals are related to a more diverse set of charges for black than for white arrestees, then officers may be over-arresting black men or under-arresting white men. The former was certainly the case in Ferguson, Missouri, where officers disproportionately charged black residents with a variety of misdemeanors, such as manner of walking and failure to comply (U.S. DOJ 2015, 67). One can obtain a baseline measure of similarity by examining how often two charges are present in a single arrest. However, we can extend this concept of similarity by considering

2 charges more similar when people arrested for both are more similar along other dimensions as well. This measure of similarity is given by the SimRank algorithm. We begin with a discussion of SimRank, extend it to run efficiently on our data, and give a procedure for determining which SimRank scores are significant due to network structure as opposed to the frequency of charges or simple co-occurrences. 3 SimRank: A Measure of Structural-ontext Similarity SimRank (Jeh and Widom 2002) provides an algorithm for assigning similarity between pairs of vertices in any given network based on their structural contexts. Two vertices are considered similar if they are connected to nodes which themselves are similar. This recursive definition of similarity improves upon previously used measures of similarity for networks (which only compared the shared number of neighbors between nodes) by allowing similarity to propagate through the network. SimRank strengthens measures of similarity on the assumption that similarity is a quality that can propagate through a network. If this assumption makes sense for the modeled data, then SimRank is an effective and meaningful measure of similarity based on complex contextual structures. SimRank is quadratic in the size of the graph (and for dense graphs, cubic), so the run-time is not ideal. Run-time improvements can be achieved by pruning and advanced disk-writing techniques, but these either reduce the quality of the results or require more complex engineering. 4 Problem Statement We develop and propose a strategy for identifying which offenses have higher similarity due to their network characteristics. We hypothesize that black and white male arrestees charges have different network structures. More precisely, we hypothesize that within each racial group, different charges are similar to one another, due to different patterns of criminal behavior and policing for each racial group. 5 Strategy 1. alculate SimRank scores between charges, made feasible by our ompressed SimRank algorithm. 2. Standardize the observed SimRank scores by subtracting the SimRank scores from a graph with the same single charge frequencies but randomized co-occurrences. 3. ompare these standardized SimRank scores with the baseline similarity of residuals of a chi-squared test done on co-occurrences to see which charges are more similar when considering network context. 4. Repeat the above restricted to black and white populations, and contrast the findings. 2

3 6 Data The Federal Bureau of Investigation collects data from law enforcement agencies on a monthly basis, producing an annual file titled the National Incident Based Reporting System (NIBRS; National Archive of riminal Justice Data 2013). In this project we examine NIBRS arrestee data from 2013, which contain records for 2,361,308 arrested men. Because NIBRS relies on agencies to voluntarily report their statistics, there may be considerable selection bias in these data. We merge this arrestee-level file with data on law enforcement agencies from the U.S. Department of Justices Law Enforcement Management and Administrative Statistics (LEMAS) survey (U.S. D.O.J. 2013). This dataset allows us to compare arrestees and their charges at the individual level while also incorporating information about their social context. Of the 2.4 million arrested men included in NIBRS data, we exclude records for which the charges are unknown. This results in a dataset of 1,124,876 arrested men (white: 744,889; black: 343,706). 7 Models We take advantage of three models: the arrestee, the profile model, and the random-profile model. 7.1 Arrestee Model We model our data as an undirected, bipartite graph G(O, H, E) where O = Node set of all arrestees H = Node set of all charges E = Edge set linking each arrestees with the corresponding attributes and charges. Each offender is charged with 1-3 charges. 7.2 Profile Model In the profile model, we profile arrestees by their set of charges, and collapse all arrestees with the same profile into one profile node. We represent our compressed network an undirected, bipartite graph G(P, H, E) where P = Node set, where each node represents a profile p with p offenders. H = Node set of all charges E = Edge set linking each profiles with the corresponding attributes and charges. 3

4 The number of observed profiles were 1,995 for the total data, 1,775 for the white subset, and 1,312 for the black subset, out of 16,261 profiles possible profiles. 7.3 Random-Profile Model We generate a random-profile graph that has the same counts of arrestees for each charge, but randomized the arrestees profiles. We generated this graph by iteratively selecting 1-3 charges and adding an arrestee to the corresponding profile while decreasing the corresponding charge counts. The algorithm ran until all charge counts were 0. This produced a graph with a similar number of profiles (total: 2,561 total; white: 2,320; black: 1,963), but a different distribution of charges. 8 Methods 8.1 SimRank Following the original methods for finding SimRank for a bipartite graph with = 1 = 2 = 0.9, for all a, b V, V = H, O, we start by setting { 1, if a = b s 0 (a, b) = 0, if a b We then iterate using the update equation: 1, if a = b s k (a, b) = s k 1 (c, d), if a b I(a) I(b) c I(a) d I(b) until convergence of a maximum number of iterations. We say the algorithm converges when all SimRank scores are within ɛ of the previous iteration. The runtime of this naive approach is is O(K( O 2 d O + H 2 d H )), where K is the number of iterations, and d V is the average value of I(a) I(b) for a, b V. 8.2 ompressed SimRank SimRank is prohibitively large given the number of offender in our dataset. The original SimRank paper suggests pruning as a way to increase computation, but this can decrease the quality of the results. Instead, we can use our profile model along with what we describe here as ompressed SimRank to achieve the correct SimRank scores in a feasible amount of time. ompressed SimRank relies on the fact that any pair of offenders from the same pair of profiles will have the same SimRank score 4

5 s(a, b) = s(a, b ) for a, b p, a, b q, a b and a b since they have the same neighbor sets. We can thus compute SimRank scores on the profile model instead of the offender model. We define the SimRank between two profiles to be the SimRank of two offenders taken from those profiles. The SimRank of a profile with itself is the SimRank of two different offenders from that profile (The SimRank of a profile that contains only one offender with itself does not affect the equations, and thus is calculated as if there were more than one offender). Using our compressed network, we change the initial equation for profiles to be: s 0 (p, q) = 0 and the initial equation for charges to be the same: { 1, if a = b s 0 (a, b) = 0, if a b The update equation for profiles changes to: s k (p, q) = I(p) I(q) and the update equation for charges changes to s k (a, b) = p I(a) q I(b) if a b and s k (a, b) = 1 if a = b. p I(a) q I(b) p q c I(p) d I(q) s k 1 (c, d) p q p q s k 1 (p, q) + ( p 2 p )s k 1 (p, q) + p p I(a) q I(b) p=q The resulting scores for pairs of charges is the same as SimRank. We can also recover the SimRank of a pair of two different offenders from the SimRank of their corresponding profiles. The derivation of these equations is given in Appendix A. The resulting runtime is O(K( P 2 d P + 2 d )) O(K( P 2 )). Since the number of profiles is about 2/1000ths the number of offenders, this runs in much faster time, and finished in 1-2 hours. ompressed SimRank was run on the original data, the black subset, and the white subset, as well as a random-profile graph based on the single charge frequencies of the original data, the black subset, and the white subset. 5

6 9 Evaluation We standardize computed SimRank scores for the observed data by subtracting the Sim- Rank scores for the random-profile graphs. The figures below show the distributions of SimRank scores for each offense for the observed data on the left and the random network s data on the right. SimRank scores for the observed data have greater variance than do those for the random network, indicating that charges are more related to one another than one would expect by chance. The random SimRank scores have a higher mean because the randomization resulted in more profiles with greater sizes, and thus more simulated offenders, but this difference does not affect our evaluation. SimRank Values for Observed Arrests SimRank Values for Random Graph SimRank SimRank operating_gambling aggravated_assault simple_assault credit_card wire_fraud sports_tampering _promotion _purchase stolen_property false_pretenses forcible_sodomy negligent_manslaughter welfare_fraud gambling_equipment Offense drugs_narcotics drug_equipment pocket_picking purse_snatching building_theft theft_from_machine theft_from_auto auto_parts_theft other_theft forcible_rape auto_theft rape_object forcible_fondling statutory_rape operating_gambling aggravated_assault simple_assault credit_card wire_fraud sports_tampering _promotion _purchase stolen_property false_pretenses forcible_sodomy negligent_manslaughter welfare_fraud gambling_equipment Offense In order to evaluate the general accuracy of the computed SimRank measures, we compare charge-pairs similarity scores and hi-squared residuals in the raw, co-occurrence data. For instance, to assess whether White men s relatively high standardized similarity score (0.121) for and drug charges is accurate, we calculate a hi-squared table for the two charges. Given their respective frequencies, their expected co-occurrence is 3,835, while their actual co-occurrence is 7,915. This results in a hi-squared residual of 65.9 for co-occurrence and a general p-value of , indicating that the two charges co-occur much more frequently than expected by chance. Black men have a relatively lower standardized SimRank score for and drug charges (0.080), although this is still well above black men s average standardized SimRank of all charge pairs (-0.149). For black arrested men, the expected co-occurrence of and drug charges is 4,155, while the observed co-occurrence is 7,919. This results in a hi-squared residual of 58.4 and a general p-value of , indicating that the two charges are related, but slightly less so than they are for white men. In order to compare all computed SimRank values to observed co-occurrences simultaneously, we plotted matrices of charge-pairs standardized SimRank scores and hi-squared drugs_narcotics drug_equipment pocket_picking purse_snatching building_theft theft_from_machine theft_from_auto auto_parts_theft other_theft forcible_rape auto_theft rape_object forcible_fondling statutory_rape 6

7 residuals. The figure below on the left below shows our standardized SimRank scores: the difference in SimRank scores between the observed and random networks. It seems that drugs/narcotics and drugs equipment are highly related to one another and many other charges, while most other charge pairs are less related than the random network predicts. Standardized SimRank Observed SimRank Random hi Squared Residuals for o Occurrences operating_gambling aggravated_assault simple_assault credit_card wire_fraud sports_tampering _promotion _purchase stolen_property false_pretenses forcible_sodomy negligent_manslaughter welfare_fraud gambling_equipment drugs_narcotics drug_equipment pocket_picking purse_snatching building_theft theft_from_machine theft_from_auto auto_parts_theft other_theft forcible_rape auto_theft rape_object forcible_fondling statutory_rape operating_gambling aggravated_assault simple_assault credit_card wire_fraud sports_tampering _promotion _purchase stolen_property false_pretenses forcible_sodomy negligent_manslaughter welfare_fraud gambling_equipment drugs_narcotics drug_equipment pocket_picking purse_snatching building_theft theft_from_machine theft_from_auto auto_parts_theft other_theft forcible_rape auto_theft rape_object forcible_fondling statutory_rape operating_gambling aggravated_assault simple_assault credit_card wire_fraud sports_tampering _promotion _purchase stolen_property false_pretenses forcible_sodomy negligent_manslaughter welfare_fraud gambling_equipment drugs_narcotics drug_equipment pocket_picking purse_snatching building_theft theft_from_machine theft_from_auto auto_parts_theft other_theft forcible_rape auto_theft rape_object forcible_fondling statutory_rape operating_gambling aggravated_assault simple_assault credit_card wire_fraud sports_tampering _promotion _purchase stolen_property false_pretenses forcible_sodomy negligent_manslaughter welfare_fraud gambling_equipment drugs_narcotics drug_equipment pocket_picking purse_snatching building_theft theft_from_machine theft_from_auto auto_parts_theft other_theft forcible_rape auto_theft rape_object forcible_fondling A similar pattern emerges with the hi-squared residuals, shown in the figure on the right. Drugs and narcotics charges are highly related to one another, while most other charges are largely unrelated to one another. statutory_rape The figure below on the left compares charge pairs standardized SimRank scores and hisquared residuals. Blue dots represent charge pairs whose hi-squared residual is between -1 and 1, while red dots represent charge pairs with relatively larger hi-squared residuals (in absolute terms). It seems that SimRank closely matches hi-squared residuals for charges that co-occur much more or less often than expected, but does not do so for charge pairs that are relatively unrelated. This makes sense, as network structure impacts SimRank scores more strongly when charges are unrelated to one another. SimRank is thus somewhat independent of charge co-occurrence. The figure below on the right confirms this independence, as charge pairs standardized SimRank scores are only moderately related to the log of their co-occurrences. 7

8 hi Squared Residual by Standardized SimRank Observed Random SimRank hi Squared Residual o Occurrences by Standardized SimRank Score Observed Random SimRank Log of harge o Occurrences In addition to performing these analyses, we examine our data visually to understand how black and white arrestees charges relate to one another. We create a charge-charge network with edges weighted by charges co-occurrence. That is, for each individual arrested for charge 1 and charge 2, the edge between charge 1 and 2 increases by 1 in weight. Please see the figures below, which represent the charge-charge networks for black and white arrested men, with Black men on the left. auto parts theft purse snatching forcible sodomy promotion trespass forcible fondling negligent manslaughter runaway other theft wire fraud drunk family nonv gambling equipment bad check aggravated assault simple assault peeping tom statutory rape theft from auto auto theft sex trafficking dui credit card drugs narcotics rape object other offense building theft pocket picking liquor law drug equipment false pretenses theft from machine operating gambling stolen property loitering forcible rape purchase sports tampering disorder welfare fraud auto parts theft purse snatching forcible sodomy promotion trespass forcible fondling negligent manslaughter runaway other theft wire fraud drunk family nonv gambling equipment bad check aggravated assault forcible simple assault peeping tom statutory rape theft from auto nonforcible auto theft sex trafficking dui credit card drugs narcotics rape object other offense building theft pocket picking liquor law drug equipment false pretenses theft from machine operating gambling stolen property loitering forcible rape purchase sports tampering disorder welfare fraud ommunities are detected by modularity and colored accordingly, and charges with greater 8

9 weighted PageRank have larger labels and are more central to the network. Both black and white arrestees networks have similar communities and charges high in centrality, mirroring their similar SimRank scores. 9.1 Final Results and Discussion We compare the distributions of standardized SimRank scores and hi-squared residuals and find that the following charge pairs are two standard deviations above the mean (are significantly higher in SimRank): drugs equipment, with each of, simple assault, aggravated assault,, other theft drugs narcotics, with each of, other theft, simple assault In addition, the following charge-pairs are six standard deviations below the mean (are relatively lower in SimRank): drugs/narcotics with drugs equipment with promotion gambling operation with forcible sodomy with forcible rape with other theft The implication here is that our procedure identifies an important contribution of the network structure to the similarity of, theft, assault, and drug crimes. The pairs below the mean difference have relatively large standardized SimRank scores due to their high count and/or simple co-occurrence. We present pairs six deviations below the mean due to space constraints, but the tails are similar and the large majority of pairs (96%) are within 2 standard deviations. We calculate the same distributions for subsets of black and white arrestees separately and find similar charge-pairs to be relatively high and low in SimRank as compared to hisquared residuals. Thus we find no support for our hypothesis that charges similarities vary by arrestee race. 10 onclusion We estimate a measure of charge similarity that is related to but also divergent from charges co-occurrences. We compute ompressed SimRank to enhance classical SimRank, and standardize these scores against those of a comparable random graph. High standardized Sim- Rank scores relative to co-occurrence indicate that a pair of charges share similar contexts. We compare these measures for black and white arrestees and find that black and white arrestees charges relate to one another in similar ways. 9

10 11 References Jeh, Glen, and Jennifer Widom. SimRank: a measure of structural-context similarity. Proceedings of the eighth AM SIGKDD international conference on Knowledge discovery and data mining. AM, National Archive of riminal Justice Data. National Incident-Based Reporting System, 2013: Extract Files. IPSR36121-v1. Ann Arbor, MI: Inter-university onsortium for Political and Social Research [distributor], United States Department of Justice (DOJ). Office of Justice Programs. Bureau of Justice Statistics. Law Enforcement Management and Administrative Statistics (LEMAS), 2013, IPSR Ann Arbor, MI: Inter-university onsortium for Political and Social Research (IPSR) [distributor], doi.org/ /ipsr36164.v1. United States Department of Justice (DOJ) Investigation of the Ferguson Police Department. Technical report, United States Department of Justice, ivil Rights Division. 12 Teammate ontributions Dustin: Models and Algorithms, ompressed SimRank, Random-profile generation, Final Results and Discussion Katharina: Data cleaning, Data visualization, Evaluations and evaluation visualization 10

11 The following pages are included for completeness sake and not intended to be graded. Appendix A - Derivation of ompressed SimRank Let G(O, H, E) be an undirected bipartite graph, and G(P, H, E) be the compressed profile version of the graph. Let SimRank for the original graph to be defined by the equations { 1, if a = b s 0 (a, b) = 0, if a b and We take it as fact that 1, if a = b s k (a, b) = s k 1 (c, d), if a b I(a) I(b) c I(a) d I(b) s(a, b) = s(a, b ) for a, b p, a, b q, a b and a b for profiles p, q P. Noting this, we can then define the SimRank between to profiles to be s(p, q) = s(a, b) for any a p, b q, a b. This definition is ill-defined if p = q and p = 1, but this will not end up mattering in our derivation. For the purposes of the rest of the derivation, we let a, b be taken from p, q with a b. Given this definition, our initial values for s(p, q) are 0, since Our update rule for profiles is s k (p, q) = s k (a, b) = s 0 (p, q) = s 0 (a, b) = 0 I(a) I(b) c I(a) d I(b) s k 1 (c, d) Let Î denote the neighbor set in our compressed graph. Because of our compression, we have that I(a) = Î(p) and I(b) = Î(q), so we can re-write this as s k (p, q) = Î(p) Î(q) c ˆ(p) d Î(q) s k 1 (c, d) 11

12 It remains to derive the update equation for charges in ompressed SimRank in terms of the SimRank scores of profiles from the original SimRank update equation. For a pair of charges c, d, if c = d, we still have s k (c, d) = 1. Otherwise, we have s k (c, d) = I(c) I(d) a I(c) b I(d) s k 1 (a, b) We first note that the neighbor set of a charge c is the union of all offenders of all profiles linked to c in the compressed graph: We thus get I(c) = p Î(c) a p a I(c) = p Î(c) a pa = a p a = p and similarly for d with q. The first equality works since the profiles partition the set of offenders. We next consider the sum c Î(p) s k 1 (c, d). get d Î(q) s k 1 (a, b) = s k 1 (a, b) + a p a I(c) b I(d) p q a p b q We can regroup the offenders by profile to p=q b q a b s k 1 (a, b) + a p b q a=b s k 1 (a, b) Since all SimRank scores are the same for a pair of offenders from the same pair of profiles, we can replace the inner sums with products of the SimRank of the profiles and the profile sizes. 12

13 a I(c) b I(d) s k 1 (a, b) = s k 1 (a, b) + a p p q p q a p b q p=q p=q b q a b s k 1 (a, b) + a p b q a=b s k 1 (a, b) = p q s k 1 (p, q) + [ ( p 2 p )s k 1 (p, q) + p 2 (1) ] We separate the terms like this to account for the case when p = q and a = b, since the value of s(a, b) in this case is 1 and not s(p, q). Note that if p = q and p = 1, p 2 p = 0 and so the undefined value of s(p, q) in this case does not matter. Putting it all together, we get s k (c, d) = I(c) I(d) a I(c) b I(d) s k 1 (a, b) = p q s p p Î(d) q k 1 (p, q) + [ ( p 2 p )s k 1 (p, q) + p 2 (1) ] p q p q p=q p=q = p q s k 1 (p, q) + [ ( p 2 p )s k 1 (p, q) + p 2 (1) ] p q 13

Belton Police Department

Belton Police Department Belton Police Department Uniform Crime Report for June 2013 RETURN-A - MONTHLY RETURN OF OFFENSES KNOWN TO THE POLICE 1 Classification of offenses 1 Criminal Homicide A Murder/nonnegligent homicide 2 Offenses

More information

LEWISVILLE POLICE DEPARTMENT

LEWISVILLE POLICE DEPARTMENT STATISTICAL REPORT FOR THE MONTH OF RUSSELL KERBOW CHIEF OF POLICE PREPARED BY: ROBIN BRIZENDINE PUBLIC SAFETY DATA TECHNICIAN DATE PREPARED: September 20, 2017 ORI: TX0610600 AGENCY: LEWISVILLE STATE:

More information

U.C. Davis FBI Part I & Part II Crime Offenses 2008 to 2010

U.C. Davis FBI Part I & Part II Crime Offenses 2008 to 2010 U.C. Davis FBI Part I & Part II Crime Offenses 2008 to 2010 2010 PART I OFFENSES 2008 Number of Actual Offenses 2009 Number of Actual Offenses 2010 Number of Actual Offenses 2009 to 2010 Percent Change

More information

FBI Part I & Part II Crime Offenses Arrests Miscellaneous Activity Value of Stolen Property Crime Pie Charts Crime Line Charts Crime Rate Charts

FBI Part I & Part II Crime Offenses Arrests Miscellaneous Activity Value of Stolen Property Crime Pie Charts Crime Line Charts Crime Rate Charts U.C. Davis Medical Center Crime Statistics (Medical Center) PDF Version FBI Part I & Part II Crime Offenses Arrests Miscellaneous Activity Value of Stolen Property Crime Pie Charts Crime Line Charts Crime

More information

UC POLICE DEPARTMENT REPORTS DASHBOARD

UC POLICE DEPARTMENT REPORTS DASHBOARD UC POLICE DEPARTMENT REPORTS DASHBOARD UC MERCED Annual 1. UC Merced FBI Part I Crime Offenses 2 2. UC Merced FBI Part II Crime Offenses 3 3. UC Merced Arrests - FBI Crime Offenses 4 4. UC Merced Value

More information

All data is subject to further review and change.

All data is subject to further review and change. All data is subject to further review and change. 5500 5000 4500 32% Increase Number or Incidents 4000 3500 3000 2500 2000 1500 21% Decrease 3742 4936 2008 2009 1000 500 0 1052 Part I 827 Part II 800 700

More information

Percent Change. Last YTD

Percent Change. Last YTD Part I L Month Month HOMICIDE 1 0 5 3 66.7% 1 4 3 100.0% 80.0% 100.0% 0 0 0 2 6 8 MURDER 0 0 4 1 300.0% 0 3 1-75.0% 100.0% 0 0 0 2 6 8 MANSLAUGHTER 0 0 0 2-100.0% 0 0 2 - - 100.0% 0 0 0 0 0 0 NEG. HOMICIDE

More information

Medford Police Department April 2018 Consolidated Incident Report(includes all nature of call codes)

Medford Police Department April 2018 Consolidated Incident Report(includes all nature of call codes) Part I L Month Month HOMICIDE 0 0 2 1 100.0% 0 3 1-150.0% 100.0% 0 0 0 0 5 5 MURDER 0 0 2 1 100.0% 0 2 1-100.0% 100.0% 0 0 0 0 2 2 MANSLAUGHTER 0 0 0 0 0% 0 0 0 - - - 0 0 0 0 0 0 NEG. HOMICIDE - TRAFFIC

More information

Percent Change. Last YTD MANSLAUGHTER % % 100.0% JUST. HOMICIDE %

Percent Change. Last YTD MANSLAUGHTER % % 100.0% JUST. HOMICIDE % Part I L Month Month HOMICIDE 1 0 3 1 200.0% 2 4 1 200.0% 133.3% 100.0% 0 2 2 0 4 4 MURDER 0 0 1 1-1 2 1-200.0% 100.0% 0 0 0 0 1 1 MANSLAUGHTER 1 0 2 0 200% 1 2 0 100.0% 100.0% - 0 2 2 0 3 3 JUST. HOMICIDE

More information

Crime and Fire Statistics

Crime and Fire Statistics Crime and Fire Statistics Monmouth University Police Department Crime Statistics Murder Negligent Manslaughter Forcible Sex Offenses Rape Criminal Sexual Contact Non-Forced Sex Offenses Incest Statutory

More information

Dispatched / Callback Calls 1025 MEET FELONY WARRANT

Dispatched / Callback Calls 1025 MEET FELONY WARRANT CITY OF Dispatched, Callback, and Self-Initiated for Service to 19th Ave & Northern Area (Based on Addresses of 19th Av & Northern, Northern & 19th Av, 1800-2000 W Northern, or 7800-8100 N 19th Av) 01/01/2014-06/30/2017

More information

City of Newcastle Police Services Report. Third Quarter 2017

City of Newcastle Police Services Report. Third Quarter 2017 City of Newcastle Police Services Report Third Quarter 2017 Prepared by: The Research, Planning and Information Services Unit of the King County Sheriff s Office CITY OF NEWCASTLE CHIEF OF POLICE SERVICES

More information

Crime and Fire Statistics

Crime and Fire Statistics Monmouth University Police Department Crime Statistics Murder Negligent Manslaughter Forcible Sex Offenses Rape Criminal Sexual Contact Non-Forced Sex Offenses Incest Statutory Rape Robbery Aggravated

More information

Policy and Procedures For Issuing Timely Warnings

Policy and Procedures For Issuing Timely Warnings The Pennsylvania State University Policy and Procedures For Issuing Timely Warnings I. Purpose The purpose of this Policy is to establish a process by which The Pennsylvania State University (hereinafter

More information

ITT Technical Institute, Salem, Oregon Safety and Security Policies with Crime Statistics Report CRIME STATISTICS

ITT Technical Institute, Salem, Oregon Safety and Security Policies with Crime Statistics Report CRIME STATISTICS ITT Technical Institute, Salem, Oregon Safety and Security Policies with Crime Statistics Report CRIME STATISTICS In each of the specified calendar years, the following number of crimes were reported to

More information

Communities Via Laplacian Matrices. Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices

Communities Via Laplacian Matrices. Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices Communities Via Laplacian Matrices Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices The Laplacian Approach As with betweenness approach, we want to divide a social graph into

More information

Montana Marijuana Arrests

Montana Marijuana Arrests Working to Reform Marijuana Laws The NORML Almanac of Marijuana Arrest Statistics Montana Marijuana Arrests Marijuana Arrests 1997-2002 (Summary) Marijuana Possession Arrests-2002 (Demographics) Marijuana

More information

Major Crime Map Help Documentation

Major Crime Map Help Documentation Major Crime Map Help Documentation This web application is designed to make it easier to visualize and understand crime trends in Overland Park. The data for this application are generally limited to relatively

More information

Environmental Criminology

Environmental Criminology ENVIRONMENTAL CRIMINOLOGY AND THE IMPACT OF PROBLEM BUSINESSES ON CRIME IN PORTLAND NEIGHBORHOODS Presented by Ryan Arnold Hector Osuna Allen Byrd Environmental Criminology Annually crime plays a large

More information

Chicago Crime Category Classification (CCCC)

Chicago Crime Category Classification (CCCC) Chicago Crime Category Classification (CCCC) Sahil Agarwal 1, Shalini Kedlaya 2 and Ujjwal Gulecha 3 Abstract It is very important and crucial to keep a city safe from crime. If the police could be given

More information

Texas Christian University

Texas Christian University Texas Christian University Annual Campus Security Report CRIME STATISTICS October 2018 Texas Christian University Police Department 3025 Lubbock Avenue, Fort Worth, Texas 76129 police.tcu.edu TCU CRIME

More information

Node similarity and classification

Node similarity and classification Node similarity and classification Davide Mottin, Anton Tsitsulin HassoPlattner Institute Graph Mining course Winter Semester 2017 Acknowledgements Some part of this lecture is taken from: http://web.eecs.umich.edu/~dkoutra/tut/icdm14.html

More information

Seasonality in recorded crime: preliminary findings

Seasonality in recorded crime: preliminary findings Seasonality in recorded crime: preliminary findings Celia Hird Chandni Ruparel Home Office Online Report /7 The views expressed in this report are those of the authors, not necessarily those of the Home

More information

Analysis of Violent Crime in Los Angeles County

Analysis of Violent Crime in Los Angeles County Analysis of Violent Crime in Los Angeles County Xiaohong Huang UID: 004693375 March 20, 2017 Abstract Violent crime can have a negative impact to the victims and the neighborhoods. It can affect people

More information

CS246 Final Exam, Winter 2011

CS246 Final Exam, Winter 2011 CS246 Final Exam, Winter 2011 1. Your name and student ID. Name:... Student ID:... 2. I agree to comply with Stanford Honor Code. Signature:... 3. There should be 17 numbered pages in this exam (including

More information

2013 PROBATION SURVEY

2013 PROBATION SURVEY This document is made available electronically by the Minnesota Legislative Reference Library as part of an ongoing digital archiving project. http://www.leg.state.mn.us/lrl/lrl.asp 2013 PROBATION SURVEY

More information

MEMPHIS POLICE DEPARTMENT

MEMPHIS POLICE DEPARTMENT MEMPHIS POLICE DEPARTMENT Sentinel Camera Project City Council Districts (Data Dates Jan. 1 Dec. 31, 2016) District 1 - Councilman Bill Morrison District 2 - Councilman Frank Colvett District 3 Councilwoman

More information

CRIME IN VIRGINIA. Virginia Uniform Crime Reporting Program JANUARY-DECEMBER, 2010

CRIME IN VIRGINIA. Virginia Uniform Crime Reporting Program JANUARY-DECEMBER, 2010 CRIME IN VIRGINIA JANUARY-DECEMBER, 2010 Virginia Uniform Crime Reporting Program DEPARTMENT OF STATE POLICE Colonel W. Steven Flaherty Superintendent RICHMOND i ii Copyright c 2011 Virginia Foreword Crime

More information

Date: Wed 12 July 2017 Reference Number: FOI Category: Stats - Crime

Date: Wed 12 July 2017 Reference Number: FOI Category: Stats - Crime Date: Wed 12 July 2017 Reference Number: 20171145 FOI Category: Stats - Crime Title: Hate Crimes Request Date: Friday, 23 June, 2017 Response Date: Wednesday, 12 July, 2017 Request Details: Q1: The total

More information

Driving Forces of Houston s Burglary Hotspots During Hurricane Rita

Driving Forces of Houston s Burglary Hotspots During Hurricane Rita Driving Forces of Houston s Burglary Hotspots During Hurricane Rita Marco Helbich Department of Geography University of Heidelberg Heidelberg, Germany & Michael Leitner Department of Geography and Anthropology

More information

Behavioral Data Mining. Lecture 2

Behavioral Data Mining. Lecture 2 Behavioral Data Mining Lecture 2 Autonomy Corp Bayes Theorem Bayes Theorem P(A B) = probability of A given that B is true. P(A B) = P(B A)P(A) P(B) In practice we are most interested in dealing with events

More information

TEXAS WESLEYAN UNIVERSITY CAMPUS SECURITY AND FIRE SAFETY ANNUAL REPORT January 1, 2010 through December 31, 2010

TEXAS WESLEYAN UNIVERSITY CAMPUS SECURITY AND FIRE SAFETY ANNUAL REPORT January 1, 2010 through December 31, 2010 TEXAS WESLEYAN UNIVERSITY CAMPUS SECURITY AND FIRE SAFETY ANNUAL REPORT January 1, 2010 through December 31, 2010 The following information is provided to better inform University students and others.

More information

October, 2017 Property Crimes *All Address Locations Are To The Nearest Block Number*

October, 2017 Property Crimes *All Address Locations Are To The Nearest Block Number* OWOSSO NEIGHBORHOOD CRIME REPORT October, 2017 Property Crimes *All Address Locations Are To The Nearest Block Number* 1200 PENBROOK DR ABANDONED VEHICLE 100 N LANSING ST ABANDONED VEHICLE 1700 E CORUNNA

More information

CRIME IN VIRGINIA. Compiled By Uniform Crime Reporting Section Department of State Police

CRIME IN VIRGINIA. Compiled By Uniform Crime Reporting Section Department of State Police CRIME IN VIRGINIA 2012 Compiled By Uniform Crime Reporting Section Department of CRIME IN VIRGINIA JANUARY-DECEMBER, 2012 Virginia Uniform Crime Reporting Program DEPARTMENT OF STATE POLICE Colonel W.

More information

The figures provided therefore, only stipulate where the requested keywords are present in a summary field of a crime report.

The figures provided therefore, only stipulate where the requested keywords are present in a summary field of a crime report. Freedom of Information Request Reference No: I note you seek access to the following information: Regarding crimes involving dating apps Figures for Facebook as well as Tinder and Grindr in each financial

More information

Crime Prediction using Businesses and Housing Values in San Francisco

Crime Prediction using Businesses and Housing Values in San Francisco Crime Prediction using Businesses and Housing Values in San Francisco James Jung Lee, Joel Kek, Yik Lun Lee Introduction Predictive policing is the idea of using technology and data analytics to proactively

More information

Curriculum Summary 8 th Grade Algebra I

Curriculum Summary 8 th Grade Algebra I Curriculum Summary 8 th Grade Algebra I Students should know and be able to demonstrate mastery in the following skills by the end of Eighth Grade: The Number System Extend the properties of exponents

More information

The CrimeStat Program: Characteristics, Use, and Audience

The CrimeStat Program: Characteristics, Use, and Audience The CrimeStat Program: Characteristics, Use, and Audience Ned Levine, PhD Ned Levine & Associates and Houston-Galveston Area Council Houston, TX In the paper and presentation, I will discuss the CrimeStat

More information

A Reservoir Sampling Algorithm with Adaptive Estimation of Conditional Expectation

A Reservoir Sampling Algorithm with Adaptive Estimation of Conditional Expectation A Reservoir Sampling Algorithm with Adaptive Estimation of Conditional Expectation Vu Malbasa and Slobodan Vucetic Abstract Resource-constrained data mining introduces many constraints when learning from

More information

n n P} is a bounded subset Proof. Let A be a nonempty subset of Z, bounded above. Define the set

n n P} is a bounded subset Proof. Let A be a nonempty subset of Z, bounded above. Define the set 1 Mathematical Induction We assume that the set Z of integers are well defined, and we are familiar with the addition, subtraction, multiplication, and division. In particular, we assume the following

More information

CRIME IN VIRGINIA. Compiled by Uniform Crime Reporting Section Department of State Police

CRIME IN VIRGINIA. Compiled by Uniform Crime Reporting Section Department of State Police CRIME IN VIRGINIA 2006 Compiled by Uniform Crime Reporting Section Department of CRIME IN VIRGINIA JANUARY-DECEMBER, 2006 Virginia Uniform Crime Reporting Program DEPARTMENT OF STATE POLICE Colonel W.

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data Instructor: Yizhou Sun yzsun@ccs.neu.edu March 16, 2016 Methods to Learn Classification Clustering Frequent Pattern Mining Matrix Data Decision

More information

Juvenile Justice Information System

Juvenile Justice Information System Juvenile Justice Information System Data & Evaluation Reports Dispositions () JJIS Steering Committee JJIS Data & Evaluation Sub-Committee JJIS A Shared Information System Oregon Youth Authority Oregon

More information

Problem Set 4. General Instructions

Problem Set 4. General Instructions CS224W: Analysis of Networks Fall 2017 Problem Set 4 General Instructions Due 11:59pm PDT November 30, 2017 These questions require thought, but do not require long answers. Please be as concise as possible.

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

CS570 Data Mining. Anomaly Detection. Li Xiong. Slide credits: Tan, Steinbach, Kumar Jiawei Han and Micheline Kamber.

CS570 Data Mining. Anomaly Detection. Li Xiong. Slide credits: Tan, Steinbach, Kumar Jiawei Han and Micheline Kamber. CS570 Data Mining Anomaly Detection Li Xiong Slide credits: Tan, Steinbach, Kumar Jiawei Han and Micheline Kamber April 3, 2011 1 Anomaly Detection Anomaly is a pattern in the data that does not conform

More information

Anomaly Detection. Jing Gao. SUNY Buffalo

Anomaly Detection. Jing Gao. SUNY Buffalo Anomaly Detection Jing Gao SUNY Buffalo 1 Anomaly Detection Anomalies the set of objects are considerably dissimilar from the remainder of the data occur relatively infrequently when they do occur, their

More information

Tools for Analyzing NIBRS Data and Tips for Merging NIBRS Data with Publicly Available Datasets

Tools for Analyzing NIBRS Data and Tips for Merging NIBRS Data with Publicly Available Datasets Tools for Analyzing NIBRS Data and Tips for Merging NIBRS Data with Publicly Available Datasets Scott Came Executive Director Becki Goggins Director, Law and Policy Program 2016 Association of State Uniform

More information

1995 Marijuana Sales Arrests by Race

1995 Marijuana Sales Arrests by Race Virginia Metropolitan Area Counties 1995 Marijuana Sales Arrests by Race (Data Source: Uniform Crime Reports. See below for notes.) ALL BLACK WHITE AM. INDIAN ASIAN-PAC. METROPOLITAN AREA JURISDICTION

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 03 The Chi-Square Distributions Dr. Neal, Spring 009 The chi-square distributions can be used in statistics to analyze the standard deviation of a normally distributed measurement and to test the

More information

CRISP: Capture-Recapture Interactive Simulation Package

CRISP: Capture-Recapture Interactive Simulation Package CRISP: Capture-Recapture Interactive Simulation Package George Volichenko Carnegie Mellon University Pittsburgh, PA gvoliche@andrew.cmu.edu December 17, 2012 Contents 1 Executive Summary 1 2 Introduction

More information

Finding the Gold in Your Data

Finding the Gold in Your Data Finding the Gold in Your Data An introduction to Data Mining Originally presented @ SAS Global Forum David A. Dickey North Carolina State University Decision Trees A divisive method (splits) Start with

More information

On Top-k Structural. Similarity Search. Pei Lee, Laks V.S. Lakshmanan University of British Columbia Vancouver, BC, Canada

On Top-k Structural. Similarity Search. Pei Lee, Laks V.S. Lakshmanan University of British Columbia Vancouver, BC, Canada On Top-k Structural 1 Similarity Search Pei Lee, Laks V.S. Lakshmanan University of British Columbia Vancouver, BC, Canada Jeffrey Xu Yu Chinese University of Hong Kong Hong Kong, China 2014/10/14 Pei

More information

*Final Exam* EECS 203 Spring 2015

*Final Exam* EECS 203 Spring 2015 *Final Exam* EECS 203 Spring 2015 Name (Print): uniqname (Print): Instructions. You have 120 minutes to complete this exam. You may have one page of notes (8.5x11.5 two-sided) but may not use any other

More information

Manual of Crime Analysis Map Production

Manual of Crime Analysis Map Production Manual of Crime Analysis Map Production Manual of Crime Analysis Map Production Written by: Mary Velasco Research Associate, Crime Mapping Laboratory and Rachel Boba, Ph.D. Director, Crime Mapping Laboratory

More information

2014 PROBATION SURVEY

2014 PROBATION SURVEY This document is made available electronically by the Minnesota Legislative Reference Library as part of an ongoing digital archiving project. http://www.leg.state.mn.us/lrl/lrl.asp 2014 PROBATION SURVEY

More information

An Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets

An Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets IEEE Big Data 2015 Big Data in Geosciences Workshop An Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets Fatih Akdag and Christoph F. Eick Department of Computer

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Matrix Data: Prediction Instructor: Yizhou Sun yzsun@ccs.neu.edu September 21, 2015 Announcements TA Monisha s office hour has changed to Thursdays 10-12pm, 462WVH (the same

More information

Final Exam, Machine Learning, Spring 2009

Final Exam, Machine Learning, Spring 2009 Name: Andrew ID: Final Exam, 10701 Machine Learning, Spring 2009 - The exam is open-book, open-notes, no electronics other than calculators. - The maximum possible score on this exam is 100. You have 3

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness

More information

CME323 Distributed Algorithms and Optimization. GloVe on Spark. Alex Adamson SUNet ID: aadamson. June 6, 2016

CME323 Distributed Algorithms and Optimization. GloVe on Spark. Alex Adamson SUNet ID: aadamson. June 6, 2016 GloVe on Spark Alex Adamson SUNet ID: aadamson June 6, 2016 Introduction Pennington et al. proposes a novel word representation algorithm called GloVe (Global Vectors for Word Representation) that synthesizes

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

Capacity Releasing Diffusion for Speed and Locality

Capacity Releasing Diffusion for Speed and Locality 000 00 002 003 004 005 006 007 008 009 00 0 02 03 04 05 06 07 08 09 020 02 022 023 024 025 026 027 028 029 030 03 032 033 034 035 036 037 038 039 040 04 042 043 044 045 046 047 048 049 050 05 052 053 054

More information

Table of Contents Introductory Letter to Mayor Holly C. Brinda Elyria Police Department 2011 Roster of Personnel Distribution of Personnel

Table of Contents Introductory Letter to Mayor Holly C. Brinda Elyria Police Department 2011 Roster of Personnel Distribution of Personnel Table of Contents Introductory Letter to Mayor Holly C. Brinda 1 Elyria Police Department 2011 Roster of Personnel 2 Distribution of Personnel 4 Organizational Structure 5 Comparison of Selected Categories

More information

Cutting Graphs, Personal PageRank and Spilling Paint

Cutting Graphs, Personal PageRank and Spilling Paint Graphs and Networks Lecture 11 Cutting Graphs, Personal PageRank and Spilling Paint Daniel A. Spielman October 3, 2013 11.1 Disclaimer These notes are not necessarily an accurate representation of what

More information

Introduction to Machine Learning CMU-10701

Introduction to Machine Learning CMU-10701 Introduction to Machine Learning CMU-10701 23. Decision Trees Barnabás Póczos Contents Decision Trees: Definition + Motivation Algorithm for Learning Decision Trees Entropy, Mutual Information, Information

More information

Style Insights DISC, English version 2006.g

Style Insights DISC, English version 2006.g To: From:. Style Insights DISC, English version 2006.g Bill Bonnstetter Target Training International, Ltd. www.documentingexcellence.com Date: 12 May 2006 www.documentingexcellence.com 445 S. Julian St,

More information

2.4. Conditional Probability

2.4. Conditional Probability 2.4. Conditional Probability Objectives. Definition of conditional probability and multiplication rule Total probability Bayes Theorem Example 2.4.1. (#46 p.80 textbook) Suppose an individual is randomly

More information

Data Mining Techniques

Data Mining Techniques Data Mining Techniques CS 622 - Section 2 - Spring 27 Pre-final Review Jan-Willem van de Meent Feedback Feedback https://goo.gl/er7eo8 (also posted on Piazza) Also, please fill out your TRACE evaluations!

More information

EICCARS WORKBOOK FOR INITIATING COMMUNITY PROFILES WITH STATISTICAL AND OTHER SECONDARY DATA

EICCARS WORKBOOK FOR INITIATING COMMUNITY PROFILES WITH STATISTICAL AND OTHER SECONDARY DATA The Cultural Ecology of Health and Change (CEHC) Working Papers Ethnographically Informed Community and Cultural Assessment Research Systems (EICCARS) Workbooks EICCARS WORKBOOK FOR INITIATING COMMUNITY

More information

Professionalism Integrity Respect Compassion

Professionalism Integrity Respect Compassion Police Department MONTHLY REPORT April 2016 Professionalism Integrity Respect Compassion Inside This Issue Department Highlights 1-3 Detective 4 At-a-Glance Statistics Statistics 5-11 1 Police DEPARTMENT

More information

Using GIS to Explore the Relationship between Socioeconomic Status and Demographic Variables and Crime in Pittsburgh, Pennsylvania

Using GIS to Explore the Relationship between Socioeconomic Status and Demographic Variables and Crime in Pittsburgh, Pennsylvania Using GIS to Explore the Relationship between Socioeconomic Status and Demographic Variables and Crime in Pittsburgh, Pennsylvania Stephen E. Mitchell Department of Resource Analysis, Saint Mary s University

More information

CRIME IN VIRGINIA. Compiled By UNIFORM CRIME REPORTING SECTION DEPARTMENT OF STATE POLICE

CRIME IN VIRGINIA. Compiled By UNIFORM CRIME REPORTING SECTION DEPARTMENT OF STATE POLICE 1 9 9 9 CRIME IN VIRGINIA Compiled By UNIFORM CRIME REPORTING SECTION DEPARTMENT OF STATE POLICE CRIME IN VIRGINIA JANUARY-DECEMBER, 1999 Virginia Uniform Crime Reporting Program DEPARTMENT OF STATE POLICE

More information

Supporting Statistical Hypothesis Testing Over Graphs

Supporting Statistical Hypothesis Testing Over Graphs Supporting Statistical Hypothesis Testing Over Graphs Jennifer Neville Departments of Computer Science and Statistics Purdue University (joint work with Tina Eliassi-Rad, Brian Gallagher, Sergey Kirshner,

More information

Indicator: Proportion of the rural population who live within 2 km of an all-season road

Indicator: Proportion of the rural population who live within 2 km of an all-season road Goal: 9 Build resilient infrastructure, promote inclusive and sustainable industrialization and foster innovation Target: 9.1 Develop quality, reliable, sustainable and resilient infrastructure, including

More information

Large-Scale Feature Learning with Spike-and-Slab Sparse Coding

Large-Scale Feature Learning with Spike-and-Slab Sparse Coding Large-Scale Feature Learning with Spike-and-Slab Sparse Coding Ian J. Goodfellow, Aaron Courville, Yoshua Bengio ICML 2012 Presented by Xin Yuan January 17, 2013 1 Outline Contributions Spike-and-Slab

More information

CSCE 750 Final Exam Answer Key Wednesday December 7, 2005

CSCE 750 Final Exam Answer Key Wednesday December 7, 2005 CSCE 750 Final Exam Answer Key Wednesday December 7, 2005 Do all problems. Put your answers on blank paper or in a test booklet. There are 00 points total in the exam. You have 80 minutes. Please note

More information

Crime Analysis. GIS Solutions for Intelligence-Led Policing

Crime Analysis. GIS Solutions for Intelligence-Led Policing Crime Analysis GIS Solutions for Intelligence-Led Policing Applying GIS Technology to Crime Analysis Know Your Community Analyze Your Crime Use Your Advantage GIS aids crime analysis by Identifying and

More information

Area-Specific Crime Prediction Models

Area-Specific Crime Prediction Models Area-Specific Crime Prediction Models Mohammad Al Boni and Matthew S. Gerber Department of Systems and Information Engineering, University of Virginia, Charlottesville, Virginia, USA Email: {ma2sm, msg8u}@virginia.edu

More information

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248) AIM HIGH SCHOOL Curriculum Map 2923 W. 12 Mile Road Farmington Hills, MI 48334 (248) 702-6922 www.aimhighschool.com COURSE TITLE: Statistics DESCRIPTION OF COURSE: PREREQUISITES: Algebra 2 Students will

More information

Encounters Leading to Detentions. September 2018

Encounters Leading to Detentions. September 2018 Council Briefing Encounters Leading to Detentions September 2018 Standardization Tracking all detentions regardless of outcome-previous standard were selfreported not leading to an arrest. Standardize

More information

The Central Limit Theorem

The Central Limit Theorem - The Central Limit Theorem Definition Sampling Distribution of the Mean the probability distribution of sample means, with all samples having the same sample size n. (In general, the sampling distribution

More information

SL - Binomial Questions

SL - Binomial Questions IB Questionbank Maths SL SL - Binomial Questions 262 min 244 marks 1. A random variable X is distributed normally with mean 450 and standard deviation 20. Find P(X 475). Given that P(X > a) = 0.27, find

More information

FINAL EXAM: FALL 2013 CS 6375 INSTRUCTOR: VIBHAV GOGATE

FINAL EXAM: FALL 2013 CS 6375 INSTRUCTOR: VIBHAV GOGATE FINAL EXAM: FALL 2013 CS 6375 INSTRUCTOR: VIBHAV GOGATE You are allowed a two-page cheat sheet. You are also allowed to use a calculator. Answer the questions in the spaces provided on the question sheets.

More information

Randomized Algorithms

Randomized Algorithms Randomized Algorithms Prof. Tapio Elomaa tapio.elomaa@tut.fi Course Basics A new 4 credit unit course Part of Theoretical Computer Science courses at the Department of Mathematics There will be 4 hours

More information

CS 70 Discrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand Midterm 1 Solutions

CS 70 Discrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand Midterm 1 Solutions CS 70 Discrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand Midterm 1 Solutions PRINT Your Name: Answer: Oski Bear SIGN Your Name: PRINT Your Student ID: CIRCLE your exam room: Dwinelle

More information

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01 An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there

More information

Chapter. Probability

Chapter. Probability Chapter 3 Probability Section 3.1 Basic Concepts of Probability Section 3.1 Objectives Identify the sample space of a probability experiment Identify simple events Use the Fundamental Counting Principle

More information

Data Mining. CS57300 Purdue University. Bruno Ribeiro. February 8, 2018

Data Mining. CS57300 Purdue University. Bruno Ribeiro. February 8, 2018 Data Mining CS57300 Purdue University Bruno Ribeiro February 8, 2018 Decision trees Why Trees? interpretable/intuitive, popular in medical applications because they mimic the way a doctor thinks model

More information

UVA CS 4501: Machine Learning

UVA CS 4501: Machine Learning UVA CS 4501: Machine Learning Lecture 21: Decision Tree / Random Forest / Ensemble Dr. Yanjun Qi University of Virginia Department of Computer Science Where are we? è Five major sections of this course

More information

Recommendation Systems

Recommendation Systems Recommendation Systems Pawan Goyal CSE, IITKGP October 21, 2014 Pawan Goyal (IIT Kharagpur) Recommendation Systems October 21, 2014 1 / 52 Recommendation System? Pawan Goyal (IIT Kharagpur) Recommendation

More information

Modularity and Graph Algorithms

Modularity and Graph Algorithms Modularity and Graph Algorithms David Bader Georgia Institute of Technology Joe McCloskey National Security Agency 12 July 2010 1 Outline Modularity Optimization and the Clauset, Newman, and Moore Algorithm

More information

i) 83 (including 2017) ii) 10 iii) 18 iv) Please see the table below

i) 83 (including 2017) ii) 10 iii) 18 iv) Please see the table below Uned Rhyddid Gwybodaeth / Freedom of Information Unit Response Date: 07/11/2017 2017/953 Juvenile arrests In response to your recent request for information, we have extracted the juveniles arrested in

More information

Text Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University

Text Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University Text Mining Dr. Yanjun Li Associate Professor Department of Computer and Information Sciences Fordham University Outline Introduction: Data Mining Part One: Text Mining Part Two: Preprocessing Text Data

More information

The University of Texas at Austin Department of Electrical and Computer Engineering. EE381V: Large Scale Learning Spring 2013.

The University of Texas at Austin Department of Electrical and Computer Engineering. EE381V: Large Scale Learning Spring 2013. The University of Texas at Austin Department of Electrical and Computer Engineering EE381V: Large Scale Learning Spring 2013 Assignment 1 Caramanis/Sanghavi Due: Thursday, Feb. 7, 2013. (Problems 1 and

More information

Mining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University

Mining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University Note to other teachers and users of these slides: We would be delighted if you found this our material useful in giving your own lectures. Feel free to use these slides verbatim, or to modify them to fit

More information

Midterm Exam, Spring 2005

Midterm Exam, Spring 2005 10-701 Midterm Exam, Spring 2005 1. Write your name and your email address below. Name: Email address: 2. There should be 15 numbered pages in this exam (including this cover sheet). 3. Write your name

More information

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career. Introduction to Data and Analysis Wildlife Management is a very quantitative field of study Results from studies will be used throughout this course and throughout your career. Sampling design influences

More information

Introduction to Machine Learning Midterm Exam Solutions

Introduction to Machine Learning Midterm Exam Solutions 10-701 Introduction to Machine Learning Midterm Exam Solutions Instructors: Eric Xing, Ziv Bar-Joseph 17 November, 2015 There are 11 questions, for a total of 100 points. This exam is open book, open notes,

More information

Appendix Table 1. Predictive Power of the Pre-Game Point Spread versus the Halftime Point Spread.

Appendix Table 1. Predictive Power of the Pre-Game Point Spread versus the Halftime Point Spread. Appendix Table 1. Predictive Power of the Pre-Game Point Spread versus the Halftime Point Spread. Probit Regression Dependent Variable = Win (1) (2) (3) (4) (5) (6) Spread -.081 -.057 (.004) (.004) [-.033]

More information