Creating Virtual User Populations by Analysis of Anthropometric Data

Size: px
Start display at page:

Download "Creating Virtual User Populations by Analysis of Anthropometric Data"

Transcription

1 Creating Virtual User Populations by Analysis of Anthropometric Data Matthew B. Parkinson Engineering Design and Mechanical Engineering, Penn State University, University Park, PA 16802, USA Matthew P. Reed University of Michigan Transportation Research Institute, Ann Arbor, MI 48109, USA Abstract Anthropometric measures that accurately represent the target user population are essential to effective designing for human variability, yet the available anthropometric data are drawn from populations that are substantially dissimilar from the target populations for most products and environments. Many designs for civilians are assessed using military anthropometric data, for example, because the only available data with sufficient detail were gathered from soldiers. Recent studies have collected detailed information from civilians, but these populations are also not representative of typical user populations. This paper presents a new statistical method for applying the available anthropometric data to estimate distributions of anthropometric data for a target population. For many design problems, the target population can be characterized by overall demographic information, such as the gender ratio, age distribution, nationality, and ethnicity. Principal component analysis and linear regression are used to create statistical models that can draw on information from two or more databases to estimate values for a large range of anthropometric variables. The approach matches the race/ethnicity distribution of the target user population and incorporates a stochastic component so that all relevant variance is retained. Finally, it generates populations of virtual users rather than summary statistics for individual variables. Summary statistics from the virtual populations are shown to be reasonable approximations for the actual measures. This work has application to a wide range of design problems, including the design of workspaces, vehicles, and medical devices. Key words: anthropometry, principal components analysis, human variability, spatial analyses Relevance to industry Accurate estimates of target user anthropometry are essential to spatial analyses in artifact and task design. This paper presents a methodology for using available data to synthesize new populations that better represent target users. Application of these methods in the design and assessment of products and environments will produce more accommodating designs that promote better performance and customer acceptance. Introduction Effective design for human variability requires estimates of the distribution of user characteristics. For many design problems, body dimensions are critically important determinants of the safety and performance of the design. Consequently, anthropometric data are referenced in the design and evaluation of many products, and anthropometric analyses are central to ergonomic analyses of workstations. Corresponding author addresses: parkinson@psu.edu (Matthew B. Parkinson), mreed@umich.edu (Matthew P. Reed) The most common methods for applying anthropometric data to assessment of a product or workspace design have remained unchanged for decades (Roebuck, 1995). The ergonomist, designer, or engineer identifies human body dimensions that are related to the design problem. For example, seated hip breadth is related to the required width for a chair. The designer will then search for information on the selected anthropometric measures for the user population in a reference text or human factors handbook (Jung et al., 2009; Salvendy, 2006; HFES 300 Committee, 2004; Peacock and Karwowski, 1993). For the chair design problem, the designer might identify 95 th - percentile hip breadth for U.S. adult female office workers as the desired value and attempt to find that value in a handbook such as Humanscale 123 (Diffrient et al., 1981). This look-up table approach to applying anthropometric data has many limitations. The most critical issue is the assumption that the underlying database accurately represents the target user population. Unfortunately this is typically not the case. The sources of tabulated data are often not thoroughly specified, the sampled population is different from the target user population, or the Preprint submitted to International Journal of Industrial Ergonomics June 13, 2009

2 probability density function ANSUR CAESAR NHANES BMI Figure 1: The body mass index (BMI) distributions for women in two commonly used databases (ANSUR and CAESAR) and the general U.S. population (NHANES, ). data were gathered decades prior to the time of application. Anthropometric design guides are often based on military studies, which have historically been the most detailed anthropometric studies. For example, the Business and Institutional Furniture Manufacturers Association (BIFMA) anthropometric design guidelines for office chairs and workstations (BIFMA, 2002) makes extensive use of data from the 1988 U.S. Army anthropometric survey known as ANSUR (Gordon et al., 1989). The published ANSUR data, with over 100 anthropometric variables measured on over 4000 subjects, provide an excellent reference on body dimensions for the U.S. Army as of the late 1980s, but the ethnicity, age, and fitness of the ANSUR population is quite different from the general U.S. population or the likely user population for any civilian product or workspace. The published ANSUR data are even inappropriate for current or future military vehicles or equipment, because the U.S. military population has changed since 1988 (Lockett et al., 2005). Figure 1 illustrates the differences in distributions of body mass index (BMI) a measure of weight-for-stature calculated as body weight in kg divided by stature in meters square in women within two commonly used databases and the general U.S. population. Anthropometric surveys are expensive and timeconsuming, so it is rare for a designer to have access to anthropometric data drawn from the specific user population of interest. However, other reasonably accurate descriptions of the user population might be available. For example, ergonomists designing assembly workstations may have good distributional information on the gender, age, and race/ethnicity of the worker population. Similar demographic data for the target users of a consumer product might also be available from market research. The two standard approaches for modifying an existing dataset to be representative of a desired population are downsampling and weighting. The published ANSUR data are from a subset of 1774 men and 2208 women down- 2 sampled from the nearly 9000 men and women originally measured (Gordon et al., 1989). The subset was chosen such that the composition of the subset matched the characteristics of the Army on age and race/ethnicity. This is a powerful approach, provided that a large enough raw sample is available for each of the demographic subgroups. The second major approach assumes that each individual in the dataset is randomly selected from among those individuals in the target population who have the same demographic characteristics. A sampling weight is assigned to each individual, representing the fraction of the target population matching this individual s characteristics. Methodologies for weighting survey data are well developed and are applied in many domains, from crash statistics (National Center for Statistics and Analysis, 2007) to health surveys (U.S. Centers for Disease Control and Prevention, 2008). The best available statistics on the basic anthropometric characteristics of people in the U.S. are gathered as part of the National Health and Nutrition Examination Survey (NHANES), which uses U.S. Census data to compute sampling weights for each sampled person (U.S. Centers for Disease Control and Prevention, 2008). The NHANES data have been used to weight detailed anthropometric data from the Civilian American and European Anthropometric Resource (CAESAR), a survey incorporating traditional and body-scanning techniques, to be better representative of the U.S. population (Harrison and Robinette, 2002; Robinette et al., 2002). One version of the weighting procedure involves the creation of regression models relating descriptors such as stature to specific measures of interest such as buttock-popliteal length or sitting height. Secular and other changes in the population which are captured in the descriptors can then be used to extrapolate new values for the specific anthropometry (Drillis and Contini, 1966; Kinghorn and Bittner, 1995; Fromuth and Parkinson, 2008). More complex models such as neural networks have also been used (Dursun Kaya et al., 2003). A parametric approach to population weighting has been applied by Flannagan et al. (1998) in the development of functional human performance models. A linear regression predicting the desired dependent measure from an overall human dimension (usually stature) is convolved with a normal distribution representing the single-gender stature distribution of the target population. This approach is generalized from the univariate case to multivariate prediction as part of the current work. This paper presents a new methodology for using available demographic information to re-analyze existing anthropometric datasets to produce a new, synthesized dataset that better represents the target user population. A two-phase analysis is conducted that combines data from large-scale surveys with only a few variables (e.g., NHANES) with data from detailed studies (e.g., ANSUR or CAESAR). The method is fundamentally a weighting procedure, but it exploits the correlations among anthropometric measures to produce better estimates of the

3 distributions of variables than are obtained using typical weighting procedures. Methods Overview The methodology (Figure 2) obtains accurate estimates of the distributions of anthropometric variables for a target user population by synthesizing a virtual user population, each member of which is defined by a vector of attributes (gender, race/ethnicity, age, etc.) and the anthropometric variables of interest. An important consideration is that the subgroups of the user population will tend to have different relationships between individual body dimensions and overall dimensions. For example, the relationship between hip breadth and body weight is different for men and women. Ideally, a separate analysis would be conducted for each cell in the demographic matrix, but, in practice, sufficient data are available only for separate analyses by gender. Consequently, a method is needed to make the aggregate detailed database as representative as possible of the target user population with respect to demographic variables. First, distributional information is obtained for the target user population on as many variables as possible. Usually these data are categorical (gender, race/ethnicity, geographic region, occupational category, etc.) or continuous variables binned to be discrete (e.g., decile age bands). Ideally these data are crossed (the percentage of women separately for each age group, for example), but the method can proceed with only the marginal distributions. A large database is used to characterize basic anthropometric measures for each user category. For products and workspaces intended for users in the U.S., NHANES provides the best available data on stature (erect standing height), age, and body weight. Separately, the covariance structure of at least one detailed database such as ANSUR is analyzed using principal component analysis (PCA). PCA expresses the anthropometric measures Y in a new orthonormal basis (Jolliffe, 2004). Linear regression analysis is applied to predict the principal component scores (coordinates relative to the new basis) as a function of the overall body dimensions obtained from the large database. The regression models are used in a stochastic convolution procedure to convert estimates of the distributions of overall anthropometric measures for subgroups into distributions of principal component scores. The principal component transformation is then inverted to obtain a complete set of predicted anthropometric variables for an individual in the target population. The stochastic sampling procedure is repeated until the desired number of individuals in each subgroup has been generated to achieve the appropriate distribution on the original categorical variables. 3 Building a PCA Model Principal components analysis is used to establish the relationships between the selected anthropometric variables from the detailed database and overall body dimensions. The number of basis components is equivalent to the number of factors (e.g., anthropometric measures such as stature and arm length), but the method is most effective when most of the variability in the data is accounted for by the first few components. These characteristics make it a useful technique for modeling anthropometric data where the many measures are strongly correlated with a few predictors such as stature and body mass index. For a review of PCA see Jolliffe (2004) or see Lockett et al. (2005); Nicolay and Walker (2005) for anthropometric applications. The method begins with the analysis of a detailed anthropometric database U containing detailed measures for N U individuals. For target populations within the U.S., the database might be ANSUR or CAESAR, for example. The chosen database should be as close as possible to the target population with respect to distributions of age, ethnicity, and other variables, but in practice only a few detailed databases are available. The matrix U consists of X U and Y U, matrices corresponding to the predictors and measures of interest within the database. Consider the scenario in which stature and BMI are predictors: the N U stature and BMI pairs within U and a column of ones would comprise the N U j (where j = 3) predictor matrix X U. Similarly, Y U is the N U k matrix of detailed anthropometry for the k measures of each individual in the database. The detailed anthropometric data are centered by subtracting their mean Ŷ U = Y U Y U (1) The rows of Ŷ U are sorted into bins defined by the demographic characterization of the target user population (for example, gender age race). The data within each bin are then upsampled (a randomly selected row is duplicated) until the percentages across the bins are equivalent to those in the target population. This strategy is adopted rather than a downsampling procedure in order to make use of all individuals in the detailed dataset. This step is important because the correlation among anthropometric variables differs with age, race/etnicity, and other demographic variables. For each gender, k (e.g., 40) anthropometric measures of interest within the upsampled detailed data Ŷ U are analyzed using PCA. This determines P, a k k matrix of coefficients or loadings and Q, the N U k matrix of principal component scores. Each of the columns of Q corresponds to a principal component and each of the components is orthogonal to the others. These columns are sorted in descending order by the amount of variance within the data that they explain. A regression analysis is then conducted to identify the components that are significantly related (p < 0.05) to the overall body dimensions in X U. Typically, the first several components are associated with stature and/or BMI. These (z) significant components are selected for the creation of the predictive model. A reduced matrix of scores Qr is taken to be the first z columns of Q.

4 Target Population demographic data (distributions of gender, age, and ethnicity) synthesized body dimensions Representative Database (e.g., NHANES) - compute distributions of overall dimensions (stature and BMI) by demographic group Detailed Anthropometric Database (e.g., ANSUR) - sort by demographic data - upsample to match target demographics - PCA and regression Predictive Model detailed body dimensions as a function of overall dimensions (stature and BMI) Figure 2: A flowchart depicting the methodology used to obtain the synthesized anthropometry. A transformation matrix M, which relates the predictors to the reduced set of PCA loadings is then M = Q r X U. (2) Because each of the principal components is orthogonal, this is equivalent to performing a least-squares fit on each of the individual components of Qr, i.e., the standard linear regression fit. The residual variance s 2 r in the regression is calculated for each variable. Using M, the Nnew z scoring matrix Qnew, corresponding to arbitrary values of the predictors Xnew, can be calculated: where Qnew = XnewM + E. (3) E = [ɛ 1, ɛ 2,..., ɛ N new ] (4) The error terms ɛ i are normally distributed independent random variables with variance s 2 i. Because the k columns of Qnew are independent, this is equivalent to performing k simple linear regressions. Incorporating the Residual Variance Two people with the same stature and BMI will usually have considerable differences in their other body dimensions. The difference between the actual measures and the deterministic values predicted with the regression equations is called residual variance. The magnitude of the the residual variance is an indicator of the degree to which the synthesized anthropometry of two individuals with the same predictors (e.g., stature and BMI) might be expected to differ. A stochastic component is added to each of the predicted measures to account for the residual variance using a methodology similar to that in (Flannagan et al., 1998; Parkinson and Reed, 2006; Parkinson et al., 2007). For the first z components the stochastic element was incorporated back into the calculation of the component scores Qnew by randomly sampling from a normal distribution with mean = 0 and a standard deviation equal to the square root of the residual variance for the regression fit on that component. The remaining y z principal components are not significantly associated with the predictor variables, yet they 4 represent meaningful variance in the detailed anthropometric data. Rather than approximating these with a multivariate normal distribution, variance on these components is incorporated into the > z columns of Qnew by sampling randomly from the columns i of Q for i > z. Note that these scores are independent. This non-parametric approach reduces reliance on the assumption that the PC scores are normally distributed and helps to avoid unrealistic values when creating a large virtual population. The mapping from PCA-space back to the actual anthropometric measures is performed by multiplying the PCA coefficients by Qnew (the PCA scores for the new predictors) and shifting the results by the means observed in the data Results Ynew = Y U + QnewP. (5) The method is demonstrated by synthesizing the female ANSUR anthropometry from the CAESAR data set. The ANSUR data were collected in the 1980s from a military population that differs in important ways from the CAESAR civilian data collected nearly twenty years later (Figure 1 with BMI comparison). Comparing a synthesized population with known data from a substantially different database provides an opportunity for validating the methodology and examining strengths and weaknesses. The results were compared to the actual ANSUR sample on a range of variables and the effects of variable selection on the outcomes were evaluated. Building the Model To provide a direct comparison to the ANSUR dataset, anthropometry for 2208 women were synthesized the same number as in the true ANSUR dataset. Both the CAESAR and ANSUR populations are young relative to the US population as a whole, so the distribution of age was assumed to be roughly equivalent. However, the distribution of race within the target user population (ANSUR) is substantially different from that found in CAESAR. As proposed in this new methodology, rather than building separate models or using race as a predictor, the CAE- SAR data were sorted by race and upsampled so that the

5 relative distribution across bins matches that in the target population: 51.6% White, 41.8% Black, 2.6% Hispanic, resulting in a population size of 1872 individuals. Following the methodology described above, a statistical analysis of the women in the U.S. CAESAR database U was used to determine the relationships between overall body size descriptors (stature and body mass index) X U and the specific body dimensions of interest, Y U. Body mass index was used in place of body weight as a predictor of overall body size and shape because it is less correlated with stature. The y = 14 measures of interest for the target population are identified in Table 1 and were selected to represent four types of measurements: lengths, widths, circumferences, and functional measures. Additional selection criteria included presence in both data sets and the similarity of measurement methods. Where measures of both the left and right side of the participant were taken for the CAESAR data set, only the value from the right side was used. Two additional measures from CAESAR, stature and mass, were also included stature is a predictor and mass is necessary to calculate the second predictor, BMI. One potential criticism of the use of PCA methods for this type of application is that the columns of the scoring matrix Q depend on the variables selected for Y U. In other words, a model derived from sitting height, knee height, and biacromial breadth data would produce different estimates of sitting height than one derived from sitting height, knee height, and chest circumference. To examine this effect, each of the 14 anthropometric measures was simulated by randomly selecting 7 other measures from the set and constructing Y U for each of 10,000 runs. In each run, PCA was performed on Y U to yield P and Q. The scoring matrix was reduced to Qr based on the number of components that explain a statistically significant (p < 0.05) amount of variance in the data. Across the 140,000 simulations, the number of components required to achieve this level varied from two to five, with z = {2, 3, 4, 5} in {1%, 39%, 56%, 4%} of the simulations, respectively. The residual variance from the least-squares fit of predictors to the two components was retained and added back into predicted scores as outlined above. Synthesizing the Population For this problem two predictors Xnew were considered: stature and BMI. In a typical design problem one might use a different database as the source for the stature and BMI distributions, selected to match the target user population (for example, using NHANES to represent U.S. civilians). For this example the stature and BMI values of the ANSUR population were used directly to focus on the ability of the new methodology to synthesize the relevant anthropometry. Using Eq. 3, the new scoring matrix Qnew, which includes residual variance, was calculated. The synthesized anthropometry, Ynew was calculated using Eq. 5. The synthesized population had Nnew = members, the same number as in the actual ANSUR female population. Each of the 14 measures was simulated 10,000 times using randomly selected sets of 8 total dimensions, producing a distribution of results for each dimension. The 5 th, 50 th, and 95 th -percentile values of each of the y measures were calculated for each of the 10,000 runs and averaged. These values are summarized in Table 1. For comparison the equivalent data from the original ANSUR data are listed along with the differences at each percentile. Table 2 shows the 90% error intervals as both the percent and size (mm) at three commonly used reference locations (5 th, 50 th, and 95 th -percentile). Values greater than an arbitrarily selected criterion of 3% are shown in bold. Discussion and Conclusions The model was accurate in estimating measures of length, breadth, circumference, and even the functional measure, thumbtip reach. Sitting height, which is known to vary as a function of stature across ethnicities, was estimated well using this methodology despite the substantially different ethnicity distributions of the ANSUR and CAESAR populations. This can be attributed to the upsampling procedure in which the model database (CAESAR) was essentially reweighted to match that of the ANSUR data. The hip breadth measure estimates were as a whole slightly too large. Although the procedure was able to account for the differences in the BMI distributions in the two populations, the differences in fitness level or muscle tone may have influenced the results for this variable. Consistent with this hypothesis, the bideltoid breadth measure, which is related to upper-body muscle development, was consistently underapproximated. One final noteworthy difference is in the acromion-radiale length, a measure that should not be affected by muscle development and which is consistently mm too short in the synthesized population. This bias error could be the result of measurement differences in the two data collection procedures from which the data were collected. From the data users perspective, the new method offers several advantages over previous approaches. First, it is explicitly multivariate, capable of synthesizing anthropometry for many individuals within a large population at a time rather than relying on look-up tables or static constants that provide univariate anthropometric information. Second, variability across race/ethnicity groups is incorporated by weighting to match the prescribed distributions in the target population. If there are concerns that other demographics such as age might affect anthropometric ratios, these factors can readily be considered in the same manner. Third, a virtual population of users, each with a complete set of the desired anthropometry, is created. This allows designers the opportunity to compute any statistic of interest, rather than being limited to

6 those in published look-up tables. More importantly, designers can now explore the correlations among measures to determine, for example, how shoulder breadth changes with sitting height in the user population. This can assist in identifying to what degree individuals disaccommodated on one metric are likely to be disaccommodated on another. In situations where the raw synthesized anthropometry might be overwhelming or burdensome, the results can readily be reduced to the traditional look-up table format, tailored to the target user population. New types of analyses can be performed using the methodology. For example, sensitivity analyses can be conducted to determine the effects of ethnicity or age distributions on population anthropometry. One might use the approach to answer questions such as How important is it to know what percentage of automobile assembly workers are Hispanic within 5% if one is interested in knowing the 5 th -percentile thumbtip reach within 10 mm? While the example in the paper is for the U.S. population, any data (e.g., CAESAR s European component) can be used to build the detailed database. This enables other applications such as weighting for national populations (Mokdad and Al-Ansari, 2009; Klamklay et al., 2008) for global products. The current approach differs substantially from previous applications of PCA for anthropometric data analysis, which have typically focused on the identification of a small number of boundary individuals (e.g., Lockett et al., 2005). The method uses PCA to facilitate the reweighting via regression of the detailed data by rendering it in an orthonormal basis. The result is a new population that preserves the covariance structure of the original while being shifted and scaled to represent the target distributions of key variables (stature and BMI in the example). The performance of the method rests on several assumptions. First, the correlation among measures is assumed to be the same by subgroup for the target population, representative population from which the predictors for the synthesized anthropometry are obtained (e.g., NHANES), and detailed population used for PCA (e.g., ANSUR or CAESAR). Similarly, the available anthro measures (typically stature and BMI) from the representative data are assumed to be sufficient. The methodology readily incorporates any available measures, but availability of data is often restricted to stature and body weight (from which BMI can be calculated). Fortunately these two measures have been shown to account for a large fraction of the variance in a typical diverse population (Manary et al., 1999). The PCA results are affected by the variables that are included, but the simulations presented here demonstrate that good accuracy can be obtained if the selected measures are reasonably chosen. For example, adding a large number of circumference measures to an analysis primarily focused on length dimensions would be inappropriate. Although the results for this case study show that the method produces accurate results for a realistic test case, 6 the complexity of the method and the numerous subjective choices to be made preclude a general error analysis. The overall accuracy rests on the reliability of a number of assumptions, including that the user population characteristics are accurately known. The PCA and regression models assume linear relationships in the underlying data, which is generally supported by anthropometric data. For some analyses focusing on measures related to body weight, modeling the square root of circumferences, or using the log transform of BMI, may produce slightly better results. The convolution process for incorporating residual variance in the first few principal component estimates assumes normality in the residuals, which is well supported by the data. Additional research is needed to determine when separate analyses should be conducted for subgroups that may differ in the relationships among anthropometric measures. The limiting factor is the number of representatives of the subgroup in the detailed database. Using ANSUR, a separate analysis for African-American women might be feasible (approximately 500 individuals), but insufficient data are available for Hispanic women. The overall importance of subgroup models is influenced by the representation of the subgroup in the target population. A more-detailed modeling of a subgroup that is only a small fraction of the target population will have a minimal effect on the resulting design. Acknowledgements The authors acknowledge Dr. Carol Flannagan of UMTRI for her insights into anthropometric analysis methods. This work was supported by the National Science Foundation under grant #

7 References BIFMA, Apr Ergonomics guideline for vdt (visual display terminal) furniture used in office work spaces. Clauser, C., McConville, J., Young, J., Weight, volume and center of mass of segments of the human body. AMRL-TR-69-70, Wright Patterson Air Force Base. Diffrient, N., Tilley, A., Bardagjy, J., Humanscale. The MIT Press. Drillis, R., Contini, R., Body segment parameters. Office of Vocational Rehabilitation Engineering & Science, New York, cited in Clauser et al. (1969). Dursun Kaya, M., Samet Hasiloglu, A., Bayramoglu, M., Yesilyurt, H., Fahri Ozok, A., A new approach to estimate anthropometric measurements by adaptive neuro-fuzzy inference system;. International Journal of Industrial Ergonomics 32 (2), Flannagan, C. A. C., Manary, M. A., Schneider, L. W., Reed, M. P., Improved seating accommodation model with application to different user populations. In: Proc. SAE International Congress & Exposition. Vol SAE, Warrendale, PA, USA, pp Fromuth, R., Parkinson, M., Predicting the 5th and 95th percentile anthropometric segment lengths from population stature. In: Proc. ASME International Design Engineering Technical Conferences. New York, NY. Gordon, C. C., Churchill, T., Clauser, C. E., Bradtmiller, B., Mc- Conville, J. T., Tebbetts, I., Walker, R. A., anthropometric survey of U.S. Army personnel: Methods and summary statistics. final report. (NATICK/TR-89/027). Harrison, C., Robinette, K., Jan Caesar: Summary statistics for the adult population (ages 18-65) of the united states of america. stinet.dtic.mil. HFES 300 Committee, Guidelines for Using Anthropometric Data in Product Design. Human Factors and Ergonomics Society, Santa Monica, CA. Jolliffe, I., Principal Component Analysis, 2nd Edition. Springer. Jung, K., Kwon, O., You, H., Development of a digital human model generation method for ergonomic design in virtual environment. International Journal of Industrial Ergonomics In Press, Corrected Proof,. Kinghorn, R. A., Bittner, A. C., Truck driver anthropometric data: Estimation of the current population;. International Journal of Industrial Ergonomics 15 (3), Klamklay, J., Sungkhapong, A., Yodpijit, N., E. Patterson, P., Anthropometry of the southern Thai population. International Journal of Industrial Ergonomics 38 (1), Lockett, J., Kozycki, R., Gordon, C., Bellandi, E., Proposed integrated human figure modeling analysis approach for the Army s future combat systems. In: Military Vehicle Technology (SP- 1962). SAE International, Warrendale, PA. Manary, M., Flannagan, C., Reed, M., Schneider, L., Human subject testing in support of ASPECT. Technical Paper SAE Transactions: Journal of Passenger Cars 108. Mokdad, M., Al-Ansari, M., Anthropometrics for the design of Bahraini school furniture. International Journal of Industrial Ergonomics In Press, Corrected Proof,. National Center for Statistics and Analysis, National Automotive Sampling System Crashworthiness Data System. Analytical User s Manual. U.S. Department of Transportation, Washington, DC. Nicolay, C. W., Walker, A. L., Grip strength and endurance: Influences of anthropometric variation, hand dominance, and gender;. International Journal of Industrial Ergonomics 35 (7), Parkinson, M., Reed, M., Kokkolaras, M., Papalambros, P., Optimizing truck cab layout for driver accommodation. ASME Journal of Mechanical Design 129 (11), Parkinson, M. B., Reed, M. P., Optimizing vehicle occupant packaging. SAE Technical Paper Peacock, B., Karwowski, W. (Eds.), Automotive Ergonomics. Taylor & Francis, London. 7 Robinette, K., Blackwell, S., Daanen, H., Boehmer, M., n Civilian american and european surface anthropometry resource (caesar), final report. volume 1. Tech. rep., United States Air Force Research Laboratory, Wright-Patterson Air Force Base, Ohio, U.S. Roebuck, J. A., Anthropometric methods: designing to fit the human body. HFES. Santa Monica, CA. Salvendy, G. (Ed.), Handbook of human factors and ergonomics, 3rd Edition. John Wiley & Sons. U.S. Centers for Disease Control and Prevention, National Health and Nutrition Examiniation Survey (NHANES). National Center for Health Statistics. URL List of Figures 1 The body mass index (BMI) distributions for women in two commonly used databases (ANSUR and CAESAR) and the general U.S. population (NHANES, ) A flowchart depicting the methodology used to obtain the synthesized anthropometry.. 4

8 Table 1: Comparison of the summary statistics for the true and synthesized populations of ANSUR women. ANSUR women synthesized 5 th 50 th 95 th 5 th 50 th 95 th stature BMI acromion height, seated acromion height, standing acromion - radiale length biacromial breadth bideltoid breadth buttock - knee length, seated chest circ., under bust head breadth head length hip breadth, seated knee height, seated sitting height thigh circumference thumbtip reach Table 2: The 90% error intervals for the distributions of each measure. Error percentages greater than an arbitrarily selected value of 3% are shown in bold. 90% error interval (%) 90% error interval (mm) 5 th 50 th 95 th 5 th 50 th 95 th acromion height, seated acromion height, standing acromion - radiale length biacromial breadth bideltoid breadth buttock - knee length, seated chest circ., under bust head breadth head length hip breadth, seated knee height, seated sitting height thigh circumference thumbtip reach

DETC A Z-SCORE-BASED METHOD TO SYNTHESIZE ANTHROPOMETRIC DATASETS FOR GLOBAL USER POPULATIONS

DETC A Z-SCORE-BASED METHOD TO SYNTHESIZE ANTHROPOMETRIC DATASETS FOR GLOBAL USER POPULATIONS Proceedings of the ASME 03 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference IDETC/CIE 03 August 4-7, 03, Portland, Oregon, USA DETC03-845 A

More information

Postprint.

Postprint. http://www.diva-portal.org Postprint This is the accepted version of a paper presented at 19th Triennial Congress of the International Ergonomics Association, Melbourne, Australia, 9-14 August, 2015. Citation

More information

METHODOLOGY FOR ESTIMATING THORACIC IMPACT RESPONSE IN FRONTAL CRASH TESTS

METHODOLOGY FOR ESTIMATING THORACIC IMPACT RESPONSE IN FRONTAL CRASH TESTS METHODOLOGY FOR ESTIMATING THORACIC IMPACT RESPONSE IN FRONTAL CRASH TESTS Craig P. Thor, Hampton C. Gabler Virginia Tech-Wake Forest, Center for Injury Biomechanics ABSTRACT This study has investigated

More information

machine design, Vol.9(2017) No.3, ISSN pp

machine design, Vol.9(2017) No.3, ISSN pp machine design, Vol.9(2017) No.3, ISSN 1821-1259 pp. 81-86 DOI: 10.24867/MD.9.2017.3.81-86 Research paper GEOMETRY OF HUMAN BODY MODEL USED FOR SIMULATION OF THERMAL COMFORT IN AN AGRICULTURAL VEHICLE

More information

Anthropometry of Saudi Arabian male workers

Anthropometry of Saudi Arabian male workers Anthropometry of Saudi Arabian male workers Abdalla Alrashdan, Mohamed Albassam, Abdullah Alkohlani, and Meshal Alkadi Industrial Engineering Department Alfaisal University Riyadh, Saudi Arabia aalrshdan@alfaisal.edu,

More information

An Improved Seating Accommodation Model with Application to Different User Populations

An Improved Seating Accommodation Model with Application to Different User Populations SAE TECHNICAL PAPER SERIES 980651 An Improved Seating Accommodation Model with Application to Different User Populations Carol A. C. Flannagan, Miriam A. Manary, Lawrence W. Schneider and Matthew P. Reed

More information

CHOOSING THE RIGHT SAMPLING TECHNIQUE FOR YOUR RESEARCH. Awanis Ku Ishak, PhD SBM

CHOOSING THE RIGHT SAMPLING TECHNIQUE FOR YOUR RESEARCH. Awanis Ku Ishak, PhD SBM CHOOSING THE RIGHT SAMPLING TECHNIQUE FOR YOUR RESEARCH Awanis Ku Ishak, PhD SBM Sampling The process of selecting a number of individuals for a study in such a way that the individuals represent the larger

More information

Application of Indirect Race/ Ethnicity Data in Quality Metric Analyses

Application of Indirect Race/ Ethnicity Data in Quality Metric Analyses Background The fifteen wholly-owned health plans under WellPoint, Inc. (WellPoint) historically did not collect data in regard to the race/ethnicity of it members. In order to overcome this lack of data

More information

Data Preprocessing Tasks

Data Preprocessing Tasks Data Tasks 1 2 3 Data Reduction 4 We re here. 1 Dimensionality Reduction Dimensionality reduction is a commonly used approach for generating fewer features. Typically used because too many features can

More information

CRISP: Capture-Recapture Interactive Simulation Package

CRISP: Capture-Recapture Interactive Simulation Package CRISP: Capture-Recapture Interactive Simulation Package George Volichenko Carnegie Mellon University Pittsburgh, PA gvoliche@andrew.cmu.edu December 17, 2012 Contents 1 Executive Summary 1 2 Introduction

More information

SOUTH COAST COASTAL RECREATION METHODS

SOUTH COAST COASTAL RECREATION METHODS SOUTH COAST COASTAL RECREATION METHODS A customized, web-based survey instrument, which utilizes Point 97 s Viewpoint survey and mapping technology, was used to collect spatially explicit data on coastal

More information

This report details analyses and methodologies used to examine and visualize the spatial and nonspatial

This report details analyses and methodologies used to examine and visualize the spatial and nonspatial Analysis Summary: Acute Myocardial Infarction and Social Determinants of Health Acute Myocardial Infarction Study Summary March 2014 Project Summary :: Purpose This report details analyses and methodologies

More information

The Journal of Database Marketing, Vol. 6, No. 3, 1999, pp Retail Trade Area Analysis: Concepts and New Approaches

The Journal of Database Marketing, Vol. 6, No. 3, 1999, pp Retail Trade Area Analysis: Concepts and New Approaches Retail Trade Area Analysis: Concepts and New Approaches By Donald B. Segal Spatial Insights, Inc. 4938 Hampden Lane, PMB 338 Bethesda, MD 20814 Abstract: The process of estimating or measuring store trade

More information

104 Business Research Methods - MCQs

104 Business Research Methods - MCQs 104 Business Research Methods - MCQs 1) Process of obtaining a numerical description of the extent to which a person or object possesses some characteristics a) Measurement b) Scaling c) Questionnaire

More information

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones Prepared for consideration for PAA 2013 Short Abstract Empirical research

More information

Assessing Social Vulnerability to Biophysical Hazards. Dr. Jasmine Waddell

Assessing Social Vulnerability to Biophysical Hazards. Dr. Jasmine Waddell Assessing Social Vulnerability to Biophysical Hazards Dr. Jasmine Waddell About the Project Built on a need to understand: The pre-disposition of the populations in the SE to adverse impacts from disaster

More information

Analysis of Bank Branches in the Greater Los Angeles Region

Analysis of Bank Branches in the Greater Los Angeles Region Analysis of Bank Branches in the Greater Los Angeles Region Brian Moore Introduction The Community Reinvestment Act, passed by Congress in 1977, was written to address redlining by financial institutions.

More information

Principal Components Analysis. Sargur Srihari University at Buffalo

Principal Components Analysis. Sargur Srihari University at Buffalo Principal Components Analysis Sargur Srihari University at Buffalo 1 Topics Projection Pursuit Methods Principal Components Examples of using PCA Graphical use of PCA Multidimensional Scaling Srihari 2

More information

Validation of Visual Statistical Inference, with Application to Linear Models

Validation of Visual Statistical Inference, with Application to Linear Models Validation of Visual Statistical Inference, with pplication to Linear Models Mahbubul Majumder, Heike Hofmann, Dianne Cook Department of Statistics, Iowa State University pril 2, 212 Statistical graphics

More information

Introduction to Course

Introduction to Course .. Introduction to Course Oran Kittithreerapronchai 1 1 Department of Industrial Engineering, Chulalongkorn University Bangkok 10330 THAILAND last updated: September 17, 2016 COMP METH v2.00: intro 1/

More information

ASA Section on Survey Research Methods

ASA Section on Survey Research Methods REGRESSION-BASED STATISTICAL MATCHING: RECENT DEVELOPMENTS Chris Moriarity, Fritz Scheuren Chris Moriarity, U.S. Government Accountability Office, 411 G Street NW, Washington, DC 20548 KEY WORDS: data

More information

DATA DISAGGREGATION BY GEOGRAPHIC

DATA DISAGGREGATION BY GEOGRAPHIC PROGRAM CYCLE ADS 201 Additional Help DATA DISAGGREGATION BY GEOGRAPHIC LOCATION Introduction This document provides supplemental guidance to ADS 201.3.5.7.G Indicator Disaggregation, and discusses concepts

More information

Data Analyses in Multivariate Regression Chii-Dean Joey Lin, SDSU, San Diego, CA

Data Analyses in Multivariate Regression Chii-Dean Joey Lin, SDSU, San Diego, CA Data Analyses in Multivariate Regression Chii-Dean Joey Lin, SDSU, San Diego, CA ABSTRACT Regression analysis is one of the most used statistical methodologies. It can be used to describe or predict causal

More information

EQ: What is a normal distribution?

EQ: What is a normal distribution? Unit 5 - Statistics What is the purpose EQ: What tools do we have to assess data? this unit? What vocab will I need? Vocabulary: normal distribution, standard, nonstandard, interquartile range, population

More information

ABSTRACT INTRODUCTION

ABSTRACT INTRODUCTION ABSTRACT Presented in this paper is an approach to fault diagnosis based on a unifying review of linear Gaussian models. The unifying review draws together different algorithms such as PCA, factor analysis,

More information

GEOGRAPHIC INFORMATION SYSTEMS Session 8

GEOGRAPHIC INFORMATION SYSTEMS Session 8 GEOGRAPHIC INFORMATION SYSTEMS Session 8 Introduction Geography underpins all activities associated with a census Census geography is essential to plan and manage fieldwork as well as to report results

More information

Planned Missingness Designs and the American Community Survey (ACS)

Planned Missingness Designs and the American Community Survey (ACS) Planned Missingness Designs and the American Community Survey (ACS) Steven G. Heeringa Institute for Social Research University of Michigan Presentation to the National Academies of Sciences Workshop on

More information

Ergonomic Assessment of Snow Shovels Using Digital Human Modeling

Ergonomic Assessment of Snow Shovels Using Digital Human Modeling Ergonomic Assessment of Snow Shovels Using Digital Human Modeling Carly Wolkiewicz, Katherine Collins and Faisal Aqlan* Industrial Engineering Department Penn State Erie, The Behrend College Erie, PA 16563,

More information

The Geospatial Census: plans and progress for the 2010 round of population censuses

The Geospatial Census: plans and progress for the 2010 round of population censuses The Geospatial Census: plans and progress for the 2010 round of population censuses David Rain, Assistant Professor of Geography & International Affairs, and Consultant to the United Nations/UNSD drain@gwu.edu

More information

TECHNIQUE FOR RANKING POTENTIAL PREDICTOR LAYERS FOR USE IN REMOTE SENSING ANALYSIS. Andrew Lister, Mike Hoppus, and Rachel Riemam

TECHNIQUE FOR RANKING POTENTIAL PREDICTOR LAYERS FOR USE IN REMOTE SENSING ANALYSIS. Andrew Lister, Mike Hoppus, and Rachel Riemam TECHNIQUE FOR RANKING POTENTIAL PREDICTOR LAYERS FOR USE IN REMOTE SENSING ANALYSIS Andrew Lister, Mike Hoppus, and Rachel Riemam ABSTRACT. Spatial modeling using GIS-based predictor layers often requires

More information

USA SWIMMING 2015 D&I SCORECARD

USA SWIMMING 2015 D&I SCORECARD D&I SCORECARD D&I SCORECARD How the D&I Scorecard is collected and prepared In accordance with the Ted Stevens Act, the USOC s Diversity and Inclusion department annually collects diversity data from the

More information

USA GOLF 2013 D&I SCORECARD

USA GOLF 2013 D&I SCORECARD D&I SCORECARD D&I SCORECARD How the D&I Scorecard is collected and prepared In accordance with the Ted Stevens Act, the USOC s Diversity and Inclusion department annually collects diversity data from the

More information

Basics of Multivariate Modelling and Data Analysis

Basics of Multivariate Modelling and Data Analysis Basics of Multivariate Modelling and Data Analysis Kurt-Erik Häggblom 2. Overview of multivariate techniques 2.1 Different approaches to multivariate data analysis 2.2 Classification of multivariate techniques

More information

Exploiting Multiple Mahalanobis Distance Metric to Screen Outliers from Analogue Product Manufacturing Test Responses

Exploiting Multiple Mahalanobis Distance Metric to Screen Outliers from Analogue Product Manufacturing Test Responses 1 Exploiting Multiple Mahalanobis Distance Metric to Screen Outliers from Analogue Product Manufacturing Test Responses Shaji Krishnan and Hans G. Kerkhoff Analytical Research Department, TNO, Zeist, The

More information

1- ia 33;14.iSU VALIDATION OF A STATISTICAL MATCHING PROCEDURE USED TO CREATE UNITED STATES MARINE CORPS ANTHROPOMETRIC DATABASES

1- ia 33;14.iSU VALIDATION OF A STATISTICAL MATCHING PROCEDURE USED TO CREATE UNITED STATES MARINE CORPS ANTHROPOMETRIC DATABASES *19 -''s-i 1- ia 33;14.iSU 49 A TECHNICAL REPORT NATICK/TR-96/035 AD VALIDATION OF A STATISTICAL MATCHING PROCEDURE USED TO CREATE UNITED STATES MARINE CORPS ANTHROPOMETRIC DATABASES By Sarah M. Donelson*

More information

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING Vishwanath Mantha Department for Electrical and Computer Engineering Mississippi State University, Mississippi State, MS 39762 mantha@isip.msstate.edu ABSTRACT

More information

SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM)

SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM) SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM) SEM is a family of statistical techniques which builds upon multiple regression,

More information

DEVELOPING DECISION SUPPORT TOOLS FOR THE IMPLEMENTATION OF BICYCLE AND PEDESTRIAN SAFETY STRATEGIES

DEVELOPING DECISION SUPPORT TOOLS FOR THE IMPLEMENTATION OF BICYCLE AND PEDESTRIAN SAFETY STRATEGIES DEVELOPING DECISION SUPPORT TOOLS FOR THE IMPLEMENTATION OF BICYCLE AND PEDESTRIAN SAFETY STRATEGIES Deo Chimba, PhD., P.E., PTOE Associate Professor Civil Engineering Department Tennessee State University

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

U.S. SOCCER FEDERATION 2014 D&I SCORECARD

U.S. SOCCER FEDERATION 2014 D&I SCORECARD D&I SCORECARD D&I SCORECARD How the D&I Scorecard is collected and prepared How s in the USOC D&I Scorecard are derived In accordance with the Ted Stevens Act, the USOC s Diversity and Inclusion department

More information

CHAPTER 2 Estimating Probabilities

CHAPTER 2 Estimating Probabilities CHAPTER 2 Estimating Probabilities Machine Learning Copyright c 2017. Tom M. Mitchell. All rights reserved. *DRAFT OF September 16, 2017* *PLEASE DO NOT DISTRIBUTE WITHOUT AUTHOR S PERMISSION* This is

More information

Do not copy, post, or distribute

Do not copy, post, or distribute 14 CORRELATION ANALYSIS AND LINEAR REGRESSION Assessing the Covariability of Two Quantitative Properties 14.0 LEARNING OBJECTIVES In this chapter, we discuss two related techniques for assessing a possible

More information

A Modified Incremental Principal Component Analysis for On-line Learning of Feature Space and Classifier

A Modified Incremental Principal Component Analysis for On-line Learning of Feature Space and Classifier A Modified Incremental Principal Component Analysis for On-line Learning of Feature Space and Classifier Seiichi Ozawa, Shaoning Pang, and Nikola Kasabov Graduate School of Science and Technology, Kobe

More information

An Introduction to Mplus and Path Analysis

An Introduction to Mplus and Path Analysis An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression

More information

Regression Models for Quantitative and Qualitative Predictors: An Overview

Regression Models for Quantitative and Qualitative Predictors: An Overview Regression Models for Quantitative and Qualitative Predictors: An Overview Polynomial regression models Interaction regression models Qualitative predictors Indicator variables Modeling interactions between

More information

PIRLS 2016 Achievement Scaling Methodology 1

PIRLS 2016 Achievement Scaling Methodology 1 CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally

More information

Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities

Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities Peter M. Aronow and Cyrus Samii Forthcoming at Survey Methodology Abstract We consider conservative variance

More information

Advanced Quantitative Research Methodology Lecture Notes: January Ecological 28, 2012 Inference1 / 38

Advanced Quantitative Research Methodology Lecture Notes: January Ecological 28, 2012 Inference1 / 38 Advanced Quantitative Research Methodology Lecture Notes: Ecological Inference 1 Gary King http://gking.harvard.edu January 28, 2012 1 c Copyright 2008 Gary King, All Rights Reserved. Gary King http://gking.harvard.edu

More information

Inferences Based on Two Samples

Inferences Based on Two Samples Chapter 6 Inferences Based on Two Samples Frequently we want to use statistical techniques to compare two populations. For example, one might wish to compare the proportions of families with incomes below

More information

NEW CONCEPTS - SOIL SURVEY OF THE FUTURE

NEW CONCEPTS - SOIL SURVEY OF THE FUTURE NEW CONCEPTS - SOIL SURVEY OF THE FUTURE The new process of doing soil surveys by Major Land Resource Areas (MLRA) highlights this section. Special emphasis is given to an overview of the National Soil

More information

Dimension Reduction and Classification Using PCA and Factor. Overview

Dimension Reduction and Classification Using PCA and Factor. Overview Dimension Reduction and Classification Using PCA and - A Short Overview Laboratory for Interdisciplinary Statistical Analysis Department of Statistics Virginia Tech http://www.stat.vt.edu/consult/ March

More information

Discriminative Direction for Kernel Classifiers

Discriminative Direction for Kernel Classifiers Discriminative Direction for Kernel Classifiers Polina Golland Artificial Intelligence Lab Massachusetts Institute of Technology Cambridge, MA 02139 polina@ai.mit.edu Abstract In many scientific and engineering

More information

Non-parametric bootstrap and small area estimation to mitigate bias in crowdsourced data Simulation study and application to perceived safety

Non-parametric bootstrap and small area estimation to mitigate bias in crowdsourced data Simulation study and application to perceived safety Non-parametric bootstrap and small area estimation to mitigate bias in crowdsourced data Simulation study and application to perceived safety David Buil-Gil, Reka Solymosi Centre for Criminology and Criminal

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

Measuring Social Influence Without Bias

Measuring Social Influence Without Bias Measuring Social Influence Without Bias Annie Franco Bobbie NJ Macdonald December 9, 2015 The Problem CS224W: Final Paper How well can statistical models disentangle the effects of social influence from

More information

An Introduction to Path Analysis

An Introduction to Path Analysis An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving

More information

Unsupervised Learning Methods

Unsupervised Learning Methods Structural Health Monitoring Using Statistical Pattern Recognition Unsupervised Learning Methods Keith Worden and Graeme Manson Presented by Keith Worden The Structural Health Monitoring Process 1. Operational

More information

BNG 495 Capstone Design. Descriptive Statistics

BNG 495 Capstone Design. Descriptive Statistics BNG 495 Capstone Design Descriptive Statistics Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential statistical methods, with a focus

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

A multivariate multilevel model for the analysis of TIMMS & PIRLS data

A multivariate multilevel model for the analysis of TIMMS & PIRLS data A multivariate multilevel model for the analysis of TIMMS & PIRLS data European Congress of Methodology July 23-25, 2014 - Utrecht Leonardo Grilli 1, Fulvia Pennoni 2, Carla Rampichini 1, Isabella Romeo

More information

Dimensionality Reduction Techniques (DRT)

Dimensionality Reduction Techniques (DRT) Dimensionality Reduction Techniques (DRT) Introduction: Sometimes we have lot of variables in the data for analysis which create multidimensional matrix. To simplify calculation and to get appropriate,

More information

Noise & Data Reduction

Noise & Data Reduction Noise & Data Reduction Paired Sample t Test Data Transformation - Overview From Covariance Matrix to PCA and Dimension Reduction Fourier Analysis - Spectrum Dimension Reduction 1 Remember: Central Limit

More information

Lecture 2. Judging the Performance of Classifiers. Nitin R. Patel

Lecture 2. Judging the Performance of Classifiers. Nitin R. Patel Lecture 2 Judging the Performance of Classifiers Nitin R. Patel 1 In this note we will examine the question of how to udge the usefulness of a classifier and how to compare different classifiers. Not only

More information

A Driver Interface for a Road Hazard Warning System: Development and Preliminary Evaluation

A Driver Interface for a Road Hazard Warning System: Development and Preliminary Evaluation A Driver Interface for a Road Hazard Warning System: Development and Preliminary Evaluation Paul Green University of Michigan Transportation Research Institute Human Factors Division 2901 Baxter Road Ann

More information

Basics of Uncertainty Analysis

Basics of Uncertainty Analysis Basics of Uncertainty Analysis Chapter Six Basics of Uncertainty Analysis 6.1 Introduction As shown in Fig. 6.1, analysis models are used to predict the performances or behaviors of a product under design.

More information

The Scope and Growth of Spatial Analysis in the Social Sciences

The Scope and Growth of Spatial Analysis in the Social Sciences context. 2 We applied these search terms to six online bibliographic indexes of social science Completed as part of the CSISS literature search initiative on November 18, 2003 The Scope and Growth of Spatial

More information

Bivariate Relationships Between Variables

Bivariate Relationships Between Variables Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods

More information

In the Matter of the Study of State Police Stop Activity at the Southern end of the New Jersey Turnpike

In the Matter of the Study of State Police Stop Activity at the Southern end of the New Jersey Turnpike In the Matter of the Study of State Police Stop Activity at the Southern end of the New Jersey Turnpike We were retained by the ACLU of New Jersey to determine whether the New Jersey State Police (NJSP)

More information

One-stage dose-response meta-analysis

One-stage dose-response meta-analysis One-stage dose-response meta-analysis Nicola Orsini, Alessio Crippa Biostatistics Team Department of Public Health Sciences Karolinska Institutet http://ki.se/en/phs/biostatistics-team 2017 Nordic and

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

STAT Section 2.1: Basic Inference. Basic Definitions

STAT Section 2.1: Basic Inference. Basic Definitions STAT 518 --- Section 2.1: Basic Inference Basic Definitions Population: The collection of all the individuals of interest. This collection may be or even. Sample: A collection of elements of the population.

More information

Introduction to Machine Learning

Introduction to Machine Learning 10-701 Introduction to Machine Learning PCA Slides based on 18-661 Fall 2018 PCA Raw data can be Complex, High-dimensional To understand a phenomenon we measure various related quantities If we knew what

More information

Understanding China Census Data with GIS By Shuming Bao and Susan Haynie China Data Center, University of Michigan

Understanding China Census Data with GIS By Shuming Bao and Susan Haynie China Data Center, University of Michigan Understanding China Census Data with GIS By Shuming Bao and Susan Haynie China Data Center, University of Michigan The Census data for China provides comprehensive demographic and business information

More information

Reducing Computation Time for the Analysis of Large Social Science Datasets

Reducing Computation Time for the Analysis of Large Social Science Datasets Reducing Computation Time for the Analysis of Large Social Science Datasets Douglas G. Bonett Center for Statistical Analysis in the Social Sciences University of California, Santa Cruz Jan 28, 2014 Overview

More information

Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size

Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size Berkman Sahiner, a) Heang-Ping Chan, Nicholas Petrick, Robert F. Wagner, b) and Lubomir Hadjiiski

More information

Model Estimation Example

Model Estimation Example Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions

More information

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page 46 4 C H A P T E R Regression Models TEACHING SUGGESTIONS Teaching Suggestion 4.1: Which Is the Independent Variable? We find that students are often confused

More information

Myoelectrical signal classification based on S transform and two-directional 2DPCA

Myoelectrical signal classification based on S transform and two-directional 2DPCA Myoelectrical signal classification based on S transform and two-directional 2DPCA Hong-Bo Xie1 * and Hui Liu2 1 ARC Centre of Excellence for Mathematical and Statistical Frontiers Queensland University

More information

FERMENTATION BATCH PROCESS MONITORING BY STEP-BY-STEP ADAPTIVE MPCA. Ning He, Lei Xie, Shu-qing Wang, Jian-ming Zhang

FERMENTATION BATCH PROCESS MONITORING BY STEP-BY-STEP ADAPTIVE MPCA. Ning He, Lei Xie, Shu-qing Wang, Jian-ming Zhang FERMENTATION BATCH PROCESS MONITORING BY STEP-BY-STEP ADAPTIVE MPCA Ning He Lei Xie Shu-qing Wang ian-ming Zhang National ey Laboratory of Industrial Control Technology Zhejiang University Hangzhou 3007

More information

BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition, International Publication,

BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition, International Publication, STATISTICS IN TRANSITION-new series, August 2011 223 STATISTICS IN TRANSITION-new series, August 2011 Vol. 12, No. 1, pp. 223 230 BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition,

More information

Spatial Layout and the Promotion of Innovation in Organizations

Spatial Layout and the Promotion of Innovation in Organizations Spatial Layout and the Promotion of Innovation in Organizations Jean Wineman, Felichism Kabo, Jason Owen-Smith, Gerald Davis University of Michigan, Ann Arbor, Michigan ABSTRACT: Research on the enabling

More information

Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns

Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns Aly Kane alykane@stanford.edu Ariel Sagalovsky asagalov@stanford.edu Abstract Equipped with an understanding of the factors that influence

More information

Quantitative Trendspotting. Rex Yuxing Du and Wagner A. Kamakura. Web Appendix A Inferring and Projecting the Latent Dynamic Factors

Quantitative Trendspotting. Rex Yuxing Du and Wagner A. Kamakura. Web Appendix A Inferring and Projecting the Latent Dynamic Factors 1 Quantitative Trendspotting Rex Yuxing Du and Wagner A. Kamakura Web Appendix A Inferring and Projecting the Latent Dynamic Factors The procedure for inferring the latent state variables (i.e., [ ] ),

More information

A Modified Incremental Principal Component Analysis for On-Line Learning of Feature Space and Classifier

A Modified Incremental Principal Component Analysis for On-Line Learning of Feature Space and Classifier A Modified Incremental Principal Component Analysis for On-Line Learning of Feature Space and Classifier Seiichi Ozawa 1, Shaoning Pang 2, and Nikola Kasabov 2 1 Graduate School of Science and Technology,

More information

Core Courses for Students Who Enrolled Prior to Fall 2018

Core Courses for Students Who Enrolled Prior to Fall 2018 Biostatistics and Applied Data Analysis Students must take one of the following two sequences: Sequence 1 Biostatistics and Data Analysis I (PHP 2507) This course, the first in a year long, two-course

More information

Katherine J. Curtis 1, Heather O Connell 1, Perla E. Reyes 2, and Jun Zhu 1. University of Wisconsin-Madison 2. University of California-Santa Cruz

Katherine J. Curtis 1, Heather O Connell 1, Perla E. Reyes 2, and Jun Zhu 1. University of Wisconsin-Madison 2. University of California-Santa Cruz Disentangling the Spatial Concentration and Temporal Stickiness of Poverty: Industrial Structure, Racial/Ethnic Composition, and the Complex Links to Poverty Katherine J. Curtis 1, Heather O Connell 1,

More information

A Modular NMF Matching Algorithm for Radiation Spectra

A Modular NMF Matching Algorithm for Radiation Spectra A Modular NMF Matching Algorithm for Radiation Spectra Melissa L. Koudelka Sensor Exploitation Applications Sandia National Laboratories mlkoude@sandia.gov Daniel J. Dorsey Systems Technologies Sandia

More information

Ministry of Health and Long-Term Care Geographic Information System (GIS) Strategy An Overview of the Strategy Implementation Plan November 2009

Ministry of Health and Long-Term Care Geographic Information System (GIS) Strategy An Overview of the Strategy Implementation Plan November 2009 Ministry of Health and Long-Term Care Geographic Information System (GIS) Strategy An Overview of the Strategy Implementation Plan November 2009 John Hill, Health Analytics Branch Health System Information

More information

Sensitivity of FIA Volume Estimates to Changes in Stratum Weights and Number of Strata. Data. Methods. James A. Westfall and Michael Hoppus 1

Sensitivity of FIA Volume Estimates to Changes in Stratum Weights and Number of Strata. Data. Methods. James A. Westfall and Michael Hoppus 1 Sensitivity of FIA Volume Estimates to Changes in Stratum Weights and Number of Strata James A. Westfall and Michael Hoppus 1 Abstract. In the Northeast region, the USDA Forest Service Forest Inventory

More information

Passing-Bablok Regression for Method Comparison

Passing-Bablok Regression for Method Comparison Chapter 313 Passing-Bablok Regression for Method Comparison Introduction Passing-Bablok regression for method comparison is a robust, nonparametric method for fitting a straight line to two-dimensional

More information

Estimation and sample size calculations for correlated binary error rates of biometric identification devices

Estimation and sample size calculations for correlated binary error rates of biometric identification devices Estimation and sample size calculations for correlated binary error rates of biometric identification devices Michael E. Schuckers,11 Valentine Hall, Department of Mathematics Saint Lawrence University,

More information

peak half-hourly New South Wales

peak half-hourly New South Wales Forecasting long-term peak half-hourly electricity demand for New South Wales Dr Shu Fan B.S., M.S., Ph.D. Professor Rob J Hyndman B.Sc. (Hons), Ph.D., A.Stat. Business & Economic Forecasting Unit Report

More information

Models and Anthropometry

Models and Anthropometry Learning Objectives Models and Anthropometry Readings: some of Chapter 8 [in text] some of Chapter 11 [in text] By the end of this lecture, you should be able to: Describe common anthropometric measurements

More information

Sampling The World. presented by: Tim Haithcoat University of Missouri Columbia

Sampling The World. presented by: Tim Haithcoat University of Missouri Columbia Sampling The World presented by: Tim Haithcoat University of Missouri Columbia Compiled with materials from: Charles Parson, Bemidji State University and Timothy Nyerges, University of Washington Introduction

More information

Contextual Effects in Modeling for Small Domains

Contextual Effects in Modeling for Small Domains University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Engineering and Information Sciences 2011 Contextual Effects in

More information

Institution: CUNY Hostos Community College (190585) User ID: 36C0029 Completions Overview distance education All Completers unduplicated count

Institution: CUNY Hostos Community College (190585) User ID: 36C0029 Completions Overview distance education All Completers unduplicated count Completions 204-5 Institution: CUNY Hostos Community College (90585) User ID: 36C0029 Completions Overview Welcome to the IPEDS Completions survey component. The Completions component is one of several

More information

CHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the

CHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the CHAPTER 4 VARIABILITY ANALYSES Chapter 3 introduced the mode, median, and mean as tools for summarizing the information provided in an distribution of data. Measures of central tendency are often useful

More information

A User s Guide to the Federal Statistical Research Data Centers

A User s Guide to the Federal Statistical Research Data Centers A User s Guide to the Federal Statistical Research Data Centers Mark Roberts Professor of Economics and Director PSU FSRDC September 2016 M. Roberts () RDC User s Guide September 2016 1 / 14 Outline Introduction

More information

Taking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD

Taking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD Taking into account sampling design in DAD SAMPLING DESIGN AND DAD With version 4.2 and higher of DAD, the Sampling Design (SD) of the database can be specified in order to calculate the correct asymptotic

More information

A SHORT INTRODUCTION TO PROBABILITY

A SHORT INTRODUCTION TO PROBABILITY A Lecture for B.Sc. 2 nd Semester, Statistics (General) A SHORT INTRODUCTION TO PROBABILITY By Dr. Ajit Goswami Dept. of Statistics MDKG College, Dibrugarh 19-Apr-18 1 Terminology The possible outcomes

More information