Rebecca G. Frederick L ouisiana State U niversity D epartm ent of E xperim ental Statistics

Similar documents
c. What is the average rate of change of f on the interval [, ]? Answer: d. What is a local minimum value of f? Answer: 5 e. On what interval(s) is f

Form and content. Iowa Research Online. University of Iowa. Ann A Rahim Khan University of Iowa. Theses and Dissertations

A L A BA M A L A W R E V IE W

Grain Reserves, Volatility and the WTO

Functional pottery [slide]

LSU Historical Dissertations and Theses

STEEL PIPE NIPPLE BLACK AND GALVANIZED

LU N C H IN C LU D E D

Class Diagrams. CSC 440/540: Software Engineering Slide #1

C o r p o r a t e l i f e i n A n c i e n t I n d i a e x p r e s s e d i t s e l f

EKOLOGIE EN SYSTEMATIEK. T h is p a p e r n o t to be c i t e d w ith o u t p r i o r r e f e r e n c e to th e a u th o r. PRIMARY PRODUCTIVITY.

REFUGEE AND FORCED MIGRATION STUDIES

B ooks Expans ion on S ciencedirect: 2007:

MAHARASHTRA STATE BOARD OF TECHNICAL EDUCATION

The Ability C ongress held at the Shoreham Hotel Decem ber 29 to 31, was a reco rd breaker for winter C ongresses.

gender mains treaming in Polis h practice

The Construction and Testing of a New Empathy Rating Scale

Lesson Ten. What role does energy play in chemical reactions? Grade 8. Science. 90 minutes ENGLISH LANGUAGE ARTS

Agenda Rationale for ETG S eek ing I d eas ETG fram ew ork and res u lts 2

MOLINA HEALTHCARE, INC. (Exact name of registrant as specified in its charter)

TTM TECHNOLOGIES, INC. (Exact Name of Registrant as Specified in Charter)

Table of C on t en t s Global Campus 21 in N umbe r s R e g ional Capac it y D e v e lopme nt in E-L e ar ning Structure a n d C o m p o n en ts R ea

TECHNICAL MANUAL OPTIMA PT/ST/VS

Texas Student Assessment Program. Student Data File Format for Student Registration and Precoding

600 Billy Smith Road, Athens, VT

F O R M T H R E E K enya C ertificate of Secondary E ducation

A Study of Attitude Changes of Selected Student- Teachers During the Student-Teaching Experience.

VERITAS L1 trigger Constant Fraction Discriminator. Vladimir Vassiliev Jeremy Smith David Kieda

M a n a g e m e n t o f H y d ra u lic F ra c tu rin g D a ta

Sodium-Initiated Polymerization of Alpha- Methylstyrene in the Vicinity of Its Reported Ceiling Temperature

University Microfilms

Software Architecture. CSC 440: Software Engineering Slide #1

UNITED STATES SECURITIES AND EXCHANGE COMMISSION Washington, D.C FORM 8-K

A new ThermicSol product

INCOME TAXES IN ALONG-TERMMACROECONOMETRIC FORECASTING MODEL. Stephen H. Pollock

THE BANK OF NEW YORK MELLON CORPORATION (Exact name of registrant as specified in its charter)

UNITED STATES SECURITIES AND EXCHANGE COMMISSION Washington, D.C Form 8-K/A (Amendment No. 2)

Information System Desig

F48T10VHO, F60T10VHO, F72T10VHO, F96T12HO (1 LAMP ONLY) ELECTRICAL DATA (120V APPLICATION)

T h e C S E T I P r o j e c t

The use and effectiveness of financial and physical reserves in Montana's dryland wheat area by Howard W Hjort

AGRICULTURE SYLLABUS

Comparative Analyses of Teacher Verbal and Nonverbal Behavior in a Traditional and an Openspace

Feasibility Analysis, Dynamics, and Control of Distillation Columns With Vapor Recompression.

BE640 Intermediate Biostatistics 2. Regression and Correlation. Simple Linear Regression Software: SAS. Emergency Calls to the New York Auto Club

A Comparison of Two Methods of Teaching Computer Programming to Secondary Mathematics Students.

Use precise language and domain-specific vocabulary to inform about or explain the topic. CCSS.ELA-LITERACY.WHST D

THE EFFECT Of SUSPENSION CASTING ON THE HOT WORKABILITY AND MECHANICAL PROPERTIES OF A IS I TYPE STAINLESS STEEL

Beechwood Music Department Staff

heliozoan Zoo flagellated holotrichs peritrichs hypotrichs Euplots, Aspidisca Amoeba Thecamoeba Pleuromonas Bodo, Monosiga

S ca le M o d e l o f th e S o la r Sy ste m

SPECIFICATION SHEET : WHSG4-UNV-T8-HB

Dentists incomes, fees, practice costs, and the Economic Stabilization Act: to 1976

SCHOOLS DIVISION OFFICE OF KABANKALAN CITY

M. H. DALAL & ASSOCIATES C H ARTERED ACCOUNTANTS

Instruction Sheet COOL SERIES DUCT COOL LISTED H NK O. PR D C FE - Re ove r fro e c sed rea. I Page 1 Rev A

Distributive Justice, Injustice and Beyond Justice: The Difference from Principle to Reality between Karl Marx and John Rawls

Model Checking. Automated Verification of Computational Systems

Breakup of weakly bound nuclei and its influence on fusion. Paulo R. S. Gomes Univ. Fed. Fluminense (UFF), Niteroi, Brazil

ANNUAL MONITORING REPORT 2000

UNITED STATES SECURITIES AND EXCHANGE COMMISSION FORM 8-K. Farmer Bros. Co.

Software Process Models there are many process model s in th e li t e ra t u re, s om e a r e prescriptions and some are descriptions you need to mode

Compulsory Continuing Education for Certified Public Accountants: a Model Program for the State of Louisiana.

EXST7015: Estimating tree weights from other morphometric variables Raw data print

DATA SHEET FOR COMPOUND TEV

Use precise language and domain-specific vocabulary to inform about or explain the topic. CCSS.ELA-LITERACY.WHST D

SPU TTERIN G F R O M A LIQ U ID -PH A SE G A -IN EUTECTIC ALLOY KEVIN M A R K H U B B A R D YALE UNIVER SITY M A Y

P a g e 5 1 of R e p o r t P B 4 / 0 9

Photo. EPRI s Power System and Railroad Electromagnetic Compatibility Handbook

The Effects of Apprehension, Conviction and Incarceration on Crime in New York State

Taiwan Radio Occultation Process System (TROPS)

Heider's Five Levels of Causality and Assignment of Responsibility by Actors and Observers.

Country Report Government (Part I) Due: November 14, 2017

UNITED STATES SECURITIES AND EXCHANGE COMMISSION WASHINGTON, D.C FORM 8-K

Joh n L a w r e n c e, w ho is on sta ff at S ain t H ill, w r ite s :

7.2 P rodu c t L oad/u nload Sy stem s

Status of industrial arts teaching in Montana high schools with enrollments of from forty to one hundred fifty students in 1950

What are S M U s? SMU = Software Maintenance Upgrade Software patch del iv ery u nit wh ich once ins tal l ed and activ ated prov ides a point-fix for

Dangote Flour Mills Plc

176 5 t h Fl oo r. 337 P o ly me r Ma te ri al s

Sub: Filing of Reconciliation of share capital for the quarter ended September 30, 2018

OH BOY! Story. N a r r a t iv e a n d o bj e c t s th ea t e r Fo r a l l a g e s, fr o m th e a ge of 9

1980 Annual Report / FEDERAL R ESER V E BA N K OF RICHMOND. Digitized for FRASER Federal Reserve Bank of St.

NORWEGIAN MARITIME DIRECTORATE

FKSZ2.E Drivers for Light-emitting-diode Arrays, Modules and Controllers - Component

UNITED STATES SECURITIES AND EXCHANGE COMMISSION Washington, DC FORM 8-K. Current Report

A Comparison of the Early Social Behavior of Twins and Singletons.

Obsidian hydration dating of naturally worked sediments in the Yellowstone region, Montana and Wyoming by Kenneth Donald Adams

Results as of 30 September 2018

Survey of the subjects taught in Lake County high schools with recommendations for curriculum revision

MONTHLY REVIEW. f C r e d i t a n d B u s i n e s s C o n d i t i o n s F E D E R A L R E S E R V E B A N K O F N E W Y O R K MONEY MARKET IN JUNE

NUMERICAL SIMULATION OF MHD-PROBLEMS ON THE BASIS OF VARIATIONAL APPROACH

Comparison of a Population Means

EXST 7015 Fall 2014 Lab 08: Polynomial Regression

A Followup Study of the Socioeconomic Status of Mildly Retarded Individuals in Selected Public School Systems in Louisiana.

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3

The Measurement of Investment Center Managerial Performance Within Selected Diversified Industrial Firms: an Inquiry.

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3

Imitative Aggression as a Function of Race of Model, Race of Target and Socioeconomic Status of Observer.


Transcription:

USING ODS W ITH PROC UNIVARIATE Rebecca G. Frederick L ouisiana State U niversity D epartm ent of E xperim ental Statistics South CentralS A S U sers G roup 1 ABSTRACT P ro c U n iv a ria te is u se d b y m a n y sta tistic ia n s to g e t a h a n d le o n th e ty p e o f d a ta th a t is to b e a n a ly z e d. R eg ard le ss o f w h eth er I am co n d u ctin g a R eg ressio n, G e n e ra l L in e a r M o d e l, o r a S u rv iv a l A n a ly s is, I h a v e to investigate the data. O utliers are a natural occurrence in m a n y d a tase ts. A n a ly sis w ith P ro c U n iv a ria te sh o u ld occur first on the data before any other procedure. H ypertext m arkup language or H T M L using the O D S statem ent p erm its th e resu lts to easily sh ared w ith c o lla b o ra to rs. 2 135

INTRO DUCTIO N Typically, a statistician will be handed data of a large number of observations that has been given by a client. The client has assured the statistician that all of the observations have been checked and re-checked for outliers and all problems have been eliminated. But, no data set has been cleaned-up until the data set has been analyzed through SAS Proc Univariate. Further, suppose that the client is located at another location and so, all information will be emailed between the client and the statistician. This is where the SAS ODS statements will come in handy for use. 3 INTRO DUCTIO N(C O N T IN U E ) A quantile is defined as the am ount of area under a density curve or the area to the left o f specified fraction of the total unit area. F or instance, a given value of p is th e pth percentile such that the area to the left of it is p. Q uan tile E stim ate 100% (M ax Q uantile) 94 95% 94 75% (T hird Q uantile) 79 50% (M edian Q uantile ) 77 25% (F irst Q uantile ) 58 0 % (M in Q uantile ) 52 T h e E stim a te s a re fro m th e d a ta se t, Students, and are the T est Scores from a particular section of class. 4 136

ADDITIO NS TO TH E PRO C UNIVARIATE: PROBPLOT, & QQPLOTS If th e d a ta d istrib u tio n m a tc h e s th e th e o retic a l d istrib u tio n, th e p o in ts o n th e p lo t fo rm a lin e a r p a ttern, y = x. T h u s, you can use a Q -Q plot or a probability plot to determ ine h o w w e ll a th e o re tic a l d istrib u tio n m o d e ls a se t o f m easurem ents. For exam ple, G ra p h A : N o L in ea r R e latio n sh ip G ra p h B : L in ea r R e latio n sh ip Y X Y =X y y x x 5 ADDITIONS TO THE PROC UNIVARIATE: PR O BPLO T, & Q Q PLO TS (CONTINUE) T he slope and intercept are visual estim ates of the scale and lo cation param eters of the theo retical distribution. Q -Q plots are m ore convenient than probability plots fo r graphical estim ation o f the location and scale param eters b e c a u se th e a x is o f a Q -Q p lo t is sc a le d lin e a rly. O n the other hand, pro bability plots are m ore convenient for estim ating percentiles or probabilities. 6 137

PR O BPLO T Probplot C reates a probability plot by using highresolution graphs, w hich co m pare ordered v a ria b le v a lu e s w ith p e rc e n tile s o f a sp e c ifie d theo retical d istributio n. S yntax of the statem ent: PR O B PLO T variable(s) / option(s); P ro b p lo t state m e n t o p tio n s c a n req u e st a d istrib u tio n (B eta, E x p o n e n tia l, G a m m a, L o g N o rm a l, N o rm a l, a n d W e ib u ll) and each of the distribution param eters (A lpha, B eta, C, M U, S igm a, S lo pe, T heta, and Z eta) 7 PR O BPLO T (co n tin u e) T he distribution param eters can com pute a m axim um likelihood estim ate by specifying: distribution _param eter= est. P robplot can control the appearance of distribution reference line, general plot layout, enhance the probability plot or com parative plot. IN S E T statem ent P laces a box or table of sum m ary sta tistic s d ire c tly in th e h ig h -re so lu tio n graphics. 8 138

PR O BPLO T (continue) PROGRAM: LIBNAME STUDENT C:\MYSASDIR\SESSION7'; PROC SORT DATA = STUDENT.STUDENTS OUT=SORTED; GOPTIONS HTITLE=2 HTEXT=1 FTEXT=SWISSB FTITLE=SWISSB; SYMBOL VALUE=STAR; PROC UNIVARIATE DATA=SORTED NOPRINT; VAR EXAM; PROBPLOT EXAM /NORMAL(MU=EST SIGMA=EST); INSET MEAN STD / HEADER='Normal Parameters' POSITION=(95,5) REFPOINT=BR; TITLE1 '100 Obs Sampled from a Normal Distribution'; TITLE2 'Normal Probability Plot'; 9 PROBPLOT OUTPUT: (continue) 10 139

QQPLOT Q Q plot C reates a quantile-quantile plot by using highre so lu tio n g ra p h s, w h ic h c o m p a re o rd e re d v a ria b le v a lu e s w ith q u a n tile s o f a sp e c ifie d th e o retic a l d istrib u tio n. Q Q plot statem ent optio ns can request a distribution (B eta, E x p o n e n tia l, G a m m a, L o g N o rm a l, N o rm a l, a n d W e ib u ll) a n d each o f th e d istrib u tio n s p ara m eters(a lp h a, B eta, C, M u, S ig m a, S lope, T heta, and Z eta). T he distribution param eters can com pute a m axim u m likelihood estim ate by specifying: distribution _param eter= est. Q Q plot can control the appearance of distributio n reference line, general plo t layo ut, enhance the probability plot or com parative plo t. 11 QQPLOT (co n tin u e) PR O G R A M : LIBNAME STUDENT C:\MYSASDIR\SESSION7'; PROC SORT DATA = STUDENT.STUDENTS OUT=SORTED; GOPTIONS HTITLE=2 HTEXT=1 FTEXT=SWISSB FTITLE=SWISSB; SYMBOL VALUE=STAR; PROC UNIVARIATE DATA=SORTED NOPRINT; VAR EXAM; QQPLOT EXAM /NORMAL(MU=EST SIGMA=EST); INSET MEAN STD / HEADER='Normal Parameters' POSITION=(95,5) REFPOINT=BR; TITLE1 '100 Obs Sampled from a Normal Distribution'; TITLE2 'Normal Probability Plot'; 12 140

Q Q PLO T O UTPUT (co n tin u e) 13 O UTPUT DELIVERY SYSTEM (O DS) O D S U ser can com bine raw data w ith one or m ore table definitions to produce output to a printer or form atted in H ypertext M arkup L anguage (H T M L ). 14 141

ODS (co n tin u e) O D S breaks dow n the procedures into separate pieces so that the user can print out only sections of the report. O D S, in version 9, can currently supports m any destinations but here are at least four destinatio ns: T he O utput destinatio n produces S A S dataset. T h e L istin g d e stin a tio n p ro d u c e s m o n sp a c e o u tp u t, w h ic h is fo rm a tte d lik e tra d itio n a l S A S p ro c e d u re o u tp u t. T he H T M L destinatio n produces o utput that is fo rm atted in H ypertext M arkup L anguage. T h e P rin te r d e stin a tio n p ro d u c e s o u tp u t th a t is fo rm a tte d fo r high-resolution printers. 15 ODS (co n tin u e) PROGRAM : LIBNAME STUDENT C:\MYSASDIR\SESSION7'; PROC SORT DATA = STUDENT.STUDENTS OUT=SORTED; GOPTIONS HTITLE=2 HTEXT=1 FTEXT=swissb FTITLE=SWISSB; SYMBOL VALUE=STAR; ODS HTML FILE='ODSHTML_BODY.HTM' CONTENTS='ODSHTML_CONTENTS.HTM' PAGE='ODSHTML_PAGE.HTM' FRAME='ODSHTML_FRAME.HTM'; South CentralS A S U sers G roup 16 142

ODS (co n tin u e) PR O G R A M : PROC UNIVARIATE DATA=SORTED NOPRINT; VAR EXAM; QQPLOT EXAM /NORMAL(MU=EST SIGMA=EST); INSET MEAN STD / HEADER='Normal Parameters' POSITION=(95,5) REFPOINT=BR; TITLE1 '100 Obs Sampled from a Normal Distribution'; TITLE2 'Normal Quantile-Quantile Plot'; ODS HTML CLOSE; 17 ODS OUTPUT (co n tin u e) 18 143

CONCLUSION: T h e O u tp u t D e liv e ry S y ste m (O D S ) is a n e x tre m e ly pow erful tool that statisticians can use to com m unicate re su lts o f a n a ly se s w ith c lie n ts. H ig h -re so lu tio n g ra p h ic s o f the probability plots and Q -Q plots display theoretical distributions relative to the actual data. Perform ance of these is displayed through a very sim ple procedure nam ed, SA S Proc U n iv a ria te. 19 REFERENCES: M eeker, W illiam Q. and L uis A. E scobar. (1998), Statistical M ethods For R eliability D ata, N ew Y ork: John W iley & Sons. SA S Institute, Inc (2000). SA S O nlined oc, V ersion 8. C ary, N C : SA S Institute, Inc. SA S Institute, Inc (). SA S H elp and D ocum entation, V ersio n 9. C ary, N C : S A S In stitu te, In c. S outh C entralsas Users G roup 20 144

TRADEM ARKS: S A S is a registered tradem ark or a tradem ark o f S A S Institute Inc. in the U S A and other countries. In d icates U S A reg istratio n. O th er b ran d an d p ro d u ct n am e s are registered tradem arks or tradem arks of their respective com panies. 21 CONTACT INFORM ATION: Y our com m ents and questions are valued and encouraged. C ontact the author at: R ebecca G. F rederick L o uisiana S tate U niversity A gricu ltural C enter D epartm ent of E xperim ental Statistics B aton R ouge, L A 70803-5606 W ork Phone: 225-578-8303 F a x : 2 2 5-5 7 8-8 3 4 4 E m ail: rfred eri@ lsu.ed u http://www.stat.lsu.edu/faculty/frederick South CentralS A S U sers G roup 22 145