Introductory Applied Bio-Statistics

Similar documents
Introductory Statistics

Outline for today s lecture (Ch. 14, Part I)

Sexual Reproduction and Genetics

Section 11 1 The Work of Gregor Mendel

Name Class Date. Pearson Education, Inc., publishing as Pearson Prentice Hall. 33

Biology 211 (1) Exam 4! Chapter 12!

Introduction to Genetics

Introduction to Genetics

Name Date Class CHAPTER 10. Section 1: Meiosis

Introduction to Genetics

Ch 11.Introduction to Genetics.Biology.Landis

Family resemblance can be striking!

Guided Notes Unit 6: Classical Genetics

Lesson 4: Understanding Genetics

I. GREGOR MENDEL - father of heredity

Table of Contents. Chapter Preview. 5.1 Mendel s Work. 5.2 Probability and Heredity. 5.3 The Cell and Inheritance. 5.4 Genes, DNA, and Proteins

Unit 2 Lesson 4 - Heredity. 7 th Grade Cells and Heredity (Mod A) Unit 2 Lesson 4 - Heredity

HEREDITY: Objective: I can describe what heredity is because I can identify traits and characteristics

Introduction to population genetics & evolution

Mendelian Genetics. Introduction to the principles of Mendelian Genetics

What is a sex cell? How are sex cells made? How does meiosis help explain Mendel s results?

Genetics_2011.notebook. May 13, Aim: What is heredity? Homework. Rd pp p.270 # 2,3,4. Feb 8 11:46 PM. Mar 25 1:15 PM.

Meiosis and Mendel. Chapter 6

Chapter 11 INTRODUCTION TO GENETICS

Interactive Biology Multimedia Courseware Mendel's Principles of Heredity. Copyright 1998 CyberEd Inc.

1. What is genetics and who was Gregor Mendel? 2. How are traits passed from one generation to the next?

Ch. 10 Sexual Reproduction and Genetics. p

UNIT 3: GENETICS 1. Inheritance and Reproduction Genetics inheritance Heredity parent to offspring chemical code genes specific order traits allele

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148

Name Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have.

Interest Grabber. Analyzing Inheritance

11.1 Traits. Studying traits

Name Date Class. Meiosis I and Meiosis II

Heredity and Genetics WKSH

Chapter 6 Meiosis and Mendel

Chapter 5. Heredity. Table of Contents. Section 1 Mendel and His Peas. Section 2 Traits and Inheritance. Section 3 Meiosis

REVISION: GENETICS & EVOLUTION 20 MARCH 2013

Question: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation?

Unit 8 Meiosis and Mendel. Genetics and Inheritance Quiz Date: Jan 14 Test Date: Jan. 22/23

2014 Pearson Education, Inc.

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Genetics and Natural Selection

1 Mendel and His Peas

Mechanisms of Evolution

The Work of Gregor Mendel

Neutral Theory of Molecular Evolution

Reinforcement Unit 3 Resource Book. Meiosis and Mendel KEY CONCEPT Gametes have half the number of chromosomes that body cells have.

Keys to Success on the Quarter 3 EXAM

genome a specific characteristic that varies from one individual to another gene the passing of traits from one generation to the next

Darwin, Mendel, and Genetics

Yesterday s Picture UNIT 3D

Labs 7 and 8: Mitosis, Meiosis, Gametes and Genetics

Rebops. Your Rebop Traits Alternative forms. Procedure (work in pairs):

-Genetics- Guided Notes

Dropping Your Genes. A Simulation of Meiosis and Fertilization and An Introduction to Probability

Chapter 2: Extensions to Mendel: Complexities in Relating Genotype to Phenotype.

THE WORK OF GREGOR MENDEL

Guided Reading Chapter 1: The Science of Heredity

Essential Questions. Meiosis. Copyright McGraw-Hill Education

AEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity,

T TT Tt. T TT Tt. T = Tall t = Short. Figure 11 1

Cover Requirements: Name of Unit Colored picture representing something in the unit

Class Copy! Return to teacher at the end of class! Mendel's Genetics

MODULE NO.22: Probability

Natural Selection. Population Dynamics. The Origins of Genetic Variation. The Origins of Genetic Variation. Intergenerational Mutation Rate

Advance Organizer. Topic: Mendelian Genetics and Meiosis

BENCHMARK 1 STUDY GUIDE SPRING 2017

Chapter 4 Lesson 1 Heredity Notes

GENES, ALLELES, AND CHROMOSOMES All living things carry their genetic information in DNA Sections of DNA with instructions for making proteins are

is the scientific study of. Gregor Mendel was an Austrian monk. He is considered the of genetics. Mendel carried out his work with ordinary garden.

Heredity and Evolution

The Genetics of Natural Selection

Population Genetics I. Bio

Jeopardy. Evolution Q $100 Q $100 Q $100 Q $100 Q $100 Q $200 Q $200 Q $200 Q $200 Q $200 Q $300 Q $300 Q $300 Q $300 Q $300

VOCABULARY somatic cell autosome fertilization gamete sex chromosome diploid homologous chromosome sexual reproduction meiosis

NOTES CH 17 Evolution of. Populations

Unit 6 Reading Guide: PART I Biology Part I Due: Monday/Tuesday, February 5 th /6 th

Directed Reading B. Section: Traits and Inheritance A GREAT IDEA

Outline. Probability. Math 143. Department of Mathematics and Statistics Calvin College. Spring 2010

1 Errors in mitosis and meiosis can result in chromosomal abnormalities.

Chapter 10. Prof. Tesler. Math 186 Winter χ 2 tests for goodness of fit and independence

Unit 3 Life: Growth, Development, and Reproduction

Animal Genetics - MENDELU

PRINCIPLES OF MENDELIAN GENETICS APPLICABLE IN FORESTRY. by Erich Steiner 1/

Introduction to Genetics

Results. Experiment 1: Monohybrid Cross for Pea Color. Table 1.1: P 1 Cross Results for Pea Color. Parent Descriptions: 1 st Parent: 2 nd Parent:

Family Trees for all grades. Learning Objectives. Materials, Resources, and Preparation

Unit Test on Cell Biology please read carefully and double-check your work! LO: Describe and explain the Central Dogma. SLE: Meet NGSS.

Note: A file Algebra Unit 09 Practice X Patterns can be useful to prepare students to quickly find sum and product.

1 Mendel and His Peas

Hypothesis tests

Name Period. 2. Name the 3 parts of interphase AND briefly explain what happens in each:

B) Describe the structures and functions of a Paramecium. Draw a Paramecium.

Microevolution (Ch 16) Test Bank

Genetics (patterns of inheritance)

Chapter 1: Mendel s breakthrough: patterns, particles and principles of heredity

Problems for 3505 (2011)

Chromosome duplication and distribution during cell division

Science 7 Acceleration Study Guide

Survey of Physical Anthropology Exam 1

Transcription:

Introductory Applied Bio-Statistics IMPORTANT: you will need a calculator for this section of the course and for exam questions over this section! For the purposes of this section, a population is a group of inter-breeding people with/of equal species that live together. Frequency (f) What is the frequency of a genotype in a population if there are three (3) genotypes, C 1 C 1, C 2 C 2, C 1 C 2 (C 11, C 22, C 12, respectively) and there are C T people in the population (T means total)? 1

Frequency (f) The frequencies of each would be determined as shown right: For the frequency of C 11, it is equal to the number of homozygous genotypes divided by the total number of the people in the population with the C genotype. f f f 11 12 22 # C # C # C # C # C T # C 11 12 T 22 T f f f 11 12 22 1 The sum of the frequencies of these genotypes equals 1, i.e., CLUE: think per cent: if a you have a pie and it is cut into a number of pieces of different sizes, then the sum of the per cent of each piece of the pie is equal to 100%; alternatively, if you only use fractions rather than the per cent values, the sum of the fraction of each piece of pie is equal to unity (1). 2

How does one calculate the frequency of a specific allele? IMPORTANT: when ALL alleles are assumed to be observed in heterozygotes is our ASSUMPTION. Allele = one of two or more alternative forms of a gene that arise by mutation and are found at the same place on a chromosome. f C 1 f C 1 f C 11 + 1/2(f C 12 ) = C 11 /C T + 1/2(C 12 /C T ) = (C 11 + 1/2(C 12 ))/C T f C 2 f C 2 f C 22 + 1/2(f C 12 ) = C 22 /C T + 1/2(C 12 /C T ) = (C 22 + 1/2(C 12 ))/C T f "rule" f C 1 + f C 2 = 1 \ f C 1 = 1 f C 2 3

E.g., In 500 people, there are 268 with C 1 C 1 (Normal), 100 with C 1 C 2 (Carriers) and 132 with C 2 C 2 (Disease). Determine the frequencies of C 11, C 12, C 22, the frequency of C 1 and the frequency of C 2 : #C 11 #C 12 #C 22 268 100 132 f C 11 f C 12 f C 22 268/500 = 0.536 100/500 = 0.2 132/500 = 0.264 (53.6%) (20%) (26.4%) And f C 1 is: (268 + 1/2(100))/500 = 318/500 = 0.636 And f C 2 is: 1 f C 1 = f C 2 = 1-0.636 = 0.364 4

How does one use this information? By definition, if the frequency value is calculated to be greater than or equal to 0.99, then this is a monomorphic gene. If the frequency value is calculated to be less than 0.99, then this is a polymorphic gene. A monomorphic gene is a gene with not more than one allele in high frequency; A polymorphic gene is a gene that has 2 or more alleles in substantial frequency. From our example, above, both C 1 and C 2 are polymorphic (0.636 < 0.99 and 0.364 < 0.99, respectively). Although these alleles are not really frequent, both are polymorphic. 5

Hardy-Weinberg Principle The Hardy-Weinberg Principle shows a relationship between the frequency of the alleles and the frequency of the genotypes. The frequency of the genotypes for a gene with 2 different alleles are a binomial function of the frequency of the alleles. Four assumptions are inherent in this model: 1. The population is "breeding" for 2 alleles, C 1 and C 2 with frequencies of C 1 and C 2 such that their sum is unity. 2. A sperm and an ovum unite at random to produce a zygote. 3. No other processes effect the variation of the genotype. 4. The frequencies of all C's are identical in sperm and the ovum. 6

Let's use our above example in understanding the Hardy-Weinberg Principle. We can examine this graphically with a sort of modification/expansion of the Punnett square and Mendel's work: If we make the long lengths of a square and rectangle equal to the f C 1 and the short sides of a rectangle equal to the f C 2, then we may construct the diagram: Remember that the area of a square is the product of two of its sides. Since a square has equal sides, the Area is equal to one side squared. The area of a rectangle is the product of its long side and its short side. In the case of the graphic, the area of a square is equal to (f C 1 ) 2, (f C 2 ) 2 and the area of a rectangle is f C 1 f C 2. 7

If we plug in the numbers from our original example, we'll find that (fc 1 ) 2 equals 0.4045 (40.45% C 1 C 1 ) and (fc 2 ) 2 equals 0.132 (13.2% C 2 C 2 ) and 2(fC 1 fc 2 ) equals 0.464 (46.4% C 1 C 2 ). If we add up all the areas of the squares and rectangles, we obtain the following equation for the complete square: (fc 1 ) 2 + 2(fC 1 fc 2 ) + (fc 2 ) 2 = 1 Which ALSO equals: (fc 1 + fc 2 ) 2 8

ASIDE MATH Primer Reminder: Equations of the form (a + b) 2 are called BINOMIAL (has exactly 2 terms); using the FOIL method, (a + b) 2 expands to that found below: We then obtain: a 2 + ab + ba + b 2 which is equal to: a 2 + 2ab + b 2 which LOOKS just like: F means to multiply the First terms together in the two parenthetic phrases; O means to multiply the Outer terms; I means to multiply the inner terms together; L means to multiply the last two terms together. (fc 1 ) 2 + 2(fC 1 fc 2 ) + (fc 2 ) 2 Hence, the proportion of progeny are a function of binomial arithmetic END of ASIDE 9

This binomial relationship allows us to examine the relationships between the three alleles we've been working with: 1. The maximum frequency of C 11 occurs when the frequency of C 1 = 1 and the frequency of C 2 = 0. 2. The maximum frequency of C 22 occurs when the frequency of C 2 =1 and the frequency of C 1 = 0. 3. The maximum frequency of C 12 occurs when the frequency of C 1 is identical to the frequency of C 2 ; this value is 0.5, and marked with a star. The bottom line, here, is that as C 1 C 1 decreases, both C 1 C 2 and C 2 C 2 increase; conversely, as C 2 C 2 decreases, C 1 C 1 and C 1 C 2 increase. Not withstanding the numerical evaluation, the critical concept of the Hardy-Weinberg Principle is that genotypic frequencies stay in the EXACT proportions of binomial arithmetic, ad infinitum! 10

As an extension of the Hardy-Weinberg Principle, one may estimate the proportions of a carrier in a given, defined population, e.g.: (fc 2 ) 2 = #C 22 /#C T and (fc 2 ) 2 = (#C 22 /#C T ) = estimation of the frequency of C 2 ASSUMPTION: all alleles are NOT observed in the heterozygote, i.e., one dominant trait coded for and one recessive trait coded for: this is more REALISTIC! 11

The Estimation of a Carrier in A Population (E P ) This value for a given, defined population is: E PC2 = 2(#C 22 /#C T ) * [1 - (#C 22 /#C T )] OR E PC2 = 2 (fc 2 ) (1 - fc 2 ) 12

Example Clubfoot appears in one of every thousand births. Determine what per cent of the population are carriers for this allele. From the previous equations: f clubfoot = (1/1000) = 0.0316 and 1 - f clubfoot = 1-0.0316 = 0.9684 and E pclubfoot = (2)(0.0316)(0.9684) = 0.0612 % = 0.0612 * 100 = 6.12% of the population (1 in 16 people) are carriers of the clubfoot allele. Surprising?! 13

The Chi Squared ( 2 ) Test In some cases, it is necessary to use statistics to determine if the observed frequency of genotypes is close enough to Hardy-Weinberg expectations. The "big" test is the Chi squared ( 2 ) test. This statistical manipulation is equal to the following generic equation: 2 = [(Observed #'s - Expected #'s) 2 /Expected #'s] NOTE: NUMBERS NOT frequencies!!!!!! 2 is dependent on the degrees of freedom (dof); GENERALLY this equal to the number of characteristics less one. 14

One may use 2 2 ways: Confidence "Goodness of Fit" Confidence Level Error "Sloppy Sampling" p Level dof 99% 95% dof 0.05 + 0.01 * 1 6.63 3.84 1 0.0039 0.0002 2 9.21 5.99 2 0.103 0.0201 3 11.3 7.81 3 0.352 0.115 4 13.3 9.49 4 0.711 0.297 8 20.1 15.5 8 2.73 1.65 10 23.2 18.3 10 3.94 2.56 For confidence of "fit of observed data" to that expected + 5%, * 1% error; for error in data sampling -- examples coming up shortly 15

APPLICATION for both uses: 2 calculated < 2 table means: Confidence: GOOD Error: < 5% + or 1%* If p (probability) > 0.05, you probably do not have a correct hypothesis; If p < 0.05, this suggests < 5% + error in your hypothesis; If p < 0.01 suggests that there is less than 1%* error in your hypothesis. 16

Example Peas in Mendel's experiments exhibited 2 of 4 characteristics (NOTE: dof = 4-1 = 3): Round Yellow Wrinkled Green R Y W G Mendel s theory of heredity predicts peas in the following proportions: 9 RY: 3 RG: 3 WY: 1 WG In practice, he obtained as follows: 315 RY: 108 RG: 101 WY: 32 WG 1. Does the data (observed) fit with the expected data? A) @ 95% and B) @ 99% confidence? 2. Are the results subject to > 5% error? 3. Are the results subject to > 1% error? 17

FIRST: sum the number of peas: 315 + 108 + +101 + 32 = 556 SECOND: What is EXPECTED? The sum of 9 + 3 + 3 + 1 = 16 \ 9/16 are RY, 3/16 are RG, 3/16 are WY and 1/16 are WG We'd expect the following, then: 9/16 * 556 = 313 RY 3/16 * 556 = 104 RG 3/16 * 556 = 104 WY 1/16 * 556 = 35 WG 18

THIRD: calculate 2 : 2 = [(315-313) 2 /313] + [(108-104) 2 /104] + [(101-104) 2 /104] + [(32-35) 2 /35] = 4/313 + 16/104 + 9/104 + 9/35 = 0.510 FOURTH: calculate dof 4 characteristics, so 4-1 = 3 dof. 19

FIFTH: compare to table value: Step 1 Confidence level Calculated c 2 Table c 2 FIT A 95% 0.51 vs. 7.81 Good B 95% 0.51 Vs 11.3 Good 3 dof FIFTH: compare to table value: Step 2 Calculated c 2 Table c 2 < 5% error 0.51 Vs. 0.352 YES FIFTH: compare to table value: Step 3 Calculated c 2 Table c 2 < 1% error 0.51 Vs. 0.352 YES: Fit and error are good. 20

Student's Two-Tailed T-test for Significance Sometimes, though, you just want to compare values between two groups to determine if there is a significant difference. Student's Two-Tailed T-test for Significance does just that. The necessities for this test are listed and defined, below: Group 1 Data Group 2 Date Mean = the average value in each group Mean 1 Mean 2 s 1 s 2 N 1 N 1 s = the standard deviation in each group and equals a measure of the difference between a value and the average value "corrected" for the number of samples; determined on a calculator on the "sigma" () key. N = the number of samples 21

How to calculate p value: Step 1 : ( s1 N ) 1 2 ( s 2 N ) 2 2 Q Z Values p = 0.1 0.05 0.01 0.005 0.002 Step 2 : ( mean 1 Q mean 2 ) Z ( ) Z for 2-tail test ± 1.645 ± 1.96 ± 2.58 ± 2.81 ± 3.08 At infinite dof Step 3 : Look up Z in table, below 22

EXAMPLE In an experiment designed to study the effects of a DNA binding anti-cancer drug, ACS-19685, the drug was used to treat cancerous cells in culture. All cultures began with 50 cells per culture; 10 cultures were treated with ACS-19685 and ten were treated with the inert carrier solvent. The following data was obtained: Culture # ACS-19685 treated: cells remaining Control: cells remaining 1 4 45 2 5 40 3 10 42 4 7 47 5 12 40 6 8 39 7 2 41 8 20 43 9 1 44 10 6 48 Are these results statistically significantly different by the 2-tailed t-test? 23

SOLUTION: ACS-19685 treated Control AVG 7.5 42.9 s 5.26 2.91 N 10 10 Q Z (5.26 ) 10 \ 2 p (7.5 42.9) 4.303 (2.91) 10 0.002 2 8.227 4.303 This means that about 99.8% of the time, there was no error, therefore, % error possible is less than 0.2% and the difference is real between the two groups. YES, they are 24