Multidimensional heritability analysis of neuroanatomical shape. Jingwei Li
|
|
- Belinda Clark
- 5 years ago
- Views:
Transcription
1 Multidimensional heritability analysis of neuroanatomical shape Jingwei Li
2 Brain Imaging Genetics Genetic Variation Behavior Cognition Neuroanatomy
3 Brain Imaging Genetics Genetic Variation Neuroanatomy
4 Descriptors of Brain Structures One-dimensional descriptors (Hibar015; Stein01; Sabuncu01) Volume Surface area Drawbacks Limited when capturing the anatomical variation Same area
5 Descriptors of Brain Structures Multi-dimensional shape descriptor: truncated Laplace-Beltrami Spectrum (LBS) ψ: R n R n+k is the local parametrization of a submonifold M of R n+k g ij =< i ψ, j ψ >, G = g ij, W = det G, g ij = G 1 i, j n n If f and φ are real-valued functions defined on M, then f, φ = i,j g i,j i f j φ, Δf = 1 W i,j i g ij W j f where f, φ < grad f, grad φ > and Δf div grad f. Nabla operator Laplace-Beltrami operator Solve Laplacian eigenvalue problem: Δf = λf eigenfunction eigenvalue
6 Descriptors of Brain Structures Multi-dimensional shape descriptor: truncated Laplace-Beltrami Spectrum (LBS) Translate Laplacian eigenvalue problem: Δf = λf to a variational problem: φδf dσ = f, φ dσ Green formula Since f, φ = i,j g i,j i f j φ and φδf dσ = φ λf dσ = λ φfdσ i,j g i,j i f j φ dσ = λ φfdσ variational problem
7 Descriptors of Brain Structures Multi-dimensional shape descriptor: truncated Laplace-Beltrami Spectrum (LBS) Discretization of i,j g i,j i f j φ dσ = λ φfdσ: Choose n linearly independent form functions: φ 1 x, φ x,, φ n x as basis functions (e.g. x, x, x 3, ) defined on the parameter space. Any eigenfunction f can be approximately projected to the basis functions: f x F x = U 1 φ 1 x + + U n φ n x To solve U, substitute f and φ into the variational problem. Define A = a lm n n = j,k j F l k F m g jk dσ n n and B = b lm n n = F l F m dσ n n => AU = λbu General eigenvalue problem
8 Descriptors of Brain Structures Multi-dimensional shape descriptor: truncated Laplace-Beltrami Spectrum (LBS) Solve a Laplacian eigenvalue problem defined based on the brain region Obtain the first M eigenvalues Properties (Reuter 006): Isometric invariant For planar shapes and 3D-solids: isometry congruency (identical after rigid body transformation) For surface: isometry congruency
9 Descriptors of Brain Structures Multi-dimensional shape descriptor: truncated Laplace-Beltrami Spectrum (LBS) Solve a Laplacian eigenvalue problem defined based on the brain region Obtain the first M eigenvalues Properties (Reuter 006): Isometric invariant scaling a n-dimensional manifold by the factor a results in scaled eigenvalues by the factor 1 a In this paper, eigenvalues are scaled: λ i,m = λ i,m V i /3 i: subject; m: dimension
10 Heritability A phenotype/trait can be influenced by genetic and environmental effects. Heritability: how much of the variation in a phenotype/trait is due to variation in genetic factors.
11 Main Idea of This Paper Truncated LBS is more representative for a shape compared to volume. Use truncated LBS as descriptors for 1 brain regions to compute heritability. Compare that with volumebased heritability. To adapt truncated LBS into GCTA (Genome-wide Complex Trait Analysis) (Yang 011) heritability model, propose a multi-dimensional heritability model.
12 GCTA heritability model N 1 trait vector (N: #subjects) y = g + c + e g~n 0, σ A K c~n 0, σ C Λ e~n 0, σ E I Additive genetic component Common environmental component Unique environmental component
13 GCTA heritability model y = g + c + e g~n 0, σ A K c~n 0, σ C Λ e~n 0, σ E I K: genetic similarity matrix Familial study: K = Kinship Coefficients. E.g. parent-offspring (0.5), identical twins (1), full siblings (0.5), half siblings (0.5) Unrelated subjects study: genome-side single-nucleotide polymorphism (SNP) data
14 GCTA heritability model y = g + c + e g~n 0, σ A K c~n 0, σ C Λ e~n 0, σ E I What is Single-Nucleotide Polymorphism (SNP): Each locus on a DNA sequence is a single nucleotide adenine (A), thymine (T), cytosine (C), or guanine (G). SNP: a DNA sequence variation occurring when the types of single nucleotide in the genome (or other shared sequence) differs between individuals or paired chromosomes in one subject. E.g., AAGCCTA and AAGCTTA. SNP can leads to alleles (variants of a given gene). Each SNP can have 3 genotypes: AA, Aa, aa (denoted as 0-)
15 GCTA heritability model y = g + c + e g~n 0, σ A K c~n 0, σ C Λ e~n 0, σ E I How to compute genetic similarity from SNP: X(#subjects x #SNPs). Standardize each column of X (mean 0, variance 1) K = XXT #SNPs
16 GCTA heritability model y = g + c + e g~n 0, σ A K c~n 0, σ C Λ e~n 0, σ E I Λ: shared environment matrix between the subjects Familial study: e.g., twins & non-twin siblings (1) Unrelated subjects study: Λ vanishes
17 GCTA heritability model y = g + c + e g~n 0, σ A K c~n 0, σ C Λ e~n 0, σ E I Identical matrix
18 GCTA heritability model y = g + c + e g~n 0, σ A K c~n 0, σ C Λ e~n 0, σ E I heritability h = σ A σ A + σ C + σ E h : the variance in the trait explained by the variance in additive genetic component
19 Multi-dimensional traits heritability model N M trait matrix (N: #subjects) (M: #dimensions) Y = G + C + E vec G ~N 0, Σ A K, vec C ~N 0, Σ C Λ, vec E ~N 0, Σ E I Σ A = σ A rs M M : σ Ars is the genetic covariance between r-th and s-th dimensions in traits Σ C = σ C rs M M : σ Crs is the common environmental covariance between r-th and s-th dimensions in traits Σ E = σ E rs M M : σ Ers is the unique environmental covariance between r-th and s-th dimensions in traits
20 Multi-dimensional traits heritability model Y = G + C + E vec G ~N 0, Σ A K, vec C ~N 0, Σ C Λ, vec E ~N 0, Σ E I : Kronecker product Σ A rs K = σ A 11 K σ A1 K σ A1M K σ A 1 K σ A K σ A M K σ A M1 K σ AM K σ AMM K
21 Multi-dimensional traits heritability model Y = G + C + E vec G ~N 0, Σ A K, vec C ~N 0, Σ C Λ, vec E ~N 0, Σ E I vec a 1, a,, a k = a 1 a a k
22 Multi-dimensional traits heritability model Y = G + C + E vec G ~N 0, Σ A K, vec C ~N 0, Σ C Λ, vec E ~N 0, Σ E I heritability h = tr Σ A tr Σ A + tr Σ C + tr Σ E = M m=1 γ m h m where γ m = σ Amm + σ Cmm + σ Emm h m = M p=1 σ A pp + σ Cpp + σ Epp σ A mm σ A mm + σ Cmm + σ Emm The multi-dimensional trait heritability is a weighted average of the heritability of each dimension.
23 Multi-dimensional traits heritability model Properties Invariant to rotations of data Y = G + C + E (1) YT = GT + CT + ET () T T T = TT T = I h T = h heritability from model () heritability from model (1)
24 Consider covariates Sometimes, we want to study the effects after controlling some nuisance variables by regressing them out. E.g., age, gender, handness
25 Covariates (N q) Consider covariates Y = XB + G + C + E vec G ~N 0, Σ A K, vec C ~N 0, Σ C Λ, vec E ~N 0, Σ E I U: N N q Y = U T Y = U T G + U T C + U T E = G + C + E vec G ~N 0, Σ A U T KU, vec C ~N 0, Σ C U T ΛU, vec E ~N 0, Σ E I U T X = 0 U T U = I UU T = I X X T X 1 X T
26 Datasets: Analysis Genomics Superstruct Project (GSP; N = 130) unrelated subjects Human Connectome Project (HCP; N = 590) 7 monozygotic twin pairs 69 dizygotic twin pairs 53 full siblings of twins 55 singletons 1 brain structures Traits Volume Truncated LBS
27 Volume heritability (GSP data) Before multiple comparisons correction: 3/1 brain structures are significant After multiple comparisons correction: none is significant Most structures: parametric & nonparametric p values are similar => standard errors estimates are accurate
28 Volume heritability (GSP data) Test-retest reliability: Lin s concordance correlation coefficient correlation coefficient ρ c = ρσ x σ y σ x + σ y + μ x μ y variance mean x, y: use repeated runs on separate days of the same set of subjects
29 Truncated LBS heritability (GSP data) Before multiple comparisons correction: 7/1 brain structures are significant After multiple comparisons correction: 5/1 brain structures are significant Most structures: parametric & nonparametric p values are similar => standard errors estimates are accurate Smaller standard error than volume-based heritability
30 Truncated LBS heritability (GSP data) Test-retest reliability: Averaged Lin s concordance correlation coefficient across M dimensions correlation coefficient ρ c = ρσ x σ y σ x + σ y + μ x μ y variance mean x, y: use repeated runs on separate days of the same set of subjects
31 Truncated LBS heritability (GSP data)
32 Truncated LBS heritability (HCP data) Structure h Standard Error Accumbens area Caudate Cerebellum Corpus Callosum Hippocampus Third Ventricle Putamen Only significant brain structures results are shown Consistently higher than GSP dataset Possible reason: in unrelated subjects only the variation of some common SNPs are captured.
33 Visualizing principal mode of shape variation PCA is a kind of rotation of data. The first PC of LBS explains a large percentage of shape variation. Heritability model: (1) invariant to rotation; () heritability of multi-dimensional trait = weighted average of each dimension s heritability The heritability of truncated LBS is the weighted average of the first M PCs heritability.
34 Visualizing principal mode of shape variation Procedures (for one brain structure) 1. Register each subject s mask (1 in structure, 0 out of structure) to a common used template.. Create a population average of structure surface for plotting A weighted average of all subjects registered mask image Weight: Gaussian kernel center: average of first PC distance: subject-specific corresponding first PC <-> center Width: resulting 500 shapes have non-0 weights The isosurface with 0.5 in the averaged map 3. Use the same Gaussian kernel, generate averaged maps by including the shapes around + standard deviation of the first PC (- s.d. as well) 4. Plot the difference between the two maps in step 3 on the surface generated in step.
35 Visualizing principal mode of shape variation Red: shapes around + s.d. are larger than - s.d. Blue: shapes around - s.d. are larger than + s.d.
36 Strengths Use truncated LBS instead of volume as features Capture more shape variation Isometry invariance Does not require any registration or mapping (Reuter 006 & 009) Generalize the concept of heritability into multidimensional phenotypes Other applications (multi-tests of one behavior; disease study)
37 Strengths Variability of heritability estimation Multi-dimensional trait heritability model < original GCTA model (unrelated subject dataset) Heritability estimates are more accurate, more significant Propose a visualization method for shape variation Interpretation: shape variation along the first PC axis of the shape descriptor
38 Weakness Optimal number of eigenvalue may not be 50 Only 30, 50, 70 are tested Error bars for difference number of eigenvalues are not shown Other number except 50 (used in paper) could lead to higher heritability and smaller error bars
39 Weakness Optimal number of eigenvalue can be different for different brain structures Amygdala: heritability is similar for 30, 50, 70 eigenvalues (even decrease) 3 rd -ventricle: heritability increases from 0.4 to 0.6
40 Weakness Links between proposed visualization method and LBS heritability are not clear. Only volume-based GCTA heritability is compared to the new method and new model. More comparisons with the literature (e.g., Gilmore 010; Baare 001)
41 Backup: invariant to rotations of data cov vec GT = cov T T I vec G = T T I vec G T I = T T I Σ A K T I = T T Σ A T K Theorem: vec AXB = B T A vec X Here A = I, X = G, B = T A B T = A T B T cov AX = Acov X A T A B C D = AC BD Similarly, cov vec CT = T T Σ C T Λ, cov vec ET = T T Σ E T I h T = tr T T Σ A T tr T T Σ A T + tr T T Σ C T + tr T T Σ E T tr ABC = tr BCA = tr CAB Associative property of matrix multiplication = tr Σ A TT T tr Σ A TT T + tr Σ C TT T + tr Σ E (TT T ) = tr Σ A tr Σ A + tr Σ C + tr Σ E = h
42 Backup: multi-dimensional trait heritability is a weighted average of heritability of each dimension h = = tr Σ A tr Σ A + Σ C + Σ E p=1 M m=1 M M σ A pp + p=1 σ A mm σ C pp + p=1 M σ E pp = M m=1 σ A mm + σ Cmm + σ Emm M p=1 σ A pp + σ Cpp + σ Epp σ A mm σ A mm + σ Cmm + σ Emm = M m=1 γ m h m
43 Backup: moment-matching estimator for unrelated subjects (no shared environmental component) cov y r, y s = σ A rs K + σ Ers I y ry s T = σ A rs K + σ Ers I To estimate σ A rs, σ Ers, use a regression model: vec y r y T s = σ A rs vec K + σ Ersvec I y s y r = σ A rs vec K + σ Ersvec I vec K T y s y r vec I T y s y r = σ A rs vec K T vec K + σ E rs vec K T vec I = σ A rs vec I T vec K + σ E rs vec I T vec I y s y r T vec K = σ A rs vec K T vec K + σ E rs vec I T vec K y s y r T vec I = σ A rs vec K T vec I + σ E rs vec I T vec I y s T y r T vec K = σ A rs vec K T vec K + σ E rs vec I T vec K y s T y r T vec I = σ A rs vec K T vec I + σ E rs vec I T vec I y r T Ky s = σ A rstr K + y r T y s = σ A rs tr K + σ Ers σ E rstr K tr I
44 Backup: moment-matching estimator for unrelated subjects (no shared environmental component) σ Ars σ = tr K tr K E rs tr K tr I 1 yr T Ky s y r T y s σ A rs = y r T NK tr K I y s Ntr K tr K y r T K τi y s ν K σ E rs = y r T tr K I tr K K y s Ntr K tr [K] = y r T κi τk y s ν K where τ = tr K N, κ = tr K N, ν K = tr K Σ A = YT K τi Y ν K, Σ E = YT κi τk Y ν K tr K N = N κ τ
45 Backup: sampling variance of the point estimator Q A K τi ν K, Q E κi τk ν K t A tr Σ A = tr Y T Q A Y, t E = tr Σ E = tr Y T Q E Y, t = t A te The heritability is a function of t: f t = var h SNP = var f t f t t cov t t A t A +t E f t t T where f t t = f t t Define V rs = cov y r, y s, f t t = t E t A +t E, = σ A rs K + σ Ers I t A t A +t E
46 Backup: sampling variance of the point estimator = = cov tr Y T Q α Y, tr Y T Q β Y M r,s=1 M r,s=1 cov t = cov y r T Q α y r, y s T Q β y s tr Q α V rs Q β V rs M tr Q A V rs Q A V rs tr Q A V rs Q E V rs r,s=1 tr Q E V rs Q A V rs tr Q E V rs Q E V rs M r,s=1 σ A rs + σ Ers = tr Σ A + Σ E 1 τ ν K τ κ tr Σ A + Σ E 1 1 ν K 1 1 Quadratic form of statistics: cov ε T Λ 1 ε, ε T Λ ε = tr Λ 1 ΣΛ Σ + 4μ T Λ 1 ΣΛ μ Here μ = 0 tr Q A tr Q A Q E tr Q E Q A tr Q E K I τ 1, κ 1 V rs = σ A rs K + σ Ers I σ A rs I + σ Ers I
47 Backup: sampling variance of the point estimator tr Q A = tr K τi ν K = tr K tr K tr K N I tr K N = tr K tr K N tr K KI + tr K tr K N N I tr Q A Q E = tr K tr K N = tr = tr K tr K N K τi tr K N + tr K N = 1 κi τk ν K ν K tr K tr K N tr K tr K N ν K = tr κki τk τki + τ IK ν K + tr3 K N = tr K N tr K tr K tr K + tr K N tr K tr = τ K N ν K
48 Backup: sampling variance of the point estimator tr Q E = = = tr κi τk κ tr K N κ tr K = tr κ I κτk + τ K ν K ν K N tr K N tr K N + tr K ν K ν K tr K tr K N var h SNP = var f t f t t N + tr K N cov t f t N tr K = κ ν K t T tr Σ A + Σ E 1 1 t ν K t A + t 4 E, t A E 1 1 tr K = tr Σ A + Σ E ν K tr Σ A + tr Σ = tr Σ A + Σ E E ν K tr Σ A + Σ = E ν K t E t = tr Σ A + Σ E t A ν K t A + t 4 A + t E E tr Σ P tr Σ P
49 Backup: sampling variance of the point estimator For univariate trait, tr Σ P For multi-dimensional trait, = tr Σ P, var h SNP = ν K tr Σ P tr Σ P = i=1 M λ i i=1 M λ 1 var h SNP i ν K
... x. Variance NORMAL DISTRIBUTIONS OF PHENOTYPES. Mice. Fruit Flies CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE
NORMAL DISTRIBUTIONS OF PHENOTYPES Mice Fruit Flies In:Introduction to Quantitative Genetics Falconer & Mackay 1996 CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE Mean and variance are two quantities
More informationMethods for Cryptic Structure. Methods for Cryptic Structure
Case-Control Association Testing Review Consider testing for association between a disease and a genetic marker Idea is to look for an association by comparing allele/genotype frequencies between the cases
More informationVariance Component Models for Quantitative Traits. Biostatistics 666
Variance Component Models for Quantitative Traits Biostatistics 666 Today Analysis of quantitative traits Modeling covariance for pairs of individuals estimating heritability Extending the model beyond
More informationLesson 4: Understanding Genetics
Lesson 4: Understanding Genetics 1 Terms Alleles Chromosome Co dominance Crossover Deoxyribonucleic acid DNA Dominant Genetic code Genome Genotype Heredity Heritability Heritability estimate Heterozygous
More information(Genome-wide) association analysis
(Genome-wide) association analysis 1 Key concepts Mapping QTL by association relies on linkage disequilibrium in the population; LD can be caused by close linkage between a QTL and marker (= good) or by
More informationQuantitative characters II: heritability
Quantitative characters II: heritability The variance of a trait (x) is the average squared deviation of x from its mean: V P = (1/n)Σ(x-m x ) 2 This total phenotypic variance can be partitioned into components:
More informationResearch Statement on Statistics Jun Zhang
Research Statement on Statistics Jun Zhang (junzhang@galton.uchicago.edu) My interest on statistics generally includes machine learning and statistical genetics. My recent work focus on detection and interpretation
More informationShort Answers Worksheet Grade 6
Short Answers Worksheet Grade 6 Short Answer 1. What is the role of the nucleolus? 2. What are the two different kinds of endoplasmic reticulum? 3. Name three cell parts that help defend the cell against
More informationDNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to
1 1 1 1 1 1 1 1 0 SUPPLEMENTARY MATERIALS, B. BIVARIATE PEDIGREE-BASED ASSOCIATION ANALYSIS Introduction We propose here a statistical method of bivariate genetic analysis, designed to evaluate contribution
More informationLecture WS Evolutionary Genetics Part I 1
Quantitative genetics Quantitative genetics is the study of the inheritance of quantitative/continuous phenotypic traits, like human height and body size, grain colour in winter wheat or beak depth in
More informationCase-Control Association Testing. Case-Control Association Testing
Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits. Technological advances have made it feasible to perform case-control association studies
More informationAssociation Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5
Association Testing with Quantitative Traits: Common and Rare Variants Timothy Thornton and Katie Kerr Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 1 / 41 Introduction to Quantitative
More informationG E INTERACTION USING JMP: AN OVERVIEW
G E INTERACTION USING JMP: AN OVERVIEW Sukanta Dash I.A.S.R.I., Library Avenue, New Delhi-110012 sukanta@iasri.res.in 1. Introduction Genotype Environment interaction (G E) is a common phenomenon in agricultural
More informationLinear Regression (1/1/17)
STA613/CBB540: Statistical methods in computational biology Linear Regression (1/1/17) Lecturer: Barbara Engelhardt Scribe: Ethan Hada 1. Linear regression 1.1. Linear regression basics. Linear regression
More informationProportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power
Proportional Variance Explained by QTL and Statistical Power Partitioning the Genetic Variance We previously focused on obtaining variance components of a quantitative trait to determine the proportion
More informationHeritability estimation in modern genetics and connections to some new results for quadratic forms in statistics
Heritability estimation in modern genetics and connections to some new results for quadratic forms in statistics Lee H. Dicker Rutgers University and Amazon, NYC Based on joint work with Ruijun Ma (Rutgers),
More informationGenotype Imputation. Biostatistics 666
Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives
More informationPCA and admixture models
PCA and admixture models CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar, Alkes Price PCA and admixture models 1 / 57 Announcements HW1
More informationAffected Sibling Pairs. Biostatistics 666
Affected Sibling airs Biostatistics 666 Today Discussion of linkage analysis using affected sibling pairs Our exploration will include several components we have seen before: A simple disease model IBD
More information1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics
1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
More informationGENETICS - CLUTCH CH.1 INTRODUCTION TO GENETICS.
!! www.clutchprep.com CONCEPT: HISTORY OF GENETICS The earliest use of genetics was through of plants and animals (8000-1000 B.C.) Selective breeding (artificial selection) is the process of breeding organisms
More informationThe concept of breeding value. Gene251/351 Lecture 5
The concept of breeding value Gene251/351 Lecture 5 Key terms Estimated breeding value (EB) Heritability Contemporary groups Reading: No prescribed reading from Simm s book. Revision: Quantitative traits
More informationSTATISTICAL SHAPE MODELS (SSM)
STATISTICAL SHAPE MODELS (SSM) Medical Image Analysis Serena Bonaretti serena.bonaretti@istb.unibe.ch ISTB - Institute for Surgical Technology and Biomechanics University of Bern Overview > Introduction
More informationResemblance between relatives
Resemblance between relatives 1 Key concepts Model phenotypes by fixed effects and random effects including genetic value (additive, dominance, epistatic) Model covariance of genetic effects by relationship
More informationHERITABILITY ESTIMATION USING A REGULARIZED REGRESSION APPROACH (HERRA)
BIRS 016 1 HERITABILITY ESTIMATION USING A REGULARIZED REGRESSION APPROACH (HERRA) Malka Gorfine, Tel Aviv University, Israel Joint work with Li Hsu, FHCRC, Seattle, USA BIRS 016 The concept of heritability
More informationLecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017
Lecture 2: Genetic Association Testing with Quantitative Traits Instructors: Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 29 Introduction to Quantitative Trait Mapping
More informationAssociation studies and regression
Association studies and regression CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar Association studies and regression 1 / 104 Administration
More informationPower and sample size calculations for designing rare variant sequencing association studies.
Power and sample size calculations for designing rare variant sequencing association studies. Seunggeun Lee 1, Michael C. Wu 2, Tianxi Cai 1, Yun Li 2,3, Michael Boehnke 4 and Xihong Lin 1 1 Department
More informationObjective 3.01 (DNA, RNA and Protein Synthesis)
Objective 3.01 (DNA, RNA and Protein Synthesis) DNA Structure o Discovered by Watson and Crick o Double-stranded o Shape is a double helix (twisted ladder) o Made of chains of nucleotides: o Has four types
More informationMODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES
MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES Saurabh Ghosh Human Genetics Unit Indian Statistical Institute, Kolkata Most common diseases are caused by
More informationThe Mystery of Missing Heritability: Genetic interactions create phantom heritability - Supplementary Information
The Mystery of Missing Heritability: Genetic interactions create phantom heritability - Supplementary Information 1 Contents List of Figures 4 List of Tables 5 List of Symbols 6 1 Calculating the top-down
More informationLecture 9: Kernel (Variance Component) Tests and Omnibus Tests for Rare Variants. Summer Institute in Statistical Genetics 2017
Lecture 9: Kernel (Variance Component) Tests and Omnibus Tests for Rare Variants Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 46 Lecture Overview 1. Variance Component
More informationIntroduction to Machine Learning. Recitation 11
Introduction to Machine Learning Lecturer: Regev Schweiger Recitation Fall Seester Scribe: Regev Schweiger. Kernel Ridge Regression We now take on the task of kernel-izing ridge regression. Let x,...,
More informationVariance Components: Phenotypic, Environmental and Genetic
Variance Components: Phenotypic, Environmental and Genetic You should keep in mind that the Simplified Model for Polygenic Traits presented above is very simplified. In many cases, polygenic or quantitative
More informationUniversität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen PCA. Tobias Scheffer
Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen PCA Tobias Scheffer Overview Principal Component Analysis (PCA) Kernel-PCA Fisher Linear Discriminant Analysis t-sne 2 PCA: Motivation
More informationGaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012
Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature
More informationRégression en grande dimension et épistasie par blocs pour les études d association
Régression en grande dimension et épistasie par blocs pour les études d association V. Stanislas, C. Dalmasso, C. Ambroise Laboratoire de Mathématiques et Modélisation d Évry "Statistique et Génome" 1
More informationDepartment of Forensic Psychiatry, School of Medicine & Forensics, Xi'an Jiaotong University, Xi'an, China;
Title: Evaluation of genetic susceptibility of common variants in CACNA1D with schizophrenia in Han Chinese Author names and affiliations: Fanglin Guan a,e, Lu Li b, Chuchu Qiao b, Gang Chen b, Tinglin
More informationLIFE SCIENCE CHAPTER 5 & 6 FLASHCARDS
LIFE SCIENCE CHAPTER 5 & 6 FLASHCARDS Why were ratios important in Mendel s work? A. They showed that heredity does not follow a set pattern. B. They showed that some traits are never passed on. C. They
More informationModeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17
Modeling IBD for Pairs of Relatives Biostatistics 666 Lecture 7 Previously Linkage Analysis of Relative Pairs IBS Methods Compare observed and expected sharing IBD Methods Account for frequency of shared
More informationLecture 9. QTL Mapping 2: Outbred Populations
Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred
More informationCS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS
CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS * Some contents are adapted from Dr. Hung Huang and Dr. Chengkai Li at UT Arlington Mingon Kang, Ph.D. Computer Science, Kennesaw State University Problems
More informationBare minimum on matrix algebra. Psychology 588: Covariance structure and factor models
Bare minimum on matrix algebra Psychology 588: Covariance structure and factor models Matrix multiplication 2 Consider three notations for linear combinations y11 y1 m x11 x 1p b11 b 1m y y x x b b n1
More informationUnconstrained Ordination
Unconstrained Ordination Sites Species A Species B Species C Species D Species E 1 0 (1) 5 (1) 1 (1) 10 (4) 10 (4) 2 2 (3) 8 (3) 4 (3) 12 (6) 20 (6) 3 8 (6) 20 (6) 10 (6) 1 (2) 3 (2) 4 4 (5) 11 (5) 8 (5)
More informationBreeding Values and Inbreeding. Breeding Values and Inbreeding
Breeding Values and Inbreeding Genotypic Values For the bi-allelic single locus case, we previously defined the mean genotypic (or equivalently the mean phenotypic values) to be a if genotype is A 2 A
More informationGenetics Studies of Multivariate Traits
Genetics Studies of Multivariate Traits Heping Zhang Department of Epidemiology and Public Health Yale University School of Medicine Presented at Southern Regional Council on Statistics Summer Research
More informationOverview. Background
Overview Implementation of robust methods for locating quantitative trait loci in R Introduction to QTL mapping Andreas Baierl and Andreas Futschik Institute of Statistics and Decision Support Systems
More informationSome models of genomic selection
Munich, December 2013 What is the talk about? Barley! Steptoe x Morex barley mapping population Steptoe x Morex barley mapping population genotyping from Close at al., 2009 and phenotyping from cite http://wheat.pw.usda.gov/ggpages/sxm/
More informationCovariance to PCA. CS 510 Lecture #14 February 23, 2018
Covariance to PCA CS 510 Lecture 14 February 23, 2018 Overview: Goal Assume you have a gallery (database) of images, and a probe (test) image. The goal is to find the database image that is most similar
More informationSecond-Order Inference for Gaussian Random Curves
Second-Order Inference for Gaussian Random Curves With Application to DNA Minicircles Victor Panaretos David Kraus John Maddocks Ecole Polytechnique Fédérale de Lausanne Panaretos, Kraus, Maddocks (EPFL)
More informationFundamental concepts of functional data analysis
Fundamental concepts of functional data analysis Department of Statistics, Colorado State University Examples of functional data 0 1440 2880 4320 5760 7200 8640 10080 Time in minutes The horizontal component
More informationNonlinear Dimensionality Reduction
Nonlinear Dimensionality Reduction Piyush Rai CS5350/6350: Machine Learning October 25, 2011 Recap: Linear Dimensionality Reduction Linear Dimensionality Reduction: Based on a linear projection of the
More informationDescriptive Statistics
Descriptive Statistics DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall17 Carlos Fernandez-Granda Descriptive statistics Techniques to visualize
More informationBig Idea 3: Living systems store, retrieve, transmit, and respond to information essential to life processes.
Big Idea 3: Living systems store, retrieve, transmit, and respond to information essential to life processes. Enduring understanding 3.A: Heritable information provides for continuity of life. Essential
More informationQuantitative characters - exercises
Quantitative characters - exercises 1. a) Calculate the genetic covariance between half sibs, expressed in the ij notation (Cockerham's notation), when up to loci are considered. b) Calculate the genetic
More informationIntroduction to Machine Learning. PCA and Spectral Clustering. Introduction to Machine Learning, Slides: Eran Halperin
1 Introduction to Machine Learning PCA and Spectral Clustering Introduction to Machine Learning, 2013-14 Slides: Eran Halperin Singular Value Decomposition (SVD) The singular value decomposition (SVD)
More informationThe Laplacian ( ) Matthias Vestner Dr. Emanuele Rodolà Room , Informatik IX
The Laplacian (26.05.2014) Matthias Vestner Dr. Emanuele Rodolà {vestner,rodola}@in.tum.de Room 02.09.058, Informatik IX Seminar «The metric approach to shape matching» Alfonso Ros Wednesday, May 28th
More informationINTRODUCTION TO ANIMAL BREEDING. Lecture Nr 3. The genetic evaluation (for a single trait) The Estimated Breeding Values (EBV) The accuracy of EBVs
INTRODUCTION TO ANIMAL BREEDING Lecture Nr 3 The genetic evaluation (for a single trait) The Estimated Breeding Values (EBV) The accuracy of EBVs Etienne Verrier INA Paris-Grignon, Animal Sciences Department
More informationProbability of Detecting Disease-Associated SNPs in Case-Control Genome-Wide Association Studies
Probability of Detecting Disease-Associated SNPs in Case-Control Genome-Wide Association Studies Ruth Pfeiffer, Ph.D. Mitchell Gail Biostatistics Branch Division of Cancer Epidemiology&Genetics National
More informationRegularization Parameter Selection for a Bayesian Multi-Level Group Lasso Regression Model with Application to Imaging Genomics
Regularization Parameter Selection for a Bayesian Multi-Level Group Lasso Regression Model with Application to Imaging Genomics arxiv:1603.08163v1 [stat.ml] 7 Mar 016 Farouk S. Nathoo, Keelin Greenlaw,
More informationLecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015
Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2015 1 / 1 Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits.
More informationResemblance among relatives
Resemblance among relatives Introduction Just as individuals may differ from one another in phenotype because they have different genotypes, because they developed in different environments, or both, relatives
More informationOn Expected Gaussian Random Determinants
On Expected Gaussian Random Determinants Moo K. Chung 1 Department of Statistics University of Wisconsin-Madison 1210 West Dayton St. Madison, WI 53706 Abstract The expectation of random determinants whose
More informationCalculation of IBD probabilities
Calculation of IBD probabilities David Evans and Stacey Cherny University of Oxford Wellcome Trust Centre for Human Genetics This Session IBD vs IBS Why is IBD important? Calculating IBD probabilities
More informationNeuroimage Processing
Neuroimage Processing Instructor: Moo K. Chung mkchung@wisc.edu Lecture 10-11. Deformation-based morphometry (DBM) Tensor-based morphometry (TBM) November 13, 2009 Image Registration Process of transforming
More informationTAMS39 Lecture 2 Multivariate normal distribution
TAMS39 Lecture 2 Multivariate normal distribution Martin Singull Department of Mathematics Mathematical Statistics Linköping University, Sweden Content Lecture Random vectors Multivariate normal distribution
More informationEE16B Designing Information Devices and Systems II
EE16B Designing Information Devices and Systems II Lecture 9A Geometry of SVD, PCA Intro Last time: Described the SVD in Compact matrix form: U1SV1 T Full form: UΣV T Showed a procedure to SVD via A T
More informationBayesian Inference of Interactions and Associations
Bayesian Inference of Interactions and Associations Jun Liu Department of Statistics Harvard University http://www.fas.harvard.edu/~junliu Based on collaborations with Yu Zhang, Jing Zhang, Yuan Yuan,
More informationBTRY 7210: Topics in Quantitative Genomics and Genetics
BTRY 7210: Topics in Quantitative Genomics and Genetics Jason Mezey Biological Statistics and Computational Biology (BSCB) Department of Genetic Medicine jgm45@cornell.edu February 12, 2015 Lecture 3:
More informationDenisova cave. h,ps://
Denisova cave h,ps://www.flickr.com/photos/hmnh/3033749380/ Gene tree- species tree conflict can result from introgression or incomplete lineage sorcng Sous & Hey 2013 DisCnguishing incomplete lineage sorcng
More informationQuantitative Traits Modes of Selection
Quantitative Traits Modes of Selection Preservation of Favored Races in the Struggle for Life = Natural Selection 1. There is variation in morphology, function or behavior between individuals. 2. Some
More informationLecture 7 Correlated Characters
Lecture 7 Correlated Characters Bruce Walsh. Sept 2007. Summer Institute on Statistical Genetics, Liège Genetic and Environmental Correlations Many characters are positively or negatively correlated at
More informationLecture 24: Multivariate Response: Changes in G. Bruce Walsh lecture notes Synbreed course version 10 July 2013
Lecture 24: Multivariate Response: Changes in G Bruce Walsh lecture notes Synbreed course version 10 July 2013 1 Overview Changes in G from disequilibrium (generalized Bulmer Equation) Fragility of covariances
More informationMACHINE LEARNING. Methods for feature extraction and reduction of dimensionality: Probabilistic PCA and kernel PCA
1 MACHINE LEARNING Methods for feature extraction and reduction of dimensionality: Probabilistic PCA and kernel PCA 2 Practicals Next Week Next Week, Practical Session on Computer Takes Place in Room GR
More informationFocus was on solving matrix inversion problems Now we look at other properties of matrices Useful when A represents a transformations.
Previously Focus was on solving matrix inversion problems Now we look at other properties of matrices Useful when A represents a transformations y = Ax Or A simply represents data Notion of eigenvectors,
More informationSection 2. Basic formulas and identities in Riemannian geometry
Section 2. Basic formulas and identities in Riemannian geometry Weimin Sheng and 1. Bianchi identities The first and second Bianchi identities are R ijkl + R iklj + R iljk = 0 R ijkl,m + R ijlm,k + R ijmk,l
More informationGenetics Studies of Comorbidity
Genetics Studies of Comorbidity Heping Zhang Department of Epidemiology and Public Health Yale University School of Medicine Presented at Science at the Edge Michigan State University January 27, 2012
More informationLecture 32: Infinite-dimensional/Functionvalued. Functions and Random Regressions. Bruce Walsh lecture notes Synbreed course version 11 July 2013
Lecture 32: Infinite-dimensional/Functionvalued Traits: Covariance Functions and Random Regressions Bruce Walsh lecture notes Synbreed course version 11 July 2013 1 Longitudinal traits Many classic quantitative
More informationHeredity and Genetics WKSH
Chapter 6, Section 3 Heredity and Genetics WKSH KEY CONCEPT Mendel s research showed that traits are inherited as discrete units. Vocabulary trait purebred law of segregation genetics cross MAIN IDEA:
More informationLecture 9. Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Synbreed course version 3 July 2013
Lecture 9 Short-Term Selection Response: Breeder s equation Bruce Walsh lecture notes Synbreed course version 3 July 2013 1 Response to Selection Selection can change the distribution of phenotypes, and
More informationMultivariate analysis of genetic data: an introduction
Multivariate analysis of genetic data: an introduction Thibaut Jombart MRC Centre for Outbreak Analysis and Modelling Imperial College London XXIV Simposio Internacional De Estadística Bogotá, 25th July
More informationGrouped Network Vector Autoregression
Statistica Sinica: Supplement Grouped Networ Vector Autoregression Xuening Zhu 1 and Rui Pan 2 1 Fudan University, 2 Central University of Finance and Economics Supplementary Material We present here the
More informationTutorial Session 2. MCMC for the analysis of genetic data on pedigrees:
MCMC for the analysis of genetic data on pedigrees: Tutorial Session 2 Elizabeth Thompson University of Washington Genetic mapping and linkage lod scores Monte Carlo likelihood and likelihood ratio estimation
More informationRelationship between Genomic Distance-Based Regression and Kernel Machine Regression for Multi-marker Association Testing
Relationship between Genomic Distance-Based Regression and Kernel Machine Regression for Multi-marker Association Testing Wei Pan Division of Biostatistics, School of Public Health, University of Minnesota,
More informationQuantitative characters
Quantitative characters Joe Felsenstein GENOME 453, Autumn 015 Quantitative characters p.1/38 A random mating population with two genes having alleles each, at equal frequencies, symmetrically affecting
More informationA FAST, ACCURATE TWO-STEP LINEAR MIXED MODEL FOR GENETIC ANALYSIS APPLIED TO REPEAT MRI MEASUREMENTS
A FAST, ACCURATE TWO-STEP LINEAR MIXED MODEL FOR GENETIC ANALYSIS APPLIED TO REPEAT MRI MEASUREMENTS Qifan Yang 1,4, Gennady V. Roshchupkin 2, Wiro J. Niessen 2, Sarah E. Medland 3, Alyssa H. Zhu 1, Paul
More informationINTRODUCTION TO ANIMAL BREEDING. Lecture Nr 2. Genetics of quantitative (multifactorial) traits What is known about such traits How they are modeled
INTRODUCTION TO ANIMAL BREEDING Lecture Nr 2 Genetics of quantitative (multifactorial) traits What is known about such traits How they are modeled Etienne Verrier INA Paris-Grignon, Animal Sciences Department
More informationQuantitative Genomics and Genetics BTRY 4830/6830; PBSB
Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture16: Population structure and logistic regression I Jason Mezey jgm45@cornell.edu April 11, 2017 (T) 8:40-9:55 Announcements I April
More information25 : Graphical induced structured input/output models
10-708: Probabilistic Graphical Models 10-708, Spring 2016 25 : Graphical induced structured input/output models Lecturer: Eric P. Xing Scribes: Raied Aljadaany, Shi Zong, Chenchen Zhu Disclaimer: A large
More informationNotes on Twin Models
Notes on Twin Models Rodrigo Pinto University of Chicago HCEO Seminar April 19, 2014 This draft, April 19, 2014 8:17am Rodrigo Pinto Gene-environment Interaction and Causality, April 19, 2014 8:17am 1
More informationMIXED MODELS THE GENERAL MIXED MODEL
MIXED MODELS This chapter introduces best linear unbiased prediction (BLUP), a general method for predicting random effects, while Chapter 27 is concerned with the estimation of variances by restricted
More informationLecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011
Lecture 6: Introduction to Quantitative genetics Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011 Quantitative Genetics The analysis of traits whose variation is determined by both a
More informationQuantitative characters
Quantitative characters Joe Felsenstein GENOME 453, Autumn 013 Quantitative characters p.1/38 A random mating population with two genes having alleles each, at equal frequencies, symmetrically affecting
More informationGenerative Models and Stochastic Algorithms for Population Average Estimation and Image Analysis
Generative Models and Stochastic Algorithms for Population Average Estimation and Image Analysis Stéphanie Allassonnière CIS, JHU July, 15th 28 Context : Computational Anatomy Context and motivations :
More informationCalculation of IBD probabilities
Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm
More informationReview We have covered so far: Single variant association analysis and effect size estimation GxE interaction and higher order >2 interaction Measurement error in dietary variables (nutritional epidemiology)
More informationPCA vignette Principal components analysis with snpstats
PCA vignette Principal components analysis with snpstats David Clayton October 30, 2018 Principal components analysis has been widely used in population genetics in order to study population structure
More informationGenetic Studies of Multivariate Traits
Genetic Studies of Multivariate Traits Heping Zhang Collaborative Center for Statistics in Science Yale University School of Public Health Presented at DIMACS, University of Rutgers May 17, 2013 Heping
More informationTheoretical and computational aspects of association tests: application in case-control genome-wide association studies.
Theoretical and computational aspects of association tests: application in case-control genome-wide association studies Mathieu Emily November 18, 2014 Caen mathieu.emily@agrocampus-ouest.fr - Agrocampus
More informationAdvances in Manifold Learning Presented by: Naku Nak l Verm r a June 10, 2008
Advances in Manifold Learning Presented by: Nakul Verma June 10, 008 Outline Motivation Manifolds Manifold Learning Random projection of manifolds for dimension reduction Introduction to random projections
More information