Chromatin Structure and Transcriptional Activity Shaping Genomic Landscapes in Eukaryotes

Similar documents
Biology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall

Chapter 15 Active Reading Guide Regulation of Gene Expression

The Eukaryotic Genome and Its Expression. The Eukaryotic Genome and Its Expression. A. The Eukaryotic Genome. Lecture Series 11

The nature of genomes. Viral genomes. Prokaryotic genome. Nonliving particle. DNA or RNA. Compact genomes with little spacer DNA

Introduction to Bioinformatics

Encoding and Decoding 3D Genome Organization

Introduction. Gene expression is the combined process of :

Complete all warm up questions Focus on operon functioning we will be creating operon models on Monday

Lecture 18 June 2 nd, Gene Expression Regulation Mutations

Transcription Factor Binding Site Positioning in Yeast: Proximal Promoter Motifs Characterize TATA-Less Promoters

Computational Biology: Basics & Interesting Problems

Eukaryotic Gene Expression

Предсказание и анализ промотерных последовательностей. Татьяна Татаринова

Big Idea 3: Living systems store, retrieve, transmit and respond to information essential to life processes. Tuesday, December 27, 16

Gene Regulation and Expression

REVIEW SESSION. Wednesday, September 15 5:30 PM SHANTZ 242 E

Regulation of Gene Expression

Measuring TF-DNA interactions

Intro Gene regulation Synteny The End. Today. Gene regulation Synteny Good bye!

Regulation of gene Expression in Prokaryotes & Eukaryotes

Whole-genome analysis of GCN4 binding in S.cerevisiae

12-5 Gene Regulation

Network Motifs of Pathogenic Genes in Human Regulatory Network

3.B.1 Gene Regulation. Gene regulation results in differential gene expression, leading to cell specialization.

Self Similar (Scale Free, Power Law) Networks (I)

Bio 119 Bacterial Genomics 6/26/10

Topic 4 - #14 The Lactose Operon

UE Praktikum Bioinformatik

Prokaryotic Regulation

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

Multiple Choice Review- Eukaryotic Gene Expression

Written Exam 15 December Course name: Introduction to Systems Biology Course no

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.

A sequence-dependent physical model of nucleosome occupancy along the eukaryotic chromatin fiber. Alain Arneodo

Mole_Oce Lecture # 24: Introduction to genomics

16 CONTROL OF GENE EXPRESSION

A DNA Sequence 2017/12/6 1

INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide

Evaluation of the relative contribution of each STRING feature in the overall accuracy operon classification

Quantitative Test of the Barrier Nucleosome Model for Statistical Positioning of Nucleosomes Up- and Downstream of Transcription Start Sites

13.4 Gene Regulation and Expression

CHAPTER 12 - THE CELL CYCLE (pgs )

Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus:

How much non-coding DNA do eukaryotes require?

networks in molecular biology Wolfgang Huber

Regulation of Gene Expression

GENE REGULATION AND PROBLEMS OF DEVELOPMENT

Lineage specific conserved noncoding sequences in plants

Genomics and bioinformatics summary. Finding genes -- computer searches

AS A SERVICE TO THE RESEARCH COMMUNITY, GENOME BIOLOGY PROVIDES A 'PREPRINT' DEPOSITORY

Fuzzy Clustering of Gene Expression Data

A Study on chromatin structure via nucleosome positioning pattern classification using n-gram graphs

CS-E5880 Modeling biological networks Gene regulatory networks

15.2 Prokaryotic Transcription *

RNA Synthesis and Processing

Cell Division. OpenStax College. 1 Genomic DNA

Cell Growth and Reproduction Module B, Anchor 1

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus

Data visualization and clustering: an application to gene expression data

Co-ordination occurs in multiple layers Intracellular regulation: self-regulation Intercellular regulation: coordinated cell signalling e.g.

nucleus: DNA & chromosomes

Biology 105/Summer Bacterial Genetics 8/12/ Bacterial Genomes p Gene Transfer Mechanisms in Bacteria p.

Nucleosome Switching

An Information-Theoretic Approach to Methylation Data Analysis

Network motifs in the transcriptional regulation network (of Escherichia coli):

Discovering MultipleLevels of Regulatory Networks

Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p

Clustering of Pathogenic Genes in Human Co-regulatory Network. Michael Colavita Mentor: Soheil Feizi Fifth Annual MIT PRIMES Conference May 17, 2015

Honors Biology Reading Guide Chapter 11

Corresponding authors: Judith B Zaugg: +44 (0) Nicholas M Luscombe: +44 (0)

CHAPTER : Prokaryotic Genetics

INTEGRATING EPIGENETIC PRIORS FOR IMPROVING COMPUTATIONAL IDENTIFICATION OF TRANSCRIPTION FACTOR BINDING SITES AFFAN SHOUKAT

Networks. Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource

Computational methods for the analysis of bacterial gene regulation Brouwer, Rutger Wubbe Willem

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17.

The Role of Nucleosome Positioning in the Evolution of Gene Regulation

Deciphering regulatory networks by promoter sequence analysis

Genome 541! Unit 4, lecture 3! Genomics assays

Supplementary Information

Small RNA in rice genome

Graph Alignment and Biological Networks

Chapter 18 Lecture. Concepts of Genetics. Tenth Edition. Developmental Genetics

Lesson Overview. Gene Regulation and Expression. Lesson Overview Gene Regulation and Expression

Nucleus. The nucleus is a membrane bound organelle that store, protect and express most of the genetic information(dna) found in the cell.

Control of Gene Expression

Stochastic simulations

Gene expression in prokaryotic and eukaryotic cells, Plasmids: types, maintenance and functions. Mitesh Shrestha

The Amoeba-Flagellate Transformation

Prokaryotic Gene Expression (Learning Objectives)

Clustering. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein. Some slides adapted from Jacques van Helden

Clustering Lecture 1: Basics. Jing Gao SUNY Buffalo

Endless evolutionary paths to Virtual Microbes

Chapter 20. Initiation of transcription. Eukaryotic transcription initiation

Gene Regula*on, ChIP- X and DNA Mo*fs. Statistics in Genomics Hongkai Ji

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison

Computational methods for predicting protein-protein interactions

Meiosis produces haploid gametes.

Computational approaches for functional genomics

Name Period The Control of Gene Expression in Prokaryotes Notes

Transcription:

Invisible Cities: Segregated domains in eukaryote genomes Chromatin Structure and Transcriptional Activity Shaping Genomic Landscapes in Eukaryotes Christoforos Nikolaou Computational Genomics Group Dept. of Biology, University of Crete http://computational-genomics-uoc.weebly.com/

Genomes as Architectures Genomes as Architectures In Zoe the lack of Signs does not allow you to the understand the function of each building: You are lost in an indivisible environment. Italo Calvino, Invisible Cities Left: A 2D live cell DNA dstorm image, adapted from Benke A, Manley S. ChemBioChem. 2012 Right: Artistic representation of Calvino s city of Zoe by Karina Puente Frantzen

Structural Landmarks vs a U-topian Architecture? The lack of landmarks is a characteristic of ultra-designed architectural structures, in which the sense of space is by definition arbitrary. ( ου τόπος = no place) Most urban landscapes have, in contrast, developed through gradual aggregation of distinguishable architectural elements. Top left: Plan for the Ville Radieuse by Le Corbusier, Top Right: CDG International Airport, Paris Bottom Right: Aerial view of Paris

The multiple levels of genome organization Gene distribution Gene Regions/Gene Deserts Regulation of gene expression Eu/Hetero-chromatin Chromatin States Topologically Associated Domains (TAD) Transcription Factories Chromosomal Territories RidGE (Caron et al, 2001); Chromatin and Transcription (Jenuwein, 2001); Chromatin States (Ernst & Kellis, 2010); TAD (Bing Ren, 2011); 3D Genome (Fraser et al 2010); Chromosomal Territories (Dekker et al)

Can we deduce structural/functional domains in the genome? Structural Entities in the Eukaryotic Genome We are now in the position to dissect the eukaryotic genome in structurally coherent entities (TADs, chromosomal compartments, transcription factories etc) The question: Are there structural properties that distinguish between these entities or are we lost in an indivisible environment? Our goal: To analyze the eukaryote genome in linear space and see if there are attributes that can help us define genomic landmarks and thus grasp a hidden genome architecture. Hi-C map from the data of Duan et al (2009). Adapted from Tsochatzidou et al (2017)

Genome Architecture in one dimension Genome segmentation based on co-expression profiles Question: Do up/down-regulated genes tend to cluster in linear dimension? Answer: Yes. Clusters of consistent expression-change under topological stress are longer than expected by chance Tsochatzidou et al, Nucleic Acids Res. 2017

Topo-regulated gene clusters are also co-regulated Increased gene co-expression within the topologically defined clusters SPELL Database: >10000 gene expression profiles for yeast Question: Are the defined clusters also co-regulated? Answer: Yes. Topologically defined clusters show higher expression correlation that randomly selected consecutive gene clusters of the same size. Tsochatzidou et al, Nucleic Acids Res. 2017

Non-random genomic distribution of gene clusters Gene clusters of different types show clear positional preferences Question: Are gene clusters randomly positioned in the genome? Answer: No. Down-regulated clusters tend to be found close to the centromeres and away from the edges of chromosomes. The opposite is true for Up-regulated gene clusters. Tsochatzidou et al, Nucleic Acids Res. 2017

Gene Conservation and TFBS density Down-regulated clusters are more conserved. Up-regulated clusters have more complex regulation patterns. Down-regulated gene clusters are more conserved in terms of sequence than Upregulated ones, which, on the other hand contain more transcription factor binding sites in their promoters. Tsochatzidou et al, Nucleic Acids Res. 2017

Spatial preferences of gene clusters Up-regulated gene clusters are more distant from each other In general, expression values under topological stress, increase with increasing distance between adjacent genes. Distance between genes in Up-regulated clusters is significantly higher than the genome average, while Downregulated genes are placed closer than the expected mean. Distance preferences are not related to length of genes. Tsochatzidou et al, Nucleic Acids Res. 2017

Transcriptional directionality matters Patterns of transcriptional direction differ between clusters Down-regulated clusters have more common changes in transcriptional direction compared to Up-regulated ones where genes tend to be transcribed in the same direction. Tsochatzidou et al, Nucleic Acids Res. 2017

Genome Architecture in one dimension Co-directional transcription is buffering topological stress Genes transcribed in the same direction (and with greater intergenic space) can dissipate (or even harness) the accumulation of topological stress. Conservation of directionality is apparent in up-regulated genes even at low sequence conservation. Tsochatzidou et al, Nucleic Acids Res. 2017

Genome Urbanization Genome Organization reminiscent of urban landscapes Tsochatzidou et al, Nucleic Acids Res. 2017

The multiple levels of genome organization How can elements from one level associate/define the next one? Gene distribution Gene Regions/Gene Deserts Regulation of gene expression Eu/Hetero-chromatin Chromatin States? Topologically Associated Domains (TAD) Transcription Factories Chromosomal Territories RidGE (Caron et al, 2001); Chromatin and Transcription (Jenuwein, 2001); Chromatin States (Ernst & Kellis, 2010); TAD (Bing Ren, 2011); 3D Genome (Fraser et al 2010); Chromosomal Territories (Dekker et al)

Genome Architecture three dimensions Three-dimensional preferences for gene clusters. Up-regulated clusters tend to overlap TAD insulators Overlap with Insulators TADs defined on the basis of an insulation score. Insulator regions called on the basis of TAD separation. Up-regulated gene clusters are enriched in insulating regions. No apparent tendency was observed for down-regulated ones. Tsochatzidou et al, Nucleic Acids Res. 2017

From structure to function and back to structure How is the compartmentalization linked to the finer chromatin structure? Chromatin structure is stratified in hierarchical levels from local, nucleosomal patterns to global 3D topology. How are the two levels linked? Can broader genomic domains differ in their nucleosome positioning patterns? Left: Definition of TAD-like domains in yeast chriv. Nikolaou C. Current Genetics, 2017 Right: Different nucleosomal occupancy patterns for different gene groups. Nikolaou et al, NAR 2012

The Plan What we did We divided the yeast genome in TAD-like domains based on an insulation analysis of Hi-C maps We obtained the average nucleosome occupancy profile for the genes in each TAD-like domain We clustered the domains according to their average nucleosome occupancy pattern We wanted to see differences in the functional and structural properties of the different TAD-like clusters, such as: Genomic localization Gene conservation Transcription factor binding enrichment Chromosomal distribution

Nucleosome occupancy patterns delineate gene neighborhoods Strikingly different nucleosome occupancy profiles in the yeast genome Closed-chromatin pattern: Shallow Nucleosome Free Region (NFR) upstream of the TSS. Fuzzy nucleosomes upstream and downstream Open Chromatin pattern: A clear Nucleosome Free Region (NFR) upstream of the TSS. Periodically positioned nucleosomes downstream of the TSS. Very Open Chromatin pattern: Clear and very deep Nucleosome Free Region (NFR) upstream of the TSS. Periodically positioned nucleosomes downstream of the TSS. Wellpositioned -1 nucleosome upstream of the NFR. C. Nikolaou, Current Genetics, 2017

Nucleosome occupancy patterns delineate gene neighborhoods Open Chromatin Clusters. Deep NFR, Nucleosome Periodicity Closed Chromatin Clusters: Shallow or no NFR, lack of periodicity C. Nikolaou, Current Genetics, 2017

Some classes are different than others Different nucleosome occupancy is linked to various genomic aspects Different classes of TAD-like domains based on their nucleosome occupancy patterns also differ in a number of aspects such as size, sequence conservation and chromosomal localization. C. Nikolaou, Current Genetics, 2017

Regulatory compartmentalization between classes Strikingly different Regulatory potential among the TAD-like Classes Open/Closed Chromatin classes are recaptured at the level of transcription factor binding enrichment (Classes 1,2,4 vs 3,5 and 6). Differences between classes of the same type are also indicative of an additional level of complexity. (e.g. Class 2 shares a few factors with Class 5) Insulating Regions have predominantly characteristics of open chromatin. Cluster 3 appears to be the more depleted in transcription factor binding despite its apparently open nucleosome occupancy profile. C. Nikolaou, Current Genetics, 2017

Genome Compartmentalization in structural neighborhoods Uneven chromosomal distribution of TAD-like domain structural classes Unequal distribution of the different classes along the yeast genome and among chromosomes points towards a particular compartmentalization of structurally coherent genomic domains. C. Nikolaou, Current Genetics, 2017

Genome Segregation in non-interacting genomic neighborhoods Lack of inter-chromosomal interactions between classes Different classes tend to be isolated in terms of inter-chromosomal interactions. (Abundance of blue boxes) Overall lack of inter-chromosomal interactions is more pronounced between different classes than within them Classes that show increased interaction tendency belong to similar categories (red boxes) Insulating regions show the greatest tendency for inter-chromosomal interactions with themselves. C. Nikolaou, Current Genetics, 2017

Invisible Cities: Segregated domains in the yeast genome Part #1: Conclusions The architecture of the yeast genome contains structural landmarks instead of a uniform indivisible environment. The yeast genome is compartmentalized in TAD-like domains that differ in the nucleosome occupancy patterns of their contained genes. The different structural classes of genomic domains are associated with different chromatin structure (open/closed nucleosomal patterns), linear chromosomal localization (proximity to telomeres or ARS), sequence conservation and transcription factor binding enrichments. The yeast genome appears to be not only compartmentalized but also segregated in the sense that different classes tend to be isolated, lacking interactions in 3D space. Our observations support the evolution of the genome s architecture through self-organization of chromatin structural elements.

End credits Our collaborators Our Group Maria Tsochatzidou Analysis of GRO seq data Roderic Guigo, (CRG, Barcelona) Maria Malliarou Genome compartmentalization Antonis Klonizakis Transcription factor enrichment analysis Antonis Papadakis Regulatory Networks Vassilis Ntassis Genome Segregation Stelios Mavropoulos, Labrina Bondi Bipartite Networks Joaquim Roca, (CSIC, Barcelona) George Kollias (BSRC Alexander Fleming, Athens) Eleni Lianoudaki Position-dependent Gene Expression Nefely Paschou Nina Koukourikou http://computational-genomics-uoc.weebly.com/