BIOLOGY. Genomes and Their Evolution CAMPBELL. Reece Urry Cain Wasserman Minorsky Jackson

Similar documents
Genomes and Their Evolution

Campbell Biology 10. A Global Approach. Chapter 20 The Evolution of Genomes

Genomes and Their Evolution

Chapter 18 Active Reading Guide Genomes and Their Evolution

Genomes and Their Evolution

(b) α lactalbumin. (a) Lysozyme. Lysozyme R A T N Y NAGD R QA I V E NN E L L K D I DGY. Lysozyme DA V ACA K R V V. Lysozyme

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

The Eukaryotic Genome and Its Expression. The Eukaryotic Genome and Its Expression. A. The Eukaryotic Genome. Lecture Series 11

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18

Full file at CHAPTER 2 Genetics

2012 Univ Aguilera Lecture. Introduction to Molecular and Cell Biology

Meiosis and Sexual Life Cycles

BIOLOGY. Meiosis and Sexual Life Cycles CAMPBELL. Reece Urry Cain Wasserman Minorsky Jackson

Introduction to Molecular and Cell Biology

Molecular evolution - Part 1. Pawan Dhar BII

Frequently Asked Questions (FAQs)

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.

Multiple Choice Review- Eukaryotic Gene Expression

Molecular Evolution & the Origin of Variation

Molecular Evolution & the Origin of Variation

Computational Biology: Basics & Interesting Problems

UNIT 5. Protein Synthesis 11/22/16

18.4 Embryonic development involves cell division, cell differentiation, and morphogenesis

Eukaryotic vs. Prokaryotic genes

Principles of Genetics

Unit 5: Cell Division and Development Guided Reading Questions (45 pts total)

Meiosis and Sexual Life Cycles

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17.

Homeotic Genes and Body Patterns

GCD3033:Cell Biology. Transcription

TE content correlates positively with genome size

2. Cellular and Molecular Biology

From gene to protein. Premedical biology

Honors Biology Reading Guide Chapter 11

Biology 105/Summer Bacterial Genetics 8/12/ Bacterial Genomes p Gene Transfer Mechanisms in Bacteria p.

Related Courses He who asks is a fool for five minutes, but he who does not ask remains a fool forever.

Meiosis and Sexual Life Cycles

Chapter 18 Lecture. Concepts of Genetics. Tenth Edition. Developmental Genetics

ECOL/MCB 320 and 320H Genetics

Flow of Genetic Information

Biology 112 Practice Midterm Questions

Special Topics on Genetics

BME 5742 Biosystems Modeling and Control

3/8/ Complex adaptations. 2. often a novel trait

Reading Assignments. A. Systems of Cell Division. Lecture Series 5 Cell Cycle & Cell Division

Lecture Series 5 Cell Cycle & Cell Division

Chapter 15 Active Reading Guide Regulation of Gene Expression

Lesson Overview. Gene Regulation and Expression. Lesson Overview Gene Regulation and Expression

5/4/05 Biol 473 lecture

Lecture Series 5 Cell Cycle & Cell Division

Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus:

The Gene The gene; Genes Genes Allele;

Unit 5: Taxonomy. KEY CONCEPT Organisms can be classified based on physical similarities.

PROTEIN SYNTHESIS INTRO

DNA Structure and Function

A diploid somatic cell from a rat has a total of 42 chromosomes (2n = 42). As in humans, sex chromosomes determine sex: XX in females and XY in males.

Unit 2: Cellular Chemistry, Structure, and Physiology Module 5: Cellular Reproduction

Topic 8 Mitosis & Meiosis Ch.12 & 13. The Eukaryotic Genome. The Eukaryotic Genome. The Eukaryotic Genome

Bio 119 Bacterial Genomics 6/26/10

Algorithms in Computational Biology (236522) spring 2008 Lecture #1

GENETICS - CLUTCH CH.1 INTRODUCTION TO GENETICS.

Regulation of Gene Expression

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly

1. In most cases, genes code for and it is that

Sperm & Eggs & Variation..OH MY!

Virginia Western Community College BIO 101 General Biology I

Bacterial Genetics & Operons

Chapters 12&13 Notes: DNA, RNA & Protein Synthesis

CHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection

Designer Genes C Test

Chapter 8 Lectures by Gregory Ahearn University of North Florida

Name: Class: Date: ID: A

GSBHSRSBRSRRk IZTI/^Q. LlML. I Iv^O IV I I I FROM GENES TO GENOMES ^^^H*" ^^^^J*^ ill! BQPIP. illt. goidbkc. itip31. li4»twlil FIFTH EDITION

Vital Statistics Derived from Complete Genome Sequencing (for E. coli MG1655)

CGS 5991 (2 Credits) Bioinformatics Tools

Chapter 19. History of Life on Earth

Cell Division: the process of copying and dividing entire cells The cell grows, prepares for division, and then divides to form new daughter cells.

Small RNA in rice genome

Biology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall

Chapter 10, 11, 14: Gene Expression, Regulation, and Development Exam

Introduction to molecular biology. Mitesh Shrestha

MEIOSIS C H A P T E R 1 3

Chapter 13 Meiosis and Sexual Reproduction

Valley Central School District 944 State Route 17K Montgomery, NY Telephone Number: (845) ext Fax Number: (845)

Lecture 20 DNA Repair and Genetic Recombination (Chapter 16 and Chapter 15 Genes X)

BIOINFORMATICS LAB AP BIOLOGY

Genetics 275 Notes Week 7

Biology I Level - 2nd Semester Final Review

Lecture Notes: BIOL2007 Molecular Evolution

How much non-coding DNA do eukaryotes require?

Revision Based on Chapter 19 Grade 11

PLNT2530 (2018) Unit 5 Genomes: Organization and Comparisons

Bioinformatics Exercises

Texas Biology Standards Review. Houghton Mifflin Harcourt Publishing Company 26 A T

STUDY UNIT 1 MITOSIS AND MEIOSIS. Klug, Cummings & Spencer Chapter 2. Morphology of eukaryotic metaphase chromosomes. Chromatids

Exploring Evolution & Bioinformatics

Big Idea 3: Living systems store, retrieve, transmit and respond to information essential to life processes. Tuesday, December 27, 16

Transcription:

CAMPBELL BIOLOGY TENTH EDITION Reece Urry Cain Wasserman Minorsky Jackson 21 Genomes and Their Evolution Lecture Presentation by Nicole Tunbridge and Kathleen Fitzpatrick

Reading the Leaves from the Tree of Life Complete genome sequences exist for a human, chimpanzee, E. coli, brewer s yeast, corn, fruit fly, house mouse, rhesus macaque, and many other organisms Comparisons of genomes among organisms provide insights into evolution and other biological processes

Genomics is the study of whole sets of genes and their interactions Bioinformatics is the application of computational methods to the storage and analysis of biological data

Figure 21.1

Figure 21.1a House mouse (Mus musculus)

Concept 21.1: The Human Genome Project fostered development of faster, less expensive sequencing techniques Officially begun as the Human Genome Project in 1990, the sequencing of the human genome was largely completed by 2003 The genome was completed using sequencing machines and the dideoxy chain termination method A major thrust of the project was development of technology for faster sequencing

Two approaches complemented each other in obtaining the complete sequence The initial approach built on an earlier storehouse of human genetic information Then J. Craig Venter set up a company to sequence the entire genome using an alternative whole-genome shotgun approach This used cloning and sequencing of fragments of randomly cut DNA followed by assembly into a single continuous sequence

Figure 21.2-1 1 Cut the DNA into overlapping fragments short enough for sequencing. 2 Clone the fragments in plasmid or other vectors.

Figure 21.2-2 1 Cut the DNA into overlapping fragments short enough for sequencing. 2 Clone the fragments in plasmid or other vectors. 3 Sequence each fragment. CGCCATCAGT AGTCCGCTATACGA ACGATACTGGT

Figure 21.2-3 1 Cut the DNA into overlapping fragments short enough for sequencing. 2 Clone the fragments in plasmid or other vectors. 3 Sequence each fragment. CGCCATCAGT AGTCCGCTATACGA ACGATACTGGT 4 Order the sequences into one overall sequence with computer software. CGCCATCAGT ACGATACTGGT AGTCCGCTATACGA CGCCATCAGTCCGCTATACGATACTGGT

Today the whole-genome shotgun approach is widely used, though newer techniques are contributing to the faster pace and lowered cost of genome sequencing These newer techniques do not require a cloning step These techniques have also facilitated a metagenomics approach in which DNA from a group of species in an environmental sample is sequenced

Concept 21.2: Scientists use bioinformatics to analyze genomes and their functions The Human Genome Project established databases and refined analytical software to make data available on the Internet This has accelerated progress in DNA sequence analysis

Centralized Resources for Analyzing Genome Sequences Bioinformatics resources are provided by a number of sources National Library of Medicine and the National Institutes of Health (NIH) created the National Center for Biotechnology Information (NCBI) European Molecular Biology Laboratory DNA Data Bank of Japan BGI in Shenzhen, China

Genbank, the NCBI database of sequences, doubles its data approximately every 18 months Software is available that allows online visitors to search Genbank for matches to A specific DNA sequence A predicted protein sequence Common stretches of amino acids in a protein The NCBI website also provides 3-D views of all protein structures that have been determined

Figure 21.3 WD40 - Sequence Alignment Viewer WD40 - Cn3D 4.1 Name: WD40 CDD Descriptive Items WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mrna processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40;

Identifying Protein-Coding Genes and Understanding Their Functions Using available DNA sequences, geneticists can study genes directly The identification of protein coding genes within DNA sequences in a database is called gene annotation

Gene annotation is largely an automated process Comparison of sequences of previously unknown genes with those of known genes in other species may help provide clues about their function

Understanding Genes and Gene Expression at the Systems Level Proteomics is the systematic study of full protein sets encoded by a genome Proteins, not genes, carry out most of the activities of the cell

How Systems Are Studied: An Example A systems biology approach can be applied to define gene circuits and protein interaction networks Researchers working on the yeast Saccharomyces cerevisiae used sophisticated techniques to disable pairs of genes one pair at a time, creating double mutants Computer software then mapped genes to produce a network-like functional map of their interactions The systems biology approach is possible because of advances in bioinformatics

Figure 21.4 Translation and ribosomal functions Mitochondrial functions RNA processing Transcription and chromatin-related functions Nuclearcytoplasmic transport Nuclear migration and protein degradation Mitosis DNA replication and repair Cell polarity and morphogenesis Peroxisomal functions Metabolism and amino acid biosynthesis Secretion and vesicle transport Protein folding and glycosylation; cell wall biosynthesis Vesicle fusion Glutamate biosynthesis Amino acid permease pathway Serinerelated biosynthesis

Figure 21.4a Translation and ribosomal functions Mitochondrial functions RNA processing Transcription and chromatin-related functions Nuclearcytoplasmic transport Nuclear migration and protein degradation Mitosis DNA replication and repair Cell polarity and morphogenesis Peroxisomal functions Metabolism and amino acid biosynthesis Secretion and vesicle transport Protein folding and glycosylation; cell wall biosynthesis

Figure 21.4b Glutamate biosynthesis Metabolism and amino acid biosynthesis Vesicle fusion Amino acid permease pathway Serinerelated biosynthesis

Application of Systems Biology to Medicine The Cancer Genome Atlas project, started in 2010, looked for all the common mutations in three types of cancer by comparing gene sequences and expression in cancer versus normal cells This was so fruitful, it has been extended to ten other common cancers Silicon and glass chips have been produced that hold a microarray of most known human genes These are used to study gene expression patterns in patients suffering from various cancers or other diseases

Figure 21.5

Concept 21.3: Genomes vary in size, number of genes, and gene density By early 2013, over 4,300 genomes were completely sequenced, including 4,000 bacteria, 186 archaea, and 183 eukaryotes Sequencing of over 9,600 genomes and over 370 metagenomes is currently in progress

Genome Size Genomes of most bacteria and archaea range from 1 to 6 million base pairs (Mb); genomes of eukaryotes are usually larger Most plants and animals have genomes greater than 100 Mb; humans have 3,000 Mb Within each domain there is no systematic relationship between genome size and phenotype

Table 21.1

Number of Genes Free-living bacteria and archaea have 1,500 to 7,500 genes Unicellular fungi have from about 5,000 genes and multicellular eukaryotes up to at least 40,000 genes

Number of genes is not correlated to genome size For example, it is estimated that the nematode C. elegans has 100 Mb and 20,100 genes, while Drosophila has 165 Mb and 14,000 genes Researchers predicted the human genome would contain about 50,000 to 100,000 genes; however the number is around 21,000 Vertebrate genomes can produce more than one polypeptide per gene because of alternative splicing of RNA transcripts

Gene Density and Noncoding DNA Humans and other mammals have the lowest gene density, or number of genes, in a given length of DNA Multicellular eukaryotes have many introns within genes and a large amount of noncoding DNA between genes

Concept 21.4: Multicellular eukaryotes have much noncoding DNA and many multigene families Sequencing of the human genome reveals that 98.5% does not code for proteins, rrnas, or trnas About a quarter of the human genome codes for introns and gene-related regulatory sequences

Intergenic DNA is noncoding DNA found between genes Pseudogenes are former genes that have accumulated mutations and are nonfunctional Repetitive DNA is present in multiple copies in the genome About three-fourths of repetitive DNA is made up of transposable elements and sequences related to them

Figure 21.6 Exons (1.5%) Regulatory sequences (5%) L1 sequences (17%) Repetitive DNA that includes transposable elements and related sequences (44%) Alu elements (10%) Simple sequence DNA (3%) Introns ( 20%) Repetitive DNA unrelated to transposable elements (14%) Unique noncoding DNA (15%) Large-segment duplications (5 6%)

Much evidence indicates that noncoding DNA (previously called junk DNA ) plays important roles in the cell For example, genomes of humans, rats, and mice show high sequence conservation for about 500 noncoding regions

Transposable Elements and Related Sequences The first evidence for mobile DNA segments came from geneticist Barbara McClintock s breeding experiments with Indian corn McClintock identified changes in the color of corn kernels that made sense only if some genetic elements move from other genome locations into the genes for kernel color These transposable elements move from one site to another in a cell s DNA; they are present in both prokaryotes and eukaryotes

Figure 21.7

Figure 21.7a

Figure 21.7b

Movement of Transposons and Retrotransposons Eukaryotic transposable elements are of two types Transposons, which move by means of a DNA intermediate and require a transposase enzyme Retrotransposons, which move by means of an RNA intermediate, using a reverse transcriptase

Figure 21.8 Transposon New copy of transposon DNA of genome Transposon is copied Insertion Mobile copy of transposon

Figure 21.9 Retrotransposon New copy of retrotransposon Synthesis of a single-stranded RNA intermediate RNA Reverse transcriptase Insertion DNA strand Mobile copy of retrotransposon

Sequences Related to Transposable Elements Multiple copies of transposable elements and related sequences are scattered throughout eukaryotic genomes In primates, a large portion of transposable element related DNA consists of a family of similar sequences called Alu elements Many Alu elements are transcribed into RNA molecules; some are thought to help regulate gene expression

The human genome also contains many sequences of a type of retrotransposon called LINE-1 (L1) L1 sequences have a low rate of transposition and may have effects on gene expression L1 transposons may play roles in the diversity of neuronal cell types

Other Repetitive DNA, Including Simple Sequence DNA About 15% of the human genome consists of duplication of long sequences of DNA from one location to another In contrast, simple sequence DNA contains many copies of tandemly repeated short sequences

A series of repeating units of 2 to 5 nucleotides is called a short tandem repeat (STR) The repeat number for STRs can vary among sites (within a genome) or individuals Simple sequence DNA is common in centromeres and telomeres, where it probably plays structural roles in the chromosome

Genes and Multigene Families Many eukaryotic genes are present in one copy per haploid set of chromosomes The rest of the genes occur in multigene families, collections of identical or very similar genes Some multigene families consist of identical DNA sequences, usually clustered tandemly, such as those that code for rrna products

Figure 21.10a DNA Direction of transcription RNA transcripts Nontranscribed spacer Transcription unit DNA 18S 5.8S 28S rrna 5.8S 28S 18S (a) Part of the ribosomal RNA gene family

The classic examples of multigene families of nonidentical genes are two related families of genes that encode globins -globins and -globins are polypeptides of hemoglobin and are coded by genes on different human chromosomes and are expressed at different times in development

Figure 21.10b β-globin α-globin α-globin β-globin Heme α-globin gene family β-globin gene family Chromosome 16 Chromosome 11 ζ ζ α 2 α 1 θ ϵ G A β β α 2 α 1 Fetus Embryo and adult Embryo Fetus Adult (b) The human α-globin and β-globin gene families

Concept 21.5: Duplication, rearrangement, and mutation of DNA contribute to genome evolution The basis of change at the genomic level is mutation, which underlies much of genome evolution The earliest forms of life likely had only those genes necessary for survival and reproduction The size of genomes has increased over evolutionary time, with the extra genetic material providing raw material for gene diversification

Duplication of Entire Chromosome Sets Accidents in meiosis can lead to one or more extra sets of chromosomes, a condition known as polyploidy The genes in one or more of the extra sets can diverge by accumulating mutations; these variations may persist if the organism carrying them survives and reproduces In this way genes with novel functions can evolve

Alterations of Chromosome Structure Humans have 23 pairs of chromosomes, while chimpanzees have 24 pairs Following the divergence of humans and chimpanzees from a common ancestor, two ancestral chromosomes fused in the human line Duplications and inversions result from mistakes during meiotic recombination Comparative analysis between chromosomes of humans and seven mammalian species paints a hypothetical chromosomal evolutionary history

Figure 21.11 Telomere sequences Human chromosome Chimpanzee chromosomes Centromere sequences Telomere-like sequences 12 Centromere-like sequences 2 13

Figure 21.12 Human chromosome Mouse chromosomes 16 7 8 16 17

The rate of duplications and inversions seems to have accelerated about 100 million years ago This coincides with when large dinosaurs went extinct and mammals diversified Chromosomal rearrangements are thought to contribute to the generation of new species

Duplication and Divergence of Gene-Sized Regions of DNA Unequal crossing over during prophase I of meiosis can result in one chromosome with a deletion and another with a duplication of a particular region Transposable elements can provide sites for crossover between nonsister chromatids

Figure 21.13 Nonsister chromatids Gene Transposable element Incorrect pairing of two homologs during meiosis Crossover point and

Evolution of Genes with Related Functions: The Human Globin Genes The genes encoding the various globin proteins evolved from one common ancestral globin gene, which duplicated and diverged about 450 500 million years ago After the duplication events, differences between the genes in the globin family arose from the accumulation of mutations

Figure 21.14 Evolutionary time Ancestral globin gene Duplication of ancestral gene Mutation in both copies α β Transposition to different chromosomes Further duplications and mutations ζ α α ϵ β β ζ ζ α α 2 α 1 β 2 α 1 y θ ϵ G β A α-globin gene family on chromosome 16 β-globin gene family on chromosome 11

Subsequent duplications of these genes and random mutations gave rise to the present globin genes, which code for oxygen-binding proteins The similarity in the amino acid sequences of the various globin proteins supports this model of gene duplication and mutation

Evolution of Genes with Novel Functions The copies of some duplicated genes have diverged so much in evolution that the functions of their encoded proteins are now very different For example the lysozyme gene was duplicated and evolved into the gene that encodes -lactalbumin in mammals Lysozyme is an enzyme that helps protect animals against bacterial infection -lactalbumin is a nonenzymatic protein that plays a role in milk production in mammals

Figure 21.15 (a) Lysozyme (b) α lactalbumin Lysozyme α lactalbumin Lysozyme α lactalbumin Lysozyme α lactalbumin 1 1 51 51 101 101 (c) Amino acid sequence alignments of lysozyme and α lactalbumin

Rearrangements of Parts of Genes: Exon Duplication and Exon Shuffling The duplication or repositioning of exons has contributed to genome evolution Errors in meiosis can result in an exon being duplicated on one chromosome and deleted from the homologous chromosome In exon shuffling, errors in meiotic recombination lead to some mixing and matching of exons, either within a gene or between two nonallelic genes

Figure 21.16 EGF EGF EGF EGF Epidermal growth factor gene with multiple EGF exons F F F F Fibronectin gene with multiple finger exons Exon shuffling F EGF Exon duplication K K K Plasminogen gene with a kringle exon Exon shuffling Portions of ancestral genes TPA gene as it exists today

How Transposable Elements Contribute to Genome Evolution Multiple copies of similar transposable elements may facilitate recombination, or crossing over, between different chromosomes Insertion of transposable elements within a protein-coding sequence may block protein production Insertion of transposable elements within a regulatory sequence may increase or decrease protein production

Transposable elements may carry a gene or groups of genes to a new position Transposable elements may also create new sites for alternative splicing in an RNA transcript In all cases, changes are usually detrimental but may on occasion prove advantageous to an organism

Concept 21.6: Comparing genome sequences provides clues to evolution and development Comparisons of genome sequences from different species reveal much about the evolutionary history of life Comparative studies of embryonic development are beginning to clarify the mechanisms that generated the diversity of life-forms present today

Comparing Genomes Genome comparisons of closely related species help us understand recent evolutionary events Relationships among species can be represented by a tree-shaped diagram

Figure 21.17 Most recent common ancestor of all living things 4 3 2 1 0 Billions of years ago Bacteria Eukarya Archaea Chimpanzee Human Mouse 70 60 50 40 30 20 10 0 Millions of years ago

Comparing Distantly Related Species Highly conserved genes have changed very little over time These help clarify relationships among species that diverged from each other long ago Bacteria, archaea, and eukaryotes diverged from each other between 2 and 4 billion years ago Highly conserved genes can be studied in one model organism, and the results applied to other organisms

Comparing Closely Related Species Genomes of closely related species are likely to be organized similarly For example, using the human genome sequence as a guide, researchers were quickly able to sequence the chimpanzee genome Analysis of the human and chimpanzee genomes reveals some general differences that underlie the differences between the two organisms

Human and chimpanzee genomes differ by 1.2% at single base-pairs, and by 2.7% because of insertions and deletions Sequencing of the bonobo genome in 2012 reveals that in some regions there is greater similarity between human and bonobo or chimpanzee sequences than between chimpanzee and bonobo

A number of genes are apparently evolving faster in the human than in the chimpanzee or mouse Among them are genes involved in defense against malaria and tuberculosis and one that regulates brain size

Humans and chimpanzees differ in the expression of the FOXP2 gene, whose product turns on genes involved in vocalization Differences in the FOXP2 gene may explain why humans but not chimpanzees communicate by speech The FOXP2 gene of Neanderthals is identical to that of humans, suggesting they may have been capable of speech

Number of whistles Figure 21.18 Experiment Wild type: two normal copies of FOXP2 Heterozygote: one copy of FOXP2 disrupted Homozygote: both copies of FOXP2 disrupted Experiment 1: Researchers cut thin sections of brain and stained them with reagents that allow visualization of brain anatomy in a UV fluorescence microscope. Results Experiment 1 Experiment 2: Researchers separated each newborn pup from its mother and recorded the number of ultrasonic whistles produced by the pup. Experiment 2 400 300 200 Wild type Heterozygote Homozygote 100 0 Wild type Heterozygote (No whistles) Homozygote

Figure 21.18a Experiment Wild type: two normal copies of FOXP2 Heterozygote: one copy of FOXP2 disrupted Homozygote: both copies of FOXP2 disrupted Experiment 1: Researchers cut thin sections of brain and stained them with reagents that allow visualization of brain anatomy in a UV fluorescence microscope. Results Experiment 1 Wild type Heterozygote Homozygote

Figure 21.18aa Wild type: two normal copies of FOXP2

Figure 21.18ab Heterozygote: one copy of FOXP2 disrupted

Figure 21.18ac Homozygote: both copies of FOXP2 disrupted

Number of whistles Figure 21.18b Experiment Wild type: two normal copies of FOXP2 Heterozygote: one copy of FOXP2 disrupted Homozygote: both copies of FOXP2 disrupted Experiment 2: Researchers separated each newborn pup from its mother and recorded the number of ultrasonic whistles produced by the pup. Results Experiment 2 400 300 200 100 0 Wild type Heterozygote (No whistles) Homozygote

Figure 21.18ba

Comparing Genomes Within a Species As a species, humans have only been around about 200,000 years and have low within-species genetic variation Variation within humans is due to single nucleotide polymorphisms, inversions, deletions, and duplications Most surprising is the large number of copynumber variants These variations are useful for studying human evolution and human health

Widespread Conservation of Developmental Genes Among Animals Evolutionary developmental biology, or evo-devo, is the study of the evolution of developmental processes in multicellular organisms Genomic information shows that minor differences in gene sequence or regulation can result in striking differences in form

Molecular analysis of the homeotic genes in Drosophila has shown that they all include a sequence called a homeobox An identical or very similar nucleotide sequence has been discovered in the homeotic genes of both vertebrates and invertebrates Homeobox genes code for a domain that allows a protein to bind to DNA and to function as a transcription regulator Homeotic genes in animals are called Hox genes

Figure 21.19 Adult fruit fly Fruit fly embryo (10 hours) Fruit fly chromosome Mouse chromosomes Mouse embryo (12 days) Adult mouse

Related homeobox sequences have been found in regulatory genes of yeasts, plants, and even prokaryotes In addition to homeotic genes, many other developmental genes are highly conserved from species to species

Sometimes small changes in regulatory sequences of certain genes lead to major changes in body form For example, variation in Hox gene expression controls variation in leg-bearing segments of crustaceans and insects In other cases, genes with conserved sequences play different roles in different species

Figure 21.20 Thorax Genital segments Abdomen (a) Expression of four Hox genes in the brine shrimp Artemia Thorax Abdomen (b) Expression of the grasshopper versions of the same four Hox genes

Figure 21.10 DNA Direction of transcription RNA transcripts β-globin Nontranscribed spacer DNA rrna 28S Transcription unit 18S 5.8S 28S 18S 5.8S (a) Part of the ribosomal RNA gene family α-globin β-globin α 2 α 1 α-globin Heme α-globin gene family β-globin gene family Chromosome 16 Chromosome 11 ζ ζ α 2 α 1 θ ϵ G A β β Fetus Embryo and adult Embryo Fetus Adult (b) The human α-globin and β-globin gene families

Figure 21.10c DNA Direction of transcription RNA transcripts Nontranscribed spacer Transcription unit

Figure 21.UN01a Globin Alignment of Globin Amino Acid Sequences α 1 ζ α 1 ζ α 1 ζ α 1 ζ α 1 ζ 1 1 31 31 61 61 91 91 121 121 MVLSPADKTNVKAAWGKVGAHAGEYGAEAL MSLTKTER T I I VSMWAK I S TQADT IGTETL ERMFLSFPTTKTYFPHFDLSH GSAQVKGH ERLFLSHPQTKTYFPHFDL HPGSAQLRAH GKKVADALTNAVAHVDDMPNALSALSDLHA GSKVVAAVGDAVKSIDDIGGALSKLSELHA HKLRVDPVNFKLLSHCLLVTLAAHL PAEFT YILRVDPVNFKLLSHCLLVTLAARFPADFT PAVHASLDKFLASVSTVLTSKYR AEAHAAWDKFLSVVSSVLTEKYR

Figure 21.UN01b β Family α Family Amino Acid Identity Table α Family β Family α 1 (alpha 1) α 2 (alpha 2) ζ (zeta) β (beta) (delta) ϵ (epsilon) A (gamma A) G (gamma G) α 1 ----- 100 61 45 44 39 42 42 α 2 ----- 61 45 44 39 42 42 ζ ----- 38 40 41 41 41 β ----- 93 76 73 73 ----- 73 71 72 ϵ ----- 80 80 A ----- 99 G -----

Figure 21.UN01c β α α β Hemoglobin

Figure 21.UN02

Figure 21.UN03 Bacteria Archaea Eukarya Genome size Most are 1 6 Mb Most are 10 4,000 Mb, but a few are much larger Number of genes Gene density 1,500 7,500 Higher than in eukaryotes 5,000 40,000 Lower than in prokaryotes (Within eukaryotes, lower density is correlated with larger genomes.) Introns None in protein-coding genes Present in some genes Present in most genes of multicellular eukaryotes, but only in some genes of unicellular eukaryotes Other noncoding DNA Very little Can exist in large amounts; generally more repetitive noncoding DNA in multicellular eukaryotes

Figure 21.UN04 α-globin gene family β-globin gene family Chromosome 16 Chromosome 11 ζ ζ α 2 α 1 θ ϵ β β α 2 α 1 G A

Figure 21.UN05

Figure 21.UN06