The translation machinery of the cell works with triples of types of RNA bases. Any triple of RNA bases is known as a codon. The set of codons is

Similar documents
Proteins: Characteristics and Properties of Amino Acids

Clustering and Model Integration under the Wasserstein Metric. Jia Li Department of Statistics Penn State University

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

Protein Fragment Search Program ver Overview: Contents:

Amino Acid Side Chain Induced Selectivity in the Hydrolysis of Peptides Catalyzed by a Zr(IV)-Substituted Wells-Dawson Type Polyoxometalate

Packing of Secondary Structures

Snork Synthesis Lab Lab Directions

Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4.

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

C CH 3 N C COOH. Write the structural formulas of all of the dipeptides that they could form with each other.

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

7.012 Problem Set 1 Solutions

Translation. A ribosome, mrna, and trna.

Lecture 15: Realities of Genome Assembly Protein Sequencing

UNIT TWELVE. a, I _,o "' I I I. I I.P. l'o. H-c-c. I ~o I ~ I / H HI oh H...- I II I II 'oh. HO\HO~ I "-oh

CHMI 2227 EL. Biochemistry I. Test January Prof : Eric R. Gauthier, Ph.D.

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Unraveling the degradation of artificial amide bonds in Nylon oligomer hydrolase: From induced-fit to acylation processes

Electronic Supplementary Information

Introduction to the Ribosome Overview of protein synthesis on the ribosome Prof. Anders Liljas

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution

Supplementary Information. Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Massachusetts Institute of Technology Computational Evolutionary Biology, Fall, 2005 Notes for November 7: Molecular evolution

Peptide Syntheses. Illustrative Protection: BOC/ t Bu. A. Introduction. do not acid

C H E M I S T R Y N A T I O N A L Q U A L I F Y I N G E X A M I N A T I O N SOLUTIONS GUIDE

Advanced Topics in RNA and DNA. DNA Microarrays Aptamers

Supplementary Information Intrinsic Localized Modes in Proteins

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Properties of amino acids in proteins

Chapter 4: Amino Acids

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Desorption/Ionization Efficiency of Common Amino Acids in. Surface-assisted Laser Desorption/ionization Mass Spectrometry

Chemistry Chapter 22

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

Peptides And Proteins

Could Genetic Code Be Understood Number Theoretically?

Energy and Cellular Metabolism

Read more about Pauling and more scientists at: Profiles in Science, The National Library of Medicine, profiles.nlm.nih.gov

Objective: You will be able to justify the claim that organisms share many conserved core processes and features.

Similarity or Identity? When are molecules similar?

Could Genetic Code Be Understood Number Theoretically?

Oxygen Binding in Hemocyanin

Amino Acids and Proteins at ZnO-water Interfaces in Molecular Dynamics Simulations: Electronic Supplementary Information

Amino acids: Optimization in underivatized formula and fragment patterns in LC/ESI-MS analysis. Yoshinori Takano (JAMSTEC)

Any protein that can be labelled by both procedures must be a transmembrane protein.

Potentiometric Titration of an Amino Acid. Introduction

A Plausible Model Correlates Prebiotic Peptide Synthesis with. Primordial Genetic Code

7.014 Problem Set 1. A nswers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions will be posted on the web.

Physiochemical Properties of Residues

NMR Assignments using NMRView II: Sequential Assignments

Amino Acids and Peptides

Section Week 3. Junaid Malek, M.D.

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

Geometrical Concept-reduction in conformational space.and his Φ-ψ Map. G. N. Ramachandran

THE SEARCH FOR DESIGN:

CHEMISTRY ATAR COURSE DATA BOOKLET

A. Two of the common amino acids are analyzed. Amino acid X and amino acid Y both have an isoionic point in the range of

Diastereomeric resolution directed towards chirality. determination focussing on gas-phase energetics of coordinated. sodium dissociation

L L. Figure by MIT OCW.

Final Chem 4511/6501 Spring 2011 May 5, 2011 b Name

Solutions In each case, the chirality center has the R configuration

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease

Meeting the challenges of representing large, modified biopolymers

Unification of Four Approaches to the Genetic Code

1. Contains the sugar ribose instead of deoxyribose. 2. Single-stranded instead of double stranded. 3. Contains uracil in place of thymine.

Studies Leading to the Development of a Highly Selective. Colorimetric and Fluorescent Chemosensor for Lysine

Bahnson Biochemistry Cume, April 8, 2006 The Structural Biology of Signal Transduction

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS

Exam I Answer Key: Summer 2006, Semester C

Aoife McLysaght Dept. of Genetics Trinity College Dublin

Resonance assignments in proteins. Christina Redfield

B O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A.

7.014 Quiz I Handout

Get started on your Cornell notes right away

Deciphering the Rules for Amino Acid Co- Assembly Based on Interlayer Distances

April, The energy functions include:

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Videos. Bozeman, transcription and translation: Crashcourse: Transcription and Translation -

Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover

Background BIOCHEMISTRY LAB CHE-554. Experiment #1 Spectrophotometry

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb

HSQC spectra for three proteins

Problem Set 1

Protein Struktur. Biologen und Chemiker dürfen mit Handys spielen (leise) go home, go to sleep. wake up at slide 39

From Gene to Protein

So, what are special sciences? ones that are particularly dear to the author? ( Oh dear. I am touched. Psychology is just, so, well, special!

12/6/12. Dr. Sanjeeva Srivastava IIT Bombay. Primary Structure. Secondary Structure. Tertiary Structure. Quaternary Structure.

TRANSLATION: How to make proteins?

Sequence Alignments. Dynamic programming approaches, scoring, and significance. Lucy Skrabanek ICB, WMC January 31, 2013

Course Notes: Topics in Computational. Structural Biology.

Chem 250 Evening Exam 2

Chapter 4. Basic Set Theory. 4.1 The Language of Set Theory

Supporting information

Transcription:

Relations Supplement to Chapter 2 of Steinhart, E. (2009) More Precisely: The Math You Need to Do Philosophy. Broadview Press. Copyright (C) 2009 Eric Steinhart. Non-commercial educational use encouraged! All others uses prohibited. (Version 1) 1. The Genetic Code The machinery of the cell translates strands of RNA into proteins. A strand of RNA is a sequence of bases. There are four types of bases: Uracil, Cytosine, Adenosine, and Guanine. We can refer to each type by its first letter. Thus the set of base types is: B = {U, C, A, G}. The translation machinery of the cell works with triples of types of RNA bases. Any triple of RNA bases is known as a codon. The set of codons is Codons = B 3 = B B B. The translation machinery associates each triple of RNA base types with either a type of amino acid or with an instruction like Stop. There are 21 types of amino acids. It s standard to refer to amino acid types using three letter abbreviations. For example, Ile is isoleucine. We give the abbreviations but not the full names. If we include the Stop instruction in the set of amino acid types, we get Aminos = { Ala, Arg, Asn, Asp, Cys, Gln, Glu, Gly, His, Ile, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, Val, Stop}. The association from codons to amino acid types (including the Stop instruction) is known as the genetic code. It is a function of the form ƒ: Codons Aminos. The genetic code is displayed in Table 1. Each row is the first base in a codon; each column is the second base; each row in a cell is for the third base. 1

U C A G U C A G U Phe U Ser U Tyr U Cys C Phe C Ser C Tyr C Cys A Leu A Ser A Stop A Stop G Leu G Ser G Stop G Trp U Leu U Pro U His U Arg C Leu C Pro C His C Arg A Leu A Pro A Gln A Arg G Leu G Pro G Gln G Arg U Ile U Thr U Asn U Ser C Ile C Thr C Asn C Ser A Ile A Thr A Lys A Arg G Met G Thr G Lys G Arg U Val U Ala U Asp U Gly C Val C Ala C Asp C Gly A Val A Ala A Glu A Gly G Val G Ala G Glu G Gly Table 1. The genetic code. 2. Vagueness and Fuzzy Sets Given some universe of individuals U, we can define sets over that collection using characteristic functions. A characteristic function is a function from the universe U to the numbers in {0, 1}. Or, equivalently, from U to the truth-values in {F, T}. Suppose ƒ is a function from U to {0, 1}. The function ƒ defines a set over U. For any x in the universe U, we define the membership of x in ƒ like this: x is a member of ƒ if ƒ(x) = 1; x is not a member of ƒ if ƒ(x) = 0. A characteristic function expresses the idea that an object is either a member of a set or else it is not a member of a set. It expresses the idea that a set is formed by some definite yes or no decisions. There s nothing new in this way of thinking about sets. It may happen that we want to define degrees of membership in a set. A predicate like being a prime number is clear enough: either x is a prime number, or it is not. But in the world of ordinary language, predicates are often not so clear. Predicates are often vague. The most famous example of a vague predicate is baldness. It s clear that old Abe is bald. And Bob has a head covered with hair. Bob is definitely not bald. But what about 2

Charles? Most of his hair is gone. But he still has some. Is he bald or not? It s probably not worth fighting about it. It s almost certainly better to say that there are degrees of baldness. Let B measure the degree to which x is bald. Then: B(Abe) = 1 since Abe is bald; B(Bob) = 0 since Bob is not bald; B(Charles) = 1/2 since he s sort of bald. Fuzzy Sets. Given any universe of objects U, a fuzzy set over U is a function from U to some set of degrees. At one extreme, the degrees {0, 1} define just ordinary sets. At another extreme, we can use real numbers between 0 and 1 as degrees. The set of real numbers between 0 and 1 inclusive is denoted [0, 1]. If we re using these as degrees of membership, then a fuzzy set over U is a function from U to [0, 1]. Fuzzy sets can be used to analyze vague predicates (like bald or tall). And they can also be used to analyze vague relations. For example, van Inwagen (1995: 221 227) says that the parthood relation is vague, and he uses fuzzy sets to analyze that vagueness. 3. Quine s Democritean Worlds Quine defines a set of numerical structures that have been called Democritean worlds (Quine, 1969: 147-152). Cellular automata like the game of life (and lattice gasses and lattice Boltzmann structures) are Democritean worlds with finitely extended and divided space-time. A Democritean world is a "cosmic distribution of binary choices (occupied vs. empty) over the points of space-time" (Quine, 1969: 155). Occupied means occupied by matter or energy. In the game of life, occupied is the cell-value 1 while empty is 0. We cite Quine's development of Democritean worlds at some length: let us accept for a while an old-fashioned physics according to which, as Democritus held, all atoms are homogeneous in substance and differ only in size, shape, position, and motion. Let us suppose further that space is Euclidean.... there remain for each point in space just two possible states: the point may lie within some particle or it may be empty. Each distribution of these states over all the points of space may be seen... as a possible momentary world state.... The virtue of taking a possible world state as an exhaustive assignment of 'occupied' or 'empty,' 'yes' or 'no,' to points of space, is that the assignment... can simply be identified with the aggregate of the occupied points themselves. Each portion of space, big or little, compact or scattered, may thus be accounted a possible world state. Realization of that world state would consist in there being matter at each of those points of space and none elsewhere.... we can by-pass the points by adopting a system of coordinates and speaking of triples of real numbers. Our ontology then requires only portions of matter, as individuals, and the usual superstructure of classes of individuals, classes of classes, and so on.... a possible world state becomes simply any class of triples of real numbers. To any such class we equate what, intuitively, would be called the possible world state 3

that has matter at just the positions given by number triples in the class.... This explication of possible worlds is predicated on the view that every possible world has homogeneous matter, Euclidean space, and a time dimension independent of frame of reference. these traits, being then traits of all possible worlds, rate as necessary. This view is debatable, since the real world is believed to lack all three traits. One thing good about this version of possible worlds, nevertheless, is that it stays within a clear extensional ontology. I expect that while still staying within these terms we could complicate it to suit current physics. (Quine, 1969: 147-152) Quine starts out by saying that a momentary state of a Democritean world is a pair (P, ƒ) where P is a set of spatial points and ƒ is a function from P to {0, 1}. In physics, a field is a function that associates every point in space with an object. So the function ƒ is a matter field. For any point p in space, Quine says ƒ(p) = 0 if p does not lie within any material particle (it is empty); ƒ(p) = 1 if p lies within some material particle (it is occupied). The set of all possible momentary world states is the set of all functions from P to {0, 1}. If we use S to denote the set of momentary world states, then S = { ƒ ƒ: P {0, 1}}. Presumably, a Democritean world is temporally extended. It is a time-ordered series of momentary states. It is therefore a quadruple of the form (P, T, S, W) where P is a set of points of space; T is a set of moments of time; S is the set of possible momentary world states; and W is a function that associates each moment of time with a momentary world state. But Quine wants to take this further. Each matter field is a characteristic function over the points of space. It selects some points and rejects others. Every matter field corresponds to a region of space. A region of space is a subset of P. So every matter field corresponds to a subset of P. On the basis of this correspondence, Quine redefines momentary world states as regions of space. The set of all possible momentary world states is now the power set of P. He says S = pow P. Quine further says that points can be replaced by triples of real numbers. Assuming that R refers to the set of real numbers, the set of points is now redefined as P = R 3. Following this redefinition of spatial points, we can replace each moment of time by a real number. Thus 4

T = R. Given these replacements, a Democritean world is a quadruple of the form (R 3, R, pow R 3, W). If we follow the Quinean replacements, then we have redefined a Democritean world to be a purely set-theoretic structure over the real numbers. Of course, you might raise some objections to these Quinean replacements. The most obvious objection is that a matter field is not the same kind of thing as a region of space. It is true that there is a 1-1 correspondence between the occupied portions of matter fields and the regions of space. But that hardly entails that they are identical. The same sort of objection can be raised against the replacement of points with triples of real numbers. Do you think these objections are the same in both cases? What are your replies? 5

References Quine, W. V. (1969) Ontological Relativity and Other Essays. New York: Columbia University Press. van Inwagen, P. (1995) Material Beings. Ithaca, NY: Cornell University Press. 6