Inferring the in vivo looping properties of DNA

Similar documents
3.B.1 Gene Regulation. Gene regulation results in differential gene expression, leading to cell specialization.

Bi 8 Lecture 11. Quantitative aspects of transcription factor binding and gene regulatory circuit design. Ellen Rothenberg 9 February 2016

Biology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall

Supplemental Material

Prokaryotic Gene Expression (Learning Objectives)

Chapter 15 Active Reading Guide Regulation of Gene Expression

CHAPTER : Prokaryotic Genetics

DNA/RNA structure and packing

Control of Gene Expression in Prokaryotes

On the Use of the Hill Functions in Mathematical Models of Gene Regulatory Networks

Topic 4 - #14 The Lactose Operon

13.4 Gene Regulation and Expression

Villa et al. (2005) Structural dynamics of the lac repressor-dna complex revealed by a multiscale simulation. PNAS 102:

12-5 Gene Regulation

Genetic transcription and regulation

Complete all warm up questions Focus on operon functioning we will be creating operon models on Monday

Lecture 7: Simple genetic circuits I

Lecture 4: Transcription networks basic concepts

Regulation of Gene Expression

six lectures on systems biology

the noisy gene Biology of the Universidad Autónoma de Madrid Jan 2008 Juan F. Poyatos Spanish National Biotechnology Centre (CNB)

CS-E5880 Modeling biological networks Gene regulatory networks

REGULATION OF GENE EXPRESSION. Bacterial Genetics Lac and Trp Operon

Name Period The Control of Gene Expression in Prokaryotes Notes

UNIT 6 PART 3 *REGULATION USING OPERONS* Hillis Textbook, CH 11

Multistability in the lactose utilization network of Escherichia coli

Gene Regulation and Expression

56:198:582 Biological Networks Lecture 8

Basic modeling approaches for biological systems. Mahesh Bule

D. Combinatorial transcriptional control. Complex eukaryotic promoter

Biomolecular Feedback Systems

Prokaryotic Gene Expression (Learning Objectives)

Topic 4: Equilibrium binding and chemical kinetics

Unit 3: Control and regulation Higher Biology

Name: SBI 4U. Gene Expression Quiz. Overall Expectation:

Bacterial Genetics & Operons

Written Exam 15 December Course name: Introduction to Systems Biology Course no

Gene regulation I Biochemistry 302. Bob Kelm February 25, 2005

2. Mathematical descriptions. (i) the master equation (ii) Langevin theory. 3. Single cell measurements

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016

Introduction. Gene expression is the combined process of :

Gene regulation II Biochemistry 302. February 27, 2006

Prokaryotic Regulation

Control of Prokaryotic (Bacterial) Gene Expression. AP Biology

GLOBEX Bioinformatics (Summer 2015) Genetic networks and gene expression data

Warm-Up. Explain how a secondary messenger is activated, and how this affects gene expression. (LO 3.22)

Computational Cell Biology Lecture 4

APGRU6L2. Control of Prokaryotic (Bacterial) Genes

Modeling Multiple Steady States in Genetic Regulatory Networks. Khang Tran. problem.

REVIEW SESSION. Wednesday, September 15 5:30 PM SHANTZ 242 E

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide

GENE REGULATION AND PROBLEMS OF DEVELOPMENT

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus

JCB. Modeling network dynamics: the lac operon, a case study. The Journal of Cell Biology. Mini-Review

Multistability in the lactose utilization network of E. coli. Lauren Nakonechny, Katherine Smith, Michael Volk, Robert Wallace Mentor: J.

Chapter 16 Lecture. Concepts Of Genetics. Tenth Edition. Regulation of Gene Expression in Prokaryotes

GENES AND CHROMOSOMES III. Lecture 5. Biology Department Concordia University. Dr. S. Azam BIOL 266/

THE EDIBLE OPERON David O. Freier Lynchburg College [BIOL 220W Cellular Diversity]

Welcome to Class 21!

Initiation of translation in eukaryotic cells:connecting the head and tail

Biological Pathways Representation by Petri Nets and extension

Lysogeny maintenance: a matter of looping. Laura Finzi

Bi 1x Spring 2014: LacI Titration

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON

RNA Synthesis and Processing

Genetic transcription and regulation

Sparse, stable gene regulatory network recovery via convex optimization

Modeling network dynamics: the lac operon, a case study

4. Why not make all enzymes all the time (even if not needed)? Enzyme synthesis uses a lot of energy.

Gene Switches Teacher Information

Lecture 18 June 2 nd, Gene Expression Regulation Mutations

Tuning Transcriptional Regulation through Signaling: A Predictive Theory of Allosteric Induction

Regulation of Gene Expression

Hybrid Model of gene regulatory networks, the case of the lac-operon

A(γ A D A + γ R D R + γ C R + δ A )

Co-ordination occurs in multiple layers Intracellular regulation: self-regulation Intercellular regulation: coordinated cell signalling e.g.

Controlling Gene Expression

Network motifs in the transcriptional regulation network (of Escherichia coli):

Bioinformatics: Network Analysis

32 Gene regulation, continued Lecture Outline 11/21/05

PROTEIN SYNTHESIS INTRO

When do diffusion-limited trajectories become memoryless?

Molecular Biology, Genetic Engineering & Biotechnology Operons ???

+ regulation. ribosomes

Dynamics of the Mixed Feedback Loop Integrated with MicroRNA

Chapter 18: Control of Gene Expression

Quantitative Characterization of Combinatorial Transcriptional Control of the Lactose Operon of E. coli

Brief contents. Chapter 1 Virus Dynamics 33. Chapter 2 Physics and Biology 52. Randomness in Biology. Chapter 3 Discrete Randomness 59

Transport of single molecules along the periodic parallel lattices with coupling

Gene regulation II Biochemistry 302. Bob Kelm February 28, 2005

1 Periodic stimulations of the Incoherent Feedforward Loop network

Lesson Overview. Gene Regulation and Expression. Lesson Overview Gene Regulation and Expression

Big Idea 3: Living systems store, retrieve, transmit and respond to information essential to life processes. Tuesday, December 27, 16

Noise is often perceived as being undesirable and unpredictable;

1 Exploring the expression of an unregulated gene

Lecture 6: The feed-forward loop (FFL) network motif

Sequence Dependence of Transcription Factor-Mediated DNA Looping: Supplementary Material

Modeling DNA loops using the theory of elasticity

DNA Structure. Voet & Voet: Chapter 29 Pages Slide 1

Are DNA Transcription Factor Proteins Maxwellian Demons?

Transcription:

1 Inferring the in vivo looping properties of DNA Leonor Saiz, J. Miguel Rubi *, and Jose M. G. Vilar Integrative Biological Modeling Laboratory, Computational Biology Program, Memorial Sloan-Kettering Cancer Center, 307 East 63rd Street, New York, NY 10021, USA * On leave from: Departament de Fisica Fonamental, Universitat de Barcelona, Diagonal 647, 08028 Barcelona, Spain Classification: BIOLOGICAL SCIENCES: Biophysics Corresponding author: Jose M.G. Vilar Computational Biology Center Memorial Sloan-Kettering Cancer Center 307 East 63rd Street New York, NY 10021, USA Tel: 646-735-8085 Fax: 646-735-0021 E-mail: vilar@cbio.mskcc.org Manuscript information: 16 text pages and 4 Figures. Word and character counts: Abstract: 120 words. Total number of characters in paper: 26533 (18097 characters for text + 7596 characters for figures + 720 characters for allowance + 120 characters for equations)

2 ABSTRACT The free energy of looping DNA by proteins and protein complexes determines to what extent distal DNA sites can affect each other. We inferred its in vivo value through a combined computational-experimental approach for different lengths of the loop and found that, in addition to the intrinsic periodicity of the DNA double helix, the free energy has an oscillatory component with about half the helical period. Moreover, the oscillations have such an amplitude that the effects of regulatory molecules become strongly dependent on their precise DNA positioning and yet easily tunable by their cooperative interactions. These unexpected results can confer to the physical properties of DNA a more prominent role at shaping the properties of gene regulation than previously thought.

3 Introduction The cell is a densely packed dynamic structure made of thousands of different molecular species that orchestrate their interactions to form a functional unit. Such complexity poses a strong barrier for experimentally characterizing the cellular components: not only the properties of the components can change when studied in vitro outside the cell but also the in vivo probing of the cell can perturb the process under study (1). Here we use computational modeling to obtain the properties of the in vivo unperturbed components at the molecular level from physiological measurements at the cellular level. Explicitly, we infer the in vivo free energies of DNA looping from enzyme production in the lac operon (2). The formation of DNA loops by the binding of proteins at distal DNA sites plays a fundamental role in many cellular processes, such as transcription, recombination, and replication (3-5). In gene regulation, proteins bound far away from the genes they regulate can be brought to the initiation of transcription region by looping the intervening DNA. The free energy cost of this process determines how easily DNA can loop and therefore the extent to which distal DNA sites can affect each other (5). In the lac operon, there is a repressor molecule that regulates transcription by binding specifically to DNA sites known as operators and preventing the RNA polymerase from transcribing the genes. DNA looping allows the repressor to bind to two operators simultaneously, leading to an increase in repression of transcription. This increase, characterized by the repression level, can be connected to the free energy of looping DNA by a recent model for transcription regulation by the lac repressor (6). The distance between operators determines the length of the DNA loop and affects the repression level

4 through the changes in the free energy of looping. For inter-operator distances from 57.5 bp to 98.5 bp, Muller et al. (7) systematically varied the distance between two operators in increments of 1 bp and measured the in vivo repression levels under conditions similar to wild-type. These physiological measurements of enzyme production in E. Coli cell populations allowed us to infer here the in vivo looping properties of DNA with unprecedented accuracy. Background and Methods Transcription regulation in the lac operon The lac operon consists of a regulatory domain and three genes required for the uptake and catabolism of lactose. Binding of the lac repressor to the main operator O1 prevents the RNA polymerase from binding to the promoter and transcribing the genes. There are also two auxiliary operators outside the regulatory region, O2 and O3, to which the repressor can bind without preventing transcription. They enhance repression of transcription by increasing the ability of the repressor to bind O1. This effect is mediated by DNA looping, which allows the simultaneous binding of the repressor to the main and one auxiliary operator (2). To avoid the formation of multiple loops, experiments are typically performed with just the main and one auxiliary operator, as in Muller et al. (7). In such a case, there is transcription when the system in a state in which: (i) none of the operators is occupied,

5 (ii) a repressor is bound to just the auxiliary operator, and there is no transcription when: (iii) a repressor is bound to just the main operator, (iv) a repressor is bound to both the main and the auxiliary operators by looping the intervening DNA, and (v) one repressor is bound to the main operator and another repressor, to the auxiliary operator. In Fig. 1, we illustrate these five possible states for the lac constructs used in the experiments of Ref. (7), which consisted of the main operator O1 and just an upstream auxiliary operator, Oid, with the sequence of the ideal operator. Repression level The repression level, R loop, is a dimensionless quantity used to measure the extent of repression of a gene. It is defined as the ratio of the unrepressed transcription rate (t max ) to the actual transcription rate (t act ): R loop = t max t act. The actual transcription rate is the unrepressed transcription rate times the fraction of time that the main operator is free. Therefore, the repression level is the inverse of the probability that the system is in the states (i) and (ii): R loop = 1 P i + P ii, where P indicates the probability for the system to be in the state denoted by its subscript.

6 Statistical thermodynamics analysis Statistical thermodynamics connects the probability P k of a state k with its standard free energy ΔG k through P k = 1 Z [N]n k e ΔG k / RT, where [N] is the concentration of repressors expressed in moles; n k is the number of repressors considered in the state k; RT is the gas constant, R, times the absolute temperature, T; and Z = is the normalization factor (8). P k k This relationship between probabilities and free energies leads to R loop = 1 + [N]e ΔG O1 / RT + [N]e ΔG ( O1 +ΔG Oid +ΔG l )/ RT + [N] 2 e ΔG ( O1 +ΔG Oid )/ RT, where ΔG 1 + [N]e ΔG Oid / RT O1 and ΔG Oid are the standard free energies of binding of the repressor to the O1 and Oid operators, respectively, and ΔG l is the free energy of looping (6). The free energy of looping is defined from the free energy of the looped state (iv), ΔG iv, which can be decomposed in binding to O1 and Oid and looping contributions: ΔG iv =ΔG O1 +ΔG Oid +ΔG l (Figure 2). The free energies of the other states are: ΔG i = 0, ΔG ii =ΔG Oid, ΔG iii =ΔG O1, and ΔG v =ΔG O1 +ΔG Oid. For a strong auxiliary operator ([N]e ΔG Oid / RT >> 1), as in the experimental conditions analyzed (7), for which [N]e ΔG Oid / RT 135, the previous expression simplifies to R loop = 1 + e ΔG l / ( RT + [N])e ΔG O1 / RT. When the auxiliary operator is not present ( ΔG l = infinity ), the repression level is given by R noloop = 1+ [N]e ΔG O1 / RT.

7 Results and Discussion The mathematical expressions connecting the repression level with the molecular properties for a strong auxiliary operator, R loop = 1 + e ΔG l / ( RT + [N])e ΔG O1 / RT, and for no DNA looping, R noloop = 1 + [N]e ΔG O1 / RT, have the remarkable property that they can be combined to give the free energy of looping ΔG l as a function of the physiologically measurable quantities R loop and R noloop : ΔG l = RT ln R loop R noloop R noloop 1 [N]. (1) We used this expression with measured repression levels (7) (Figure 3a) to obtain the in vivo free energies of looping DNA by the lac repressor for different distances between operators (Figure 3b). This mathematical transformation reveals a wealth of details that were not evident in the repression level curves (7, 9). As expected, the free energy oscillates with the helical periodicity of DNA (7, 9-11) since the operators must have the right phase to bind simultaneously to the repressor. The free energy in a cycle, however, behaves asymmetrically, increasing much faster (~3 bp) than decreasing (~8 bp) as the length of the loop increases. A Fourier analysis of the oscillations (Figure 3c) indicates that, in addition to the component with the helical period (~10.9 bp), there is a second representative component with period ~5.6 bp, which leads to the observed asymmetry (Figure 3d). This second component has a strong statistical significance (see Appendix I) and is also present for other experimental data (see Appendix II). Previous in vivo experiments and analysis for longer loops (>100 bp) used continuum elastic models of DNA to fit the observed repression levels (12-14). Therefore, the conclusions that can be extracted from those studies are constrained by the properties of

8 the DNA models used, which are unable to account for the asymmetry we observe in the oscillations for short loops (Figure 3b). Our analysis, in contrast, does not depend on any underlying DNA model and is able to capture novel properties, not considered by current DNA looping models. Recent structural and computational studies on DNA (15, 16) indicate that the loop can be bent and twisted non-uniformly because of different contributions, as for instance, the anisotropic flexibility of DNA, local features resulting from the DNA sequence, and interactions with the lac repressor and other DNA binding proteins. Therefore, the behavior we observe might result from the detailed molecular structure of DNA (15, 16) or the intrinsic elasticity of the lac repressor (17). Another remarkable property of the free energy is that the amplitude of the oscillations is ~2.5 kcal/mol. Such an amplitude, which is much smaller than predicted from current DNA models (15, 18) and similar to the typical free energies of interaction between regulatory molecules (19), demonstrates that the effects of regulatory molecules are strongly dependent on their precise DNA positioning and at the same time easily tunable and modifiable by their cooperative interactions. Calibrated models, as the one we have used here, are widely used in physics and engineering as measuring tools. A well-known example is the measurement of the temperature from the height of the liquid column of a thermometer. In the prototypical model of a thermometer, the temperature in Celsius is given by T = h h freeze 100, h boil h freeze where h is the height of the column and the subscripts boil and freeze label the heights at which water boils and freezes, respectively (20). Our results show that analogous measuring tools can also be implemented in cellular systems to infer molecular properties from the observed physiological behavior. Explicitly, the lac operon can function as a sensor to measure in vivo free energies of DNA looping. The results we have obtained

9 with this combined computational-experimental approach challenge the universality of current continuum elastic DNA models at short distances and point to an active role of the physical properties of DNA at shaping the properties of gene regulation.

10 Appendix I: Statistical reliability of the half-helical-period component To study the effects of noise in the measurements we consider that the free energy Δ G l (l) for a loop length l is affected by a random quantity ξ(l) with zero mean: ΔG l (l) =Δ G l (l) + ξ(l). The Fourier transform of the deterministic and random components at the frequency v 12 of the half-helical-period component are defined as F(Δ G l,v 12 ) = 1 M Δ G l (l)exp(2πiv 12 l) and F(ξ,v 12 ) = 1 M ξ(l)exp(2πiv 1 2 l), l respectively, where M is the number of measurements. l The power spectrum is defined as the squared modulus of the Fourier transform. Therefore, the relative contribution of the measurement noise is NSR 2 = F(ξ,v 12 )F(ξ, v 12 ) F(Δ G l,v 12 )F(Δ G l, v 12 ) σ 2 MA, 2 12 where σ 2 = 1 ξ(l) 2 is the variance of the random contribution to the measured M 1 l free energies and 2A 12 is the amplitude of the half-helical-period component. From the variance of the free energies in Table 2 of reference (6) one can estimate σ = 0.15 kcal/mol. From Muller et al.'s data we obtain 2A 12 = 0.30 kcal/mol (Figure

11 3d), v 12 = 5.6 bp (Figure 3c), and M = 41 (Figure 3a). Therefore, the relative contribution of the measurement noise to the power spectrum at the 5.6 bp frequency is NSR = 0.16, which indicates that only 16% of amplitude of the half-helical-period component can be attributed to measurement errors and that the probability of obtaining the 5.6 bp peak of the power spectrum just by chance is effectively zero. The reliability of the half-helical-period component is much higher than that of single measurements because the evaluation of this component incorporates all the 41 measurements of the free energy for the different loop lengths. For large M, approximately M > 10, the error can be approximated by a Gaussian distribution and the probability of obtaining a periodic component as high as A 12 because of the measurement errors is given by P(A 12 ) = 2 2πσ 2 / M infinity e M x 2 2σ 2 dx. A 12 Only for high values of the measurement error does this probability become appreciable. For instance, in the previous case, even if the measurement errors had a standard deviation σ = 0.3 kcal/mol the probability of obtaining a periodic component with amplitude 0.30 kcal/mol or higher as a result of the measurement noise would be below 0.2% ( P(0.3) = 0.0014 ).

12 Appendix II: Corroboration of the main results The main results we have obtained based on the classical experiments of Muller et al (7) are corroborated by the application of our method to the recent experimental data of Becker et al. (21). These new experiments differ from Muller et al.'s mainly in quantitative details. Becker et al. used a weaker main operator, a lower number or repressors per cell, and measured the repression levels for fewer loop lengths. Because all these differences have a tendency to increase the statistical fluctuations, the results obtained with this new data (Figure 4) are expected to be slightly less precise than the ones we obtained in Figure 3. Similarly as we did in Figure 3, we used Equation 1 with the new measured repression levels (21) (Figure 4a) to obtain the in vivo free energies of looping DNA by the lac repressor for different distances between operators (Figure 4b). A Fourier analysis of the oscillations (Figure 4c) also reveals the component with the helical period (~11.6 bp) and a second representative component with period ~5.2 bp, which lead to the asymmetric behavior (Figure 4d).

13 References 1. Alberts, B. (2002) Molecular biology of the cell (Garland Science, New York). 2. Müller-Hill, B. (1996) The lac Operon: a short history of a genetic paradigm (Walter de Gruyter, Berlin; New York). 3. Schleif, R. (1992) Annu Rev Biochem 61, 199-223. 4. Adhya, S. (1989) Annu Rev Genet 23, 227-50. 5. Vilar, J. M. G. & Saiz, L. (2005) Curr Opin Genet Dev 15, 136-44. 6. Vilar, J. M. G. & Leibler, S. (2003) J Mol Biol 331, 981-9. 7. Muller, J., Oehler, S. & Muller-Hill, B. (1996) J Mol Biol 257, 21-9. 8. Hill, T. L. (1960) An introduction to statistical thermodynamics (Addison-Wesley Pub. Co., Reading, Mass.). 9. Lee, D. H. & Schleif, R. F. (1989) Proc Natl Acad Sci U S A 86, 476-80. 10. Dunn, T. M., Hahn, S., Ogden, S. & Schleif, R. F. (1984) Proc Natl Acad Sci U S A 81, 5017-20. 11. Hochschild, A. & Ptashne, M. (1986) Cell 44, 681-7. 12. Bellomy, G. R., Mossing, M. C. & Record, M. T., Jr. (1988) Biochemistry 27, 3900-6. 13. Bintu, L., Buchler, N. E., Garcia, H. G., Gerland, U., Hwa, T., Kondev, J. & Phillips, R. (2005) Curr Opin Genet Dev 15, 116-24. 14. Bintu, L., Buchler, N. E., Garcia, H. G., Gerland, U., Hwa, T., Kondev, J., Kuhlman, T. & Phillips, R. (2005) Curr Opin Genet Dev 15, 125-35.

14 15. Olson, W. K., Swigon, D. & Coleman, B. D. (2004) Philos Transact A Math Phys Eng Sci 362, 1403-22. 16. Richmond, T. J. & Davey, C. A. (2003) Nature 423, 145-50. 17. Villa, E., Balaeff, A. & Schulten, K. (2005) Proc Natl Acad Sci U S A 102, 6783-8. 18. Shimada, J. & Yamakawa, H. (1984) Macromolecules 17, 689-698. 19. Ptashne, M. & Gann, A. (2002) Genes & signals (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.). 20. Resnick, R., Halliday, D. & Krane, K. S. (1992) Physics (Wiley, New York). 21. Becker, N. A., Kahn, J. D. & Maher, L. J., 3rd (2005) J Mol Biol 349, 716-30.

15 Figure Legends Figure 1: Schematic representation of the five possible states for lac repressor binding to two operators for the lac constructs used in the analyzed experiments by Muller et al. (7). The lac Z gene, downstream the main operator, is repressed when the lac repressor (in gray with black contour line) is bound to the main operator: states (iii), (iv), and (v); and unrepressed when the main operator is unoccupied: states (i) and (ii). The thick black line represents the DNA and the two lac operators are shown as white (auxiliary operator) and highlighted (main operator) boxes. The dotted thick line indicates the different lengths of the spacer DNA between operators. Figure 2: Cartoon of the lac repressor (in gray with black contour line) bound simultaneously to the main (O1 ) and one auxiliary operator (Oid, with the sequence of the ideal operator), which are represented by white rectangular boxes. Binding of the repressor to O1 represses the lac Z gene in the experiments (7). The simultaneous binding to O1 and Oid with free energies ΔG O1 and ΔG Oid, respectively, competes with the unfavorable process of forming a loop ( ΔG l ). Using this type of lac constructs, Muller et al. (7) measured the in vivo repression of the lac Z gene as a function of the distance between the centers of the O1 and Oid operators. Figure 3: From enzyme content of E. coli populations to in vivo DNA looping free energies. (a) Measured repression levels, R loop, as a function of the distance between operators from Muller et al. (7) (b) Inferred free energies of looping using Equation 1 and R loop as in panel a, with R noloop = 135 and [N] = 75 nm (7) (c) Fourier analysis of the operator distance dependence of the free energy. (d) Behavior of the two main periodic components of the free energy of looping:

16 ΔG l (l) = 8.79 0.68sin(2πl /10.9 0.94) 0.30sin(2πl / 5.6 2.32), where l is the distance between operators. Figure 4: Corroboration of the main results. (a) Measured repression levels, R loop, as a function of the distance between operators from Figure 4a of Becker et al. (21). (b) Inferred free energies of looping using Equation 1 and R loop as in panel a, with R noloop = 5 and [N] = 15 nm. (c) Fourier analysis of the operator distance dependence of the free energy. (d) Behavior of the two main periodic components of the free energy of looping: ΔG l (l) = 9.58 0.72sin(2πl /11.6 3.08) 0.28sin(2πl / 5.2 3.31), where l is the distance between operators.

(i) (ii) (iii) (iv) (v) Figure 1. Saiz, Rubi, and Vilar

O id G Oid G l G O1 O 1 lac Z Figure 2. Saiz, Rubi, and Vilar

R loop 6000 4000 2000 (a) Repression level (physiology) G l (kcal/mol) 0 50 60 70 80 90 100 Operator distance (bp) 10 9 8 (b) 50 60 70 80 90 100 Operator distance (bp) Computational model Free energy of looping DNA Fourier analysis Amplitude 2 (a.u.) (c) 10.9 bp 5.6 bp 0 0.1 0.2 Frequency (1/bp) Power spectrum Feature extraction G l (kcal/mol) 10 9 8 (d) Two-component free energy 50 60 70 80 90 100 Operator distance (bp) Figure 3. Saiz, Rubi, and Vilar

R loop 150 100 50 (a) G l (kcal/mol) 0 50 60 70 80 90 100 Operator distance (bp) 11 10 9 (b) 8 50 60 70 80 90 100 Operator distance (bp) G l (kcal/mol) Amplitude 2 (a.u.) 11 10 9 (c) 0 0.1 0.2 Frequency (1/bp) (d) 11.6 bp 5.2 bp 8 50 60 70 80 90 100 Operator distance (bp) Figure 4. Saiz, Rubi, and Vilar