Supplementary Information

Size: px
Start display at page:

Download "Supplementary Information"

Transcription

1 Supplementary Information 1

2 List of Figures 1 Models of circular chromosomes. 2 Distribution of distances between core genes in Escherichia coli K12, arc based model. 3 Distribution of distances between core genes in Escherichia coli K12, circular model. 4 Periodograms without detrend: theoretical distribution and arc based model distribution. 5 Periodograms without symmetrization (with detrend), arc based model. 6 Effects of symmetrization and detrend on the theoretical distribution. 7 Periodograms computed on models displaying a signal of 117kb period. 8 Curves (non smoothed) tracing chromosomal distribution of genes along the left and right arcs in E. coli K12 with a period of 33kb. Core genes and genes in COGs classes are considered. 9 Curves (non smoothed) tracing chromosomal distribution of genes along the left and right arcs in E. coli K12 with a period of 117kb. Core genes and genes in COGs classes are considered. 10 Distribution of genes in a randomly generated genome. 11 Curves (non smoothed) tracing chromosomal distribution of genes along the left and right arcs in E. coli K12 with a period of 33kb. Transcription factors and all genes in COG classes are considered. 12 Curves (smoothed and non smoothed) tracing chromosomal distribution in all E. coli K12 genes along th left and the right arcs with a period of 33kb. 13 Curves tracing the number of non-empty periodic windows along the E. coli K12 chromosome and the average number of genes in these windows for the left and right arcs. 14 Pearson correlation curve and macrodomains. 15 Plot of transcription data obtained under two conditions of response to stress and of the SCCI* curve. 16 Sectors and Pearson correlation curve computed between the SCCI* curve and the log-phase Expression* curve, the acid-shock Expression* curve and the heat-shock Expression* curve. 17 Statistical analysis of sectors for 1000 randomly generated genomes. 18 Distribution of genes in Baba s dataset. 19 Functional distribution of genes on the chromosomal spiral of period 28571nt. List of Tables 1 Distribution of genes, core genes and ribosomal proteins located in leading and lagging strands. 2 Characteristics of the Escherichia coli K12 organism and genome. 3 Periods computed for the three arc based models and for the circular model on core genes. 4 Periods computed for the three arc based models and for the circular model, taking into account that a core gene (possibly several) might occur in an operon. Case I. 5 Periods computed for the three arc based models and for the circular model, taking into account that a core gene (possibly several) might occur in an operon. Case II. 6 Periodograms comparison between the Escherichia coli K12 genome and randomly generated genomes. 7 Sectors positions along the E. coli K12 chromosome. 8 SCCI* peaks coverage with a period of 117kb along the chromosome. 9 Coverage of Expression* peaks for periods of 33kb and 117kb. 10 Periods obtained (with c = 3) on several groups of genes for the two chromosomal models. 11 Description of Baba s and Gerdes datasets of essential genes. 12 Pearson correlation coefficients between pairs of curves in Fig.5A in the article. 2

3 Figure 1: Top: A circular chromosome is ideally split into two halves whose extremes correspond to the replication origin (ORI) and the (ideally well determined) termination site of replication (TER). The four strands are grouped into leading and lagging strands following the 5-3 DNA directionality (as indicated by the arrows in the figure). Left and right chromosome arcs are unambiguously determined by the orientation of the strand that has been sequenced. By definition, a leading strand is a DNA strand that is replicated continuously. In DNA replication, the strand that is made in the 5 to 3 direction by continuous polymerization at the 3 growing tip. The lagging strand is at the opposite side of the replication fork from the leading strand. Bottom: the four models considered in the article with the strands used to compute distances between genes located on them. 3

4 800 Escherichia coli str. K-12 substr. MG1655 occurences e+06 2e+06 distances Figure 2: Distribution of distances, along the two arcs of the E. coli K12 MG1655 chromosome for the full arc based model, between core genes with symmetrization. Lines in the grid are spaced every 33566nt. No smoothing has been applied to the data. 4

5 Figure 3: Circular model: histogram (top corner) and periodogram computed on the distribution of minimal distances between pairs of core genes in the E. coli K12 chromosome. The E. coli K12 histogram is truncated on the y-axis with an effect on the central column; the maximum y-value is at 706. Compare it to Figure 2B in the article. 5

6 Figure 4: Periodograms realized without detrend on the theoretical distribution and on E. coli K12 distance distribution with the full arc based model. Notice that for the E. coli K12 distance distribution, the maximum peak remains at with the same spectral value. Compare it to Figure 2 in the article (the curves are essentially the same). 6

7 Figure 5: Periodogram realized without symmetrization (with detrend) on the full arc based model. Comparison with Figure 2B in the article: the maximum peak remains at but the spectral values decrease considerably. 7

8 A B C D Figure 6: Effects of symmetrization and detrend on the theoretical distribution. A: No symmetrization is applied, detrend is realized. The curve is non linear and this is due to the artificial step induced by the triangle shape in the FFT analysis. B: No detrend and no symmetrization operations are applied. The curve appears identical to the one in Figure 2A in the article, but zooming on the curve displays a sinusoidal shape as shown in C. C: two zooms on the curve in B, where a sinusoidal shape is present. D: two zooms on the curve (obtained without detrend) in SI Fig. 4A, absence of the sinusoidal shape. Notice that zooms on the curve in Figure 2 in the article look essentially the same as the ones reported here for SI Fig. 4. 8

9 Figure 7: Periodograms computed on models displaying a signal at 117kb. A: COGs classes ABJKL extended with rrnas were analyzed for the circular model. B and C: The signal at 117kb is not found for the full set of genes in COGs classes ABJKL but it is recovered for COGs classes ABJKL where only genes with a codon bias above the mean are considered (that is, a gene g is considered in the set only if SCCI(g) µ SCCI ). Both, the circular and the full arc based models display the 117kb signal. For the circular model, the spectrum associated to 117kb grows from (for biased genes in ABJKL) to (for ABJKL+rRNAs) and this is due to the precise periodic location of rrnas along the chromosome. D and E: Baba s dataset of essential genes displays the 117kb signal for both models. Notice that the signal at 117kb for Baba s has Z=4.57 while for ABJKL+rRNAs it has Z=6.87; Z-scores represent here a good manner to discriminate a peak from the mean behavior. Compare P-values in Table 10. 9

10 Figure 8: Curves (non smoothed) tracing chromosomal distributions of genes along the left and right arcs in E. coli K12 with a period of 33566nt. Genes belong to different groups: core genes (red curve), genes involved in metabolism (violet), in information storage and processing (green) and in cellular processing and signaling (blue). Violet, blue and green curves are constructed from genes classified in the COGs classes listed in the figure legend. To compare with the corresponding smoothed curves in Figure 5A in the article and with SI Figure 9. 10

11 Figure 9: Same as in SI Figure 8 but with a period of 117kb. This period was determined as statistically significant in [Wright et al. 2007], and as one can notice, a regularity in the distribution of genes is much less evident here. 11

12 number of genes in window Left dist from origin (mod 12938) in kb Right dist from origin (mod 12938) in kb periods (kb) spectrum Figure 10: Distribution of genes in a random genome. Top. Curves (non smoothed) tracing chromosomal distribution of genes tagged as core, on a period of 33566nt (left) and on a period of 12938nt (right) along the left and right arcs. The period of 12938nt is the main period obtained through Fourier analysis of this random genome. The random generation of genomes with tagged genes is explained in Methods. Compare the plots on the left to Fig. 4CD in the article, where the signal is much sharper: the total number of genes tagged as core in a given position varies within the interval [30, 90] for the left arc of the E. coli K12 genome (Fig. 4C) against [40, 70] for the randomly generated genome. The curve is defined by one main maxima for E. coli K12 and by several comparable local maxima for the random genome. Similarly for the right arc (Fig. 4D and also Fig. 5B). A noisy signal is easily recognized on the plots on the right with the absence of a main local maxima for the left arc. Bottom. Periodogram resulting from the Fourier analysis of the random genome. 12

13 Figure 11: Curves (non smoothed) tracing chromosomal distribution of E. coli K12 genes along the left and right arcs with a period of 33566nt. Top: distribution of all genes regulated by known transcription factors; bottom: distribution of all genes classified in COGs classes. We observe that for both groups, the left arc presents a periodic colocalization of genes in a subinterval between 10kb and 20kb from the chromosomal origin (mod 33566nt), with an extra, less pronounced, colocalization for genes regulated by known transcription factors at around 30kb. The right arc displays a much less sharp distribution. The list of genes regulated by known transcription factors (version 6.2) has been downloaded at 13

14 number of genes in window Left dist from origin (mod 33566) in kb number of genes in window Right dist from origin (mod 33566) in kb Figure 12: Curves (smoothed and non smoothed) tracing chromosomal distribution of all E. coli K12 genes along the left and right arcs with a period of 33566nt. Even if the signal is less sharp here than for core genes (see Figures 4CD in the article), there is a clear tendency of all E. coli K12 genes to colocalize in specific periodic subintervals. Namely, on the left arc we find a periodic gene colocalization between 10kb and 20kb from the chromosomal origin (mod 33566nt), and on the right arc we observe the same subinterval but also another, less pronounced, at around 30kb from the origin. 14

15 2.4 Left arc 2 Right arc mean number of genes in windows mean number of genes in windows dist from origin (mod 33566) in kb dist from origin (mod 33566) in kb 50 Left arc 40 Right arc Number of non empty windows Number of non empty windows dist from origin (mod 33566) in kb dist from origin (mod 33566) in kb Figure 13: Curves tracing the number of non-empty periodic windows along the chromosome (bottom; windows should contain some E. coli K12 core gene) and the average number of genes in these windows (top) for the left and right arcs. Each point of the curves corresponds to the counting done on periodic windows (of 5500nt) centered at this point along the 33kb period. The shape of the curves is similar to the ones in Fig.4CD and Fig.5B. This means that within chromosomal facets displaying a higher number of genes, we observe a larger number of periodic windows containing core genes and an increasing number of genes within these windows. 15

16 Figure 14: Pearson correlation curve and macrodomains. A: E. coli chromosome containing macrodomains as defined in [Esnault et. al, 2007]. Four macrodomains are defined: one around the oric (Ori, green), one around the dif (Ter, clear blue) and two flanking the Ter region, named Left (blue) and Right (red). The figure was redrawn from the one in [Esnault et. al, 2007] B: Pearson correlation curve shown in Figure 3B in the article. The chromosomal segments corresponding to the four macrodomains are colored as in A. Both extremes of the Left macrodomain and one extreme of the Right macrodomain coincide with the boundaries of chromosomal sectors. Namely, sector C starts at position nt and the Left macrodomain starts at nt; sector E starts at position nt and the Left macrodomain ends at position nt; secteur A (on the right arc) starts at position nt and the Right macrodomain ends at position nt. 16

17 0.4 Origin Left Arc Terminus Right Arc Origin 0.3 SCCI* under acid-shock* SCCI* under heat-shock* Clockwise position from origin (kb) 0 Figure 15: Transcriptomic data obtained under two conditions of response to stress (top: acid shock; bottom: heat shock) are plot (blue curves) against SCCI* values of E. coli K12 genes (red curve). Curves are smoothed. 17

18 Figure 16: Sectors computed on the Pearson correlation curve between the SCCI* curve and the log-phase Expression* curve (top), the acid-shock Expression* curve (middle) and the heat-shock Expression* curve (bottom) are reported. While log-phase and acid-shock conditions determine essentially the same chromosomal sectors (with the exception of A that splits in three large sectors under stress conditions), the heat-shock chromosomal sectors are clearly redefined, with much smaller sectors and a number of sectors (18) which is higher than it is expected in a randomly generated genome (this value is estimated to be 15 as indicated in Figure 17). Sectors B, C, D are the only ones that are preserved under heat-shock conditions. Colors correspond to macrodomains as in SI Fig. 14A. 18

19 Number of random genomes Number of sectors 150 genomes with at most 15 sectors genomes with at most 11 sectors genomes with at most 8 sectors Number of random genomes Percentage of sectors Figure 17: Statistical analysis of sectors for 1000 randomly generated genomes. For each random genome, sectors are computed from the Pearson correlation curve obtained as in Fig. 3B. Top: distribution of the number of sectors determined for 1000 random genomes, with mean and standard deviation Bottom: for each random genome, we compute the percentage of sector strips that cover more than 50% of the SCCI* peaks falling in the sector. Percentages are computed by taking into account only sectors containing SCCI* peaks. The distribution of percentages for random genomes containing at most 8 (blue), 11 (green) and 15 (red) sectors is plotted. (Notice that µ = 15, µ σ = 11 and µ 2σ = 8, where µ, σ are mean and standard deviation of the distribution.) No random genome with at most 8 sectors displays all sectors where strips cover > 50% of the SCCI* peaks, as for the E. coli K12 genome. For the blue histogram, the number of genomes for each bin is reported. 19

20 Figure 18: Distribution of genes in Baba s dataset. Baba s dataset is plotted on the intervals of 33566nt (left) and nt (right) for the left and right arcs, respectively. Observe that for period 117kb, there is a region highlighting a concentration of genes along both arcs. 20

21 Figure 19: Functional distribution of genes on the chromosomal spiral of period 28571nt. Distribution of genes belonging to the three main COGs functional groups is computed by counting the number of genes located on the chromosomal spiral of period 28571nt (starting from the origin O) and on successive windows of 5500nt along chromosomal arcs. The three functional groups are: information storage and processing (based on COGs classes ABJKL), cellular processing and signaling (DMNOPTUVWYZ) and metabolism (CEFGHIQ). Original curves have been smoothed with a Gaussian smoothing window with a sd of 2000nt. Original and smoothed curves are plot. The complementarity of the violet and blue curves can be easily observed. Green and blue curves follow the same tendency. 21

22 Escherichia coli str. K12 substr. MG1655 chromosome Lagging Leading Total Number of genes Number of core genes Top 100 core genes Core genes near to ORI Core genes near to TER Core ribosomal proteins near to ORI 3(3) 46(46) 49(49) Core ribosomal proteins near to TER 1(2) 4(7) 5(9) Table 1: Distribution of genes, core genes and ribosomal proteins located in leading and lagging strands. Number of genes, number of core genes and number of core ribosomal proteins located on the leading and on the lagging strands of Escherichia coli str. K12 substr. MG1655 genome. Proximity of core genes to ORI and TER is considered. For ribosomal proteins, the number in parenthesis describes the total number of ribosomal proteins. 22

23 Group King Size GC Gr Shape Arran Mot Oxy Habitat Temp Pathogenic γ-proteobacteria Bact Neg Rod S, P Yes Fac Host-ass Meso No Table 2: Characteristics of the E. coli K12 organism and genome. Group, Kingdom [Bacteria (Bact)], Size, GC, Gram stain [Negative (Neg)], Shape, Arrangement [Singles (S), Pairs(P)], Motility, Oxygen Req. [Facultative (Fac)], Habitat [Host-associated (Host-ass)], Temperature Range [Mesophilic (Meso)], Pathogenic. 23

24 Arc based models On S all On S leading On S lagging On S circ c = (P < 10 4 ; Z=7.81) 33333(P = ; Z=7.13) (4.39) (2.99) (4.67) (2.59) 31373(2.15) 17266(2.53) 80000(2.33) 33566(P < 10 4 ; Z=7.81) (4.39) 33566(P < 10 4 ; Z=7.81) (4.39) 65753(P = ; Z=4.97) (3.82) 33566(2.89) (2.8) 58537(2.77) (2.41) (2.37) 65753(P = ; Z=4.97) (3.82) c = (P = ; Z=7.13) (4.67) c = (P = ; Z=4.97) 33333(P = ; Z=7.13) (4.67) Circular model 33566(P < 10 4 ; Z=8.19) (5.5) 64865(2.15) 33566(P < 10 4 ; Z=8.19) (5.5) 33566(P < 10 4 ; Z=8.19) (5.5) Table 3: Periods computed for the three arc based models and for the circular model on core genes. Periods are computed with FFT analysis based on S all, S leading, S lagging and S circ defined on core genes. P -values and Z-scores are reported in parenthesis for the first peak and only Z-scores are reported for subsequent peaks. Lists are ordered from the highest peak to the lowest, and all peaks are greater than a fixed threshold depending on c (as described in Methods). These models do not take into account the organization of genes into operons. 24

25 Arc based models Circular model On S all On S leading On S lagging On S circ c = (P = ; (P = ; Z=7.12) 33333(P = ; Z=8.66) (P = ; Z=7.59) 33333(P = ; e Z=6.7) Z=3.65) (P = ; Z=2.75) (P = ; Z=7.59) 33333(P = ; Z=3.65) (P = ; Z=7.59) (P = ; Z=7.12) c = (P = ; Z=8.66) (P = ; Z=6.7) c = (P = ; Z=7.12) 33333(P = ; Z=8.66) (P = ; Z=6.7) Table 4: Periods computed for the three arc based models and for the circular model, taking into account that a core gene (possibly several) might occur in an operon. Case I. This model, compared to the arc based model considered in the article, discards from the set of core genes, those genes that belong to an operon and that are not the first occurring core gene in the operon (notice that the condition does not require the first occurring core gene to be the very first gene of the operon). Periods are computed with the FFT analysis on S all, S leading, S lagging and S circ. P -values and Z-scores are reported in parenthesis for the first peak and the peaks at 33kb. Lists are ordered from the highest peak to the lowest, and all peaks are greater than a fixed threshold depending on c (as described in Methods). 25

26 Arc based models Circular model On S all On S leading On S lagging On S circ c = (P = ; (P = ; Z=6.28) 33333(P = ; Z=8.39) (P = ; Z=7.71) 33333(P = ; Z=3.98) Z=6.56) (P = ; Z=3.71) (P = ; Z=7.71) 33333(P = ; Z=3.98) (P = ; Z=7.71) (P = ; Z=6.28) (P = ; Z=6.28) c = 3 c = (P = ; Z=8.39) (P = ; Z=6.56) (P = ; Z=3.71) 33333(P = ; Z=8.39) (P = ; Z=6.56) Table 5: Periods computed for the three arc based models and for the circular model, taking into account that a core gene (possibly several) might occur in an operon. Case II. The model considers those genes which are either the first gene in the operon (not necessarily core, but such that the operon contains at least one core gene) or a core gene that does not occur in an operon. Periods are computed by the FFT analysis on S all, S leading, S lagging and S circ. P -values and Z-scores are reported in parenthesis for the first peak and for the peaks at 33kb. 26

27 P -value #M r > µ r + 3σ r Arc based models S all c = 2 < c = 3 < c = 4 < S leading c = c = c = S lagging c = c = c = Circular model c = 2 < c = 3 < c = 4 < Table 6: Periodograms comparison between the E. coli K12 genome and randomly generated genomes. The subscript r refers to randomly generated genomes and the subscript g to the reference genome. The P -value counts the number of randomly generated genomes displaying a maximal peak which is greater than the maximal peak for the reference genome and divides it by 10000, that is the number of random generations. Condition #M r > µ r + 3σ r counts, for each periodogram generated from random genomes, the number of peaks which are greater than µ r + 3σ r ; the number in the table corresponds to peaks validating the inequality for random genomes. Notice that from the data in the Table we can conclude that the expected number of peaks which are > µ r + 3σ r in a random genome equals the number of peaks > µ + 3σ in E. coli K12 and that the peaks for random genomes have usually a smaller spectral value. 27

28 Sector sizes along the E.coli chromosome Sectors on left arc Sectors on right arc A B C D E F G A Start position End position Sector size (kb) Table 7: Sector positions along the E. coli chromosome. Sector positions are computed from the origin. Note that sector A overlaps the right and the left chromosomal arcs and its total size is 1634kb. 28

29 Sectors - left arc Sectors - right arc A B C D E F G A All peaks on the SCCI* curve - 117kb period # peaks in strip # total peaks Peaks on the SCCI* curve with values > µ + σ # peaks in strip # total peaks Table 8: SCCI* peaks coverage with a period of 117kb along the chromosome. Peaks in the SCCI* curve are organized in sectors (see SI Table 7). Strips accomodating most peaks in each sector are computed with a periodicity of 117kb and a size of 39kb (that is one third of the period). Compare to Table 1 in the article: we find a much better peak coverage with the 33kb period. 29

30 Sectors - left arc Sectors - right arc A B C D E F G A All peaks on the Expression* curve - 33kb period # peaks in strip # total peaks Peaks on the Expression* curve with values > µ + σ - 33kb period # peaks in strip # total peaks All peaks on the Expression* curve - 117kb period # peaks in strip # total peaks Peaks on the Expression* curve with values > µ + σ - 117kb period # peaks in strip # total peaks Table 9: Coverage of Expression* peaks for periods of 33kb and 117kb. Peaks in the Expression* curve are thought to be organized in sectors and strips accomodating most peaks in each sector are computed with a periodicity of 33566nt (with strip size of 11189nt) and of 117kb (with strip size of 39kb) (that is one third of the period). A better peak coverage is found with the 33kb period. 30

31 Sets of genes Arc based models Circular model All genes (4295) (P = ; Z=10.03) 3e+05 Core genes (563) 33566(P < 10 4 ; Z=7.81) Core genes + rrna genes (585) (P = ; Z=6.66) Wright s genes (2247) (P = ; Z=8.21) Core genes in Wright s set (452) 33566(P = ; Z=5.94) ABJKL genes in Wright s set (396) ABJKL genes in Wright s set with SCCI-value > µscci(274) COG classes CEFGHIQ: metabolism (1365) COG classes DMNOPTU- VWYZ: cellular processing and signalling (965) COG classes ABJKL: information storage and processing (681) On Sall On Sleading On Slagging (P = ; Z=5.5) (P < 10 4 ; Z=5.84) e (P = ; Z=10.38) (P = ; Z=6.33) (P = ; Z=5.82) ABJKL + rrna genes (703) (P = ; Z=6.87) ABJKL genes with SCCI-value > µscci (274) All COG classes confounded (3533) (P < 10 4 ; Z=6.75) (P = ; Z=9.25) Core genes in COGs (542) 33566(P = ; Z=7.9) Gerdes essential genes (602) 94118(P = ; Z=5.471) Baba s essential genes (300) (P < 10 4 ; Z=4.57) e+05(P = ; Z=5.06) (P = ; Z=4.97) (P = ; Z=4.21) (P = ; Z=6.1) (P < 10 4 ; Z=4.62) (P < 10 4 ; Z=5.7) (P < 10 4 ; Z=5.68) e (P = ; Z=8.38) (P = ; Z=4.35) (P = ; Z=7.22) 8e (P = ; Z=5.16) 8e (P < 10 4 ; Z=5.62) 8e (P = ; Z=4.83) (P = ; Z=4.22) (P = 0.003; Z=5.51) 8e+05 4e+05 9e+05(P < 10 4 ; Z=5.66) (P = ; Z=8.15) (P = ; Z=7.13) (P = 0.012; Z=7.13) (P = ; Z=8.42) (P = ; Z=6.07) (P = ; Z=8.09) (P = ; Z=5.91) (P = ; Z=8.73) (P = ; Z=5.71) (P = ; Z=3.84) (P = ; Z=3.84) (P = ; Z=6) (P = 0.012; Z=8.76) 2e (P = ; Z=5.54) (P = ; Z=6.76) (P = ; Z=4.94) (P = ; Z=3.68) (P < 10 4 ; Z=8.19) (P < 10 4 ; Z=7.97) (P = ; Z=6.93) (P < 10 4 ; Z=7.63) (P < 10 4 ; Z=5.57) (P < 10 4 ; Z=5.81) e (P = ; Z=5.63) (P = ; Z=6.86) (P = 0.001; Z=6.37) (P = ; Z=5.26) (P < 10 4 ; Z=6.23) e (P = 0.007; Z=10.52) 33566(P < 10 4 ; Z=8.78) (P = ; Z=6.33) (P < 10 4 ; Z=6.66) Table 10: Periods obtained with c = 3 on several groups of genes for the two kinds of chromosomal models. Periods are detected with the same methods and parameterization which was used in the article to analyse core genes. Periods which are integer multiples of 33kb are highlighted in bold and periods which are integer multiples of 117kb are underlined. For the first peak in a model, we report P -value and Z-score (computed on generated random genomes) of the peak. The number of genes in a dataset is reported in parenthesis, after the dataset name. 31

32 Baba Gerdes COGs ABJKL A B J K L CEFGHIQ C E F G H I Q DMNOPTUVWYZ D M N O P T U V W Y Z R S Wright (2247) Wright ABJKL Wright CEFGHIQ Wright DMNOPTUVWYZ Core genes Total number of genes Table 11: Table describing the content of Baba s and Gerdes datasets of essential genes. The two datasets are decomposed in COGs functional classes. The total number of genes contained in COGs classes is also given for comparison. The decomposition of Baba s and Gerdes datasets of essential genes in terms of core genes or genes collected in Wright s dataset is also reported. 32

33 Left arc CEFGHIQ DMNOPTUVWYZ Core genes ABJKL 0.58 (p < 2.2e 16 ) -0.53(p < 2.2e 16 ) 0.97(p < 2.2e 16 ) CEFGHIQ -0.19(p < 2.2e 16 ) 0.65(p < 2.2e 16 ) DMNOPTUVWYZ -0.63(p < 2.2e 16 ) Right arc CEFGHIQ DMNOPTUVWYZ Core genes ABJKL (p < 2.2e 16 ) 0.89(p < 2.2e 16 ) -0.40(p < 2.2e 16 ) CEFGHIQ -0.70(p < 2.2e 16 ) 0.89(p < 2.2e 16 ) DMNOPTUVWYZ -0.34(p < 2.2e 16 ) Table 12: Pearson correlation coefficients between pairs of curves in Fig. 5A in the article. The Pearson correlation coefficient and the associated P -value computed using Student s t-test, have been calculated for each pair of curves in Fig. 5A. Pairs are represented in a matrix. Two matrices describe the left arc (top) and the right arc (bottom). ABJKL, CEFGHIQ, DMNOPTUVWYZ and Core genes refer to the four curves drawn in Fig. 5A. 33

Chromosomal periodicity and positional networks of genes in Escherichia coli

Chromosomal periodicity and positional networks of genes in Escherichia coli Molecular Systems Biology 6; Article number 366; doi:10.1038/msb.2010.21 Citation: Molecular Systems Biology 6:366 & 2010 EMBO and Macmillan Publishers Limited All rights reserved 1744-4292/10 www.molecularsystemsbiology.com

More information

Bio 119 Bacterial Genomics 6/26/10

Bio 119 Bacterial Genomics 6/26/10 BACTERIAL GENOMICS Reading in BOM-12: Sec. 11.1 Genetic Map of the E. coli Chromosome p. 279 Sec. 13.2 Prokaryotic Genomes: Sizes and ORF Contents p. 344 Sec. 13.3 Prokaryotic Genomes: Bioinformatic Analysis

More information

Essentiality in B. subtilis

Essentiality in B. subtilis Essentiality in B. subtilis 100% 75% Essential genes Non-essential genes Lagging 50% 25% Leading 0% non-highly expressed highly expressed non-highly expressed highly expressed 1 http://www.pasteur.fr/recherche/unites/reg/

More information

Vital Statistics Derived from Complete Genome Sequencing (for E. coli MG1655)

Vital Statistics Derived from Complete Genome Sequencing (for E. coli MG1655) We still consider the E. coli genome as a fairly typical bacterial genome, and given the extensive information available about this organism and it's lifestyle, the E. coli genome is a useful point of

More information

Computational Biology: Basics & Interesting Problems

Computational Biology: Basics & Interesting Problems Computational Biology: Basics & Interesting Problems Summary Sources of information Biological concepts: structure & terminology Sequencing Gene finding Protein structure prediction Sources of information

More information

Microbiology / Active Lecture Questions Chapter 10 Classification of Microorganisms 1 Chapter 10 Classification of Microorganisms

Microbiology / Active Lecture Questions Chapter 10 Classification of Microorganisms 1 Chapter 10 Classification of Microorganisms 1 2 Bergey s Manual of Systematic Bacteriology differs from Bergey s Manual of Determinative Bacteriology in that the former a. groups bacteria into species. b. groups bacteria according to phylogenetic

More information

Supplementary materials Quantitative assessment of ribosome drop-off in E. coli

Supplementary materials Quantitative assessment of ribosome drop-off in E. coli Supplementary materials Quantitative assessment of ribosome drop-off in E. coli Celine Sin, Davide Chiarugi, Angelo Valleriani 1 Downstream Analysis Supplementary Figure 1: Illustration of the core steps

More information

the noisy gene Biology of the Universidad Autónoma de Madrid Jan 2008 Juan F. Poyatos Spanish National Biotechnology Centre (CNB)

the noisy gene Biology of the Universidad Autónoma de Madrid Jan 2008 Juan F. Poyatos Spanish National Biotechnology Centre (CNB) Biology of the the noisy gene Universidad Autónoma de Madrid Jan 2008 Juan F. Poyatos Spanish National Biotechnology Centre (CNB) day III: noisy bacteria - Regulation of noise (B. subtilis) - Intrinsic/Extrinsic

More information

Part 2. The Basics of Biology:

Part 2. The Basics of Biology: Part 2 The Basics of Biology: An Engineer s Perspective Chapter 2 An Overview of Biological Basics 21 2.1 Cells 2.2 Cell Construction 2.3 Cell Nutrient 2.1 Are all cells the same? Cells Basic unit of living

More information

The percentage of bacterial genes on leading versus lagging strands is influenced by multiple balancing forces

The percentage of bacterial genes on leading versus lagging strands is influenced by multiple balancing forces The percentage of bacterial genes on leading versus lagging strands is influenced by multiple balancing forces Xizeng Mao 1, Han Zhang 1,4, Yanbin Yin 1, 2 1, 2, 3, Ying Xu 1 Computational Systems Biology

More information

Topology. 1 Introduction. 2 Chromosomes Topology & Counts. 3 Genome size. 4 Replichores and gene orientation. 5 Chirochores.

Topology. 1 Introduction. 2 Chromosomes Topology & Counts. 3 Genome size. 4 Replichores and gene orientation. 5 Chirochores. Topology 1 Introduction 2 3 Genome size 4 Replichores and gene orientation 5 Chirochores 6 G+C content 7 Codon usage 27 marc.bailly-bechet@univ-lyon1.fr The big picture Eukaryota Bacteria Many linear chromosomes

More information

Kingdom Monera(Archaebacteria & Eubacteria)

Kingdom Monera(Archaebacteria & Eubacteria) Kingdom Monera(Archaebacteria & All bacteria are prokaryotes Characteristics: 1. No nucleus Eubacteria) 2. No membrane bound organelles 3. Smaller & less ribosomes 4. Most are smaller than eukaryotes 5.

More information

Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes. - Supplementary Information -

Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes. - Supplementary Information - Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes - Supplementary Information - Martin Bartl a, Martin Kötzing a,b, Stefan Schuster c, Pu Li a, Christoph Kaleta b a

More information

Biology 112 Practice Midterm Questions

Biology 112 Practice Midterm Questions Biology 112 Practice Midterm Questions 1. Identify which statement is true or false I. Bacterial cell walls prevent osmotic lysis II. All bacterial cell walls contain an LPS layer III. In a Gram stain,

More information

Biology 105/Summer Bacterial Genetics 8/12/ Bacterial Genomes p Gene Transfer Mechanisms in Bacteria p.

Biology 105/Summer Bacterial Genetics 8/12/ Bacterial Genomes p Gene Transfer Mechanisms in Bacteria p. READING: 14.2 Bacterial Genomes p. 481 14.3 Gene Transfer Mechanisms in Bacteria p. 486 Suggested Problems: 1, 7, 13, 14, 15, 20, 22 BACTERIAL GENETICS AND GENOMICS We still consider the E. coli genome

More information

High-resolution 3D models of Caulobacter crescentus chromosome reveal genome structural variability and organization

High-resolution 3D models of Caulobacter crescentus chromosome reveal genome structural variability and organization Published online 26 February 2018 Nucleic Acids Research, 2018, Vol. 46, No. 8 3937 3952 doi: 10.1093/nar/gky141 High-resolution 3D models of Caulobacter crescentus chromosome reveal genome structural

More information

3.B.1 Gene Regulation. Gene regulation results in differential gene expression, leading to cell specialization.

3.B.1 Gene Regulation. Gene regulation results in differential gene expression, leading to cell specialization. 3.B.1 Gene Regulation Gene regulation results in differential gene expression, leading to cell specialization. We will focus on gene regulation in prokaryotes first. Gene regulation accounts for some of

More information

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17.

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17. Genetic Variation: The genetic substrate for natural selection What about organisms that do not have sexual reproduction? Horizontal Gene Transfer Dr. Carol E. Lee, University of Wisconsin In prokaryotes:

More information

RGP finder: prediction of Genomic Islands

RGP finder: prediction of Genomic Islands Training courses on MicroScope platform RGP finder: prediction of Genomic Islands Dynamics of bacterial genomes Gene gain Horizontal gene transfer Gene loss Deletion of one or several genes Duplication

More information

INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA

INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA XIUFENG WAN xw6@cs.msstate.edu Department of Computer Science Box 9637 JOHN A. BOYLE jab@ra.msstate.edu Department of Biochemistry and Molecular Biology

More information

CHAPTER : Prokaryotic Genetics

CHAPTER : Prokaryotic Genetics CHAPTER 13.3 13.5: Prokaryotic Genetics 1. Most bacteria are not pathogenic. Identify several important roles they play in the ecosystem and human culture. 2. How do variations arise in bacteria considering

More information

Computational Cell Biology Lecture 4

Computational Cell Biology Lecture 4 Computational Cell Biology Lecture 4 Case Study: Basic Modeling in Gene Expression Yang Cao Department of Computer Science DNA Structure and Base Pair Gene Expression Gene is just a small part of DNA.

More information

Computational methods for the analysis of bacterial gene regulation Brouwer, Rutger Wubbe Willem

Computational methods for the analysis of bacterial gene regulation Brouwer, Rutger Wubbe Willem University of Groningen Computational methods for the analysis of bacterial gene regulation Brouwer, Rutger Wubbe Willem IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's

More information

Lecture 5: Processes and Timescales: Rates for the fundamental processes 5.1

Lecture 5: Processes and Timescales: Rates for the fundamental processes 5.1 Lecture 5: Processes and Timescales: Rates for the fundamental processes 5.1 Reading Assignment for Lectures 5-6: Phillips, Kondev, Theriot (PKT), Chapter 3 Life is not static. Organisms as a whole are

More information

BACTERIA. CLS 212: Medical Microbiology Miss Zeina Alkudmani

BACTERIA. CLS 212: Medical Microbiology Miss Zeina Alkudmani BACTERIA CLS 212: Medical Microbiology Miss Zeina Alkudmani Prokaryotes Prokaryotic cells possess simpler structures than eukaryotic cells, since they do not have a nucleus or a lot of cytoplasmic organelles.

More information

ADMM Fused Lasso for Copy Number Variation Detection in Human 3 March Genomes / 1

ADMM Fused Lasso for Copy Number Variation Detection in Human 3 March Genomes / 1 ADMM Fused Lasso for Copy Number Variation Detection in Human Genomes Yifei Chen and Jacob Biesinger 3 March 2011 ADMM Fused Lasso for Copy Number Variation Detection in Human 3 March Genomes 2011 1 /

More information

Topic 4 - #14 The Lactose Operon

Topic 4 - #14 The Lactose Operon Topic 4 - #14 The Lactose Operon The Lactose Operon The lactose operon is an operon which is responsible for the transport and metabolism of the sugar lactose in E. coli. - Lactose is one of many organic

More information

Bacterial Genetics & Operons

Bacterial Genetics & Operons Bacterial Genetics & Operons The Bacterial Genome Because bacteria have simple genomes, they are used most often in molecular genetics studies Most of what we know about bacterial genetics comes from the

More information

Section 19 1 Bacteria (pages )

Section 19 1 Bacteria (pages ) Chapter 19 Bacteria and Viruses Section 19 1 Bacteria (pages 471 477) How do the two groups of prokaryotes differ? What factors are used to identify prokaryotes? What is the importance of bacteria? 13.

More information

Chapter 27: Bacteria and Archaea

Chapter 27: Bacteria and Archaea Name Period Overview 1. The chapter opens with amazing tales of life at the extreme edge. What are the masters of adaptation? Describe the one case you thought most dramatic. Concept 27.1 Structural and

More information

MONTGOMERY COUNTY COMMUNITY COLLEGE BIO 140 CHAPTER 4. Functional Anatomy of Prokaryotic and Eukaryotic Cells

MONTGOMERY COUNTY COMMUNITY COLLEGE BIO 140 CHAPTER 4. Functional Anatomy of Prokaryotic and Eukaryotic Cells MONTGOMERY COUNTY COMMUNITY COLLEGE BIO 140 CHAPTER 4 Functional Anatomy of Prokaryotic and Eukaryotic Cells I. PROKARYOTES A. Structure Of The Cell: Chemical Composition And Function 1. Cell Wall a. composition

More information

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016 Boolean models of gene regulatory networks Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016 Gene expression Gene expression is a process that takes gene info and creates

More information

Computational approaches for functional genomics

Computational approaches for functional genomics Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding

More information

Evaluation of the relative contribution of each STRING feature in the overall accuracy operon classification

Evaluation of the relative contribution of each STRING feature in the overall accuracy operon classification Evaluation of the relative contribution of each STRING feature in the overall accuracy operon classification B. Taboada *, E. Merino 2, C. Verde 3 blanca.taboada@ccadet.unam.mx Centro de Ciencias Aplicadas

More information

(b) In 1977, Carl Woese suggested that there are three domains of living organisms: the Archaea, the Bacteria and the Eukaryota.

(b) In 1977, Carl Woese suggested that there are three domains of living organisms: the Archaea, the Bacteria and the Eukaryota. 1 Prokaryotic and eukaryotic organisms can be classified depending on their cellular structure. (a) Describe three structural differences between prokaryotic and eukaryotic cells. (b) In 1977, Carl Woese

More information

Supplemental Materials

Supplemental Materials JOURNAL OF MICROBIOLOGY & BIOLOGY EDUCATION, May 2013, p. 107-109 DOI: http://dx.doi.org/10.1128/jmbe.v14i1.496 Supplemental Materials for Engaging Students in a Bioinformatics Activity to Introduce Gene

More information

A genomic-scale search for regulatory binding sites in the integration host factor regulon of Escherichia coli K12

A genomic-scale search for regulatory binding sites in the integration host factor regulon of Escherichia coli K12 The integration host factor regulon of E. coli K12 genome 783 A genomic-scale search for regulatory binding sites in the integration host factor regulon of Escherichia coli K12 M. Trindade dos Santos and

More information

Biology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall

Biology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall Biology Biology 1 of 26 Fruit fly chromosome 12-5 Gene Regulation Mouse chromosomes Fruit fly embryo Mouse embryo Adult fruit fly Adult mouse 2 of 26 Gene Regulation: An Example Gene Regulation: An Example

More information

no.1 Raya Ayman Anas Abu-Humaidan

no.1 Raya Ayman Anas Abu-Humaidan no.1 Raya Ayman Anas Abu-Humaidan Introduction to microbiology Let's start! As you might have concluded, microbiology is the study of all organisms that are too small to be seen with the naked eye, Ex:

More information

Kingdom Bacteria Kingdom Archaea

Kingdom Bacteria Kingdom Archaea Section 5.1 Kingdom Bacteria Kingdom Archaea p. 132-139 Kingdom Bacteria General Characteristics: Cell Type: all are prokaryotic. Body Form: most are unicellular, some are colonial. Three main shapes are:

More information

Bioinformatics Chapter 1. Introduction

Bioinformatics Chapter 1. Introduction Bioinformatics Chapter 1. Introduction Outline! Biological Data in Digital Symbol Sequences! Genomes Diversity, Size, and Structure! Proteins and Proteomes! On the Information Content of Biological Sequences!

More information

Mining Infrequent Patterns of Two Frequent Substrings from a Single Set of Biological Sequences

Mining Infrequent Patterns of Two Frequent Substrings from a Single Set of Biological Sequences Mining Infrequent Patterns of Two Frequent Substrings from a Single Set of Biological Sequences Daisuke Ikeda Department of Informatics, Kyushu University 744 Moto-oka, Fukuoka 819-0395, Japan. daisuke@inf.kyushu-u.ac.jp

More information

The Gene The gene; Genes Genes Allele;

The Gene The gene; Genes Genes Allele; Gene, genetic code and regulation of the gene expression, Regulating the Metabolism, The Lac- Operon system,catabolic repression, The Trp Operon system: regulating the biosynthesis of the tryptophan. Mitesh

More information

Introduction to Microbiology BIOL 220 Summer Session I, 1996 Exam # 1

Introduction to Microbiology BIOL 220 Summer Session I, 1996 Exam # 1 Name I. Multiple Choice (1 point each) Introduction to Microbiology BIOL 220 Summer Session I, 1996 Exam # 1 B 1. Which is possessed by eukaryotes but not by prokaryotes? A. Cell wall B. Distinct nucleus

More information

Name: SAMPLE EOC PROBLEMS

Name: SAMPLE EOC PROBLEMS Name: SAMPLE EOC PROBLEMS 1.Bromothymol blue (BTB) is a ph indicator that is also used to detect carbon dioxide (CO2). BTB is blue when ph is basic and CO2 is low. BTB is yellow when ph is acidic and CO2

More information

Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus:

Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus: m Eukaryotic mrna processing Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus: Cap structure a modified guanine base is added to the 5 end. Poly-A tail

More information

Comparative genomics: Overview & Tools + MUMmer algorithm

Comparative genomics: Overview & Tools + MUMmer algorithm Comparative genomics: Overview & Tools + MUMmer algorithm Urmila Kulkarni-Kale Bioinformatics Centre University of Pune, Pune 411 007. urmila@bioinfo.ernet.in Genome sequence: Fact file 1995: The first

More information

The Prokaryotic World

The Prokaryotic World The Prokaryotic World A. An overview of prokaryotic life There is no doubt that prokaryotes are everywhere. By everywhere, I mean living in every geographic region, in extremes of environmental conditions,

More information

GENE REGULATION AND PROBLEMS OF DEVELOPMENT

GENE REGULATION AND PROBLEMS OF DEVELOPMENT GENE REGULATION AND PROBLEMS OF DEVELOPMENT By Surinder Kaur DIET Ropar Surinder_1998@ yahoo.in Mob No 9988530775 GENE REGULATION Gene is a segment of DNA that codes for a unit of function (polypeptide,

More information

KINGDOM MONERA. Bacterial Cell Shape 8/22/2010. The Prokaryotes: Archaebacteria and Eubacteria

KINGDOM MONERA. Bacterial Cell Shape 8/22/2010. The Prokaryotes: Archaebacteria and Eubacteria KINGDOM MONERA The Prokaryotes: Archaebacteria and Eubacteria Bacteria are the most organisms living on the Earth. (i.e. 10mL of soil contains 1 x 10 10 bacteria. They are found in nearly every habitat

More information

GENE ACTIVITY Gene structure Transcription Transcript processing mrna transport mrna stability Translation Posttranslational modifications

GENE ACTIVITY Gene structure Transcription Transcript processing mrna transport mrna stability Translation Posttranslational modifications 1 GENE ACTIVITY Gene structure Transcription Transcript processing mrna transport mrna stability Translation Posttranslational modifications 2 DNA Promoter Gene A Gene B Termination Signal Transcription

More information

2. Mathematical descriptions. (i) the master equation (ii) Langevin theory. 3. Single cell measurements

2. Mathematical descriptions. (i) the master equation (ii) Langevin theory. 3. Single cell measurements 1. Why stochastic?. Mathematical descriptions (i) the master equation (ii) Langevin theory 3. Single cell measurements 4. Consequences Any chemical reaction is stochastic. k P d φ dp dt = k d P deterministic

More information

Sequence analysis and comparison

Sequence analysis and comparison The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species

More information

12-5 Gene Regulation

12-5 Gene Regulation 12-5 Gene Regulation Fruit fly chromosome 12-5 Gene Regulation Mouse chromosomes Fruit fly embryo Mouse embryo Adult fruit fly Adult mouse 1 of 26 12-5 Gene Regulation Gene Regulation: An Example Gene

More information

University of Colorado at Boulder ECEN 4/5532. Lab 2 Lab report due on February 16, 2015

University of Colorado at Boulder ECEN 4/5532. Lab 2 Lab report due on February 16, 2015 University of Colorado at Boulder ECEN 4/5532 Lab 2 Lab report due on February 16, 2015 This is a MATLAB only lab, and therefore each student needs to turn in her/his own lab report and own programs. 1

More information

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON PROKARYOTE GENES: E. COLI LAC OPERON CHAPTER 13 CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON Figure 1. Electron micrograph of growing E. coli. Some show the constriction at the location where daughter

More information

Supplementary Figure 1. Nature Genetics: doi: /ng.3848

Supplementary Figure 1. Nature Genetics: doi: /ng.3848 Supplementary Figure 1 Phenotypes and epigenetic properties of Fab2L flies. A- Phenotypic classification based on eye pigment levels in Fab2L male (orange bars) and female (yellow bars) flies (n>150).

More information

TRACING BACK TO THE BEGINNING

TRACING BACK TO THE BEGINNING BACTERIA! TRACING BACK TO THE BEGINNING PROKARYOTES KINGDOM EUBACTERIA KINGDOM ARCHAEBACTERIA CHARACTERISTICS: 1. NO NUCLEUS 2. NO MEMBRANE BOUND ORGANELLES 4. MOST ARE SMALLER THAN EUKARYOTES 5. ARE SINGLE-CELLED

More information

Gene regulation II Biochemistry 302. Bob Kelm February 28, 2005

Gene regulation II Biochemistry 302. Bob Kelm February 28, 2005 Gene regulation II Biochemistry 302 Bob Kelm February 28, 2005 Catabolic operons: Regulation by multiple signals targeting different TFs Catabolite repression: Activity of lac operon is restricted when

More information

56:198:582 Biological Networks Lecture 10

56:198:582 Biological Networks Lecture 10 56:198:582 Biological Networks Lecture 10 Temporal Programs and the Global Structure The single-input module (SIM) network motif The network motifs we have studied so far all had a defined number of nodes.

More information

Machine Learning Linear Classification. Prof. Matteo Matteucci

Machine Learning Linear Classification. Prof. Matteo Matteucci Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)

More information

Supplementary Information. Drought response transcriptomics are altered in poplar with reduced tonoplast sucrose transporter expression

Supplementary Information. Drought response transcriptomics are altered in poplar with reduced tonoplast sucrose transporter expression Supplementary Information Drought response transcriptomics are altered in poplar with reduced tonoplast sucrose transporter expression Liang Jiao Xue, Christopher J. Frost, Chung Jui Tsai, Scott A. Harding

More information

Name: SBI 4U. Gene Expression Quiz. Overall Expectation:

Name: SBI 4U. Gene Expression Quiz. Overall Expectation: Gene Expression Quiz Overall Expectation: - Demonstrate an understanding of concepts related to molecular genetics, and how genetic modification is applied in industry and agriculture Specific Expectation(s):

More information

Bergey s Manual Classification Scheme. Vertical inheritance and evolutionary mechanisms

Bergey s Manual Classification Scheme. Vertical inheritance and evolutionary mechanisms Bergey s Manual Classification Scheme Gram + Gram - No wall Funny wall Vertical inheritance and evolutionary mechanisms a b c d e * * a b c d e * a b c d e a b c d e * a b c d e Accumulation of neutral

More information

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide Name: Period: Date: AP Bio Module 6: Bacterial Genetics and Operons, Student Learning Guide Getting started. Work in pairs (share a computer). Make sure that you log in for the first quiz so that you get

More information

MCB 110. "Molecular Biology: Macromolecular Synthesis and Cellular Function" Spring, 2018

MCB 110. Molecular Biology: Macromolecular Synthesis and Cellular Function Spring, 2018 MCB 110 "Molecular Biology: Macromolecular Synthesis and Cellular Function" Spring, 2018 Faculty Instructors: Prof. Jeremy Thorner Prof. Qiang Zhou Prof. Eva Nogales GSIs:!!!! Ms. Samantha Fernandez Mr.

More information

Microbial Diversity. Yuzhen Ye I609 Bioinformatics Seminar I (Spring 2010) School of Informatics and Computing Indiana University

Microbial Diversity. Yuzhen Ye I609 Bioinformatics Seminar I (Spring 2010) School of Informatics and Computing Indiana University Microbial Diversity Yuzhen Ye (yye@indiana.edu) I609 Bioinformatics Seminar I (Spring 2010) School of Informatics and Computing Indiana University Contents Microbial diversity Morphological, structural,

More information

Controlling Gene Expression

Controlling Gene Expression Controlling Gene Expression Control Mechanisms Gene regulation involves turning on or off specific genes as required by the cell Determine when to make more proteins and when to stop making more Housekeeping

More information

DNA Technology, Bacteria, Virus and Meiosis Test REVIEW

DNA Technology, Bacteria, Virus and Meiosis Test REVIEW Be prepared to turn in a completed test review before your test. In addition to the questions below you should be able to make and analyze a plasmid map. Prokaryotic Gene Regulation 1. What is meant by

More information

Topic 3: Genetics (Student) Essential Idea: Chromosomes carry genes in a linear sequence that is shared by members of a species.

Topic 3: Genetics (Student) Essential Idea: Chromosomes carry genes in a linear sequence that is shared by members of a species. Topic 3: Genetics (Student) 3.2 Essential Idea: Chromosomes carry genes in a linear sequence that is shared by members of a species. 3.2 Chromosomes 3.2.U1 Prokaryotes have one chromosome consisting of

More information

9/8/2017. Bacteria and Archaea. Three domain system: The present tree of life. Structural and functional adaptations contribute to prokaryotic success

9/8/2017. Bacteria and Archaea. Three domain system: The present tree of life. Structural and functional adaptations contribute to prokaryotic success 5 m 2 m 9/8/2017 Three domain system: The present tree of life Bacteria and Archaea Chapter 27 Structural and functional adaptations contribute to prokaryotic success Unicellular Small Variety of shapes

More information

Introduction to Microbiology. CLS 212: Medical Microbiology Miss Zeina Alkudmani

Introduction to Microbiology. CLS 212: Medical Microbiology Miss Zeina Alkudmani Introduction to Microbiology CLS 212: Medical Microbiology Miss Zeina Alkudmani Microbiology Micro- means very small (that needs a microscope to see). Microbiology is the study of very small living organisms.

More information

Shape, Arrangement, and Size. Cocci (s., coccus) bacillus (pl., bacilli) 9/21/2013

Shape, Arrangement, and Size. Cocci (s., coccus) bacillus (pl., bacilli) 9/21/2013 Shape, Arrangement, and Size Cocci (s., coccus) are roughly spherical cells. The other common shape is that of a rod, sometimes called a bacillus (pl., bacilli). Spiral-shaped procaryotes can be either

More information

Biology I Fall Semester Exam Review 2014

Biology I Fall Semester Exam Review 2014 Biology I Fall Semester Exam Review 2014 Biomolecules and Enzymes (Chapter 2) 8 questions Macromolecules, Biomolecules, Organic Compunds Elements *From the Periodic Table of Elements Subunits Monomers,

More information

7 th Grade MCA3 Standards, Benchmarks, Examples, Test Specifications & Sampler Questions

7 th Grade MCA3 Standards, Benchmarks, Examples, Test Specifications & Sampler Questions 7 th Grade 3 Standards, Benchmarks, Examples, Test Specifications & Sampler Questions Strand Standard No. Benchmark (7 th Grade) Sampler Item Number & Operation 12-16 Items Modified 7-9 Items Read, write,

More information

Number of questions TEK (Learning Target) Biomolecules & Enzymes

Number of questions TEK (Learning Target) Biomolecules & Enzymes Unit Biomolecules & Enzymes Number of questions TEK (Learning Target) on Exam 8 questions 9A I can compare and contrast the structure and function of biomolecules. 9C I know the role of enzymes and how

More information

Know how to read a balance, graduated cylinder, ruler. Know the SI unit of each measurement.

Know how to read a balance, graduated cylinder, ruler. Know the SI unit of each measurement. Biology I Fall Semester Exam Review 2012-2013 Due the day of your final for a maximum of 5 extra credit points. You will be able to use this review on your exam for 15 minutes! Safety and Lab Measurement:

More information

Chapter 1. Basics of Microbiology

Chapter 1. Basics of Microbiology Chapter 1 Basics of Microbiology Objectives How microorganisms are classified (taxonomy) What they look like (morphology) The major divisions among microorganisms based upon their function in the environment

More information

Introduction to Molecular and Cell Biology

Introduction to Molecular and Cell Biology Introduction to Molecular and Cell Biology Molecular biology seeks to understand the physical and chemical basis of life. and helps us answer the following? What is the molecular basis of disease? What

More information

Chapter 12. Genes: Expression and Regulation

Chapter 12. Genes: Expression and Regulation Chapter 12 Genes: Expression and Regulation 1 DNA Transcription or RNA Synthesis produces three types of RNA trna carries amino acids during protein synthesis rrna component of ribosomes mrna directs protein

More information

2012 Univ Aguilera Lecture. Introduction to Molecular and Cell Biology

2012 Univ Aguilera Lecture. Introduction to Molecular and Cell Biology 2012 Univ. 1301 Aguilera Lecture Introduction to Molecular and Cell Biology Molecular biology seeks to understand the physical and chemical basis of life. and helps us answer the following? What is the

More information

Types of RNA. 1. Messenger RNA(mRNA): 1. Represents only 5% of the total RNA in the cell.

Types of RNA. 1. Messenger RNA(mRNA): 1. Represents only 5% of the total RNA in the cell. RNAs L.Os. Know the different types of RNA & their relative concentration Know the structure of each RNA Understand their functions Know their locations in the cell Understand the differences between prokaryotic

More information

Lecture Notes 5: Multiresolution Analysis

Lecture Notes 5: Multiresolution Analysis Optimization-based data analysis Fall 2017 Lecture Notes 5: Multiresolution Analysis 1 Frames A frame is a generalization of an orthonormal basis. The inner products between the vectors in a frame and

More information

Revisiting the Central Dogma The role of Small RNA in Bacteria

Revisiting the Central Dogma The role of Small RNA in Bacteria Graduate Student Seminar Revisiting the Central Dogma The role of Small RNA in Bacteria The Chinese University of Hong Kong Supervisor : Prof. Margaret Ip Faculty of Medicine Student : Helen Ma (PhD student)

More information

Microbiome: 16S rrna Sequencing 3/30/2018

Microbiome: 16S rrna Sequencing 3/30/2018 Microbiome: 16S rrna Sequencing 3/30/2018 Skills from Previous Lectures Central Dogma of Biology Lecture 3: Genetics and Genomics Lecture 4: Microarrays Lecture 12: ChIP-Seq Phylogenetics Lecture 13: Phylogenetics

More information

Androgen-independent prostate cancer

Androgen-independent prostate cancer The following tutorial walks through the identification of biological themes in a microarray dataset examining androgen-independent. Visit the GeneSifter Data Center (www.genesifter.net/web/datacenter.html)

More information

chapter 5 the mammalian cell entry 1 (mce1) operon of Mycobacterium Ieprae and Mycobacterium tuberculosis

chapter 5 the mammalian cell entry 1 (mce1) operon of Mycobacterium Ieprae and Mycobacterium tuberculosis chapter 5 the mammalian cell entry 1 (mce1) operon of Mycobacterium Ieprae and Mycobacterium tuberculosis chapter 5 Harald G. Wiker, Eric Spierings, Marc A. B. Kolkman, Tom H. M. Ottenhoff, and Morten

More information

Classifying Prokaryotes: Eubacteria Plasma Membrane. Ribosomes. Plasmid (DNA) Capsule. Cytoplasm. Outer Membrane DNA. Flagellum.

Classifying Prokaryotes: Eubacteria Plasma Membrane. Ribosomes. Plasmid (DNA) Capsule. Cytoplasm. Outer Membrane DNA. Flagellum. Bacteria The yellow band surrounding this hot spring is sulfur, a waste product of extremophilic prokaryotes, probably of the Domain Archaea, Kingdom Archaebacteria. Bacteria are prokaryotic cells (no

More information

Chapter 19. Gene creatures, Part 1: viruses, viroids and plasmids. Prepared by Woojoo Choi

Chapter 19. Gene creatures, Part 1: viruses, viroids and plasmids. Prepared by Woojoo Choi Chapter 19. Gene creatures, Part 1: viruses, viroids and plasmids Prepared by Woojoo Choi Dead or alive? 1) In this chapter we will explore the twilight zone of biology and the gene creature who live there.

More information

Lecture 3: Linear Filters

Lecture 3: Linear Filters Lecture 3: Linear Filters Professor Fei Fei Li Stanford Vision Lab 1 What we will learn today? Images as functions Linear systems (filters) Convolution and correlation Discrete Fourier Transform (DFT)

More information

9th and 10th Grade Math Proficiency Objectives Strand One: Number Sense and Operations

9th and 10th Grade Math Proficiency Objectives Strand One: Number Sense and Operations Strand One: Number Sense and Operations Concept 1: Number Sense Understand and apply numbers, ways of representing numbers, the relationships among numbers, and different number systems. Justify with examples

More information

BME 5742 Biosystems Modeling and Control

BME 5742 Biosystems Modeling and Control BME 5742 Biosystems Modeling and Control Lecture 24 Unregulated Gene Expression Model Dr. Zvi Roth (FAU) 1 The genetic material inside a cell, encoded in its DNA, governs the response of a cell to various

More information

Genome Annotation Project Presentation

Genome Annotation Project Presentation Halogeometricum borinquense Genome Annotation Project Presentation Loci Hbor_05620 & Hbor_05470 Presented by: Mohammad Reza Najaf Tomaraei Hbor_05620 Basic Information DNA Coordinates: 527,512 528,261

More information

Written Exam 15 December Course name: Introduction to Systems Biology Course no

Written Exam 15 December Course name: Introduction to Systems Biology Course no Technical University of Denmark Written Exam 15 December 2008 Course name: Introduction to Systems Biology Course no. 27041 Aids allowed: Open book exam Provide your answers and calculations on separate

More information

Translation - Prokaryotes

Translation - Prokaryotes 1 Translation - Prokaryotes Shine-Dalgarno (SD) Sequence rrna 3 -GAUACCAUCCUCCUUA-5 mrna...ggagg..(5-7bp)...aug Influences: Secondary structure!! SD and AUG in unstructured region Start AUG 91% GUG 8 UUG

More information

Bacteria are very small

Bacteria are very small BACTERIA BACTERIA Bacteria are very small Bacteria are very small compared to cells with nuclei This is a pore in human skin and the yellow spheres are bacteria BACTERIA LIVE ALMOST EVERYWHERE Hot springs

More information

MODULE 9 NORMAL DISTRIBUTION

MODULE 9 NORMAL DISTRIBUTION MODULE 9 NORMAL DISTRIBUTION Contents 9.1 Characteristics of a Normal Distribution........................... 62 9.2 Simple Areas Under the Curve................................. 63 9.3 Forward Calculations......................................

More information

Quantitative Molecular Biology

Quantitative Molecular Biology Quantitative Molecular Biology PHYS 176/276 Instructor: Terry Hwa Winter 2018 What is quantitative biology? èquantitative biology biology + numbers/equations biology-inspired physics application of existing

More information

Big Idea 3: Living systems store, retrieve, transmit and respond to information essential to life processes. Tuesday, December 27, 16

Big Idea 3: Living systems store, retrieve, transmit and respond to information essential to life processes. Tuesday, December 27, 16 Big Idea 3: Living systems store, retrieve, transmit and respond to information essential to life processes. Enduring understanding 3.B: Expression of genetic information involves cellular and molecular

More information

Introduction to Bioinformatics Integrated Science, 11/9/05

Introduction to Bioinformatics Integrated Science, 11/9/05 1 Introduction to Bioinformatics Integrated Science, 11/9/05 Morris Levy Biological Sciences Research: Evolutionary Ecology, Plant- Fungal Pathogen Interactions Coordinator: BIOL 495S/CS490B/STAT490B Introduction

More information