Structural Basis for Methyl Transfer by a Radical SAM Enzyme

Similar documents
SUPPLEMENTARY INFORMATION

New Delhi Metallo-β-Lactamase: Structural Insights into β- Lactam Recognition and Inhibition

Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 Å resolution: comparison with the Azotobacter vinelandii MoFe protein

SUPPLEMENTARY INFORMATION

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

Supporting Information. Synthesis of Aspartame by Thermolysin : An X-ray Structural Study

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Supplementary materials. Crystal structure of the carboxyltransferase domain. of acetyl coenzyme A carboxylase. Department of Biological Sciences

Pathogenic C9ORF72 Antisense Repeat RNA Forms a Double Helix with Tandem C:C Mismatches

Table S1. Overview of used PDZK1 constructs and their binding affinities to peptides. Related to figure 1.

SUPPLEMENTARY INFORMATION

Supplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate

Supplemental Data. Structure of the Rb C-Terminal Domain. Bound to E2F1-DP1: A Mechanism. for Phosphorylation-Induced E2F Release

Supporting Information

Introduction to Comparative Protein Modeling. Chapter 4 Part I

SUPPLEMENTARY INFORMATION

Acta Crystallographica Section F

Model Mélange. Physical Models of Peptides and Proteins

SUPPLEMENTARY INFORMATION

Diphthamide biosynthesis requires a radical iron-sulfur enzyme. Pennsylvania State University, University Park, Pennsylvania 16802, USA

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY FIGURES

Physiochemical Properties of Residues

Packing of Secondary Structures

High-resolution crystal structure of ERAP1 with bound phosphinic transition-state analogue inhibitor

Viewing and Analyzing Proteins, Ligands and their Complexes 2

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION. doi: /nature07461

Full wwpdb X-ray Structure Validation Report i

Acta Crystallographica Section D

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Structurale, Université Grenoble Alpes, CNRS, CEA, Grenoble, France

Nature Structural and Molecular Biology: doi: /nsmb.2938

Introduction to" Protein Structure

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

SUPPLEMENTARY INFORMATION

The structure of a nucleolytic ribozyme that employs a catalytic metal ion. Yijin Liu, Timothy J. Wilson and David M.J. Lilley

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION

Supporting Information

Detailed description of overall and active site architecture of PPDC- 3dThDP, PPDC-2HE3dThDP, PPDC-3dThDP-PPA and PPDC- 3dThDP-POVA

IgE binds asymmetrically to its B cell receptor CD23

Supporting Information

Biomolecules: lecture 10

SUPPLEMENTARY INFORMATION

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Radical SAM enzyme QueE defines a new minimal core fold and metal-dependent mechanism

Supporting Information

SUPPLEMENTARY INFORMATION

Supersecondary Structures (structural motifs)

Catalytic Mechanism of the Glycyl Radical Enzyme 4-Hydroxyphenylacetate Decarboxylase from Continuum Electrostatic and QC/MM Calculations

Supplemental Information

Rational Design of Thermodynamic and Kinetic Binding Profiles by. Optimizing Surface Water Networks Coating Protein Bound Ligands

CHAPTER 29 HW: AMINO ACIDS + PROTEINS

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

SUPPLEMENTARY INFORMATION

Supporting Information

Bacterial protease uses distinct thermodynamic signatures for substrate recognition

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Details of Protein Structure

Full wwpdb X-ray Structure Validation Report i

PAN-modular Structure of Parasite Sarcocystis muris Microneme Protein SML-2 at 1.95 Å Resolution and the Complex with 1-Thio-β-D-Galactose

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Supporting Information

The structure of a nucleolytic ribozyme that employs a catalytic metal ion Liu, Yijin; Wilson, Timothy; Lilley, David

type GroEL-GroES complex. Crystals were grown in buffer D (100 mm HEPES, ph 7.5,

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

SUPPLEMENTARY INFORMATION

Sunhats for plants. How plants detect dangerous ultraviolet rays

Lecture 12. Metalloproteins - II

Supporting Information. UV-induced ligand exchange in MHC class I protein crystals

DATE A DAtabase of TIM Barrel Enzymes

Esser et al. Crystal Structures of R. sphaeroides bc 1

Prediction and refinement of NMR structures from sparse experimental data

Supplementary Information

SI Text S1 Solution Scattering Data Collection and Analysis. SI references

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.

It s the amino acids!

Bioinformatics. Macromolecular structure

Direct Method. Very few protein diffraction data meet the 2nd condition

Supplementary information

Full wwpdb X-ray Structure Validation Report i

Chapter 9 DNA recognition by eukaryotic transcription factors

High Pressure Freezing. Philippe Carpentier,

Structure, mechanism and ensemble formation of the Alkylhydroperoxide Reductase subunits. AhpC and AhpF from Escherichia coli

Protein Structures: Experiments and Modeling. Patrice Koehl

Supporting Information

for Molecular Biology and Neuroscience and Institute of Medical Microbiology, Rikshospitalet-Radiumhospitalet

Structural insights into WcbI, a novel polysaccharide-biosynthesis enzyme

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

The NRP peptidase ClbP as a target for the inhibition. of genotoxicity, cell proliferation and tumorogenesis. mediated by pks-harboring bacteria

Full-length GlpG sequence was generated by PCR from E. coli genomic DNA. (with two sequence variations, D51E/L52V, from the gene bank entry aac28166),

Gene regulation II Biochemistry 302. February 27, 2006

RNA protects a nucleoprotein complex against radiation damage

Final Chem 4511/6501 Spring 2011 May 5, 2011 b Name

Transcription:

www.sciencemag.org/cgi/content/full/science.1205358/dc1 Supporting Online Material for Structural Basis for Methyl Transfer by a Radical SAM Enzyme Amie K. Boal, Tyler L. Grove, Monica I. McLaughlin, Neela H. Yennawar, Squire J. Booker,* Amy C. Rosenzweig* *To whom correspondence should be addressed. E-mail: squire@psu.edu (S.J.B.); amyr@northwestern.edu (A.C.R.) This PDF file includes: Published 28 April 2011 on Science Express DOI: 10.1126/science.1205358 Materials and Methods Figs. S1 to S18 Table S1 References

Materials and Methods General crystallographic methods. All datasets were processed using the HKL2000 package (51) and solved by multiple anomalous dispersion (MAD) phasing using SHARP/autoSHARP (52, 53) or by molecular replacement using the program PHASER (54). Model building and refinement were performed with Coot and Refmac5, respectively (55, 56). Data collection and refinement statistics are shown in Table S1. Ramachandran plots were calculated with PROCHECK (57) and MolProbity (58). Diffraction-component precision index (DPI) errors were calculated with SFCHECK (59). Figures were prepared using PyMOL (60) and electrostatic surface potential calculations were performed with the PyMOL APBS plugin (49) at 150 mm monovalent ion concentration. All data were collected at the Life Sciences Collaborative Access Team (LS-CAT) and General Medicine and Cancer Institutes Collaborative Access Team (GM/CA-CAT) beamlines at the Advanced Photon Source. RlmN structure. RlmN from Escherichia coli was prepared as described previously (7) from a construct encoding a C-terminal hexahistidine affinity tag that remains uncleaved after purification. All manipulations were carried out in a Coy anaerobic chamber. Protein solutions (100 mg/ml in 10 mm HEPES ph 7.5, 500 mm KCl, 10% glycerol, 5 mm DTT) were diluted 1:10 in 20 mm HEPES ph 7.6, allowed to stand overnight, and centrifuged at 10,000 g prior to crystallization. Yellow-brown rectangular prism-shaped crystals were obtained using the hanging drop vapor diffusion method at 20 C with 10% (w/v) PEG 6000, 5% (v/v) 2-methyl-2,4-pentanediol (MPD), 0.1 M HEPES ph 7.5 as a precipitant equilibrated against a 0.25 M LiCl well solution. Crystals were soaked in cryoprotectant solution (30% (w/v) glycerol, 10% (w/v) PEG

6000, 5% (v/v) MPD, 0.1 M HEPES ph 7.5) for less than 5 min, mounted on rayon loops, and flash cooled in liquid nitrogen. Diffraction data for MAD phasing were collected at the Fe absorption peak and a remote wavelength (Table S1). A high-resolution native data set was also collected. The peak and remote data sets were subjected individually to an initial round of autosharp that yielded the location of the iron-sulfur cluster. High resolution phase information was obtained by placing individual iron atoms within the cluster density using the Fe anomalous Fourier map, a model cluster, and the locations of the three cysteinyl ligands (61). Phasing with SHARP (53) (peak 2 and remote 2 datasets) with the eight iron sites (four sites per cluster, one cluster in each of the two molecules in the ASU) determined as above yielded good quality electron density maps after solvent flattening with SOLOMON (62). An initial model was built automatically using Buccaneer (63). Further adjustment manually with Coot (55) was aided by use of additional data from a second crystal phased as described above, but with a higher resolution native dataset included (peak 1, remote 1, and native 1 datasets). This map showed improved side chain density and was used during manual model building. Non-crystallographic symmetry restraints were used during model building but removed in the later stages of refinement. The final model was refined against the native 1 dataset and consists of residues 10-350 and 361-374 for chain A, residues 14-350 and 361-374 for chain B, eight iron atoms, eight sulfur atoms, one MPD molecule, and 366 water molecules. Electron density was not observed for residues 350-360 in chains A and B. Ramachandran plots indicate that 99.9% of the residues are in the allowed and additionally allowed regions. The DPI error is 0.212 Å.

RlmN+SAM structure. RlmN crystals obtained as described above were soaked in mother liquor containing 1 mm S-adenosyl-L-methionine (SAM) for 45 min in a Coy anaerobic chamber. Crystals were briefly transferred to cryoprotectant solution (30% (v/v) PEG 400, 10% (w/v) PEG 6000, 5% (v/v) MPD, 0.1 M HEPES ph 7.5), then mounted on rayon loops and flash frozen in liquid nitrogen. The structure was solved by molecular replacement using the coordinates of RlmN as a starting model. The final model consists of residues 17-349 for chain A, residues 17-375 for chain B, eight iron atoms, eight sulfur atoms, two SAM molecules, and 181 water molecules. Electron density was not observed beyond residue 349 in chain A. Ramachandran plots indicate that 100% of the residues are in the allowed and additionally allowed regions. The DPI error is 0.173 Å.

Figure S1. The methylation reactions catalyzed by E. coli RlmN and S. aureus Cfr (7). RlmN methylates the C2 position of A2503. Cfr catalyzes methylation of C2 and C8, but C8 is the primary target.

Figure S2. The proposed mechanism for C2 methylation of A2503 by RlmN (7).

Figure S3. Secondary structure matching (SSM) superposition (PDBeFold server (64)) of the E. coli RlmN (purple) α 6 /β 6 core with the PFL-AE (PDB accession code 3CB8) core structure (30). Two views are shown, (A) and (B). The [4Fe-4S] cluster is shown as a space filling model in orange (iron) and yellow (sulfur) and the SAM cosubstrate is shown in stick format (green) and colored by atom type. The rmsd is 2.6 Å for 220 Cα atoms. The most significant differences between the two structures are found in the indicated loops connecting core secondary structure elements. The equivalent to loop C, located between β6 and α6, is shorter in RlmN and lacks the helical region. The loop between α1 and β2 is longer in RlmN and is located on the same side of the barrel as a loop in PFL-AE (not shown) that undergoes dramatic conformational change upon binding of a substrate analog (30). The structural differences in these loops may reflect differences in substrate specificity for the two enzymes.

Figure S4. Structure-based sequence alignment of the RlmN N-terminal domain with a representative HhH 2 structure (RuvA, PDB accession code 2C7Y). The fold is defined by a set of conserved hydrophobic residues contributed by each helix (bottom, black bars) and a GXG sequence conserved in the two hairpin regions (bottom, orange bars) (36). The paired GXG sequences typically confer sequence independent minor-groove recognition of B-form DNA through interaction with the phosphate backbone on both strands (36). In the RlmN domain, lack of conservation of the hairpin sequence and deviation from a pseudosymmetric arrangement of HhH motifs may result in specificity for an RNA structure. Similar deviations are observed in a domain found in DNA polymerase β which interacts with only one DNA strand (65).

Figure S5. The α1/β2 loop adopts different conformations and engages in crystal contacts. (A) Residues 160-170 shown in stick format (RlmN structure, chain B). The β7 extension is shown in blue, the α 6 /β 6 core in purple, the β 1-3 extension in pink, and the N-terminal domain in green. In this molecule, the loop packs against a portion of the second molecule in the ASU, placing it close to the β 1-3 extension. (B) and (C) show an overlay of the three conformations observed in the four molecules from the two structures.

Figure S6. The C-terminal helix in the RlmN+SAM structure. (A) 2F o -F c (1.3σ) map of residues 363-375 in chain B. (B) and (C) show interactions with a symmetry related molecule. The C- terminal helix also interacts in an intramolecular fashion with a short loop composed of residues 99-103. (D-F) show zoomed-in views of hydrophobic (D) and hydrogen bonding interactions (E) with the symmetry related molecule (D and E) and residues 99-103 from the same molecule (F). The side chain density is more pronounced for the residues that engage in specific hydrophobic or hydrogen bonding interactions. The intramolecular interactions shown in view (F) are not observed in the absence of SAM because the helix is shifted away from the core. Notably, the residues that comprise this helix are absent in Cfr.

Figure S7. The iron-sulfur cluster in RlmN and RlmN+SAM. 2F o -F c electron density map (1.5σ) for the side chains of residues 125, 129, and 132 within the CX 3 CX 2 C motif, the [4Fe-4S] cluster, and the SAM cofactor in the RlmN+SAM structure. The SAM methionine sulfur atom is located 3.7 Å from the two nearest sulfur atoms in the [4Fe-4S] cluster and 3.2 Å from the unique iron site. These distances are consistent with other structurally characterized radical SAM enzymes (39). In the inset, an Fe edge anomalous difference electron density map (9.0σ) for the RlmN structure shows full occupancy of all four iron atoms in the cluster.

Figure S8. A hydrogen bonding network between SAM (shown as sticks, green), the α 6 /β 6 core, residues 351-360 (colored as in Fig. 1), and ordered solvent (red spheres). Additionally, Glu 105 forms hydrogen bonds to the backbone of the linker region. These interactions may trigger ordering of residues 351-360 upon SAM binding. The use of water-mediated contacts may be an especially important feature of this network, allowing for controlled repositioning of the loop within the active site.

Figure S9. Overlay of the RlmN (light gray) and RlmN+SAM structures (colored as in Fig. 1). (A) A view of the entire structure. Areas that undergo large changes are indicated by black arrows. The C-terminal helix (blue) experiences a significant positional shift due to ordering of residues 351-360. (B) A zoomed-in view of the SAM binding site. Conserved residue Asn 312 shifts 2.5 Å (Cα) to fold over the face of the SAM adenine moiety. (C) Loops near the active site shift position or become ordered (black arrows) upon SAM binding resulting in a less solvent exposed active site at the C-terminal end of the barrel. Similar ordering phenomena and conformational changes are observed upon substrate binding in other RS enzymes (30) and are proposed to prevent unwanted chemistry such as abortive cleavage of SAM.

Figure S10. 2F o -F c electron density map (1.0σ) for residues 349-361 in the RlmN+SAM structure. Weak electron density is observed for two residues (Ala 353 and Gly 360), labeled with asterisks, suggesting conformational flexibility.

Figure S11. A stereoview of the active site of RlmN.

Figure S12. Active site overlay of RlmN+SAM with RlmN (white). Cys 118 changes rotamer upon SAM binding, inducing a flip in the Met 176-Gly 177 peptide bond (or vice versa). This may result in stabilization of the SAM-bound state and favorable positioning of the Met 176 backbone carbonyl and Cys 118 side chain for various steps in the reaction pathway. The location of the Met 176 side chain methyl group, 4 Å from the site of 5 -da formation, may be important in controlling access to the reactive site.

Figure S13. Methyl acquistion by Cys 355 may occur via SAM bound to the [4Fe-4S] cluster. (A) The mcys 355 sulfur atom is positioned favorably to be deprotonated by nearby strictly conserved residue Glu 105 (perhaps via a local water network) and to subsequently attack the carbon atom of the SAM methyl substituent. (B) Local hydrogen bonding interactions may be altered in the presence of unmethylated Cys 355 to further facilitate methyl transfer or downstream reaction steps.

Figure S14. Comparison of the RlmN+SAM structure (colored as in Fig. 1) and the PFL- AE+peptide substrate (yellow) structure (30) (PDB accession code 3CB8). (A) Side-by-side view showing that the general location of the peptide substrate in the PFL-AE complex structure is similar to that of residues 351-360 in RlmN+SAM. This observation is notable since these enzymes both form a protein centered radical in these regions upon 5 -da generation in the active site. (B) Side-by-side view showing that the PFL peptide glycyl radical site (Gly 734) (right) is shifted toward the 5 carbon (*) of the SAM cofactor compared to the position of the methyl substituent on mcys 355 in RlmN+SAM (left). An important structural feature of the loop may be the exposure of backbone carbonyls (such as that of Gly 734, shown) to interact with the SAM cosubstrate. Similar interactions may guide positioning of mcys 355 in RlmN upon substrate binding. (C) A manual alignment of the backbone atoms in the PFL-AE substrate and RlmN residues 351-360 shows a similar finger loop structure perhaps defined by a potential hydrogen bonding interaction indicated by the gray dashed line. The site of radical formation in RlmN (mcys 355) does not align here with the glycyl radical site in PFL (Gly 734). It aligns with an adjacent serine, suggesting the site of reactivity is not controlled by the sequence of the loop but instead by its precise location in the active site. In each system, distal structural elements in the enzyme/substrate complex may ultimately control the position of the loop.

Figure S15. An additional view in the comparison of the electrostatic surface potential (A) in RlmN to a sequence conservation map from pairwise alignment of E. coli RlmN and S. aureus Cfr (B) as shown in Figure 3.

Figure S16. Sequence conservation near the active site in a pairwise alignment of E. coli RlmN and S. aureus Cfr. (A) shows strictly conserved residues in orange, neutral substitutions in tan, and variable regions in white. The surface near mcys 355 and Cys 118, residues identified as mechanistically critical, is highly conserved in both enzymes. (B) is colored as in Fig. 1. Specific residues substituted in Cfr are shown in stick format and labeled in black. Several relatively conservative substitions (Thr, Val in RlmN to Ser in Cfr) may subtly alter hydrogen bonding patterns to promote substrate conformational changes. A salt bridge between conserved residue Arg 344 and Glu 278 (Ala in Cfr) could be an important factor in substrate orientation within the RlmN active site.

Figure S17. Surface map (ConSURF server) (66) of sequence conservation among the entire RlmN/Cfr family of enzymes (700 sequences) (8). Views in (A), (B), and (C) are as shown in Figs. 3A-D, and S15.

Figure S18. Overlay of a Cfr homology model (I-TASSER server) (67) (tan) onto the RlmN+SAM structure (colored as in Fig. 1). Brackets denote large structural insertions found in RlmN.

Table S1. Data collection and refinement statistics. Data collection Wavelength (Å) RlmN Native 1 1 RlmN Peak 1 1 RlmN Remote 1 1 RlmN Peak 2 2 RlmN RlmN+SAM 1 Remote 2 2 0.98 1.72 1.65 1.72 1.65 0.98 Space group C222 1 P2 1 2 1 2 1 Cell dimensions a, b, c (Å) 72.15, 80.41, 312.24 72.19, 80.46, 312.46 50.00-2.52 (2.56-2.52) 0.120 (0.412) 72.19, 80.42, 312.11 50.00-2.42 (2.46-2.42) 0.129 (0.488) 72.14, 80.33, 312.94 50.00-2.92 (2.97-2.92) 0.139 (0.645) 72.24, 80.43, 313.37 50.00-2.90 (2.95-2.90) 0.128 (0.688) 55.18, 55.62, 252.18 Resolution (Å) 50.00-2.20 (2.29-2.20) 30.00-2.05 (2.09-2.05) R sym or R merge 0.086 0.087 (0.529) (0.507) <I /σi> 23.9 (2.1) 26.0 (5.1) 20.3 (2.4) 32.2 (7.1) 23.6 (4.3) 16.5 (2.0) Completeness 99.3 (92.3) 100.0 (99.7) 98.1 (82.9) 100.0 100.0 96.4 (95.1) (%) (99.8) (99.9) Redundancy 7.0 (4.4) 13.2 (7.8) 12.3 (4.1) 28.5 (24.5) 14.1 (11.9) 4.9 (3.6) Refinement Resolution (Å) 50.0-2.20 50.0-2.05 No. reflections 43897 45458 R work / R free 0.213/0.267 0.202/0.242 No. atoms 5983 5714 Protein 5593 5463 Ligand/ion 24 70 Water 366 181 B-factors (Å 2 ) Protein 25.7 23.1 Ligand/ion 44.3 26.4 Water 31.7 27.1 R.m.s. deviations Bond 0.008 0.008 lengths (Å) Bond angles ( ) 1.012 1.069 1 Data collected at LS-CAT beamlines 21-ID-D, 21-ID-G, 21-ID-F 2 Data collected at GM/CA-CAT beamline 23-ID-D

References 51. Z. Otwinowski, W. Minor, Methods Enzymol. 276, 307 (1997). 52. C. Vonrhein, E. Blanc, P. Roversi, G. Bricogne, in Methods in Molecular Biology, S. Doublié, Ed. (Humana Press, Inc., Totowa, NJ, 2007), vol. 364, pp. 215-230. 53. E. de la Fortelle, G. Bricogne, Methods Enzymol. 276, 472 (1997). 54. A. J. McCoy, R. W. Grosse-Kunstleve, L. C. Storoni, R. J. Read, Acta Cryst. D61, 458 (2005). 55. P. Emsley, K. Cowtan, Acta Cryst. D60, 2126 (2004). 56. G. N. Murshudov, A. A. Vagin, E. J. Dodson, Acta Cryst. D53, 240 (1997). 57. R. A. Laskowski, J. Appl. Cryst. 26, 283 (1993). 58. V. B. Chen et al., Acta Cryst. D66, 12 (2010). 59. A. A. Vaguine, J. Richelle, S. J. Wodak, Acta Cryst. D55, 191 (1999). 60. W. L. Delano, The PyMOL molecular graphics system. (DeLano Scientific, San Carlos, CA, 2002). 61. J. W. Peters, H. D. Bellamy, J. Appl. Cryst. 32, 1180 (1999). 62. J. P. Abrahams, A. G. W. Leslie, Acta Cryst. D52, 30 (1996). 63. K. Cowtan, Acta Cryst. D62, 1002 (2006). 64. E. Krissinel, K. Henrick, Acta Cryst. D60, 2256 (2004). 65. M. R. Sawaya, R. Prasad, S. H. Wilson, J. Kraut, H. Pelletier, Biochemistry 36, 11205 (1997). 66. M. Landau et al., Nucleic Acids Res. 33, W299 (2005). 67. A. Roy, A. Kucukural, Y. Zhang, Nat. Protoc. 5, 725 (2010).