Structural and mechanistic insights into a Bacteroides vulgatus retaining N- acetyl-β-galactosaminidase that uses neighboring group participation

Similar documents
Acta Crystallographica Section D

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

Supporting Information. Full and Partial Agonism of a Designed Enzyme Switch

Table S1. Overview of used PDZK1 constructs and their binding affinities to peptides. Related to figure 1.

Supporting Information. Synthesis of Aspartame by Thermolysin : An X-ray Structural Study

Pathogenic C9ORF72 Antisense Repeat RNA Forms a Double Helix with Tandem C:C Mismatches

High-resolution crystal structure of ERAP1 with bound phosphinic transition-state analogue inhibitor

GC376 (compound 28). Compound 23 (GC373) (0.50 g, 1.24 mmol), sodium bisulfite (0.119 g,

type GroEL-GroES complex. Crystals were grown in buffer D (100 mm HEPES, ph 7.5,

Figure 2. Amino acid sequence alignment of L-carbamoylases. A BLAST search was conducted with BsLcar sequence, using the UNIREF100 sequence cluster

Supplementary materials. Crystal structure of the carboxyltransferase domain. of acetyl coenzyme A carboxylase. Department of Biological Sciences

Acta Crystallographica Section F

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY FIGURES

Trapped intermediates in crystals of the FMN-dependent oxidase PhzG provide insight into the final steps of phenazine biosynthesis

SUPPLEMENTARY INFORMATION

Supplemental Data. Structure of the Rb C-Terminal Domain. Bound to E2F1-DP1: A Mechanism. for Phosphorylation-Induced E2F Release

Supporting Information

Supporting Information

Dynamics connect substrate recognition to catalysis in protein kinase A

Supporting Information

Exploiting Protein Conformational Change to Optimize Adenosine-Derived Inhibitors of HSP70

Supplementary Figures

SUPPLEMENTARY INFORMATION

Supporting information

SUPPORTING INFORMATION

Supplementary Materials for

Structurale, Université Grenoble Alpes, CNRS, CEA, Grenoble, France

Supplementary Information. Structural basis for precursor protein-directed ribosomal peptide macrocyclization

Supporting Information

Supporting Information

High Frequency sonoatrp of 2-Hydroxyethyl Acrylate in an Aqueous Medium

BA, BSc, and MSc Degree Examinations

Acta Crystallographica Section D

IgE binds asymmetrically to its B cell receptor CD23

Supporting Information

Supporting Information

Discovery of Decamidine as a New and Potent PRMT1 Inhibitor

Supplementary Figure 1. Structures of substrates tested with 1. Only one enantiomer is shown.

Supplementary information

BSc and MSc Degree Examinations

Structural basis for catalytically restrictive dynamics of a high-energy enzyme state

FRAGMENT SCREENING IN LEAD DISCOVERY BY WEAK AFFINITY CHROMATOGRAPHY (WAC )

SUPPLEMENTARY INFORMATION

A BODIPY-based fluorescent probe for the differential

Supporting Information for. Jesinghaus, Rachael Barry, Zemer Gitai, Justin Kollman and Enoch P. Baldwin

New Delhi Metallo-β-Lactamase: Structural Insights into β- Lactam Recognition and Inhibition

Supplementary Materials for

Redox-Responsive Complexation between a. Pillar[5]arene with Mono ethylene oxide Substituents. and Paraquat

Serine-7 but not serine-5 phosphorylation primes RNA polymerase II CTD for P-TEFb recognition

Supplementary Figure 1. Stability constants of metal monohydroxides. The log K values are summarized according to the atomic number of each element

SUPPLEMENTARY INFORMATION

Hyperbranched Poly(N-(2-Hydroxypropyl) Methacrylamide) via RAFT Self- Condensing Vinyl Polymerization

Supporting Information. Time-Resolved Botulinum Neurotoxin A Activity Monitored using. Peptide-Functionalized Au Nanoparticle Energy Transfer Sensors

SUPPLEMENTARY INFORMATION

Supplementary Figure 1 Crystal contacts in COP apo structure (PDB code 3S0R)

Structural Basis for Methyl Transfer by a Radical SAM Enzyme

Rational Design of Thermodynamic and Kinetic Binding Profiles by. Optimizing Surface Water Networks Coating Protein Bound Ligands

Structural insights into WcbI, a novel polysaccharide-biosynthesis enzyme

Enzyme Kinetics Using Isothermal Calorimetry. Malin Suurkuusk TA Instruments October 2014

CCP4 Diamond 2014 SHELXC/D/E. Andrea Thorn

Hydrophobicity-Induced Prestaining for Protein Detection in Polyacrylamide

Supplementary Figure 1. Biochemical and sequence alignment analyses the

Supporting Information. A fluorogenic assay for screening Sirt6 modulators

Title: A novel mechanism of protein thermostability: a unique N-terminal domain confers

MicroCal itc 200. System MicroCal Auto-iTC 200. System. GE Healthcare Life Sciences. System design and description. provide:

Electronic Supplementary Information

Analysis of nucleotide binding to p97 reveals the properties of a tandem AAA hexameric ATPase

SUPPLEMENTARY INFORMATION

In Crystallo Capture of a Covalent Intermediate in the UDP-Galactopyranose Mutase Reaction

Supporting Information

SUPPLEMENTARY INFORMATION

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2

PAN-modular Structure of Parasite Sarcocystis muris Microneme Protein SML-2 at 1.95 Å Resolution and the Complex with 1-Thio-β-D-Galactose

Supplementary Information

Electronic Supplementary Information for: Gram-scale Synthesis of a Bench-Stable 5,5 -Unsubstituted Terpyrrole

Electronic Supplementary Information

Presentation Microcalorimetry for Life Science Research

Supplementary Materials: Localization and Spectroscopic Analysis of the Cu(I) Binding Site in Wheat Metallothionein Ec-1

THE CRYSTAL STRUCTURE OF THE SGT1-SKP1 COMPLEX: THE LINK BETWEEN

Full-length GlpG sequence was generated by PCR from E. coli genomic DNA. (with two sequence variations, D51E/L52V, from the gene bank entry aac28166),

Cks1 CDK1 CDK1 CDK1 CKS1. are ice- lobe. conserved. conserved

A Highly Selective Fluorescent Sensor for Glucosamine

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Supplementary Figure 1 Pairing alignments, turns and extensions within the structure of the ribozyme-product complex. (a) The alignment of the G27

Supporting Information for: Using a Lipase as a High Throughput Screening Method for Measuring the Enantiomeric. Excess of Allylic Acetates

S2004 Methods for characterization of biomolecular interactions - classical versus modern

The structure of the deubiquitinase USP15 reveals a misaligned catalytic triad and an open ubiquitin-binding channel

Simple Solution-Phase Syntheses of Tetrahalodiboranes(4) and their Labile Dimethylsulfide Adducts

Techniques in Molecular Genetics Spectroscopy and Enzyme Assays

Structural Mechanism for the Fidelity Modulation of DNA Polymerase λ. 128 Academia Road Sec. 2, Nankang, Taipei, 115, Taiwan

Supporting Information. Chemo-enzymatic Synthesis of Isotopically Labeled Nicotinamide Ribose

Supporting Information. Structural Insights into Substrate Specificity and Solvent Tolerance in Alcohol

Supplemental Methods. Protein expression and purification

RNA protects a nucleoprotein complex against radiation damage

Supplementary Information

SUPPLEMENTARY INFORMATION

Sample preparation and characterization around SAXS

Preparation of 1:1 alternating, nucleobase-containing copolymers for use in sequence-controlled polymerization

TA Instruments Application Note

Transcription:

Electronic Supplementary Material (ESI) for ChemComm. This journal is The Royal Society of Chemistry 2016 SUPPORTING INFORMATION Structural and mechanistic insights into a Bacteroides vulgatus retaining N- acetyl-β-galactosaminidase that uses neighboring group participation C. Roth, a M. Petricevic, b A. John, c,d E.D. Goddard-Borger, c,d,* G.J. Davies a,* and S. J. Williams b,* Contents Supplementary figures...2 Figure S1: Sequence of BvGH123....2 Figure S2: ph activity and inhibitor binding to BvGH123....3 Fig. S3 Dimer of BvGH123 as observed in the crystal structure....4 Fig. S4 Overlay of BvGH123 in open and closed conformations....4 Experimental...5 Cloning and expression...5 Protein crystallography...5 Determination of stereochemistry of catalysis by 1 H NMR analysis...5 Kinetic analysis of BvGH123...5 Size exclusion chromatography-multi-angle laser light scattering...6 Isothermal titration calorimetry...6 Table S1: SsGH134 X-ray data collection, processing and refinement statistics...7 References...8 1

Supplementary figures (a) >ENA ABR39859 ABR39859.1 Bacteroides vulgatus ATCC 8482 conserved hypothetical protein ATGAAAAAACTCCTATTTCTAGGCGCTCTCCTGTTGTCAACAGTATGTATGAACGCCCAAACTTCCGAATATTATCAGGA AGCAGCCAATCCTATTGCCACCAACCCCGCTTTATGGGCGAAAGTTACAGCTCCACAAATCAGCTGGGGAAGCACAGACA TCCGCTATAAGAAAGAGGAACCGGCTCCCATCCATAGTGCGCAGAAAAGTATGAATCTTACTGCATGGAAAGGTGAGAAA ATTTCAGCCCAACTGGTAGTATGGACTCCCAAAGTGCTAAATGACTTGACTTTCATGGTCAGCGATTTAACCTCAGGCAG TGCAACCATCAGTAAAGAGAATATCCGAACAGGCTTTGTCCGTTACGTTATAACCGATGAACTGAACAAAGATGGTTTGG GCGCATGCGGCTATCGAAACAGTGCTGATTTTGACTCAACTTTAGTAGCAGATGTAATAGACCATATCACTCCTACTCTA ACCCTTCCCGCCAACTCCACCCAAGGGGGATGGATCAGCGTAAACATCCCTCAGGGCACTAAAGCCGGGAAATACACAGG AACCGTCACAGTGAAAGCCGACGGTATCACCTTGTCTGAATTAAAACTGAACCTCCAAGTGAAGAACCGTACTCTGCCTC CTCCTTCCGAATGGGCTTTCCATCTGGATTTATGGCAGAACCCTTATGCAGTATCCCGTTACTACAATGTGGAACCGTTC AGCAAAAAACATTTCGATTTGATGCGCCCATTGATGAAACTGTATGCCGATGCAGGTGGTAAGGTAATCACAGCCTCCAT TATGCACAAGCCTTGGAACGGACAGACCTATGATGCTTTTGAAAGCATGGTCACTTGGTTGAAAAAAGCGGATGGAACCT GGTATTTTGACTATACCGTATTCGACAAGTGGGTGGAATTTATGATGGATCTTGGTGTCAAAAAACAAATCAGTTGCTAT TCTATGGTTCCCTGGCGCCTTTCTTTCCAATATTTTGATCAGGCCAGCAACTCTTTCAAATTTTTGGACGCCAAACCGGG TGAAGTTGCTTATGAAGAATTTTGGATGAATATGCTGCAAGATTTCTCAAAGCATCTGAAAGCAAAAGGCTGGTTCGATA TCACTCACATTGCGATGGACGAACGCCCGATGAAGGACATGCAGGAAACACTGAAAGTGATCCGTAAGGCTGATAAAGAC TTTAAAGTTTCTTTGGCAGGAACTTATCACAAAGAACTATTGGATGATCTGAATGATTATTGTATCACCATTGCCGAGAA ATTTACTCCCGAAGAGATTGAAGCGCGCCGGAAGGCAGGCAAAGTAACTACTTATTATACCTGCTGCACAGAGCCCCGCC CCAACACTTTCACTTTCAGCGAACCTGCCGAAGCTGAATGGCTGGCATGGCATAGTGCGAAAGAAAATCTGGACGGATAT CTTCGTTGGGCTTTAAACAGCTGGGTGAAAAATCCCCTACAAGACAGCCGTTTCACAGCTTGGGCTGCCGGAGACACGTA TATGATTTATCCGGGCGCCCGTTCATCCATCCGGCTGGAACGCCTGACAGAAGGAATACAATTTTTTGAGAAAGTACGCA TTCTGAAAGAAGAATTTGAAGAAAAAGGCAATAAAGGAGCTATTAAGAATATAGACAAAACCTTGAAAATGTTTGATGAA TCAAGCATGGATAAGATTTCTCCTACCACTGCCGTAAACAAAGCAAAAAAAGTTATCAACCGATACTAG Predicted translation: MKKLLFLGALLLSTVCMNAQTSEYYQEAANPIATNPALWAKVTAPQISWGSTDIRYKKEE PAPIHSAQKSMNLTAWKGEKISAQLVVWTPKVLNDLTFMVSDLTSGSATISKENIRTGFV RYVITDELNKDGLGACGYRNSADFDSTLVADVIDHITPTLTLPANSTQGGWISVNIPQGT KAGKYTGTVTVKADGITLSELKLNLQVKNRTLPPPSEWAFHLDLWQNPYAVSRYYNVEPF SKKHFDLMRPLMKLYADAGGKVITASIMHKPWNGQTYDAFESMVTWLKKADGTWYFDYTV FDKWVEFMMDLGVKKQISCYSMVPWRLSFQYFDQASNSFKFLDAKPGEVAYEEFWMNMLQ DFSKHLKAKGWFDITHIAMDERPMKDMQETLKVIRKADKDFKVSLAGTYHKELLDDLNDY CITIAEKFTPEEIEARRKAGKVTTYYTCCTEPRPNTFTFSEPAEAEWLAWHSAKENLDGY LRWALNSWVKNPLQDSRFTAWAAGDTYMIYPGARSSIRLERLTEGIQFFEKVRILKEEFE EKGNKGAIKNIDKTLKMFDESSMDKISPTTAVNKAKKVINRY* underlined = signal peptide (b) >His6-BvGH123: MHHHHHHLEVLFQGPQTSEYYQEAANPIATNPALWAKVTAPQISWGSTDIRYKKEE PAPIHSAQKSMNLTAWKGEKISAQLVVWTPKVLNDLTFMVSDLTSGSATISKENIRTGFV RYVITDELNKDGLGACGYRNSADFDSTLVADVIDHITPTLTLPANSTQGGWISVNIPQGT KAGKYTGTVTVKADGITLSELKLNLQVKNRTLPPPSEWAFHLDLWQNPYAVSRYYNVEPF SKKHFDLMRPLMKLYADAGGKVITASIMHKPWNGQTYDAFESMVTWLKKADGTWYFDYTV FDKWVEFMMDLGVKKQISCYSMVPWRLSFQYFDQASNSFKFLDAKPGEVAYEEFWMNMLQ DFSKHLKAKGWFDITHIAMDERPMKDMQETLKVIRKADKDFKVSLAGTYHKELLDDLNDY CITIAEKFTPEEIEARRKAGKVTTYYTCCTEPRPNTFTFSEPAEAEWLAWHSAKENLDGY LRWALNSWVKNPLQDSRFTAWAAGDTYMIYPGARSSIRLERLTEGIQFFEKVRILKEEFE EKGNKGAIKNIDKTLKMFDESSMDKISPTTAVNKAKKVINRY Underlined = hexahistidine tag Figure S1: Sequence of BvGH123. (a) Predicted protein sequence of BvGH123. (b) Recombinant BvGH123 protein sequence, consisting of an N-terminal hexahistidine tag fused to the catalytic domain without signal peptide. The general acid/base and stabilizer residues are in bold. 2

Bv_GH123 1 MKKLL---------------------------FLGALLLSTVCMNAQTSEYYQEAANPIATNPALWAKVTAPQISWGSTD Cp_GH123 1 MKKDT---------------------------TL--------------------------------------GASIGSTD Psp_GH123 201 QLNAAVLPANASNQTIRWSTSDDQVAKIDAQGRLSAEAVGTVQVTATSEDGGFQATRKVVVAPASQY----LKGAVGTTD EEEE EEEEEEEEEE EEEE HHHEEEEEEE Bv_GH123 54 IRYKKEEP---------APIHSA-QKSMNLTAWKGEKISAQLVVWTPKV-LNDLTFMVSDLTSG-SATISKENIRTGFVR Cp_GH123 16 FHYLQKDY---------DEIKKLNLNTWNEVAWIGDELNSKIVMWTNSSPVNNVTLSSSDFINENGDLISSNNIKISWLK Psp_GH123 277 MHYVKESPFDDYKRIDYMEIAQMNEKVWTGNAWKGDRASAQFVLWTTNRKEDQVTLSIGDLVGEHNQSIASSNLSVHFVK EEEE HHH EEEEEEEE EEEEEEEEEE EEEEEEEEEE EEE-E Bv_GH123 122 YVITDELNKDGLGACGYRNSADFDSTLVADVIDHITPTLTLPANSTQGGWISVNIPQGTKAGKYTGTVTVKADGITLS-E Cp_GH123 87 ETLAN---------IGRS-NPSAPLEPFPDIIHN-SGSLNIEKNKIASAWINIKIPRNAKPGIYNGSIEVTADELEKSYT Psp_GH123 357 TTKAA---------RGNP-SQGKPQELIPDILGS-ADPVSIDKFGVQPIWVSIDVPKDAKAGNYSGQLTASSASGEEV-S EEEEEEE HHH EEEE HHHHHHH HHHHHHHHHHHHHHHH EEEEE Bv_GH123 201 LKLNLQVKNRTLPPPSEWAFHLDLWQNPYAVSRYYNV---EPFSKKHFDLMRPLMKLYADAGGKVITASIMHKPWNGQTY Cp_GH123 156 FDYSFEVLNLVQPLPSETNTQIEFWQHPYTIARYYKICKEDLFTEKHFKYLRGNLKEYRNMGGRGVIATIVHEAWNHQSY Psp_GH123 425 FTLQLEVLDLTLPDVKDWGFELDLWQNPYAVARVNGITQDQLWTKAHFDAMKPHYEMLAAAGQKVITTTVTYDPWNSQTY EEEE EEEE HHHHHHHHHHHH EEEEE EEEEEE HHH Bv_GH123 278 DAFESMVTWLKKADGTWYFDYTVFDKWVEFMMDLGVK------KQISCYSMVPWRLSFQYFDQASNSFKFLDAKPGEVAY Cp_GH123 236 DSDPSMIKWRKNSYGTFEFDYSHFDKWIQLNIDLGILDPEKGFGQIKCYSIVPWNNRIQYFNEATNKEEAINPTPGSDLW Psp_GH123 505 DVYDTMVKWTKKADGTYTFDFEIFDKWVQFMMDLGID------SQIDAYSMVSWASKIKYYDEVQKKDVIETVQTNNPKW HHHHHHHHHHHHHHHHHH HHHEEEEEE HHHHHHHHHHHHHH EEEEEE HHH EEEEE Bv_GH123 352 EEFWMNMLQDFSKHLKAKGWFDITHIAMDERPMKDMQETLKVIRKAD----KDFKVSLAGTYH----KELLDDLNDYCIT Cp_GH123 316 INIWTQFLTSFMSHLEEKGWFNITYISMDERSMDDLKACVDLIENITNNSYEHFKISSAMDYESGNDYSFLDRIDDISIG Psp_GH123 579 AVMWRAFLEAFVPHLEAKGWLDKTYMANDERALNDLLIAADLIEEVS---GGKLKIAAAMNVNSLSD-SRLDRLHKISVG HHHHHHHHH EEEEE HHHHHHHHHHHHH EEEE Bv_GH123 424 IAE-----KFTPEEIEARRKAGKVTTYYTCCTEPRPNTFTFSEPAEAEWLAWHSAKENLDGYLRWALNSWVKNPLQDSRF Cp_GH123 396 LSHINHNSDDMKNMATHRQELGLLTTIYTC-TGDYPSSFTISDPSEGAFTIWYSLYQNTNGFLRWSWDGWVENPLENVSY Psp_GH123 655 LYHVNHENKQLTNTSEHRRQLGLITTIYNC-VGHYPNSFARSNPAEPVWVIWDTMRHQTDGYLRWAFDSFVKDPYETTDF HHHHHHHHHHHHHHHHHHHHHHHHH HH-----HHHHHHHHHHH Bv_GH123 499 TAWAAGDTYMIYPGARSS--------IRLERLTEGIQFFEKVRILKEEFEEKGNKG-----AIKNIDKTLKMF------- Cp_GH123 475 KYWEPGDPFLIYPAEKDSIGKTFYSTPRLEKLKEGIRDINKAKYLMEKAPNLKNSI-------ENLIYSLKRPNKGENAY Psp_GH123 734 KTWESGDSAQVYPGAVSS--------VRFERMKEGIRDAEKVRYLTEHNAEIGEEIAAAAWQMQNVGYALDPY------- HHHHHHHHHHHHHHH Bv_GH123 559 D----ESSMDKISPT-TAVNKAKKVINRY------------- Cp_GH123 548 GSAVAASKEDRDLTI-SEANRIKNGINNFAREFISLTMETL- Psp_GH123 799 N----GVKDPGIVNIPQEVNRLKASLDQAARKYLDLIREEMK Figure S2: Sequence alignment of biochemically-characterized family GH123 sequences. The characterized GH123s from Bacteroides vulgatus ATCC 8482 (ABR39859.1), Clostridium perfringens ATCC 13124 (ABG82546.1) and Paenibacillus sp. TS12 (BAJ83606.1) where aligned using ClustalW. The secondary structure of the B. vulgatus enzyme, as determined by the structure reported in this work, is shown above the alignment. Residues interacting with the substrate are highlighted in red. 3

(a) (b) Figure S3: ph activity and inhibitor binding to BvGH123. (a) ph dependence of activity for BvGH123 using pnpgalnac as substrate. k cat /K M values were determined by the substrate depletion method. See the Experimental section for further details. (b) Isothermal titration calorimetry of Gal-thiazoline binding to BvGH123. 4

Fig. S4 Dimer of BvGH123 as observed in the crystal structure. (a) β-wing domain formed by insertion of a 4 stranded β-sheet between the β3 and 4 of the (βα) 8 -barrel. (b) Crystallographic dimer observed for BvGH123 Fig. S5 Overlays of BvGH123 in open and closed conformations, and with CpNga123. Overlay of open (a) and closed (b) conformations.of BvGH123 (wheat) and CpNga123 (slate gray). (c) Overlay of open (wheat) and closed (plum) form of BvGH123. The active site residues D361 and E362 are shown in stick representation. The overlay was based on the TIM-barrel domains. 5

Fig. S6 Zoom of overlay of BvGH123 and CpNga123 in open and closed conformations. Overlay of open (a) and closed (b) conformations.of BvGH123 (wheat) and CpNga123 (slate gray). (c) Overlay of BvGH123 in open(wheat) and closed (slate gray). 6

Experimental Cloning and expression The gene of BvGH123, encoding the mature protein sequence, was amplified from genomic DNA of B. vulgatus DSM 1447 and cloned in YSBL3CLIC creating a construct with an N- terminal cleavable His-Tag 1. Protein expression was carried out in TB-medium. For the production of selenomethionine containing protein the gene was transformed in E. coli B384 and the cells were grown in minimal medium, using autoinduction to induce protein synthesis 2,3. BvGH123 was purified in a two-step procedure using IMAC and then gel filtration. Fractions containing BvGH123, eluting as a dimer, were combined and concentrated to approx 25 mg/ml and stored at 80 C. Protein crystallography Initial crystallisation conditions were established using commercial sparse matrix screens from Hampton Research and Molecular Dimensions. Subsequently crystals were optimised in 48 well screen sitting drop with a protein concentration of 7.5 mg/ml. Data were collected at the Diamond Light source (Didcot UK). Data were indexed and integrated using XDS 4 as part of Xia2 5 and subsequently scaled using AIMLESS 6. The experimental phasing was done using the Crank2 pipeline within i2 of the CCP4 software package 7,8,9. The partial structure was then further refined against a higher resolution native dataset. The model was further improved by alternating cycles of manual model building in real space with Coot and reciprocal space refinement with Refmac5 10,11. The quality of the final models was judged using Molprobity 12. Determination of stereochemistry of catalysis by 1 H NMR analysis BvGH123 catalysed hydrolysis of pnpgalnac was monitored by 1 H NMR spectroscopy using a 500 MHz instrument. A solution of BvGH123 in buffered D 2 O (0.15 ml, 0.095 mm in 10 mm sodium phosphate, pd 7.4) was added to a solution of pnpgalnac (4.0 mg, 19.3 mmol) in buffered D 2 O (0.6 ml, 10 mm sodium phosphate, pd 7.4 ) at 25 C. 1 H NMR spectra were acquired at time points (t = 0, 3, 22 and 38 min, 20 h). Kinetic analysis of BvGH123 ph dependence of activity. k cat /K M values for BvGH123 were measured for pnpgalnac hydrolysis using the substrate depletion method in a stopped assay. Reactions (500 l) were performed in 50 mm citrate/phosphate buffer, 150 mm NaCl at a range of ph values (4.0, 4.5, 5.0, 5.5, 6.0, 7.0) at 37 C. Reactions were initiated by the addition of 10 l of 0.106 µm BvGH123 to pnpgalnac (0.4 mm) in buffer, and aliquots (25 l) were quenched at different time points into 75 l glycine buffer (1 M, ph 10.0) in a 96-well plate. Absorbances were measured using a UV/visible plate reader (λ = 405 nm). Data (absorbance at 400 nm against time) were fitted to a first order rate equation (A t = A 0 (1-e -kt )+offset, where A t is the absorbance at time t, A 0 is the absorbance at time 0 and k is the rate constant) using the Prism 6 software package (Graphpad Scientific Software), to give values of V max /K M, which was adjusted for the enzyme concentration to give k cat /K M. The k cat /K M values at different ph values were fitted to a bell-shaped ionisation curve (k cat /K M = (limit x 10 (ph-pka1) /(10 (2xpH-pKa-pKa2) + 10 (ph-pka1) +1) using Prism 6. Michaelis-Menten kinetics. Kinetic parameters for BvGH123 hydrolysis of pnpgalnac and pnpglcnac, were measured using a Varian Cary50 UV/visible spectrophotometer to measure the release of 4-nitrophenol at the isosbestic point (λ = 348 nm). Reactions were performed in 50 mm sodium phosphate, 150 mm NaCl, ph 5 at 37 C using a final concentration of 17.8 nm BvGH123 7

at substrate concentrations ranging from 0.25 mm to 1.25 mm. Above this concentration the signalto-noise deteriorated, suggesting the presence of incompletely dissolved substrate. The extinction coefficient for 4-nitrophenol under the assay conditions was determined to be 6172 M -1 cm -1. Kinetic parameters were calculated using the Prism 6 software package (Graphpad Scientific Software). Size exclusion chromatography-multi-angle laser light scattering Multi angle light scattering was used to assess the oligomeric state of BvGH123. A sample of BvGH123 was applied on a Superdex 200 10/300 (GE Healthcare), pre-equilibrated with 10 mm HEPES ph 7.5, 250 mm NaCl, 1mM DTT, with a flow rate of 0.5 ml/min, using a Shimadzu HPLC system (SPD-20A UV detector, LC20-AD isocratic pump system, DGU-20A3 degasser and SIL- 20A autosampler). This was linked to a Wyatt HELEOS-II multi-angle light scattering detector and a Wyatt rex refractive index detector. The column was calibrated using BSA as standard Isothermal titration calorimetry The determination of the binding constant of Gal-thiazoline was carried out with an Auto- ITC 200 system from Malvern. A final concentration of 17.5 µm BvGH123 in 50 mm MES ph 6, 150 mm NaCl was added to the calorimeter. Gal-thiazoline was dissolved at 147 µm and used to titrate the protein. Injections of 2 µl were used with a spacing of 180s. The K D - value was calculated using the Malvern software. The measurements were carried out at 25 C. The K d -value was calculated using the Malvern analysis software. The Free Gibbs energy was determined to -46.1 kj/mol with an enthalpic contribution of 67.8 kj/mol and an slightly unfavourable entropic contribution of -21.7 kj/mol 8

Table S1: BvGH123 X-ray data collection, processing and refinement statistics. BvGH123SeMet BvGH123 apo BvGH123 GalNAc BvGH123 Gal-thiazoline PDB-ID 5l7r 5l7u 5l7v X-ray source Diamond-IO3 Diamond-IO4 Diamond-IO2 Diamond-IO2 Wavelength [Å] 0.979420 0.979490 0.91841 0.979500 Space group P2 1 2 1 2 1 P2 1 2 1 2 1 P2 1 2 1 2 1 P2 1 2 1 2 1 Resolution limit [Å] 2.7 1.85 2.1 2.3 Unit cell parameters a [Å] 55.0 55.3 55.4 55.5 b [Å] 144.7 147.8 142.2 148.4 c [Å] 145.7 150.5 145.8 147.7 Solvent content [%] 51.2 53.9 50.7 53.5 Monomers per asu 2 2 2 2 Total reflections 213617(28161) 859298(42494) 445720(28436) 389448(17480) Unique reflections 32604(4215) 106036(5148) 68163(4539 51335(2915) I/σ(I)* 11.6(1.4) 9.9(1.2) 12.0 (1.2) 15.6(1.1) completeness [%] 99.5(98.2) 99.9(100) 100(100) 93.4(63.6) R merge 0.105(1.421) 0.141(1.930) 0.104(1.530) 0.063(1.552) R meas 0.124(1.674) 0.151(2.056) 0.114(1.670) 0.068(1.692) R pim 0.066(0.877) 0.053(0.703) 0.044(0.664) 0.024(0.651) CC(1/2) 0.997(0.561) 0.998(0.331) 0.999(0.434) Refinement R cryst 0.181 0.193 0.187 R free 0.219 0.227 0.221 Number of Protein residues 1122 1111 1121 Ligand 0 2 2 Water 644 179 27 r.m.s.d. Bond (Å) 0.016 0.014 0.013 Angle ( ) 1.701 1.592 1.626 Average B-factor (Å 2 ) Protein 35.5 52.4 81.1 Ligand 71.8 61.1 Water 37.3 41.5 60.4 Ramachandran plot (%) favored/ allowed/disallowed 98/1.9/0.1 97.6/2.2/0.2 97.8/2.2/0 Values in parentheses refer to the highest resolution shell 9

References 1. M. J. Fogg, A. J. Wilkinson, Biochem. Soc. Trans., 2008, 36, 771 775. 2. H. K. Sreenath, C. A. Bingman, B. W. Buchan, K. D. Seder, B. T. Burns, H. V. Geetha, W. B. Jeon, F. C. Vojtik, D. J. Aceti, R. O. Frederick, G. N. Phillips, Jr, B. G. Fox, Protein Expr. Purif., 2005, 40, 256 267. 3. F. W. Studier, Methods Mol. Biol., 2014, 1091, 17 32. 4. W. Kabsch, Acta Crystallogr. D Biol. Crystallogr., 2010, 66, 125 132. 5. G. Winter, C. M. C. Lobley, S. M. Prince, Acta Crystallogr. D Biol. Crystallogr., 2013, 69, 1260 1273. 6. P. R. Evans, G. N. Murshudov, Acta Crystallogr. D Biol. Crystallogr., 2013, 69, 1204 1214. 7. N.. Collaborative Computational Project, Acta Crystallogr. D Biol. Crystallogr., 1994, 50, 760 763. 8. P. Skubák, N. S. Pannu, Nat. Commun., 2013, 4, 2777. 9. G. M. Sheldrick, Acta Crystallogr. A, 2008, 64, 112 122. 10. P. Emsley, B. Lohkamp, W. G. Scott, K. Cowtan, Acta Crystallogr. D Biol. Crystallogr., 2010, 66, 486 501. 11. G. N. Murshudov, P. Skubák, A. A. Lebedev, N. S. Pannu, R. A. Steiner, R. A. Nicholls, M. D. Winn, F. Long, A. A. Vagin, Acta Crystallogr. D Biol. Crystallogr., 2011, 67, 355 367. 12. V. B. Chen, W. B. Arendall, J. J. Headd, D. A. Keedy, R. M. Immormino, G. J. Kapral, L. W. Murray, J. S. Richardson, D. C. Richardson, Acta Crystallogr. D Biol. Crystallogr., 2010, 66, 12 21. 10