SUPPLEMENTARY EXPERIMENTAL PROCEDURES

Similar documents
Parts Manual. EPIC II Critical Care Bed REF 2031

page 1 Total ( )

THE TRANSLATION PLANES OF ORDER 49 AND THEIR AUTOMORPHISM GROUPS

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB

Sequence analysis and comparison

SUPPLEMENTARY INFORMATION

We used the PSI-BLAST program ( to search the

Week 10: Homology Modelling (II) - HHpred

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

In this 3D model, facets appear as semi-transparent to reveal overhangs. Report Details Roof Details Report Contents

Introduction to Comparative Protein Modeling. Chapter 4 Part I

2.25 Advanced Fluid Mechanics

SUPPLEMENTARY INFORMATION

Cryo-EM data collection, refinement and validation statistics

SMU Equilibrium Heat Flow Data Contribution

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Can protein model accuracy be. identified? NO! CBS, BioCentrum, Morten Nielsen, DTU

Tiffany Samaroo MB&B 452a December 8, Take Home Final. Topic 1

Nature Structural and Molecular Biology: doi: /nsmb Supplementary Figure 1

ENCODE DCC Antibody Validation Document

Upload Multiple Student File Layout TerraNova CSP Emphasis Highlighted

Multiple Alignment using Hydrophobic Clusters : a tool to align and identify distantly related proteins

Structure and RNA-binding properties. of the Not1 Not2 Not5 module of the yeast Ccr4 Not complex

Lecture 7 Sequence analysis. Hidden Markov Models

AK AJ AT AR DETAIL "A" N M L K B AB (6 PLACES) DETAIL "B" TH1 (11) TH2 (10) NTC *ALL PIN DIMENSIONS WITHIN A TOLERANCE OF ±0.5

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2

SECURITIES AND EXCHANGE COMMISSION FORM 10-D. Filing Date: Period of Report: SEC Accession No

Supplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate

Modeling for 3D structure prediction

1-D Predictions. Prediction of local features: Secondary structure & surface exposure

SUPPLEMENTARY INFORMATION

Parts List, Wiring Diagrams

Expanded View Figures

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

Molecular Modeling Lecture 7. Homology modeling insertions/deletions manual realignment

Acta Crystallographica Section D

Structure of the α-helix

Homology Modeling. Roberto Lins EPFL - summer semester 2005

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

FW 1 CDR 1 FW 2 CDR 2

Protein structure analysis. Risto Laakso 10th January 2005

ENCODE DCC Antibody Validation Document

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.

CISC 636 Computational Biology & Bioinformatics (Fall 2016)

FlexPepDock In a nutshell

Protein Structure Prediction, Engineering & Design CHEM 430

ENCODE DCC Antibody Validation Document

Homework 3/ Solutions

Math 4377/6308 Advanced Linear Algebra

Homework 1/Solutions. Graded Exercises

Pushbutton Units and Indicator Lights

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Neural Networks for Protein Structure Prediction Brown, JMB CS 466 Saurabh Sinha

of the Guanine Nucleotide Exchange Factor FARP2

Comparing whole genomes

Sequence and Structure Alignment Z. Luthey-Schulten, UIUC Pittsburgh, 2006 VMD 1.8.5

Quantifying sequence similarity

Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 Å resolution: comparison with the Azotobacter vinelandii MoFe protein

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007

Advanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

Supplementary Figure 1

Mechanistic insight into inhibition of two-component system signaling

Supplemental Table S1. Salmonella isolates included in study Allelic type for b Isolate a Serotype Source fima mdh manb. manb Dupli- subtype

Supplemental Information

EST1 Homology Domain. 100 aa. hest1a / SMG6 PIN TPR TPR. Est1-like DBD? hest1b / SMG5. TPR-like TPR. a helical. hest1c / SMG7.

Genomics and bioinformatics summary. Finding genes -- computer searches

Protein Structure Prediction

SUPPLEMENTARY INFORMATION

NVLAP Proficiency Test Round 14 Results. Rolf Bergman CORM 16 May 2016

Large-Scale Genomic Surveys

BUDE. A General Purpose Molecular Docking Program Using OpenCL. Richard B Sessions

A L A BA M A L A W R E V IE W

Initiation of DNA Replication Lecture 3! Linda Bloom! Office: ARB R3-165! phone: !

Computational Biology: Basics & Interesting Problems

Gene regulation I Biochemistry 302. Bob Kelm February 25, 2005

Supplementary Figure 1 Preparation of PDA nanoparticles derived from self-assembly of PCDA. (a)

Copyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years.

Supplementary Figure 1 Crystal contacts in COP apo structure (PDB code 3S0R)

Computational Molecular Biology (

Collinearity/Concurrence

Assignment. Name. Factor each completely. 1) a w a yk a k a yw 2) m z mnc m c mnz. 3) uv p u pv 4) w a w x k w a k w x

Tools and Algorithms in Bioinformatics

ENCODE DCC Antibody Validation Document

CAT. NO /irtl,417~ S- ~ I ';, A RIDER PUBLICATION BY H. A. MIDDLETON

CCE PR Revised & Un-Revised

Goals. Structural Analysis of the EGR Family of Transcription Factors: Templates for Predicting Protein DNA Interactions

Supplementary Information. Autocatalytic backbone N-methylation in a family of ribosomal peptide natural products

RNA Synthesis and Processing

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Supplemental Information. Molecular Basis of Spectral Diversity. in Near-Infrared Phytochrome-Based. Fluorescent Proteins

Peptide-derived Inhibitors of Protein-Protein Interactions

Supplemental Information. The Mitochondrial Fission Receptor MiD51. Requires ADP as a Cofactor

SUPPLEMENTARY INFORMATION

Supplemental Data SUPPLEMENTAL FIGURES

SUPPLEMENTARY INFORMATION

Camello, a novel family of Histone Acetyltransferases that acetylate histone H4 and is essential for zebrafish development

SMT 2018 Geometry Test Solutions February 17, 2018

Transcription:

SUPPLEMENTARY EXPERIMENTAL PROCEDURES Yeast two-hybrid assay The yeast two hybrid assay was performed according to described in Assmann (2006) [01] and was performed with the baits SCOCO (2-82) and SCOCO (2-65) and prey FEZ1 (221-392). In silico modeling According to the YASARA homology modeling routine, an alignment with the target sequence is obtained using different sources of information such as: sequence-based profiles and structure-based profiles for both the target and the template. A structure-based alignment correction, based on SSALN matrices [02], considers structural information in order to avoid gaps in secondary structure elements [03]. The presented alignment (Supplementary Figure 4), in which 20 of the 47 template profile alignments are displayed, 227 of 392 target residues accounting for 57.9% its total sequence were aligned to template residues. Amongst these residues, sequence identity if 18.9% and 39.9% are similar, BLOSUM62 score > 0 and 16 loops had to be modeled (Supplementary Table 1). The final model based on the template with pdb id 1XD4 had a Z-Score of 0.366 for dihedral, -3.052 for Packing 1D, -1.743 for Packing 3D and an overall of -1.947, thus being a satisfactory model. Per residue Z-score can be seen on Supplementary Figure 5. SUPPLEMENTARY REFERENCES [01] Assmann EM, Alborghetti MR, Camargo MER, Kobarg J (2006) FEZ1 dimerization and interaction with transcription regulatory proteins involves its coiled-coil region. J Biol Chem 281: 9869 9881. doi:10.1074/jbc.m513280200. [02] Qiu J and Elber R (2006) SSALN: An alignment algorithm using structure-dependent substitution matrices and gap penalties learned from structurally aligned protein pairs Proteins 62,881-891 [03] King RD and Sternberg MJE (1996) Identification and application of the concepts important for accurate and reliable protein secondary structure prediction. Protein Sci. 5,2298-2310

SUPPLEMENTARY FIGURES SUPPLEMENTARY FIGURE 1. Interaction between FEZ1 and SCOCO in an independent experiment A) Purified recombinant proteins FEZ1 and SCOCO were incubated, chemically cross-linked, digested with trypsin, and analyzed by MS. MS/MS spectra were manually validated for b and y ion series of the α (peptide of FEZ1) and β (peptide of SCOCO) chains. The same peptides were found in two independent experiments.

SUPPLEMENTARY FIGURE 2. Assay for β-galactosidase activity in yeast cells shows that FEZ1 (221-392) interacts with the coiled-coil region of SCOCO (2-82 and 2-65) and no interaction was detected with the empty vector pbtm116 (0). In the model, FEZ1 is shown in green and SCOCO in deep blue. The SCOCO region used in the two-hybrid assay is shown in light blue. The highlighted helix in cyan corresponds to the minimal interaction region in FEZ1/UNC-76 with SCOCO/UNC-69.

A B SUPPLEMENTARY FIGURE 3. A) Relationship between binding energy and cross-linked lysine pair distance between FEZ1 and SCOCO. It was possible to obtain models from the docking routine that exhibit both optimal Cα- Cα distance between lysine pairs (>20 A ) and favorable binding energy. B) General model of FEZ1-SCOCO in heterotetrameric fashion (ratio 2:2). FEZ1 is colored in green and SCOCO in deep blue.

Q59EU2 :...EEKLSLCFR...PPRTAVRPITERSLLQGDEIWNA (AA) C3XZ65 : MAAPLAQIDDEWLD...VDNVDNCGFGVVSFKSMEDLVHEFDETLNICFRNYNAKTDSIAPV...EEIMENSELWQG (AB) UPI00020:... (AC) UPI00019:...IWNA (AD) UPI00016:...AHVVALTCAEDAAKHLDETISVCFRSVHARTDTADPV...SLLEDDEVWIA (AE) E0CYY9 : MEAPLVSLDEEFEDIRPSCTEEPEEKPQCLYGTSPHHLEDPSLSELENFSSEIISFKSMEDLVNEFDEKLNVCFRNYNAKTESLAPVKNQLQIQEEEETLRDEEVWDA (AF) UPI00020:...FKSMEDLVNEFDEKLSVCFRNYSTDTGTIAPV...MKDDEIWNA (AG) E1C355 : MEAPLVSLEEEF...GGVPRRTMDPALAELESFSTEMMSFKSMEDLVQEFDEKLTVCFRNYDATTEGLAPVRGRLQAQEEEEHLQDEEVWDA (AH) Q6PBT3 : MEAPLVCLDEEFEDLRPCKMEELEEQPPC...FSELENF..EMMSFKSMEDLVNEFDEKLNVCFHNYNTKTEGLAPVRNQSHAEEDEERLQDEDVWDA (AI) Q99689 : MEAPLVSLDEEFEDLRPSCSEDPEEKPQCFYGSSPHHLEDPSLSELENFSSEIISFKSMEDLVNEFDEKLNVCFRNYNAKTENLAPVKNQLQIQEEEETLQDEEVWDA (AJ) SecStr : CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH (AK) Target : MEAPLVSLDEEFEDLRPSCSEDPEEKPQCFYGSSPHHLEDPSLSELENFSSEIISFKSMEDLVNEFDEKLNVCFRNYNAKTENLAPVKNQLQIQEEEETLQDEEVWDA Match : L E:F L:D I:S KS E N : :: R : E D : : Template:...EYRLKEKFF...ILKDE...VIFSAKSAEEKNNWMAALISLQYR...STLERMLDVTMLQE SecStr :...EEEEEEEEC...ETTTT...EEEEECHHHHHHHHHHHHHHHHHC...HHHHHHHHHHHHHH (AL) UPI00015:...EYRLKEKFF...ILKDE...VIFSAKSAEEKNNWMAALISLQYR...STLERMLDVTMLQE (AM) UPI00004:...EYRLKEKFF...ILKDE...VIFAAKSAEEKNNWMAALISLQYR...STLERMLDVTMSQE (AN) UPI00016:...EYRLKEKFF...ILKDG...VVFAAKSAEEKNSWMAALISLQYH...STLERMLDSALLRE (AO) F1N8Q9 :...EYRLKEKII...VSKDE...ILFAAKSAEEKSNWMAALISLQYR...STLDRMLDSVLLQE (AP) Q07890 :...EYRLKEKFV...VSKDE...IIFAAKSAEEKNNWMAALISLHYR...STLDRMLDSVLLKE (AQ) UPI00020:...EYRLKEKIV...VPKDE...FIFAAKSAEEKNNWMAVLISLQYR...STLDRMLDAVLLQE (AR) F1R019 :...EFRLKEKFV...VGREE...AVFSARSAEEKSAWMAALVTLQYR...PTLDRMLDTVLHRE (AS) UPI00016:...EYRLKEKFV...IGKDE...AVFCARTAEEKAAWMAALVTLQYR...STLDRMLDTVLQHE (AT) Q4T188 :...EYRLKEKFF...ILKDG...VVFAAKSAEEKNSWMAALISLQYH...STLERMLDSAMLRE (AU) UPI0001E:...EYRLKEKFF...ILKDE...VIFSAKSAEEKNNWMAALISLQYR...STLERMLDVTMLQE (AV) C3XZI5 :...EYRLKETFF...VARDQ...TIFFAKNEDEKNNWMAALITLHCR...STLERMLDSILLEE (AW) UPI0000E:...EYRFKERYY...EVKDS...IVFFAKSAEEKTSWMEALITLLHR...STLERMLDTILQQE (AX) F4WM45 :...ELRLKERLF...APRDH...VILVAKSVEDKNNWMADLVMLNTK...SMLERTLDSILLDE (AY) E9H784 :...DYRLKEKFF...APRQQ...CVLFTRTAEEKANWMAALVMLNTK...SMLERTLDVILLDE (AZ) D2A5R8 :...ECRLKERFF...APRHQ...VILCAKTAEEKNSWMADLVMLNTR...SMLERILDSILSDI (BA) UPI00017:...KFKLKEKLF...FPRLE...ITLVARTFEDKANWMADLLMLTSK...SILDRTLNNILLDE (BB) E2AK89 :...ELRLKERFF...APRDH...VILVAKSVEDKNNWMADLVMLNTK...SMLERTLDSILLDE (BC) UPI0001C:...SVKDQ...TVFFAKSAEDKANWMAVLTALHYR...STLERMLDTILTDE (BD) Q9N5D3 :...LKFKDRLA...ESHDK...LHFVCKNPEEKRQWMAVLVKVTTK...SVLDRILDNHEKEE (BE) E5SJ49 :...RQRDQ...IVLLAADPDEKCTWLAALLMLQTR...SMLERMLDSHLEAE (BF) Q59EU2 : LTDNYGNVMPVDW...NDEPLFTADQVIEEIEEMMQESPDPEDDE...TQSDRLSMLSQEIQTLKR... (AA) C3XZ65 : LTDNFANVLPVDW...SLTQEPIFTAEEVIEEIDEIM...SVLSQEMLALRERSN (AB) UPI00020:...VIEEIEEMMQESPDP...TQSDRLSMLSQEIQTLKRSST (AC) UPI00019: LTDNYGNVMPVDWKSSHTRA...DEELREQ...NEEPLFTAEQVIEEIEEMMQESPDPED...TQSDRLSILSQEIQTLKRSST (AD) UPI00016: LTSNYGLVRPVDWKQ...PSDEEELREQLDM...EEPLLTAEQVIEEIEEIMQDSP... (AE) E0CYY9 : LTDNYIPSLSEDWRDPNIEALNGNSSDIEIHEKEEEEFNEKSENDSGINEEPLLTADQV... (AF) UPI00020: LTDNYGNVMPVDWKTSHARAL..NLLDKAVNE.DDEELREQLD...NEEPLFTAEQVIEEIEEMMQESPDP...SMLSQEIQTL... (AG) E1C355 : LTDGFVP...WMHPEAEAPDG...DPQLCEKEDEELAERSEHDSGINEEPLLTAEQVIEELEELMQSSPDP.DEEEEEDEEEDAEANAEGDS.VLHELHTFSPTFN (AH) Q6PBT3 : LTDNYMPSSVSSWDDPTSETLNGNLSDQEIHEKDEDEMNEKNENANCLNEEPLITADQVIEEIVEMMENSPDPGETEE..EEDDIGVSSSKSSPSLLEEIRMLSQASN (AI) Q99689 : LTDNYIPSLSEDWRDPNIEALNGNCSDTEIHEKEEEEFNEKSENDSGINEEPLLTADQVIEEIEEMMQNSPDPEEEEEVLEEEDGGETSSQADSVLLQEMQALTQTFN (AJ) SecStr : HHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCC (AK) Target : LTDNYIPSLSEDWRDPNIEALNGNCSDTEIHEKEEEEFNEKSENDSGINEEPLLTADQVIEEIEEMMQNSPDPEEEEEVLEEEDGGETSSQADSVLLQEMQALTQTFN Match : S :I :E P A VI: IE: DP: : L : : S: :: Template:...DSEENIIFEE...IPIIKAGTVIKLIERL...ADPNFVRTFLTTYRSFCKPQELLSLIIERFE... SecStr :...CTTTTCEECC...CCEEEECHHHHHHHHH...CCHHHHHHHHHTTTTCCHHHHHHHHHHHHHH... (AL) UPI00015:...DSEENIVFEE...IPIIKAGTVIKLIERL...ADPNFVRTFLTTYRSFCKPQELLSLLIGRFE... (AM) UPI00004:...DSEENIVFEE...IPIIKAGTVVKLIERL...ADPNFVRTFLTTYRSFCKPQELLSLLIERFE... (AN) UPI00016:...DSEENVVFED...IPIIKAGTVLKLIERL...ADPNFVRTFLTTYRSFCKPQELLDLLMERFE... (AO) F1N8Q9 :...DSEDNIVFED...IPIIKGGTVVKLIERL...ADPNFVRTFLTTYRSFCKPQELLSLLIERFE... (AP) Q07890 :...DSEENIVFED...IPIIKGGTVVKLIERL...ADPNFVRTFLTTYRSFCKPQELLSLLIERFE... (AQ) UPI00020:...DSEENIVFED...IPLIKGGTVVKLIERL...ADPNFVRTFLTTYRSFCKPQELLSLLIERFE... (AR) F1R019 :...DSEENIVFEE...IPIIKAGTVVKLIERL...ADPNFVRTFLTTYRSFCKPQDLLTLLIERIE... (AS) UPI00016:...DSEENIVFED...IPIIKAGTVVKLIERL...ADPNFVRTFLTTYRSFCKPQELLTLLIERIE... (AT) Q4T188 :...DSEENVVFEE...IPIIKAGTVLKLIERL...ADPNFVRTFLTTYRSFCKPQELLELLMERFE... (AU) UPI0001E:...DSEENIIFEE...IPIIKAGTVIKLIERL...ADPNFVRTFLTTYRSFCKPQELLSLIIERFE... (AV) C3XZI5 :...DRPDNIVFED...VPVIKGGSLIKLIERL...ADPTFVRTFLTTYRSISKPCELLDLLIERFE... (AW) UPI0000E:...DSPDNIVFEE...VPLIKGGTLFKLIERL...ADPSFVRTFLTTYRSFCSPQDLLSLLFKRFE... (AX) F4WM45 :...DSPENIVLEA...VPLIKGATLIKLVERL...ADPAFVRTFLTTYRSFCSPQELLALLIERFD... (AY) E9H784 :...DSETNLVLET...VPLIKGATLLKLIERL...ADPMLVRTFLTTYRSFCSPTELLELLIERFE... (AZ) D2A5R8 :...DSKDNIILEQ...VPLIKGATLYKLVERL...ADPKFVRTFLTTYRSFCSPKELLDLLIERFQ... (BA) UPI00017:...DSRSNIIFEE...IPLIKGAILLKLIERL...ADPKFVKTFLTIYRSFCSPHDLMDLLIERYK... (BB) E2AK89 :...DSKENIVLEA...VPLIKGATLIKLVERL...ADPAFVRTFLTTYRSFCSPQELLTLLIERFD... (BC) UPI0001C:...DSEDNIVFED...IQVIRGGSLLKLIERL...ANPSFVRTFLTTYRSFSSPYIMCNNSLRTFE... (BD) Q9N5D3 :...DTEDNISFED...IPVIKCGTVLKLIERL...TDSKYILTFLISYRSFCTPNDLFSLLLERFN... (BE) E5SJ49 :...DSEDNIVLED...TS..SSGIPVLLIERF...PDPS...AGTMAAALLS... (BF) SUPPLEMENTARY FIGURE 4 (part 1/3). Profile alignment of 20 templates for FEZ1 protein.

Q59EU2 :...VKRLSVSELNEILEEIETAIKEYSEELVQQLALRDELEFEKEVKNSFISVLIEVQNKQKEHKETAKKKKKLKNGSSQNGKNER... (AA) C3XZ65 : TVTSY..LKKLPLQTLNEVLEELEATIKEYSEILIQQLALRDELVYEKEVKNQFISALVAVQNRQREHKQNKDNKKKRK... (AB) UPI00020: NNSYEERVKRLSVAELNELLEEIETAIKDYSEELVQQLALRDELEFEKEVKNSFISVLIEVQNKQREHKETAKKKKKLKNGSPQNGKQERG..MPGTRFSMEGISNVI (AC) UPI00019: NNSYEERVKRLSVAELNELLEEIETAIKDYSEELVQQLALRDELEFEKEVKNSFISVLIEVQNKQREHKETAKKKKKLKNGSPQNGKQERG..MPGTRFSMEGISNVI (AD) UPI00016:...LRALSVANLNEHLEGTETDIRRFSEELVQQLALRDELDFEKEVKNTFISTLIDVQNRQKEQRELMKRKRSDGATGASASHISSLSFSPPQRFSMEALSSAI (AE) E0CYY9 :... (AF) UPI00020:...VKMMSVAELNDLLEEIETAIKDYSEELIQQLALRDELEFEKEVKNSFISVLIEVQNKQKEHKENVKKKRRVKSANAQNGTKE...MPGTRFSMEGLSNVI (AG) E1C355 : NNCSHEGLEQLTAGELAAAAGRAEAASRALSAELVAQLARRDELAFEKEVKTAFIGALLAVQGEQREQREAARRRRRDRGLSMQGGRPERGGHMPRKRFSMEGISSIL (AH) Q6PBT3 : NNCSYEGLRLMPSSALLDILCRMEAAIREYSEELVAQLARRDELEFEKEVKNTFITALMEVQNRQKEQRELSKRRRKDKAMSL...RPEKTGSMPAKRFSMEGLSNIL (AI) Q99689 : NNWSYEGLRHMSGSELTELLDQVEGAIRDFSEELVQQLARRDELEFEKEVKNSFITVLIEVQNKQKEQRELMKKRRKEKGLSLQSSRIEKGNQMPLKRFSMEGISNIL (AJ) SecStr : CCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHH (AK) Target : NNWSYEGLRHMSGSELTELLDQVEGAIRDFSEELVQQLARRDELEFEKEVKNSFITVLIEVQNKQKEQRELMKKRRKEKGLSLQSSRIEKGNQMPLKRFSMEGISNIL Match : :: : : : ::EL :: : :: :: ::: FE : : : ::: :: : :: :QSS Template: TEADRIAIE...SAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFEAYLLQRMEEFIGTMKKWVESITKIIQRKKI...FQSSP... SecStr : HHHHHHHHH...HHHHHHHHHCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH...CCCCC... (AL) UPI00015: TEADRRAME...SAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFEADLLQRLEEFIGTMKKWVESITKIIQRKKQ...FESSP... (AM) UPI00004: SEADRIAME...SAELKRFRKEYVQPVQLRVLNVCRHWVEHHFYDFESDLLIRLEEFIGTMKKWVESITKIINRKKP...FESLP... (AN) UPI00016: SELDQL..E...SPEVKRFRKEFVQPVQLRVLNVCRHWVEHHFYDFEPQLLRTLEEFISSMRKWVESITKIIQRKKP...FQNSP... (AO) F1N8Q9 : TEADRLAIE...SADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFELELLERLETFISSMKKWVESIAKIIKRKKI...FESPP... (AP) Q07890 : TDADKLAIE...SADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFELELLERLESFISSMKKWVESIAKIIRRKKV...FESPP... (AQ) UPI00020: TEADRLAIE...SADLKRFRKEYVQPVQLRILNVFRHWVEHHYYDFEQELLDRLETFISTMKKWVESIAKIIRRKKV...FESPP... (AR) F1R019 : SEADREALW...AAELQRFRREYVQPVQLRVLNVFRQWVEHHFYDFEPELYERLEGYLTQMRKWVESISKIMRRKGI...FESPP... (AS) UPI00016: TEEDRQALW...AAELQRFRKEYVQPVQLRVLNVFRQWVEHHFYDFEPELRSHLEEYITSMRKWVESINKIIKRKGV...FESPP... (AT) Q4T188 : SELDQM..E...SPEVKRFRKEFVQPVQLRVLNVCRHWVEHHFYDFEPNLLRTLEEFIASMRKWVESITKIIQRKKP...FQNSP... (AU) UPI0001E: TEADRIAIE...SAELKRFRKEYIQPVQLRPPNLCRH...PYYDFEADLLQQMEXFIVIMKKWVESITKIIQRKKI...FQSSP... (AV) C3XZI5 : TPEDQAAID...REDLKRFRKEYAQPVQLRVLNVFRHWVDHHFYDFEKELLDKLMEFIEGMRKWVDSITKIVLRENV...FERTP... (AW) UPI0000E: TEEDQASID...REDLKRFRKEYAQPIQLRVLNVFRHWVDQHFYDFESELLGSLEKFIDSMRKWVESIKKIIMRREQ...FEKDP... (AX) F4WM45 : VYGEEEK.P...REDWKRYRKEFCQPVQFRVLNVLRHWVDHHFYDFERNLLERLQLFLDTMRKWVDSVIKIVQRKQR...FERSP... (AY) E9H784 : DETSGLGVD...SRELKRFRKQYMQPVQFRVLNVLRHWVDHHFYDFEPGLLQRLQGFLGVVRKWVESIHKIVARREE...FDRSP... (AZ) D2A5R8 : ECCDTDKMQ...REDWKKYRKEYVQPVQFRVLNVLRHWVDHHFYDFEPTLLVKLRDFLDAMRKWVDSVLKILHRKME...FDEPP... (BA) UPI00017:...GIT...RDETKRFKKEYLIPVQFRVLNVFRHWVDYHYYDYQPYLLDKLYEFLDSIKKLTDSLIKIIHRKEA...FDSPP... (BB) E2AK89 : VYGEEEKPS...REDWKRYRKEFCQPVQFRVLNVLRHWVDHHFYDFERNLLERLQLFLDTMRKWVDSVLKIVQRKQR...FERSP... (BC) UPI0001C: TEEDESALR...REDLKKFRKEYAQPVQLRVLNVFRHWVDQHFYDFEPELLERLMEFIDNMRKWVESIKKIVQRRSL...FDQTP... (BD) Q9N5D3 : LQQPKTVQS...SSSSQKFRKEFQQPIQLRVLSVINQWVKLHWYDFQPVLLDALELFLNRHKKFCKTILALIEKRDE...FQQHP... (BE) E5SJ49 : ENKSNMDVS...RESLKRLRKEYIQPVRIRVLNIIRQWIDHHWYDFDAELLANLKHFLKRVVKWTRSLQNIIDRK... (BF) Q59EU2 :... (AA) C3XZ65 :...LYTVIPYEKKNSPMTAEDLQILTKILVAMSEDSDKVPSLLTDYILK... (AB) UPI00020: QNGFRHTFGNSGGEKQYLTTVIPYEKKNGPPSVEDLQTLTKILHAMKEDSEKVPSLLTDYILK... (AC) UPI00019: QNGFRHTFGNSSGEKQYLTTVIPYEKKNGPPSVEDLQTLTKILHAMKEDSEKVPSLLTDYILKVLCPT (AD) UPI00016: QSSFRNTFGSSCSEPQYLTTVIPYEKKGRPPSLRDLQILTKILQAMRDDSDKVPGLLTDYILQVLCPT (AE) E0CYY9 :... (AF) UPI00020: QNSFRQTFGNPGGEKQYLTTVIPYEEKNGPPSIEDLQILTKILRAMKDDSDKVPSLLTDYILKVLCPT (AG) E1C355 : HSGLRQTFGPAANEKQYLNTVIPYEKKGSPPSVEDLQMLTNILFAMKEGNEKVPTLLTDYILKVLCPT (AH) Q6PBT3 : QTGIRQTFGSSGTEKQYLNTVIPYEKKGTPPSVDDLQMLTKILYAMKEDSEKVPTLLTDYILKVLCPT (AI) Q99689 : QSGIRQTFGSSGTDKQYLNTVIPYEKKASPPSVEDLQMLTNILFAMKEDNEKVPTLLTDYILKVLCPT (AJ) SecStr : HCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCC (AK) Target : QSGIRQTFGSSGTDKQYLNTVIPYEKKASPPSVEDLQMLTNILFAMKEDNEKVPTLLTDYILKVLCPT Match : L T: P : L : : :E V LK: T Template:...DLLTL...HPIEIARQLTLLESDLYRA..SELVG...PNLLKMIRHT SecStr :...TTTTC...HHHHHHHHHHHHHHHHHHH..GGTTT...HHHHHHHHHH (AL) UPI00015:...DLLTL...HPIEIARQLTLLESDLYRA..SELVG...PNLLKMIRHT (AM) UPI00004:...DLLTL...HPIEIARQLTLLESDLYRA..SELVG...PNLLKMIRHT (AN) UPI00016:...DLMTL...HPIEIARQLTLLESDFFRA..SELVG...PNLLRMIRHT (AO) F1N8Q9 :...DLMTL...HPIEIARQLTLLESDLYRA..SELVG...PNLLKMIRHT (AP) Q07890 :...DLMTL...HPIEIARQLTLLESDLYRK..SELVG...PNLLKMIRHT (AQ) UPI00020:...DLLTL...HPIEIARQLTLLESDLYRA..SELVG...PNLLKMIRHT (AR) F1R019 :...DLMTL...HPIEIARQLTLLESDLYRA..SELVG...PNLLRMIRHT (AS) UPI00016:...DLMTL...HPIEIARQLTLLESELYRA..SELVG...PNLLRMIRHT (AT) Q4T188 :...DLMTL...HPIEIARQLTLLESDFFRA..SELVG...PNLLRMIRHT (AU) UPI0001E:...DLLTL...HPIEIARQLTLLESDLYRC..Q...ILHS (AV) C3XZI5 :...ELLTL...HPIEIARQLTLLEFDLYRA..SELVG...PNLLKMIHHS (AW) UPI0000E:...DIMTL...HPIEIARQLTLLESDLYRA..SELVG...PNLLKMIHHS (AX) F4WM45 :...GILTL...HPIELARQLTLLEFELYKT..SELVG...PNLLKMIKHT (AY) E9H784 :...DLMTL...HPIEIARQLTLLEFDLYRA..SELVG...PNLLRMIHHT (AZ) D2A5R8 :...GILTL...HPVEIARQLTILEFDLYRM..SELVG...PNLLKMIKHT (BA) UPI00017:...NILLV...HPLEFARQLTLIESENYRA..SELVG...PNLLRLIKRT (BB) E2AK89 :...GILTI...HPIELARQLTLLEFELYRV..SELVG...PNLMKMIKHT (BC) UPI0001C:...DLMTL...HPVEIARQLTLLEFDLFRA..SELVG...PNLLKMIQQT (BD) Q9N5D3 :...DLLTL...HPIEIGRQLTLLHSDLYRA..IELVE...PQLLRLTDHS (BE) E5SJ49 :...L...HPIEIARQITLLEFDLYRA..IELVG...PQLLKLIQHS (BF) SUPPLEMENTARY FIGURE 4 (part 2/3). Profile alignment of 20 templates for FEZ1 protein.

SUBTITLES: (AA) [uniref90 Q59EU2 Fasciculation and elongation protein zeta 2 variant (Fragment) n=5 Tax=Eutheria RepID=Q59EU2_HUMAN] (AB) [uniref90 C3XZ65 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3XZ65_BRAFL] (AC) [uniref90 UPI0002034F43 PREDICTED: visual pigment-like receptor peropsin-like n=1 Tax=Meleagris gallopavo RepID=UPI0002034F43] (AD) [uniref90 UPI000194C19E PREDICTED: similar to zygin 2 n=1 Tax=Taeniopygia guttata RepID=UPI000194C19E] (AE) [uniref90 UPI00016E8843 UPI00016E8843 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E8843] (AF) [uniref90 E0CYY9 Uncharacterized protein n=3 Tax=Eutheria RepID=E0CYY9_MOUSE] (AG) [uniref90 UPI0002065F2E UPI0002065F2E related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI0002065F2E] (AH) [uniref90 E1C355 Uncharacterized protein n=2 Tax=Phasianidae RepID=E1C355_CHICK] (AI) [uniref90 Q6PBT3 Fasciculation and elongation protein zeta 1 (Zygin I) n=1 Tax=Danio rerio RepID=Q6PBT3_DANRE] (AJ) [uniref90 Q99689 Fasciculation and elongation protein zeta-1 n=18 Tax=Theria RepID=FEZ1_HUMAN] (AK) [Secondary structure predicted by PsiPred] (AL) [Secondary structure assigned by YASARA] (AM) [uniref90 UPI0001554446 PREDICTED: similar to son of sevenless homolog 1 (Drosophila) n=1 Tax=Ornithorhynchus anatinus RepID=UPI0001554446] (AN) [uniref90 UPI00004D0C97 Son of sevenless homolog 1 (SOS-1). n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00004D0C97] (AO) [uniref90 UPI00016E51B5 UPI00016E51B5 related cluster n=7 Tax=Tetraodontidae RepID=UPI00016E51B5] (AP) [uniref90 F1N8Q9 Uncharacterized protein (Fragment) n=4 Tax=Amniota RepID=F1N8Q9_CHICK] (AQ) [uniref90 Q07890 Son of sevenless homolog 2 n=25 Tax=Mammalia RepID=SOS2_HUMAN] (AR) [uniref90 UPI000203AE7E PREDICTED: son of sevenless homolog 2-like n=1 Tax=Anolis carolinensis RepID=UPI000203AE7E] (AS) [uniref90 F1R019 Uncharacterized protein (Fragment) n=5 Tax=Danio rerio RepID=F1R019_DANRE] (AT) [uniref90 UPI00016E23BC UPI00016E23BC related cluster n=4 Tax=Tetraodontidae RepID=UPI00016E23BC] (AU) [uniref90 Q4T188 Chromosome undetermined SCAF10698, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4T188_TETNG] (AV) [uniref90 UPI0001E86FA2 PREDICTED: LOW QUALITY PROTEIN: son of sevenless homolog 1-like n=1 Tax=Sus scrofa RepID=UPI0001E86FA2] (AW) [uniref90 C3XZI5 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3XZI5_BRAFL] (AX) [uniref90 UPI0000E471A9 PREDICTED: similar to Son of sevenless homolog 2 (Drosophila) n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E471A9] (AY) [uniref90 F4WM45 Protein son of sevenless n=1 Tax=Acromyrmex echinatior RepID=F4WM45_9HYME] (AZ) [uniref90 E9H784 Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9H784_DAPPU] (BA) [uniref90 D2A5R8 Histone H2A n=2 Tax=Tribolium castaneum RepID=D2A5R8_TRICA] (BB) [uniref90 UPI0001791FA8 PREDICTED: protein son of sevenless-like n=1 Tax=Acyrthosiphon pisum RepID=UPI0001791FA8] (BC) [uniref90 E2AK89 Protein son of sevenless n=1 Tax=Camponotus floridanus RepID=E2AK89_9HYME] (BD) [uniref90 UPI0001CBABEA PREDICTED: son of sevenless homolog 1-like n=1 Tax=Saccoglossus kowalevskii RepID=UPI0001CBABEA] (BE) [uniref90 Q9N5D3 Drosophila sos homolog protein 1 n=2 Tax=Caenorhabditis elegans RepID=Q9N5D3_CAEEL] (BF) [uniref90 E5SJ49 Protein son of sevenless n=1 Tax=Trichinella spiralis RepID=E5SJ49_TRISP] SUPPLEMENTARY FIGURE 4 (part 3/3). Profile alignment of 20 templates for FEZ1 protein.

MODEL QUALITY PER RESIDUE SUPPLEMENTARY FIGURE 5. Per residue Z-score of FEZ1 protein modeling

SUPPLEMENTARY TABLE 1 Supplementary Table 1. Modeled loops in FEZ1 protein!""# $%&'()*+,-.,+/0"(!""#.1'23'+/' 4%&'()*+,-.,+/0"( 5!6778 9:;<$6=>!;8?7=:<97$9!67<986@?<77<@A;6 B!$@C D 9E:;: :::?$:;7:$A7<9$: :>!!@ F A$:;C >@!!@ AB9!; G!6:=6 H!@6@?$ $$I7B J E!:A> 7!7:!:$?77 :997? K :9::= =6$7 >A>:: L :!=;; 88;:;<!7!6778 M :CIAH!@A$B9>7!7:AI8A>$9:H!$<$ 47A@: N!?H=; :A$ :;C>@ 5O $C4?8 $B$H;@:$!H>C;$6!696 ::::@ 55 :!:?: ;:C ;$7?9 5D 9@C!9 :C6$ ;6;:6 5F I7B:<!8E=7 <7:!@ 5G 6B!$@ C9>B:;;H 7>>7C 5J A::?: A!8>747:A>::;>64?B<77>E E!:A> 5K $"+' =:H>!C7!A