sybil: strepneumo: protein SPD_0562 [db=strepneumo_v15]
PROTEIN PROPERTIES properties of SPD_0562
property value
organism
Streptococcus pneumoniae D39
product name beta-galactosidase precursor, putative
sequence length 2228 aa
created 2010-09-03 11:12:00
last modified 2010-09-03 11:12:00
DATABASE REFERENCES database refs for SPD_0562
database accession version
PROTEIN CLUSTERS clusters of which SPD_0562 is a member
cluster program algorithm analysis description
strepneumo_v15.match.2047595087.1 j_ortholog_clusters j_ortholog_clusters Ortholog Clusters
GENOMIC CONTEXT genomic context of the gene show_protein_clusters
sequence location strand
pneumoniae D39 (2046115bp) (2.05 Mb) 579653-586340 +
additional feature types: pmark spacer tRNA repeatmasker simple repeats homopolymeric tracts tandem repeats detected by phobos BoxA Repeat BoxB Repeat BoxC Repeat RUP Repeat Genomic Island (from IslandPath) Signal Peptide (Neural Network) Signal Peptide (HMM) B-cell epitopes from BepiPred antigenic regions from EMBOSS antigenic Lipoprotein Attachment Site LPxTG motif Membrane surface protein motif Bacteriocin protein motif Fibronectin binding protein motif Transmembrane Regions (tmhmm)
display at most: 5 kb 10 kb 15 kb 25 kb 50 kb on either side of SPD_0562:
BLASTP HITS proteins with significant BLASTP matches
show top: 5 10 20 50 BLASTP matches
highlighted proteins: none strepneumo_v15.match.2047595087.1
SEQUENCE the amino acid sequence in FASTA format
>SPD_0562 polypeptide
MGKGHWNRKRVYSIRKFAVGACSVMIGTCAVLLGGNIAGESVVYADETLITHTAEKPKEEKMIVEEKADK
ALETKNVVERTEQSEPSSTEAIASEKKEDEAVTPKEEKVSAKPEEKAPRIESQASSQEKPLKEDAKAVTN
EEVNQMIENRKVDFNQNWYFKLNANSKEAIKPDADVSTWKKLDLPYDWSIFNDFDHESPAQNEGGQLNGG
EAWYRKTFKLDEKDLKKNVRLTFDGVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLYKDGRENVIAVHA
VNKQPSSRWYSGSGIYRDVTLQVTDKVHVEKNGTTILTPKLEEQQHGKVETHVTSKIVNTDDKDHELVAE
YQIVERGGHAVTGLVRTASRTLKAHESTSLDAILEVERPKLWTVLNDKPALYELITRVYRDGQLVDAKKD
LFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEENYKAEYRRLKQMKEMGVNSIRTTHNPASEQ
TLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEKWSDFDLRTMVERGKNNPAIFMWS
IGNEIGEANGDAHSLATVKRLVKVIKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDAVGFNYSEDNYK
ALRAKHPKWLIYGSETSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFDRDNA
GYAGQFIWTGTDYIGEPTPWHNQNQTPVKSSYFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNW
ENKELASKVADSEGKIPVRAYSNASSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQ
PGTLEAIARDESGKEIARDKITTAGKPAAVRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRF
QLHGQGQLVGVDNGEQASRERYKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFT
GKKEGQEKTVLGTEVPKVQTIIGEAPEMPTTVPFVYSDGSRAERPVTWSLVDVSKPGIVTVKGMADGREV
EARVEVIALKSELPVVKRIAPNTNLNSVDKSVSYVLTDGSVQEYEVDKWEIAEEDKAKLAIPGSRIQATG
YLEGQPIHATLVVEEGNPAAPVVPTVTVGGEAVTGLTSRQPMQYRTLSYGAQLPEVTASAENADVTVLQA
SAANGMRASIFIQPKDGGPLQTYAIQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSVRAHYQDGTQAVLP
ADKVTFSTSGEGEVAIRKGMLELHKPGAVTLNAEYEGAKGQVELTIQANTEKKIAQSIRPVNVVTDLHQE
PSLPATVTVEYDKGFPKTHKVTWQAIPKEKLDSYQIFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTP
IAEAPQLPESVRTYDSNGHVSSAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANI
SDQWTGSELPLAFASDSNPSDPVSNVNDKLISYNNQPANRWTNWNRSNPEASVGVLFGDSGILSKRSVDN
LSVGFHEDHGVGAPKSYVIEYYVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHF
SFDKVETYAIRIRMVKADNKRGTSITEVQIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGK
VPAVTANVSNNGLATVVPSVREGEPVRVIAKAENGDILGEYRLHFTKDKNLLSHKPVAAVKQARLLQVGQ
ALELPTKVPVYFTGKDGYETKDLTVEWEEVPAENLTKAGQFTVRGRVLGSNLVAEVTVRVTDKLGETLSD
NPNYDENSNQAFASATNDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNPEVSAGVIFRENGKIVERT
VAQAKLHFFADSGTDAPSKLVLERYVGPGFEVPTYYSNYQAYESGHPFNNPENWEAVPYRADKDIAAGDE
INVTFKAVKAKVMRWRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQITYK
GQRPKVSVEENNQVASTVVDSGEDSLPVLVRLVSESGKQVKEYRIQLTKEKPVSAVQEDLPKLEFVEKDL
AYKTVEKKDSTLYLGETRVEQEGKVGKERIFTVINPDGSKEEKLREVVEVPTDRIVLVGTKPVAQEAKKP
QVSEKADTKPIDSSEADQTNKAQLPNTGSAASQAAVAAGLALLGLSAGLVVTKGKKED
>SPD_0562 nucleotide
ATGGGGAAAGGCCATTGGAATCGGAAAAGAGTTTATAGCATTCGTAAGTTTGCTGTGGGAGCTTGCTCAG
TAATGATTGGGACTTGTGCAGTTCTATTAGGAGGAAATATAGCTGGAGAATCTGTAGTTTATGCGGATGA
AACACTTATTACTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGAAAAGGCTGATAAA
GCTTTGGAAACTAAAAATGTAGTTGAAAGGACAGAACAAAGTGAACCTAGTTCAACTGAGGCTATTGCAT
CTGAGAAGAAAGAAGATGAAGCCGTAACTCCAAAAGAGGAAAAAGTGTCTGCTAAACCGGAAGAAAAAGC
TCCAAGGATAGAATCACAAGCTTCAAGTCAAGAAAAACCGCTCAAGGAAGATGCTAAAGCTGTAACAAAT
GAAGAAGTGAATCAAATGATTGAAAACAGGAAAGTGGATTTTAATCAAAATTGGTACTTTAAACTCAATG
CAAATTCTAAGGAAGCCATTAAACCTGATGCAGACGTATCTACGTGGAAAAAATTAGATTTACCGTATGA
CTGGAGTATCTTTAACGATTTCGATCATGAATCTCCTGCACAAAATGAAGGTGGACAGCTCAACGGTGGG
GAAGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGACCTCAAGAAAAATGTTCGCCTTACTTTTG
ATGGCGTCTACATGGATTCTCAAGTTTATGTCAATGGTCAGTTAGTGGGGCATTATCCAAATGGTTATAA
CCAGTTCTCATACGATATCACCAAATACCTTTACAAAGATGGTCGTGAGAATGTGATTGCTGTCCATGCA
GTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAGTGGTATCTATCGTGATGTGACTTTACAAGTGA
CAGATAAGGTGCATGTTGAGAAAAATGGGACAACTATTTTAACACCAAAACTTGAAGAACAACAACATGG
CAAGGTTGAAACTCATGTGACCAGCAAAATCGTCAATACGGACGACAAAGACCATGAACTTGTAGCCGAA
TATCAAATCGTTGAACGAGGTGGTCATGCTGTAACAGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAG
CACATGAATCAACAAGCCTAGATGCGATTTTAGAAGTTGAAAGACCAAAACTCTGGACCGTTTTAAATGA
CAAACCTGCCTTGTACGAATTGATTACGCGTGTTTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGAT
TTGTTTGGTTACCGTTACTATCACTGGACTCCAAATGAAGGTTTCTCTTTGAATGGTGAACGTATTAAAT
TCCATGGAGTATCCTTGCACCACGACCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCG
CCGTCTCAAACAAATGAAGGAGATGGGAGTTAACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAA
ACCTTGCAAATCGCAGCAGAACTAGGTTTACTCGTTCAGGAAGAGGCCTTTGATACTTGGTATGGTGGCA
AGAAACCTTATGACTATGGACGTTTCTTTGAAAAAGATGCCACTCACCCAGAAGCTCGAAAAGGTGAAAA
ATGGTCTGATTTTGACCTACGTACCATGGTCGAAAGAGGCAAAAACAACCCTGCTATCTTCATGTGGTCA
ATTGGTAATGAAATAGGTGAAGCTAATGGTGATGCCCACTCTTTAGCAACTGTTAAACGTTTGGTCAAGG
TTATCAAGGATGTTGATAAGACTCGCTATGTTACCATGGGAGCAGATAAATTCCGTTTCGGTAATGGTAG
CGGAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATTTAACTATTCTGAAGATAATTACAAA
GCCCTTAGAGCTAAGCATCCAAAATGGTTGATTTACGGTTCAGAAACATCATCAGCAACCCGTACACGAG
GAAGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGA
TTATGGAAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCT
GGCTATGCTGGACAGTTTATCTGGACAGGTACGGACTATATTGGTGAACCTACACCATGGCACAACCAAA
ATCAAACTCCCGTTAAGAGCTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTA
TCTCTACCAAAGCCAATGGGTTTCTGTTAAGAAGAAACCGATGGTACACCTTCTTCCTCACTGGAACTGG
GAAAACAAAGAATTAGCATCCAAAGTAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCTTATTCGAATG
CTTCTAGTGTAGAATTGTTCTTGAATGGAAAATCTCTTGGTCTTAAGACTTTCAATAAAAAACAAACCAG
CGATGGGCGGACTTACCAAGAAGGTGCAAATGCTAATGAACTTTATCTTGAATGGAAAGTTGCCTATCAA
CCAGGTACCTTGGAAGCAATTGCTCGTGATGAATCTGGCAAGGAAATTGCTCGAGATAAGATTACGACTG
CTGGTAAGCCAGCGGCAGTTCGTCTTATTAAGGAAGACCATGCGATTGCAGCAGATGGAAAAGACTTGAC
TTACATCTACTATGAAATTGTTGACAGCCAGGGGAATGTGGTTCCAACTGCTAATAATCTGGTTCGCTTC
CAATTGCATGGCCAAGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGTGAACGCTATAAGG
CGCAAGCAGATGGTTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTGTCAAATCAACTGA
ACAAGCAGGGAAATTCACCCTTACTGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACTGTCTTTACT
GGTAAGAAAGAAGGACAAGAAAAGACTGTTTTGGGGACAGAGGTGCCAAAAGTACAGACCATTATTGGAG
AGGCACCTGAAATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGTCGTGCAGAACGTCCTGTAAC
CTGGTCTTTAGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTA
GAAGCTCGTGTAGAAGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCTCCAAATACTA
ACTTGAATTCTGTAGACAAATCTGTTTCCTATGTTTTGACTGATGGAAGTGTACAAGAGTATGAAGTGGA
CAAGTGGGAGATTGCCGAAGAAGATAAAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGT
TATTTAGAAGGTCAACCAATTCATGCAACCCTTGTGGTAGAAGAAGGCAATCCTGCAGCACCTGTAGTGC
CAACTGTTACTGTTGGAGGTGAAGCAGTAACAGGTCTTACTAGTCGACAACCAATGCAATATCGTACTCT
ATCTTATGGTGCCCAATTGCCAGAAGTCACAGCAAGTGCTGAAAATGCTGATGTGACAGTTCTTCAAGCA
AGCGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGATGGTGGCCCTCTTCAAACCTATG
CAATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACTTGAGCTTGCAAGTGGAAAAAGCTGACAGTCT
CAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAAGCTGTATTACCA
GCTGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGAAGTCGCAATTCGTAAAGGAATGCTTGAGTTGC
ATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGGCCAAGTTGAACTCACTATCCA
AGCCAATACTGAGAAGAAGATTGCGCAATCTATCCGTCCTGTAAATGTAGTCACAGATTTGCATCAGGAA
CCAAGTCTTCCAGCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTCATAAAGTCACTTGGC
AAGCTATTCCGAAAGAAAAACTAGACTCCTATCAAATATTTGAAGTACTAGGTAAAGTTGAAGGAATTGA
CCTTGAAGCGCGTGCAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGACAACTCCA
ATCGCAGAAGCACCACAATTACCAGAAAGCGTTCGGACATATGATTCAAATGGTCACGTTTCATCAGCTA
AGGTTGCATGGGATGCGATTCGTCCAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGGTCGCTT
AGAAGGTACTCAATTAACAACTAAACTTCATGTTCGCGTATCTGCTCAAACTGAGCAAGGTGCAAACATT
TCTGACCAATGGACCGGTTCAGAATTGCCACTTGCCTTTGCTTCAGATTCAAATCCAAGCGACCCAGTTT
CAAATGTTAATGACAAGCTCATTTCCTACAATAACCAACCAGCCAATCGTTGGACAAACTGGAATCGTAG
TAATCCAGAAGCTTCAGTCGGTGTTCTGTTTGGAGATTCAGGTATCTTGAGCAAACGCTCCGTTGATAAT
CTAAGTGTCGGATTCCACGAAGACCATGGAGTTGGTGCACCGAAGTCTTATGTGATTGAGTATTATGTTG
GTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGGACCATGTCTTTAATGATTC
TGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACTCAAGGCTGGAGAAATGAACCACTTT
AGCTTTGATAAAGTTGAAACCTATGCTATTCGTATTCGCATGGTTAAAGCAGATAACAAGCGTGGAACGT
CTATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAACAAGGACAAACAAGAATCCAAGT
TGACGGCAAAGACTTAGCAAACTTCAACCCTGATTTGACAGACTACTACCTTGAGTCTGTAGATGGAAAA
GTTCCGGCAGTCACAGCAAATGTTAGCAACAATGGTCTCGCTACCGTCGTTCCAAGCGTTCGTGAAGGTG
AGCCAGTTCGTGTCATCGCGAAAGCTGAAAATGGCGACATCTTAGGAGAATACCGTCTGCACTTCACTAA
GGATAAGAACTTACTTTCTCATAAACCAGTTGCTGCGGTTAAACAAGCTCGCTTGCTACAAGTAGGTCAA
GCACTTGAATTGCCGACTAAGGTTCCAGTTTACTTCACAGGTAAAGACGGCTACGAAACAAAAGACCTGA
CAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCTGACAAAAGCAGGTCAATTTACTGTTCGAGGCCGTGT
CCTTGGTAGTAACCTTGTTGCTGAGGTCACTGTACGAGTGACAGACAAACTTGGTGAGACTCTTTCAGAT
AACCCTAACTATGATGAAAACAGTAACCAGGCCTTTGCTTCAGCAACCAATGATATTGACAAAAACTCTC
ATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCGTCGTTGGACAAACTGGTCACCAAC
ACCATCTTCTAATCCAGAAGTATCAGCGGGTGTGATCTTCCGTGAAAATGGTAAGATTGTAGAACGGACT
GTTGCTCAAGCCAAACTTCACTTCTTTGCAGATAGTGGTACGGATGCACCATCTAAACTCGTTTTAGAAC
GCTATGTCGGCCCAGGCTTTGAAGTACCTACCTACTATTCAAACTACCAAGCCTACGAATCTGGACATCC
ATTTAACAATCCAGAAAATTGGGAAGCTGTGCCTTATCGTGCGGATAAAGACATCGCAGCTGGTGATGAA
ATCAACGTAACATTTAAAGCTGTCAAAGCCAAAGTCATGAGATGGCGTATGGAGCGTAAAGCTGACAAGA
GCGGTGTTGCGATGATTGAGATGACCTTCCTTGCACCAAGTGAATTGCCTCAAGAAAGCACTCAATCAAA
GATTCTTGTAGATGGAAAAGAACTTGCTGATTTCGCTGAAAATCGTCAAGACTATCAAATTACCTATAAA
GGTCAACGGCCAAAAGTCTCAGTTGAAGAAAACAATCAAGTAGCTTCAACTGTGGTAGATAGTGGAGAAG
ATAGCCTTCCAGTACTTGTTCGCCTCGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCAGTT
GACTAAGGAAAAACCAGTTTCTGCTGTACAAGAAGATCTTCCAAAACTCGAATTTGTTGAAAAAGATTTG
GCCTACAAGACAGTTGAGAAAAAAGATTCAACACTGTATCTAGGTGAAACTCGTGTAGAACAAGAAGGAA
AAGTTGGAAAAGAACGTATCTTTACAGTGATTAATCCTGATGGAAGTAAGGAAGAAAAACTCCGTGAAGT
GGTAGAAGTTCCGACAGACCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAAGAAGCTAAAAAACCA
CAAGTGTCAGAAAAAGCAGATACAAAACCAATTGATTCAAGTGAAGCTGATCAAACTAATAAAGCCCAGT
TACCAAATACAGGTAGTGCGGCAAGCCAAGCAGCAGTAGCAGCAGGTTTAGCTCTTCTAGGTTTGAGTGC
AGGATTAGTAGTTACTAAAGGTAAAAAAGAAGACTAG
sybil web site: sybil.sourceforge.net e-mail: driley@som.umaryland.edu