sybil: strepneumo: protein SPCG_0603 [db=strepneumo_v15]
PROTEIN PROPERTIES properties of SPCG_0603
property value
organism
Streptococcus pneumoniae CGSP14
product name beta-galactosidase
sequence length 2233 aa
created 2010-09-03 11:12:00
last modified 2010-09-03 11:12:00
DATABASE REFERENCES database refs for SPCG_0603
database accession version
PROTEIN CLUSTERS clusters of which SPCG_0603 is a member
cluster program algorithm analysis description
strepneumo_v15.match.2047595087.1 j_ortholog_clusters j_ortholog_clusters Ortholog Clusters
GENOMIC CONTEXT genomic context of the gene show_protein_clusters
sequence location strand
pneumoniae CGSP14 (2209198bp) (2.21 Mb) 604119-610821 +
additional feature types: pmark spacer tRNA repeatmasker simple repeats homopolymeric tracts tandem repeats detected by phobos BoxA Repeat BoxB Repeat BoxC Repeat RUP Repeat Genomic Island (from IslandPath) Signal Peptide (Neural Network) Signal Peptide (HMM) B-cell epitopes from BepiPred antigenic regions from EMBOSS antigenic Lipoprotein Attachment Site LPxTG motif Membrane surface protein motif Bacteriocin protein motif Fibronectin binding protein motif Transmembrane Regions (tmhmm)
display at most: 5 kb 10 kb 15 kb 25 kb 50 kb on either side of SPCG_0603:
BLASTP HITS proteins with significant BLASTP matches
show top: 5 10 20 50 BLASTP matches
highlighted proteins: none strepneumo_v15.match.2047595087.1
SEQUENCE the amino acid sequence in FASTA format
>SPCG_0603 polypeptide
MGKGHWNRKRVYSIRKFAVGACSVMIGTCAVLLGGNIAGESVVYADETLITHTAEKPKEEKMIVEEKADK
ALETKNVVERTEQSEPSSTEAIASEKKEDEAVTPKEEKVSAKPEEKAPRIESQASSQEKPLKEDAKAVTN
EEMNQMIEDRKVDFNQNWHFKLNANSKEAIKPDADVSTWKKLDLPYDWSIFNNFDHESPAQNEGGQLNGG
EAWYRKTFKLDEKDLKKNVRLTFDGVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLHKDGRENVIAVHA
VNKQPSSRWYSGSGIYRDVTLQVTDKVHVEKNGTTILTPKLEEQQHGKVETHVTSKIVNTDDKDHELVAE
YQIVERGGHAVTGLVRTASRTLKAHESTSLDAILEVERPKLWTVLNDKPALYELITRVYRDGQLVDAKKD
LFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEENYKAEYRRLKQMKEMGVNSIRTTHNPASEQ
TLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEKWSDFDLRTMVERGKNNPAIFMWS
IGNEIGEANGDAHSLATVKRLVKVIKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDAVGFNYSEDNYK
ALRAKHPKWLIYGSETSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFDRDNA
GYAGQFIWTGTDYIGEPTPWHNQNQTPVKSSYFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNW
ENKELASKVADSEGKIPVRAYSNASSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQ
PGTLEAIARDESGKEIARDKITTAGKPAAVRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRF
QLHGQGQLVGVDNGEQASRERYKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSSQVTVFT
GKKEGQEKTVLGTEVPKVQTIIGEAPEMPTTVPFLYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREV
EARVEVIALKSELPVVKRIAPNTDLNSVDKSVSYVLTDGSVQEYEVDSWEITEVDKAKLSVAGSRIQMTG
QLAGETIHATLVVEEGNAAAPVVPTVTVGGEAVTGLTSRQPMQYRTLSYGAQLPEVTASAENADVTVLQA
SAANGMRASIFVQPKDGGPLQTYAIQLLEEAPKIAHLSLQVEKADSLKEDQTVKLSVRAHYQDGTQAVLP
ADKVTFSTSGEGEVAIRKGMLELHKPGAVTLKAEYEGAKGQVELTIQANTEKKIAQSIRPVNVVTDLHQK
PTLPTTVTVEYDKGFPKAHKVTWQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTP
IAEAPQLPESVRTYDSNGHVSSAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANI
SDQWTGSELPLAFASDSNPSDPVSNVNDKLISYNNQPANRWTNWNRTNPEASVGVLFGDSGILSKRSVDN
LSVGFHEDHGVGAPKSYVIEYYVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHF
SFDKVETYAVRIRMVRADNKLGTSITEVQIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGK
VTAVTASVSNNGLATVVPSVREGEPVRVIAKAENGDILGEYRLHFTKDKNLLSHKPVAAVKQARLLQVGQ
ALELPTKVPVYFTGKDGYETKNLTVEWEEVPAENLIKAGQFTVRGHVLGSNLVAEITVRVTDKLGETLSD
NPNYDENSNQAFASATNDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNPEVSAGVIFRENGKIVERT
VAQGKVQFFADSGTDAPSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYRADKDIAAGDE
INVTFKDIKAKAMRWRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQITYK
GQRPKVSVEENNQVASTVVDSGEDSLPVLVRLVSESGKQVKEYRIHLTKEKPVSDKTVAAVQEDLPKLEF
VEKDLAYKTVEKKDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREVVEVPTDRIVLVGTKPVAQ
EAKKPQVSEKADTKPIDSSEASQTNKAQLPNTGSAAGQAAVAAGLALLGLSAGLVVTKGKKED
>SPCG_0603 nucleotide
ATGGGGAAAGGCCATTGGAATCGGAAAAGAGTTTATAGCATTCGTAAGTTTGCTGTGGGAGCTTGCTCAG
TAATGATTGGGACTTGTGCAGTTCTATTAGGAGGAAATATAGCTGGAGAATCTGTAGTTTATGCGGATGA
AACACTTATTACTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGAAAAGGCTGATAAA
GCTTTGGAAACTAAAAATGTAGTTGAAAGGACAGAACAAAGTGAACCTAGTTCAACTGAGGCTATTGCAT
CTGAGAAGAAAGAAGATGAAGCCGTAACTCCAAAAGAGGAAAAAGTGTCTGCTAAACCGGAAGAAAAAGC
TCCAAGGATAGAATCACAAGCTTCAAGTCAAGAAAAACCGCTCAAGGAAGATGCTAAAGCTGTAACAAAT
GAAGAAATGAATCAAATGATTGAAGACAGGAAAGTGGATTTTAATCAAAATTGGCACTTTAAACTCAATG
CAAATTCTAAGGAAGCCATTAAACCTGATGCAGACGTATCTACGTGGAAAAAATTAGATTTACCGTATGA
CTGGAGTATCTTTAACAATTTCGATCATGAATCTCCTGCACAAAATGAAGGTGGACAACTCAACGGTGGG
GAAGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGACCTCAAGAAAAATGTTCGCCTTACTTTTG
ATGGCGTCTACATGGATTCTCAAGTTTATGTCAATGGTCAGTTAGTGGGGCATTATCCAAATGGTTATAA
CCAGTTCTCATATGATATCACCAAATACCTTCACAAAGATGGTCGTGAGAATGTGATTGCTGTCCATGCA
GTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAGTGGTATCTATCGTGATGTGACTTTACAAGTGA
CAGATAAGGTGCATGTTGAGAAAAATGGGACAACTATTTTAACACCAAAACTTGAAGAACAACAACATGG
CAAGGTTGAAACTCATGTGACCAGCAAAATCGTCAATACGGACGACAAAGACCATGAACTTGTAGCCGAA
TATCAAATCGTTGAACGAGGTGGTCATGCTGTAACAGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAG
CACATGAATCAACAAGCCTAGATGCGATTTTAGAAGTTGAAAGACCAAAACTCTGGACCGTTTTAAATGA
CAAACCTGCCTTGTACGAATTGATTACGCGTGTTTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGAT
TTGTTTGGTTACCGTTACTATCACTGGACTCCAAATGAAGGTTTCTCTTTGAATGGTGAACGTATTAAAT
TCCATGGAGTATCCTTGCACCACGACCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCG
CCGTCTCAAACAAATGAAGGAGATGGGAGTTAACTCTATCCGTACAACCCACAACCCTGCTAGTGAGCAA
ACCTTGCAAATCGCAGCAGAACTAGGTTTACTCGTTCAGGAAGAGGCCTTTGATACGTGGTATGGTGGCA
AGAAACCTTATGACTATGGACGTTTCTTTGAAAAAGATGCCACTCACCCAGAAGCTCGAAAAGGTGAAAA
ATGGTCTGATTTTGACCTACGTACCATGGTCGAAAGAGGCAAAAACAACCCTGCTATCTTCATGTGGTCA
ATTGGTAATGAAATAGGTGAAGCTAATGGTGATGCCCACTCTTTAGCAACTGTTAAACGTTTGGTCAAGG
TTATCAAGGATGTTGATAAGACTCGCTATGTTACCATGGGAGCAGATAAATTCCGTTTCGGCAATGGTAG
CGGAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATTTAACTATTCTGAAGATAATTACAAA
GCCCTTAGAGCTAAGCATCCAAAATGGTTGATTTACGGTTCAGAAACATCATCAGCAACCCGTACACGAG
GAAGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGA
TTATGGAAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCT
GGCTATGCTGGACAGTTTATCTGGACAGGTACGGACTATATTGGTGAACCTACACCATGGCACAACCAAA
ATCAAACTCCTGTTAAGAGCTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTA
TCTCTACCAAAGCCAATGGGTTTCTGTTAAGAAGAAACCGATGGTACACCTTCTTCCTCACTGGAACTGG
GAAAACAAAGAATTAGCATCCAAAGTAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCTTATTCGAATG
CTTCTAGTGTAGAATTGTTCTTGAATGGAAAATCTCTTGGTCTTAAGACTTTCAATAAAAAACAAACCAG
CGATGGGCGGACTTACCAAGAAGGTGCAAATGCTAATGAACTTTATCTTGAATGGAAAGTTGCCTATCAA
CCAGGTACCTTGGAAGCAATTGCTCGTGATGAATCTGGCAAGGAAATTGCTCGAGATAAGATTACGACTG
CTGGTAAGCCAGCGGCAGTTCGTCTTATTAAGGAAGACCATGCGATTGCAGCAGATGGAAAAGACTTGAC
TTACATCTACTATGAAATTGTTGACAGCCAGGGGAATGTGGTTCCAACTGCTAATAATCTGGTTCGCTTC
CAATTGCATGGCCAAGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGTGAACGCTATAAGG
CGCAAGCAGATGGTTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTGTCAAATCAACTGA
ACAAGCAGGGAAATTCACCCTTACTGCTCACTCTGATCTCTTGAAATCTAGTCAAGTCACTGTCTTTACT
GGTAAGAAAGAAGGACAAGAAAAGACTGTTTTGGGGACAGAGGTGCCAAAAGTACAGACCATTATTGGAG
AGGCACCTGAAATGCCTACCACTGTTCCGTTTCTATACAGTGATGGTAGTCGTGCAGAACGTCCTGTAAC
CTGGTCTTCAGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTA
GAAGCTCGTGTAGAAGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCTCCAAATACTG
ACTTGAATTCTGTAGACAAATCTGTTTCCTATGTTTTGACTGATGGAAGTGTACAAGAGTACGAAGTAGA
TAGCTGGGAGATTACGGAAGTAGATAAAGCCAAACTTTCAGTAGCTGGATCACGTATTCAAATGACTGGT
CAGTTAGCTGGAGAAACTATTCATGCAACCCTTGTGGTAGAAGAAGGAAATGCTGCAGCACCTGTAGTGC
CAACTGTTACTGTTGGAGGTGAAGCAGTAACAGGTCTTACTAGTCGACAACCAATGCAATATCGTACTCT
ATCTTATGGTGCCCAATTGCCAGAAGTCACAGCAAGTGCTGAAAATGCTGATGTGACAGTTCTTCAAGCA
AGCGCAGCAAACGGCATGCGTGCAAGCATATTTGTTCAGCCTAAAGATGGTGGCCCTCTTCAAACCTATG
CAATTCAATTACTAGAAGAAGCACCAAAAATTGCTCACTTGAGCTTGCAAGTGGAAAAAGCTGACAGTCT
CAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAAGCTGTATTACCA
GCTGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGAAGTCGCAATTCGTAAAGGAATGCTTGAGTTGC
ATAAGCCAGGAGCAGTCACTCTCAAAGCGGAATATGAGGGAGCTAAAGGCCAAGTTGAACTCACTATCCA
AGCCAATACTGAGAAGAAGATTGCGCAATCTATCCGTCCTGTAAATGTAGTCACAGATTTGCATCAAAAA
CCTACTCTTCCAACAACAGTAACGGTTGAGTATGACAAAGGTTTCCCTAAAGCTCACAAAGTCACTTGGC
AAGCTATTCCGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGGAATTGA
CCTTGAAGCGCGTGCAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGACAACTCCA
ATCGCAGAAGCACCACAATTACCAGAAAGTGTTCGGACATATGATTCAAATGGTCACGTTTCATCAGCTA
AGGTTGCATGGGATGCGATTCGTCCAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGGTCGCTT
AGAAGGTACTCAATTAACAACTAAACTTCATGTTCGCGTATCTGCTCAAACTGAGCAAGGTGCAAACATT
TCTGACCAATGGACCGGTTCAGAATTGCCACTTGCCTTTGCTTCAGACTCAAATCCAAGCGACCCAGTTT
CAAATGTTAATGACAAGCTCATTTCCTACAATAACCAACCAGCCAATCGTTGGACAAACTGGAATCGTAC
TAATCCAGAAGCTTCAGTCGGTGTCCTATTCGGAGATTCAGGTATTTTGAGCAAACGTTCAGTTGATAAC
TTAAGCGTTGGCTTCCATGAAGACCATGGAGTTGGTGCACCGAAGTCTTATGTGATTGAGTATTATGTTG
GTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGGACCATGTCTTTAATGATTC
TGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACTCAAGGCTGGAGAAATGAACCACTTT
AGCTTTGATAAAGTTGAAACCTATGCTGTTCGTATTCGCATGGTGAGAGCAGACAACAAACTAGGAACGT
CTATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACAAACAAGAATCCAAGT
TGACGGTAAAGACTTAGCAAACTTCAACCCTGATTTGACAGACTACTACCTTGAGTCTGTAGATGGAAAA
GTTACGGCAGTCACAGCAAGTGTTAGCAACAATGGCCTCGCTACCGTCGTTCCAAGCGTTCGTGAAGGTG
AGCCGGTTCGTGTCATCGCGAAAGCTGAAAATGGCGACATCTTAGGAGAATACCGTCTGCACTTCACTAA
GGATAAGAACTTACTTTCTCATAAACCAGTTGCTGCGGTTAAACAAGCTCGCTTGCTACAAGTAGGTCAA
GCACTTGAATTGCCGACTAAGGTTCCAGTTTACTTCACAGGTAAAGACGGCTACGAAACAAAAAACCTGA
CAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCTGATAAAAGCAGGTCAATTTACCGTTCGAGGCCATGT
CCTTGGTAGTAACCTTGTTGCTGAGATCACTGTACGAGTGACAGACAAACTTGGTGAGACTCTTTCAGAT
AACCCTAACTATGATGAAAACAGTAACCAGGCCTTTGCTTCAGCAACCAATGATATTGACAAAAACTCTC
ATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCGTCGTTGGACAAACTGGTCACCAAC
ACCATCTTCTAATCCAGAAGTATCAGCGGGTGTGATTTTCCGTGAAAATGGTAAGATTGTAGAACGGACT
GTTGCACAAGGAAAAGTTCAGTTCTTTGCAGATAGTGGTACGGATGCACCATCTAAACTCGTTTTAGAAC
GCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAACTACCAAGCCTACGACGCAGACCATCC
ATTCAACAATCCAGAAAATTGGGAAGCTGTTCCTTATCGTGCGGATAAAGACATTGCAGCTGGTGATGAA
ATCAACGTAACATTTAAAGATATCAAAGCCAAAGCTATGAGATGGCGTATGGAGCGTAAAGCAGATAAGA
GCGGTGTTGCGATGATTGAGATGACCTTCCTTGCACCAAGTGAATTGCCTCAAGAAAGCACTCAATCAAA
GATTCTTGTAGATGGAAAAGAACTTGCTGATTTCGCTGAAAATCGTCAAGACTATCAAATTACCTATAAA
GGTCAACGGCCAAAAGTCTCAGTTGAAGAAAACAATCAAGTAGCTTCAACTGTGGTAGATAGTGGAGAAG
ATAGCCTTCCAGTACTTGTTCGCCTCGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCACTT
GACTAAGGAAAAACCAGTTTCTGATAAGACAGTTGCTGCTGTACAAGAAGATCTTCCAAAACTCGAATTT
GTTGAAAAAGATTTGGCATACAAGACAGTTGAGAAAAAAGATTCAACACTGTATCTAGGTGAAACTCGTG
TAGAACAAGAAGGAAAAGTTGGAAAAGAACGTATCTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGA
AAAACTCCGTGAAGTGGTAGAAGTTCCGACAGACCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAA
GAAGCTAAAAAACCACAAGTGTCAGAAAAAGCAGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAA
CTAATAAAGCCCAGTTACCAAATACAGGTAGTGCGGCAGGCCAAGCAGCAGTAGCAGCAGGTTTAGCTCT
TCTAGGTTTGAGTGCAGGATTAGTAGTTACTAAAGGTAAAAAAGAAGACTAG
sybil web site: sybil.sourceforge.net e-mail: driley@som.umaryland.edu