sybil: strepneumo: protein SP_0648 [db=strepneumo_v15]
PROTEIN PROPERTIES properties of SP_0648
property value
organism
Streptococcus pneumoniae TIGR4
product name beta-galactosidase
sequence length 2233 aa
created 2010-09-03 11:12:00
last modified 2010-09-03 11:12:00
DATABASE REFERENCES database refs for SP_0648
database accession version
PROTEIN CLUSTERS clusters of which SP_0648 is a member
cluster program algorithm analysis description
strepneumo_v15.match.2047595087.1 j_ortholog_clusters j_ortholog_clusters Ortholog Clusters
GENOMIC CONTEXT genomic context of the gene show_protein_clusters
sequence location strand
pneumoniae TIGR4 (2160842bp) (2.16 Mb) 615441-622143 +
additional feature types: pmark spacer tRNA repeatmasker simple repeats homopolymeric tracts tandem repeats detected by phobos BoxA Repeat BoxB Repeat BoxC Repeat RUP Repeat Genomic Island (from IslandPath) Signal Peptide (Neural Network) Signal Peptide (HMM) B-cell epitopes from BepiPred antigenic regions from EMBOSS antigenic Lipoprotein Attachment Site LPxTG motif Membrane surface protein motif Bacteriocin protein motif Fibronectin binding protein motif Transmembrane Regions (tmhmm)
display at most: 5 kb 10 kb 15 kb 25 kb 50 kb on either side of SP_0648:
BLASTP HITS proteins with significant BLASTP matches
show top: 5 10 20 50 BLASTP matches
highlighted proteins: none strepneumo_v15.match.2047595087.1
SEQUENCE the amino acid sequence in FASTA format
>SP_0648 polypeptide
MGKGHWNRKRVYSIRKFAVGACSVMIGTCAVLLGGNIAGESVVYADETLITHTAEKPKEEKMIVEEKADK
ALETKNIVERTEQSEPSSTEAIASEKKEDEAVTPKEEKVSAKPEEKAPRIESQASNQEKPLKEDAKAVTN
EEVNQMIEDRKVDFNQNWYFKLNANSKEAIKPDADVSTWKKLDLPYDWSIFNDFDHESPAQNEGGQLNGG
EAWYRKTFKLDEKDLKKNVRLTFDGVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLQKDGRENVIAVHA
VNKQPSSRWYSGSGIYRDVTLQVTDKVHVEKNGTTILTPKLEEQQHGKVETHVTSKIVNTDDKDHELVAE
YQIVERGGHAVTGLVRTASRTLKAHESTSLDAILEVERPKLWTVLNDKPALYELITRVYRDGQLVDAKKD
LFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEENYKAEYRRLKQMKEMGVNSIRTTHNPASEQ
TLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEKWSDFDLRTMVERGKNNPAIFMWS
IGNEIGEANGDAHSLATVKRLVKVIKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDAVGFNYSEDNYK
ALRAKHPKWLIYGSETSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFDRDNA
GYAGQFIWTGTDYIGEPTPWHNQNQTPVKSSYFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNW
ENKELASKVADSEGKIPVRAYSNASSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQ
PGTLEAIARDESGKEIARDKITTAGKPAAVRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRF
QLHGQGQLVGVDNGEQASRERYKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFT
GKKEGQEKTVLGTEVPKVQTIIGEAPEMPTTVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREV
EARVEVIALKSELPVVKRIAPNTDLNSVDKSVSYVLIDGSVEEYEVDKWEIAEEDKAKLAIPGSRIQATG
YLEGQPIHATLVVEEGNPAAPAVPTVTVGGEAVTGLTSQKPMQYRTLAYGAKLPEVTASAKNAAVTVLQA
SAANGMRASIFIQPKDGGPLQTYAIQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSVRAHYQDGTQAVLP
ADKVTFSTSGEGEVAIRKGMLELHKPGAVTLNAEYEGAKDQVELTIQANTEKKIAQSIRPVNVVTDLHQE
PSLPATVTVEYDKGFPKTHKVTWQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTP
IAEAPQLPESVRTYDSNGHVSSAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANI
SDQWTGSELPLAFASDSNPSDPVSNVNDKLISYNNQPANRWTNWNRTNPEASVGVLFGDSGILSKRSVDN
LSVGFHEDHGVGVPKSYVIEYYVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHF
SFDKVETYAVRIRMVKADNKRGTSITEVQIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGK
VPAVTASVSNNGLATVVPSVREGEPVRVIAKAENGDILGEYRLHFTKDKSLLSHKPVAAVKQARLLQVGQ
ALELPTKVPVYFTGKDGYETKDLTVEWEEVPAENLTKAGQFTVRGRVLGSNLVAEITVRVTDKLGETLSD
NPNYDENSNQAFASATNDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNPEVSAGVIFRENGKIVERT
VTQGKVQFFADSGTDAPSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYRADKDIAAGDE
INVTFKAIKAKAMRWRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQITYK
GQRPKVSVEENNQVASTVVDSGEDSFPVLVRLVSESGKQVKEYRIHLTKEKPVSEKTVAAVQEDLPKIEF
VEKDLAYKTVEKKDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREVVEVPTDRIVLVGTKPVAQ
EAKKPQVSEKADTKPIDSSEASQTNKAQLPSTGSAASQAAVAAGLTLLGLSAGLVVTKGKKED
>SP_0648 nucleotide
ATGGGGAAAGGCCATTGGAATCGGAAAAGAGTTTATAGCATTCGTAAGTTTGCTGTGGGAGCTTGCTCAG
TAATGATTGGGACTTGTGCAGTTTTATTAGGAGGAAATATAGCTGGAGAATCTGTAGTTTATGCGGATGA
AACACTTATTACTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGAAAAGGCTGATAAA
GCTTTGGAAACTAAAAATATAGTTGAAAGGACAGAACAAAGTGAACCTAGTTCAACTGAGGCTATTGCAT
CTGAGAAGAAAGAAGATGAAGCCGTAACTCCAAAAGAGGAAAAAGTGTCTGCTAAACCGGAAGAAAAAGC
TCCAAGGATAGAATCACAAGCTTCAAATCAAGAAAAACCGCTCAAGGAAGATGCTAAAGCTGTAACAAAT
GAAGAAGTGAATCAAATGATTGAAGACAGGAAAGTGGATTTTAATCAAAATTGGTACTTTAAACTCAATG
CAAATTCTAAGGAAGCCATTAAACCTGATGCAGACGTATCTACGTGGAAAAAATTAGATTTACCGTATGA
CTGGAGTATCTTTAACGATTTCGATCATGAATCTCCTGCACAAAATGAAGGTGGACAGCTCAACGGTGGG
GAAGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGACCTCAAGAAAAATGTTCGCCTTACTTTTG
ATGGCGTCTACATGGATTCTCAAGTTTATGTCAATGGTCAGTTAGTGGGGCATTATCCAAATGGTTATAA
CCAGTTCTCATATGATATCACCAAATACCTTCAAAAAGATGGTCGTGAGAATGTGATTGCTGTCCATGCA
GTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAGTGGTATCTATCGTGATGTGACTTTACAAGTGA
CAGATAAGGTGCATGTTGAGAAAAATGGGACAACTATTTTAACACCAAAACTTGAAGAACAACAACATGG
CAAGGTTGAAACTCATGTGACCAGCAAAATCGTCAATACGGACGACAAAGACCATGAACTTGTAGCCGAA
TATCAAATCGTTGAACGAGGTGGTCATGCTGTAACAGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAG
CACATGAATCAACAAGCCTAGATGCGATTTTAGAAGTTGAAAGACCAAAACTCTGGACTGTTTTAAATGA
CAAACCTGCCTTGTACGAATTGATTACGCGTGTTTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGAT
TTGTTTGGTTACCGTTACTATCACTGGACTCCAAATGAAGGTTTCTCTTTGAATGGTGAACGTATTAAAT
TCCATGGAGTATCCTTGCACCACGACCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCG
CCGTCTCAAACAAATGAAGGAGATGGGAGTTAACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAA
ACCTTGCAAATCGCAGCAGAACTAGGTTTACTCGTTCAGGAAGAGGCCTTTGATACGTGGTATGGTGGCA
AGAAACCTTATGACTATGGACGTTTCTTTGAAAAAGATGCCACTCACCCAGAAGCTCGAAAAGGTGAAAA
ATGGTCTGATTTTGACCTACGTACCATGGTCGAAAGAGGCAAAAACAACCCTGCTATCTTCATGTGGTCA
ATTGGTAATGAAATAGGTGAAGCTAATGGTGATGCCCACTCTTTAGCAACTGTTAAACGTTTGGTTAAGG
TTATCAAGGATGTTGATAAGACTCGCTATGTTACCATGGGAGCAGATAAATTCCGTTTCGGTAATGGTAG
CGGAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATTTAACTATTCTGAAGATAATTACAAA
GCCCTTAGAGCTAAGCATCCAAAATGGTTGATTTATGGATCAGAAACATCTTCAGCTACCCGTACACGTG
GAAGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGA
TTATGGAAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCT
GGCTATGCTGGACAGTTTATCTGGACAGGTACGGACTATATTGGTGAACCTACACCATGGCACAACCAAA
ATCAAACTCCTGTTAAGAGCTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTA
TCTCTACCAAAGCCAATGGGTTTCTGTTAAGAAGAAACCGATGGTACACCTTCTTCCTCACTGGAACTGG
GAAAACAAAGAATTAGCATCCAAAGTAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCTTATTCGAATG
CTTCTAGTGTAGAATTGTTCTTGAATGGAAAATCTCTTGGTCTTAAGACTTTCAATAAAAAACAAACCAG
CGATGGGCGGACTTACCAAGAAGGTGCAAATGCTAATGAACTTTATCTTGAATGGAAAGTTGCCTATCAA
CCAGGTACCTTGGAAGCAATTGCTCGTGATGAATCTGGCAAGGAAATTGCTCGAGATAAGATTACGACTG
CTGGTAAGCCAGCGGCAGTTCGTCTTATTAAGGAAGACCATGCGATTGCAGCAGATGGAAAAGACTTGAC
TTACATCTACTATGAAATTGTTGACAGCCAGGGGAATGTGGTTCCAACTGCTAATAATCTGGTTCGCTTC
CAATTGCATGGCCAAGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGTGAACGCTATAAGG
CGCAAGCAGATGGTTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTGTCAAATCAACTGA
ACAAGCAGGGAAATTCACCCTGACTGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACTGTCTTTACT
GGTAAGAAAGAAGGACAAGAGAAGACTGTTTTGGGGACAGAAGTGCCAAAAGTACAGACCATTATTGGAG
AGGCACCTGAAATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAAC
CTGGTCTTCAGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTA
GAAGCTCGTGTAGAAGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCTCCAAATACTG
ACTTGAATTCTGTAGACAAATCTGTTTCCTATGTTTTGATTGATGGAAGTGTTGAAGAGTATGAAGTGGA
CAAGTGGGAGATTGCCGAAGAAGATAAAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGT
TATTTAGAAGGTCAACCAATTCATGCAACCCTTGTGGTAGAAGAAGGCAATCCTGCGGCACCTGCAGTAC
CAACTGTAACGGTTGGTGGTGAGGCAGTAACAGGTCTTACTAGTCAAAAACCAATGCAATACCGCACTCT
TGCTTATGGAGCTAAGTTGCCAGAAGTCACAGCAAGTGCTAAAAATGCAGCTGTTACAGTTCTTCAAGCA
AGCGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGATGGTGGCCCTCTTCAAACCTATG
CAATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACTTGAGCTTGCAAGTGGAAAAAGCTGACAGTCT
CAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAAGCTGTATTACCA
GCTGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGAAGTCGCAATTCGTAAAGGAATGCTTGAGTTGC
ATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTGAACTCACTATCCA
AGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATTTGCATCAGGAA
CCAAGTCTTCCAGCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTCATAAAGTCACTTGGC
AAGCTATTCCGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGGAATTGA
CCTTGAAGCGCGTGCAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGACAACTCCA
ATCGCAGAAGCACCACAATTACCAGAAAGTGTTCGGACATATGATTCAAATGGTCACGTTTCATCAGCTA
AGGTTGCATGGGATGCGATTCGTCCAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGGTCGCTT
AGAAGGTACGCAATTAACAACTAAACTTCATGTTCGCGTATCTGCTCAAACTGAGCAAGGTGCAAACATT
TCTGACCAATGGACCGGTTCAGAATTGCCACTTGCCTTTGCTTCAGACTCAAATCCAAGCGACCCAGTTT
CAAATGTTAATGACAAGCTCATTTCCTACAATAACCAACCAGCCAATCGTTGGACAAACTGGAATCGTAC
TAATCCAGAAGCTTCAGTCGGTGTTCTGTTTGGAGATTCAGGTATCTTGAGCAAACGCTCCGTTGATAAT
CTAAGTGTCGGATTCCATGAAGACCATGGAGTTGGTGTACCGAAGTCTTATGTGATTGAGTATTATGTTG
GTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGGACCATGTCTTTAATGATTC
TGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACTCAAGGCTGGAGAAATGAACCACTTT
AGCTTTGATAAAGTTGAAACCTATGCTGTTCGTATTCGCATGGTTAAAGCAGATAACAAGCGTGGAACGT
CTATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACAAACAAGAATCCAAGT
TGACGGCAAAGACTTAGCAAACTTCAACCCTGATTTGACAGACTACTACCTTGAGTCTGTAGATGGAAAA
GTTCCGGCAGTCACAGCAAGTGTTAGCAACAATGGTCTCGCTACCGTCGTTCCAAGCGTTCGTGAAGGTG
AGCCAGTTCGTGTCATCGCGAAAGCTGAAAATGGCGACATCTTAGGAGAATACCGTCTGCACTTCACTAA
GGATAAGAGCTTACTTTCTCATAAACCAGTTGCTGCGGTTAAACAAGCTCGCTTGCTACAAGTAGGTCAA
GCACTTGAATTGCCGACTAAGGTTCCAGTTTACTTCACAGGTAAAGACGGCTACGAAACAAAAGACCTGA
CAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCTGACAAAAGCAGGTCAATTTACTGTTCGAGGCCGTGT
CCTTGGTAGTAACCTTGTTGCTGAGATCACTGTACGAGTGACAGACAAACTTGGTGAGACTCTTTCAGAT
AACCCTAACTATGATGAAAACAGTAACCAGGCCTTTGCTTCAGCAACCAATGATATTGACAAAAACTCTC
ATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCGTCGTTGGACAAACTGGTCACCAAC
ACCATCTTCTAATCCAGAAGTATCAGCGGGTGTGATTTTCCGTGAAAATGGTAAGATTGTAGAACGGACT
GTTACACAAGGAAAAGTTCAGTTCTTTGCAGATAGTGGTACGGATGCACCATCTAAACTCGTTTTAGAAC
GCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAACTACCAAGCCTACGACGCAGACCATCC
ATTCAACAATCCAGAAAATTGGGAAGCTGTTCCTTATCGTGCGGATAAAGACATTGCAGCTGGTGATGAA
ATCAACGTAACATTTAAAGCTATCAAAGCCAAAGCTATGAGATGGCGTATGGAGCGTAAAGCAGATAAGA
GCGGTGTTGCGATGATTGAGATGACCTTCCTTGCACCAAGTGAATTGCCTCAAGAAAGCACTCAATCAAA
GATTCTTGTAGATGGAAAAGAACTTGCTGATTTCGCTGAAAATCGTCAAGACTATCAAATTACCTATAAA
GGTCAACGGCCAAAAGTCTCAGTTGAAGAAAACAATCAAGTAGCTTCAACTGTGGTAGATAGTGGAGAAG
ATAGCTTTCCAGTACTTGTTCGCCTCGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCACTT
GACTAAGGAAAAACCAGTTTCTGAGAAGACAGTTGCTGCTGTACAAGAAGATCTTCCAAAAATCGAATTT
GTTGAAAAAGATTTGGCATACAAGACAGTTGAGAAAAAAGATTCAACACTGTATCTAGGTGAAACTCGTG
TAGAACAAGAAGGAAAAGTTGGAAAAGAACGTATCTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGA
AAAACTCCGTGAAGTGGTAGAAGTTCCGACAGACCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAA
GAAGCTAAAAAACCACAAGTGTCAGAAAAAGCAGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAA
CTAATAAAGCCCAGTTACCAAGTACAGGTAGTGCGGCAAGCCAAGCAGCAGTAGCAGCAGGTTTAACTCT
TCTAGGTTTGAGTGCAGGATTAGTAGTTACTAAAGGTAAAAAAGAAGACTAG
sybil web site: sybil.sourceforge.net e-mail: driley@som.umaryland.edu