sybil: strepneumo: protein SPP_0665 [db=strepneumo_v15]
PROTEIN PROPERTIES properties of SPP_0665
property value
organism
Streptococcus pneumoniae P1031
product name beta-galactosidase
sequence length 2233 aa
created 2010-09-03 11:12:00
last modified 2010-09-03 11:12:00
DATABASE REFERENCES database refs for SPP_0665
database accession version
PROTEIN CLUSTERS clusters of which SPP_0665 is a member
cluster program algorithm analysis description
strepneumo_v15.match.2047595087.1 j_ortholog_clusters j_ortholog_clusters Ortholog Clusters
GENOMIC CONTEXT genomic context of the gene show_protein_clusters
sequence location strand
pneumoniae P1031 (2111882bp) (2.11 Mb) 604720-611422 +
additional feature types: pmark spacer tRNA repeatmasker simple repeats homopolymeric tracts tandem repeats detected by phobos BoxA Repeat BoxB Repeat BoxC Repeat RUP Repeat Genomic Island (from IslandPath) Signal Peptide (Neural Network) Signal Peptide (HMM) B-cell epitopes from BepiPred antigenic regions from EMBOSS antigenic Lipoprotein Attachment Site LPxTG motif Membrane surface protein motif Bacteriocin protein motif Fibronectin binding protein motif Transmembrane Regions (tmhmm)
display at most: 5 kb 10 kb 15 kb 25 kb 50 kb on either side of SPP_0665:
BLASTP HITS proteins with significant BLASTP matches
show top: 5 10 20 50 BLASTP matches
highlighted proteins: none strepneumo_v15.match.2047595087.1
SEQUENCE the amino acid sequence in FASTA format
>SPP_0665 polypeptide
MGKGHWNRKRVYSIRKFAVGACSVMIGTCAVLLGGNIAGESVVYADETLITHTAEKPKEEKMIVEEKADK
ALETKNVVERTEQSEPSSTEAIASEKKEDEAVTPKEEKVSAKLEEKAPRIESQASSQEKPLKEDAKAVTN
EEMNQMIEDRKVDFNQNWYFKLNANSKEAIKPDVDVSTWKKLDLPYDWSIFNDFDHESPAQNEGGQLNGG
EAWYRKTFKLDEKDLKKNVRLTFDGVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLHKDGRENVIAVHA
VNKQPSSRWYSGSGIYRDVTLQVTDKVHVEKNGTTILTPKLEEQQHGKVETHVTSKIANTDDKDHELVAE
YQIVERGGHAVTGLVRTASRTLKAHESTSLDAILEVERPKLWTVLNDKPALYELITRVYRDGQLVDAKKD
LFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEENYKAEYRRLKQMKEMGVNSIRTTHNPASEQ
TLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEKWSDFDLRTMVERGKNNPAIFMWS
IGNEIGEANGDAHSLVTVKRLVKVIKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDAVGFNYSEDNYK
ALRAKHPKWLIYGSETSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFDRDNA
GYAGQFIWTGTDYIGEPTPWHNQNQTPVKSSYFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNW
ENKELASKVADSEGKIPVRAYSNASSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQ
PGTLEAIARDESGKEIARDKITTAGKPAAVRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRF
QLHGQGQLVGVDNGEQASRERYKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFT
GKKEGQEKTVLVTEVPKVQTIIGEAPEMPTTVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREV
EARVEVIALKSELPVVKRIAPNTDLNSVDKSVSYVLTDGSVQEYEVDKWEIAEEDKAKLAIPGSRIQATG
YLEGQPIHATLVVEEGNPAAPAVPTVTVGGEAVTGLTSQKPMQYRTLAYGAKLPEVTASAKNAAVTVLQA
SAANGMRASIFIQPKDGGPLQTYAIQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSVRAHYQDGTQAVLP
ADKVTFSTSGEGEVAIRKGMLELHKPGAVTLKAEYEGAKGQVDLTIQANTEKKIAQSIRPVNVVTDLHQE
PTLPSTVTVEYDKGFPKAHKVTWQAIPKEKLDSYQTFEVLGKVEGIDLEARARVSVEGIVSVEEVSVTTP
IAEAPQLPESVRTYDSNGHVSSAKVAWDAIRIEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANI
SDQWTGSELPLAFASDSNPSDPVSNVNDKLISYNNQPANRWTNWNRSNPEASVGVLFGDSGILSKRSVDN
LSVGFHEDHGVGAPKSYVIEYYVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHF
SFDKVETYAVRIRMVKADNKRGTSITEVQIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGK
VTAVTASVSNNGLATVVPSVREGEPVRVIAKAENGDILGEYRLHFTKDKSLLSHKPVAAVKQARLLQVGQ
ALELPTKVPVYFTGKDGYETKDLTVEWEEVPAENLTKAGQFTVRGRVLGSNLVAEITVRVTDKLGETLSD
NPNYDENSNQAFASATNDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNPEVSAGVIFRENGKIVERT
VAQGKVQFFADSGTDAPSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYRADKDIAAGDE
INVTFKDIKAKAMRWRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQITYK
GQRPKVSVEENNQVASTVVDSGEDSLPVLVRLVSESGKQVKEYRIHLTKEKPVSEKTVAAVQEDLPKLEF
VEKDLAYKTVEKKDSTLYLGETRVEQEGKTGKERIFTAINPDGSKEEKLREVVEAPTDRIVLVGTKPVAQ
EAKKPQVSEKADTKPIDSSEASQTNKAQLPNTGSASSQAAVAAGLALLGLSAGLVVTKGKKED
>SPP_0665 nucleotide
ATGGGGAAAGGCCATTGGAATCGGAAAAGAGTTTATAGCATTCGTAAGTTTGCTGTGGGAGCTTGCTCAG
TAATGATTGGGACTTGTGCAGTTCTATTAGGAGGAAATATAGCTGGAGAATCTGTAGTTTATGCGGATGA
AACACTTATTACTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGAAAAGGCTGATAAA
GCTTTGGAAACTAAAAATGTAGTTGAAAGGACAGAACAAAGTGAACCTAGTTCAACTGAGGCTATTGCAT
CTGAGAAGAAAGAAGATGAAGCCGTAACTCCAAAAGAGGAAAAAGTGTCTGCTAAACTGGAAGAAAAAGC
TCCAAGGATAGAATCACAAGCTTCAAGTCAAGAAAAACCGCTCAAGGAAGATGCTAAAGCTGTAACAAAT
GAAGAAATGAATCAAATGATTGAAGACAGGAAAGTGGATTTTAATCAAAATTGGTACTTTAAACTCAATG
CAAATTCTAAGGAAGCCATTAAACCTGATGTAGACGTATCTACGTGGAAAAAATTAGATTTACCGTATGA
CTGGAGTATCTTTAACGATTTCGATCATGAATCTCCTGCACAAAATGAAGGTGGACAGCTCAACGGTGGG
GAAGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGACCTCAAGAAAAATGTTCGCCTTACTTTTG
ATGGCGTCTACATGGATTCTCAAGTTTATGTCAATGGTCAGTTAGTGGGGCATTATCCAAATGGTTATAA
CCAGTTCTCATATGATATCACCAAATACCTTCACAAAGATGGTCGTGAGAATGTGATTGCTGTCCATGCA
GTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAGTGGTATCTATCGTGATGTGACTTTACAAGTGA
CAGATAAGGTGCATGTTGAGAAAAATGGGACAACTATTTTAACACCAAAACTTGAAGAACAACAACATGG
CAAGGTTGAAACTCATGTGACCAGCAAAATCGCCAATACGGACGACAAAGACCATGAACTTGTAGCCGAA
TATCAAATCGTTGAACGAGGTGGTCATGCTGTAACAGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAG
CACATGAATCAACAAGCCTAGATGCGATTTTAGAAGTTGAAAGACCAAAACTCTGGACCGTTTTAAATGA
CAAACCTGCCTTGTACGAATTGATTACGCGTGTTTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGAT
TTGTTTGGTTACCGTTACTATCACTGGACTCCAAATGAAGGTTTCTCTTTGAATGGTGAACGTATTAAAT
TCCATGGAGTATCCTTGCACCACGACCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCG
CCGTCTCAAACAAATGAAGGAGATGGGAGTTAACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAA
ACCTTGCAAATCGCAGCAGAACTAGGTTTACTCGTTCAGGAAGAGGCTTTTGATACTTGGTATGGTGGCA
AGAAACCTTATGACTATGGACGTTTCTTTGAAAAAGATGCCACTCACCCAGAAGCTCGAAAAGGTGAAAA
ATGGTCTGATTTTGACCTACGTACCATGGTCGAAAGAGGCAAAAACAACCCTGCTATCTTCATGTGGTCA
ATTGGTAATGAAATAGGTGAAGCTAATGGTGATGCCCACTCTTTAGTAACTGTTAAACGTTTGGTCAAGG
TTATCAAGGATGTTGATAAGACTCGCTATGTTACCATGGGAGCAGATAAATTCCGTTTCGGTAATGGTAG
CGGAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATTTAACTATTCTGAAGATAATTACAAA
GCCCTTAGAGCTAAGCATCCAAAATGGTTGATTTATGGATCAGAAACATCTTCAGCTACCCGTACACGTG
GAAGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCTGA
CTATGGGAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCT
GGCTATGCTGGACAGTTTATCTGGACAGGTACGGACTATATTGGTGAACCTACACCATGGCACAACCAAA
ATCAAACTCCTGTTAAGAGCTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTA
TCTCTACCAAAGCCAATGGGTTTCTGTTAAGAAGAAACCGATGGTACACCTTCTTCCTCACTGGAACTGG
GAAAACAAAGAATTAGCATCCAAAGTAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCTTATTCGAATG
CTTCTAGTGTAGAATTGTTCTTGAATGGAAAATCTCTTGGTCTTAAGACTTTCAATAAAAAACAAACCAG
CGATGGGCGGACTTACCAAGAAGGTGCAAATGCTAATGAACTTTATCTTGAATGGAAAGTTGCCTATCAA
CCAGGTACCTTGGAAGCAATTGCTCGTGATGAATCTGGCAAGGAAATTGCTCGAGATAAGATTACGACTG
CTGGTAAGCCAGCGGCAGTTCGTCTTATTAAGGAAGACCATGCGATTGCAGCAGATGGAAAAGACTTGAC
TTACATCTACTATGAAATTGTTGACAGCCAGGGGAATGTGGTTCCAACTGCTAATAATCTGGTTCGCTTC
CAATTGCATGGTCAAGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGTGAACGCTATAAGG
CGCAAGCAGATGGTTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTGTCAAATCAACTGA
ACAAGCAGGGAAATTCACCCTTACTGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACTGTCTTTACT
GGTAAGAAAGAAGGACAAGAGAAGACTGTTTTGGTGACAGAAGTGCCAAAAGTACAGACCATTATTGGAG
AGGCACCTGAAATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAAC
CTGGTCTTCAGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTA
GAAGCTCGTGTAGAAGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCTCCAAATACTG
ACTTGAATTCTGTAGACAAATCTGTTTCCTATGTTTTGACTGATGGAAGTGTACAAGAGTATGAAGTGGA
CAAGTGGGAGATTGCCGAAGAAGATAAAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGT
TATTTAGAAGGTCAACCAATTCATGCAACCCTTGTGGTAGAAGAAGGCAATCCTGCGGCACCTGCAGTAC
CAACTGTAACGGTTGGTGGTGAAGCTGTCACAGGTCTTACTAGTCAAAAACCAATGCAATACCGCACTCT
TGCTTATGGAGCTAAGTTGCCAGAAGTCACAGCAAGTGCTAAAAATGCAGCTGTTACAGTTCTTCAAGCA
AGCGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGATGGTGGCCCTCTTCAAACCTATG
CAATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACTTGAGCTTGCAAGTGGAAAAAGCTGACAGTCT
CAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAAGCTGTATTACCA
GCTGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGAAGTCGCAATTCGTAAAGGAATGCTTGAGTTAC
ATAAGCCAGGAGCAGTCACTCTCAAAGCGGAATATGAGGGAGCTAAAGGCCAAGTTGATCTCACTATCCA
GGCCAATACTGAGAAGAAGATTGCGCAATCTATCCGTCCAGTAAATGTAGTGACAGATTTACACCAAGAA
CCTACTCTTCCGTCAACAGTAACGGTTGAGTATGACAAAGGTTTCCCTAAAGCTCACAAAGTCACTTGGC
AAGCTATTCCGAAAGAAAAACTAGACTCCTATCAAACCTTTGAAGTACTAGGTAAAGTTGAAGGAATTGA
CCTTGAAGCGCGTGCAAGAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGACAACTCCA
ATCGCAGAAGCACCACAATTACCAGAAAGCGTTCGGACATATGATTCAAATGGTCACGTTTCATCAGCTA
AGGTTGCATGGGATGCGATTCGTATAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGGTCGCTT
AGAAGGTACTCAATTAACAACTAAACTTCATGTTCGCGTATCTGCTCAAACTGAGCAAGGTGCAAATATT
TCTGACCAATGGACCGGTTCAGAATTGCCACTTGCCTTTGCTTCAGACTCAAATCCAAGCGACCCAGTTT
CAAATGTTAATGACAAGCTCATTTCCTACAATAACCAACCAGCCAATCGTTGGACAAACTGGAATCGTAG
TAATCCAGAAGCTTCAGTCGGTGTTCTGTTTGGAGATTCAGGTATCTTGAGCAAACGCTCCGTTGATAAT
CTAAGTGTCGGATTCCACGAAGACCATGGAGTTGGTGCACCGAAGTCTTATGTGATTGAGTATTATGTTG
GTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGGACCATGTCTTTAATGATTC
TGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACTCAAGGCTGGAGAAATGAACCACTTT
AGCTTTGATAAAGTTGAAACCTATGCTGTTCGTATTCGCATGGTTAAAGCAGATAACAAGCGTGGAACGT
CTATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAACAAGGACAAACAAGAATCCAAGT
TGACGGCAAAGACTTAGCAAACTTCAACCCTGATTTGACAGACTACTACCTTGAGTCTGTAGATGGAAAA
GTTACGGCGGTCACAGCAAGTGTTAGCAACAATGGTCTCGCTACCGTCGTTCCAAGCGTTCGTGAAGGTG
AGCCAGTTCGTGTCATCGCGAAAGCTGAAAATGGCGACATCTTAGGAGAATACCGTCTGCACTTCACTAA
GGATAAGAGCTTACTTTCTCATAAACCAGTTGCTGCGGTTAAACAAGCTCGCTTGCTACAAGTAGGTCAA
GCACTTGAATTGCCGACTAAGGTTCCAGTTTACTTCACAGGTAAAGACGGCTACGAAACAAAAGACCTGA
CAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCTGACAAAAGCAGGTCAATTTACTGTTCGAGGCCGTGT
CCTTGGTAGTAACCTTGTTGCTGAGATCACTGTACGAGTGACAGACAAACTTGGTGAGACTCTTTCAGAT
AACCCTAACTATGATGAAAACAGTAACCAGGCCTTTGCTTCAGCAACCAATGATATTGACAAAAACTCTC
ATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCGTCGTTGGACAAACTGGTCACCAAC
ACCATCTTCTAATCCAGAAGTATCAGCGGGTGTGATTTTCCGTGAAAATGGTAAGATTGTAGAACGGACT
GTTGCACAAGGAAAAGTTCAGTTCTTTGCAGATAGTGGTACGGATGCACCATCTAAACTCGTTTTAGAAC
GCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAACTACCAAGCCTACGACGCAGACCATCC
ATTCAACAATCCAGAAAATTGGGAAGCTGTTCCTTATCGTGCGGATAAAGACATTGCAGCTGGTGATGAA
ATCAACGTAACATTTAAAGATATCAAAGCCAAAGCTATGAGATGGCGTATGGAGCGTAAAGCAGATAAGA
GCGGTGTTGCGATGATTGAGATGACCTTCCTTGCACCAAGTGAATTGCCTCAAGAAAGCACTCAATCAAA
GATTCTTGTAGATGGAAAAGAACTTGCTGATTTCGCTGAAAATCGTCAAGACTATCAAATTACCTATAAA
GGTCAACGGCCAAAAGTCTCAGTTGAAGAAAACAATCAAGTAGCTTCAACTGTGGTAGATAGTGGAGAAG
ATAGCCTTCCAGTACTTGTTCGCCTCGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCACTT
GACTAAGGAAAAACCAGTTTCTGAGAAGACAGTTGCTGCTGTACAAGAAGATCTTCCAAAACTCGAATTT
GTTGAAAAAGATTTGGCCTACAAGACAGTTGAGAAAAAAGATTCAACACTGTATCTAGGTGAAACTCGTG
TAGAACAAGAAGGAAAAACTGGTAAAGAACGTATCTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGA
AAAACTCCGTGAAGTGGTAGAAGCTCCGACAGACCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAA
GAAGCTAAAAAACCACAAGTGTCAGAAAAAGCAGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAA
CTAATAAAGCCCAGTTACCAAATACAGGTAGTGCGTCAAGCCAAGCAGCAGTAGCAGCAGGTTTAGCTCT
TCTAGGTTTGAGTGCAGGATTAGTAGTTACTAAAGGTAAAAAAGAAGACTAG
sybil web site: sybil.sourceforge.net e-mail: driley@som.umaryland.edu