sybil: strepneumo: protein HMPREF0837_10938 [db=strepneumo_v15]
PROTEIN PROPERTIES properties of HMPREF0837_10938
property value
organism
Streptococcus pneumoniae TCH8431/19A
product name beta-galactosidase
sequence length 2209 aa
created 2010-09-03 11:12:00
last modified 2010-09-03 11:12:00
DATABASE REFERENCES database refs for HMPREF0837_10938
database accession version
PROTEIN CLUSTERS clusters of which HMPREF0837_10938 is a member
cluster program algorithm analysis description
strepneumo_v15.match.2047595087.1 j_ortholog_clusters j_ortholog_clusters Ortholog Clusters
GENOMIC CONTEXT genomic context of the gene show_protein_clusters
sequence location strand
pneumoniae TCH8431/19A (2088772bp) (2.09 Mb) 859623-866253 +
additional feature types: pmark spacer tRNA repeatmasker simple repeats homopolymeric tracts tandem repeats detected by phobos BoxA Repeat BoxB Repeat BoxC Repeat RUP Repeat Genomic Island (from IslandPath) Signal Peptide (Neural Network) Signal Peptide (HMM) B-cell epitopes from BepiPred antigenic regions from EMBOSS antigenic Lipoprotein Attachment Site LPxTG motif Membrane surface protein motif Bacteriocin protein motif Fibronectin binding protein motif Transmembrane Regions (tmhmm)
display at most: 5 kb 10 kb 15 kb 25 kb 50 kb on either side of HMPREF0837_10938:
BLASTP HITS proteins with significant BLASTP matches
show top: 5 10 20 50 BLASTP matches
highlighted proteins: none strepneumo_v15.match.2047595087.1
SEQUENCE the amino acid sequence in FASTA format
>HMPREF0837_10938 polypeptide
MIGTCAVLLGGNIAGESVVYADETLITHTAEKPKEEKMIVEEKADKALETKNVVERTEQSEPSSTEAIAS
EKKEDEAVTPKEEKVSAKPEEKAPRIESQASSQEKPLKEDAKAVTNEEMNQMIEDRKVDFNQNWHFKLNA
NSKEAIKPDADVSTWKKLDLPYDWSIFNNFDHESPAQNEGGQLNGGEAWYRKTFKLDEKDLKKNVRLTFD
GVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLHKDGRENVIAVHAVNKQPSSRWYSGSGIYRDVTLQVT
DKVHVEKNGTTILTPKLEEQQHGKVETHVTSKIVNTDDKDHELVAEYQIVERGGHAVTGLVRTASRTLKA
HESTSLDAILEVERPKLWTVLNDKPALYELITRVYRDGQLVDAKKDLFGYRYYHWTPNEGFSLNGERIKF
HGVSLHHDHGALGAEENYKAEYRRLKQMKEMGVNSIRTTHNPASEQTLQIAAELGLLVQEEAFDTWYGGK
KPYDYGRFFEKDATHPEARKGEKWSDFDLRTMVERGKNNPAIFMWSIGNEIGEANGDAHSLATVKRLVKV
IKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDAVGFNYSEDNYKALRAKHPKWLIYGSETSSATRTRG
SYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFDRDNAGYAGQFIWTGTDYIGEPTPWHNQN
QTPVKSSYFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNWENKELASKVADSEGKIPVRAYSNA
SSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQPGTLEAIARDESGKEIARDKITTA
GKPAAVRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRFQLHGQGQLVGVDNGEQASRERYKA
QADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFTGKKEGQEKTVLGTEVPKVQTIIGE
APEMPTTVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREVEARVEVIALKSELPVVKRIAPNTD
LNSVDKSVSYVLTDGSVQEYEVDSWEIAEEDKAKLAIPGSRIQATGYLEGQPIHATLVVEEGNPAAPVVP
TVTVGGEAVTGLTSRQPMQYRTLSYGAQLPEVTASAENADVTVLQASAANGMRASIFIQPKDGGPLQTYA
IQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSVRAHYQDGTQAVLPADKVTFSTSGEGEVAIRKGMLELH
KPGAVTLNAEYEGAKGQVELTIQANTEKKIAQSIRPVNVVTDLHQKPTLPTTVTVEYDKGFPKAHKVTWQ
AIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTPIAEAPQLPESVRTYDSNGHVSSAK
VAWDAIRIEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANISDQWTGSELPLAFASDSNPSDPVS
NVNDKLISYNNQPANRWTNWNRSNPEASVGVLFGDSGILSKRSVDNLSVGFHEDHGVGAPKSYVIEYYVG
KTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHFSFDKVETYAVRIRMVKADNKRGTS
ITEVQIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGKVPAVTANVSNNGLATVVPSVREGE
PVRVIAKAENGDILGEYRLHFTKDKNLLSHKPVAAVKQARLLQVGQALELPTKVPVYFTGKDGYETKDLT
VEWEEVPAENLTKAGQFTVRGRVLGSDLVAEVTVRVTDKLGEALSDNPNYDENSNQAFASATNDIDKNSH
DRVDYLNDGDHSENRRWTNWSPTPSSNPEVSAGVIFRENGKIVERTVAQAKLHFFADSGTDAPTKLVLER
YVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYRADKDIEAGDEINVTFKAVKAKAMRWRMERKADKS
GVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQITYKGQRPKVSVEENNQVASTVVDSGED
SLPVLVRLVSESGKQVKEYRIQLTKEKPVSEKTVAAVQEDLPKLEFVEKDLAYKTVEKKDSTLYLGETRV
EQEGKTGKERIFTAINPDGSKEEKLREVVEAPTDRIVLVGTKPVAQEAKKPQVSEKADTKPIDSSEASQT
NKAQLPNTGSAASQAAVAAGLALLGLSAGLVVTKGKKED
>HMPREF0837_10938 nucleotide
ATGATTGGGACTTGTGCAGTTCTATTAGGAGGAAATATAGCTGGAGAATCTGTAGTTTATGCGGATGAAA
CACTTATTACTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGAAAAGGCTGATAAAGC
TTTGGAAACTAAAAATGTAGTTGAAAGGACAGAACAAAGTGAACCTAGTTCAACTGAGGCTATTGCATCT
GAGAAGAAAGAAGATGAAGCCGTAACTCCAAAAGAGGAAAAAGTGTCTGCTAAACCGGAAGAAAAAGCTC
CAAGGATAGAATCACAAGCTTCAAGTCAAGAAAAACCGCTCAAGGAAGATGCTAAAGCTGTAACAAATGA
AGAAATGAATCAAATGATTGAAGACAGGAAAGTGGATTTTAATCAAAATTGGCACTTTAAACTCAATGCA
AATTCTAAGGAAGCCATTAAACCTGATGCAGACGTATCTACGTGGAAAAAATTAGATTTACCGTATGACT
GGAGTATCTTTAACAATTTCGATCATGAATCTCCTGCACAAAATGAAGGTGGACAACTCAACGGTGGGGA
AGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGACCTCAAGAAAAATGTTCGCCTTACTTTTGAT
GGCGTCTACATGGATTCTCAAGTTTATGTCAATGGTCAGTTAGTGGGGCATTATCCAAATGGTTATAACC
AGTTCTCATATGATATCACCAAATACCTTCACAAAGATGGTCGTGAGAATGTGATTGCTGTCCATGCAGT
CAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAGTGGTATCTATCGTGATGTGACTTTACAAGTGACA
GATAAGGTGCATGTTGAGAAAAATGGGACAACTATTTTAACACCAAAACTTGAAGAACAACAACATGGCA
AGGTTGAAACTCATGTGACCAGCAAAATCGTCAATACGGACGACAAAGACCATGAACTTGTAGCCGAATA
TCAAATCGTTGAACGAGGTGGTCATGCTGTAACAGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAGCA
CATGAATCAACAAGCCTAGATGCGATTTTAGAAGTTGAAAGACCAAAACTCTGGACCGTTTTAAATGACA
AACCTGCCTTGTACGAATTGATTACGCGTGTTTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGATTT
GTTTGGTTACCGTTACTATCACTGGACTCCAAATGAAGGTTTCTCTTTGAATGGTGAACGTATTAAATTC
CATGGAGTATCCTTGCACCACGACCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCGCC
GTCTCAAACAAATGAAGGAGATGGGAGTTAACTCTATCCGTACAACCCACAACCCTGCTAGTGAGCAAAC
CTTGCAAATCGCAGCAGAACTAGGTTTACTCGTTCAGGAAGAGGCCTTTGATACGTGGTATGGTGGCAAG
AAACCTTATGACTATGGACGTTTCTTTGAAAAAGATGCCACTCACCCAGAAGCTCGAAAAGGTGAAAAAT
GGTCTGATTTTGACCTACGTACCATGGTCGAAAGAGGCAAAAACAACCCTGCTATCTTCATGTGGTCAAT
TGGTAATGAAATAGGTGAAGCTAATGGTGATGCCCACTCTTTAGCAACTGTTAAACGTTTGGTCAAGGTT
ATCAAGGATGTTGATAAGACTCGCTATGTTACCATGGGAGCAGATAAATTCCGTTTCGGCAATGGTAGCG
GAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATTTAACTATTCTGAAGATAATTACAAAGC
CCTTAGAGCTAAGCATCCAAAATGGTTGATTTACGGTTCAGAAACATCATCAGCAACCCGTACACGAGGA
AGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGATT
ATGGAAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCTGG
CTATGCTGGACAGTTTATCTGGACAGGTACGGACTATATTGGTGAACCTACACCATGGCACAACCAAAAT
CAAACTCCTGTTAAGAGCTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTATC
TCTACCAAAGCCAATGGGTTTCTGTTAAGAAGAAACCGATGGTACACCTTCTTCCTCACTGGAACTGGGA
AAACAAAGAATTAGCATCCAAAGTAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCTTATTCGAATGCT
TCTAGTGTAGAATTGTTCTTGAATGGAAAATCTCTTGGTCTTAAGACTTTCAATAAAAAACAAACCAGCG
ATGGGCGGACTTACCAAGAAGGTGCAAATGCTAATGAACTTTATCTTGAATGGAAAGTTGCCTATCAACC
AGGTACCTTGGAAGCAATTGCTCGTGATGAATCTGGCAAGGAAATTGCTCGAGATAAGATTACGACTGCT
GGTAAGCCAGCGGCAGTTCGTCTTATTAAGGAAGACCATGCGATTGCAGCAGATGGAAAAGACTTGACTT
ACATCTACTATGAAATTGTTGACAGCCAGGGGAATGTGGTTCCAACTGCTAATAATCTGGTTCGCTTCCA
ATTGCATGGCCAAGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGTGAACGCTATAAGGCG
CAAGCAGATGGTTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTGTCAAATCAACTGAAC
AAGCAGGGAAATTCACACTGACAGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACTGTCTTTACTGG
TAAGAAAGAAGGACAAGAGAAGACTGTTTTGGGGACAGAAGTGCCAAAAGTACAGACCATTATTGGAGAG
GCACCTGAAATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAACCT
GGTCTTCAGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTAGA
AGCTCGTGTAGAAGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCTCCAAATACTGAC
TTGAATTCTGTAGACAAATCTGTTTCCTATGTTTTGACTGATGGAAGTGTACAAGAGTATGAAGTAGATA
GCTGGGAGATTGCCGAAGAAGATAAAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGTTA
TTTAGAAGGTCAACCAATTCATGCAACCCTTGTGGTAGAAGAAGGCAATCCTGCAGCACCTGTAGTGCCA
ACTGTTACTGTTGGAGGTGAAGCAGTAACAGGTCTTACTAGTCGACAACCAATGCAATATCGTACTCTAT
CTTATGGTGCCCAATTGCCAGAAGTCACAGCAAGTGCTGAAAATGCTGATGTGACAGTTCTTCAAGCAAG
CGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGATGGTGGCCCTCTTCAAACCTATGCA
ATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACTTGAGTTTGCAAGTGGAAAAAGCTGACAGTCTCA
AAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAAGCTGTATTACCAGC
TGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGAAGTCGCAATTCGTAAAGGAATGCTTGAGTTGCAT
AAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGGCCAAGTTGAACTCACTATCCAAG
CCAATACTGAGAAGAAGATTGCGCAATCTATCCGTCCTGTAAATGTAGTCACAGATTTGCATCAAAAACC
TACTCTTCCAACAACAGTAACGGTTGAGTATGACAAAGGTTTCCCTAAAGCTCACAAAGTCACTTGGCAA
GCTATTCCGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGGAATTGACC
TTGAAGCGCGTGCAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGACAACTCCAAT
CGCAGAAGCACCACAATTACCAGAAAGCGTTCGGACATATGATTCAAATGGTCACGTTTCATCAGCTAAG
GTTGCATGGGATGCGATTCGTATAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGGTCGCTTAG
AAGGTACTCAATTAACAACTAAACTTCATGTTCGCGTATCTGCTCAAACTGAGCAAGGTGCAAACATTTC
TGACCAATGGACCGGTTCAGAATTGCCACTTGCCTTTGCTTCAGATTCAAATCCAAGCGACCCAGTTTCA
AATGTTAATGACAAGCTCATTTCCTACAATAACCAACCAGCCAATCGTTGGACAAACTGGAATCGTAGTA
ATCCAGAAGCTTCAGTCGGTGTTCTGTTTGGAGATTCAGGTATCTTGAGCAAACGCTCCGTTGATAATCT
AAGTGTCGGATTCCACGAAGACCATGGAGTTGGTGCACCGAAGTCTTATGTGATTGAGTATTATGTTGGT
AAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGGACCATGTCTTTAATGATTCTG
CCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACTCAAGGCTGGAGAAATGAACCACTTTAG
CTTTGATAAAGTTGAAACCTATGCTGTTCGTATTCGCATGGTTAAAGCAGATAACAAGCGTGGAACGTCT
ATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAACAAGGACAAACAAGAATCCAAGTTG
ACGGCAAAGACTTAGCAAACTTCAACCCTGATTTGACAGACTACTACCTTGAGTCTGTAGATGGAAAAGT
TCCGGCAGTCACAGCAAATGTTAGCAACAATGGTCTCGCTACCGTCGTTCCAAGCGTTCGTGAAGGTGAG
CCAGTTCGTGTCATCGCGAAAGCTGAAAATGGCGACATCTTAGGAGAATACCGTCTGCACTTCACTAAGG
ATAAGAACTTACTTTCTCATAAACCAGTTGCTGCGGTTAAACAAGCTCGCTTGCTACAAGTAGGTCAAGC
ACTTGAATTGCCGACTAAGGTTCCAGTTTACTTCACAGGTAAAGACGGCTACGAAACAAAAGACCTGACA
GTTGAATGGGAAGAAGTTCCAGCGGAAAATCTGACAAAAGCAGGTCAATTTACTGTTCGAGGCCGTGTCC
TTGGTAGTGACCTTGTTGCTGAGGTCACTGTACGAGTGACAGACAAACTTGGTGAGGCTCTTTCAGATAA
CCCTAACTATGATGAAAACAGTAACCAGGCCTTTGCTTCAGCAACCAATGATATTGACAAAAACTCTCAT
GACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCGTCGTTGGACAAACTGGTCACCAACAC
CATCTTCTAATCCAGAAGTATCAGCGGGTGTGATCTTCCGTGAAAATGGTAAGATTGTAGAACGGACTGT
TGCGCAAGCCAAACTTCACTTCTTTGCAGATAGTGGTACGGATGCACCAACTAAACTCGTTTTAGAACGC
TATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAACTACCAAGCCTACGACGCAGACCATCCAT
TCAACAATCCAGAAAATTGGGAAGCTGTGCCTTATCGTGCGGATAAAGACATTGAAGCCGGTGATGAAAT
TAACGTTACCTTTAAAGCTGTCAAAGCCAAAGCCATGAGATGGCGTATGGAGCGTAAAGCAGATAAGAGC
GGTGTTGCGATGATTGAGATGACCTTCCTTGCACCGAGTGAATTGCCTCAAGAAAGCACTCAATCGAAGA
TTCTTGTAGATGGAAAAGAGCTTGCTGATTTCGCTGAAAATCGTCAAGACTATCAAATTACCTATAAAGG
TCAACGGCCAAAAGTCTCAGTTGAAGAAAATAATCAAGTAGCTTCAACTGTGGTAGATAGTGGAGAAGAT
AGCCTTCCAGTACTTGTTCGCCTCGTTTCAGAAAGTGGAAAACAAGTTAAGGAATACCGTATCCAGTTGA
CTAAGGAAAAACCAGTTTCTGAGAAGACAGTTGCTGCTGTACAAGAAGATCTTCCAAAACTCGAATTTGT
TGAAAAAGATTTGGCCTACAAGACAGTTGAGAAAAAAGATTCAACACTGTATCTAGGTGAAACTCGTGTA
GAACAAGAAGGAAAAACTGGTAAAGAACGTATCTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGAAA
AACTCCGTGAAGTGGTAGAAGCTCCGACAGACCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAAGA
AGCTAAAAAACCACAAGTGTCAGAAAAAGCAGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAACT
AATAAAGCCCAGTTACCAAATACAGGTAGTGCGGCAAGCCAAGCAGCAGTAGCAGCAGGTTTAGCTCTTC
TAGGTTTGAGTGCAGGATTAGTAGTTACTAAAGGTAAAAAAGAAGACTAG
sybil web site: sybil.sourceforge.net e-mail: driley@som.umaryland.edu