sybil: strepneumo: protein sp19_2251 [db=strepneumo_v15]
PROTEIN PROPERTIES properties of sp19_2251
property value
organism
Streptococcus pneumoniae SP19-BS75
product name conserved hypothetical protein
sequence length 2233 aa
created 2010-09-03 11:12:00
last modified 2010-09-03 11:12:00
DATABASE REFERENCES database refs for sp19_2251
database accession version
PROTEIN CLUSTERS clusters of which sp19_2251 is a member
cluster program algorithm analysis description
strepneumo_v15.match.2047595087.1 j_ortholog_clusters j_ortholog_clusters Ortholog Clusters
GENOMIC CONTEXT genomic context of the gene show_protein_clusters
sequence location strand
pneumoniae SP19-BS75 (2099971bp) (2.10 Mb) 595561-602263 +
additional feature types: pmark spacer tRNA repeatmasker simple repeats homopolymeric tracts tandem repeats detected by phobos BoxA Repeat BoxB Repeat BoxC Repeat RUP Repeat Genomic Island (from IslandPath) Signal Peptide (Neural Network) Signal Peptide (HMM) B-cell epitopes from BepiPred antigenic regions from EMBOSS antigenic Lipoprotein Attachment Site LPxTG motif Membrane surface protein motif Bacteriocin protein motif Fibronectin binding protein motif Transmembrane Regions (tmhmm)
display at most: 5 kb 10 kb 15 kb 25 kb 50 kb on either side of sp19_2251:
BLASTP HITS proteins with significant BLASTP matches
show top: 5 10 20 50 BLASTP matches
highlighted proteins: none strepneumo_v15.match.2047595087.1
SEQUENCE the amino acid sequence in FASTA format
>sp19_2251 polypeptide
MGKGHWNRKRVYSIRKFAVGACSVMIGTCAVLLGGNIAGESVVYADETLITHTTEKPKEEKMIVEEKADK
ALETKNVVERTEQSEPSSTEAIASEKKEDEAVTPKEEKVSAKPEEKAPRIESQASSQEKPLKEDAKAVTN
EEVNQMIENRKVDFNQNWYFKLNANSKEAIKPDADVSTWKKLDLPYDWSIFNDFDHESPAQNEGGQLNGG
EAWYRKTFKLDEKDLKKNVRLTFDGVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLYKDGRENVIAVHA
VNKQPSSRWYSGSGIYRDVTLQVTDKVHVEKNGTTILTPKLEEQQHGKVETHVTSKIVNTDDKDHELVAE
YQIVERGGHAVTGLVRTASRTLKAHESTSLDAILEVERPKLWTVLNDKPALYELITRVYRDGQLVDAKKD
LFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEENYKAEYRRLKQMKEMGVNSIRTTHNPASEQ
TLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEKWSDFDLRTMVERGKNNPAIFMWS
IGNEIGEANGDAHSLATVKRLVKVIKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDAVGFNYSEDNYK
ALRAKHPKWLIYGSETSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFDRDNA
GYAGQFIWTGTDYIGEPTPWHNQNQTPVKSSYFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNW
ENKELASKVADSEGKIPVRAYSNASSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQ
PGTLEAIARDESGKEIARDKITTAGKPAAVRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRF
QLHGQGQLVGVDNGEQASRERYKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFT
GKKEGQEKTVLGTEVPKVQTIIGEAPEMPTTVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREV
EARVEVIALKSELPVVKRIAPNTDLNSVDKSVSYVLTDGSVQEYEVDKWEIAEEDKAKLAIPGSRIQATG
YLEGQPIHATLVVEEGNPAAPVVPTVTVGGEAVTGLTSRQPMQYRTLSYGAQLPEVTASAKNAAVTVLQA
SAANGMRASIFIQPKDGGPLQTYAIQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSVRAHYQDGTQAVLP
ADKVTFSTSGEGEVAIRKGMLELHKPGAVTLNAEYEGAKGQVELTIQANTEKKIAQSIRPVNVVTDLHQE
PSLPATVTVEYDKGFPKTHKVTWQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTP
IAEAPQLPESVRTYDSNGHVSSAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANI
SDQWTGSELPLAFASDSNPSDPVSNVNDKLISYNNQPANRWTNWNRTNPEASVGVLFGDSGILSKRSVDN
LSVGFHEDHGVGAPKSYVIEYYVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHF
SFDKVETYAVRIRMVKADNKRGTSITEVQIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGK
VPAVTASVSNNGLATVVPSVREGEPVRVIAKAENGDILGEYRLHFTKDKSLLSHKPVAAVKQARLLQVGQ
ALELPTKVPVYFTGKDGYETKDLTVEWEEVPAENLTKAGQFTVRGRVLGSNLVAEVTVRVTDKLGETLSD
NPNYDENSNQAFASATNDIDKNSHDRVDYLNDGDYSENRRWTNWSPTPSSNPEVSAGVIFRENGKIVERT
VAQGKVQFFADSGTDAPSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYRADKDIAAGDE
INVTFKAIKAKAMRWRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQITYK
GQRPKVSVEENNQVASTVVDSGEDSFPVLVRLVSESGKQVKEYRIHLTKEKPVSEKTVAAVQEDLPKIEF
VEKDLAYKTVEKKDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREVVEVPTDRIVLVGTKPVAQ
EAKKPQVSEKADTKPIDSSEASQTNKAQLPNTGSAASQAAVAAGLALLGLSAGLVVTKGKKED
>sp19_2251 nucleotide
ATGGGGAAAGGCCATTGGAATCGGAAAAGAGTTTATAGCATTCGTAAGTTTGCTGTGGGAGCTTGCTCAG
TAATGATTGGGACTTGTGCAGTTCTATTAGGAGGAAATATAGCTGGAGAATCTGTAGTTTATGCGGATGA
AACACTTATTACTCATACTACTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGAAAAGGCTGATAAA
GCTTTGGAAACTAAAAATGTAGTTGAAAGGACAGAACAAAGTGAACCTAGTTCAACTGAGGCTATTGCAT
CTGAGAAGAAAGAAGATGAAGCCGTAACTCCAAAAGAGGAAAAAGTGTCTGCTAAACCGGAAGAAAAAGC
TCCAAGGATAGAATCACAAGCTTCAAGTCAAGAAAAACCGCTCAAGGAAGATGCTAAAGCTGTAACAAAT
GAAGAAGTGAATCAAATGATTGAAAACAGGAAAGTGGATTTTAATCAAAATTGGTACTTTAAACTCAATG
CAAATTCTAAGGAAGCCATTAAACCTGATGCAGACGTATCTACGTGGAAAAAATTAGATTTACCGTATGA
CTGGAGTATCTTTAACGATTTCGATCATGAATCTCCTGCACAAAATGAAGGTGGACAGCTCAACGGTGGG
GAAGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGACCTCAAGAAAAATGTTCGCCTTACTTTTG
ATGGCGTCTACATGGATTCTCAAGTTTATGTCAATGGTCAGTTAGTGGGGCATTATCCAAATGGTTATAA
CCAGTTCTCATACGATATCACCAAATACCTTTACAAAGATGGTCGTGAGAATGTGATTGCTGTCCATGCA
GTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAGTGGTATCTATCGTGATGTGACTTTACAAGTGA
CAGATAAGGTGCATGTTGAGAAAAATGGGACAACTATTTTAACACCAAAACTTGAAGAACAACAACATGG
CAAGGTTGAAACTCATGTGACCAGCAAAATCGTCAATACGGACGACAAAGACCATGAACTTGTAGCCGAA
TATCAAATCGTTGAACGAGGTGGTCATGCTGTAACAGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAG
CACATGAATCAACAAGCCTAGATGCGATTTTAGAAGTTGAAAGACCAAAACTCTGGACCGTTTTAAATGA
CAAACCTGCCTTGTACGAATTGATTACGCGTGTTTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGAT
TTGTTTGGTTACCGTTACTATCACTGGACTCCAAATGAAGGTTTCTCTTTGAATGGTGAACGTATTAAAT
TCCATGGAGTATCCTTGCACCACGACCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCG
CCGTCTCAAACAAATGAAGGAGATGGGAGTTAACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAA
ACCTTGCAAATCGCAGCAGAACTAGGTTTACTCGTTCAGGAAGAGGCCTTTGATACTTGGTATGGTGGCA
AGAAACCTTATGACTATGGACGTTTCTTTGAAAAAGATGCCACTCACCCAGAAGCTCGAAAAGGTGAAAA
ATGGTCTGATTTTGACCTACGTACCATGGTCGAAAGAGGCAAAAACAACCCTGCTATCTTCATGTGGTCA
ATTGGTAATGAAATAGGTGAAGCTAATGGTGATGCCCACTCTTTAGCAACTGTTAAACGTTTGGTCAAGG
TTATCAAGGATGTTGATAAGACTCGCTATGTTACCATGGGAGCAGATAAATTCCGTTTCGGTAATGGTAG
CGGAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATTTAACTATTCTGAAGATAATTACAAA
GCCCTTAGAGCTAAGCATCCAAAATGGTTGATTTACGGTTCAGAAACATCATCAGCAACCCGTACACGAG
GAAGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGA
TTATGGAAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCT
GGCTATGCTGGACAGTTTATCTGGACAGGTACGGACTATATTGGTGAACCTACACCATGGCACAACCAAA
ATCAAACTCCCGTTAAGAGCTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTA
TCTCTACCAAAGCCAATGGGTTTCTGTTAAGAAGAAACCGATGGTACACCTTCTTCCTCACTGGAACTGG
GAAAACAAAGAATTAGCATCCAAAGTAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCTTATTCGAATG
CTTCTAGTGTAGAATTGTTCTTGAATGGAAAATCTCTTGGTCTTAAGACTTTCAATAAAAAACAAACCAG
CGATGGGCGGACTTACCAAGAAGGTGCAAATGCTAATGAACTTTATCTTGAATGGAAAGTTGCCTATCAA
CCAGGTACCTTGGAAGCAATTGCTCGTGATGAATCTGGCAAGGAAATTGCTCGAGATAAGATTACGACTG
CTGGTAAGCCAGCGGCAGTTCGTCTTATTAAGGAAGATCATGCGATTGCAGCAGATGGAAAAGACTTGAC
TTACATCTACTATGAAATTGTTGACAGCCAGGGGAATGTGGTTCCAACTGCTAATAATCTGGTTCGCTTC
CAATTGCATGGTCAAGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGTGAACGCTATAAGG
CGCAAGCAGATGGTTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTGTCAAATCAACTGA
ACAAGCAGGGAAATTCACCCTTACTGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACTGTCTTTACT
GGTAAGAAAGAAGGACAAGAGAAGACTGTTTTGGGGACAGAAGTGCCAAAAGTACAGACCATTATTGGAG
AGGCACCTGAAATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAAC
CTGGTCTTCAGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTA
GAAGCTCGTGTAGAAGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCTCCAAATACTG
ACTTGAATTCTGTAGACAAATCTGTTTCCTATGTTTTGACTGATGGAAGTGTACAAGAGTATGAAGTGGA
CAAGTGGGAGATTGCCGAAGAAGATAAAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGT
TATTTAGAAGGTCAACCAATTCATGCAACCCTTGTGGTAGAAGAAGGCAATCCTGCAGCACCTGTAGTGC
CAACTGTTACTGTTGGAGGTGAAGCAGTAACAGGTCTTACTAGTCGACAACCAATGCAATATCGTACTCT
ATCTTATGGTGCCCAATTGCCAGAAGTCACAGCAAGTGCTAAAAATGCAGCTGTTACAGTTCTTCAAGCA
AGCGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGATGGTGGCCCTCTTCAAACCTATG
CAATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACTTGAGCTTGCAAGTGGAAAAAGCTGACAGTCT
CAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAAGCTGTATTACCA
GCTGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGAAGTCGCAATTCGTAAAGGAATGCTTGAGTTGC
ATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGGCCAAGTTGAACTCACTATCCA
AGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATTTGCATCAGGAA
CCAAGTCTTCCAGCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTCATAAAGTCACTTGGC
AAGCTATTCCGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGGAATTGA
CCTTGAAGCGCGTGCAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGACAACTCCA
ATCGCAGAAGCACCACAATTACCAGAAAGTGTTCGGACATATGATTCAAATGGTCACGTTTCATCAGCTA
AGGTTGCATGGGATGCGATTCGTCCAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGGTCGCTT
AGAAGGTACTCAATTAACAACTAAACTTCATGTTCGCGTATCTGCTCAAACTGAGCAAGGTGCAAACATT
TCTGACCAATGGACCGGTTCAGAATTGCCACTTGCCTTTGCTTCAGACTCAAATCCAAGCGACCCAGTTT
CAAATGTTAATGACAAGCTCATTTCCTACAATAACCAACCAGCCAATCGTTGGACAAACTGGAATCGTAC
TAATCCAGAAGCTTCAGTCGGTGTCCTATTCGGAGATTCAGGTATTTTGAGCAAACGCTCCGTTGATAAT
CTAAGTGTCGGATTCCACGAAGACCATGGAGTTGGTGCACCGAAGTCTTATGTGATTGAGTATTATGTTG
GTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGGACCATGTCTTTAATGATTC
TGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACTCAAGGCTGGAGAAATGAACCACTTT
AGCTTTGATAAAGTTGAAACCTATGCTGTTCGTATTCGCATGGTTAAAGCAGATAACAAGCGTGGAACGT
CTATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACAAACAAGAATCCAAGT
TGACGGCAAAGACTTAGCAAACTTCAACCCTGATTTGACAGACTACTACCTTGAGTCTGTAGATGGAAAA
GTTCCGGCAGTCACAGCAAGTGTTAGCAACAATGGTCTCGCTACCGTCGTTCCAAGCGTTCGTGAAGGTG
AGCCAGTTCGTGTCATCGCGAAAGCTGAAAATGGCGACATCTTAGGAGAATACCGTCTGCACTTCACTAA
GGATAAGAGCTTACTTTCTCATAAACCAGTTGCTGCGGTTAAACAAGCTCGCTTGCTACAAGTAGGTCAA
GCACTTGAATTGCCGACTAAGGTTCCAGTTTACTTCACAGGTAAAGACGGCTACGAAACAAAAGACCTGA
CAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCTGACAAAAGCAGGTCAATTTACTGTTCGAGGCCGTGT
CCTTGGTAGTAACCTTGTTGCTGAGGTCACTGTACGAGTGACAGACAAACTTGGTGAGACTCTTTCAGAT
AACCCTAACTATGATGAAAACAGTAACCAGGCCTTTGCTTCAGCAACCAATGATATTGACAAAAACTCTC
ATGACCGCGTTGACTATCTCAATGACGGAGATTATTCAGAAAATCGTCGTTGGACAAACTGGTCACCAAC
ACCATCTTCTAATCCAGAAGTATCAGCGGGTGTGATTTTCCGTGAAAATGGTAAGATTGTAGAACGGACT
GTTGCACAAGGAAAAGTTCAGTTCTTTGCAGATAGTGGTACGGATGCACCATCTAAACTCGTTTTAGAAC
GCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAACTACCAAGCCTACGACGCAGACCATCC
ATTCAACAATCCAGAAAATTGGGAAGCTGTTCCTTATCGTGCGGATAAAGACATTGCAGCTGGTGATGAA
ATCAACGTAACATTTAAAGCTATCAAAGCCAAAGCTATGAGATGGCGTATGGAGCGTAAAGCAGATAAGA
GCGGTGTTGCGATGATTGAGATGACCTTCCTTGCACCAAGTGAATTGCCTCAAGAAAGCACTCAATCAAA
GATTCTTGTAGATGGAAAAGAACTTGCTGATTTCGCTGAAAATCGTCAAGACTATCAAATTACCTATAAA
GGTCAACGGCCAAAAGTCTCAGTTGAAGAAAACAATCAAGTAGCTTCAACTGTGGTAGATAGTGGAGAAG
ATAGCTTTCCAGTACTTGTTCGCCTCGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCACTT
GACTAAGGAAAAACCAGTTTCTGAGAAGACAGTTGCTGCTGTACAAGAAGATCTTCCAAAAATCGAATTT
GTTGAAAAAGATTTGGCATACAAGACAGTTGAGAAAAAAGATTCAACACTGTATCTAGGTGAAACTCGTG
TAGAACAAGAAGGAAAAGTTGGAAAAGAACGTATCTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGA
AAAACTCCGTGAAGTGGTAGAAGTTCCGACAGACCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAA
GAAGCTAAAAAACCACAAGTGTCAGAAAAAGCAGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAA
CTAATAAAGCCCAGTTACCAAATACAGGTAGTGCGGCAAGCCAAGCAGCAGTAGCAGCAGGTTTAGCTCT
TCTAGGTTTGAGTGCAGGATTAGTAGTTACTAAAGGTAAAAAAGAAGACTAG
sybil web site: sybil.sourceforge.net e-mail: driley@som.umaryland.edu