BLASTP 2.0.10 [Aug-26-1999] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= PAB0311 (pfpI) DE:intracellular protease (pfpI) (166 letters) Database: ./suso.pep; /banques/blast2/nr.pep 598,487 sequences; 189,106,746 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value pir||H75163 intracellular proteinase (pfpi) PAB0311 - Pyrococcus... 340 5e-93 sp|O59413|PFPI_PYRHO PROTEASE I >gi|7449372|pir||C71178 probable... 310 8e-84 sp|Q51732|PFPI_PYRFU PROTEASE I >gi|2129411|pir||JC6003 intracel... 305 3e-82 pdb|1G2I|A Chain A, Crystal Structure Of A Novel Intracellular P... 301 4e-81 pir||F72722 hypothetical protein APE0319 - Aeropyrum pernix (str... 221 4e-57 gi|11498880 intracellular protease (pfpI) [Archaeoglobus fulgidu... 186 1e-46 gb|AAK41330.1| Intracellular proteinase [Sulfolobus solfataricus] 171 3e-42 emb|CAC11607.1| (AL445064) probable intracellular proteinase I [... 153 8e-37 pir||A70355 proteinase I - Aquifex aeolicus >gi|2983230|gb|AAC06... 141 6e-33 pir||E65105 hypothetical 20.3 kD protein in sohA-mtr intergenic ... 134 8e-31 sp|P45470|YHBO_ECOLI HYPOTHETICAL 18.9 KDA PROTEIN IN SOHA-MTR I... 134 8e-31 sp|P80876|GS18_BACSU GENERAL STRESS PROTEIN 18 (GSP18) >gi|74493... 132 3e-30 dbj|BAB06744.1| (AP001517) general stress protein [Bacillus halo... 130 9e-30 pir||E83601 proteinase PfpI PA0355 [imported] - Pseudomonas aeru... 130 1e-29 sp|O06006|YRAA_BACSU HYPOTHETICAL 16.8 KD PROTEIN IN ADHA-SACC I... 128 3e-29 pir||T34745 probable proteinase pfpI - Streptomyces coelicolor >... 122 2e-27 sp|Q53719|YLY1_STAAU HYPOTHETICAL 18.6 KD PROTEIN IN LYTA 3'REGI... 120 7e-27 pir||F75423 proteinase I - Deinococcus radiodurans (strain R1) >... 112 2e-24 gb|AAF26988.1|AC018363_33 (AC018363) unknown protein [Arabidopsi... 106 2e-22 pir||A83103 conserved hypothetical protein PA4336 [imported] - P... 103 1e-21 gb|AAD56430.1|AF158699_2 (AF158699) unknown [Burkholderia cepacia] 102 3e-21 pir||D83125 probable proteinase PA4171 [imported] - Pseudomonas ... 100 1e-20 sp|Q58377|Y967_METJA HYPOTHETICAL PROTEIN MJ0967 >gi|2128604|pir... 92 4e-18 dbj|BAB16486.1| (AP002861) hypothetical protein [Oryza sativa] 80 2e-14 pir||T47619 hypothetical protein T14E10.170 - Arabidopsis thalia... 77 1e-13 gb|AAF57086.1| (AE003775) CG1349 gene product [Drosophila melano... 75 7e-13 gb|AAC79625.1| (AC005770) unknown protein [Arabidopsis thaliana] 74 1e-12 dbj|BAA97062.1| (AP000370) emb|CAA17570.1~gene_id:K15M2.13~simil... 72 4e-12 pir||D70177 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate bi... 71 8e-12 pir||T03871 hypothetical protein C49G7.11 - Caenorhabditis elega... 70 1e-11 gb|AAF69547.1|AC008007_22 (AC008007) F12M16.18 [Arabidopsis thal... 70 2e-11 >pir||H75163 intracellular proteinase (pfpi) PAB0311 - Pyrococcus abyssi (strain Orsay) >gi|5457901|emb|CAB49391.1| (AJ248284) intracellular protease (pfpI) [Pyrococcus abyssi] Length = 166 Score = 340 bits (863), Expect = 5e-93 Identities = 166/166 (100%), Positives = 166/166 (100%) Query: 1 MRVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVDLAFEEVNPD 60 MRVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVDLAFEEVNPD Sbjct: 1 MRVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVDLAFEEVNPD 60 Query: 61 EFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSY 120 EFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSY Sbjct: 61 EFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSY 120 Query: 121 PGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 PGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK Sbjct: 121 PGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 >sp|O59413|PFPI_PYRHO PROTEASE I >gi|7449372|pir||C71178 probable intracellular proteinase - Pyrococcus horikoshii >gi|3258135|dbj|BAA30818.1| (AP000007) 166aa long hypothetical intracellular proteinase [Pyrococcus horikoshii] Length = 166 Score = 310 bits (785), Expect = 8e-84 Identities = 146/166 (87%), Positives = 159/166 (94%) Query: 1 MRVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVDLAFEEVNPD 60 M+VL L+A++FEDVELIYPYHRLKEEGHEV +ASF+RG ITGKHGY+V VDL F++VNP+ Sbjct: 1 MKVLFLTANEFEDVELIYPYHRLKEEGHEVYIASFERGTITGKHGYSVKVDLTFDKVNPE 60 Query: 61 EFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSY 120 EFDALVLPGGRAPERVRLNEKAV IA+KMFSEGKPVASICHGPQILISAGVLRGR+GTSY Sbjct: 61 EFDALVLPGGRAPERVRLNEKAVSIARKMFSEGKPVASICHGPQILISAGVLRGRKGTSY 120 Query: 121 PGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 PGIKDDMINAGV+WVDAEVVVDGNWVSSRVP DLYAWMREFVKLLK Sbjct: 121 PGIKDDMINAGVEWVDAEVVVDGNWVSSRVPADLYAWMREFVKLLK 166 >sp|Q51732|PFPI_PYRFU PROTEASE I >gi|2129411|pir||JC6003 intracellular proteinase I (EC 3.4.-.-) - Pyrococcus furiosus >gi|1373331|gb|AAB04694.1| (U57642) protease I [Pyrococcus furiosus] Length = 166 Score = 305 bits (772), Expect = 3e-82 Identities = 142/166 (85%), Positives = 159/166 (95%) Query: 1 MRVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVDLAFEEVNPD 60 M++L LSA++FEDVELIYPYHRLKEEGHEV +ASF++GVITGKHGY+V VDL F+EVNPD Sbjct: 1 MKILFLSANEFEDVELIYPYHRLKEEGHEVYIASFEKGVITGKHGYSVKVDLTFDEVNPD 60 Query: 61 EFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSY 120 EFDALVLPGGRAPERVRLNEKAVEIA+KMF+EGKPVA+ICHGPQILISAGVL+GR+GTSY Sbjct: 61 EFDALVLPGGRAPERVRLNEKAVEIARKMFTEGKPVATICHGPQILISAGVLKGRKGTSY 120 Query: 121 PGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 GI+DDMINAGV+W+D EVVVDGNWVSSR PGDLYAWMREFVKLLK Sbjct: 121 IGIRDDMINAGVEWIDREVVVDGNWVSSRHPGDLYAWMREFVKLLK 166 >pdb|1G2I|A Chain A, Crystal Structure Of A Novel Intracellular Protease From Pyrococcus Horikoshii At 2 A Resolution >gi|11513903|pdb|1G2I|B Chain B, Crystal Structure Of A Novel Intracellular Protease From Pyrococcus Horikoshii At 2 A Resolution >gi|11513904|pdb|1G2I|C Chain C, Crystal Structure Of A Novel Intracellular Protease From Pyrococcus Horikoshii At 2 A Resolution Length = 166 Score = 301 bits (762), Expect = 4e-81 Identities = 142/165 (86%), Positives = 155/165 (93%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVDLAFEEVNPDE 61 +VL L+A++FEDVELIYPYHRLKEEGHEV +ASF+RG ITGKHGY+V VDL F++VNP+E Sbjct: 2 KVLFLTANEFEDVELIYPYHRLKEEGHEVYIASFERGTITGKHGYSVKVDLTFDKVNPEE 61 Query: 62 FDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSYP 121 FDALVLPGGRAPERVRLNEKAV IA+K FSEGKPVASICHGPQILISAGVLRGR+GTSYP Sbjct: 62 FDALVLPGGRAPERVRLNEKAVSIARKXFSEGKPVASICHGPQILISAGVLRGRKGTSYP 121 Query: 122 GIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 GIKDD INAGV+WVDAEVVVDGNWVSSRVP DLYAW REFVKLLK Sbjct: 122 GIKDDXINAGVEWVDAEVVVDGNWVSSRVPADLYAWXREFVKLLK 166 >pir||F72722 hypothetical protein APE0319 - Aeropyrum pernix (strain K1) >gi|5103958|dbj|BAA79274.1| (AP000059) 180aa long hypothetical proteinase I [Aeropyrum pernix] Length = 180 Score = 221 bits (557), Expect = 4e-57 Identities = 108/165 (65%), Positives = 130/165 (78%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVDLAFEEVNPDE 61 R LI+SAD FEDVEL+YPY+RL E G E LVA+ RG I GK GY V L+FEEV P+E Sbjct: 3 RALIISADGFEDVELLYPYYRLVEAGFETLVAAPSRGEIKGKMGYKVEAKLSFEEVKPEE 62 Query: 62 FDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSYP 121 FD LV+PGGRAPERVRL+E A+ I + F + KPVA+ICHGPQ+LISAGV++GRR TSY Sbjct: 63 FDVLVIPGGRAPERVRLHEAALNIVRHFFEKNKPVATICHGPQVLISAGVVKGRRLTSYW 122 Query: 122 GIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 G+KDD+I AG +WVD VVVDGN VSSR P D+ WMREF++LL+ Sbjct: 123 GVKDDVIAAGGNWVDEPVVVDGNLVSSRYPPDIPYWMREFMRLLE 167 >gi|11498880 intracellular protease (pfpI) [Archaeoglobus fulgidus] >gi|6136633|sp|O28987|YC81_ARCFU HYPOTHETICAL PROTEIN AF1281 >gi|7449373|pir||H69409 intracellular proteinase I (EC 3.4.-.-) pfpI homolog - Archaeoglobus fulgidus >gi|2649300|gb|AAB89965.1| (AE001016) intracellular protease (pfpI) [Archaeoglobus fulgidus] Length = 168 Score = 186 bits (468), Expect = 1e-46 Identities = 93/166 (56%), Positives = 123/166 (74%) Query: 1 MRVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVDLAFEEVNPD 60 MRVLIL+ ++FED+EL YP +RL+EEG EV VAS V GK GY V DL +E+V + Sbjct: 1 MRVLILAENEFEDLELFYPLYRLREEGLEVKVASSSLEVRVGKKGYQVRPDLTYEDVKVE 60 Query: 61 EFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSY 120 ++ LV+PGG++PERVR+NE+AVEI K GKPVA+ICHGPQ+LISA ++GRR TS+ Sbjct: 61 DYAGLVIPGGKSPERVRINERAVEIVKDFLELGKPVAAICHGPQLLISAMAVKGRRMTSW 120 Query: 121 PGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 GI+DD+I AG + D VVVDGN ++SR+P DL + E +K+LK Sbjct: 121 IGIRDDLIAAGALYEDRPVVVDGNVITSRMPDDLPYFCGELIKILK 166 >gb|AAK41330.1| Intracellular proteinase [Sulfolobus solfataricus] Length = 173 Score = 171 bits (430), Expect = 3e-42 Identities = 85/166 (51%), Positives = 118/166 (70%), Gaps = 1/166 (0%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVA-SFKRGVITGKHGYTVNVDLAFEEVNPD 60 +VL L ++FED+EL+YP++R+ EEG + ++A + GKHGYTV D+AF++V P+ Sbjct: 5 KVLFLVGEEFEDIELLYPFYRVMEEGFKPVIAWKEANSKVIGKHGYTVISDIAFKDVRPE 64 Query: 61 EFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSY 120 ++ ALV+PGGR PE +R E+ I +K F KPVA+ICHGPQILISA +++GR+ TS Sbjct: 65 DYIALVIPGGRGPEHIRTLEEVKNITRKFFELKKPVAAICHGPQILISANLVKGRKLTSV 124 Query: 121 PGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 IKDD+I AG +VD +VVVD N +SSRVP DL A+ +K LK Sbjct: 125 NSIKDDVIAAGGIYVDNDVVVDENLISSRVPSDLPAFASTLIKALK 170 >emb|CAC11607.1| (AL445064) probable intracellular proteinase I [Thermoplasma acidophilum] Length = 186 Score = 153 bits (384), Expect = 8e-37 Identities = 81/180 (45%), Positives = 112/180 (62%), Gaps = 14/180 (7%) Query: 1 MRVLILSADQFEDVELIYPYHRLKEEGHEVLVAS------------FKRG--VITGKHGY 46 +RVLIL+ D E +E++YP R+KEEG EV VA+ F+ G T K GY Sbjct: 2 VRVLILTGDAGESLEVMYPLQRMKEEGFEVDVAAPEKKKIQLVVHDFEEGFDTYTEKPGY 61 Query: 47 TVNVDLAFEEVNPDEFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQIL 106 + DLAF++V+P ++DAL++PGGRAPE +R ++ + I K F + PVA +CH P L Sbjct: 62 KLQADLAFKDVDPSKYDALIIPGGRAPEYIRNDKDFIRIVKYFFEKHSPVAELCHAPLAL 121 Query: 107 ISAGVLRGRRGTSYPGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 +AGVL+GR +YP + D+ AG +VD V+DGN VS+R D WMR F+KLLK Sbjct: 122 AAAGVLKGRTTAAYPALAPDVAIAGGQFVDGAAVIDGNLVSARAWPDHPEWMRAFIKLLK 181 >pir||A70355 proteinase I - Aquifex aeolicus >gi|2983230|gb|AAC06827.1| (AE000698) protease I [Aquifex aeolicus] Length = 167 Score = 141 bits (351), Expect = 6e-33 Identities = 71/165 (43%), Positives = 101/165 (61%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVDLAFEEVNPDE 61 +VLI + EDVE IYPY R KEEG EV+ A+ K G GK G T D ++V E Sbjct: 3 KVLIFLEELVEDVEFIYPYLRFKEEGFEVVSAAPKLGEYKGKKGMTFRPDKTIKDVYHQE 62 Query: 62 FDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSYP 121 FD + +PGG AP+R+R + + I KK + GK V ++CHGP +LISA V++G++ T + Sbjct: 63 FDCVFIPGGYAPDRLRRYPEVLHIVKKHYDSGKLVCAVCHGPWVLISAKVVKGKKVTGFF 122 Query: 122 GIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 IKDD+INAG ++ V VDGN +++ P + M+ + LK Sbjct: 123 AIKDDLINAGANYTGKPVEVDGNLITATDPKSMLEMMKVIISRLK 167 >pir||E65105 hypothetical 20.3 kD protein in sohA-mtr intergenic region - Escherichia coli (strain K-12) >gi|606093|gb|AAA57956.1| (U18997) ORF_o186 [Escherichia coli] >gi|1789543|gb|AAC76187.1| (AE000396) orf, hypothetical protein [Escherichia coli K12] Length = 186 Score = 134 bits (333), Expect = 8e-31 Identities = 73/167 (43%), Positives = 106/167 (62%), Gaps = 3/167 (1%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRG-VITGKHGY-TVNVDLAFEEVNP 59 ++ +L D+FED E P ++ GHEV+ + G + GK G +V +D + +EV P Sbjct: 18 KIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAGKTVKGKKGEASVTIDKSIDEVTP 77 Query: 60 DEFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTS 119 EFDAL+LPGG +P+ +R + + V + + GKPV +ICHGPQ+LISA V+RGR+ T+ Sbjct: 78 AEFDALLLPGGHSPDYLRGDNRFVTFTRDFVNSGKPVFAICHGPQLLISADVIRGRKLTA 137 Query: 120 YPGIKDDMINAGVDWVDAEVVVD-GNWVSSRVPGDLYAWMREFVKLL 165 I D+ NAG ++ D EVVVD V+SR P DL A+ RE ++LL Sbjct: 138 VKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLPAFNREALRLL 184 >sp|P45470|YHBO_ECOLI HYPOTHETICAL 18.9 KDA PROTEIN IN SOHA-MTR INTERGENIC REGION Length = 172 Score = 134 bits (333), Expect = 8e-31 Identities = 73/167 (43%), Positives = 106/167 (62%), Gaps = 3/167 (1%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRG-VITGKHGY-TVNVDLAFEEVNP 59 ++ +L D+FED E P ++ GHEV+ + G + GK G +V +D + +EV P Sbjct: 4 KIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAGKTVKGKKGEASVTIDKSIDEVTP 63 Query: 60 DEFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTS 119 EFDAL+LPGG +P+ +R + + V + + GKPV +ICHGPQ+LISA V+RGR+ T+ Sbjct: 64 AEFDALLLPGGHSPDYLRGDNRFVTFTRDFVNSGKPVFAICHGPQLLISADVIRGRKLTA 123 Query: 120 YPGIKDDMINAGVDWVDAEVVVD-GNWVSSRVPGDLYAWMREFVKLL 165 I D+ NAG ++ D EVVVD V+SR P DL A+ RE ++LL Sbjct: 124 VKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLPAFNREALRLL 170 >sp|P80876|GS18_BACSU GENERAL STRESS PROTEIN 18 (GSP18) >gi|7449376|pir||H69808 conserved hypothetical protein yfkM - Bacillus subtilis >gi|2626825|dbj|BAA23403.1| (D83967) YfkM [Bacillus subtilis] >gi|2633109|emb|CAB12614.1| (Z99108) similar to hypothetical proteins [Bacillus subtilis] Length = 172 Score = 132 bits (328), Expect = 3e-30 Identities = 71/168 (42%), Positives = 105/168 (62%), Gaps = 3/168 (1%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRG-VITGKHGYT-VNVDLAFEEVNP 59 ++ ++ FED E P KE GHE+ V ++G + GK G V VD + ++VN Sbjct: 4 KIAVVLTYYFEDSEYTEPAKAFKEAGHELTVIEKEKGKTVKGKQGTAEVTVDASIDDVNS 63 Query: 60 DEFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTS 119 +FDAL++PGG +P+++R +++ V+ K ++ KPV +ICHGPQ+LI+A L GR+ T Sbjct: 64 SDFDALLIPGGFSPDQLRADDRFVQFTKAFMTDKKPVFAICHGPQLLINAKALDGRKATG 123 Query: 120 YPGIKDDMINAGVDWVDAEVVV-DGNWVSSRVPGDLYAWMREFVKLLK 166 Y I+ DM NAG D VD EVVV V+SR P D+ A+ RE + LL+ Sbjct: 124 YTSIRVDMENAGADVVDKEVVVCQDQLVTSRTPDDIPAFNRESLALLE 171 >dbj|BAB06744.1| (AP001517) general stress protein [Bacillus halodurans] Length = 171 Score = 130 bits (324), Expect = 9e-30 Identities = 69/167 (41%), Positives = 104/167 (61%), Gaps = 3/167 (1%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGV-ITGKHGY-TVNVDLAFEEVNP 59 ++ ++ + FED E P KE GH ++ ++G+ + GK G T+ +D + +EV Sbjct: 4 KIAVVVTNLFEDSEYTEPVKAFKEAGHSIVTIEKEKGIKVKGKQGESTITIDASIDEVTE 63 Query: 60 DEFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTS 119 D FDAL++PGG +P+ +R +++ V AK K + +ICHGPQ+LI+AGVL+GR T Sbjct: 64 DAFDALLIPGGFSPDILRADDRFVAFAKAFADAKKTIMAICHGPQLLINAGVLKGRDVTG 123 Query: 120 YPGIKDDMINAGVDWVDAEVVV-DGNWVSSRVPGDLYAWMREFVKLL 165 Y I D+ NAG ++ D EVVV GN V+SR P DL A+ RE + +L Sbjct: 124 YKSIAVDLRNAGANFYDQEVVVCGGNLVTSRTPDDLPAFNRESLNVL 170 >pir||E83601 proteinase PfpI PA0355 [imported] - Pseudomonas aeruginosa (strain PAO1) >gi|9946205|gb|AAG03744.1|AE004473_6 (AE004473) protease PfpI [Pseudomonas aeruginosa] Length = 179 Score = 130 bits (323), Expect = 1e-29 Identities = 75/168 (44%), Positives = 103/168 (60%), Gaps = 5/168 (2%) Query: 3 VLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGY----TVNVDLAFEEVN 58 V L D FE VEL P L++ G V + S K G + G + + VD FE+ + Sbjct: 10 VAALVTDGFEQVELTGPKKALEDAGATVRILSDKAGEVRGWNHHQPAEAFRVDGTFEDAS 69 Query: 59 PDEFDALVLPGGRA-PERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRG 117 D++DAL+LPGG +++R KA E+A + KPVA ICHG +LISAG+++GR Sbjct: 70 LDDYDALLLPGGVINSDQIRSLAKAQELAIRAEQASKPVAVICHGAWLLISAGLVQGRTL 129 Query: 118 TSYPGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLL 165 TS+P +KDD+ NAG WVD EV VDG VSSR P D+ A+ R F+++L Sbjct: 130 TSWPSLKDDINNAGGHWVDQEVAVDGKLVSSRKPEDIPAFNRRFIEIL 177 >sp|O06006|YRAA_BACSU HYPOTHETICAL 16.8 KD PROTEIN IN ADHA-SACC INTERGENIC REGION >gi|7449374|pir||A69970 conserved hypothetical protein yraA - Bacillus subtilis >gi|2108267|emb|CAA63466.1| (X92868) yraA [Bacillus subtilis] >gi|2635148|emb|CAB14644.1| (Z99117) similar to hypothetical proteins [Bacillus subtilis] Length = 154 Score = 128 bits (319), Expect = 3e-29 Identities = 62/149 (41%), Positives = 93/149 (61%), Gaps = 1/149 (0%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGV-ITGKHGYTVNVDLAFEEVNPD 60 ++ +L DQFED+E P +E G+ V+ + G +TGKHG V +D A +V+ Sbjct: 4 KIAVLVTDQFEDIEYTSPVKAYEEAGYSVVAIDLEAGKEVTGKHGEKVKIDKAISDVDAS 63 Query: 61 EFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSY 120 +FDAL++PGG +P+ +R +++ E AK KPV +ICHGPQ+LI +L+G+ T Y Sbjct: 64 DFDALLIPGGFSPDLLRADDRPGEFAKAFVENKKPVFAICHGPQVLIDTDLLKGKDITGY 123 Query: 121 PGIKDDMINAGVDWVDAEVVVDGNWVSSR 149 I+ D+INAG ++ DAEVVV N V + Sbjct: 124 RSIRKDLINAGANYKDAEVVVSHNIVDKQ 152 >pir||T34745 probable proteinase pfpI - Streptomyces coelicolor >gi|3861447|emb|CAA22052.1| (AL033505) putative protease [Streptomyces coelicolor A3(2)] Length = 180 Score = 122 bits (304), Expect = 2e-27 Identities = 72/172 (41%), Positives = 101/172 (57%), Gaps = 10/172 (5%) Query: 1 MRVLILSADQ-FEDVELIYPYHRLKEEGHEVLVASFKRGVITG----KHGYTVNVDLAFE 55 MR+ L+A + E VEL P+ KE GH+ ++ S + G I G T VD Sbjct: 1 MRIAFLTAPEGVEQVELTEPWRAAKEAGHDPVLVSTQSGEIQGFDHLDKADTFPVDEVVG 60 Query: 56 EVNPDEFDALVLPGGRA-PERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRG 114 E++ D F LVLPGG A P+ +R++EKAV K F++G+PVA+ICH P L+ A V+RG Sbjct: 61 EISADAFGGLVLPGGVANPDFLRMDEKAVAFVKDFFTQGRPVAAICHAPWTLVEADVVRG 120 Query: 115 RRGTSYPGIKDDMINAGVDWVDAEVVV----DGNWVSSRVPGDLYAWMREFV 162 R TS+P ++ D+ NAG WVD +V V D V+SR P DL A+ ++ Sbjct: 121 RTMTSWPSLRTDLRNAGATWVDEQVQVCDAGDNVLVTSRKPDDLEAFCETYL 172 >sp|Q53719|YLY1_STAAU HYPOTHETICAL 18.6 KD PROTEIN IN LYTA 3'REGION (ORF1) >gi|310602|gb|AAA18514.1| (L19300) ORF1 [Staphylococcus aureus] Length = 171 Score = 120 bits (299), Expect = 7e-27 Identities = 67/168 (39%), Positives = 97/168 (56%), Gaps = 3/168 (1%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEG-HEVLVASFKRGVITGKHGYTVNVDLAFEEVNPD 60 +V I+ A++FED+E P L+ G + V++ + GKHG V VD+ E P+ Sbjct: 4 KVAIILANEFEDIEYSSPKEALENAGFNTVVIGDTANSEVVGKHGEKVTVDVGIAEAKPE 63 Query: 61 EFDALVLPGGRAPERVRLNEKAV--EIAKKMFSEGKPVASICHGPQILISAGVLRGRRGT 118 ++DAL++PGG +P+ +R + + AK P +ICHGPQILI L+GR T Sbjct: 64 DYDALLIPGGFSPDHLRGDTEGRYGTFAKYFTKNDVPTFAICHGPQILIDTDDLKGRTLT 123 Query: 119 SYPGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 + ++ D+ NAG VD VVVD N V+SRVP DL + RE VK L+ Sbjct: 124 AVLNVRKDLSNAGAHVVDESVVVDNNIVTSRVPDDLDDFNREIVKQLQ 171 >pir||F75423 proteinase I - Deinococcus radiodurans (strain R1) >gi|6458942|gb|AAF10772.1|AE001969_1 (AE001969) protease I [Deinococcus radiodurans] Length = 190 Score = 112 bits (278), Expect = 2e-24 Identities = 61/168 (36%), Positives = 94/168 (55%), Gaps = 6/168 (3%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHG-----YTVNVDLAFEE 56 ++ IL+AD E++EL P ++ G + S + G I G VD E Sbjct: 11 KIAILAADGVEEIELTSPRAAIEAAGGTTELISLEPGEIQSMKGDIEPQEKYRVDHVVSE 70 Query: 57 VNPDEFDALVLPGGRA-PERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGR 115 V ++D L+LPGG P+++RL E A++ + M+ GKP+A+ICHGP L G+ +G Sbjct: 71 VQVSDYDGLLLPGGTVNPDKLRLEEGAMKFVRDMYDAGKPIAAICHGPWSLSETGIAQGL 130 Query: 116 RGTSYPGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVK 163 + TS+P +K ++ AG WVD E V D V+SR P DL A+ ++ V+ Sbjct: 131 KMTSWPSLKRELTLAGAQWVDEECVTDKGVVTSRKPDDLPAFNKKIVE 178 >gb|AAF26988.1|AC018363_33 (AC018363) unknown protein [Arabidopsis thaliana] Length = 388 Score = 106 bits (262), Expect = 2e-22 Identities = 61/166 (36%), Positives = 92/166 (54%), Gaps = 20/166 (12%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEV--LVASFKRG--------------VITGKHG 45 R+L L D ED E+ P+ L+ G +V + K G + K G Sbjct: 199 RILFLCGDYMEDYEVKVPFQSLQALGCQVDAVCPEKKAGDRCPTAIHDFEGDQTYSEKPG 258 Query: 46 YTVNVDLAFEEVNPDEFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQI 105 +T + F+++ +DALV+PGGRAPE + LNE + I K+ + KPVASICHG QI Sbjct: 259 HTFALTTNFDDLVSSSYDALVIPGGRAPEYLALNEHVLNIVKEFMNSEKPVASICHGQQI 318 Query: 106 LISAGVLRGRRGTSYPGIKDDMINAGVDWVDAEVV----VDGNWVS 147 L +AGVL+GR+ T+YP +K +++ G W++ + + DGN V+ Sbjct: 319 LAAAGVLKGRKCTAYPAVKLNVVLGGGTWLEPDPIDRCFTDGNLVT 364 Score = 96.7 bits (237), Expect = 1e-19 Identities = 62/183 (33%), Positives = 94/183 (50%), Gaps = 20/183 (10%) Query: 3 VLILSADQFEDVELIYPYHRLKEEGHEV--LVASFKRG--VITGKH------------GY 46 VLIL D ED E++ P+ L+ G V + K G T H G+ Sbjct: 7 VLILCGDYMEDYEVMVPFQALQAFGITVHTVCPGKKAGDSCPTAVHDFCGHQTYFESRGH 66 Query: 47 TVNVDLAFEEVNPDEFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQIL 106 ++ F+EV+ ++D LV+PGGRAPE + L VE+ K+ GKP+ASICHG IL Sbjct: 67 NFTLNATFDEVDLSKYDGLVIPGGRAPEYLALTASVVELVKEFSRSGKPIASICHGQLIL 126 Query: 107 ISAGVLRGRRGTSYPGIKDDMINAGVDWVDA----EVVVDGNWVSSRVPGDLYAWMREFV 162 +A + GR+ T+Y + ++ AG WV+ VVDG+ +++ +++ FV Sbjct: 127 AAADTVNGRKCTAYATVGPSLVAAGAKWVEPITPDVCVVDGSLITAATYEGHPEFIQLFV 186 Query: 163 KLL 165 K L Sbjct: 187 KAL 189 >pir||A83103 conserved hypothetical protein PA4336 [imported] - Pseudomonas aeruginosa (strain PAO1) >gi|9950562|gb|AAG07724.1|AE004850_2 (AE004850) conserved hypothetical protein [Pseudomonas aeruginosa] Length = 194 Score = 103 bits (255), Expect = 1e-21 Identities = 61/185 (32%), Positives = 103/185 (54%), Gaps = 23/185 (12%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEV-LVASFKRGVITGK----------------- 43 ++L+L D ED E + P+ L+ GH+V V KR + + Sbjct: 6 KILMLVGDYAEDYETMVPFQALQMVGHQVHAVCPDKRAGQSVRTAIHDFEGDQTYSEKPG 65 Query: 44 HGYTVNVDLAFEEVNPDEFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGP 103 H +T+N D F + +++DAL++PGGRAPE +RLNE+ + + + + KP+A++CHG Sbjct: 66 HNFTLNAD--FAQARAEDYDALLIPGGRAPEYLRLNEQVLALVRDFDAARKPIAAVCHGA 123 Query: 104 QILISAGVLRGRRGTSYPGIKDDMINAGVDWVDA---EVVVDGNWVSSRVPGDLYAWMRE 160 Q+L +AGVL+GR ++YP ++ AG ++VD + VDG+ V++ AW+ Sbjct: 124 QLLAAAGVLQGRACSAYPACAPEVRLAGGEYVDLPPDQAHVDGHLVTAPAWPAHPAWLAR 183 Query: 161 FVKLL 165 F++ L Sbjct: 184 FLEAL 188 >gb|AAD56430.1|AF158699_2 (AF158699) unknown [Burkholderia cepacia] Length = 197 Score = 102 bits (251), Expect = 3e-21 Identities = 59/183 (32%), Positives = 97/183 (52%), Gaps = 19/183 (10%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEV-LVASFKRG---------------VITGKHG 45 ++L L+ D ED E + P+ L+ GH V V KR T K G Sbjct: 9 KILFLTGDFAEDYETMVPFQALQAVGHHVDAVCPGKRAGDKVKTAIHDFEGDQTYTEKPG 68 Query: 46 YTVNVDLAFEEVNPDEFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQI 105 + ++ F++V+ +DAL + GGRAPE +RL+ + + + GKP+A+ICH Q+ Sbjct: 69 HQFTLNATFDDVDASRYDALAIAGGRAPEYLRLDPNVIALVRAFAEAGKPIAAICHAAQL 128 Query: 106 LISAGVLRGRRGTSYPGIKDDMINAGVDWVDAEV---VVDGNWVSSRVPGDLYAWMREFV 162 L +A V+RG+R ++YP ++ AG ++ D V V D +V++ V + AW+ +F+ Sbjct: 129 LAAADVIRGKRISAYPACAPEVKLAGGEYADIPVDAAVTDAPFVNAPVWAENPAWVSQFL 188 Query: 163 KLL 165 LL Sbjct: 189 ALL 191 >pir||D83125 probable proteinase PA4171 [imported] - Pseudomonas aeruginosa (strain PAO1) >gi|9950379|gb|AAG07558.1|AE004833_9 (AE004833) probable protease [Pseudomonas aeruginosa] Length = 187 Score = 100 bits (246), Expect = 1e-20 Identities = 61/166 (36%), Positives = 94/166 (55%), Gaps = 11/166 (6%) Query: 3 VLILSADQ-FEDVELIYPYHRLKEEGHEVLVASFKRG-----VITGKHGYTVNVDLAFEE 56 VL+++A+ E EL+ P LK++G V A+ + G V + VN D + Sbjct: 10 VLVITANTGIERDELLEPLKTLKQQGATVTHATIEGGEAQTWVHDSEKDLIVNSDARLQG 69 Query: 57 VNPDEFDALVLPGGRA-PERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGR 115 + +++D L++PGG + +R + +A + ++ GK VA+ICHGP +LI AGV RG+ Sbjct: 70 LRAEDYDLLLVPGGTVNADTLRQDSEARRLVREFSEAGKTVAAICHGPWLLIDAGVARGK 129 Query: 116 RGTSYPGIKDDMINAGVDWVDAEVVV--DGNW--VSSRVPGDLYAW 157 TSY ++ D+ NAG DWVD V V W ++SR PGDL A+ Sbjct: 130 TLTSYSSVRIDLTNAGADWVDTRVKVCPANGWTLITSRNPGDLQAF 175 >sp|Q58377|Y967_METJA HYPOTHETICAL PROTEIN MJ0967 >gi|2128604|pir||G64420 hypothetical protein MJ0967 - Methanococcus jannaschii >gi|1499805|gb|AAB98972.1| (U67540) intracellular protease (pfpI) [Methanococcus jannaschii] Length = 205 Score = 92.1 bits (225), Expect = 4e-18 Identities = 49/152 (32%), Positives = 88/152 (57%), Gaps = 3/152 (1%) Query: 3 VLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVDLAFEEVNPDEF 62 +++++ F D EL P + G +V V S +G G G + V+ +VNPD++ Sbjct: 35 LMVIAPKDFRDEELFEPMAVFESNGLKVDVVSTTKGECVGMLGNKITVEKTIYDVNPDDY 94 Query: 63 DALVLPGG-RAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSYP 121 A+V+ GG + E + N K +E+ K+ +++ K V++IC P +L AG+L+G++ T YP Sbjct: 95 VAIVIVGGIGSKEYLWNNTKLIELVKEFYNKNKVVSAICLSPVVLARAGILKGKKATVYP 154 Query: 122 GIK--DDMINAGVDWVDAEVVVDGNWVSSRVP 151 + +++ AG + D VVVDGN ++++ P Sbjct: 155 APEAIEELKKAGAIYEDRGVVVDGNVITAKSP 186 >dbj|BAB16486.1| (AP002861) hypothetical protein [Oryza sativa] Length = 517 Score = 80.0 bits (194), Expect = 2e-14 Identities = 50/155 (32%), Positives = 84/155 (53%), Gaps = 6/155 (3%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASF--KRGVITGKHGYTVNVDLAFEEVNP 59 +VL+ A+ E++E + L+ G V VAS K V+T +H + + D+ EE Sbjct: 336 QVLVPVANGSEEMEALNLIDILRRAGANVTVASVEDKLQVVTRRHKFNLIADIMVEEAAK 395 Query: 60 DEFDALVLPGG-RAPERVRLNEKAVEIAKKMFSEGKPVASICHGP-QILISAGVLRGRRG 117 EFD +V+PGG +++ + V++ KK KP +IC P +L G+L+G++ Sbjct: 396 REFDLIVMPGGLPGAQKLSSTKVLVDLLKKQAESNKPYGAICASPAYVLEPHGLLKGKKA 455 Query: 118 TSYPGIKDDMINAGVDWVDAEVVVDGNWVSSRVPG 152 TS+P + + + D+ VVVDGN ++S+ PG Sbjct: 456 TSFPPMAHLLTDQSA--CDSRVVVDGNLITSKAPG 488 >pir||T47619 hypothetical protein T14E10.170 - Arabidopsis thaliana >gi|7258363|emb|CAB77580.1| (AL138656) putative protein [Arabidopsis thaliana] Length = 399 Score = 76.9 bits (186), Expect = 1e-13 Identities = 55/174 (31%), Positives = 92/174 (52%), Gaps = 18/174 (10%) Query: 3 VLILSADQFEDVELIYPYHRLKEEGHEV--LVASFKRG---------------VITGKHG 45 +L L D ED + P+ + G +V + + KRG + T K G Sbjct: 213 LLFLIGDCVEDYSINVPFKAFQALGCKVDAVTPTKKRGEKCATIVHDLEDGRQLPTEKFG 272 Query: 46 YTVNVDLAFEEVNPDEFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQI 105 + V +A+++V+ D++D +V+PGGR+PE + +N KAVE+ +K +GK VA+I G + Sbjct: 273 HNFYVTVAWDDVSVDDYDCIVVPGGRSPELLVMNPKAVELVRKFVEKGKFVAAIGMGNWL 332 Query: 106 LISAGVLRGRRGTSYPGIKDDMINAGVDWVDAE-VVVDGNWVSSRVPGDLYAWM 158 L + G L+ +R S G K + AG + V++E V D V++ DL A++ Sbjct: 333 LAATGALKKKRCASSYGTKVAVKVAGGEIVESERCVTDDKLVTAASTSDLPAFL 386 >gb|AAF57086.1| (AE003775) CG1349 gene product [Drosophila melanogaster] Length = 187 Score = 74.5 bits (180), Expect = 7e-13 Identities = 49/154 (31%), Positives = 73/154 (46%), Gaps = 2/154 (1%) Query: 4 LILSADQFEDVELIYPYHRLKEEGHEVLVASFKRG-VITGKHGYTVNVDLAFEEVNPDEF 62 L++ A E++E I L+ G +V VA G + + D + +V D+F Sbjct: 6 LVILAPGAEEMEFIIAADVLRRAGIKVTVAGLNGGEAVKCSRDVQILPDTSLAQVASDKF 65 Query: 63 DALVLPGGRAPERVRLNEKAV-EIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSYP 121 D +VLPGG V ++ + S G +A+IC P +L GV G+ TSYP Sbjct: 66 DVVVLPGGLGGSNAMGESSLVGDLLRSQESGGGLIAAICAAPTVLAKHGVASGKSLTSYP 125 Query: 122 GIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLY 155 +K ++N D VV DGN ++SR PG Y Sbjct: 126 SMKPQLVNNYSYVDDKTVVKDGNLITSRGPGTAY 159 >gb|AAC79625.1| (AC005770) unknown protein [Arabidopsis thaliana] Length = 398 Score = 73.7 bits (178), Expect = 1e-12 Identities = 53/174 (30%), Positives = 89/174 (50%), Gaps = 18/174 (10%) Query: 3 VLILSADQFEDVELIYPYHRLKEEGHEV--LVASFKRGVITG---------------KHG 45 VL L D ED + P+ L+ G +V + + K+G + K G Sbjct: 212 VLFLIGDYVEDYGINVPFRALQALGCKVDAVTPNKKKGEVCATAVYDLEDGRQIPAEKRG 271 Query: 46 YTVNVDLAFEEVNPDEFDALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQI 105 + V +++++ D++D +V+PGGR+PE + +NEKAV + K + K A+I G + Sbjct: 272 HNFFVTASWDDICVDDYDCVVVPGGRSPELLVMNEKAVALVKSFAEKDKVFAAIGQGKLL 331 Query: 106 LISAGVLRGRRGTSYPGIKDDMINAGVDWV-DAEVVVDGNWVSSRVPGDLYAWM 158 L + GVL+G+R S G+K + AG + V + V DG V++ DL A++ Sbjct: 332 LAATGVLKGKRCASGKGMKVMVKVAGGEAVMEKGCVTDGKVVTAASATDLPAFL 385 >dbj|BAA97062.1| (AP000370) emb|CAA17570.1~gene_id:K15M2.13~similar to unknown protein [Arabidopsis thaliana] Length = 369 Score = 71.8 bits (173), Expect = 4e-12 Identities = 44/154 (28%), Positives = 79/154 (50%), Gaps = 5/154 (3%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGV-ITGKHGYTVNVDLAFEEVNPD 60 ++L+ A++ E++E I L+ V++A+ + + G + ++ +EV Sbjct: 190 QILVPIAEESEEIEAIALVDILRRAKANVVIAAVGNSLEVEGSRKAKLVAEVLLDEVAEK 249 Query: 61 EFDALVLPGG-RAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISA-GVLRGRRGT 118 FD +VLPGG +R EK V + +K KP IC P + G+L+G++ T Sbjct: 250 SFDLIVLPGGLNGAQRFASCEKLVNMLRKQAEANKPYGGICASPAYVFEPNGLLKGKKAT 309 Query: 119 SYPGIKDDMINAGVDWVDAEVVVDGNWVSSRVPG 152 ++P + D + + ++ VVVDGN ++SR PG Sbjct: 310 THPVVSDKLSDKS--HIEHRVVVDGNVITSRAPG 341 Score = 67.5 bits (162), Expect = 9e-11 Identities = 43/133 (32%), Positives = 68/133 (50%), Gaps = 3/133 (2%) Query: 23 LKEEGHEVLVASFKRGV-ITGKHGYTVNVDLAFEEVNPDEFDALVLPGGRAPERVRLNEK 81 L+ G +V VAS + V + HG + D ++ FD +VLPGG N K Sbjct: 5 LRRGGADVTVASVETQVGVDACHGIKMVADTLLSDITDSVFDLIVLPGGLPGGETLKNCK 64 Query: 82 AVE-IAKKMFSEGKPVASICHGPQILISA-GVLRGRRGTSYPGIKDDMINAGVDWVDAEV 139 ++E + KK S+G+ A+IC P + + G+L G++ T YP + + V++ V Sbjct: 65 SLENMVKKQDSDGRLNAAICCAPALALGTWGLLEGKKATGYPVFMEKLAATCATAVESRV 124 Query: 140 VVDGNWVSSRVPG 152 +DG V+SR PG Sbjct: 125 QIDGRIVTSRGPG 137 >pir||D70177 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis protein (thiJ) homolog - Lyme disease spirochete >gi|2688544|gb|AAC66975.1| (AE001163) 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis protein (thiJ) [Borrelia burgdorferi] Length = 184 Score = 71.0 bits (171), Expect = 8e-12 Identities = 47/169 (27%), Positives = 90/169 (52%), Gaps = 5/169 (2%) Query: 1 MRVLILSADQFEDVELIYPYHRLKEEGHEV-LVASFKRGVITGKHGYTVNVDLAFEEVNP 59 M V I+ A+ FED+E I P L+ + ++++ V+ G + D Sbjct: 1 MVVGIILANGFEDIEAIIPIDILRRGNVNIQIISTNDSNVVISSKGVSFLADDIISNCKE 60 Query: 60 DEFDALVLPGGRAPERVRLNEKAVE-IAKKMFSEGKPVASICHGPQILISA-GVLRGRRG 117 + FD ++LPGG N K ++ I K M S+GK +A+IC P ++++A G+L + Sbjct: 61 NCFDLIILPGGMPGATNLFNSKELDLILKDMNSKGKFIAAICASPVVVLAAKGLLGFNKF 120 Query: 118 TSYPGIKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 T YPG++ ++++ ++VD VV N+++S+ G + + ++++K Sbjct: 121 TCYPGLEKNVLDG--EFVDENVVRSNNFITSKGVGTSFEFAFTLLEMVK 167 >pir||T03871 hypothetical protein C49G7.11 - Caenorhabditis elegans >gi|2291156|gb|AAB65292.1| (AF016418) C49G7.11 gene product [Caenorhabditis elegans] Length = 186 Score = 70.2 bits (169), Expect = 1e-11 Identities = 41/164 (25%), Positives = 80/164 (48%), Gaps = 1/164 (0%) Query: 3 VLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGVITGKHGYTVNVDLAFEEVNPDEF 62 +++L + E++E+I L G +VL A + G + D+A ++V F Sbjct: 8 LILLPPEDAEEIEVIVTGDVLVRGGLQVLYAGSSTEPVKCAKGARIVPDVALKDVKNKTF 67 Query: 63 DALVLPGGRAPERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISAGVLRGRRGTSYPG 122 D +++PGG ++ E+ K G + +IC GP +L++ G++ R T + Sbjct: 68 DIIIIPGGPGCSKLAECPVIGELLKTQVKSGGLIGAICAGPTVLLAHGIV-AERVTCHYT 126 Query: 123 IKDDMINAGVDWVDAEVVVDGNWVSSRVPGDLYAWMREFVKLLK 166 +KD M G ++D VV+ ++S+ PG + + + V+ L+ Sbjct: 127 VKDKMTEGGYKYLDDNVVISDRVITSKGPGTAFEFALKIVETLE 170 >gb|AAF69547.1|AC008007_22 (AC008007) F12M16.18 [Arabidopsis thaliana] Length = 438 Score = 69.9 bits (168), Expect = 2e-11 Identities = 43/154 (27%), Positives = 78/154 (49%), Gaps = 5/154 (3%) Query: 2 RVLILSADQFEDVELIYPYHRLKEEGHEVLVASFKRGV-ITGKHGYTVNVDLAFEEVNPD 60 ++L+ AD E++E + LK V+VA+ + + + D+ +E + Sbjct: 259 QILVPIADGSEEMEAVAIIDVLKRAKANVVVAALGNSLEVVASRKVKLVADVLLDEAEKN 318 Query: 61 EFDALVLPGGRA-PERVRLNEKAVEIAKKMFSEGKPVASICHGPQILISA-GVLRGRRGT 118 +D +VLPGG E +EK V + KK KP +IC P ++ G+L+G++ T Sbjct: 319 SYDLIVLPGGLGGAEAFASSEKLVNMLKKQAESNKPYGAICASPALVFEPHGLLKGKKAT 378 Query: 119 SYPGIKDDMINAGVDWVDAEVVVDGNWVSSRVPG 152 ++P + + + ++ V+VDGN ++SR PG Sbjct: 379 AFPAMCSKLTDQ--SHIEHRVLVDGNLITSRGPG 410 Database: ./suso.pep Posted date: Jul 6, 2001 5:57 PM Number of letters in database: 840,471 Number of sequences in database: 2977 Database: /banques/blast2/nr.pep Posted date: Dec 14, 2000 12:46 PM Number of letters in database: 188,266,275 Number of sequences in database: 595,510 Lambda K H 0.320 0.140 0.419 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 65229477 Number of Sequences: 2977 Number of extensions: 2722644 Number of successful extensions: 6511 Number of sequences better than 1.0e-10: 31 Number of HSP's better than 0.0 without gapping: 23 Number of HSP's successfully gapped in prelim test: 8 Number of HSP's that attempted gapping in prelim test: 6449 Number of HSP's gapped (non-prelim): 36 length of query: 166 length of database: 189,106,746 effective HSP length: 51 effective length of query: 115 effective length of database: 158,583,909 effective search space: 18237149535 effective search space used: 18237149535 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.8 bits) S2: 162 (67.5 bits)