BLASTP 2.0.10 [Aug-26-1999] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= PAB1662 (PAB1662) DE:CAROTENOID BIOSYNTHETIC GENE ERWCRTS related (370 letters) Database: ./suso.pep; /banques/blast2/nr.pep 598,487 sequences; 189,106,746 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value pir||D75084 carotenoid biosynthetic gene erwcrts related PAB1662... 739 0.0 pir||D71063 hypothetical protein PH1202 - Pyrococcus horikoshii ... 668 0.0 pir||H69162 conserved hypothetical protein MTH48 - Methanobacter... 333 1e-90 sp|Q58272|Y862_METJA HYPOTHETICAL PROTEIN MJ0862 >gi|2128035|pir... 276 3e-73 gi|10803607 ORF H0660 [Halobacterium sp. NRC-1] >gi|10803696|ref... 272 5e-72 gb|AAG20768.1| (AE005145) carotenoid biosynthetic protein; Crt_1... 272 5e-72 gi|11499868 carotenoid biosynthetic gene ERWCRTS, putative [Arch... 266 4e-70 gb|AAK40424.1| FMN-dependent dehydrogenase, conserved hypothetic... 256 3e-67 sp|P95997|YC08_SULSO HYPOTHETICAL PROTEIN C05008 >gi|7444263|pir... 256 3e-67 sp|P74287|YF56_SYNY3 HYPOTHETICAL 37.5 KD PROTEIN sll1556 >gi|74... 255 7e-67 pir||C72560 hypothetical protein APE1765 - Aeropyrum pernix (str... 249 3e-65 pir||C71704 hypothetical protein RP452 - Rickettsia prowazekii >... 243 2e-63 emb|CAC11250.1| (AL445063) conserved hypothetical protein [Therm... 240 3e-62 sp|Q01335|YCR6_ERWHE HYPOTHETICAL 37.2 KD PROTEIN IN CRTE-CRTX I... 230 2e-59 gb|AAF77222.1|AC009601_16 (AC009601) L165.10 [Leishmania major] ... 215 6e-55 pir||G75437 conserved hypothetical protein - Deinococcus radiodu... 206 2e-52 pir||C70185 carotenoid biosynthesis protein homolog - Lyme disea... 194 1e-48 dbj|BAB07793.1| (AB037666) hypothetical protein [Streptomyces sp... 193 2e-48 dbj|BAB07820.1| (AB037907) hypothetical protein [Streptomyces gr... 183 2e-45 sp|P50740|YPGA_BACSU HYPOTHETICAL 22.6 KD PROTEIN IN CMK-GPSA IN... 159 5e-38 >pir||D75084 carotenoid biosynthetic gene erwcrts related PAB1662 - Pyrococcus abyssi (strain Orsay) >gi|5458489|emb|CAB49977.1| (AJ248286) CAROTENOID BIOSYNTHETIC GENE ERWCRTS related [Pyrococcus abyssi] Length = 370 Score = 739 bits (1886), Expect = 0.0 Identities = 370/370 (100%), Positives = 370/370 (100%) Query: 1 MEEQTILRKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDY 60 MEEQTILRKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDY Sbjct: 1 MEEQTILRKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDY 60 Query: 61 PIMITGMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYVRDVAP 120 PIMITGMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYVRDVAP Sbjct: 61 PIMITGMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYVRDVAP 120 Query: 121 DVFLVGNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGV 180 DVFLVGNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGV Sbjct: 121 DVFLVGNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGV 180 Query: 181 LEALAEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKD 240 LEALAEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKD Sbjct: 181 LEALAEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKD 240 Query: 241 GEKRNLALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGITMAKALAMGASMVGIALP 300 GEKRNLALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGITMAKALAMGASMVGIALP Sbjct: 241 GEKRNLALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGITMAKALAMGASMVGIALP 300 Query: 301 VLRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQRIDL 360 VLRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQRIDL Sbjct: 301 VLRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQRIDL 360 Query: 361 NSYLRARFKM 370 NSYLRARFKM Sbjct: 361 NSYLRARFKM 370 >pir||D71063 hypothetical protein PH1202 - Pyrococcus horikoshii >gi|3257619|dbj|BAA30302.1| (AP000005) 371aa long hypothetical protein [Pyrococcus horikoshii] Length = 371 Score = 668 bits (1705), Expect = 0.0 Identities = 323/368 (87%), Positives = 355/368 (95%) Query: 2 EEQTILRKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYP 61 EE TILRKFEHI+HCL +NVEAHV+NGFEDV+ +HKSLPEIDKDEIDL+V+FLGRKFDYP Sbjct: 3 EELTILRKFEHIEHCLKRNVEAHVSNGFEDVYFVHKSLPEIDKDEIDLTVEFLGRKFDYP 62 Query: 62 IMITGMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYVRDVAPD 121 IMITGMTGGTR+ EIA +INRTLA AA+ELNIP G+GSQRAMIEKPETWESYYVRDVAPD Sbjct: 63 IMITGMTGGTRREEIAGKINRTLAMAAEELNIPFGVGSQRAMIEKPETWESYYVRDVAPD 122 Query: 122 VFLVGNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGVL 181 +FL+GNLGAPQFG+NAKKRYSV EVLYAIEKIEADAIAIHMNPLQES+QPEGDTT++GVL Sbjct: 123 IFLIGNLGAPQFGKNAKKRYSVKEVLYAIEKIEADAIAIHMNPLQESVQPEGDTTYAGVL 182 Query: 182 EALAEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKDG 241 EALAEI S+I+YPVIAKETGAGVSKEVA+ELE+VG+DAIDISGLGGTSWSAVEYYR KD Sbjct: 183 EALAEIKSSINYPVIAKETGAGVSKEVAIELESVGIDAIDISGLGGTSWSAVEYYRAKDS 242 Query: 242 EKRNLALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGITMAKALAMGASMVGIALPV 301 EKR +ALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDG+ MAKALAMGAS+VGIALPV Sbjct: 243 EKRKIALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGVMMAKALAMGASLVGIALPV 302 Query: 302 LRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQRIDLN 361 LRPAA+GDVEGV+RII+GYAEEI+NVMFLVGARNI+ELR+VPLVITGFVREWLLQRIDLN Sbjct: 303 LRPAARGDVEGVVRIIRGYAEEIKNVMFLVGARNIRELRRVPLVITGFVREWLLQRIDLN 362 Query: 362 SYLRARFK 369 SYLR+RFK Sbjct: 363 SYLRSRFK 370 >pir||H69162 conserved hypothetical protein MTH48 - Methanobacterium thermoautotrophicum (strain Delta H) >gi|2621084|gb|AAB84555.1| (AE000797) conserved protein [Methanobacterium thermoautotrophicum] Length = 349 Score = 333 bits (846), Expect = 1e-90 Identities = 172/351 (49%), Positives = 240/351 (68%), Gaps = 18/351 (5%) Query: 8 RKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIMITGM 67 RK EH+ C + +VE GFED+ ++H+++PEI+K++ID+S+ FLGR+ P+MI+ + Sbjct: 5 RKLEHLILCASCDVEYRKKTGFEDIEIVHRAIPEINKEKIDISLDFLGRELSSPVMISAI 64 Query: 68 TGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYV-RDVAPDVFLVG 126 TGG + +INR LA+AA++L I LGLGSQRA +E PE +Y + R+ AP L+G Sbjct: 65 TGGH---PASMKINRELARAAEKLGIALGLGSQRAGVEHPELEGTYTIAREEAPSAMLIG 121 Query: 127 NLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGVLEALAE 186 N+G+ ++ A+E I+ADA+A+H+NPLQESIQP GD SG LE+++ Sbjct: 122 NIGSSH----------IEYAERAVEMIDADALAVHLNPLQESIQPGGDVDSSGALESISA 171 Query: 187 ITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKDGEKRNL 246 I ++D PV+ KETGAG+ E A+ELE+ GV AID++G GGTSW+AVE YR D R L Sbjct: 172 IVESVDVPVMVKETGAGICSEDAIELESCGVSAIDVAGAGGTSWAAVETYRADD---RYL 228 Query: 247 ALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGITMAKALAMGASMVGIALPVLRPAA 306 FWDWGI TA S EV + ++P+IASGG+R GI AKA+++GA MVGIALPVL A Sbjct: 229 GELFWDWGIPTAASTVEVVESVSIPVIASGGIRSGIDAAKAISLGAEMVGIALPVLEAAG 288 Query: 307 KGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQR 357 G E VI++I+G+ E +R M+L GA + +L+K P++ITG EWL QR Sbjct: 289 HGYRE-VIKVIEGFNEALRTAMYLAGAETLDDLKKSPVIITGHTGEWLNQR 338 >sp|Q58272|Y862_METJA HYPOTHETICAL PROTEIN MJ0862 >gi|2128035|pir||F64407 carotenoid biosynthesis protein homolog - Methanococcus jannaschii >gi|1591547|gb|AAB98867.1| (U67530) carotenoid biosynthetic gene ERWCRTS isolog [Methanococcus jannaschii] Length = 359 Score = 276 bits (698), Expect = 3e-73 Identities = 156/358 (43%), Positives = 227/358 (62%), Gaps = 15/358 (4%) Query: 7 LRKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIMITG 66 +RK EHI C NVE T ED+ LIHK I+ ++I+ ++ G+K PI+++G Sbjct: 10 VRKLEHIFLCSYCNVEYEKTTLLEDIELIHKGTCGINFNDIETEIELFGKKLSAPIIVSG 69 Query: 67 MTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESY-YVRDVAPDVFLV 125 MTGG K + IN+ +A+A +EL + +G+GSQRA I E ++Y VRD ++ ++ Sbjct: 70 MTGGHSKAK---EINKNIAKAVEELGLGMGVGSQRAAIVNDELIDTYSIVRDYTNNL-VI 125 Query: 126 GNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGVLEALA 185 GNLGA F + +D+ AIE I+ADAIAIH NPLQE IQPEGD F + + L Sbjct: 126 GNLGAVNFIVDDWDEEIIDK---AIEMIDADAIAIHFNPLQEIIQPEGDLNFKNLYK-LK 181 Query: 186 EITSTI-----DYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKD 240 EI S + P IAK+ G G SKE A+ L+ +G DAID+ G GGTSW+ VE YR K+ Sbjct: 182 EIISNYKKSYKNIPFIAKQVGEGFSKEDALILKDIGFDAIDVQGSGGTSWAKVEIYRVKE 241 Query: 241 GEKRNLALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGITMAKALAMGASMVGIALP 300 E + LA KF +WGI TA S+ EV+ + +I SGG+R G+ +AK +A+G +ALP Sbjct: 242 EEIKRLAEKFANWGIPTAASIFEVKSVYDGIVIGSGGIRGGLDIAKCIAIGCDCCSVALP 301 Query: 301 VLRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQRI 358 +L+ + KG E V+++++ Y +E++ MFLVGA NI+EL+K ++ G ++EW+ QR+ Sbjct: 302 ILKASLKG-WEEVVKVLESYIKELKIAMFLVGAENIEELKKTSYIVKGTLKEWISQRL 358 >gi|10803607 ORF H0660 [Halobacterium sp. NRC-1] >gi|10803696|ref|NP_046094.1| ORF H1696 [Halobacterium sp. NRC-1] >gi|7444262|pir||T08277 carotenoid biosynthesis protein homolog H0660 - Halobacterium sp. (strain NRC-1) plasmid pNRC100 >gi|2822338|gb|AAC82844.1| (AF016485) ORF H0660 [Halobacterium sp. NRC-1] >gi|2822427|gb|AAC82933.1| (AF016485) ORF H1696 [Halobacterium sp. NRC-1] Length = 379 Score = 272 bits (688), Expect = 5e-72 Identities = 159/360 (44%), Positives = 219/360 (60%), Gaps = 17/360 (4%) Query: 4 QTILRKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIM 63 QT RK +H++ ++VE T GF+DVHL+H +LPE+D D ID S+ FLG PI Sbjct: 27 QTEDRKDDHLQIVQERDVETTGT-GFDDVHLVHNALPELDYDAIDPSIDFLGHDLSAPIF 85 Query: 64 ITGMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPE--TWESY-YVRDVAP 120 I MTGG INR LA+AA E I +GLGSQRA +E + ESY VRD AP Sbjct: 86 IESMTGGHHN---TTEINRALARAASETGIAMGLGSQRAGLELDDERVLESYTVVRDAAP 142 Query: 121 DVFLVGNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGV 180 D F+ GNLGA Q + Y ++ V A+E I+ADA+A+H+N LQE+ QPEGD Sbjct: 143 DAFIYGNLGAAQL-----REYDIEMVEQAVEMIDADALAVHLNFLQEATQPEGDVDGRNC 197 Query: 181 LEALAEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKD 240 + A+ ++ + P+I KETG G+S E A EL A GVDA+D++G GGT+WS +E YR Sbjct: 198 VAAIERVSEALSVPIIVKETGNGISGETARELTAAGVDALDVAGKGGTTWSGIEAYRAAA 257 Query: 241 G---EKRNLALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGITMAKALAMGASMVGI 297 ++ + F +WGI TA S E A + +IASGG+R G+ +AKA+A+GA G+ Sbjct: 258 ANAPRQKQIGTLFREWGIPTAASTIEC-VAEHDCVIASGGVRTGLDVAKAIALGARAGGL 316 Query: 298 ALPVLRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQR 357 A P L+PA G + VI + E+R MF+ G+ +I EL++V V+ G RE++ QR Sbjct: 317 AKPFLKPATDGP-DAVIERVGDLIAELRTAMFVTGSGSIDELQQVEYVLHGKTREYVEQR 375 >gb|AAG20768.1| (AE005145) carotenoid biosynthetic protein; Crt_1 [Halobacterium sp. NRC-1] >gi|10584461|gb|AAG21040.1| (AE005169) carotenoid biosynthetic protein; Crt_2 [Halobacterium sp. NRC-1] Length = 360 Score = 272 bits (688), Expect = 5e-72 Identities = 159/360 (44%), Positives = 219/360 (60%), Gaps = 17/360 (4%) Query: 4 QTILRKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIM 63 QT RK +H++ ++VE T GF+DVHL+H +LPE+D D ID S+ FLG PI Sbjct: 8 QTEDRKDDHLQIVQERDVETTGT-GFDDVHLVHNALPELDYDAIDPSIDFLGHDLSAPIF 66 Query: 64 ITGMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPE--TWESY-YVRDVAP 120 I MTGG INR LA+AA E I +GLGSQRA +E + ESY VRD AP Sbjct: 67 IESMTGGHHN---TTEINRALARAASETGIAMGLGSQRAGLELDDERVLESYTVVRDAAP 123 Query: 121 DVFLVGNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGV 180 D F+ GNLGA Q + Y ++ V A+E I+ADA+A+H+N LQE+ QPEGD Sbjct: 124 DAFIYGNLGAAQL-----REYDIEMVEQAVEMIDADALAVHLNFLQEATQPEGDVDGRNC 178 Query: 181 LEALAEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKD 240 + A+ ++ + P+I KETG G+S E A EL A GVDA+D++G GGT+WS +E YR Sbjct: 179 VAAIERVSEALSVPIIVKETGNGISGETARELTAAGVDALDVAGKGGTTWSGIEAYRAAA 238 Query: 241 G---EKRNLALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGITMAKALAMGASMVGI 297 ++ + F +WGI TA S E A + +IASGG+R G+ +AKA+A+GA G+ Sbjct: 239 ANAPRQKQIGTLFREWGIPTAASTIEC-VAEHDCVIASGGVRTGLDVAKAIALGARAGGL 297 Query: 298 ALPVLRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQR 357 A P L+PA G + VI + E+R MF+ G+ +I EL++V V+ G RE++ QR Sbjct: 298 AKPFLKPATDGP-DAVIERVGDLIAELRTAMFVTGSGSIDELQQVEYVLHGKTREYVEQR 356 >gi|11499868 carotenoid biosynthetic gene ERWCRTS, putative [Archaeoglobus fulgidus] >gi|7444261|pir||G69535 carotenoid biosynthetic protein ERWCRTS homolog - Archaeoglobus fulgidus >gi|2648236|gb|AAB88970.1| (AE000947) carotenoid biosynthetic gene ERWCRTS, putative [Archaeoglobus fulgidus] Length = 317 Score = 266 bits (672), Expect = 4e-70 Identities = 141/322 (43%), Positives = 208/322 (63%), Gaps = 12/322 (3%) Query: 34 LIHKSLPEIDKDEIDLSVKFLGRKFDYPIMITGMTGGTRKGEIAWRINRTLAQAAQELNI 93 LIHK+LPE+D +ID ++F G+K +P++I MTGG + + IN L +A +E I Sbjct: 3 LIHKALPEVDYWKIDTEIEFFGKKLSFPLLIASMTGGHPETK---EINARLGEAVEEAGI 59 Query: 94 PLGLGSQRAMIEKPETWESY-YVRDVAPDVFLVGNLGAPQFGRNAKKRYSVDEVLYAIEK 152 +G+GSQRA IE +S+ VR+ AP+ F+ N+G PQ V+ V A+E Sbjct: 60 GMGVGSQRAAIEDESLADSFTVVREKAPNAFVYANIGMPQVIERG-----VEIVDRAVEM 114 Query: 153 IEADAIAIHMNPLQESIQPEGDTTFSGVLEALAEITSTIDYPVIAKETGAGVSKEVAVEL 212 I+ADA+AIH+N LQE+IQPEGD LE L E+ ++ PVIAKETGAG+S+EVAV L Sbjct: 115 IDADAVAIHLNYLQEAIQPEGDLNAEKGLEVLEEVCRSVKVPVIAKETGAGISREVAVML 174 Query: 213 EAVGVDAIDISGLGGTSWSAVEYYRTKDGEKRNLALKFWDWGIKTAISLAEVRWATNLPI 272 + GV AID+ G GGT++S VE YR D +++ + FWDWG+ TA S+ + R LP+ Sbjct: 175 KRAGVSAIDVGGKGGTTFSGVEVYRVNDEVSKSVGIDFWDWGLPTAFSIVDCRGI--LPV 232 Query: 273 IASGGMRDGITMAKALAMGASMVGIALPVLRPAAKGDVEGVIRIIKGYAEEIRNVMFLVG 332 IA+GG+R G+ +AK++A+GA + ALP LR AA E V I+ + ++ MFL G Sbjct: 233 IATGGLRSGLDVAKSIAIGAELGSAALPFLR-AAVESAEKVREEIEYFRRGLKTAMFLTG 291 Query: 333 ARNIKELRKVPLVITGFVREWL 354 +N++EL+ + + ++G ++EW+ Sbjct: 292 CKNVEELKGLKVFVSGRLKEWI 313 >gb|AAK40424.1| FMN-dependent dehydrogenase, conserved hypothetical [Sulfolobus solfataricus] Length = 368 Score = 256 bits (647), Expect = 3e-67 Identities = 150/367 (40%), Positives = 223/367 (59%), Gaps = 15/367 (4%) Query: 8 RKFEHIKHCLTKNVEAHVTNGF-EDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIMITG 66 RK EH++ +NV+ ++ F DV L+H+ P I EI+ KF ++ PIM+TG Sbjct: 7 RKVEHVEIAAFENVDGLSSSTFLNDVILVHQGFPGISFSEINTKTKFFRKEISAPIMVTG 66 Query: 67 MTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESY-YVRDVAPDVFLV 125 MTGG + E+ RINR +A+ A++ IP+G+GSQR IEK E ES+ VR VAP + ++ Sbjct: 67 MTGG--RNELG-RINRIIAEVAEKFGIPMGVGSQRVAIEKAEARESFTIVRKVAPTIPII 123 Query: 126 GNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFS-GVLEAL 184 NLG PQ K Y + E AI+ IEADAIA+H+NP QE QPEG+ + LE L Sbjct: 124 ANLGMPQL----VKGYGLKEFQDAIQMIEADAIAVHLNPAQEVFQPEGEPEYQIYALERL 179 Query: 185 AEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYR--TKDGE 242 +I+ + P+I KE+G G+S E A L + G+ D SG GGT+W A+E R + Sbjct: 180 RDISKELSVPIIVKESGNGISMETAKLLYSYGIKNFDTSGQGGTNWIAIEMIRDIRRGNW 239 Query: 243 KRNLALKFWDWGIKTAISLAEVRWA-TNLPIIASGGMRDGITMAKALAMGASMVGIALPV 301 K A F DWG+ TA S+ EVR++ + ++ SGG+R G+ AKA+A+GA + G+ALPV Sbjct: 240 KAESAKNFLDWGVPTAASIIEVRYSIPDAFLVGSGGIRSGLDAAKAIALGADIAGMALPV 299 Query: 302 LRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQR-IDL 360 L+ A +G E + + + E++ M L G++N++ L++ +VI G ++EW R I+L Sbjct: 300 LKSAIEGK-ESLEQFFRKIIFELKATMMLTGSKNVEALKRSSIVILGKLKEWAEYRGINL 358 Query: 361 NSYLRAR 367 + Y + R Sbjct: 359 SIYEKVR 365 >sp|P95997|YC08_SULSO HYPOTHETICAL PROTEIN C05008 >gi|7444263|pir||S75425 hypothetical protein c05008 - Sulfolobus solfataricus >gi|1707831|emb|CAA69539.1| (Y08257) carotenoid biosynthetic gene ERWCRTS homolog [Sulfolobus solfataricus] Length = 368 Score = 256 bits (647), Expect = 3e-67 Identities = 150/367 (40%), Positives = 223/367 (59%), Gaps = 15/367 (4%) Query: 8 RKFEHIKHCLTKNVEAHVTNGF-EDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIMITG 66 RK EH++ +NV+ ++ F DV L+H+ P I EI+ KF ++ PIM+TG Sbjct: 7 RKVEHVEIAAFENVDGLSSSTFLNDVILVHQGFPGISFSEINTKTKFFRKEISAPIMVTG 66 Query: 67 MTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESY-YVRDVAPDVFLV 125 MTGG + E+ RINR +A+ A++ IP+G+GSQR IEK E ES+ VR VAP + ++ Sbjct: 67 MTGG--RNELG-RINRIIAEVAEKFGIPMGVGSQRVAIEKAEARESFTIVRKVAPTIPII 123 Query: 126 GNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFS-GVLEAL 184 NLG PQ K Y + E AI+ IEADAIA+H+NP QE QPEG+ + LE L Sbjct: 124 ANLGMPQL----VKGYGLKEFQDAIQMIEADAIAVHLNPAQEVFQPEGEPEYQIYALERL 179 Query: 185 AEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYR--TKDGE 242 +I+ + P+I KE+G G+S E A L + G+ D SG GGT+W A+E R + Sbjct: 180 RDISKELSVPIIVKESGNGISMETAKLLYSYGIKNFDTSGQGGTNWIAIEMIRDIRRGNW 239 Query: 243 KRNLALKFWDWGIKTAISLAEVRWA-TNLPIIASGGMRDGITMAKALAMGASMVGIALPV 301 K A F DWG+ TA S+ EVR++ + ++ SGG+R G+ AKA+A+GA + G+ALPV Sbjct: 240 KAESAKNFLDWGVPTAASIIEVRYSIPDAFLVGSGGIRSGLDAAKAIALGADIAGMALPV 299 Query: 302 LRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQR-IDL 360 L+ A +G E + + + E++ M L G++N++ L++ +VI G ++EW R I+L Sbjct: 300 LKSAIEGK-ESLEQFFRKIIFELKATMMLTGSKNVEALKRSSIVILGKLKEWAEYRGINL 358 Query: 361 NSYLRAR 367 + Y + R Sbjct: 359 SIYEKVR 365 >sp|P74287|YF56_SYNY3 HYPOTHETICAL 37.5 KD PROTEIN sll1556 >gi|7429104|pir||S75922 hypothetical protein sll1556 - Synechocystis sp. (strain PCC 6803) >gi|1653467|dbj|BAA18381.1| (D90913) hypothetical protein [Synechocystis sp.] Length = 349 Score = 255 bits (644), Expect = 7e-67 Identities = 145/344 (42%), Positives = 209/344 (60%), Gaps = 10/344 (2%) Query: 3 EQTILRKFEHIKHCLTKNVEAH-VTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYP 61 + T RK +HI+ L ++V ++ GFE + L H +LP +D D +DL + G+ YP Sbjct: 2 DSTPHRKSDHIRIVLEEDVVGKGISTGFERLMLEHCALPAVDLDAVDLGLTLWGKSLTYP 61 Query: 62 IMITGMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYVRDVAPD 121 +I+ MTGGT + + +IN LA+ AQ L I +GLGSQRA IE P+ +Y VR VAPD Sbjct: 62 WLISSMTGGTPEAK---QINLFLAEVAQALGIAMGLGSQRAAIENPDLAFTYQVRSVAPD 118 Query: 122 VFLVGNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGVL 181 + L NLG Q Y +++ A++ IEADA+ +H+NPLQE++QP+GD +SG+ Sbjct: 119 ILLFANLGLVQLNYG----YGLEQAQRAVDMIEADALILHLNPLQEAVQPDGDRLWSGLW 174 Query: 182 EALAEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKDG 241 L + ++ PVI KE G G+S VA L+ GV AID++G GGTSWS VE +R D Sbjct: 175 SKLEALVEALEVPVIVKEVGNGISGPVAKRLQECGVGAIDVAGAGGTSWSEVEAHRQTDR 234 Query: 242 EKRNLALKFWDWGIKTAISLAEVRWAT-NLPIIASGGMRDGITMAKALAMGASMVGIALP 300 + + +A F DWG+ TA SL +V T + + ASGG+R GI AKA+A+GA++VG A P Sbjct: 235 QAKEVAHNFADWGLPTAWSLQQVVQNTEQILVFASGGIRSGIDGAKAIALGATLVGSAAP 294 Query: 301 VLRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPL 344 VL AK + + V + E++ F A N+ +L +VPL Sbjct: 295 VL-AEAKINAQRVYDHYQARLRELQIAAFCCDAANLTQLAQVPL 337 >pir||C72560 hypothetical protein APE1765 - Aeropyrum pernix (strain K1) >gi|5105455|dbj|BAA80768.1| (AP000062) 361aa long hypothetical protein [Aeropyrum pernix] Length = 361 Score = 249 bits (630), Expect = 3e-65 Identities = 136/339 (40%), Positives = 203/339 (59%), Gaps = 10/339 (2%) Query: 17 LTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIMITGMTGGTRKGEI 76 ++ VE+ + E V ++H PE++ ++ L + F G + P++ITGMTGG ++ Sbjct: 3 VSSKVESRESTLLEYVRIVHNPTPEVNLGDVSLEIDFCGGRLRAPLVITGMTGG--HPDV 60 Query: 77 AWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYV-RDVAPDVFLVGNLGAPQFGR 135 W INR LA A+EL I +G+GSQRA IE P ++ R+ AP+ FL+ NLGAPQ Sbjct: 61 EW-INRELASVAEELGIAIGVGSQRAAIEDPSLARTFRAAREAAPNAFLIANLGAPQLSL 119 Query: 136 NAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGVLEALAEITSTIDYPV 195 YSV EV A+E I+ADAIAIH+NP QE+ QPEGD + GV+ +AE PV Sbjct: 120 G----YSVREVRMAVEMIDADAIAIHLNPGQEAYQPEGDPFYRGVVGKIAEAAEAAGVPV 175 Query: 196 IAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKDGEKRNLALKFWD-WG 254 I KETG G+S+E +L A+GV D++GLGGT+W +E R + A D WG Sbjct: 176 IVKETGNGLSREAVAQLRALGVRCFDVAGLGGTNWIKIEVLRGRKAGSPLEAGPLQDFWG 235 Query: 255 IKTAISLAEVRWAT-NLPIIASGGMRDGITMAKALAMGASMVGIALPVLRPAAKGDVEGV 313 TA +L E R A + IIASGG+R+G+ A+A+A+GA G+ALP +R G + Sbjct: 236 NPTAAALMEARTAAPDAYIIASGGVRNGLDAARAIALGADAAGVALPAIRSLLSGGRQAT 295 Query: 314 IRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVRE 352 ++++K +++ +++VG ++ L + P+V+ G + E Sbjct: 296 LKLLKAIEYQLKTAVYMVGETRVRGLWRAPIVVWGRLAE 334 >pir||C71704 hypothetical protein RP452 - Rickettsia prowazekii >gi|3861009|emb|CAA14909.1| (AJ235271) unknown [Rickettsia prowazekii] Length = 342 Score = 243 bits (614), Expect = 2e-63 Identities = 139/341 (40%), Positives = 209/341 (60%), Gaps = 9/341 (2%) Query: 6 ILRKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIMIT 65 I RK EHI+ L +NV + + +G E + IH +LPEI+ D ID + FLG+ PI+I+ Sbjct: 9 IERKQEHIEINLKQNVNSTLKSGLESIKFIHNALPEINYDSIDTTTTFLGKDMKAPILIS 68 Query: 66 GMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYVRDVAPDVFLV 125 MTGGT + A IN LAQAAQ+ I +GLGS R ++ KP+T +++ VR VAPD+ L+ Sbjct: 69 SMTGGTAR---ARDINYRLAQAAQKSGIAMGLGSMRILLTKPDTIKTFTVRHVAPDIPLL 125 Query: 126 GNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGVLEALA 185 N+GA Q + E Y I+ I+ADA+ +H+N L E QPEG+ + +L + Sbjct: 126 ANIGAVQLNYGVTPK----ECQYLIDTIKADALILHLNVLHELTQPEGNKNWENLLPKIK 181 Query: 186 EITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKDGEKRN 245 E+ + + PVI KE G G+SK+VA +L GV +DI+G GGTSWS VE YR K+ + Sbjct: 182 EVINYLSVPVIVKEVGYGLSKQVAKKLIKAGVKVLDIAGSGGTSWSQVEAYRAKNSMQNR 241 Query: 246 LALKFWDWGIKTAISLAEVR-WATNLPIIASGGMRDGITMAKALAMGASMVGIALPVLRP 304 +A F +WGI T SL ++ + ++ IIASGG++ GI AKA+ MGA++ G+A +L+ Sbjct: 242 IASSFINWGITTLDSLKMLQEISKDITIIASGGLQSGIDGAKAIRMGANIFGLAGKLLKA 301 Query: 305 AAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLV 345 A + V+ I+ E+++ M G+ +K+L K ++ Sbjct: 302 ADIAE-SLVLEEIQVIIEQLKITMLCTGSCTLKDLAKAEIM 341 >emb|CAC11250.1| (AL445063) conserved hypothetical protein [Thermoplasma acidophilum] Length = 348 Score = 240 bits (605), Expect = 3e-62 Identities = 141/351 (40%), Positives = 216/351 (61%), Gaps = 14/351 (3%) Query: 8 RKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIMITGM 67 RK EHI+ ++V + N ++D+ L+H++ PE++ DEID SV FLG+K +P++I+ M Sbjct: 5 RKEEHIRIAENEDVSSF-HNFWDDISLMHEADPEVNYDEIDTSVDFLGKKLKFPMIISSM 63 Query: 68 TGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYVRDVAPDVFLVGN 127 TGG EIA INR LA AA+ I +G+GS RA I ++Y V + + + N Sbjct: 64 TGGA---EIAKNINRNLAVAAERFGIGMGVGSMRAAIVDRSIEDTYSVINESHVPLKIAN 120 Query: 128 LGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGVLEALAEI 187 +GAPQ R K S ++ Y + I+AD +A+H N LQE +QPEGD GV++ + ++ Sbjct: 121 IGAPQLVRQDKDAVSNRDIAYIYDLIKADFLAVHFNFLQEMVQPEGDRNSKGVIDRIKDL 180 Query: 188 TSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTK---DGEKR 244 + + + +IAKETG+G S+ A L GV AI++SG+ GT+++AVEYYR + + EK Sbjct: 181 SGS--FNIIAKETGSGFSRRTAERLIDAGVKAIEVSGVSGTTFAAVEYYRARKENNLEKM 238 Query: 245 NLALKFWDWGIKTAISLAEVRWATNL-PIIASGGMRDGITMAKALAMGASMVGIALPVLR 303 + FW+WGI S A V + ++L P+I SGG+R+G+ +AKA+AMGA+ G A +L+ Sbjct: 239 RIGETFWNWGIP---SPASVYYCSDLAPVIGSGGLRNGLDLAKAIAMGATAGGFARSLLK 295 Query: 304 PAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWL 354 A D E +++ I+ E R +FL G +N+ EL+ VI +R WL Sbjct: 296 D-ADTDPEMLMKNIELIQREFRVALFLTGNKNVYELKFTKKVIVDPLRSWL 345 >sp|Q01335|YCR6_ERWHE HYPOTHETICAL 37.2 KD PROTEIN IN CRTE-CRTX INTERGENIC REGION (ORF6) >gi|1073298|pir||S52979 hypothetical protein 6 - Erwinia herbicola >gi|148409|gb|AAA64978.1| (M87280) gene not found in Erwinia uredovora crt gene cluster; ORF6 [Erwinia herbicola] Length = 347 Score = 230 bits (581), Expect = 2e-59 Identities = 137/346 (39%), Positives = 201/346 (57%), Gaps = 11/346 (3%) Query: 2 EEQTILRKFEHIKHCLT-KNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDY 60 +E+ + RK +H+ L + + GFE H +LPE++ +I L FL R+ Sbjct: 3 DERLVQRKNDHLDIVLDPRRAVTQASAGFERWRFTHCALPELNFSDITLETTFLNRQLQA 62 Query: 61 PIMITGMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWE-SYYVRDVA 119 P++I+ MTGG + INR LA+AAQ L I +G+GSQR IE +R +A Sbjct: 63 PLLISSMTGGVERSR---HINRHLAEAAQVLKIAMGVGSQRVAIESDAGLGLDKTLRQLA 119 Query: 120 PDVFLVGNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSG 179 PDV L+ NLGA Q R +D A+E IEADA+ +H+NPLQE++QP GD + G Sbjct: 120 PDVPLLANLGAAQL----TGRKGIDYARRAVEMIEADALIVHLNPLQEALQPGGDRDWRG 175 Query: 180 VLEALAEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTK 239 L A+ + + P++ KE GAG+S+ VA +L GV ID++G GGTSW+AVE R Sbjct: 176 RLAAIETLVRELPVPLVVKEVGAGISRTVAGQLIDAGVTVIDVAGAGGTSWAAVEGERAA 235 Query: 240 DGEKRNLALKFWDWGIKTAISLAEVRWA-TNLPIIASGGMRDGITMAKALAMGASMVGIA 298 ++R++A F DWGI TA +L ++ A +P+IASGG+++G+ AKAL +GA MVG A Sbjct: 236 TEQQRSVANVFADWGIPTAEALVDIAEAWPQMPLIASGGIKNGVDAAKALRLGACMVGQA 295 Query: 299 LPVLRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPL 344 VL A E VI E++R F G+R++ +L++ + Sbjct: 296 AAVLGSAGV-STEKVIDHFNVIIEQLRVACFCTGSRSLSDLKQADI 340 >gb|AAF77222.1|AC009601_16 (AC009601) L165.10 [Leishmania major] >gi|9864748|gb|AAG01358.1|AC068666_5 (AC068666) L165.10 [Leishmania major] Length = 357 Score = 215 bits (542), Expect = 6e-55 Identities = 128/347 (36%), Positives = 192/347 (54%), Gaps = 14/347 (4%) Query: 8 RKFEHIKHCLTKNVEAHV--TNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIMIT 65 RK +HI CL ++VE H T+ + L +K+LPE+D +ID S +F+G++ +P I+ Sbjct: 17 RKKDHIDICLHQDVEPHKRRTSIWNKYTLPYKALPEVDLQKIDTSCEFMGKRISFPFFIS 76 Query: 66 GMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYVRDVAPDVFLV 125 MTGG G + IN LA+A + IP GLGS R + ++ V++ P V ++ Sbjct: 77 SMTGGEAHGRV---INENLAKACEAEKIPFGLGSMRIINRYASAVHTFNVKEFCPSVPML 133 Query: 126 GNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGVLEALA 185 N+G Q + EV + + AD + IH+N QE QPEGDT F G++E L Sbjct: 134 ANIGLVQLNYG----FGPKEVNNLVNSVRADGLCIHLNHTQEVCQPEGDTNFEGLIEKLR 189 Query: 186 EITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTK-DGEKR 244 ++ I PV+ K G G+ E V ++A GV +D+SG GGTSW+ +E R E+ Sbjct: 190 QLLPHIKVPVLVKGVGHGIDYESMVAIKASGVKYVDVSGCGGTSWAWIEGRRQPYKAEEE 249 Query: 245 NLALKFWDWGIKTAISLAEVRWAT---NLPIIASGGMRDGITMAKALAMGASMVGIALPV 301 N+ D G+ T + L E T +L +IA GG+R+G+ +AKAL MGA A+P Sbjct: 250 NIGYLLRDIGVPTDVCLRESAPLTVNGDLHLIAGGGIRNGMDVAKALMMGAEYATAAMPF 309 Query: 302 LRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITG 348 L A + E V +I+ +E+R MF GARNI+ELR++ ++ G Sbjct: 310 LAAALESS-EAVRAVIQRMRQELRVSMFTCGARNIEELRRMKVIELG 355 >pir||G75437 conserved hypothetical protein - Deinococcus radiodurans (strain R1) >gi|6458821|gb|AAF10661.1|AE001959_1 (AE001959) conserved hypothetical protein [Deinococcus radiodurans] Length = 286 Score = 206 bits (520), Expect = 2e-52 Identities = 118/295 (40%), Positives = 177/295 (60%), Gaps = 16/295 (5%) Query: 49 LSVKFLGRKFDYPIMITGMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPE 108 L FLGR+ P++I MTGG K + INR LA AA+ L + + LGSQR M+E P+ Sbjct: 3 LDTVFLGRRLKAPVLIGAMTGGAEKAGV---INRNLATAARNLGLGMMLGSQRVMLEHPD 59 Query: 109 TWESYYVRDVAPDVFLVGNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQES 168 WES+ VR+VAP++ L+GNLGA QF Y ++ A++++ ADA+AIH+NPLQE+ Sbjct: 60 AWESFNVREVAPEILLIGNLGAAQFMLG----YGAEQARRAVDEVMADALAIHLNPLQEA 115 Query: 169 IQPEGDTTFSGVLEALAEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGT 228 +Q GDT + GV L ++ +D+PVI KE G G+ L A D++G GGT Sbjct: 116 LQRGGDTRWQGVTYRLKQVARELDFPVIIKEVGHGLDAATLRALADGPFAAYDVAGAGGT 175 Query: 229 SWSAVEYYRTKDGEKRNLALKFWDWGIKTAISLAEVRWATNLP---IIASGGMRDGITMA 285 SW+ VE G+ + L + G+ TA +L + R LP +IASGG+R G+ A Sbjct: 176 SWARVEQL-VAHGQVHSPDL--CELGVPTAQALRQAR--KTLPGAQLIASGGIRSGLDAA 230 Query: 286 KALAMGASMVGIALPVLRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELR 340 +AL++GA +V +A P+L PA E ++ + +E+R +F+ G R+++E+R Sbjct: 231 RALSLGAEVVAVARPLLEPALDSS-EAAEAWLRNFIQELRVALFVGGYRDVREVR 284 >pir||C70185 carotenoid biosynthesis protein homolog - Lyme disease spirochete >gi|2688617|gb|AAC67033.1| (AE001169) carotenoid biosynthesis protein, putative [Borrelia burgdorferi] Length = 360 Score = 194 bits (488), Expect = 1e-48 Identities = 107/354 (30%), Positives = 192/354 (54%), Gaps = 11/354 (3%) Query: 1 MEEQTILRKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDY 60 +E + K HI+ CL KN N + + L H +L + + EI++ + G Sbjct: 9 IEPNILENKKRHIEICLNKNDVKGGCNFLKFIKLKHNALSDFNFSEINIKEEIFGYNISM 68 Query: 61 PIMITGMTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYVRDVAP 120 P+ I+ MTGG+++G N++L + A L IP+GLGS + + + PE + ++ A Sbjct: 69 PVFISSMTGGSKEGN---DFNKSLVRIANYLKIPIGLGSFKLLFKYPEYIRDFTLKRYAH 125 Query: 121 DVFLVGNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGV 180 ++ L N+GA Q + + ++ I+++E DAI +H+N QE ++ +GD F G+ Sbjct: 126 NIPLFANVGAVQI-----VEFGISKIAEMIKRLEVDAIIVHLNAGQELMKVDGDRNFKGI 180 Query: 181 LEALAEITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKD 240 E++A+++ + P+I KETG G+S + EL ++G +D++G GGT+W VE ++ + Sbjct: 181 RESIAKLSDFLSVPLIVKETGFGISPKDVKELFSLGASYVDLAGSGGTNWILVEGMKSNN 240 Query: 241 GEKRNLALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGITMAKALAMGASMVGIALP 300 N+A F DWGI + +L + + I ASGG G+ +AK +A+GA ++G+A Sbjct: 241 ---LNIASCFSDWGIPSVFTLLSIDDSLKANIFASGGYETGMDIAKGIALGARLIGVAAV 297 Query: 301 VLRPAAKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWL 354 VLR + V + Y ++ MFL G++++ E R ++ ++ + L Sbjct: 298 VLRAFYDSGEDAVFGLFSDYEHILKMSMFLSGSKSLLEFRNNKYFLSSYLLDEL 351 >dbj|BAB07793.1| (AB037666) hypothetical protein [Streptomyces sp. CL190] Length = 363 Score = 193 bits (486), Expect = 2e-48 Identities = 115/351 (32%), Positives = 195/351 (54%), Gaps = 18/351 (5%) Query: 8 RKFEHIKHCLTKNVEAHVTNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIMITGM 67 RK +H++ + ++ N F+DV +H +L ID+ ++ L+ F G + PI I M Sbjct: 6 RKDDHVRLAIEQHNAHSGRNQFDDVSFVHHALAGIDRPDVSLATSFAGISWQVPIYINAM 65 Query: 68 TGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYV-RDVAPDVFLVG 126 TGG+ K + INR LA AA+E +P+ GS A I+ P +++ V RD P+ F++ Sbjct: 66 TGGSEKTGL---INRDLATAARETGVPIASGSMNAYIKDPSCADTFRVLRDENPNGFVIA 122 Query: 127 NLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGVLEALAE 186 N+ A +VD AI+ IEA+A+ IH+N QE+ PEGD +F+ + + + Sbjct: 123 NINATT---------TVDNAQRAIDLIEANALQIHINTAQETPMPEGDRSFASWVPQIEK 173 Query: 187 ITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKDGEKRNL 246 I + +D PVI KE G G+S++ + L +GV A D+SG GGT ++ +E R + G+ L Sbjct: 174 IAAAVDIPVIVKEVGNGLSRQTILLLADLGVQAADVSGRGGTDFARIENGRRELGDYAFL 233 Query: 247 ALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGITMAKALAMGASMVGIALPVLRPAA 306 WG TA L + + +LP++ASGG+R + + +ALA+GA VG + LR Sbjct: 234 ----HGWGQSTAACLLDAQ-DISLPVLASGGVRHPLDVVRALALGARAVGSSAGFLRTLM 288 Query: 307 KGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQR 357 V+ +I + + +++ + ++GAR +L + +++ G +R++ R Sbjct: 289 DDGVDALITKLTTWLDQLAALQTMLGARTPADLTRCDVLLHGELRDFCADR 339 >dbj|BAB07820.1| (AB037907) hypothetical protein [Streptomyces griseolosporeus] Length = 364 Score = 183 bits (461), Expect = 2e-45 Identities = 120/361 (33%), Positives = 191/361 (52%), Gaps = 21/361 (5%) Query: 8 RKFEHIKHCLTKNVEAHV-TNGFEDVHLIHKSLPEIDKDEIDLSVKFLGRKFDYPIMITG 66 RK +H++ T+ AH N F+DV +H +L ID+ ++ L+ F G + P+ I Sbjct: 6 RKDDHVR-LATEQQRAHSGRNQFDDVSFVHHALAGIDRPDVRLATTFAGITWRLPLYINA 64 Query: 67 MTGGTRKGEIAWRINRTLAQAAQELNIPLGLGSQRAMIEKPETWESYYV-RDVAPDVFLV 125 MTGG+ K INR LA AA+E + GS A P +++ V R PD F++ Sbjct: 65 MTGGSAK---TGAINRDLAVAARETGAAIASGSMHAFFRDPSCADTFRVLRTENPDGFVM 121 Query: 126 GNLGAPQFGRNAKKRYSVDEVLYAIEKIEADAIAIHMNPLQESIQPEGDTTFSGVLEALA 185 N+ A SVD A++ IEA+A+ IH+N QE+ PEGD +F +A Sbjct: 122 ANVNATA---------SVDNARRAVDLIEANALQIHLNTAQETPMPEGDRSFGSWPAQIA 172 Query: 186 EITSTIDYPVIAKETGAGVSKEVAVELEAVGVDAIDISGLGGTSWSAVEYYRTKDGEKRN 245 +IT+ +D PVI KE G G+S++ + L +GV D+SG GGT ++ +E R G+ Sbjct: 173 KITAAVDVPVIVKEVGNGLSRQTLLALPDLGVRVADVSGRGGTDFARIENSRRPLGDYAF 232 Query: 246 LALKFWDWGIKTAISLAEVRWATNLPIIASGGMRDGITMAKALAMGASMVGIALPVLRPA 305 L WG T L + + P++ASGG+R+ + +A+ALA+GA VG + LR Sbjct: 233 L----HGWGQSTPACLLDAQ-DVGFPLLASGGIRNPLDVARALALGAGAVGSSGVFLRTL 287 Query: 306 AKGDVEGVIRIIKGYAEEIRNVMFLVGARNIKELRKVPLVITGFVREWLLQR-IDLNSYL 364 G V ++ I + +++ + ++GAR +L + ++I G +R + R ID+ + Sbjct: 288 IDGGVSALVAQISTWLDQLAALQTMLGARTPADLTRCDVLIHGPLRSFCTDRGIDIGRFA 347 Query: 365 R 365 R Sbjct: 348 R 348 >sp|P50740|YPGA_BACSU HYPOTHETICAL 22.6 KD PROTEIN IN CMK-GPSA INTERGENIC REGION >gi|7474674|pir||D69935 conserved hypothetical protein ypgA - Bacillus subtilis >gi|1146216|gb|AAC83963.1| (L47648) similar to Erwinia herbicola carotenoid biosynthesis cluster; putative [Bacillus subtilis] >gi|2634705|emb|CAB14203.1| (Z99115) similar to hypothetical proteins [Bacillus subtilis] Length = 212 Score = 159 bits (398), Expect = 5e-38 Identities = 91/215 (42%), Positives = 135/215 (62%), Gaps = 11/215 (5%) Query: 153 IEADAIAIHMNPLQESIQPEGDTTFSGVLEALAEITSTIDYPVIAKETGAGVSKEVAVEL 212 I A+A+ IH+N +QE + PEGD +FSG L+ + +I S + PVI KE G G+SK A +L Sbjct: 2 IGANALQIHLNVIQEIVMPEGDRSFSGALKRIEQICSRVSVPVIVKEVGFGMSKASAGKL 61 Query: 213 EAVGVDAIDISGLGGTSWSAVEYYRTKDGEKRNLALKFWDWGIKTAISLAEVRWATNLP- 271 G A+DI G GGT++S +E R +R ++ F WGI TA SLAE+R + P Sbjct: 62 YEAGAAAVDIGGYGGTNFSKIENLR----RQRQISF-FNSWGISTAASLAEIR--SEFPA 114 Query: 272 --IIASGGMRDGITMAKALAMGASMVGIALPVLRPAAKGDVEGVIRIIKGYAEEIRNVMF 329 +IASGG++D + +AKA+A+GAS G+A L+ EG++ I+ EE++ +M Sbjct: 115 STMIASGGLQDALDVAKAIALGASCTGMAGHFLKALTDSGEEGLLEEIQLILEELKLIMT 174 Query: 330 LVGARNIKELRKVPLVITGFVREWLLQR-IDLNSY 363 ++GAR I +L+K PLVI G WL +R ++ +SY Sbjct: 175 VLGARTIADLQKAPLVIKGETHHWLTERGVNTSSY 209 Database: ./suso.pep Posted date: Jul 6, 2001 5:57 PM Number of letters in database: 840,471 Number of sequences in database: 2977 Database: /banques/blast2/nr.pep Posted date: Dec 14, 2000 12:46 PM Number of letters in database: 188,266,275 Number of sequences in database: 595,510 Lambda K H 0.319 0.137 0.397 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 136273883 Number of Sequences: 2977 Number of extensions: 5649939 Number of successful extensions: 14129 Number of sequences better than 1.0e-10: 20 Number of HSP's better than 0.0 without gapping: 18 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 14032 Number of HSP's gapped (non-prelim): 20 length of query: 370 length of database: 189,106,746 effective HSP length: 56 effective length of query: 314 effective length of database: 155,591,474 effective search space: 48855722836 effective search space used: 48855722836 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 166 (69.1 bits)