BLASTP 2.0.10 [Aug-26-1999]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= PAB1246 (PAB1246) DE:Hypothetical protein
         (447 letters)

Database: ./suso.pep; /banques/blast2/nr.pep
           598,487 sequences; 189,106,746 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||F75016 hypothetical protein PAB1246 - Pyrococcus abyssi (st...   912  0.0
pir||D69769 cellulose synthase homolog ydaM - Bacillus subtilis ...   136  7e-31
gb|AAK43339.1| Glycosyltransferase, putative [Sulfolobus solfata...   108  1e-22
gb|AAD52055.1|AF086783_3 (AF086783) IcaA [Staphylococcus aureus]      104  2e-21
pir||S77608 probable intercellular adhesion protein A - Staphylo...   100  3e-20
sp|P75905|YCDQ_ECOLI HYPOTHETICAL 50.8 KDA PROTEIN IN PHOH-CSGG ...    96  8e-19
pir||T34632 probable bi-functional transferase/deacetylase - Str...    93  7e-18
gb|AAB66590.1| (U22837) HmsR [Yersinia pestis]                         88  2e-16
pir||T47005 hypothetical protein hmsR [imported] - Yersinia pest...    88  2e-16
gb|AAC98402.1| (L39794) WbbF [Plasmid pWQ799]                          87  3e-16
emb|CAB72208.1| (AL138851) putative bi-functional transferase/de...    85  1e-15
pir||T05111 hypothetical protein F28M20.220 - Arabidopsis thalia...    80  5e-14
gb|AAD23884.1|AC006954_5 (AC006954) putative glucosyltransferase...    79  9e-14
pir||S75693 hypothetical protein sll1377 - Synechocystis sp. (st...    78  3e-13
gb|AAF02144.1|AC009853_4 (AC009853) unknown protein [Arabidopsis...    77  3e-13
dbj|BAB11680.1| (AB006699) glucosyltransferase-like protein [Ara...    75  2e-12
dbj|BAB05950.1| (AP001514) unknown conserved protein in others [...    74  4e-12
gb|AAD15482.1| (AC006266) putative glucosyltransferase [Arabidop...    72  1e-11
sp|Q47536|YAIP_ECOLI HYPOTHETICAL 44.7 KD PROTEIN IN ADHC-TAUA I...    71  2e-11
pir||T48403 hypothetical protein F17C15.180 - Arabidopsis thalia...    71  2e-11
gb|AAK42537.1| Glucosaminyltransferase, intercellular adhesion p...    70  4e-11

>pir||F75016 hypothetical protein PAB1246 - Pyrococcus abyssi (strain Orsay)
           >gi|5459086|emb|CAB50572.1| (AJ248288) hypothetical
           protein [Pyrococcus abyssi]
           Length = 447
           
 Score =  912 bits (2332), Expect = 0.0
 Identities = 447/447 (100%), Positives = 447/447 (100%)

Query: 1   MMRASIKFQSALYLYILIIIGLALVIPPKYALEIVLIILFLMVSSGSIFYTLLMASLGKR 60
           MMRASIKFQSALYLYILIIIGLALVIPPKYALEIVLIILFLMVSSGSIFYTLLMASLGKR
Sbjct: 1   MMRASIKFQSALYLYILIIIGLALVIPPKYALEIVLIILFLMVSSGSIFYTLLMASLGKR 60

Query: 61  YPYDETGFNLEFLEPLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRD 120
           YPYDETGFNLEFLEPLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRD
Sbjct: 61  YPYDETGFNLEFLEPLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRD 120

Query: 121 IMEEINRKYPRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNA 180
           IMEEINRKYPRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNA
Sbjct: 121 IMEEINRKYPRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNA 180

Query: 181 LKTLVSIMESAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKY 240
           LKTLVSIMESAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKY
Sbjct: 181 LKTLVSIMESAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKY 240

Query: 241 GGTVALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYI 300
           GGTVALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYI
Sbjct: 241 GGTVALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYI 300

Query: 301 KQRSRWAQGHLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIIT 360
           KQRSRWAQGHLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIIT
Sbjct: 301 KQRSRWAQGHLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIIT 360

Query: 361 GAPPLSFARPKLFLSVSIFTFLLFWFSVAYSNWVEKKRHNYYVPWSFVALYPLYFMVFVI 420
           GAPPLSFARPKLFLSVSIFTFLLFWFSVAYSNWVEKKRHNYYVPWSFVALYPLYFMVFVI
Sbjct: 361 GAPPLSFARPKLFLSVSIFTFLLFWFSVAYSNWVEKKRHNYYVPWSFVALYPLYFMVFVI 420

Query: 421 AGVIYTMRGLIRLLVGRLHWEKTKRFT 447
           AGVIYTMRGLIRLLVGRLHWEKTKRFT
Sbjct: 421 AGVIYTMRGLIRLLVGRLHWEKTKRFT 447


>pir||D69769 cellulose synthase homolog ydaM - Bacillus subtilis
           >gi|1881240|dbj|BAA19267.1| (AB001488) FUNCTION UNKNOWN,
           SIMILAR PRODUCT IN MANY BACTERIA. [Bacillus subtilis]
           >gi|2632730|emb|CAB12237.1| (Z99106) similar to
           cellulose synthase [Bacillus subtilis]
           Length = 420
           
 Score =  136 bits (338), Expect = 7e-31
 Identities = 103/377 (27%), Positives = 185/377 (48%), Gaps = 16/377 (4%)

Query: 75  PLVYVLIPAHNEERVIYKTVRSVLGQDYRN--MKVILINDNSTDRTRDIMEEINRKYPRK 132
           P V VLIPAHNEE VI +T+++++   Y    +++I++NDNS+DRT DI+ E + KY   
Sbjct: 49  PKVSVLIPAHNEEVVIRQTLKAMVNLYYPKDRLEIIVVNDNSSDRTGDIVNEFSEKYDFI 108

Query: 133 VVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLVSIMESAP 192
            ++I  PP  G+ K  ALN          ++ + + + DAD      A+  LV  + +  
Sbjct: 109 KMVITKPPNAGKGKSSALNSGFA-----ESNGDVICVYDADNTPEKMAVYYLVLGLMN-D 162

Query: 193 QYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKYGGTVALLRFPLL 252
           +    + G  R  N  K  +T+FI +E +    +A  G  K  +     GT   +R  ++
Sbjct: 163 EKAGAVVGKFRVINAAKTLLTRFINIETICFQWMAQGGRWKWFKIATIPGTNFAIRRSII 222

Query: 253 IRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWAQGHLQ 312
            +LG + + ++ EDT+L  R    GY   ++   I WE+  ET + + +QR+RWA+G+  
Sbjct: 223 EKLGGWDDKALAEDTELTIRVYNLGYHIRFFPAAITWEQEPETWKVWWRQRTRWARGNQY 282

Query: 313 VMIDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIITGAPPLSFARPKL 372
           V++       +     I   + +F+   +L   F+F  ++ N+  ++     L  +   L
Sbjct: 283 VVLKFLAQFFKLKRKRIIFDLFYFFFTYFL---FFFGVIMSNAIFVVNLFYDLHLSVGFL 339

Query: 373 FLSVSIFTFLLFWFSVAYSNWVEK---KRHNYYVPWSFVALYPLYFMVFVIAGVIYTMRG 429
            + + I  F LF   V  +  +EK    + N+++ +     Y   ++V VI  +   ++ 
Sbjct: 340 AMILWILAFFLFMTEVMITLSIEKTEMNKQNFFIVFLMYFTYSQAWIVLVIYSLFVEIKH 399

Query: 430 LIRLLVGRLHWEKTKRF 446
             RL    + W KT+R+
Sbjct: 400 --RLFKQEVKWYKTERY 414


>gb|AAK43339.1| Glycosyltransferase, putative [Sulfolobus solfataricus]
           Length = 349
           
 Score =  108 bits (267), Expect = 1e-22
 Identities = 103/370 (27%), Positives = 177/370 (47%), Gaps = 34/370 (9%)

Query: 79  VLIPAHNEERVIYKTVRSVLGQDYRNMK--VILINDNSTDRTRDIMEEINRKYPRKVVII 136
           +++P  NEERV+ + +  ++  +Y   K  +I++ D STDRT  I +E   KY   +   
Sbjct: 1   MIVPVKNEERVLPRLLDRLVNLEYDKSKYEIIVVEDGSTDRTFQICKEYEIKYNNLIRCY 60

Query: 137 DVPPER-GRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLVSIMESAPQYV 195
            +P       K RALN+AL I     +    + I D D +   + L+ +    E     V
Sbjct: 61  SLPRANVPNGKSRALNFALRI-----SKGEIIGIFDGDTVPRLDILEYVEPKFEDIT--V 113

Query: 196 IGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKYGGTVALLRFPLLIRL 255
             +QG + P N R++  ++   +E L+ +  +I G  K+       GT + +R  +++ L
Sbjct: 114 GAVQGKLVPINVRESVTSRLAAIEELI-YEYSIAGRAKVGLFVPIEGTCSFIRKSIIMEL 172

Query: 256 GKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWAQGHLQVMI 315
           G + E S+TED D+  + +  G +  Y    I W E   +LR  I+QR RW +GHL+V +
Sbjct: 173 GGWNEYSLTEDLDISLKIVNKGCKIVYSPTTISWREVPVSLRVLIRQRLRWYRGHLEVQL 232

Query: 316 DHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIITGAPPLSFARPKLFLS 375
                +     + I   +  F+M+  LV   + L ++ +S L I  A  +S A       
Sbjct: 233 GKLRKIDLRIIDGILIVLTPFFMVLNLVN--YSLVLVYSSSLYIVAASLVSLA------- 283

Query: 376 VSIFTFLLFWFSVAYSNWVEKKRHNYYVPWSFVALYPLYFMVFVIAGVIYTMRGLIRLLV 435
            S+ + LL    +A  + +E   + Y +P SFV      +M F++A + +T   L  +  
Sbjct: 284 -SLLSLLLI-ILIARRHMIE---YFYMIP-SFV------YMNFIVA-LNFTAIFLELIRA 330

Query: 436 GRLHWEKTKR 445
            R+ W KT+R
Sbjct: 331 PRV-WVKTER 339


>gb|AAD52055.1|AF086783_3 (AF086783) IcaA [Staphylococcus aureus]
           Length = 412
           
 Score =  104 bits (258), Expect = 2e-21
 Identities = 107/426 (25%), Positives = 186/426 (43%), Gaps = 45/426 (10%)

Query: 35  VLIILFLMVSSGSIFYTL-LMASLGKRYPYDETGFNLEFLEPLVYVLIPAHNEERVIYKT 93
           V + ++ +V S   ++T  +  SL K+        N++ LE + + L+  +NE   I  T
Sbjct: 12  VFMSIYWIVGSIYFYFTREIRYSLNKK-----PDINVDELEGITF-LLACYNESETIEDT 65

Query: 94  VRSVLGQDYRNMKVILINDNSTDRTRDIMEEINRKYPRKVVIIDVPPERGRSKPRALNYA 153
           + +VL   Y   ++I+IND S+D T +++ +I  K     + +D+   RG  K  ALN  
Sbjct: 66  LSNVLALKYEKKEIIIINDGSSDNTAELIYKI--KENNDFIFVDLQENRG--KANALNQG 121

Query: 154 LEIIEKYMTHPNYVFILDADYLIPPNALKTLVSIMESAPQYVIGIQGNVRPRNFRKNFVT 213
           ++         +YV  LDAD ++  +A   ++   +  P+ +  + GN R RN + + + 
Sbjct: 122 IK-----QASYDYVMCLDADTIVDQDAPYYMIENFKHDPK-LGAVTGNPRIRN-KSSILG 174

Query: 214 KFITLERLVGFNVAIEGDMKLNENGKYGGTVALLRFPLLIRLGKFREDSVTEDTDLWARA 273
           K  T+E                      G   L +   ++ +G +  D +TED  +  + 
Sbjct: 175 KIQTIEYASLIGCIKRSQTLAGAVNTISGVFTLFKKSAVVDVGYWDTDMITEDIAVSWKL 234

Query: 274 MIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWAQGHLQVMIDHYWPVMRSCSNIIESFI 333
            + GYR  Y    + W    ETL    KQR RWAQG  +V++  ++  M++     + F 
Sbjct: 235 HLRGYRIKYEPLAMCWMLVPETLGGLWKQRVRWAQGGHEVLLRDFFSTMKT-----KRFP 289

Query: 334 EHFYMMSYLVPVFWFLSVILN-SYLIITGAPPLSFARPKLFLSVSIFTFLLFWFSVAYSN 392
            +  M   ++ + W   V+L   YL IT A  L +     F++ S   FLL  F++ + N
Sbjct: 290 LYILMFEQIISILWVYIVLLYLGYLFIT-ANFLDYT----FMTYSFSIFLLSSFTMTFIN 344

Query: 393 WV------------EKKRHNYYVPWSFVALYPLYFMVFVIAGVIYTM-RGLIRLLVGRLH 439
            +            EKK     +   FV+ YP  + +   A V+    + L R   G   
Sbjct: 345 VIQFTVALFIDSRYEKKNMAGLI---FVSWYPTVYWIINAAVVLVAFPKALKRKRGGYAT 401

Query: 440 WEKTKR 445
           W    R
Sbjct: 402 WSSPDR 407


>pir||S77608 probable intercellular adhesion protein A - Staphylococcus
           epidermidis >gi|1161380|gb|AAC06117.1| (U43366) IcaA
           [Staphylococcus epidermidis]
           Length = 412
           
 Score =  100 bits (247), Expect = 3e-20
 Identities = 100/381 (26%), Positives = 170/381 (44%), Gaps = 40/381 (10%)

Query: 80  LIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRDIMEEINRKYPRKVVIIDVP 139
           L+  +NE   +  T+ SVL  +Y   ++I+IND S+D T +I+ +  + +  K V ++V 
Sbjct: 52  LLACYNESETVQDTLSSVLSLEYPEKEIIIINDGSSDNTAEIIYDFKKNHDFKFVDLEV- 110

Query: 140 PERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLVSIMESAPQYVIGIQ 199
               R K  ALN  ++          YV  LDAD +I  +A   ++   +  P+ +  + 
Sbjct: 111 ---NRGKANALNEGIK-----QASYEYVMCLDADTVIDDDAPFYMIEDFKKNPK-LGAVT 161

Query: 200 GNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNEN-----GKYGGTVALLRFPLLIR 254
           GN R RN + + + K  T+E       +I G +K +++         G   L +   L  
Sbjct: 162 GNPRIRN-KSSILGKIQTIEY-----ASIIGCIKRSQSLAGAINTISGVFTLFKKSALKD 215

Query: 255 LGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWAQGHLQVM 314
           +G +  D +TED  +  +  +  Y   Y    + W    ET+    KQR RWAQG  +V+
Sbjct: 216 VGYWDTDMITEDIAVSWKLHLFDYEIKYEPRALCWMLVPETIGGLWKQRVRWAQGGHEVL 275

Query: 315 IDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFW-FLSVILNSYLIITGAPPLSFARPKLF 373
           +  +WP +++     +    +  M   +  + W ++ +   S+L+IT A  L +   K  
Sbjct: 276 LRDFWPTIKT-----KKLSLYILMFEQIASITWVYIVLCYLSFLVIT-ANILDYTYLKYS 329

Query: 374 LSVSIF-----TFL-LFWFSVA--YSNWVEKKRHNYYVPWSFVALYPLYFMVFVIAGVIY 425
            S+  F     TF+ +  F+VA    +  EKK     V   F++ YP  + V   A VI 
Sbjct: 330 FSIFFFSSFTMTFINIIQFTVALFIDSRYEKKN---IVGLIFLSWYPTLYWVINAAVVIM 386

Query: 426 TM-RGLIRLLVGRLHWEKTKR 445
              + L R   G   W    R
Sbjct: 387 AFPKALKRKKGGYATWSSPDR 407


>sp|P75905|YCDQ_ECOLI HYPOTHETICAL 50.8 KDA PROTEIN IN PHOH-CSGG INTERGENIC REGION
           >gi|7451878|pir||D64844 probable gylcosyltransferase
           ycdQ - Escherichia coli >gi|1787259|gb|AAC74107.1|
           (AE000204) orf, hypothetical protein [Escherichia coli
           K12] >gi|4062586|dbj|BAA35803.1| (D90739) Glycosyl
           transferase (lgtD) homolog [Escherichia coli]
           Length = 441
           
 Score = 96.0 bits (235), Expect = 8e-19
 Identities = 92/360 (25%), Positives = 159/360 (43%), Gaps = 36/360 (10%)

Query: 75  PLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRDIMEEINRKYPRKVV 134
           P + ++IP  NEE+ + +T+ + L Q Y N++VI +ND STD+TR I++ +  + P  + 
Sbjct: 75  PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPH-LR 133

Query: 135 IIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLVSIMESAPQY 194
           +I +   +G++       A    E       Y+  +D D L+  +A   +V  M   P+ 
Sbjct: 134 VIHLAQNQGKAIALKTGAAAAKSE-------YLVCIDGDALLDRDAAAYIVEPMLYNPR- 185

Query: 195 VIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKYG------GTVALLR 248
           V  + GN R R  R   V K       VG   +I G +K  +   YG      G +A  R
Sbjct: 186 VGAVTGNPRIRT-RSTLVGKI-----QVGEYSSIIGLIKRTQR-IYGNVFTVSGVIAAFR 238

Query: 249 FPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWAQ 308
              L  +G + +D +TED D+  +  +  +  +Y    + W    ETL+   KQR RWAQ
Sbjct: 239 RSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQ 298

Query: 309 GHLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILN--SYLIITGAPPLS 366
           G  +V + +   + R      E+F        Y +   W  + ++    Y +     PL+
Sbjct: 299 GGAEVFLKNMTRLWRK-----ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLN 353

Query: 367 FARPKLFLS----VSIFTFLLFWFSVAYSNWVEKK-RHNYYVPWSFVALYPLYFMVFVIA 421
                +  +    + + T  L  F V  S  +E +  HN      ++  +P+ F +  +A
Sbjct: 354 IELTHIAATHTAGILLCTLCLLQFIV--SLMIENRYEHNLTSSLFWIIWFPVIFWMLSLA 411


>pir||T34632 probable bi-functional transferase/deacetylase - Streptomyces
           coelicolor >gi|5042285|emb|CAB44539.1| (AL078618)
           putative bi-functional transferase/deacetylase
           [Streptomyces coelicolor]
           Length = 743
           
 Score = 92.8 bits (227), Expect = 7e-18
 Identities = 99/381 (25%), Positives = 166/381 (42%), Gaps = 49/381 (12%)

Query: 77  VYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRDIMEEINRKYPRKVVII 136
           V V++PA+NE+  I  T+RS L +    +++I+++D STD T DI E +     R V   
Sbjct: 389 VSVIVPAYNEKECIEATLRS-LARSTHPVEIIVVDDGSTDGTADIAESLGLPGVRVV--- 444

Query: 137 DVPPERGRSKPRALNYALEIIEKYMTHPNY--VFILDADYLIPPNALKTLVSIMESAPQY 194
               +    KP ALN  +        H  Y  V ++D D +  P+ ++ LV     A   
Sbjct: 445 ---RQANAGKPAALNNGVR-------HARYDIVVMMDGDTVFEPDTVRHLVQPF--ADPS 492

Query: 195 VIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKYGGTVALLRFPLLIR 254
           V  + GN +  N R+  +  +  +E ++GFN+       L       G +   R   +++
Sbjct: 493 VGAVAGNAKVGN-RRTLIGAWQHIEYVMGFNLDRRMYDLLRCMPTIPGAIGAFRREAVLQ 551

Query: 255 LGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWAQGHLQVM 314
            G   +D++ EDTD+      AG+R  Y      W EA  +L     QR RW+ G +Q +
Sbjct: 552 AGGMSDDTLAEDTDITIALHRAGWRVVYEEHARAWTEAPGSLGQLWSQRYRWSYGTMQAL 611

Query: 315 IDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIITGAPPLSFARPKLFL 374
               W   RS ++   S    F  +   +P+     V+L   +    AP +        L
Sbjct: 612 ----WKHRRSLTDKGPS--GRFGRVG--MPL-----VVLFQVVTPVFAPLIDVFTVYSML 658

Query: 375 SVSIFTFLLFWFSV--------AYSNWVEKKRHNY--YVPWSFVALYPLYFMVFVIAGVI 424
            V     LL W +V        AY+  ++++++ Y   +P   +A   + ++V + + V 
Sbjct: 659 FVDFRAALLAWLAVLGVQLVCAAYAFRLDREKYRYLLMMPLQQLAYRQMMYLVLIHSCV- 717

Query: 425 YTMRGLIRLLVGRLHWEKTKR 445
                   L  GRL W+K KR
Sbjct: 718 ------TALTGGRLRWQKLKR 732


>gb|AAB66590.1| (U22837) HmsR [Yersinia pestis]
           Length = 457
           
 Score = 87.8 bits (214), Expect = 2e-16
 Identities = 93/389 (23%), Positives = 171/389 (43%), Gaps = 44/389 (11%)

Query: 75  PLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRDIMEEINRKYPRKVV 134
           PLV +L+P  NE     +T+ + L Q Y N++VI IND S+D T  +++ +  + PR + 
Sbjct: 88  PLVSILVPCFNEGLNARETIHAALAQTYTNIEVIAINDGSSDDTAQVLDALLAEDPR-LR 146

Query: 135 IIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLVSIMESAPQY 194
           +I +   +G++    +  A            Y+  +D D L+  NA+  LV+ + + P+ 
Sbjct: 147 VIHLAHNQGKAIALRMGAA-------AARSEYLVCIDGDALLDKNAVPYLVAPLIANPR- 198

Query: 195 VIGIQGNVRPRNFRKNFVTKFITLERL-VGFNVAIEGDMKLNENGKYG------GTVALL 247
              + GN R R       T+   + R+ VG   +I G +K  +   YG      G VA  
Sbjct: 199 TGAVTGNPRIR-------TRSTLIGRVQVGEFSSIIGLIKRTQR-VYGQVFTVSGVVAAF 250

Query: 248 RFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWA 307
           R   L  +G +  D +TED D+  +  +  +  ++    + W    ETLR   KQR RWA
Sbjct: 251 RRRALADVGYWSPDMITEDIDISWKLQLKHWSVFFEPRGLCWILMPETLRGLWKQRLRWA 310

Query: 308 QGHLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIITG----AP 363
           QG  +V + + + + R  +  +         + Y + + W  + + +  L + G     P
Sbjct: 311 QGGAEVFLKNMFKLWRWRNRRM-----WLLFLEYSLSITWAFTYLFSITLYLLGLVITLP 365

Query: 364 P---LSFARPKLFLSVSIFTFLLFWFSVAY---SNWVEKKRHNYYVPWSFVALYPL-YFM 416
           P   +    P  F  + +    L  F+++      +  K  H+ +    ++  YP+ Y+M
Sbjct: 366 PGIHVQSVFPPAFTGMVLALTCLLQFAISLVIERRYEPKLGHSLF----WIIWYPMVYWM 421

Query: 417 VFVIAGVIYTMRGLIRLLVGRLHWEKTKR 445
           + +   V+   + ++     R  W    R
Sbjct: 422 LNLFTTVVSFPKVMLITKRKRARWVSPDR 450


>pir||T47005 hypothetical protein hmsR [imported] - Yersinia pestis
           >gi|4106593|emb|CAA21348.1| (AL031866) ORF25, len: 457
           aa, hmsR, 99,8% identity with Yersinia pestis hemin
           binding protein Q56941, Fasta scores opt: 3002, E(): 0
           Length = 457
           
 Score = 87.8 bits (214), Expect = 2e-16
 Identities = 93/389 (23%), Positives = 171/389 (43%), Gaps = 44/389 (11%)

Query: 75  PLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRDIMEEINRKYPRKVV 134
           PLV +L+P  NE     +T+ + L Q Y N++VI IND S+D T  +++ +  + PR + 
Sbjct: 88  PLVSILVPCFNEGLNARETIHAALAQTYTNIEVIAINDGSSDDTAQVLDALLAEDPR-LR 146

Query: 135 IIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLVSIMESAPQY 194
           +I +   +G++    +  A            Y+  +D D L+  NA+  LV+ + + P+ 
Sbjct: 147 VIHLAHNQGKAIALRMGAA-------AARSEYLVCIDGDALLDKNAVPYLVAPLIANPR- 198

Query: 195 VIGIQGNVRPRNFRKNFVTKFITLERL-VGFNVAIEGDMKLNENGKYG------GTVALL 247
              + GN R R       T+   + R+ VG   +I G +K  +   YG      G VA  
Sbjct: 199 TGAVTGNPRIR-------TRSTLIGRVQVGEFSSIIGLIKRTQR-VYGQVFTVSGVVAAF 250

Query: 248 RFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWA 307
           R   L  +G +  D +TED D+  +  +  +  ++    + W    ETLR   KQR RWA
Sbjct: 251 RRRALADVGYWSPDMITEDIDISWKLQLKHWSVFFEPRGLCWILMPETLRGLWKQRLRWA 310

Query: 308 QGHLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIITG----AP 363
           QG  +V + + + + R  +  +         + Y + + W  + + +  L + G     P
Sbjct: 311 QGGAEVFLKNMFKLWRWRNRRM-----WLLFLEYSLSITWAFTYLFSITLYLLGLVITLP 365

Query: 364 P---LSFARPKLFLSVSIFTFLLFWFSVAY---SNWVEKKRHNYYVPWSFVALYPL-YFM 416
           P   +    P  F  + +    L  F+++      +  K  H+ +    ++  YP+ Y+M
Sbjct: 366 PGIHVQSVFPPAFTGMVLALTCLLQFAISLVIERRYEPKLGHSLF----WIIWYPMVYWM 421

Query: 417 VFVIAGVIYTMRGLIRLLVGRLHWEKTKR 445
           + +   V+   + ++     R  W    R
Sbjct: 422 LNLFTTVVSFPKVMLITKRKRARWVSPDR 450


>gb|AAC98402.1| (L39794) WbbF [Plasmid pWQ799]
           Length = 459
           
 Score = 87.4 bits (213), Expect = 3e-16
 Identities = 89/315 (28%), Positives = 144/315 (45%), Gaps = 38/315 (12%)

Query: 30  YALEIVLIILFLMV----------SSGSIFYTLLMASLGKRYPYDETGFNLEFLEPLVYV 79
           Y ++IV  +L+++V          SS SIF ++   +  K+  Y +      FL     +
Sbjct: 4   YIIDIVEYVLYVLVTAMTWYLFALSSYSIFLSVFGFAKNKK-DYPDCPPEARFL-----I 57

Query: 80  LIPAHNEERVIYKTVRSV--LGQDYRNMKVILINDNSTDRTRDIMEEINRKYPRKVVIID 137
           L+ AHNEE VI  T+ ++  +  D +   ++++NDNSTDRT  I +    K+   V  I+
Sbjct: 58  LVAAHNEEAVIGSTLINLKNIQYDKKLFDIVVVNDNSTDRTGLICDSHEVKH---VDTIE 114

Query: 138 VPPER-GRSKPRALNYALEIIEKYMTHPNY--VFILDADYLIPPNALKTLVS--IMESAP 192
              ER G  KP  + YAL  +       NY  V +LDAD  +  N L  L S  I +  P
Sbjct: 115 GEFEREGVGKPAGIQYALRKLGFETVKENYDLVMVLDADNFVDANILTELNSQWISKDKP 174

Query: 193 QYVIGIQGNVRPRNFRK----NFVTKFITLERLVGFNVAIEGDMKLNENGKYGGTVALLR 248
           +    IQ  +  +N        + T +  + R    +       +L      GGT  ++ 
Sbjct: 175 E---AIQAYLDCKNSTSLLSFGYCTSYWMMNRFFQLS-----KYRLGLPNAIGGTGFVVS 226

Query: 249 FPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWAQ 308
              LI  G F   S+TED +L    +    R  + H V  ++E  + LR  +KQR RW++
Sbjct: 227 SNFLINTGGFCFKSLTEDIELEIEIVRKRGRVLWNHNVRVYDEKPDNLRISLKQRYRWSK 286

Query: 309 GHLQVMIDHYWPVMR 323
           GH  V   +++ + +
Sbjct: 287 GHWYVAFTNFFNLFK 301


>emb|CAB72208.1| (AL138851) putative bi-functional transferase/deacetylase
           [Streptomyces coelicolor A3(2)]
           Length = 734
           
 Score = 85.4 bits (208), Expect = 1e-15
 Identities = 69/241 (28%), Positives = 113/241 (46%), Gaps = 15/241 (6%)

Query: 77  VYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRDIMEEINRKYPRKVVII 136
           V VL+PA+NE + I  TVRS++  D+  ++VI+I+D S+D T  I+E +     R +  +
Sbjct: 362 VTVLVPAYNEAKCIENTVRSLVASDHP-VEVIVIDDGSSDGTARIVEGLGLPGVRVIRQL 420

Query: 137 DVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLVSIMESAPQYVI 196
           +        KP ALN  L          + V ++D D +  P+ ++ LV         V 
Sbjct: 421 NA------GKPAALNRGLA-----NARYDIVVMMDGDTVFEPSTVRELVQPF--GDPRVG 467

Query: 197 GIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKYGGTVALLRFPLLIRLG 256
            + GN +  N + + +  +  +E ++GFN+       L       G V   R   L  +G
Sbjct: 468 AVAGNAKVGN-KDSLIGAWQHIEYVMGFNLDRRMYDVLGCMPTIPGAVGAFRRSALEPIG 526

Query: 257 KFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWAQGHLQVMID 316
              +D++ EDTD+      AG+R  Y      W EA E++     QR RW+ G +Q +  
Sbjct: 527 GMSDDTLAEDTDVTMALHRAGWRVVYAENARAWTEAPESVGQLWSQRYRWSYGTMQAIWK 586

Query: 317 H 317
           H
Sbjct: 587 H 587


>pir||T05111 hypothetical protein F28M20.220 - Arabidopsis thaliana
           >gi|3281868|emb|CAA19764.1| (AL031004) putative protein
           [Arabidopsis thaliana] >gi|7270062|emb|CAB79877.1|
           (AL161579) putative protein [Arabidopsis thaliana]
           Length = 692
           
 Score = 80.0 bits (194), Expect = 5e-14
 Identities = 95/406 (23%), Positives = 175/406 (42%), Gaps = 37/406 (9%)

Query: 4   ASIKFQSALYLYIL--IIIGLALVIPPKYALEIVLIILFLMVSSGSIFYTL--LMASLGK 59
           ++++ QS  +L  +  + +    + PP  AL    I+LFL+ S   +   L        K
Sbjct: 145 STLEIQSLFHLVYVGWLTLRADYIAPPIKALSKFCIVLFLIQSVDRLVLCLGCFWIKYKK 204

Query: 60  RYP-YDETGFNLEFLE------PLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVIL--I 110
             P +DE  F  +  E      P+V V IP  NE  V  +++ +V   D+   ++++  +
Sbjct: 205 IKPRFDEEPFRNDDAEGSGSEYPMVLVQIPMCNEREVYEQSISAVCQLDWPKDRILVQVL 264

Query: 111 NDNSTDRTRDIMEEINRKYPRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFIL 170
           +D++ +  + +++    K+ +K V I       R+  +A N    +   Y+    YV I 
Sbjct: 265 DDSNDESIQQLIKAEVAKWSQKGVNIIYRHRLVRTGYKAGNLKSAMSCDYVEAYEYVAIF 324

Query: 171 DADYLIPPNALKTLVSIMESAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEG 230
           DAD+   P+ LK  V   +  P+  + +Q      N  +N +T      RL   N+    
Sbjct: 325 DADFQPTPDFLKLTVPHFKDNPELGL-VQARWTFVNKDENLLT------RLQNINLCFHF 377

Query: 231 DMKLNENGKY------GGTVALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYH 284
           +++   NG +       GT  + R   L   G + E +  ED D+  RA + G++F Y +
Sbjct: 378 EVEQQVNGVFLNFFGFNGTAGVWRIKALEESGGWLERTTVEDMDIAVRAHLHGWKFIYLN 437

Query: 285 GVIGWEEAVETLRDYIKQRSRWAQGHLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVP 344
            V    E  E+   Y KQ+ RW  G +Q        + R C   I +     +  + L+ 
Sbjct: 438 DVKVLCEVPESYEAYKKQQHRWHSGPMQ--------LFRLCLGSILTSKIAIWKKANLIL 489

Query: 345 VFWFLSVIL---NSYLIITGAPPLSFARPKLFLSVSIFTFLLFWFS 387
           +F+ L  ++    S+ +     PL+   P+  L V +  ++  + S
Sbjct: 490 LFFLLRKLILPFYSFTLFCIILPLTMFVPEAELPVWVICYIPVFMS 535


>gb|AAD23884.1|AC006954_5 (AC006954) putative glucosyltransferase [Arabidopsis thaliana]
           Length = 690
           
 Score = 79.2 bits (192), Expect = 9e-14
 Identities = 94/407 (23%), Positives = 176/407 (43%), Gaps = 37/407 (9%)

Query: 4   ASIKFQSALYLYILIIIGLAL--VIPPKYALEIVLIILFLMVSSGSIFYTL--LMASLGK 59
           + ++ QS L+L+ +  + L    + PP  AL    I+LFL+ S   +   L  L     K
Sbjct: 145 SKLEIQSLLHLFYVGWLSLRADYIAPPIKALSKFCIVLFLVQSVDRLILCLGCLWIKFKK 204

Query: 60  RYP-YDETGFNLEFLE------PLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVIL--I 110
             P  DE  F  +  E      P+V V IP  NE  V  +++ +V   D+   ++++  +
Sbjct: 205 IKPRIDEEHFRNDDFEGSGSEYPMVLVQIPMCNEREVYEQSISAVCQLDWPKDRLLVQVL 264

Query: 111 NDNSTDRTRDIMEEINRKYPRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFIL 170
           +D+  +  ++++ +   K+ +K V I       R+  +A N    +   Y+    +V I 
Sbjct: 265 DDSDDESIQELIRDEVTKWSQKGVNIIYRHRLVRTGYKAGNLKSAMSCDYVEAYEFVAIF 324

Query: 171 DADYLIPPNALKTLVSIMESAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEG 230
           DAD+    + LK  V   +  P+  + +Q      N  +N +T      RL   N+    
Sbjct: 325 DADFQPNSDFLKLTVPHFKEKPELGL-VQARWAFVNKDENLLT------RLQNINLCFHF 377

Query: 231 DMKLNENGKY------GGTVALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYH 284
           +++   NG +       GT  + R   L   G + E +  ED D+  RA + G++F Y +
Sbjct: 378 EVEQQVNGVFLNFFGFNGTAGVWRIKALEESGGWLERTTVEDMDIAVRAHLHGWKFIYLN 437

Query: 285 GVIGWEEAVETLRDYIKQRSRWAQGHLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVP 344
            V    E  E+   Y KQ+ RW  G +Q        + R C   I +     +  + L+ 
Sbjct: 438 DVKVLCEVPESYEAYKKQQHRWHSGPMQ--------LFRLCLRSILTSKIAMWKKANLIL 489

Query: 345 VFWFLSVIL---NSYLIITGAPPLSFARPKLFLSVSIFTFLLFWFSV 388
           +F+ L  ++    S+ +     P++   P+  L + +  ++  + S+
Sbjct: 490 LFFLLRKLILPFYSFTLFCVILPITMFVPEAELPIWVICYVPIFMSL 536


>pir||S75693 hypothetical protein sll1377 - Synechocystis sp. (strain PCC 6803)
           >gi|1653339|dbj|BAA18254.1| (D90912) hypothetical
           protein [Synechocystis sp.]
           Length = 479
           
 Score = 77.6 bits (188), Expect = 3e-13
 Identities = 75/278 (26%), Positives = 125/278 (43%), Gaps = 22/278 (7%)

Query: 75  PLVYVLIPAHNEERVIYKTVRSVLGQDYRNMK--VILINDNSTDRTRDIMEEINRKYPRK 132
           P V +++ A NEE VI K V+ +   DY   +  V +++DNSTDRT  I++++ ++YP+ 
Sbjct: 108 PQVCLMVAAKNEEAVIGKIVQQLCSLDYPGDRHEVWIVDDNSTDRTPAILDQLRQQYPQL 167

Query: 133 VVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLVSIMESAP 192
            V+       G  K  ALN  L       T  + V + DAD  +P + L+ +V      P
Sbjct: 168 KVVRRGAGASG-GKSGALNEVLA-----QTQGDIVGVFDADANVPKDLLRRVV------P 215

Query: 193 QYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNE-----NGKYGGTVALL 247
            +     G ++ R    N    F T  R  G  +A++   +         G+  G    +
Sbjct: 216 YFASPTFGALQVRKAIANEAVNFWT--RGQGAEMALDAYFQQQRIVTGGIGELRGNGQFV 273

Query: 248 RFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWA 307
               L  +G + E ++T+D DL  R  +  ++          EE V T      QR+RWA
Sbjct: 274 ARQALDAVGGWNEQTITDDLDLTIRLHLHQWKVGILVNPPVEEEGVTTAIALWHQRNRWA 333

Query: 308 QGHLQVMIDHY-WPVMRSCSNIIESFIEHFYMMSYLVP 344
           +G  Q  +D++ W   +      +  +  F +M YL+P
Sbjct: 334 EGGYQRYLDYWRWICTQPMGWKKKLDLFSFLLMQYLLP 371


>gb|AAF02144.1|AC009853_4 (AC009853) unknown protein [Arabidopsis thaliana]
           Length = 682
           
 Score = 77.3 bits (187), Expect = 3e-13
 Identities = 83/348 (23%), Positives = 155/348 (43%), Gaps = 27/348 (7%)

Query: 20  IGLALVIPPKYALEIVLIILFLMVSSGSIFYTL---------LMASLGKRYPYDETGFNL 70
           I  + + PP  +L  V I+LFL+ S   +   L         +       YP    G  +
Sbjct: 156 IRASYLAPPLQSLTNVCIVLFLIQSVDRLVLVLGCFWIKLRRIKPVASMEYPTKLVGEGV 215

Query: 71  EFLE-PLVYVLIPAHNEERVIYKTVRSVLGQDY--RNMKVILINDNSTDRTRDIMEEINR 127
              + P+V V IP  NE+ V  +++ +V   D+    M V +++D+S    + +++   +
Sbjct: 216 RLEDYPMVIVQIPMCNEKEVYQQSIGAVCMLDWPRERMLVQVLDDSSELDVQQLIKAEVQ 275

Query: 128 KYPRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLVSI 187
           K+ ++ V I       R+  +A N    +  +Y+    +V I DAD+  P + LK  V  
Sbjct: 276 KWQQRGVRIVYRHRLIRTGYKAGNLKAAMNCEYVKDYEFVAIFDADFQPPADFLKKTVPH 335

Query: 188 MESAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKY------G 241
            +   +  + +Q      N  +N +T      RL   N++   +++   NG +       
Sbjct: 336 FKGNEELAL-VQTRWAFVNKDENLLT------RLQNINLSFHFEVEQQVNGVFINFFGFN 388

Query: 242 GTVALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIK 301
           GT  + R   L   G + E +  ED D+  RA + G++F Y + V    E  E+   Y K
Sbjct: 389 GTAGVWRIKALEDCGGWLERTTVEDMDIAVRAHLCGWKFIYLNDVKCLCELPESYEAYKK 448

Query: 302 QRSRWAQGHLQVMIDHYWPVMRSCSNIIE--SFIEHFYMMSYLVPVFW 347
           Q+ RW  G +Q+    ++ ++RS  +  +  + I  F+++  L+  F+
Sbjct: 449 QQYRWHSGPMQLFRLCFFDILRSKVSAAKKANMIFLFFLLRKLILPFY 496


>dbj|BAB11680.1| (AB006699) glucosyltransferase-like protein [Arabidopsis thaliana]
           Length = 534
           
 Score = 74.9 bits (181), Expect = 2e-12
 Identities = 81/350 (23%), Positives = 148/350 (42%), Gaps = 31/350 (8%)

Query: 12  LYLYILIIIGLALVIPPKY-ALEIVLIILFLMVSSGSIFYTLLMASLGKRYPYDETGFNL 70
           L +YI +++ + L+    Y  + IVL+ LF                  KRY ++    + 
Sbjct: 43  LAVYICLLMSVMLLCERVYMGIVIVLVKLFWKKPD-------------KRYKFEPIHDDE 89

Query: 71  EFLE---PLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVIL-INDNSTDRTRDIMEEIN 126
           E      P+V V IP  NE  V   ++ +  G  + + ++++ + D+STD T   M E+ 
Sbjct: 90  ELGSSNFPVVLVQIPMFNEREVYKLSIGAACGLSWPSDRLVIQVLDDSTDPTVKQMVEVE 149

Query: 127 -RKYPRKVVII--DVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKT 183
            +++  K + I   +   R   K  AL   L+    Y+ H  YV I DAD+   P+ L+ 
Sbjct: 150 CQRWASKGINIRYQIRENRVGYKAGALKEGLK--RSYVKHCEYVVIFDADFQPEPDFLRR 207

Query: 184 LVSIMESAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKYGGT 243
            +  +   P   + +Q   R  N  +  +T+   +     F V  E     +    + GT
Sbjct: 208 SIPFLMHNPNIAL-VQARWRFVNSDECLLTRMQEMSLDYHFTVEQEVGSSTHAFFGFNGT 266

Query: 244 VALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQR 303
             + R   +   G +++ +  ED DL  RA + G++F Y   +    E   T R +  Q+
Sbjct: 267 AGIWRIAAINEAGGWKDRTTVEDMDLAVRASLRGWKFLYLGDLQVKSELPSTFRAFRFQQ 326

Query: 304 SRWAQGHLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVIL 353
            RW+ G   +         +    I+ +    F+   Y++  F+F+  I+
Sbjct: 327 HRWSCGPANLF-------RKMVMEIVRNKKVRFWKKVYVIYSFFFVRKII 369


>dbj|BAB05950.1| (AP001514) unknown conserved protein in others [Bacillus
           halodurans]
           Length = 482
           
 Score = 73.7 bits (178), Expect = 4e-12
 Identities = 97/371 (26%), Positives = 156/371 (41%), Gaps = 59/371 (15%)

Query: 57  LGKRYPYDETGFNLEFLEPLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTD 116
           L K+  YD+    L + +P V +L+PA+NEE  I +TVRS+L   Y   +++++ND STD
Sbjct: 48  LNKQEVYDDY-LELTYTKP-VSILVPAYNEETGIIETVRSLLSLKYPQTEIVVVNDGSTD 105

Query: 117 RTRD-IMEEINRKYPRKVV--IIDVPPERG-----------------RSKPRALNYALEI 156
           +T + I+E        KV+   I+  P +G                   K  ALN  L  
Sbjct: 106 QTLEVIIEHFQMVKVGKVIRKQIETEPIKGVYQSTIFPHLLLVDKSNGGKADALNAGLN- 164

Query: 157 IEKYMTHPNYVFILDADYLIPPNA-LKTLVSIMESA--PQYVIGIQGNVRPRN------- 206
           + KY     Y   +D D ++  +A LK +  I+ S      VI   GNVR  N       
Sbjct: 165 VSKY----PYFCSIDGDSILETDALLKVMKPIVTSRDDEDEVIASGGNVRIANGSDIQMG 220

Query: 207 ------FRKNFVTKFITLERLVGFNVAIEGDMKLNENGKYGGTVALLRFPLLIRLGKFRE 260
                   KN +     +E L  F +   G  + N      G  ++     ++  G + +
Sbjct: 221 SVLSVQLAKNPLVVMQVIEYLRAFLMGRIGLSRHNMVLIISGAFSVFAKKWVMEAGGYSK 280

Query: 261 DSVTEDTDLWAR------AMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWAQGHLQVM 314
            +V ED +L  R            R  +    + W EA  T R   +QRSRW +G ++ +
Sbjct: 281 KTVGEDMELVVRLHRLVKEKRLKKRITFVPDPVCWTEAPATFRVLQRQRSRWHRGLMESL 340

Query: 315 IDHYWPVMRSCSNII-ESFIEHFYMMSYLVPVFWFLSVILNSYLIITGAPPLSFARPKLF 373
             H          ++  + I +F+++ +  PV     V L  YL I      +F    L+
Sbjct: 341 WLHRGMTFNPKYGLVGTASIPYFWIVEFFGPV-----VELMGYLYIV----FAFFFGGLY 391

Query: 374 LSVSIFTFLLF 384
           +  ++  FLLF
Sbjct: 392 VEFALALFLLF 402


>gb|AAD15482.1| (AC006266) putative glucosyltransferase [Arabidopsis thaliana]
           >gi|7267435|emb|CAB77947.1| (AL161508) putative
           glucosyltransferase [Arabidopsis thaliana]
           Length = 699
           
 Score = 72.2 bits (174), Expect = 1e-11
 Identities = 81/344 (23%), Positives = 149/344 (42%), Gaps = 16/344 (4%)

Query: 18  IIIGLALVIPPKYALEIVLIILFLMVSSGSIFYTL--LMASLGKRYPYD--------ETG 67
           +++ +  + PP   L    I+LFL+ S   +   L        K  P          E+G
Sbjct: 175 VLLRVEYLAPPLQFLANGCIVLFLVQSLDRLILCLGCFWIRFKKIKPVPKPDSISDLESG 234

Query: 68  FNLEFLEPLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVIL--INDNSTDRTRDIMEEI 125
            N  FL P+V V IP  NE+ V  +++ +V   D+   K+++  ++D+    T+ +++E 
Sbjct: 235 DNGAFL-PMVLVQIPMCNEKEVYQQSIAAVCNLDWPKGKILIQILDDSDDPITQSLIKEE 293

Query: 126 NRKYPRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLV 185
             K+ +    I       R   +A N    +   Y+    +V I DAD+   P+ LK  +
Sbjct: 294 VHKWQKLGARIVYRHRVNREGYKAGNLKSAMNCSYVKDYEFVAIFDADFQPLPDFLKKTI 353

Query: 186 SIMESAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKYGGTVA 245
              +   +  + +Q      N  +N +T+   +     F V  + +        + GT  
Sbjct: 354 PHFKDNEEIGL-VQARWSFVNKEENLLTRLQNINLAFHFEVEQQVNSVFLNFFGFNGTAG 412

Query: 246 LLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSR 305
           + R   L   G + E +  ED D+  RA + G++F + + V    E  E+   Y KQ+ R
Sbjct: 413 VWRIKALEDSGGWLERTTVEDMDIAVRAHLHGWKFVFLNDVECQCELPESYEAYRKQQHR 472

Query: 306 WAQGHLQVMIDHYWPVMRSCSNIIESF--IEHFYMMSYLVPVFW 347
           W  G +Q+       V++S  +I + F  I  F+++  L+  F+
Sbjct: 473 WHSGPMQLFRLCLPAVIKSKISIGKKFNLIFLFFLLRKLILPFY 516


>sp|Q47536|YAIP_ECOLI HYPOTHETICAL 44.7 KD PROTEIN IN ADHC-TAUA INTERGENIC REGION
           >gi|7466632|pir||C64764 membrane protein yaiP -
           Escherichia coli >gi|1657558|gb|AAB18086.1| (U73857)
           44.8 kD hypothetical protein [Escherichia coli]
           >gi|1786560|gb|AAC73466.1| (AE000143) polysaccharide
           metabolism [Escherichia coli K12]
           Length = 398
           
 Score = 71.4 bits (172), Expect = 2e-11
 Identities = 90/377 (23%), Positives = 151/377 (39%), Gaps = 66/377 (17%)

Query: 80  LIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRDIMEEINRKY-PRKVVIIDV 138
           +IPA+NE   + +++ ++L   Y   +VI +ND STD T  +M E+ RK+  R V +   
Sbjct: 35  IIPAYNEGPCLAQSLDNLLRNPYF-CRVICVNDGSTDNTEAVMAEVKRKWGDRFVAVTQK 93

Query: 139 PPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPP--NALKTLVSIMESAPQYVI 196
              +G +    LNYA           + VF+ DAD  +PP  + +  +++ +E     V 
Sbjct: 94  NTGKGGALMNGLNYAT---------CDQVFLSDADTYVPPDQDGMGYMLAEIERGADAVG 144

Query: 197 GIQGNVRP---------RNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKYGGTVALL 247
           GI                  +   +    TL++L+G    I             G   + 
Sbjct: 145 GIPSTALKGAGLLPHIRATVKLPMIVMKRTLQQLLGGAPFI-----------ISGACGMF 193

Query: 248 RFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWA 307
           R  +L + G F + +  ED DL    +  GYR    +  I + +   + R+  ++  RW 
Sbjct: 194 RTDVLRKFG-FSDRTKVEDLDLTWTLVANGYRIRQANRCIVYPQECNSPREEWRRWRRWI 252

Query: 308 QGHLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYL--IITGAPPL 365
            G        Y   MR    ++ S    F +   L+ V + + + L ++    IT  P  
Sbjct: 253 VG--------YAVCMRLHKRLLFSRFGIFSIFPMLLVVLYGVGIYLTTWFNEFITTGPH- 303

Query: 366 SFARPKLFLSVSIFTFLLFWFSV-----AYSNWVEKKRHNYYVPWSFVALYPLYFMVFVI 420
                     V +  F L W  V     A+S W       ++  W  V L PL  +  ++
Sbjct: 304 ---------GVVLAMFPLIWVGVVCVIGAFSAW-------FHRCWLLVPLAPLSVVYVLL 347

Query: 421 AGVIYTMRGLIRLLVGR 437
           A  I+ + GLI    GR
Sbjct: 348 AYAIWIIYGLIAFFTGR 364


>pir||T48403 hypothetical protein F17C15.180 - Arabidopsis thaliana
           >gi|7340661|emb|CAB82941.1| (AL162506) putative protein
           [Arabidopsis thaliana] >gi|9758004|dbj|BAB08601.1|
           (AB005235) glucosyltransferase-like protein [Arabidopsis
           thaliana]
           Length = 533
           
 Score = 71.4 bits (172), Expect = 2e-11
 Identities = 73/315 (23%), Positives = 142/315 (44%), Gaps = 18/315 (5%)

Query: 59  KRYPYDETGFNLEF---LEPLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVIL-INDNS 114
           KR+ Y+    ++E      P+V + IP  NE  V   ++ +  G  + + ++++ + D+S
Sbjct: 78  KRFKYEPIKDDIELGNSAYPMVLIQIPMFNEREVYQLSIGAACGLSWPSDRIVIQVLDDS 137

Query: 115 TDRT-RDIMEEINRKYPRKVVII--DVPPERGRSKPRALNYALEIIEKYMTHPNYVFILD 171
           TD T +D++E    ++  K V I  ++   R   K  AL   ++  + Y+   +YV I D
Sbjct: 138 TDPTIKDLVEMECSRWASKGVNIKYEIRDNRNGYKAGALKEGMK--KSYVKSCDYVAIFD 195

Query: 172 ADYLIPPNALKTLVSIMESAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGD 231
           AD+    + L   V  +   P+  + +Q   +  N  +  +T+   +     F V  E  
Sbjct: 196 ADFQPEADFLWRTVPYLLHNPKLAL-VQARWKFVNSDECLMTRMQEMSLDYHFTVEQEVG 254

Query: 232 MKLNENGKYGGTVALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEE 291
                   + GT  + R   L   G +++ +  ED DL  RA + G++F Y   +    E
Sbjct: 255 SSTYAFFGFNGTAGIWRISALNEAGGWKDRTTVEDMDLAVRASLKGWKFLYLGSLKVKNE 314

Query: 292 AVETLRDYIKQRSRWAQGHLQVMIDHYWPVMRSCS-------NIIESFIEHFYMMSYLVP 344
              T + Y  Q+ RW+ G   +     + +M + +       ++I SF     +++++V 
Sbjct: 315 LPSTFKAYRYQQHRWSCGPANLFRKMAFEIMTNKNVTLWKKVHVIYSFFVVRKLVAHIV- 373

Query: 345 VFWFLSVILNSYLII 359
            F F  VIL + +++
Sbjct: 374 TFIFYCVILPATVLV 388


>gb|AAK42537.1| Glucosaminyltransferase, intercellular adhesion protein A homolog,
           putative (icaA) [Sulfolobus solfataricus]
           Length = 426
           
 Score = 70.2 bits (169), Expect = 4e-11
 Identities = 69/287 (24%), Positives = 130/287 (45%), Gaps = 22/287 (7%)

Query: 86  EERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRDIMEEINRKYPRKVVIIDVPPERGRS 145
           +E+ I + + ++ G DYR  KVI+++D++ +  + I+E ++ K P   VII  P  +GR 
Sbjct: 60  DEKTIKELINNLSGLDYRFYKVIIVSDDTEETFKKIIESLD-KLPDNFVIIRRPENKGR- 117

Query: 146 KPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLVSIMESAPQYVIGIQGNVRPR 205
           K  ALN+A  I +  M     +  LDA+  +  + L+ +  +   A    +  +  VR  
Sbjct: 118 KAGALNFATNISDAEM-----LVYLDAEARVEKDFLRKISQLDYDA----VAFRLKVRDV 168

Query: 206 NFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKYGGTVALLRFPLLIRLGKFREDSVTE 265
           N +   V K  +       N   +   KL       G+   ++  +L ++G ++E+SV E
Sbjct: 169 NTQ---VQKIYSYTNEFVMNALFKARDKLGLIIFANGSAFGIKRDILRKIGGWKENSVAE 225

Query: 266 DTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWAQGHLQVMIDHYWPVMRSC 325
           D +L  R  ++  +  Y   +  +  A  T  D   Q  RWA G  + +I +   + +  
Sbjct: 226 DLELGIRLALSNIKVKYVDDITVYTLAPYTHTDLYNQIKRWAYGSGE-LISYSMRLFKLG 284

Query: 326 SNIIESFIEH-------FYMMSYLVPVFWFLSVILNSYLIITGAPPL 365
              IE FI          Y++ +L+ +     + +N + + T   P+
Sbjct: 285 IRGIEGFIYSQQWGIYPLYLLLFLIIISIQFILNINYFYVFTSLIPI 331


  Database: ./suso.pep
    Posted date:  Jul 6, 2001  5:57 PM
  Number of letters in database: 840,471
  Number of sequences in database:  2977
  
  Database: /banques/blast2/nr.pep
    Posted date:  Dec 14, 2000 12:46 PM
  Number of letters in database: 188,266,275
  Number of sequences in database:  595,510
  
Lambda     K      H
   0.328    0.144    0.443 

Gapped
Lambda     K      H
   0.270   0.0470    0.230 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 168184974
Number of Sequences: 2977
Number of extensions: 7143614
Number of successful extensions: 23073
Number of sequences better than 1.0e-10: 21
Number of HSP's better than  0.0 without gapping: 1
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 23036
Number of HSP's gapped (non-prelim): 24
length of query: 447
length of database: 189,106,746
effective HSP length: 51
effective length of query: 396
effective length of database: 158,583,909
effective search space: 62799227964
effective search space used: 62799227964
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 40 (21.7 bits)
S2: 167 (69.5 bits)