ORF STATUS Function Best COG Functional category Pathways and functional systems
r_klactIV4272 good S KOG3914 Function unknown WD repeat protein WDR4
Only best alignment is shown:
BLASTP 2.2.3 [May-13-2002]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= r_klactIV4272 1451064 1452392 443
(443 letters)
Database: KOG eukaryal database 04/03
60,738 sequences; 30,389,216 total letters
Searching..................................................done
Color Key for Alignment Scores:
Score E
Sequences producing significant alignments: (bits) Value
YDR165w [S] KOG3914 WD repeat protein WDR4 368 e-101
SPCC18.13 [S] KOG3914 WD repeat protein WDR4 99 1e-20
7290698 [S] KOG3914 WD repeat protein WDR4 90 8e-18
Hs16445428 [S] KOG3914 WD repeat protein WDR4 84 6e-16
CE07574 [S] KOG3914 WD repeat protein WDR4 83 7e-16
Hs16445432 [S] KOG3914 WD repeat protein WDR4 61 4e-09
>YDR165w [S] KOG3914 WD repeat protein WDR4
Length = 444
Score = 368 bits (944), Expect = e-101
Identities = 200/451 (44%), Positives = 289/451 (63%), Gaps = 17/451 (3%)
Query: 1 MLHPFQSVLLNRDGSLLFCVVKNEIKAFKVEG-NGYVLRGEWVDDLDNTPLIKEKVLKEQ 59
++HP Q++L +RDGSL+F ++KN I +FK + N + G+W DD D + KEQ
Sbjct: 3 VIHPLQNLLTSRDGSLVFAIIKNCILSFKYQSPNHWEFAGKWSDDFDKIQESRNTTAKEQ 62
Query: 60 ARQLIENAS--KKLKTNDGEXXXXXXXXXXXXXXXXXXXXXYQYIRNLALSRDGKLLLAC 117
Q EN + KKLK+N G+ Y YIRNL L+ D L+AC
Sbjct: 63 QGQSSENENENKKLKSNKGDSIKRTAAKVPSPGLGAPPI--YSYIRNLRLTSDESRLIAC 120
Query: 118 TDSDKAAVIFNIDLDDKDNIFKLIKRQPYPKRPNAITTSVDDKDLILADKFGDVYSMPIQ 177
DSDK+ ++F++D K N+ KL KR + KRPNAI+ + DD +I+ADKFGDVYS+ I
Sbjct: 121 ADSDKSLLVFDVDKTSK-NVLKLRKRFCFSKRPNAISIAEDDTTVIIADKFGDVYSIDIN 179
Query: 178 NDVITSINAEKAPILGHVSMLTDVNMLTDSEGKQYIVTADRDEHIRISHYPQSFIVDKWL 237
+ I + PILGHVSMLTDV+++ DS+G Q+I+T+DRDEHI+ISHYPQ FIVDKWL
Sbjct: 180 S--IPEEKFTQEPILGHVSMLTDVHLIKDSDGHQFIITSDRDEHIKISHYPQCFIVDKWL 237
Query: 238 FGHEEFVSTICIPEWSDKLLFSAGGDKFVFSWNWKTGALLFKFDYTDLIQKYLTSDHLAP 297
FGH+ FVS+IC + D LL SAGGD +F+W+WKTG L FDY LI+ YL HLAP
Sbjct: 238 FGHKHFVSSICCGK--DYLLLSAGGDDKIFAWDWKTGKNLSTFDYNSLIKPYLNDQHLAP 295
Query: 298 ERFQNEKGDVIEYSVAKIVTLKDVPYIAFFVEATKVLFVLKVDEK-SGALSLHQTLEFDE 356
RFQNE D+IE++V+KI+ K++P++AFFVEATK + +L++ EK G L+L Q + F
Sbjct: 296 PRFQNENNDIIEFAVSKIIKSKNLPFVAFFVEATKCIIILEMSEKQKGDLALKQIITFPY 355
Query: 357 KIVSLTSALDVNTLCISLDNRDNQDC--DLVKLL--LLEGDVFIEQKDTNSQLMNTIRST 412
++SL++ D ++LDN+++ + K + L + F+ + +++ + I +
Sbjct: 356 NVISLSAHND--EFQVTLDNKESSGVQKNFAKFIEYNLNENSFVVNNEKSNEFDSAIIQS 413
Query: 413 LKSDLIANVEAGDVYPLYHNASLRKHGEHFS 443
++ D + ++YPLY+ +SLRKHGEH+S
Sbjct: 414 VQGDSNLVTKKEEIYPLYNVSSLRKHGEHYS 444
>SPCC18.13 [S] KOG3914 WD repeat protein WDR4
Length = 421
Score = 99.4 bits (246), Expect = 1e-20
Identities = 81/303 (26%), Positives = 138/303 (44%), Gaps = 50/303 (16%)
Query: 102 IRNLALSRDGKLLLACTDSDKAAVIFNIDLDDKDNIFKLIKRQPYPKRPNAITTSVDDKD 161
IR +A S+D + A DK +++ DK +L+ ++ PKR A +
Sbjct: 65 IRQVAFSKDYSRM-ATVSEDKCLRLWDSTQPDK---IELLYQKNIPKRC-ADLCFAGSNE 119
Query: 162 LILADKFGDVYSM------------------------PIQNDVITSINAEKA-PILGHVS 196
++ DKFGDVY + P+ ND + +K PI+GHVS
Sbjct: 120 IVFGDKFGDVYCVDENWFTTSEVTEEKKSNVVEGKQEPVNNDTLKDSKLQKLEPIMGHVS 179
Query: 197 MLTDVNMLTDSEG--KQYIVTADRDEHIRISHYPQSFIVDKWLFGHEEFVSTICIPEWSD 254
+LT + + + + ++ I+T+D+DEHIRIS +P +F+++ + GHE+FVS + + + +
Sbjct: 180 ILTQLIVAQNPQNSKEEIIITSDKDEHIRISRFPNAFVIEGFCLGHEDFVSRMSL--YDN 237
Query: 255 KLLFSAGGDKFVFSWNWKTGALLFKFDYTDLIQKYLTSDHLAPERFQNEKGDVIEYSVAK 314
+ L S GGD VF W+ + L FD YL+ + V+
Sbjct: 238 RTLISGGGDNHVFVWDLENFKCLDAFDLRSAFSTYLSLNQ--------------PMVVSV 283
Query: 315 IVTLKDVPYIAFFVEATKVLFVLKVDEKSGALSLHQTLEFDEKIV-SLTSALDVNTLCIS 373
I+ + +AF E L KV + L H L+ ++ ++ D + + IS
Sbjct: 284 ILPIFKRQLVAFACEGMAGLIFAKVTPEK-RLLFHSALKLSGPVLDAVLLDTDTDQILIS 342
Query: 374 LDN 376
LD+
Sbjct: 343 LDS 345
>7290698 [S] KOG3914 WD repeat protein WDR4
Length = 424
Score = 89.7 bits (221), Expect = 8e-18
Identities = 54/180 (30%), Positives = 98/180 (54%), Gaps = 13/180 (7%)
Query: 102 IRNLALSRDGKLLLACTDSDKAAVIFNIDLDDKDNIFKLIKRQPYPKRPNAITTSVDDKD 161
++N+A S DG+LL T + A++ + +L+ +P + +A+ D
Sbjct: 101 VQNVAYSPDGQLLAVTTSGKQKALLL---YRSRPENARLLSARPLARASSALRFCSDSSS 157
Query: 162 LILADKFGDVYSMPIQNDVITSINAEKAPILGHVSMLTDVNMLTDSEGKQYIVTADRDEH 221
+++ DK GD Y Q D + + A +LGH+S++ D+ SE +Q+I+T DRD+
Sbjct: 158 ILVTDKTGDCY----QYDCV-EVEAPPRLLLGHLSVVYDILW---SEDQQHIITCDRDDK 209
Query: 222 IRISHYPQSFIVDKWLFGHEEFVSTICIPEWSDKLLFSAGGDKFVFSWNWKTGALLFKFD 281
IR+++YP +F + + GH EFVS + + +++ + SA GDK + WN+ G L + +
Sbjct: 210 IRVTNYPATFDIHSYCLGHREFVSGLAL--LTEQHIASASGDKTLRVWNYIQGKELLQHE 267
>Hs16445428 [S] KOG3914 WD repeat protein WDR4
Length = 412
Score = 83.6 bits (205), Expect = 6e-16
Identities = 65/243 (26%), Positives = 112/243 (45%), Gaps = 27/243 (11%)
Query: 116 ACTDSDKAAVIFNIDLDDKDNIFKLIKRQPYPKRPNAITTSVDDKDLILADKFGDVYSMP 175
A TD K ++F + ++ + + +R A+T ++ +++ADK GDVYS
Sbjct: 77 ALTDDSKRLILF------RTKPWQCLSVRTVARRCTALTFIASEEKVLVADKSGDVYSF- 129
Query: 176 IQNDVITSINAEKAPILGHVSMLTDVNMLTDSEGKQYIVTADRDEHIRISHYPQSFIVDK 235
V+ + LGH+SML DV + D ++I+TADRDE IR+S ++
Sbjct: 130 ---SVLEPHGCGRLE-LGHLSMLLDVAVSPDD---RFILTADRDEKIRVSWAAAPHSIES 182
Query: 236 WLFGHEEFVSTICIPEWSDKLLFSAGGDKFVFSWNWKTGALLFKFDYTDLIQKYLTSDHL 295
+ GH EFVS I + LL S+ GD + W +++G L L + D
Sbjct: 183 FCLGHTEFVSRISVVPTQPGLLLSSSGDGTLRLWEYRSGRQLHCCHLASLQE---LVDPQ 239
Query: 296 APERFQNEKGDVIEYSVAKIVTLKDVPYIAFFVEATKVLFVLKVDEKSGALSLHQTLEFD 355
AP++F + ++I +A + T V+++ ++D + L Q L F
Sbjct: 240 APQKF----------AASRIAFWCQENCVALLCDGTPVVYIFQLDARRQQLVYRQQLAFQ 289
Query: 356 EKI 358
++
Sbjct: 290 HQV 292
>CE07574 [S] KOG3914 WD repeat protein WDR4
Length = 388
Score = 83.2 bits (204), Expect = 7e-16
Identities = 50/166 (30%), Positives = 89/166 (53%), Gaps = 14/166 (8%)
Query: 107 LSRDGKLLLACTDSDKAAVIFNIDLDDKDNIFKL--IKRQPYPKRPNAITTSVDDKDLIL 164
L+ G+ L+A ++K +F ++DK +I I PK P AI +D +++
Sbjct: 72 LTTHGRRLVAVGTNEKQIHVFEYFVNDKGDIVTAEHIVTSVVPKAPTAIVFDKEDAYVVV 131
Query: 165 ADKFGDVYSMPIQNDVITSINAEKAPILGHVSMLTDVNMLTDSEGKQYIVTADRDEHIRI 224
D+ GDV+ + +N + G +SM+ DV D GK+ ++ ADRDE +R
Sbjct: 132 GDRAGDVHRFSV-------LNGSAIEMAGAISMILDVAFSPD--GKRLLM-ADRDEKVRA 181
Query: 225 SHYPQSFIVDKWLFGHEEFVSTICIPEWSDKLLFSAGGDKFVFSWN 270
YP + ++D + GH E+V T+ + + + L+S+GGDK +++W+
Sbjct: 182 LRYPATSVIDSFFLGHTEYVKTLAVQD--NDSLWSSGGDKNLYNWS 225
>Hs16445432 [S] KOG3914 WD repeat protein WDR4
Length = 266
Score = 60.8 bits (146), Expect = 4e-09
Identities = 44/162 (27%), Positives = 74/162 (45%), Gaps = 16/162 (9%)
Query: 197 MLTDVNMLTDSEGKQYIVTADRDEHIRISHYPQSFIVDKWLFGHEEFVSTICIPEWSDKL 256
ML DV + D ++I+TADRDE IR+S ++ + GH EFVS I + L
Sbjct: 1 MLLDVAVSPDD---RFILTADRDEKIRVSWAAAPHSIESFCLGHTEFVSRISVVPTQPGL 57
Query: 257 LFSAGGDKFVFSWNWKTGALLFKFDYTDLIQKYLTSDHLAPERFQNEKGDVIEYSVAKIV 316
L S+ GD + W +++G L L + D AP++F + ++I
Sbjct: 58 LLSSSGDGTLRLWEYRSGRQLHCCHLASLQE---LVDPQAPQKF----------AASRIA 104
Query: 317 TLKDVPYIAFFVEATKVLFVLKVDEKSGALSLHQTLEFDEKI 358
+A + T V+++ ++D + L Q L F ++
Sbjct: 105 FWCQENCVALLCDGTPVVYIFQLDARRQQLVYRQQLAFQHQV 146
Database: KOG eukaryal database 04/03
Posted date: Apr 14, 2003 1:07 PM
Number of letters in database: 30,389,216
Number of sequences in database: 60,738
Lambda K H
0.319 0.137 0.397
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 25,286,326
Number of Sequences: 60738
Number of extensions: 1068177
Number of successful extensions: 2763
Number of sequences better than 1.0e-05: 6
Number of HSP's better than 0.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 2741
Number of HSP's gapped (non-prelim): 6
length of query: 443
length of database: 30,389,216
effective HSP length: 109
effective length of query: 334
effective length of database: 23,768,774
effective search space: 7938770516
effective search space used: 7938770516
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)