ORF STATUS Function Best COG Functional category Pathways and functional systems
r_klactV2170 good R KOG1188 General function prediction only WD40 repeat protein
Only best alignment is shown:
BLASTP 2.2.3 [May-13-2002]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= r_klactV2170 759718 760983 422
(422 letters)
Database: KOG eukaryal database 04/03
60,738 sequences; 30,389,216 total letters
Searching..................................................done
Color Key for Alignment Scores:
Score E
Sequences producing significant alignments: (bits) Value
YNL035c [R] KOG1188 WD40 repeat protein 461 e-130
SPCC63.06 [R] KOG1188 WD40 repeat protein 119 7e-27
Hs18087841 [R] KOG1188 WD40 repeat protein 100 3e-21
7303819 [R] KOG1188 WD40 repeat protein 75 2e-13
At2g47790 [R] KOG1188 WD40 repeat protein 67 4e-11
YEL056w [B] KOG0264 Nucleosome remodeling factor subunit CAF1/NU... 58 3e-08
YMR131c [R] KOG0302 Ribosome Assembly protein 54 4e-07
7302720 [R] KOG0266 WD40 repeat-containing protein 53 7e-07
>YNL035c [R] KOG1188 WD40 repeat protein
Length = 389
Score = 461 bits (1186), Expect = e-130
Identities = 206/354 (58%), Positives = 270/354 (76%)
Query: 2 SFNKYKCRYFEENNWCLKLQPLYQHGLMTSISDGSVHLLDWGNLKTISSIQCHTTSINDM 61
S++ + F NWCLKLQP Y+HGL+T +S+G + LLDW K++ I+ T+INDM
Sbjct: 3 SYSLVESNSFGSENWCLKLQPSYKHGLLTGLSNGEIRLLDWSTGKSVQKIKASETAINDM 62
Query: 62 KVINSDFDTGAVFATAAEDGVKVWDIRARNNVASLQNDKASPFFSLDSRHNMLACGTELK 121
KVI SDF G + ++A+ D VKV+DIR + +A +QN+ SPF SLDSRH +LACGTEL+
Sbjct: 63 KVIGSDFSAGHLVSSASIDAVKVFDIRTNDRIAQIQNEANSPFISLDSRHGLLACGTELQ 122
Query: 122 DYDAELHIYDIRNWTKPVRSFVDSHHDDITDIKFHPCDSNLLMSGSTDGYVNIYDLTQDD 181
DA ++IYDIR W P+RS +DSHHDD+T IKFHP D N+L+SGSTDGY NIYDL QD+
Sbjct: 123 GIDAAVYIYDIRKWDTPLRSLIDSHHDDVTCIKFHPSDVNILLSGSTDGYTNIYDLKQDE 182
Query: 182 EEDALHQVINFASIHSCGWLGPKRIWSLSHMETFGIHELNDKSDEMIEPKPLEFGDVRDK 241
EEDALHQVIN+ASIHSCGWL PKRI++LSHMETF IHELNDKSDE+ EP+PL+FGDVR+
Sbjct: 183 EEDALHQVINYASIHSCGWLSPKRIFTLSHMETFAIHELNDKSDELKEPQPLDFGDVREI 242
Query: 242 WGCDYVIDIYPSFIATGKTHESQGELKIIPFQNEQVDVSSALVIPDAHGNEVIRDVLIPK 301
W CDYV+DIYP IATGKT E+ GEL ++PF++E+VD + +VIP AHG+EV+RD+ IP
Sbjct: 243 WNCDYVVDIYPGLIATGKTQENCGELCLLPFKDEKVDTENGIVIPHAHGDEVVRDIFIPA 302
Query: 302 NQTSLLYSCGEDGYVNVWKDTTNSLNVPHNFWDYTLPFTAFENSVREIEMDMGQ 355
+++LYSCGEDG V +W++ L++P NFWDY+ + RE +++ +
Sbjct: 303 QHSNMLYSCGEDGCVKIWENKQGPLDIPENFWDYSKKMNVLGDDDREGSINLDE 356
>SPCC63.06 [R] KOG1188 WD40 repeat protein
Length = 331
Score = 119 bits (299), Expect = 7e-27
Identities = 92/307 (29%), Positives = 151/307 (48%), Gaps = 18/307 (5%)
Query: 28 LMTSISDGSVHLLDWGNLKTISSIQCHTTSINDMKVINSDFDTGAVFATAAEDGVKVWDI 87
++ S S GS D G L I + T+I +I+ D G V +E + +WDI
Sbjct: 28 VVVSYSTGSWSCFDKGTLLEIFKVPKAHTNITG--IISCDQLNG-VITCGSEGEIHLWDI 84
Query: 88 RARNNVASLQ-NDKASPFFSLD-SRHNMLACGTELKDYDAELHIYDIRNWTKPVRSFVDS 145
R++ A +++PF + ++ N A G+EL A + ++D+R+ K +R + D+
Sbjct: 85 RSQAKSAVRSWTQQSTPFTCIALNKKNQFATGSELTRSLASVQLWDVRSEQKLIRQWNDA 144
Query: 146 HHDDITDIKFHPCDSNLLMSGSTDGYVNIYDLTQ-----DDEEDALHQVINF-ASIHSCG 199
H+DDIT ++FHP D+ LL++GS DG V++ D T+ D EED L VIN ASIH
Sbjct: 145 HNDDITHLQFHPKDNELLLTGSVDGLVSLLDTTKEEDSTDPEEDPLLHVINHGASIHLAK 204
Query: 200 WLGPKRIWSLSHMETFGIHELNDKSDEMI--EPKPLEFGDVRDKWGCDYVIDIYPS---- 253
++ KR+ LSHME++ +++L DE + D+R + C YVI+ +
Sbjct: 205 FVSKKRVMVLSHMESYAMYKLKRDKDEKTWSSNELFSIDDLRAELSCSYVINEVSTSDKQ 264
Query: 254 FIATGKTHESQGELKIIPFQNEQVDVSSALVIPDAHGNEVIRDVLIPKNQTSLLYSCGED 313
F A S E K + ++ + E+ R + + + YS GED
Sbjct: 265 FCALAFGDFSNHETKFVLVDTSTGELKKEPTKLERASEEICRAISFDV-KNDVYYSGGED 323
Query: 314 GYVNVWK 320
G + ++
Sbjct: 324 GLLQAFR 330
>Hs18087841 [R] KOG1188 WD40 repeat protein
Length = 387
Score = 100 bits (250), Expect = 3e-21
Identities = 92/316 (29%), Positives = 144/316 (45%), Gaps = 42/316 (13%)
Query: 33 SDGSVHLLDWGNLKTISSIQCHTTSINDMKVINSDFDTGAVFATAAEDGVKVWDIR-ARN 91
S+GS+ + D L + + +N ++ NS +V++ + VK WD R AR
Sbjct: 48 SNGSIRIYDKERLNVLREFSGYPGLLNGVRFANS---CDSVYSACTDGTVKCWDARVARE 104
Query: 92 NVASLQNDKASPFF---SLDSRHNMLACGTELKDYDAELHIYDIRNWTKPVRSFVDS--- 145
L S F ++ +++ GTE D DA L +D R ++ + + DS
Sbjct: 105 KPVQLFKGYPSNIFISFDINCNDHIICAGTEKVDDDALLVFWDARMNSQNLSTTKDSLGA 164
Query: 146 ----HHDDITDIKFHPCDSNLLMSGSTDGYVNIYDLTQDDEEDALHQVINFASIHSC-GW 200
H DD+T ++FHP + N+++SGS+DG VN++D+ D+EEDAL N S SC GW
Sbjct: 165 YSETHSDDVTQVRFHPSNPNMVVSGSSDGLVNVFDINIDNEEDALVTTCNSISSVSCIGW 224
Query: 201 LGP--KRIWSLSHMETFGIHELND-KSDEMIEPKPLEFGDVRD-----KWGCDYVIDIYP 252
G K+I+ ++H E F +LN +DE + L DVR+ + DY+I
Sbjct: 225 SGKGYKQIYCMTHDEGFYWWDLNHLDTDEPV--TRLNIQDVREVVNMKEDALDYLIG--- 279
Query: 253 SFIATGKTHESQGELKIIPFQNE------QVDVSSALVIPDAHGNE--VIRDVLIPKNQT 304
G HE L +I N+ +S + G +R
Sbjct: 280 -----GLYHEKTDTLHVIGGTNKGRIHLMNCSMSGLTHVTSLQGGHAATVRSFCWNVQDD 334
Query: 305 SLLYSCGEDGYVNVWK 320
SLL + GED + +WK
Sbjct: 335 SLL-TGGEDAQLLLWK 349
>7303819 [R] KOG1188 WD40 repeat protein
Length = 411
Score = 75.1 bits (183), Expect = 2e-13
Identities = 69/260 (26%), Positives = 130/260 (49%), Gaps = 29/260 (11%)
Query: 82 VKVWDIRARNNVASLQND---------KASPFFSLDSRHNMLACGTELKDYDAELHIYDI 132
V+++D+R R A + K+ F ++ ++ CGTE +A L +D+
Sbjct: 131 VRLYDLRLRGEQARFKYTQHPNVPPVPKSLSCFDRNANGRIICCGTEQFHSNAFLVFFDV 190
Query: 133 RNWTKPVRSFVDSHHDDITDIKFHPCDSNLLMSGSTDGYVNIYDLTQDDEEDALHQVINF 192
R + + + +SH DDIT ++FH + +LL +GS DG VN++D+ + DE++AL N
Sbjct: 191 RE-RQQMGVYFESHEDDITSLRFHAQNPDLLATGSVDGLVNVFDVKEPDEDEALLNTFNT 249
Query: 193 -ASIHSCGW---LGPKRIWS-LSHMETFGIHELNDKSDEMIEPKPLEFGDVRDKWGCDY- 246
+S+ W + K I S ++ F +E + + +P +R K ++
Sbjct: 250 ESSVARLAWHRNVYDKDIISCVTTTGDFKSYECEEGDEVASFERPDVTAAIRRKKAANFN 309
Query: 247 VIDIYPS-----FIATGKTHESQGELKIIPFQNEQVDVSSAL-VIPDAHGN-EVIRDVLI 299
+I+ + F+ G T+ ++GE+ ++ V ++L + + GN +++RD L
Sbjct: 310 LINAHNQEDGGVFLLAG-TNFNKGEI----LRSVSVTSKNSLQPLANFQGNKQIVRDSLF 364
Query: 300 PKNQTSLLYSCGEDGYVNVW 319
++ SLL++ GE G V VW
Sbjct: 365 -DSKRSLLFTGGESGIVTVW 383
>At2g47790 [R] KOG1188 WD40 repeat protein
Length = 425
Score = 67.4 bits (163), Expect = 4e-11
Identities = 82/350 (23%), Positives = 143/350 (40%), Gaps = 54/350 (15%)
Query: 28 LMTSISDGSVHLLDWGNLKTISSIQCHTTSINDMKVINSDFDTGAVFATAAEDG-VKVWD 86
+ S+S +V L + + H+ ++N + + + V + + DG ++ WD
Sbjct: 56 IAVSLSTNTVKLYSPVTGQYYGECKGHSDTVNQIAFSSDSAASPHVLHSCSSDGTIRSWD 115
Query: 87 IRARNNVASLQNDKASPFFSLD---SRHNMLA--CGTELKDYD-----------AELHIY 130
R+ V+ + FS + N+LA C +++ + ++ ++
Sbjct: 116 TRSFQQVSRIDTGNDQEIFSFSYGGAADNLLAGGCKEQVRHLNPVESLLQSEAMVQVLLW 175
Query: 131 DIRNWTKPVRSFVDSHHDDITDI---------------KFHPCDSNLLMSGSTDGYVNIY 175
D RN +K V +SH DD+T + F P N L+S S DG + ++
Sbjct: 176 DWRN-SKQVACLEESHMDDVTQVCCSLILSAQAGKELVHFVPNKPNKLLSASVDGLICLF 234
Query: 176 DLTQD-DEEDALHQVINFA-SIHSCGWLGP--KRIWSLSHMETFGIHELNDKSDEMIEPK 231
+ D +++D L VIN SI G+LG K++W L+H+ET I D S E+ K
Sbjct: 235 NTEGDINDDDHLESVINVGTSIGKIGFLGDGYKKLWCLTHIETLSIWNWEDGSCEVNLEK 294
Query: 232 PLEFGDVRDKW---GCDYVIDIY----PSFIATGKTHESQGELKIIPFQNEQV-DVSSAL 283
E D W DY +D + G T G + P +Q + +A
Sbjct: 295 ARELAS--DSWTQDNVDYFVDCHCPGGEDLWVIGGT--CAGTVGYFPVNYKQPGSIGTAE 350
Query: 284 VIPDAHGNEVIRDVLIPKNQTS-----LLYSCGEDGYVNVWKDTTNSLNV 328
I +V+R VL + ++ GEDG + WK ++ +
Sbjct: 351 AILGGGHIDVVRSVLQMPGEYGGAAGLFGWTGGEDGRLCCWKSDEDATEI 400
>YEL056w [B] KOG0264 Nucleosome remodeling factor subunit CAF1/NURF55/MSI1
Length = 401
Score = 57.8 bits (138), Expect = 3e-08
Identities = 51/200 (25%), Positives = 97/200 (48%), Gaps = 29/200 (14%)
Query: 9 RYFEENNWCLKLQPLYQHGLMTSISDGSVHLLDWGN-------LKTISSIQCHTTSINDM 61
++ ++N + L L + L++ D +V L + G+ ++T + + H+ IND
Sbjct: 157 KFHKDNGYALSFSTLVKGRLLSGSDDHTVALWEVGSGGDPTKPVRTWNDL--HSDIINDN 214
Query: 62 KVINSDFDTGAVFATAAEDGV-KVWDIRARNNVASLQNDKASPFFSLDSRH---NMLACG 117
K N + D +F T +ED + K+ D+RA N PF +L H N+LA
Sbjct: 215 KWHNFNKD---LFGTVSEDSLLKINDVRANNTTIDTVKCP-QPFNTLAFSHHSSNLLAAA 270
Query: 118 TELKDYDAELHIYDIRNWTKPVRSFVDSHHDDITDIKFHPCDSNLLMSGSTDGYVNIYDL 177
D+ +++YD+RN +P+ + H D + +++F +++S +D + ++DL
Sbjct: 271 ----GMDSYVYLYDLRNMKEPLHH-MSGHEDAVNNLEFSTHVDGVVVSSGSDNRLMMWDL 325
Query: 178 -------TQDDEEDALHQVI 190
T DD ED + ++I
Sbjct: 326 KQIGAEQTPDDAEDGVPELI 345
>YMR131c [R] KOG0302 Ribosome Assembly protein
Length = 511
Score = 53.9 bits (128), Expect = 4e-07
Identities = 43/181 (23%), Positives = 86/181 (46%), Gaps = 11/181 (6%)
Query: 18 LKLQPLYQHG-LMTSISDGSVHLLDWGNLKTISSIQCHTTSIN-DMKVINSDFDTGAVFA 75
L PL + G L++ G ++ + ++ Q T S N ++ I VFA
Sbjct: 279 LDWSPLIKTGALLSGDCSGQIYFTQRHTSRWVTDKQPFTVSNNKSIEDIQWSRTESTVFA 338
Query: 76 TAAEDG-VKVWDIRARNNVASLQNDKASPFFSLDSRHNMLACGTELKDYDAELHIYDIRN 134
TA DG +++WD R++ + ++ ++ ++ S + + D + ++D+R
Sbjct: 339 TAGCDGYIRIWDTRSKKHKPAISVKASNTDVNVISWSDKIGYLLASGDDNGTWGVWDLRQ 398
Query: 135 WT-------KPVRSFVDSHHDDITDIKFHPCDSNLLMSGSTDGYVNIYDLTQDDEEDALH 187
+T +PV + D H IT I F+P D +++ GS D V ++DL+ + +++ +
Sbjct: 399 FTPSNADAVQPVAQY-DFHKGAITSIAFNPLDESIVAVGSEDNTVTLWDLSVEADDEEIK 457
Query: 188 Q 188
Q
Sbjct: 458 Q 458
>7302720 [R] KOG0266 WD40 repeat-containing protein
Length = 307
Score = 53.1 bits (126), Expect = 7e-07
Identities = 40/151 (26%), Positives = 73/151 (47%), Gaps = 14/151 (9%)
Query: 28 LMTSISDGSVHLLDWGNLKTISSIQCHTTSINDMKVINSDFDTGAVFATAAED-GVKVWD 86
L++S D + L D + I S+ H +ND+ + + A+ ++D V++WD
Sbjct: 71 LVSSSGDRLLKLWDLSATRCIQSLAGHGDGVNDVA-----WSAAGLIASCSDDMTVRLWD 125
Query: 87 IRARNNVASLQNDKASPFFS-LDSRHNMLACGTELKDYDAELHIYDIRNWTKPVRSFVDS 145
R++ V L+ F + + N+LA + +D + ++D+R T V +
Sbjct: 126 ARSKLCVKVLEGHSRYSFSCCFNPQANLLASTS----FDETVRLWDVR--TGKTLKIVHA 179
Query: 146 HHDDITDIKFHPCDSNLLMSGSTDGYVNIYD 176
H D IT + FH D N+ ++ S DG V ++D
Sbjct: 180 HQDPITSVDFHR-DGNIFVTSSYDGLVRLWD 209
Database: KOG eukaryal database 04/03
Posted date: Apr 14, 2003 1:07 PM
Number of letters in database: 30,389,216
Number of sequences in database: 60,738
Lambda K H
0.318 0.136 0.425
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,926,752
Number of Sequences: 60738
Number of extensions: 1050307
Number of successful extensions: 3571
Number of sequences better than 1.0e-05: 8
Number of HSP's better than 0.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 3556
Number of HSP's gapped (non-prelim): 17
length of query: 422
length of database: 30,389,216
effective HSP length: 109
effective length of query: 313
effective length of database: 23,768,774
effective search space: 7439626262
effective search space used: 7439626262
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)