ORF STATUS Function Best COG Functional category Pathways and functional systems
r_klactII2830 good R KOG1274 General function prediction only WD40 repeat protein
Only best alignment is shown:
BLASTP 2.2.3 [May-13-2002]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= r_klactII2830 975658 972833 -942
(942 letters)
Database: KOG eukaryal database 04/03
60,738 sequences; 30,389,216 total letters
Searching..................................................done
Color Key for Alignment Scores:
Score E
Sequences producing significant alignments: (bits) Value
YPR135w [R] KOG1274 WD40 repeat protein 701 0.0
SPAPB1E7.02c [R] KOG1274 WD40 repeat protein 195 3e-49
Hs5901892 [R] KOG1274 WD40 repeat protein 72 4e-12
7303244 [R] KOG1274 WD40 repeat protein 64 1e-09
>YPR135w [R] KOG1274 WD40 repeat protein
Length = 927
Score = 701 bits (1809), Expect = 0.0
Identities = 397/980 (40%), Positives = 584/980 (59%), Gaps = 104/980 (10%)
Query: 3 IKLTENRVFVAGGNTQARLSSDSSRLFTXXXXXXXXXXXXTDLEKEPDIIDICEEPSAFV 62
+ + + VF GG T L+ D++ L + E+EP+ +D + S+
Sbjct: 2 VSVIDKLVFDFGGKTLVSLAPDNNTLCVANKNGLTKILKTNNPEEEPETLDSSKLVSSIK 61
Query: 63 LSQNSADGLYVVSRNGDLYWYNVLENKNKLCFRSSLPLRDLALVHDDKVCVVGGDDLEMT 122
NS + + GD YN+ ++ +L R +LPLRD ++H K+ V GGDDLE+
Sbjct: 62 CYSNSH--FLMTTMQGDALRYNIDSSQEELLARFALPLRDCCVIHSGKMAVFGGDDLELI 119
Query: 123 LVALE-EGNSNVKLTLDEQLVSLSYNKQNNILAVSLSNGNVVFYSLSSTTPNRIHTLHDQ 181
L+ L+ E + + +DEQ+ +SYN Q NILAVS+ NG V +SL+ST PN++H L+D
Sbjct: 120 LLELDDETHKKHAIKIDEQVSQISYNSQMNILAVSMINGKVQIFSLTSTIPNKVHELNDY 179
Query: 182 L----------PKIFFN--DDLPADTENN-TENADSVAIDGLDPMLCEDNRTCTRVAWTN 228
+ KI N DD+ D +N+ +E AD + DP C NR CTRVAW
Sbjct: 180 IVANSYDDTHRDKILSNMMDDIDKDNDNDLSETADPDENNVADPEFCAANRICTRVAWHP 239
Query: 229 KGDQYAIPAKDSVIKLFKLSDHDLVTSFKPSVHVN--NWDALTIDQIHSNTIAAVGNSNQ 286
KG +A+P D +K+F + + L + ++ ++ L D + IAAV +
Sbjct: 240 KGLHFALPCADDTVKIFSIKGYSLQKTLSTNLSSTKAHFIDLQFDPLRGTYIAAV---DL 296
Query: 287 NSHLFIWNLSTGTQLVHESFRYSITSLSWRVNDQRTKLSLVAGTWSGDVITFNDIVSV-- 344
N+ L +WN T F+ IT+++W++ L LV GTWSG + ++
Sbjct: 297 NNKLTVWNWETSEIHYTREFKRKITNIAWKIQADSKTLDLVLGTWSGSIAIVQNLAESVV 356
Query: 345 ----NTTVAYSNTGGKLFV----------GSDD----EKLFDD----SDIDDLSKQTSNG 382
+ +VA S+T LFV G+DD +KLF D ++ +D+ QT +G
Sbjct: 357 SNIPDQSVAESSTKHGLFVDSESDLENLEGNDDINKSDKLFSDITQEANAEDVFTQTHDG 416
Query: 383 PEFYEPSKNDNNRRSKYENNDKADDKGEVSEDINSDNLFTDAEMDQHEPKRTYXXXXXXX 442
P + + + KY D+ D + +D + + ++H R +
Sbjct: 417 P-------SGLSEKRKYNFEDEEDF---IDDDDGAGYISGKKPHNEHSYSRVH------- 459
Query: 443 XXXXXXXXAAQYKRPKTHVQSIPSLTANHVNHRRIYQPPAQFQYRPFSTGATPFGNSDKR 502
KTH S P AN +F+Y PFS TPFG +D+R
Sbjct: 460 ---------------KTH--SFPISLAN----------TGKFRYMPFSPAGTPFGFTDRR 492
Query: 503 YLTINQLGHVWTTRNEGGSNSITVSFFDVSRFREYHFDDLFKYDLCSLTDEGILLGQSKM 562
YLT+N++G+V T +N SITVSFFDV RFREYHF+DLF YDLC L ++G L GQSK
Sbjct: 493 YLTMNEVGYVSTVKNSE-QYSITVSFFDVGRFREYHFEDLFGYDLCFLNEKGTLFGQSKT 551
Query: 563 GQIQYKPHSPSGDSWTKKIPLLSKERVTSIACTPKRVIVGTSLGYLRTYNDFGLPLAIEK 622
GQIQY+PH +WTK IPL + ER+TS+A TP RVIVGTSLGY R++N FG+P A+EK
Sbjct: 552 GQIQYRPHDSIHSNWTKIIPLQAGERITSVAATPVRVIVGTSLGYFRSFNQFGVPFAVEK 611
Query: 623 MSPIVALSAHEYKVFTIHYSQYHGITYSMFQQHPQTGNKYFQRESPLPITLPQQNQPGND 682
SPIVAL+A Y+VF++HYSQ+HG++YS+ + + +Y++RE PLP++LP N
Sbjct: 612 TSPIVALTAQNYRVFSVHYSQFHGLSYSL-SELGTSSKRYYKRECPLPMSLPNINSDMKK 670
Query: 683 DEDIEFLKYDDIFNSFNPLGIKSLFFSSFGDPCMFGHDNVLLILSKWRSGSDARWVPIVD 742
D ++++ +FNP+GIKSLFFSS+GDPC+FG DN LL+LSKWRS +++W+PI+D
Sbjct: 671 DANLDYY-------NFNPMGIKSLFFSSYGDPCIFGSDNTLLLLSKWRSPEESKWLPILD 723
Query: 743 TNLELWKMSGGKHPKNVHVWPLGLNFDTFSYLLLKGKNAWPDIPMTVPTEMEVRIPVVSK 802
+N+E+WKMSGGK ++HVWPL L +DT + +L+KGK+ WP+ P+ +P+EME+R+PV K
Sbjct: 724 SNMEIWKMSGGKETTDIHVWPLALAYDTLNCILVKGKHIWPEFPLPLPSEMEIRMPVFVK 783
Query: 803 SEL---QKTQDDNRND---SSDVEDIDNSNGVEIPMYLAAEEEFLRSKILSDLLKDTMDN 856
S+L K + +N+ ++ E+ + ++IP+ +AAEEE+LRSK+LS+LL DT++N
Sbjct: 784 SKLLEENKAILNKKNEIGADTEAEEGEEDKEIQIPVSMAAEEEYLRSKVLSELLTDTLEN 843
Query: 857 EGEIYGNETDLLQHLVATHDKSLLRLLAYVCSEQDVNKAVSIVNELKQDKALSAARKIAE 916
+GE+YGNE ++L L +DK+LLRL A CS+Q+V KA+S+ +ELKQD+AL+AA KI+E
Sbjct: 844 DGEMYGNENEVLAALNGAYDKALLRLFASACSDQNVEKALSLAHELKQDRALTAAVKISE 903
Query: 917 RAELLPLVRKINTIIEAKFE 936
RAEL LV+KIN I EA++E
Sbjct: 904 RAELPSLVKKINNIREARYE 923
>SPAPB1E7.02c [R] KOG1274 WD40 repeat protein
Length = 815
Score = 195 bits (495), Expect = 3e-49
Identities = 215/909 (23%), Positives = 379/909 (41%), Gaps = 164/909 (18%)
Query: 46 EKEPDIIDICEEP-SAFVLSQNSADGLYVVSRNGDLYWYNV-LENKNKLCFRSSLPLRDL 103
++EPD ID ++P + +++N S + + Y + ++ L R++LP+RD+
Sbjct: 45 DEEPDSIDNHQDPITGIAVAENY---FCTCSEDATVCVYPIDSPTEHTLLARTTLPIRDV 101
Query: 104 ALVHDDKVCVVGGDDLEMTLVALEEGNSNVKLT-LDEQLVSLSYNKQNNILAVSLSNGNV 162
A D + D+ + +V+ + + L ++Y+ N LAVS NG +
Sbjct: 102 AYSVDGNWIAIASDETAVKVVSSTDSSQIFSLRPAKASNKHVTYSPNGNFLAVSSCNGIL 161
Query: 163 VFYSLSSTTPNRIHTLHDQLPKIFFNDDLPADTENNTENADSVAIDGLDPMLCEDNRTCT 222
FY + +L K N + E+ C+
Sbjct: 162 YFYDTQTR----------ELIKFLTNTIASLEAESEI---------------------CS 190
Query: 223 RVAWTNKGDQYAIPAKDSVIKLFKLSDHDLVTSFKPSVHVNNWDALTIDQIHSNTIAAVG 282
+ AW K +A+ + D + + D + P N +T SN + +
Sbjct: 191 KAAWHPKNGTFAVASTDHFVSVISPDDWLPLYKLLPK---ENHSGVTDISWSSNGMY-IA 246
Query: 283 NSNQNSHLFIWNLSTGTQLVHESFRYSITSLSWRVNDQRTKLSLVAGTWSG--DVITF-- 338
S + + IW+ + +V + ++ +L+W+ + + G DVI
Sbjct: 247 ASFKKGGILIWDTQSHEVVVELPYS-TVVALAWQPFENVLSFTTNQGILYSCPDVIPKSI 305
Query: 339 ----NDIVSVNTTVAYSNTGGK----LFVGSDDEKLFDDSDIDDLSKQTSNGPEFYEPSK 390
ND T+ N K LF GSDDE+ + +D+D
Sbjct: 306 LKEENDPTKPLTSSKSKNRTSKELDDLF-GSDDEQSQNVNDLDG---------------- 348
Query: 391 NDNNRRSKYENNDKADDKGEVSEDINSDNLFTDAEMDQHEPKRTYXXXXXXXXXXXXXXX 450
N N +++ N+D D S D++ D+ D E D + K
Sbjct: 349 NSANEENEFINHDGLDS----SLDLDGDSYMVD-ENDLNLAK------------------ 385
Query: 451 AAQYKRPKTHVQSIPSLTANHVNHRRIYQPPAQFQYRPFSTGATPFGNSDKRYLTINQLG 510
KR + + + N + RR+ Q ++P TG+TP+ ++RYL +N +G
Sbjct: 386 ----KRKQKALIDRTTTIENGSSKRRLLQASI---HKPVHTGSTPW-QGNRRYLCLNLVG 437
Query: 511 HVWTTRNEGGSNSITVSFFDVSRFREYHFDDLFKYDLCSLTDEGILLG----QSKMGQIQ 566
+WT + + N+ITV F D + R+YHF D K+++ L EG L +S G I
Sbjct: 438 FIWTVQQDAEHNTITVEFHDETTHRKYHFVDDQKFEMACLDHEGALYASPATESSPGVIY 497
Query: 567 YKPHS--PSGDSWTKKIPLLSKERVTSIACTPKRVIVGTSLGYLRTYNDFGLPLAI--EK 622
YK H W +P+ ++ VT I+ + V+V TS GY+R ++ G P++I K
Sbjct: 498 YKAHVDWSRKSEWAMALPMENESPVT-ISLSSSVVLVCTSAGYVRVFSRQGFPISIHRSK 556
Query: 623 MSPIVALSAHEYKVFTIHYSQYHGITYSMFQQHPQTGNKYFQRESPLPITLPQQNQPGND 682
P VA S+ + + TI S + ++ ++ + LP Q
Sbjct: 557 HLPFVACSSFQDTIITIANDGLSSDGNSRLVYSIEDISRDEMLQTGDGVALPPQGT---- 612
Query: 683 DEDIEFLKYDDIFNSFNPLGIKSLFFSSFGDPCMFGHDNVLLILSKWRSGSDARWVPIVD 742
++S+FFS GDP ++ VLL+L WR A+W+P++D
Sbjct: 613 --------------------LESVFFSDVGDPYIYDSTGVLLVLMHWRIPGQAKWIPVLD 652
Query: 743 TNLELWKMSGGKHPKNVHVWPLGLNFDTFSYLLLKGKNAWPDIPMTVPTEMEVRIPVVSK 802
TN EL + + + WP+ + + F +LLKG + +P P + TE + RIP
Sbjct: 653 TN-ELER----RKSRQESYWPVTVADNQFHCILLKGASRYPYFPRPMFTEFDFRIPC--- 704
Query: 803 SELQKTQDDNRNDSSDVEDIDNSNGVEIPMYLAAEEEFLRSKILSDLLKDTMDNEGEIYG 862
+ N D+S +P+ EE LR+K+ LL+D++ +G++
Sbjct: 705 -------NTNNPDAS----------TSVPV---LEELQLRNKLFLTLLEDSI-GDGDVTE 743
Query: 863 NETDLLQHLVATHDKSLLRLLAYVCSEQDVNKAVSIVNELKQDKALSAARKIAERAELLP 922
+E + L A DK+LL+L+ C E+ + + + L++ +++AA+KIA L
Sbjct: 744 DEKISIARLEANIDKALLQLIQKACLEERIERVYELTKTLRRTTSIAAAQKIALHHSLTN 803
Query: 923 LVRKINTII 931
+ KI ++
Sbjct: 804 VAEKIGNLL 812
>Hs5901892 [R] KOG1274 WD40 repeat protein
Length = 1129
Score = 72.0 bits (175), Expect = 4e-12
Identities = 152/736 (20%), Positives = 269/736 (35%), Gaps = 123/736 (16%)
Query: 219 RTCTRVAWTNK-GDQYAIPAKDSVIKLFKLSD----HDLVTSF-KPSVHVNNWDALTIDQ 272
++ R+AW K G AIP + SV KL++ DL +F ++++ W
Sbjct: 187 KSICRLAWQPKSGKLLAIPVEKSV-KLYRRESWSHQFDLSDNFISQTLNIVTWSPC---- 241
Query: 273 IHSNTIAAVGNSNQNSHLFIWNLSTGT---QLVHESFRYSITSLSWRVNDQRTKLSLVAG 329
A G+ N + +WN+ T ++ HE Y+I L+W R +
Sbjct: 242 ---GQYLAAGSIN--GLIIVWNVETKDCMERVKHEK-GYAICGLAWHPTCGRISYTDA-- 293
Query: 330 TWSGDVITFNDIVSVNTTVAYSNTGGKLFVGSDDEKLFDDSDIDDLSKQTSNGPEFYEPS 389
G++ ++ + + S + V D LFD D+ SN +F +
Sbjct: 294 --EGNLGLLENVCDPSGKTSSSKVSSR--VEKDYNDLFDGDDM-------SNAGDFLNDN 342
Query: 390 KNDNNRRSKYENNDKADDKGEVSEDINSDNLFTDAEMDQHEPKRTYXXXXXXXXXXXXXX 449
+ SK ND DD+ + E D++
Sbjct: 343 AVEIPSFSKGIINDDEDDEDLMMASGRPRQRSHILEDDENS------VDISMLKTGSSLL 396
Query: 450 XAAQYKRPKTHVQSIPSLTANHVNHRRIYQPPAQFQYRPFSTGATPFGNSDKRYLTINQL 509
+ + + ++P +T+ + P Q +PF +G+TP + R++ N +
Sbjct: 397 KEEEEDGQEGSIHNLPLVTSQRPFYDGPMPTPRQ---KPFQSGSTPL-HLTHRFMVWNSI 452
Query: 510 GHVWTTRNEGGSNSITVSFFDVSRFREYHFDDLFKYDLCSLTDEGILLGQSKMGQIQYKP 569
G + N+ N+I V F D S H + Y + L+ E ILL ++ K
Sbjct: 453 GII-RCYNDEQDNAIDVEFHDTSIHHATHLSNTLNYTIADLSHEAILLACESTDELASKL 511
Query: 570 HSPSGDSWTKK----IPLLSKERVTSIACTPKRVIVGTSLGYLRTYNDFGLPLAIEKMS- 624
H SW I L E + +I TS LR + G+ + ++
Sbjct: 512 HCLHFSSWDSSKEWIIDLPQNEDIEAICLGQGWAAAATSALLLRLFTIGGVQKEVFSLAG 571
Query: 625 PIVALSAHEYKVFTIHYSQYHGITYSMFQQHPQTGNKYFQRESPLPITLPQQNQPGNDDE 684
P+V+++ H ++F ++ H TG F + L + L + + +
Sbjct: 572 PVVSMAGHGEQLFIVY--------------HRGTG---FDGDQCLGVQLLELGK-----K 609
Query: 685 DIEFLKYDDIFNSFNPLGIKS----LFFSSFGDPCMFGHDNVLLILSKWRSGSDARWVPI 740
+ L D + PL KS + FS+ G PC + ++ +L++ G W PI
Sbjct: 610 KKQILHGDPL-----PLTRKSYLAWIGFSAEGTPCYVDSEGIVRMLNR---GLGNTWTPI 661
Query: 741 VDTNLELWKMSGGKHPKNVHVWPLGL--NFDTFSYLLLKGKNAWPDIPMTVPTEMEVRIP 798
+T K+ H W +G+ N + KG P +P + ++P
Sbjct: 662 CNTREHC-------KGKSDHYWVVGIHENPQQLRCIPCKGSRFPPTLPRPAVAILSFKLP 714
Query: 799 VVSKSELQKTQDDNRNDSSDVEDIDNSNGVEIPMYLAAEEEFLRSKILSDLLKDTMDNEG 858
I G EE+F RS I + L D + G
Sbjct: 715 YC--------------------QIATEKG-------QMEEQFWRSVIFHNHL-DYLAKNG 746
Query: 859 EIYGNETDLLQHLVATHDKSLLRLLAYVCSEQDVNKAVSIVNELKQDKALSAARKIAERA 918
Y E + L+++LA C + + V + + + Q+ A++ A K A R+
Sbjct: 747 --YEYEESTKNQATKEQQELLMKMLALSCKLEREFRCVELADLMTQN-AVNLAIKYASRS 803
Query: 919 ELLPLVRKINTIIEAK 934
L L +K++ + K
Sbjct: 804 RKLILAQKLSELAVEK 819
>7303244 [R] KOG1274 WD40 repeat protein
Length = 895
Score = 63.5 bits (153), Expect = 1e-09
Identities = 81/334 (24%), Positives = 134/334 (39%), Gaps = 54/334 (16%)
Query: 485 QYRPFSTGATPFGNSDKRYLTINQLGHVWTTRNEGGSNSITVSFFDVSRFREYHFDDLFK 544
Q F ATP + + RY+ N +G V G +SI V F D S H + +
Sbjct: 390 QQTAFQPSATP-ADLEHRYMAWNDVGIVTAHVEPSGDSSIDVEFHDASIHHALHISNYNQ 448
Query: 545 YDLCSLTDEGILLGQSKMGQIQYKPHSPSGD-SWTKKIPLLSKERVTSIACTPKRVIVGT 603
++L S++ + L ++ ++ + +G+ W+ +P E ++ T V V T
Sbjct: 449 HNLASVSSGALALASNESSKLVVIALAAAGNKEWSLSLP--DCESAEAVVATRFLVAVAT 506
Query: 604 SLGYLRTYNDFGLPLAIEKM-SPIVALSAHEYKVFTIHYSQYHGITYSMFQQHPQ----- 657
S +LR + G + + P+VA++ HE+ + + YH S QQH
Sbjct: 507 SSSFLRIFTVMGTQREVLTIPGPMVAIAGHEHSLMVV----YHSSAPSQTQQHLAAMLVN 562
Query: 658 -TGNKYFQRESPLPITLPQQNQPGNDDEDIEFLKYDDIFNSFNPLGIKSLFFSSFGDPCM 716
G Q P+P+T PG + + Y D G P +
Sbjct: 563 INGLSLRQEYLPVPLT------PG---RQLTWFGYSDT-----------------GSPSI 596
Query: 717 FGHDNVLLILSKWRSGSDARWVPIVDTNLELWKMSGGKHPKNVHVWPLGLNFDTFSYLLL 776
DN+ L L +R S+A W PI DT + +S N V + +L
Sbjct: 597 --ADNMGL-LQLYRRSSNA-WFPICDTMKQSTSVS-----HNFFVVAVSEKRQIVQAVLC 647
Query: 777 KGKNAWPDIPMTVPTEMEVRIPV----VSKSELQ 806
+G + P + E+ ++IP+ V KSEL+
Sbjct: 648 RGTSYPMTNPRPMLQELRMQIPLCDVEVEKSELE 681
Database: KOG eukaryal database 04/03
Posted date: Apr 14, 2003 1:07 PM
Number of letters in database: 30,389,216
Number of sequences in database: 60,738
Lambda K H
0.315 0.133 0.391
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 58,955,239
Number of Sequences: 60738
Number of extensions: 2751604
Number of successful extensions: 8651
Number of sequences better than 1.0e-05: 4
Number of HSP's better than 0.0 without gapping: 2
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 8627
Number of HSP's gapped (non-prelim): 8
length of query: 942
length of database: 30,389,216
effective HSP length: 116
effective length of query: 826
effective length of database: 23,343,608
effective search space: 19281820208
effective search space used: 19281820208
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)