ORF STATUS Function Best COG Functional category Pathways and functional systems
r_klactIII4103 good S KOG4328 Function unknown WD40 protein
Only best alignment is shown:
BLASTP 2.2.3 [May-13-2002]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= r_klactIII4103 1450404 1448869 -512
(512 letters)
Database: KOG eukaryal database 04/03
60,738 sequences; 30,389,216 total letters
Searching..................................................done
Color Key for Alignment Scores:
Score E
Sequences producing significant alignments: (bits) Value
YDL156w [S] KOG4328 WD40 protein 650 0.0
At1g80710 [S] KOG4328 WD40 protein 101 2e-21
Hs13376367 [S] KOG4328 WD40 protein 84 5e-16
At5g63010 [E] KOG0280 Uncharacterized conserved protein 58 4e-08
Hs7657439 [K] KOG0263 Transcription initiation factor TFIID subu... 53 1e-06
Hs22057346 [R] KOG1334 WD40 repeat protein 52 2e-06
SPBC1A4.07c [A] KOG0268 Sof1-like rRNA processing protein (conta... 51 4e-06
SPBC1711.16 [S] KOG0270 WD40 repeat-containing protein 50 6e-06
>YDL156w [S] KOG4328 WD40 protein
Length = 522
Score = 650 bits (1677), Expect = 0.0
Identities = 335/519 (64%), Positives = 389/519 (74%), Gaps = 14/519 (2%)
Query: 1 MGELTEFQKKRLEXXXXXXXXXXXXXXXXVSSQIKREAGVEDEHLDRXXXXXXXXXXXXX 60
M ELTEFQKKRLE V+SQIK EAGV ++ R
Sbjct: 1 MPELTEFQKKRLENIKRNNDLLKKLHLSGVASQIKHEAGVLEK--SRAPAKKKQKTTNTR 58
Query: 61 XXXXXXXXXXTRRSRRLRGENVDG-NGIPNVNDNQLLKMGQSDSTPELEAIDELKNTALS 119
TRRSRRLRGE+ D GIPNVNDNQLLKMG D + ID +K +
Sbjct: 59 ATKSASPTLPTRRSRRLRGESADDVKGIPNVNDNQLLKMGSPDGQDK-NFIDAIKEKPVI 117
Query: 120 GDVKLSDLIKSENEEELLDKFKSFANKNFSSGDFFKELQQQQVPTPEIKQLQEDFDLKLY 179
GDVKLSDLIK E+E LL+KFK F N NFSSGDFF+E++++Q + ++FDL LY
Sbjct: 118 GDVKLSDLIKDEDESALLEKFKRFNNGNFSSGDFFEEIKKRQGDVTGM----DEFDLDLY 173
Query: 180 DIFQPNEIKLTAERISATFFHPSVDKKLVICGDTAGNVGLWNVRET----QPEDELEEPD 235
D+FQPNEIK+T ERISAT+FHP+++KKL+I GDT+G VG WNVR+ ED +EEPD
Sbjct: 174 DVFQPNEIKITYERISATYFHPAMEKKLIIAGDTSGTVGFWNVRDEPLADSEEDRMEEPD 233
Query: 236 ITKVKLFTKNVGRIDTYATDSSRLLAASYDGYLRSINLQDMNSEEILVLKNEYDDPLGIS 295
IT+VKLFTKNVGRID + D+S++L SYDG +RS++L ++ SEE+L LKNEYDD LGIS
Sbjct: 234 ITRVKLFTKNVGRIDCFPADTSKILLTSYDGSIRSVHLNNLQSEEVLTLKNEYDDSLGIS 293
Query: 296 DFQFNYNDPNVLFMTTLSGEFTTFDVRTKPTEINLKRLSDKKIGSFSINPKRPYEIATGS 355
D QF+Y +PNVLF+TTL GEFTTFD R K +E NL+RL+DKKIGS +INP RPYEIATGS
Sbjct: 294 DCQFSYENPNVLFLTTLGGEFTTFDTRVKKSEYNLRRLADKKIGSMAINPMRPYEIATGS 353
Query: 356 LDRTLKIWDTRKIVNKPEWSQYEDFASHEIVATYDSRLSVSAVSYSPMDETLVCNGYDDT 415
LDRTLKIWDTR +V KPEWSQYED+ SHEIV+TYDSRLSVSAVSYSP D TLVCNGYDDT
Sbjct: 354 LDRTLKIWDTRNLVKKPEWSQYEDYPSHEIVSTYDSRLSVSAVSYSPTDGTLVCNGYDDT 413
Query: 416 IRLFDVSGT--LPEDLQPKLTLKHNCQTGRWTSILKARFKLNMDVFAIANMKRAIDIYTS 473
IRLFDV L L+PKLT++HNCQTGRWTSILKARFK N +VFAIANMKRAIDIY S
Sbjct: 414 IRLFDVKSRDHLSAKLEPKLTIQHNCQTGRWTSILKARFKPNKNVFAIANMKRAIDIYNS 473
Query: 474 SGVQLAHLPTATVPAVISWHPTQNWVVGGNSSGKAFLFT 512
G QLAHLPTATVPAVISWHP +NW+ GGNSSGK FLFT
Sbjct: 474 EGQQLAHLPTATVPAVISWHPLRNWIAGGNSSGKIFLFT 512
>At1g80710 [S] KOG4328 WD40 protein
Length = 516
Score = 101 bits (252), Expect = 2e-21
Identities = 101/387 (26%), Positives = 163/387 (42%), Gaps = 55/387 (14%)
Query: 128 IKSENEEELLDKFKSFANKNFSSGDFFKELQQQQVPTPEIKQLQEDFDLKLYDIFQPNEI 187
+K E E+ D F + NK FS +P +K + +FDL L + N
Sbjct: 168 VKKEEPED--DSFSDYVNKEFS------------IP---VKPEKIEFDLDLLTLEPQNVA 210
Query: 188 KLTAERISATFFHPSVDKKLVICGDTAGNVGLWNVRETQPEDELEEPDITKVKLFTKN-- 245
++ RI F P + K+V GD GNVG WN+ D E D + LFT +
Sbjct: 211 RVVPGRIFVVQFLPCENVKMVAAGDKLGNVGFWNL------DCGNEEDNDGIYLFTPHSA 264
Query: 246 -VGRIDTYATDSSRLLAASYDGYLRSINLQDMNSEEILVLKNEYDDPLGISDFQFNYNDP 304
V I SR++++SYDG +R +++ E V Y I ND
Sbjct: 265 PVSSIVFQQNSLSRVISSSYDGLIRLMDV------EKSVFDLVYSTDEAIFSLSQRPNDE 318
Query: 305 NVLFMTTLSGEFTTFDVRTKPTEINLKRLSDKKIGSFSINPKRPYEIATGSLDRTLKIWD 364
L+ G F +D+R + + + L +++I S NP+ P+ +AT S D T +WD
Sbjct: 319 QSLYFGQDYGVFNVWDLRAGKSVFHWE-LHERRINSIDFNPQNPHVMATSSTDGTACLWD 377
Query: 365 TRKI-VNKPEWSQYEDFASHEIVATYDSRLSVSAVSYSPMDETLVCNGYDDTIRLFDVSG 423
R + KP + ++T + +V + +SP +L D+ I + +SG
Sbjct: 378 LRSMGAKKP-----------KTLSTVNHSRAVHSAYFSPSGLSLATTSLDNYIGV--LSG 424
Query: 424 TLPEDLQPKLTLKHNCQTGRWTSILKARFKLNMDVFAIANMKRAIDIYT---SSGVQLAH 480
+ + + HN T RW S KA + + + N+ + ID+ V H
Sbjct: 425 A---NFENTCMIYHN-NTSRWISKFKAVWGWDDSYIYVGNLSKKIDVINPKLKRTVMELH 480
Query: 481 LP-TATVPAVISWHPTQNWVVGGNSSG 506
P +P I HP + G+++G
Sbjct: 481 NPLQRAIPCRIHCHPYNVGTLAGSTAG 507
>Hs13376367 [S] KOG4328 WD40 protein
Length = 626
Score = 84.0 bits (206), Expect = 5e-16
Identities = 97/391 (24%), Positives = 171/391 (42%), Gaps = 35/391 (8%)
Query: 130 SENEEELLDKFKSFANKNFSSGDFFKELQQQQVPTPEIKQLQEDFDLKLYDIFQPNEIKL 189
SEN+E+ ++FK F + +G + + IK + + + + I + K+
Sbjct: 256 SENQEDNNERFKGFLHT--WAGMSKPSSKNTEKGLSSIKSYKANLNGMV--ISEDTVYKV 311
Query: 190 TAERISATFFHPSVDKKLVICGDTAGNVGLWNVRETQPEDELEEPDITKVKLFTKNVGRI 249
T I + HPS + LV G G VGL ++ + ED + ++ V +
Sbjct: 312 TTGPIFSMALHPSETRTLVAVGAKFGQVGLCDLTQQPKED-----GVYVFHPHSQPVSCL 366
Query: 250 DTYATDSSRLLAASYDGYLRSINLQDMNSEEILVLKNEYDDPLGISDFQFNYNDPNVLFM 309
+ + +L+ SYDG LR + EE V +NE S F F D + L +
Sbjct: 367 YFSPANPAHILSLSYDGTLRCGDFSRAIFEE--VYRNERSS---FSSFDFLAEDASTLIV 421
Query: 310 TTLSGEFTTFDVRTKPTEIN-LKRLSDKKIGSFSINP-KRPYEIATGSLDRTLKIWDTRK 367
G + D RT T L S KI + ++P R Y I G D I+D R+
Sbjct: 422 GHWDGNMSLVDRRTPGTSYEKLTSSSMGKIRTVHVHPVHRQYFITAGLRDT--HIYDARR 479
Query: 368 IVNKPEWSQYEDFASHEIVATYDSRLSVSAVSYSPMD-ETLVCNGYDDTIRLFDVSGTLP 426
+ ++ S +++ + S+++ +SP+ +V D +R+FD S +
Sbjct: 480 LNSR---------RSQPLISLTEHTKSIASAYFSPLTGNRVVTTCADCNLRIFD-SSCIS 529
Query: 427 EDLQPKLTLKHNCQTGRWTSILKARFKLNM-DVFAIANMK--RAIDIYTSSGVQLAHLP- 482
+ T++HN TGRW + +A + D + +M R ++I+ +G ++
Sbjct: 530 SKIPLLTTIRHNTFTGRWLTRFQAMWDPKQEDCVIVGSMAHPRRVEIFHETGKRVHSFGG 589
Query: 483 --TATVPAVISWHPTQNWVVGGNSSGKAFLF 511
+V ++ + HPT+ + GGNSSGK +F
Sbjct: 590 EYLVSVCSINAMHPTRYILAGGNSSGKIHVF 620
>At5g63010 [E] KOG0280 Uncharacterized conserved protein
Length = 343
Score = 57.8 bits (138), Expect = 4e-08
Identities = 40/160 (25%), Positives = 75/160 (46%), Gaps = 9/160 (5%)
Query: 212 DTAGNVGLWNVRETQPEDELEEPDITKVKLFTKNVGRIDTYATDSSRLLAASYDGYLRSI 271
D G + ++ + ET+ + L E K + ++ + S+ ++ DG +
Sbjct: 90 DADGCLIVYKIDETESKGTLRE---VSGKRISSSMCLCLDWDPSSTSIVVGLSDGSASVV 146
Query: 272 NLQDMNSEEILVLKNEYDDPLGISDFQFNYNDPNVLFMTTLSGEFTTFDVRTKPTEINL- 330
+ D N E + K +D L + F N +PN+++ + +F+ +D+R P + +
Sbjct: 147 SFTDSNLETVQEWKG-HDFELWTASFDLN--NPNLVYTGSDDCKFSCWDIRDSPADNRVF 203
Query: 331 --KRLSDKKIGSFSINPKRPYEIATGSLDRTLKIWDTRKI 368
++ + S NP PY I TGS D TL++WDTR +
Sbjct: 204 QNSKVHTMGVCCISSNPSDPYSIFTGSYDETLRVWDTRSV 243
>Hs7657439 [K] KOG0263 Transcription initiation factor TFIID subunit TAF5
(also component of histone acetyltransferase SAGA)
Length = 589
Score = 52.8 bits (125), Expect = 1e-06
Identities = 71/298 (23%), Positives = 124/298 (40%), Gaps = 65/298 (21%)
Query: 202 SVDKKLVICGDTAGNVGLWNVRETQPEDELEEPDI---------------------TKVK 240
S D KL+ G + LW++R + + E + D+ T++K
Sbjct: 277 SPDSKLLAAGFDNSCIKLWSLRSKKLKSEPHQVDVSRIHLACDILEEEDDEDDNAGTEMK 336
Query: 241 LFTKNVGRI--DTYATDSSRLLAASYDGYLRSINLQDMNSEEILVLKNEYDDPLGISDFQ 298
+ + G + + DSS LL+ S D SI D+ S VL + P+ D
Sbjct: 337 ILRGHCGPVYSTRFLADSSGLLSCSED---MSIRYWDLGSFTNTVLYQGHAYPVWDLDI- 392
Query: 299 FNYNDPNVLFMTTLSGEFT----TFDVRTKPTEINLKRLSDKKIGSFSINPKRPYEIATG 354
P L+ + S + T +FD RT P I L+D + +P Y +ATG
Sbjct: 393 ----SPYSLYFASGSHDRTARLWSFD-RTYPLRIYAGHLAD--VDCVKFHPNSNY-LATG 444
Query: 355 SLDRTLKIWDTRKIVNKPEWSQYEDFASHEIVATYDSRLSVSAVSYSPMDETLVCNGYDD 414
S D+T+++W + + + F H R V ++++SP + L G D
Sbjct: 445 STDKTVRLWSAQ------QGNSVRLFTGH--------RGPVLSLAFSPNGKYLASAGEDQ 490
Query: 415 TIRLFDV-SGTLPEDLQPKLTLKHNCQTGRWTSILKARFKLNMDVFAIANMKRAIDIY 471
++L+D+ SGTL ++L+ G +I F + + A A+M ++ ++
Sbjct: 491 RLKLWDLASGTLYKELR-----------GHTDNITSLTFSPDSGLIASASMDNSVRVW 537
>Hs22057346 [R] KOG1334 WD40 repeat protein
Length = 779
Score = 52.0 bits (123), Expect = 2e-06
Identities = 39/127 (30%), Positives = 64/127 (49%), Gaps = 8/127 (6%)
Query: 303 DPNVLFMTTLSGEFT---TFDVRTK--PTEINLKRLSDKKIGSFSI--NPKRPYEIATGS 355
+P+ + SGE T D+R +++ + R +DKK+G ++I NP Y+ A G
Sbjct: 475 EPDSPYKFLTSGEDAVVFTIDLRQDRPASKVVVTRENDKKVGLYTITVNPANTYQFAVGG 534
Query: 356 LDRTLKIWDTRKIVNKPEWSQYEDFASHEIVATYDSRLSVSAVSYSPMDETLVCNGYDDT 415
D+ ++I+D RKI K + F H +V D +++ V YS L+ + DD
Sbjct: 535 QDQFVRIYDQRKIDKKENNGVLKKFTPHHLV-NCDFPTNITCVVYSHDGTELLASYNDDD 593
Query: 416 IRLFDVS 422
I LF+ S
Sbjct: 594 IYLFNSS 600
>SPBC1A4.07c [A] KOG0268 Sof1-like rRNA processing protein (contains WD40
repeats)
Length = 436
Score = 51.2 bits (121), Expect = 4e-06
Identities = 51/190 (26%), Positives = 86/190 (44%), Gaps = 24/190 (12%)
Query: 239 VKLFTKNVGRID-TYATDSSRL-LAASYDGYLRSINLQDMN----SEEILVLKNEYD-DP 291
V + K G++ +Y DSS L + S G L + + ++++ S + V K E+ D
Sbjct: 130 VYMLNKQDGKVKRSYLGDSSLLDIDTSKGGDLFATSGENVSIWDYSRDTPVTKFEWGADT 189
Query: 292 LGISDFQFNYNDPNVLFMTTLSGEFTTFDVRTKPTEINLKRLSDKKIGSFSINPKRPYEI 351
L + +FNY + +VL + +D+RT L ++ + S S NP +
Sbjct: 190 LPV--VKFNYTETSVLASAGMDRSIVIYDLRTSSPLTKL--ITKLRTNSISWNPMEAFNF 245
Query: 352 ATGSLDRTLKIWDTRKIVNKPEWSQYEDFASHEIVATYDSRLSVSAVSYSPMDETLVCNG 411
GS D L ++D R + K Y+D S +V +V +SP + V
Sbjct: 246 VAGSEDHNLYMYDMRNL--KRALHVYKDHVS-----------AVMSVDFSPTGQEFVSGS 292
Query: 412 YDDTIRLFDV 421
YD TIR+++V
Sbjct: 293 YDKTIRIYNV 302
>SPBC1711.16 [S] KOG0270 WD40 repeat-containing protein
Length = 516
Score = 50.4 bits (119), Expect = 6e-06
Identities = 37/130 (28%), Positives = 61/130 (46%), Gaps = 9/130 (6%)
Query: 239 VKLFTKN---VGRIDTYATDSSRLLAASYDGYLRSINLQDMNSEEILVLKNEYDDPLGIS 295
VK FT + V +D Y+ S LL+ SYD ++ + D+ EE + + +
Sbjct: 290 VKSFTYHSDKVSCLDWYSKAPSVLLSGSYD---KTAKIADLRLEEA---PSSFQVTSDVE 343
Query: 296 DFQFNYNDPNVLFMTTLSGEFTTFDVRTKPTEINLKRLSDKKIGSFSINPKRPYEIATGS 355
+ ++ + N F+ T +G D R + + D I S+NP P +ATGS
Sbjct: 344 NVAWDQHSENNFFIGTDNGIVYYCDARNLSKSVWQLQAHDGPISCLSVNPSVPSFVATGS 403
Query: 356 LDRTLKIWDT 365
DR +K+W+T
Sbjct: 404 TDRVVKLWNT 413
Database: KOG eukaryal database 04/03
Posted date: Apr 14, 2003 1:07 PM
Number of letters in database: 30,389,216
Number of sequences in database: 60,738
Lambda K H
0.315 0.133 0.381
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 29,206,887
Number of Sequences: 60738
Number of extensions: 1274905
Number of successful extensions: 4275
Number of sequences better than 1.0e-05: 8
Number of HSP's better than 0.0 without gapping: 1
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 4250
Number of HSP's gapped (non-prelim): 15
length of query: 512
length of database: 30,389,216
effective HSP length: 111
effective length of query: 401
effective length of database: 23,647,298
effective search space: 9482566498
effective search space used: 9482566498
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)