ORF STATUS Function Best COG Functional category Pathways and functional systems
r_klactI2091 good A KOG1258 RNA processing and modification mRNA processing protein
Only best alignment is shown:
BLASTP 2.2.3 [May-13-2002]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= r_klactI2091 714866 713022 -615
(615 letters)
Database: KOG eukaryal database 04/03
60,738 sequences; 30,389,216 total letters
Searching..................................................done
Color Key for Alignment Scores:
Score E
Sequences producing significant alignments: (bits) Value
YML046w [A] KOG1258 mRNA processing protein 483 e-136
SPBC4B4.09 [A] KOG1258 mRNA processing protein 164 5e-40
At1g04080 [A] KOG1258 mRNA processing protein 119 1e-26
At5g46400 [A] KOG1258 mRNA processing protein 112 2e-24
7301703 [A] KOG1258 mRNA processing protein 79 3e-14
YDR235w [A] KOG1258 mRNA processing protein 66 2e-10
CE28000 [A] KOG1258 mRNA processing protein 65 4e-10
At1g17760 [A] KOG1914 mRNA cleavage and polyadenylation factor I... 64 9e-10
>YML046w [A] KOG1258 mRNA processing protein
Length = 629
Score = 483 bits (1242), Expect = e-136
Identities = 245/617 (39%), Positives = 379/617 (60%), Gaps = 16/617 (2%)
Query: 4 GLDQTFLKNNDEWVQSYNSVDWNDITTIDKLVVGTEALVQKYNNPNEHVKSNIYKVFDEL 63
GLD FL++N VQ+Y +DW+DI+++ ++V E V KY NPN+ +K + + ++
Sbjct: 21 GLDTQFLQDNTALVQAYRGLDWSDISSLTQMVDVIEQTVVKYGNPNDSIKLALETILWQI 80
Query: 64 LGRYPLFFGYWKRYVAVKYQLDGLEGSISTLKASLHSFPTSIDLWIDMLNVYLTHNQNDS 123
L +YPL FG+WKR+ ++YQL GL+ SI+ L S+ FPTS++LW D LNV +N N++
Sbjct: 81 LRKYPLLFGFWKRFATIEYQLFGLKKSIAVLATSVKWFPTSLELWCDYLNVLCVNNPNET 140
Query: 124 ELIRNQFRKCESLVGSHFLSHDIWDKHIAYETRLQNWENVFEVYKQVMQQPLHQYARYYT 183
+ IRN F + L+G FLSH WDK I +E +NW NV +Y+ +++ PLHQYAR++T
Sbjct: 141 DFIRNNFEIAKDLIGKQFLSHPFWDKFIEFEVGQKNWHNVQRIYEYIIEVPLHQYARFFT 200
Query: 184 SFKEFLEYHPEFANRESSIHLDTTFISNQEKVNKIWTYESQIKQPFFNIPELPENEIQNW 243
S+K+FL N +++ ++D Q VN+IW +ES+IKQPFFN+ ++ ++++NW
Sbjct: 201 SYKKFLNE----KNLKTTRNIDIVLRKTQTTVNEIWQFESKIKQPFFNLGQVLNDDLENW 256
Query: 244 DAYLSFLIN-NTEFSTELIKSTFERCLIPCLRYEYFWGAYIDW-TERTFGPECTFPLFER 301
YL F+ + + E + S F+RCLIPCL +E W YI W T++ E ++++
Sbjct: 257 SRYLKFVTDPSKSLDKEFVMSVFDRCLIPCLYHENTWMMYIKWLTKKNISDEVVVDIYQK 316
Query: 302 ALRALPADNKSFKQKYIKYLESNMDPYNKLSSKHYMDALYTFQLKWPHDPSSTIKYLRFH 361
A LP D K+ + ++++L+ N L + + + + + WP+D +YL
Sbjct: 317 ANTFLPLDFKTLRYDFLRFLKRKYRSNNTLFNNIFNETVSRYLKIWPNDILLMTEYLCML 376
Query: 362 KRRYFAASLNDEDKKILEQQSKYATFLDKTIKAYLSGNADTENHDISQQLLAMLNDTTLP 421
KR F SL+ K+ILE+Q+ + L+ +I Y++ D + H L ++ND L
Sbjct: 377 KRHSFKNSLDQSPKEILEKQTSFTKILETSITNYINNQIDAKVH-----LQTLINDKNLS 431
Query: 422 ILVVELIKVHWLVLKNVIQCRKHFTYFAKLDPIKTSVLFWLTYYKFEKTQKNIARLTKFV 481
I+VVELIK WLVLKN +Q RK+F + K IK SV FWLTYYKFEK+ N +L KF+
Sbjct: 432 IVVVELIKTTWLVLKNNMQTRKYFNLYQKNILIKNSVPFWLTYYKFEKSNVNFTKLNKFI 491
Query: 482 DQLGTEIVLPTKVINDIVRDFQGFYLINADYSEYETNLSQSRFGFDPIVHNRFKINNPTW 541
+LG EI LPT V+NDI+ D++ FYL +++ YE+++ S FDPI++ K++NP +
Sbjct: 492 RELGVEIYLPTTVMNDILTDYKTFYLTHSNIVTYESSIIDSN-TFDPILYPELKMSNPKY 550
Query: 542 RP--NAKLTKDWYKSEKYRSNGHPGLFIDKPQIKNTIIETLASKQL-NPAPLPTFRNLEK 598
P N DW+K +++ GH G+ ++PQI N+IIE + + P LP FRNLEK
Sbjct: 551 DPVLNTTANVDWHKKTEWKEAGHIGITTERPQISNSIIECNSGTLIQKPISLPNFRNLEK 610
Query: 599 IHQKPKFDDYMSIDYLK 615
I+Q K +D + ++LK
Sbjct: 611 INQ-VKINDLYTEEFLK 626
>SPBC4B4.09 [A] KOG1258 mRNA processing protein
Length = 612
Score = 164 bits (414), Expect = 5e-40
Identities = 145/626 (23%), Positives = 261/626 (41%), Gaps = 108/626 (17%)
Query: 9 FLKNNDEWVQSYNSVDWN--DITTIDKLVVGTEALVQKYN-NPNEHVKSNIYKVFDELLG 65
++ EW + ++ N D + LV +E L N ++ + + V+D LG
Sbjct: 7 YIAEETEWDKYNRQINKNPDDFDAWEGLVRASEHLEGGVGRNSSKQAINTLRSVYDRFLG 66
Query: 66 RYPLFFGYWKRYVAVKYQLDGLEGSISTLKASLHSFPTSIDLWIDMLNVYLTHNQNDSEL 125
+YPL FGYWK+Y ++ + G E S + + P S+DLW + + N D+
Sbjct: 67 KYPLLFGYWKKYADFEFFVAGAEASEHIYERGIAGIPHSVDLWTNYCAFKMETN-GDANE 125
Query: 126 IRNQFRKCESLVGSHFLSHDIWDKHIAYETRLQNWENVFEVYKQVMQQPLHQYARYYTSF 185
+R F + ++VG FLSH WDK++ +E R + +NVF++ ++++ PLHQYARY+ F
Sbjct: 126 VRELFMQGANMVGLDFLSHPFWDKYLEFEERQERPDNVFQLLERLIHIPLHQYARYFERF 185
Query: 186 KEF-------------------------------------------LEYHPEFANRESSI 202
+ LE E R +I
Sbjct: 186 VQVSQSQPIQQLLPPDVLASIRADVTREPAKVVSAGSKQITVERGELEIEREMRARIYNI 245
Query: 203 HLDTTFISNQEKVNKIWTYESQIKQPFFNIPELPENEIQNWDAYLSFLINNTEFSTELIK 262
HL F Q + K WT+ES+IK+P+F++ EL E ++ NW YL F E + I
Sbjct: 246 HLQI-FQKVQLETAKRWTFESEIKRPYFHVKELDEAQLVNWRKYLDF--EEVEGDFQRIC 302
Query: 263 STFERCLIPCLRYEYFWGAYIDW-TERTFGPECTFPLFERALRALPADNK-SFKQKYIKY 320
+ERCLI C Y+ FW Y W + + ++ERA + ++ + +Y +
Sbjct: 303 HLYERCLITCALYDEFWFRYARWMSAQPDHLNDVSIIYERASCIFASISRPGIRVQYALF 362
Query: 321 LESNMDPYNKLSSKHYMDALYTFQLKWPHDPSSTIKYLRFHKRRYFAASLNDEDKKILEQ 380
ES N S+K ++ T + P + + + ++ +R
Sbjct: 363 EESQ---GNIASAKAIYQSILT---QLPGNLEAVLGWVGLERRN---------------- 400
Query: 381 QSKYATFLDKTIKAYLSGNADTEN-HDISQQLL--AMLNDTTLPILVVELIKVHWLVLKN 437
+ N D N H + + ++ N +L+ E IK+ W + +
Sbjct: 401 ----------------APNYDLTNAHAVLRSIINEGKCNTGITEVLITEDIKLVWKIEGD 444
Query: 438 VIQCRKHFTYFAKLDPIKTSVLFWLTYYKFEKTQ--------KNIARLTKFVDQLGTEIV 489
+ R F A + FW+++ +FE Q ++ AR++ ++ + +
Sbjct: 445 IELARNMFLQNA--PALLDCRHFWISFLRFELEQPLNSKNYTEHHARVSNVMEMIRNKTR 502
Query: 490 LPTKVINDIVRDFQGFYLINADYSEYETNLSQSRFGFDPIVHNRFKINNPTWRP-NAKLT 548
LP + I D+ + Y+ + + ++ Q D V F + W+ +
Sbjct: 503 LPPRTIMDLTK----LYMEYLCHQSNDPSVLQEYLLIDRDVFGPFSVRESHWKKLDEGQD 558
Query: 549 KDWYKSEKYRSNGHPGLFIDKPQIKN 574
+ +NGHPG+ +++ +IK+
Sbjct: 559 LKQVSTRLLSTNGHPGISVNEAKIKS 584
>At1g04080 [A] KOG1258 mRNA processing protein
Length = 768
Score = 119 bits (298), Expect = 1e-26
Identities = 76/316 (24%), Positives = 135/316 (42%), Gaps = 60/316 (18%)
Query: 25 WNDITTIDKLVVGTEALVQKYNNPNEHVKSNIYKVFDELLGRYPLFFGYWKRYVAVKYQL 84
WN + AL+ + + + I KV+D L +PL +GYWK++ + ++
Sbjct: 90 WNIVRANSLEFNAWTALIDETERIAQDNIAKIRKVYDAFLAEFPLCYGYWKKFADHEARV 149
Query: 85 DGLEGSISTLKASLHSFPTSIDLWIDMLNVYLTHNQNDSELIRNQFRKCESLVGSHFLSH 144
++ + + ++ S+D+W+ + + D E IR F + VG+ FLS
Sbjct: 150 GAMDKVVEVYERAVLGVTYSVDIWLHYCT-FAINTYGDPETIRRLFERALVYVGTDFLSS 208
Query: 145 DIWDKHIAYETRLQNWENVFEVYKQVMQQPLHQYARYYTSFKEFLEYHP----------- 193
+WDK+I YE Q+W V +Y ++++ P+ RY++SFKE E P
Sbjct: 209 PLWDKYIEYEYMQQDWSRVALIYTRILENPIQNLDRYFSSFKELAETRPLSELRSAEESA 268
Query: 194 -----------EFANRES-----------------SIHLDTTFISNQEKVNK-------- 217
E A ES S L++ + E++ K
Sbjct: 269 AAAVAVAGDASESAASESGEKADEGRSQVDGSTEQSPKLESASSTEPEELKKYVGIREAM 328
Query: 218 ----------IWTYESQIKQPFFNIPELPENEIQNWDAYLSFLINNTEFSTELIKSTFER 267
I YE I++P+F++ L E++NW YL F+ + +F+ + +ER
Sbjct: 329 YIKSKEFESKIIGYEMAIRRPYFHVRPLNVAELENWHNYLDFIERDGDFNK--VVKLYER 386
Query: 268 CLIPCLRYEYFWGAYI 283
C++ C Y +W Y+
Sbjct: 387 CVVTCANYPEYWIRYV 402
>At5g46400 [A] KOG1258 mRNA processing protein
Length = 1022
Score = 112 bits (279), Expect = 2e-24
Identities = 71/283 (25%), Positives = 124/283 (43%), Gaps = 51/283 (18%)
Query: 59 VFDELLGRYPLFFGYWKRYVAVKYQLDGLEGSISTLKASLHSFPTSIDLWIDMLNVYLTH 118
V+D L +PL GYW++Y K +L LE ++ + ++ + S+ +W+D +
Sbjct: 70 VYDAFLLEFPLCHGYWRKYAYHKIKLCTLEDAVEVFERAVQAATYSVAVWLDYC-AFAVA 128
Query: 119 NQNDSELIRNQFRKCESLVGSHFLSHDIWDKHIAYETRLQNWENVFEVYKQVMQQPLHQY 178
D + F + S +G + +WDK+I Y Q W ++ VY + ++ P +
Sbjct: 129 AYEDPHDVSRLFERGLSFIGKDYSCCTLWDKYIEYLLGQQQWSSLANVYLRTLKYPSKKL 188
Query: 179 ARYYTSFKEFLE-------------------------YHPEFANRESSIHLDT------- 206
YY +F++ H + E SI +
Sbjct: 189 DLYYKNFRKIAASLKEKIKCRIDVNGDLSSDPMEEDLVHTRHTDEEISIVVRELMGPSSS 248
Query: 207 --------TFIS--------NQEKVNKIWTYESQIKQPFFNIPELPENEIQNWDAYLSFL 250
T++S +++ + KI +E+QI++P+F++ L N++ NW AYLSF
Sbjct: 249 SAVSKALHTYLSIGEQFYQDSRQLMEKISCFETQIRRPYFHVKPLDTNQLDNWHAYLSFG 308
Query: 251 INNTEFSTELIKSTFERCLIPCLRYEYFWGAYIDWTERTFGPE 293
+F + + +ERCLIPC Y FW Y+D+ E G E
Sbjct: 309 ETYGDFDWAI--NLYERCLIPCANYTEFWFRYVDFVESKGGRE 349
>7301703 [A] KOG1258 mRNA processing protein
Length = 1009
Score = 78.6 bits (192), Expect = 3e-14
Identities = 34/144 (23%), Positives = 68/144 (46%)
Query: 49 NEHVKSNIYKVFDELLGRYPLFFGYWKRYVAVKYQLDGLEGSISTLKASLHSFPTSIDLW 108
NE + +D L YP +GYW++Y + + + L + P S+DLW
Sbjct: 334 NESDAEAAREAYDTFLSHYPYCYGYWRKYADYEKRKGIKANCYKVFERGLEAIPLSVDLW 393
Query: 109 IDMLNVYLTHNQNDSELIRNQFRKCESLVGSHFLSHDIWDKHIAYETRLQNWENVFEVYK 168
I L +++ +D +R+Q+ + G F S +WD +I +E + + V ++Y
Sbjct: 394 IHYLMHVKSNHGDDETFVRSQYERAVKACGLEFRSDKLWDAYIRWENESKRYHRVVQIYD 453
Query: 169 QVMQQPLHQYARYYTSFKEFLEYH 192
+++ P Y ++ +F++ + H
Sbjct: 454 RLLAIPTQGYNGHFDNFQDLINQH 477
Score = 60.5 bits (145), Expect = 7e-09
Identities = 41/142 (28%), Positives = 66/142 (45%), Gaps = 14/142 (9%)
Query: 197 NRESSIHLDTTFISNQEKVNKI--------WTYESQIKQPFFNIPELPENEIQNWDAYLS 248
N E + + IS + KV+K+ W++E IK+P+F++ L +++NW YL
Sbjct: 597 NDEEVVSIRDRAISARRKVHKLTVSAVTARWSFEEGIKRPYFHVKPLERAQLKNWKDYLD 656
Query: 249 FLINNTEFSTELIKSTFERCLIPCLRYEYFW---GAYIDWTERTFG-PECTFPLFERALR 304
F I + E + FERCLI C Y+ FW Y++ E G + ++ RA R
Sbjct: 657 FEIEKGD--RERVLVLFERCLIACALYDEFWLKMLRYLESLEDQSGVVDLVRDVYRRACR 714
Query: 305 ALPADNKSFKQKYIKYLESNMD 326
D S + + E M+
Sbjct: 715 IHHPDKPSLHLMWAAFEECQMN 736
>YDR235w [A] KOG1258 mRNA processing protein
Length = 544
Score = 65.9 bits (159), Expect = 2e-10
Identities = 54/263 (20%), Positives = 106/263 (39%), Gaps = 40/263 (15%)
Query: 60 FDELLGRYPLFFGYWKRYVAVKYQLDGLEGSISTLKASLHSF-PTSIDLWIDMLNVYLTH 118
+ +L +P Y+ + ++Y+L + S + L +F S+ LW L
Sbjct: 60 YSSMLNEFPYLENYYIDFALLEYKLGNVSMSHKIFQRGLQAFNQRSLLLWTSYLKFCNNV 119
Query: 119 NQNDSELIRNQFRKCESLVGSHFLSHDIWDKHI-AYETRLQNWENVFEVYKQVMQQPLHQ 177
+ +L + ++ E VG HF S + WD ++ +R + + + V +++++ PLH
Sbjct: 120 ISHQKQLFK-KYETAEEYVGLHFFSGEFWDLYLEQISSRCTSSKKYWNVLRKILEIPLHS 178
Query: 178 YARYY-------------------TSFKEFLE--------------YHPEFANRESSIHL 204
++++Y TS E L+ Y + + I
Sbjct: 179 FSKFYALWLQRIDDIMDLKQLSQLTSKDELLKKLKIDINYSGRKGPYLQDAKKKLKKITK 238
Query: 205 DTTFISNQEKVNKIWTYESQIKQPFFNIPE--LPENEIQNWDAYLSFLINNTEFSTELIK 262
+ + + + +ES+I ++ PE + +EI+ W YL + I T + L
Sbjct: 239 EMYMVVQYQVLEIYSIFESKIYINYYTSPETLVSSDEIETWIKYLDYTI--TLQTDSLTH 296
Query: 263 STFERCLIPCLRYEYFWGAYIDW 285
F+R L+P Y+ W Y W
Sbjct: 297 LNFQRALLPLAHYDLVWIKYSKW 319
>CE28000 [A] KOG1258 mRNA processing protein
Length = 710
Score = 64.7 bits (156), Expect = 4e-10
Identities = 30/70 (42%), Positives = 43/70 (60%), Gaps = 2/70 (2%)
Query: 221 YESQIKQPFFNIPELPENEIQNWDAYLSFLINNTEFSTELIKSTFERCLIPCLRYEYFWG 280
+E+ IK+P+F++ L ++ NW +YL F I E E +K F+RCLIPC YE FW
Sbjct: 365 FEANIKRPYFHVKPLDYPQLFNWMSYLDFEIK--EGHEERVKILFDRCLIPCSLYEEFWI 422
Query: 281 AYIDWTERTF 290
Y WT +T+
Sbjct: 423 KYARWTWKTY 432
>At1g17760 [A] KOG1914 mRNA cleavage and polyadenylation factor I complex
subunit RNA14
Length = 722
Score = 63.5 bits (153), Expect = 9e-10
Identities = 87/451 (19%), Positives = 167/451 (36%), Gaps = 70/451 (15%)
Query: 59 VFDELLGRYPLFFGYWKRYVAVKYQLDGLEGSISTLKASLHSFPTSIDLWIDMLN----V 114
++++LL YP +WK+YV + ++ + + L + + LW + V
Sbjct: 28 IYEQLLSLYPTSARFWKQYVEAQMAVNNDDATKQIFSRCLLTC-LQVPLWQCYIRFIRKV 86
Query: 115 YLTHNQNDSELIRNQFRKCESLVGSHFLSHDIWDKHIAY---------ETRLQNWENVFE 165
Y E F + +G+ S IW ++IA+ L + +
Sbjct: 87 YDKKGAEGQEETTKAFEFMLNYIGTDIASGPIWTEYIAFLKSLPALNLNEDLHRKTALRK 146
Query: 166 VYKQVMQQPLHQYARYYTSFKEFLEYHPEFANRESSIHLDTTFISNQEKVNKIWTYESQ- 224
VY + + P H + + ++ F NR+ + L + ++ +
Sbjct: 147 VYHRAILTPTHHVEQLWKDYENF----ENTVNRQLAKGLVNEYQPKFNSARAVYRERKKY 202
Query: 225 IKQPFFNIPELP-------ENEIQNWDAYLSFLINN-----TEFSTELIKSTFERCLIPC 272
I++ +N+ +P E + W +LSF N T ST+ I +E+CL+
Sbjct: 203 IEEIDWNMLAVPPTGTSKEETQWVAWKKFLSFEKGNPQRIDTASSTKRIIYAYEQCLMCL 262
Query: 273 LRYEYFWGAYIDWTERTFGPECTFPLFERALRALPADNKSFKQKYIKYLESNMDPYNKLS 332
Y W Y +W ++ + +F+RAL+A+P D++ K + + ES
Sbjct: 263 YHYPDVWYDYAEWHVKSGSTDAAIKVFQRALKAIP-DSEMLKYAFAEMEESR-------- 313
Query: 333 SKHYMDALYTFQLKWPHDPSSTIKYLRFHKRRYFAASLNDEDKKILEQQSK----YATFL 388
I+YLRF +R A + K L+ + Y ++
Sbjct: 314 --------------------GAIQYLRFLRR---AEGVEAARKYFLDARKSPSCTYHVYI 350
Query: 389 DKTIKAYLSGNADTENHDISQQLLAMLNDTTLPILVVELIKVHWLVLKNVIQCRKHFTYF 448
A+ H+I ++ L + + IL +N+ R F
Sbjct: 351 AFATMAFCIDKEPKVAHNIFEEGLKLYMSEPVYILKYADFLTRLNDDRNI---RALFERA 407
Query: 449 AKLDPIKTSVLFWLTYYKFEKTQKNIARLTK 479
P++ S W + +FE+T ++A + K
Sbjct: 408 LSTLPVEDSAEVWKRFIQFEQTYGDLASILK 438
Database: KOG eukaryal database 04/03
Posted date: Apr 14, 2003 1:07 PM
Number of letters in database: 30,389,216
Number of sequences in database: 60,738
Lambda K H
0.320 0.137 0.421
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 39,045,050
Number of Sequences: 60738
Number of extensions: 1733590
Number of successful extensions: 4672
Number of sequences better than 1.0e-05: 8
Number of HSP's better than 0.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 4631
Number of HSP's gapped (non-prelim): 20
length of query: 615
length of database: 30,389,216
effective HSP length: 112
effective length of query: 503
effective length of database: 23,586,560
effective search space: 11864039680
effective search space used: 11864039680
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)