ORF STATUS Function Best COG Functional category Pathways and functional systems
r_klactIII2511 good A KOG1896 RNA processing and modification mRNA cleavage and polyadenylation factor II complex, subunit CFT1 (CPSF subunit)
Only best alignment is shown:
BLASTP 2.2.3 [May-13-2002]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= r_klactIII2511 883305 879406 -1300
(1300 letters)
Database: KOG eukaryal database 04/03
60,738 sequences; 30,389,216 total letters
Searching..................................................done
Color Key for Alignment Scores:
Score E
Sequences producing significant alignments: (bits) Value
YDR301w [A] KOG1896 mRNA cleavage and polyadenylation factor II ... 1301 0.0
SPBC1709.08 [A] KOG1896 mRNA cleavage and polyadenylation factor... 236 2e-61
7303176 [A] KOG1896 mRNA cleavage and polyadenylation factor II ... 163 1e-39
Hs9558725 [A] KOG1896 mRNA cleavage and polyadenylation factor I... 154 7e-37
Hs22047447 [A] KOG1896 mRNA cleavage and polyadenylation factor ... 148 6e-35
CE25609 [A] KOG1896 mRNA cleavage and polyadenylation factor II ... 114 1e-24
ECU11g0610 [A] KOG1896 mRNA cleavage and polyadenylation factor ... 100 2e-20
At5g51660 [A] KOG1896 mRNA cleavage and polyadenylation factor I... 99 3e-20
>YDR301w [A] KOG1896 mRNA cleavage and polyadenylation factor II complex
subunit CFT1 (CPSF subunit)
Length = 1357
Score = 1301 bits (3368), Expect = 0.0
Identities = 651/1354 (48%), Positives = 933/1354 (68%), Gaps = 59/1354 (4%)
Query: 1 MNVFDEILQPTVVNKCLHGNFTSAEREEYVVARTNVLSVFRVSRAQKLVLAYEWKLAGKI 60
MNV+D++L TVV+ L +FT+++ EE +V RTN+LSV+R +R KL L E+K G I
Sbjct: 1 MNVYDDVLDATVVSHSLATHFTTSDYEELLVVRTNILSVYRPTRDGKLYLTDEFKFHGLI 60
Query: 61 IDMQLLPQIGSPLKMLAILSSKSKVSLVRFDPVAESLETLSLHYYHDKFVNLSTSSLKTE 120
D+ L+PQ SPL L + + +K+S+++F+ + S++TLSLHYY KF S L
Sbjct: 61 TDIGLIPQKDSPLSCLLLCTGVAKISILKFNTLTNSIDTLSLHYYEGKFKGKSLVELAKI 120
Query: 121 SIMAVDPLFRCLLVFNEDVLAILPLKLNTED---MEIDEDENGIKEPMA----------- 166
S + +DP C L+FN D++A LP +N D E DEDEN +
Sbjct: 121 STLRMDPGSSCALLFNNDIIAFLPFHVNKNDDDEEEEDEDENIDDSELIHSMNQKSQGTN 180
Query: 167 -----KRLKRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPVLAWCGN 221
KR K T+ S+++ S L++ K++ DI++L NF+KPT+ +LYQP L W GN
Sbjct: 181 TFNKRKRTKLGDKFTAPSVVLVASELYEGAKNIIDIQFLKNFTKPTIALLYQPKLVWAGN 240
Query: 222 EKVLGNTMRYMVLSLDVEDEKT------TVIAELADLPNDLHTLVPLKRGYVLIGVNELL 275
+ +Y++L+L+++ ++ T IA + +LP DLHT+VP+ G +++G NEL
Sbjct: 241 TTISKLPTQYVILTLNIQPAESATKIESTTIAFVKELPWDLHTIVPVSNGAIIVGTNELA 300
Query: 276 YISASGALQSCIRLNTFATSSIN-TRITDNSDMNIFLSK---SSIYFYKALKRH------ 325
++ +G LQS + LN+FA + T+I +NS + I + +SI+ + ++
Sbjct: 301 FLDNTGVLQSTVLLNSFADKELQKTKIINNSSLEIMFREKNTTSIWIPSSKSKNGGSNND 360
Query: 326 DLLILIDENCRMYNIITESEGNLLTKFDCVQVPIVNEIFKNSRLPLSVCGDLNLETGR-- 383
+ L+L+D +Y I E+EG LL KFD ++PIVN++ K + P + LN
Sbjct: 361 ETLLLMDLKSNIYYIQMEAEGRLLIKFDIFKLPIVNDLLKENSNPKCITR-LNATNSNKN 419
Query: 384 --VLIGFLSGDAMFLQLKNLKVAFAAKR---------QLVETVDDDDDEYSALYGESQ-- 430
+ IGF SG+A+ L+L NLK + L++ DDDD+E LY +
Sbjct: 420 MDLFIGFGSGNALVLRLNNLKSTIETREAHNPSSGTNSLMDINDDDDEEMDDLYADEAPE 479
Query: 431 ----NNTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEFSI 486
N VET +PFDI LL S+ N+GP+TSLT+GKV+S++ ++ LPNPNK+E+S+
Sbjct: 480 NGLTTNDSKGTVETVQPFDIELLSSLRNVGPITSLTVGKVSSIDDVVKGLPNPNKNEYSL 539
Query: 487 VATSGVGRGSHLTALHSTVQPHIEQALKFTSATRIWNLKIKGKDKYLVTTDADKEKSDVY 546
VATSG G GSHLT + ++VQP IE ALKF S T+IWNLKIKG+D+YL+TTD+ K +SD+Y
Sbjct: 540 VATSGNGSGSHLTVIQTSVQPEIELALKFISITQIWNLKIKGRDRYLITTDSTKSRSDIY 599
Query: 547 QIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVTSGGLYLFDVDFKRLARLTIDIEIV 606
+ D NF+ + R+D+ T+ + ++KRI+QVT+ LYL+D F+RL + D E++
Sbjct: 600 ESDNNFKLHKGGRLRRDATTVYISMFGEEKRIIQVTTNHLYLYDTHFRRLTTIKFDYEVI 659
Query: 607 HACIIDPYILFTDARGNIKIYQLDSXXXXXXXXXXLPEALNEIIITSGSIFKSNICNKFL 666
H ++DPYIL T +RG+IKI++L+ LPE LNE++ITSG I KSN+CN+FL
Sbjct: 660 HVSVMDPYILVTVSRGDIKIFELEEKNKRKLLKVDLPEILNEMVITSGLILKSNMCNEFL 719
Query: 667 HGLENSSQEQLLFTFVTGDNQVIFFTEKHNDRIFQLNGVDQLEDMLFISTYQIPEEMNPD 726
GL S +EQLLFTFVT DNQ+IFFT+ HNDRIFQLNGVDQL + L+ISTYQ+ +E+ PD
Sbjct: 720 IGLSKSQEEQLLFTFVTADNQIIFFTKDHNDRIFQLNGVDQLNESLYISTYQLGDEIVPD 779
Query: 727 PSIKQIMLNRLGHHKKEEFLTILTFGGEIYQYKKSTKHSGKLLK--CKSHPLITGAPNNA 784
PSIKQ+M+N+LGH KEE+LTILTFGGEIYQY+K + + + ++ ITGAP+NA
Sbjct: 780 PSIKQVMINKLGHDNKEEYLTILTFGGEIYQYRKLPQRRSRFYRNVTRNDLAITGAPDNA 839
Query: 785 YPQGVNKIERVAHYFPNYNGYSVVFITGQVPYIIIKEDNSVCRIFRMTNIPIVTMARWGK 844
Y +GV+ IER+ HYFP+YNGYSV+F+TG VPYI+IKED+S +IF+ NIP+V++ W +
Sbjct: 840 YAKGVSSIERIMHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFGNIPLVSVTPWSE 899
Query: 845 NSVMCVDNIKNARVMKLDPE-CYYGNTQILRKIIIEDVVEEFETLGNIAYHERTGMYIIS 903
SVMCVD+IKNARV L + YYGN L++I I +V+++++TL + YHER ++++S
Sbjct: 900 RSVMCVDDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVS 959
Query: 904 YTKFIEYQALSEDGEPLVGYDPSKPNSTGYKSGLLLINPLTWNIIDRLDLSENSMVNDIK 963
Y K + Y+AL EDGE ++GYD + P++ G++SG+LLINP +W +ID++D +NS+VN+++
Sbjct: 960 YCKRVPYEALGEDGEKVIGYDENVPHAEGFQSGILLINPKSWKVIDKIDFPKNSVVNEMR 1019
Query: 964 TMLIQLNSKTRRKRELVIIGSSFVKEEDQPSTGCLLVLDITEVVAEPGKPDSNFKFKQLF 1023
+ +IQ+NSKT+RKRE +I G + ED P TG + D+ EVV EPGKPD+N+K K++F
Sbjct: 1020 SSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLKEIF 1079
Query: 1024 EEEIRGSVNAVCEISGRFMIGQSSKALVRDMQEDNSAVPVAFLDMPVFITDAKSFSNLMI 1083
+EE+ G+V+ VCE+SGRFMI QS K LVRD+QEDNS +PVAFLD+PVF+TD+KSF NL+I
Sbjct: 1080 QEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGNLLI 1139
Query: 1084 IGDSMQGFTFVGFDAEPYRMIVLGKSTSKFQVMNLEFLVNNGNINFIVTDRQNHLHVLRY 1143
IGD+MQGF F+GFDAEPYRMI LG+S SKFQ M+LEFLVN G++ F TD ++HVL+Y
Sbjct: 1140 IGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLVNGGDMYFAATDADRNVHVLKY 1199
Query: 1144 APDEANSLSGQRLVHCNSFNMFTTNNYMKLVRKHVEFGS-KTSNYIALGCQTDGSIFRMI 1202
APDE NSLSGQRLVHC+SF + +TN+ M L+ ++ EFGS + ++ +G Q DGS+F+++
Sbjct: 1200 APDEPNSLSGQRLVHCSSFTLHSTNSCMMLLPRNEEFGSPQVPSFQNVGGQVDGSVFKIV 1259
Query: 1203 PLNEASYRRFYLVQQQLLDHEIPLAGFNTKMERLDNEYYHKGHSLRPTLDSQVLKKYIHL 1262
PL+E YRR Y++QQQ++D E+ L G N +MERL N++Y GHS+RP LD V++++ L
Sbjct: 1260 PLSEEKYRRLYVIQQQIIDRELQLGGLNPRMERLANDFYQMGHSMRPMLDFNVIRRFCGL 1319
Query: 1263 PITKRTTIENRVGRHASTELWHDLIDIEFSLRSL 1296
I +R +I + GRHA E W D+I+IEFS+RSL
Sbjct: 1320 AIDRRKSIAQKAGRHAHFEAWRDIINIEFSMRSL 1353
>SPBC1709.08 [A] KOG1896 mRNA cleavage and polyadenylation factor II complex
subunit CFT1 (CPSF subunit)
Length = 1441
Score = 236 bits (601), Expect = 2e-61
Identities = 142/501 (28%), Positives = 255/501 (50%), Gaps = 19/501 (3%)
Query: 805 YSVVFITGQVPYIIIKEDNSVCRIFRMT-NIPIVTMARW----GKNSVMCVDNIKNARVM 859
+S VF+TG+ P++I+ +S + F ++ NIPI+++A + + VD R+
Sbjct: 945 HSAVFVTGRKPFLILSTLHSNAKFFPISSNIPILSVAPFHAHHAPQGYIYVDENSFIRIC 1004
Query: 860 KLDPECYYGNTQILRKIIIEDVVEEFETLGNIAYHERTGMYIISYTKFIEYQALSEDG-E 918
K + Y N +K+ + + + IAYH +Y + IE++ EDG E
Sbjct: 1005 KFQEDFEYDNKWPYKKVSLG------KQINGIAYHPTKMVYAVGSAVPIEFKVTDEDGNE 1058
Query: 919 PLVGYDPSKPNSTGYKSGLLLINPLTWNIIDRLDLSENSMVNDIKTMLIQLNSKTRRKRE 978
P D + L L++PLTW +ID + + + + + ++++ T+ ++
Sbjct: 1059 PYAITDDNDYLPMANTGSLDLVSPLTWTVIDSYEFQQFEIPLSVALVNLEVSETTKLRKP 1118
Query: 979 LVIIGSSFVKEEDQPSTGCLLVLDITEVVAEPGKPDSNFKFKQLFEEEIRGSVNAVCEIS 1038
+ +G+S K ED G + +I +VV +PG+P++ K K + EEI+G+V VCE+
Sbjct: 1119 YIAVGTSITKGEDIAVRGSTYLFEIIDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVD 1178
Query: 1039 GRFMIGQSSKALVRDMQEDNSAVPVAFLDMPVFITDAKSFSNLMIIGDSMQGFTFVGFDA 1098
G + GQ K +VR +++++ V V+F+D+ + AK NL++ GD Q TFVGF
Sbjct: 1179 GYLLSGQGQKVIVRALEDEDHLVGVSFIDLGSYTLSAKCLRNLLLFGDVRQNVTFVGFAE 1238
Query: 1099 EPYRMIVLGKSTSKFQVMNLEFLVNNGNINFIVTDRQNHLHVLRYAPDEANSLSGQRLVH 1158
EPYRM + K V +FLV N+ F+V D +L +L Y P+ S SG+RLV
Sbjct: 1239 EPYRMTLFSKGQEALNVSAADFLVQGENLYFVVADTSGNLRLLAYDPENPESHSGERLVT 1298
Query: 1159 CNSF---NMFTTNNYMKLVRKH--VEFGSKT-SNYIALGCQTDGSIFRMIPLNEASYRRF 1212
F N+ T + +KH E+G T ++ + +DG + ++P+++ YRR
Sbjct: 1299 RGDFHIGNVITAMTILPKEKKHQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRL 1358
Query: 1213 YLVQQQLLDHEIPLAGFNTKMERLDNEYYHKGHSLRPTLDSQVLKKYIHLPITKRTTIEN 1272
++Q L + + G N K RL + + R LD ++ + ++ + R + +
Sbjct: 1359 NIIQNYLANRVNTIGGLNPKSYRLITSPSNLTNPTRRILDGMLIDYFTYMSVAHRHEMAH 1418
Query: 1273 RVGRHASTELWHDLIDIEFSL 1293
+ G ST + +DL++++ +L
Sbjct: 1419 KCGVPVST-IMNDLVELDEAL 1438
Score = 171 bits (432), Expect = 9e-42
Identities = 164/707 (23%), Positives = 305/707 (42%), Gaps = 102/707 (14%)
Query: 3 VFDEILQPTVVNKCLHGNFTSAEREEYVVARTNVLSVFRVSRAQK--------------- 47
+F +++ TV+ + G FTS VV++ N L +F + + QK
Sbjct: 4 IFQDLVDSTVIKNAVQGQFTSLVSNNLVVSKVNSLHLFEIEKIQKDESSFPLDDSLQNEF 63
Query: 48 ----------------------------LVLAYEWKLAGKIIDMQLLPQIGSP-LKMLAI 78
L L + K+ G I ++ L GS +L +
Sbjct: 64 STSIIDESQAFMETNMHLIRTNEQTTYVLRLVSQVKVFGTITEISALKGKGSNGCDLLIM 123
Query: 79 LSSKSKVSLVRFDPVAESLETLSLHYYHD-KFVNLSTSSLKTESIMAVDPLFRC-LLVFN 136
L+ +KVS + +D ++S T SLHYY D K N+ +S T+ + VDP C LL F
Sbjct: 124 LTDYAKVSTLEWDMQSQSFVTNSLHYYEDVKSSNICSSHTPTQ--LLVDPDSDCCLLRFL 181
Query: 137 EDVLAILPLKLNTEDMEIDE---DENGIKEPMAKRLKRNQGITSDSIIMPISSLHKSLKH 193
D++AI+P N ED++++E + + I A + S ++ S L S+
Sbjct: 182 TDMMAIIPYPAN-EDLDMEEAAIENSKISSSYAYK---------PSFVLASSQLDASISR 231
Query: 194 VYDIKWLNNFSKPTVGILYQPVLAWCGNEKVLGNTMRYMVLSLDVEDEKTTVIAELADLP 253
+ D+K+L + +PT+ ILY P + +T+ + +++LD+E + VI + LP
Sbjct: 232 ILDVKFLYGYREPTLAILYSPEQTSTVTLPLRKDTVLFSLVTLDLEQRASAVITTIQSLP 291
Query: 254 NDLHTLVPLKR---GYVLIGVNELLYISASGALQSCIRLNTFATSSINTRITDNSDMNIF 310
D++ V + G +L+G NEL+Y+ ++G I +N++ + + + D SD N+
Sbjct: 292 YDIYASVSIPTPLGGSLLLGGNELIYVDSAGRTVG-IGVNSYYSKCTDFPLQDQSDFNLE 350
Query: 311 LSKS-SIYFYKALKRHDLLILIDENCRMYNIITESEGNLLTKFDCVQVPI-VNEIFKNSR 368
L + +I + ++L+ + + + + +G + + + +N+ F S
Sbjct: 351 LEGTIAIPLTSSKTETPFVVLVHTSGQFFYLDFLLDGKSVKGLSLQALDLEINDDFLKSG 410
Query: 369 LPLSVCGDLNLETGRVLIGFLSGDAMFL----QLKNLKVAF-AAKRQLVETVDDDDDEYS 423
+ +V NL V +G + D+ L + N +V L T D + D+
Sbjct: 411 ITCAVPAGENL----VFLGSQTTDSYLLRWSRRTTNEEVRLDEGDDTLYGTNDAEMDDML 466
Query: 424 ALYGESQN-NTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKD 482
+Y ++ + +I P + + D + NIGP+T +GK S P N
Sbjct: 467 DIYETDESVGSKRKIAYENGPLRLEICDVLTNIGPITDFAVGKAGS----YSYFPQDNHG 522
Query: 483 EFSIVATSGVGRGSHLTALHSTVQPHIEQALKFTSATRIWNLKIKGK------------- 529
+V T+G L + P I +F +W + I GK
Sbjct: 523 PLELVGTAGADGAGGLVVFRRNIFPLIAGEFQFDGCEALWTVSISGKLRNMKSRIQAQYS 582
Query: 530 ----DKYLVTTDADKEKSDVYQIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVTSGG 585
+ YLV + +++S ++ F+ + DF KDS+T+ + ++ R++Q+
Sbjct: 583 NPELETYLVL--SKEKESFIFLAGETFDEVQHSDFSKDSKTLNVGSLLSGMRMVQICPTS 640
Query: 586 LYLFDVDFK--RLARLTIDIEIVHACIIDPYILFTDARGNIKIYQLD 630
L ++D + + +L + +V I DP I+ G I +Y++D
Sbjct: 641 LRVYDSNLRLTQLFNFSKKQIVVSTSICDPCIIVVFLGGGIALYKMD 687
>7303176 [A] KOG1896 mRNA cleavage and polyadenylation factor II complex
subunit CFT1 (CPSF subunit)
Length = 1455
Score = 163 bits (413), Expect = 1e-39
Identities = 126/512 (24%), Positives = 239/512 (46%), Gaps = 29/512 (5%)
Query: 799 FPNYNGYSVVFITGQVPYIIIKEDNSVCRIFRMT-NIPIVTMARWGK----NSVMCVDNI 853
F N G S V + G P + RI R+ N + + A + N + D
Sbjct: 947 FANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTT 1006
Query: 854 KNARVMKLDPECYYGNTQILRKIIIEDVVEEFETLGNIAYHERTGMY-IISYTKFIEYQA 912
++ L Y + +RK+ + + + YH +Y +I+ T+ +
Sbjct: 1007 YELKISVLPSYLSYDSVWPVRKVPLRCTPRQ------LVYHRENRVYCLITQTEEPMTKY 1060
Query: 913 LSEDGEPLVGYDPSKPNSTGYKSG----LLLINPLTWNIIDRLDLSENSMVNDIKTMLIQ 968
+GE + S+ Y G ++LI+P TW I+ ++ + +++
Sbjct: 1061 YRFNGEDKELSEESRGERFIYPIGSQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVK 1120
Query: 969 LNSKTRRK--RELVIIGSSFVKEEDQPSTGCLLVLDITEVVAEPGKPDSNFKFKQLFEEE 1026
L+ + R +E + IG++F ED S G + + DI EVV EPGKP + FK K++F++E
Sbjct: 1121 LSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKE 1180
Query: 1027 IRGSVNAVCEISGRFMIGQSSKALVRDMQEDNSAVPVAFLDMPVFITDAKSFSNLMIIGD 1086
+G V+A+ ++ G + G K + ++ D + VAF+D +++ + +L+ I D
Sbjct: 1181 QKGPVSAISDVLGFLVTGLGQKIYIWQLR-DGDLIGVAFIDTNIYVHQIITVKSLIFIAD 1239
Query: 1087 SMQGFTFVGFDAEPYRMIVLGKSTSKFQVMNLEFLVNNGNINFIVTDRQNHLHVLRYAPD 1146
+ + + F E + + + + +V +EF+V+N N+ F+VTD + ++ V Y P+
Sbjct: 1240 VYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPE 1299
Query: 1147 EANSLSGQRLVHCNSFN-------MFTTNNYMKLVRKHVEFGSKTSNYIALGCQTDGSIF 1199
SL GQ+L+ ++ MF + K + + F + +++ G DG++
Sbjct: 1300 ARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYENKHFVVYG-TLDGALG 1358
Query: 1200 RMIPLNEASYRRFYLVQQQLLDHEIPLAGFNTKMERLDNEYYHKG-HSLRPTLDSQVLKK 1258
+PL E YRRF ++Q LL ++ L G N K R +G + R +D ++
Sbjct: 1359 YCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWS 1418
Query: 1259 YIHLPITKRTTIENRVGRHASTELWHDLIDIE 1290
Y + ++R + ++G + E+ DL++IE
Sbjct: 1419 YRLMANSERNEVAKKIGTR-TEEILGDLLEIE 1449
Score = 113 bits (283), Expect = 2e-24
Identities = 149/662 (22%), Positives = 281/662 (41%), Gaps = 80/662 (12%)
Query: 27 EEYVVARTNVLSVFRV------SRAQKLVLAYEWKLA--------------GKIIDMQLL 66
E VVA NVL V+R+ S+ QKL + E +LA G ++ +Q +
Sbjct: 28 ENLVVAGANVLKVYRIAPNVEASQRQKLNPS-EMRLAPKMRLECLATYTLYGNVMSLQCV 86
Query: 67 PQIGSPLKMLAILSSKSKVSLVRFDPVAESLETLSLHYYHDKFVNLSTSSLKTESIMAVD 126
G+ L I +K+S+++ DP +L+TLSLHY+ + + + + VD
Sbjct: 87 SLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWTGRYFVPTVRVD 146
Query: 127 PLFRC--LLVFNEDVLAILPLKLNTEDMEIDEDENGIKEPMAKRLKRNQGITS--DSIIM 182
P RC +LV+ + L +LP + +D +DE E +P+ K T S ++
Sbjct: 147 PDSRCAVMLVYGKR-LVVLPFR---KDNSLDEIELADVKPIKKAPTAMVSRTPIMASYLI 202
Query: 183 PISSLHKSLKHVYDIKWLNNFSKPTVGILYQPVLAWCGNEKVLGNTMRYMVLSLDVEDEK 242
+ L + + +V DI++L+ + +PT+ ILY+PV G KV +T + +SL+++
Sbjct: 203 ALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAISLNIQQRV 262
Query: 243 TTVIAELADLPNDLHTLVPLKR---GYVLIGVNELLYISASGALQSCIRLNTFATSSINT 299
+I + LP D + P+++ G +++ VN ++Y++ S + LN+ A +S
Sbjct: 263 HPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYG-VSLNSSADNSTAF 321
Query: 300 RITDNSDMNIFLSKSSIYFYK------ALKRHDLLIL---IDENCRMYNI-ITESEGNLL 349
+ + I L ++ F +L+ DL +L +D + N ++ ++L
Sbjct: 322 PLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVL 381
Query: 350 TKFDCVQVPIVNEIFKNSRLPLSVCGDLNLETGRVLIGFLSGDAMFLQLKNLKVAFAAKR 409
T C+ V IF SRL S+ E +I D + Q + + +
Sbjct: 382 T--SCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITL---DEVEQQSEQQQRNLQDED 436
Query: 410 QLVETVDDDD---------------DEYSALYGESQNNTHTRIVETQEPFDISLLDSIFN 454
Q +E + D D DE +YG + ++ F + DS+ N
Sbjct: 437 QNLEEIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQL----RKFIFEVCDSLMN 492
Query: 455 IGPLTSLTIGKVASVEP---TIQRLPNPNKD-EFSIVATSGVGRGSHLTALHSTVQPHIE 510
+ P+ + G+ E T++ +D + +VA +G + L+ + + P I
Sbjct: 493 VAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCINPQII 552
Query: 511 QALKFTSATRIWNL------KIKGKDKYLVTTDADKEKSDVYQIDRNFEPFRAQDFRKDS 564
+ + +W + K D++ + + + V Q + F +
Sbjct: 553 TSFELDGCLDVWTVFDDATKKSSRNDQHDFMLLSQRNSTLVLQTGQEINEIENTGFTVNQ 612
Query: 565 RTIGMETMDDDKRILQVTSGGLYLFDVDFKRLARLTIDI--EIVHACIIDPYILFTDARG 622
TI + + + I+QVT+ + L + + + ID+ +V I DPY+ G
Sbjct: 613 PTIFVGNLGQQRFIVQVTTRHVRLLQ-GTRLIQNVPIDVGSPVVQVSIADPYVCLRVLNG 671
Query: 623 NI 624
+
Sbjct: 672 QV 673
>Hs9558725 [A] KOG1896 mRNA cleavage and polyadenylation factor II complex
subunit CFT1 (CPSF subunit)
Length = 1442
Score = 154 bits (390), Expect = 7e-37
Identities = 139/589 (23%), Positives = 258/589 (43%), Gaps = 50/589 (8%)
Query: 727 PSIKQIMLNRLGHHKKEEFLTI-----------------LTFGGEIYQYKKSTKHSGKLL 769
P +K+++L LG + +L + L G ++KK H+
Sbjct: 844 PLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKK-VPHNINFR 902
Query: 770 KCKSHPLITGAPNNAYPQGVNKIERVAH--YFPNYNGYSVVFITGQVPYIIIKEDNSVCR 827
+ K P A +G RVA YF + GYS VFI G P+ ++ R
Sbjct: 903 EKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALR 962
Query: 828 IFRMT-NIPIVTMARWGKNSVMC------VDNIKNARVMKLDPECYYGNTQILRKIIIED 880
+ M + P+ + A + ++V C + R+ L Y +RKI +
Sbjct: 963 LHPMAIDGPVDSFAPF--HNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRC 1020
Query: 881 VVEEFETLGNIAYHERTGMYIISYT-----KFIEYQALSEDGEPLVGYDPSKPNSTGYKS 935
T +AYH + +Y ++ + I E + D +
Sbjct: 1021 ------TAHYVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAF 1074
Query: 936 GLLLINPLTWNIID--RLDLSENSMVNDIKTMLIQLNSKTRRKRELVIIGSSFVKEEDQP 993
+ LI+P++W I R++L E V +KT+ ++ + V G+ ++ E+
Sbjct: 1075 SIQLISPVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT 1134
Query: 994 STGCLLVLDITEVVAEPGKPDSNFKFKQLFEEEIRGSVNAVCEISGRFMIGQSSKALVRD 1053
G +L++D+ EVV EPG+P + KFK L+E+E +G V A+C +G + K +
Sbjct: 1135 CRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWS 1194
Query: 1054 MQEDNSAVPVAFLDMPVFITDAKSFSNLMIIGDSMQGFTFVGFDAEPYRMIVLGKSTSKF 1113
++ + +AF+D ++I S N ++ D M+ + + + E + ++ +
Sbjct: 1195 LRA-SELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPL 1253
Query: 1114 QVMNLEFLVNNGNINFIVTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMFT-TNNYMK 1172
+V +++F+V+N + F+V+DR +L V Y P+ S G RL+ F++ N + +
Sbjct: 1254 EVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR 1313
Query: 1173 LVRKHVEFGSKTS-----NYIALGCQTDGSIFRMIPLNEASYRRFYLVQQQLLDHEIPLA 1227
+ E SK S +I DG I ++P+ E +YRR ++Q L A
Sbjct: 1314 TPCRATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHA 1373
Query: 1228 GFNTKMER-LDNEYYHKGHSLRPTLDSQVLKKYIHLPITKRTTIENRVG 1275
G N + R L + +++R LD ++L +Y++L +R+ + ++G
Sbjct: 1374 GLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIG 1422
Score = 131 bits (330), Expect = 6e-30
Identities = 152/710 (21%), Positives = 290/710 (40%), Gaps = 122/710 (17%)
Query: 3 VFDEILQPTVVNKCLHGNFTSAEREEYVVARTNVLSVFRVSR-----------------A 45
V+ + P + ++ NF + VVA T+ L V+R++R
Sbjct: 4 VYKQAHPPPGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNRDAEALTKNDRSTEGKAHR 63
Query: 46 QKLVLAYEWKLAGKIIDMQLLPQIGSPLKMLAILSSKSKVSLVRFDPVAESLETLSLHYY 105
+KL LA + G ++ M + G+ L + +K+S+V +DP L+TLSLHY+
Sbjct: 64 EKLELAASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYF 123
Query: 106 H-----DKFV-NLSTSSLKTESIMAVDPLFRC--LLVFNEDVLAILPLKLNTEDMEIDED 157
D FV N+ T ++ VDP RC +LV+ L +LP + + + E+
Sbjct: 124 EEPELRDGFVQNVHTPRVR------VDPDGRCAAMLVYGTR-LVVLPFRRES----LAEE 172
Query: 158 ENGIKEPMAKRLKRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPVLA 217
G+ + + S I+ + +L + L ++ D+++L+ + +PT+ IL++P
Sbjct: 173 HEGLVG------EGQRSSFLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQT 226
Query: 218 WCGNEKVLGNTMRYMVLSLDVEDEKTTVIAELADLPNDLHTLVPLKR---GYVLIGVNEL 274
W G V +T + +SL++ + VI L LP D + + + G V+ VN L
Sbjct: 227 WPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSL 286
Query: 275 LYISASGALQSCIRLNTFATSSINTRITDNSDMNIFLSKSSIYFYKALKRHDLLILIDEN 334
LY++ S + LN+ T + + + I L + F +D +++ +
Sbjct: 287 LYLNQSVPPYG-VALNSLTTGTTAFPLRTQEGVRITLDCAQATFIS----YDKMVISLKG 341
Query: 335 CRMYNIITESEG-NLLTKFDCVQVPIVNEIFKNSRLPLSVCGDLNLETGRVLIGFLSGDA 393
+Y + ++G + F F + + + +E G + +G G++
Sbjct: 342 GEIYVLTLITDGMRSVRAFH----------FDKAAASVLTTSMVTMEPGYLFLGSRLGNS 391
Query: 394 MFLQLKNL--------------KVAFAAKRQLVETV-----------DDDDDEYSALYGE 428
+ L+ K +K++ V+ D+ DE E
Sbjct: 392 LLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVDEIEVYGSE 451
Query: 429 SQNNTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEFSIVA 488
+Q+ T + + DSI NIGP + +G+ A + Q P P + IV
Sbjct: 452 AQSGTQL------ATYSFEVCDSILNIGPCANAAVGEPAFLSEEFQNSPEP---DLEIVV 502
Query: 489 TSGVGRGSHLTALHSTVQPHIEQALKFTSATRIW------------NLKIKGKDKYLVTT 536
SG G+ L+ L +++P + + +W N K +G ++ TT
Sbjct: 503 CSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTT 562
Query: 537 -DAD-------------KEKSDVYQIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVT 582
+AD ++ + + Q + F T+ + D++ I+QV+
Sbjct: 563 PEADDDGRRHGFLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQVS 622
Query: 583 SGGLYLFD-VDFKRLARLTIDIEIVHACIIDPYILFTDARGNIKIYQLDS 631
G+ L + V+ + + IV + DPY++ A G++ ++ L S
Sbjct: 623 PLGIRLLEGVNQLHFIPVDLGAPIVQCAVADPYVVIMSAEGHVTMFLLKS 672
>Hs22047447 [A] KOG1896 mRNA cleavage and polyadenylation factor II complex
subunit CFT1 (CPSF subunit)
Length = 1443
Score = 148 bits (373), Expect = 6e-35
Identities = 138/601 (22%), Positives = 261/601 (42%), Gaps = 73/601 (12%)
Query: 727 PSIKQIMLNRLGHHKKEEFLTI-----------------LTFGGEIYQYKKSTKHSGKLL 769
P +K+++L LG + +L + L G ++KK H+
Sbjct: 844 PLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKK-VPHNINFR 902
Query: 770 KCKSHPLITGAPNNAYPQGVNKIERVAH--YFPNYNGYSVVFITGQVPYIIIKEDNSVCR 827
+ K P A +G VA YF + GYS VFI G P+ ++ R
Sbjct: 903 EKKPKPSKKKAEGGGAEEGAGARGLVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALR 962
Query: 828 IFRMT-NIPIVTMARWGKNSVMC------VDNIKNARVMKLDPECYYGNTQILRKIIIED 880
+ M + P+ + A + ++V C + R+ L Y +RKI
Sbjct: 963 LHPMAIDGPVDSFAPF--HNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKI---- 1016
Query: 881 VVEEFETLGNIAYHERTGMYIISYTKFI-------------EYQALSEDGEPLVGYDPSK 927
+ T +AYH +Y ++ + E++ + D +
Sbjct: 1017 --PQRCTAHYVAYHVEFKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYI------H 1068
Query: 928 PNSTGYKSGLLLINPLTWNIID--RLDLSENSMVNDIKTMLIQLNSKTRRKRELVIIGSS 985
P + + LI+P++W I R++L E V +KT+ ++ + V G+
Sbjct: 1069 PQQEAFS--IQLISPVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTC 1126
Query: 986 FVKEEDQPSTGCLLVLDITEVVAEPGKPDSNFKFKQLFEEEIRGSVNAVCEISGRFMIGQ 1045
++ E+ G +L++D+ EVV EPG+P + KFK L+E+E +G V A+C +G +
Sbjct: 1127 LMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAI 1186
Query: 1046 SSKALVRDMQEDNSAVPVAFLDMPVFITDAKSFSNLMIIGDSMQGFTFVGFDAEPYRMIV 1105
K + ++ + +AF+D ++I S N ++ D M+ + + + E + +
Sbjct: 1187 GQKIFLWSLRA-SELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSL 1245
Query: 1106 LGKSTSKFQVMNLEFLVNNGNINFIVTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMF 1165
+ + +V +++F+V+N + F+V+DR +L V Y P+ S G RL+ F++
Sbjct: 1246 VSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVG 1305
Query: 1166 T-TNNYMK---------LVRKHVEFGSKTSNYIALGCQTDGSIFRMIPLNEASYRRFYLV 1215
N + + L +K V + +K + A DG I ++P+ E +YRR ++
Sbjct: 1306 AHVNTFWRTPCRGATEGLSKKSVVWENKHITWFA---TLDGGIGLLLPMQEKTYRRLLML 1362
Query: 1216 QQQLLDHEIPLAGFNTKMER-LDNEYYHKGHSLRPTLDSQVLKKYIHLPITKRTTIENRV 1274
Q L AG N + R L + +++R LD ++L +Y++L +R+ + ++
Sbjct: 1363 QNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSEVAKKI 1422
Query: 1275 G 1275
G
Sbjct: 1423 G 1423
Score = 134 bits (336), Expect = 1e-30
Identities = 153/710 (21%), Positives = 291/710 (40%), Gaps = 122/710 (17%)
Query: 3 VFDEILQPTVVNKCLHGNFTSAEREEYVVARTNVLSVFRVSR-----------------A 45
V+ + PT + ++ NF + VVA T+ L V+R++R
Sbjct: 4 VYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNRDAEALTKNDRSTEGKAHR 63
Query: 46 QKLVLAYEWKLAGKIIDMQLLPQIGSPLKMLAILSSKSKVSLVRFDPVAESLETLSLHYY 105
+KL LA + G ++ M + G+ L + +K+S+V +DP L+TLSLHY+
Sbjct: 64 EKLELAASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYF 123
Query: 106 H-----DKFV-NLSTSSLKTESIMAVDPLFRC--LLVFNEDVLAILPLKLNTEDMEIDED 157
D FV N+ T ++ VDP RC +LV+ L +LP + + + E+
Sbjct: 124 EEPELRDGFVQNVHTPRVR------VDPDGRCAAMLVYGTR-LVVLPFRRES----LAEE 172
Query: 158 ENGIKEPMAKRLKRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPVLA 217
G+ + + S I+ + +L + L ++ D+++L+ + +PT+ IL++P
Sbjct: 173 HEGLVG------EGQRSSFLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQT 226
Query: 218 WCGNEKVLGNTMRYMVLSLDVEDEKTTVIAELADLPNDLHTLVPLKR---GYVLIGVNEL 274
W G V +T + +SL++ + VI L LP D + + + G V+ VN L
Sbjct: 227 WPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSL 286
Query: 275 LYISASGALQSCIRLNTFATSSINTRITDNSDMNIFLSKSSIYFYKALKRHDLLILIDEN 334
LY++ S + LN+ T + + + I L + F +D +++ +
Sbjct: 287 LYLNQSVPPYG-VALNSLTTGTTAFPLRTQEGVRITLDCAQATFIS----YDKMVISLKG 341
Query: 335 CRMYNIITESEG-NLLTKFDCVQVPIVNEIFKNSRLPLSVCGDLNLETGRVLIGFLSGDA 393
+Y + ++G + F F + + + +E G + +G G++
Sbjct: 342 GEIYVLTLITDGMRSVRAFH----------FDKAAASVLTTSMVTMEPGYLFLGSRLGNS 391
Query: 394 MFLQLKNL--------------KVAFAAKRQLVETV-----------DDDDDEYSALYGE 428
+ L+ K +K++ V+ D+ DE E
Sbjct: 392 LLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVDEIEVYGSE 451
Query: 429 SQNNTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEFSIVA 488
+Q+ T + + DSI NIGP + +G+ A + Q P P + IV
Sbjct: 452 AQSGTQL------ATYSFEVCDSILNIGPCANAAVGEPAFLSKEFQNSPEP---DLEIVV 502
Query: 489 TSGVGRGSHLTALHSTVQPHIEQALKFTSATRIW------------NLKIKGKDKYLVTT 536
SG G+ L+ L +++P + + +W N K +G ++ TT
Sbjct: 503 CSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTT 562
Query: 537 -DAD-------------KEKSDVYQIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVT 582
+AD ++ + + Q + F T+ + D++ I+QV+
Sbjct: 563 PEADDDGRRHGFLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQVS 622
Query: 583 SGGLYLFD-VDFKRLARLTIDIEIVHACIIDPYILFTDARGNIKIYQLDS 631
G+ L + V+ + + IV + DPY++ A G++ ++ L S
Sbjct: 623 PLGIRLLEGVNQLHFIPVDLGAPIVQCAVADPYVVIMSAEGHVTMFLLKS 672
>CE25609 [A] KOG1896 mRNA cleavage and polyadenylation factor II complex
subunit CFT1 (CPSF subunit)
Length = 1601
Score = 114 bits (285), Expect = 1e-24
Identities = 140/666 (21%), Positives = 279/666 (41%), Gaps = 101/666 (15%)
Query: 11 TVVNKCLHGNFTSAEREEY--VVARTNVLSVFRVS---------------RAQKLVLAYE 53
T +N +G F E + + + +FRV+ + KL +
Sbjct: 12 TAINFSAYGKFLPGENTGFQLLTIGAKFIRIFRVNPYVLKEPGEDNEEWQQKTKLECMFS 71
Query: 54 WKLAGKI--IDMQLLPQIGSPLKMLAILSSKSKVSLVRFDPVAESLETLSLHYYHDKFVN 111
+L K I + +PQ+ +L +K+S+V + +++T+SLH + ++++
Sbjct: 72 CRLLNKCHSIAVARVPQLPDQDSILMTFDD-AKLSIVSINEKERNMQTISLHAFENEYLR 130
Query: 112 LSTSSLKTESIMAVDPLFRCL--LVFNEDVLAILPLKLNTEDMEIDEDENGIKEPMAKRL 169
+ ++ DP RC LV+ + + AILP N++ +
Sbjct: 131 DGFINHFQPPLVRSDPSNRCAACLVYGKHI-AILPFHENSKRIH---------------- 173
Query: 170 KRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPVLAWCGNEKVLGNTM 229
S ++P+ + L ++ D+ +L+ + +PT+ LY+P+ G V +TM
Sbjct: 174 ---------SYVIPLKQIDPRLDNIADMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTM 224
Query: 230 RYMVLSLDVEDEKTTVIAELADLPNDLHTLVPLKR---GYVLIGVNELLYISASGALQSC 286
M +S+++ D + V+ + A+LP D L+P+ + G ++ G N ++Y++ A+ C
Sbjct: 225 CIMGVSVNIVDRQFAVVWQTANLPMDCSQLLPIPKPLGGALVFGSNTVVYLNQ--AVPPC 282
Query: 287 -IRLNTFATSSINTRITDNSDMNIFLS-KSSIYFYKALKRHDLLILI---DENCRMYNII 341
+ LN+ + D + + L +S+Y D I + D + + ++
Sbjct: 283 GLVLNSCYDGFTKFPLKDLKHLKMTLDCSTSVYM------EDGRIAVGSRDGDLFLLRLM 336
Query: 342 TESEGNLLTKFDCVQVPIVNEIFKNS-RLPLSVCGDLNLETGRVLIGFLSGDAMFLQLKN 400
T S G + + +++++ S L+VC G + +G GD+ L+
Sbjct: 337 TSSGGGTVKSLE------FSKVYETSIAYSLTVCA-----PGHLFVGSRLGDSQLLEYTL 385
Query: 401 LKVA--FAAKRQLVETVD----------DDDDEYSALYGESQNNTHTRIVETQEPFDISL 448
LK A KR ++ D DD + Y E QN+ +I E
Sbjct: 386 LKTTRDCAVKRLKIDNKDPAAAEIELDEDDMELYGGAIEEQQNDDDEQI---DESLQFRE 442
Query: 449 LDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDE-FSIVATSGVGRGSHLTALHSTVQP 507
LD + N+GP+ S+ +G+ + + + +D F +V SG G+ L +++P
Sbjct: 443 LDRLRNVGPVKSMCVGRPNYMSNDL--VDAKRRDPVFDLVTASGHGKNGALCVHQRSLRP 500
Query: 508 HIEQALKFTSATRIWNLKIKGKD--KYLVTTDADKEKSDVYQIDRNFEPFRAQDFRKDSR 565
I + A ++W + K + KYL+ + + + ++ Q F
Sbjct: 501 EIITSSLLEGAEQLWAVGRKENESHKYLIVSRV--RSTLILELGEELVELEEQLFVTGEP 558
Query: 566 TIGMETMDDDKRILQVTSGGLYLFDVDFKRLARLTID--IEIVHACIIDPYILFTDARGN 623
T+ + +QVTS + L D +++ + ID ++ A I+DPY+ G
Sbjct: 559 TVAAGELSQGALAVQVTSTCIALV-TDGQQMQEVHIDSNFPVIQASIVDPYVALLTQNGR 617
Query: 624 IKIYQL 629
+ +Y+L
Sbjct: 618 LLLYEL 623
Score = 90.5 bits (223), Expect = 2e-17
Identities = 63/255 (24%), Positives = 118/255 (45%), Gaps = 7/255 (2%)
Query: 934 KSGLLLINPLTWNIIDRLDLSENSM--VNDIKTMLIQLNSKTRRKRELVIIGSSFVKEED 991
K L L + W + ++S M V + + ++ S L+ +G+ E+
Sbjct: 1079 KYTLNLFSSQDWAAVPNTEISFEDMEAVTACEDVALKSESTISGLETLLAMGTVNNYGEE 1138
Query: 992 QPSTGCLLVLDITEVVAEPGKPDSNFKFKQLFEEEIRGSVNAVCEISGRFMIGQSSKALV 1051
G +++ ++ EVV EP +P SN K K LF++E +G V +C I+G + G K +
Sbjct: 1139 VLVRGRIILCEVIEVVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLCGMGQKVFI 1198
Query: 1052 RDMQEDNSAVPVAFLDMPVFITDAKSFSNLMIIGDSMQGFTFVGFDAEPYRMIVLGKSTS 1111
+DN + ++FLDM ++ S + I D+ + + + F + M + +
Sbjct: 1199 WQF-KDNDLMGISFLDMHYYVYQLHSLRTIAIACDARESMSLIRFQEDNKAMSIASRDDR 1257
Query: 1112 KF--QVMNLEFLVNNGNINFIVTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMFTTNN 1169
K M + +V+ ++ F+++D ++ + YAP+ S G+RL + N+ T N
Sbjct: 1258 KCAQPPMASQLVVDGAHVGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGT--N 1315
Query: 1170 YMKLVRKHVEFGSKT 1184
VR F SK+
Sbjct: 1316 INAFVRLRGNFCSKS 1330
>ECU11g0610 [A] KOG1896 mRNA cleavage and polyadenylation factor II complex
subunit CFT1 (CPSF subunit)
Length = 1156
Score = 100 bits (248), Expect = 2e-20
Identities = 68/325 (20%), Positives = 147/325 (44%), Gaps = 9/325 (2%)
Query: 948 IDRLDLSENSMVNDIKTMLIQLNSKTRRKRELVIIGSSFVKEEDQPSTGCLLVLDITEVV 1007
ID +L EN V IK +++ K +++ ++F++ ED+P+ G L VL+I VV
Sbjct: 822 IDTYELDENEYVFHIKYLILDDMQGNYGKSPFLLVCTTFIEGEDRPARGRLHVLEIISVV 881
Query: 1008 AEPGKPDSNFKFKQLFEEEIRGSVNAVCEISGRFMIGQSSKALVRDMQEDNSAVPVAFLD 1067
P + K K L E+ +GS+ E+ G+ + +K ++ + + +P+ F D
Sbjct: 882 PSLESPFKDCKLKVLGIEKTKGSIVRCEEVRGKIALCLGTKIMIYKIDRSSGIIPIGFYD 941
Query: 1068 MPVFITDAKSFSNLMIIGDSMQGFTFVGFDAEPYRMIVLGKSTSKFQVMNLEFLVNNGNI 1127
+ +F + N ++ D +G +F F ++P R+ ++ S + E L +
Sbjct: 942 LHIFTSSISVVKNYILASDIYRGLSFFFFQSKPIRLHLISSSEPLRNATSTELLSTGNEL 1001
Query: 1128 NFIVTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMFTTNNYMKLVRKHVEFGSKTSNY 1187
+ + D + +H Y+P+ S+ G RLV TN + + FG+
Sbjct: 1002 SMLCCDAKGTIHGYTYSPNNIISMDGARLVKRAEIK---TN-----LGRLSSFGAGFKKN 1053
Query: 1188 IALGCQTDGSIFRMIPLNEASYRRFYLVQQQLLDHEIPLAGFNTKMERLDNEYYHKGHSL 1247
+ + + +++A Y + VQ ++ H + G N + + L+++ + SL
Sbjct: 1054 SIMFYSRSNMLIHVSGIDDAHYLKLLGVQTAIMAHLKSVFGLNQR-DYLNSDIHLHSLSL 1112
Query: 1248 RPTLDSQVLKKYIHLPITKRTTIEN 1272
+ + +L + + ++ + ++ +
Sbjct: 1113 KSPIVLHILNLFSYFDMSTQESVSS 1137
>At5g51660 [A] KOG1896 mRNA cleavage and polyadenylation factor II complex
subunit CFT1 (CPSF subunit)
Length = 1448
Score = 99.4 bits (246), Expect = 3e-20
Identities = 81/348 (23%), Positives = 165/348 (47%), Gaps = 26/348 (7%)
Query: 951 LDLSENSMVNDIKTMLIQLNSKTRRKRELVIIGSSFVKEEDQPSTGCLLVLDITEVVAEP 1010
+ SE+++ + T+L N+ T L+ +G+++V+ ED + G +L+
Sbjct: 1111 MQTSEHALTVRVVTLL---NASTGENETLLAVGTAYVQGEDVAARGRVLLFSF------- 1160
Query: 1011 GKPDSNFK--FKQLFEEEIRGSVNAVCEISGRFMIGQSSKALVRDMQEDNSAVPVAFLDM 1068
GK N + +++ E++G+++AV I G +I K ++ VAF D
Sbjct: 1161 GKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKWN-GTELNGVAFFDA 1219
Query: 1069 P-VFITDAKSFSNLMIIGDSMQGFTFVGFDAEPYRMIVLGKSTSKFQVMNLEFLVNNGNI 1127
P +++ + +++GD + F+ + + ++ +L K EFL++ +
Sbjct: 1220 PPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGSTL 1279
Query: 1128 NFIVTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMFT-TNNYMKLVRKHVEFGSKTSN 1186
+ V+D Q ++ V YAP S G +L+ F++ + +++L + V G+ N
Sbjct: 1280 SLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRL--QMVSSGADKIN 1337
Query: 1187 YIALGCQT-DGSIFRMIPLNEASYRRFYLVQQQLLDHEIPLAGFNTKMERLDNEYYHKGH 1245
AL T DGS + PL+E ++RR +Q++L+D +AG N R ++ G
Sbjct: 1338 RFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFR---QFRSSGK 1394
Query: 1246 SLR----PTLDSQVLKKYIHLPITKRTTIENRVGRHASTELWHDLIDI 1289
+ R +D ++L Y LP+ ++ + +++G + DL+D+
Sbjct: 1395 ARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGT-TRYSILKDLVDL 1441
Score = 80.1 bits (196), Expect = 2e-14
Identities = 123/640 (19%), Positives = 255/640 (39%), Gaps = 87/640 (13%)
Query: 7 ILQPTVVNKCLHGNFTSAEREEYVVARTNVLS-VFRVSRAQKLVLAYEWKLAGKIIDMQL 65
IL+ +V GN T R + R V+ V+ VS L L ++L G + + +
Sbjct: 67 ILEVYIVRAQEEGN-TQELRNPKLAKRGGVMDGVYGVS----LELVCHYRLHGNVESIAV 121
Query: 66 LPQIGSPLKM----LAILSSKSKVSLVRFDPVAESLETLSLHYYHDK---FVNLSTSSLK 118
LP G + + +K+S++ FD SL S+H + + S
Sbjct: 122 LPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESFP 181
Query: 119 TESIMAVDPLFRC--LLVFNEDVLAILPLKLNTEDMEIDEDENGIKEPMAKRLKRNQGIT 176
++ VDP RC +LV+ ++ + ++ + + D+D ++ R++
Sbjct: 182 RGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGS-GLVGDDDAFSSGGTVSARVE------ 234
Query: 177 SDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPVLAWCGNEKVLGNTMRYMVLSL 236
S I+ + L +KHV D +L+ + +P + IL + W G +T LS+
Sbjct: 235 -SSYIINLRDLE--MKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSI 291
Query: 237 DVEDEKTTVIAELADLPNDLHTLVPLKR---GYVLIGVNELLYISASGALQSCIRLNTFA 293
+ ++ VI +LP+D + L+ + G +++ N + Y S S + + LN +A
Sbjct: 292 NSTLKQHPVIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSAS--CALALNNYA 349
Query: 294 TSSINTRITDNSDMNIFLSKSSIYFYKALKRHDLLILIDENCRMYNIITESEGNLLTKFD 353
+S+ +++ S+ ++ L + + +D+ +L ++ + + +G + + D
Sbjct: 350 SSADSSQELPASNFSVELDAA----HGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLD 405
Query: 354 CVQVPIVNEIFKNSRLPLSVCGDLNLETGRVLIGFLSGDAMFLQLKNLKVAFAAKRQLVE 413
S+ + ++ +G GD++ +Q + + AA +
Sbjct: 406 ----------LSKSKASVLASDITSVGNSLFFLGSRLGDSLLVQF-SCRSGPAASLPGLR 454
Query: 414 TVDDD----------------------DDEYSALYGESQNNTHTRIVETQ--EPFDISLL 449
D+D +E +L+G + NN+ + V + + F ++
Sbjct: 455 DEDEDIEGEGHQAKRLRMTSDTFQDTIGNEELSLFGSTPNNSDSAQVTSSVLKSFSFAVR 514
Query: 450 DSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEFSIVATSGVGRGSHLTALHSTVQPHI 509
DS+ N+GP+ G + + + + + +V SG G+ L L +++P +
Sbjct: 515 DSLVNVGPVKDFAYGLRINADANATGV--SKQSNYELVCCSGHGKNGALCVLRQSIRPEM 572
Query: 510 EQALKFTSATRIWNLKIKGKDKYLVTTD---ADKEKSDVYQI-------------DRNFE 553
++ IW + K + + AD+++ Y I D E
Sbjct: 573 ITEVELPGCKGIWTVYHKSSRGHNADSSKMAADEDEYHAYLIISLEARTMVLETADLLTE 632
Query: 554 PFRAQDFRKDSRTIGMETMDDDKRILQVTSGGLYLFDVDF 593
+ D+ RTI + +R++QV G + D F
Sbjct: 633 VTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGARILDGSF 672
Database: KOG eukaryal database 04/03
Posted date: Apr 14, 2003 1:07 PM
Number of letters in database: 30,389,216
Number of sequences in database: 60,738
Lambda K H
0.320 0.137 0.394
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 75,789,634
Number of Sequences: 60738
Number of extensions: 3322900
Number of successful extensions: 8087
Number of sequences better than 1.0e-05: 8
Number of HSP's better than 0.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 8012
Number of HSP's gapped (non-prelim): 25
length of query: 1300
length of database: 30,389,216
effective HSP length: 118
effective length of query: 1182
effective length of database: 23,222,132
effective search space: 27448560024
effective search space used: 27448560024
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)