ORF STATUS Function Best COG Functional category Pathways and functional systems
r_klactIV3071 good L KOG0442 Replication, recombination and repair Structure-specific endonuclease ERCC1-XPF, catalytic component XPF/ERCC4
Only best alignment is shown:
BLASTP 2.2.3 [May-13-2002]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= r_klactIV3071 1037870 1041037 1056
(1056 letters)
Database: KOG eukaryal database 04/03
60,738 sequences; 30,389,216 total letters
Searching..................................................done
Color Key for Alignment Scores:
Score E
Sequences producing significant alignments: (bits) Value
YPL022w [L] KOG0442 Structure-specific endonuclease ERCC1-XPF ca... 1013 0.0
SPCC970.01 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF... 423 e-118
Hs4885217 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF ... 382 e-105
At5g41150 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF ... 352 2e-96
7290484 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF ca... 332 2e-90
CE24855 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF ca... 189 2e-47
ECU08g0760 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF... 121 7e-27
>YPL022w [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic
component XPF/ERCC4
Length = 1100
Score = 1013 bits (2620), Expect = 0.0
Identities = 537/1069 (50%), Positives = 734/1069 (68%), Gaps = 43/1069 (4%)
Query: 3 LFLQDDSDEEDLVIELSN-LSKNDETADVIVEDQPKETAESELSENPQSEVLGDTEDKVL 61
LF Q DSD+E L EL+ ++ +++ + ED+P ++ EN S+VL D D VL
Sbjct: 4 LFYQGDSDDE-LQEELTRQTTQASQSSKIKNEDEPDDSNHLNEVENEDSKVLDD--DAVL 60
Query: 62 YPLIPIDANT-ETLKPNIEEIRPVDVHLSLSLPFQHLILENLLVSENALLVIGKGLSVLS 120
YPLIP + + ET KPNI +IRPVD+ L+L LPFQ ++EN L++E+AL+++GKGL +L
Sbjct: 61 YPLIPNEPDDIETSKPNINDIRPVDIQLTLPLPFQQKVVENSLITEDALIIMGKGLGLLD 120
Query: 121 IVSNLLYTLSTPTRIDGTDKRSLVLVLNAXXXXXXXXXXXLMELQWTCN---EDDEEGQP 177
IV+NLL+ L+TPT I+G KR+LVLVLNA L EL W N +DD+
Sbjct: 121 IVANLLHVLATPTSINGQLKRALVLVLNAKPIDNVRIKEALEELSWFSNTGKDDDDTAVE 180
Query: 178 GD-----RPFTVISSDSFTVDQRSKRYQQGGIISVTSRILIVDLLSGIVHPKNITGLLIL 232
D RPF V+++DS ++++R K Y GGI+S+TSRILIVDLLSGIVHP +TG+L+L
Sbjct: 181 SDDELFERPFNVVTADSLSIEKRRKLYISGGILSITSRILIVDLLSGIVHPNRVTGMLVL 240
Query: 233 HAEKLDSMSIESFIVEIYRNSNKWGFIKAITDSAESMISEFAPLAKKMKDLLLKRILLWP 292
+A+ L S ESFI+EIYR+ N WGFIKA +++ E+ + EF+PL KMK+L LK +LLWP
Sbjct: 241 NADSLRHNSNESFILEIYRSKNTWGFIKAFSEAPETFVMEFSPLRTKMKELRLKNVLLWP 300
Query: 293 RFHADISSCLNTTSNT---KVIEIRVSLTDSMSKIQFGLYECLKKCIDELNRKNPELSTE 349
RF ++SSCLN T+ T KVIE++VSLT+SMS+IQFGL ECLKKCI EL+RKNPEL+ +
Sbjct: 301 RFRVEVSSCLNATNKTSHNKVIEVKVSLTNSMSQIQFGLMECLKKCIAELSRKNPELALD 360
Query: 350 YWSFENALDSNFLRIINGVLSPKWHRISYESKQLVKDXXXXXXXXXXXXXYDALDFYELI 409
+W+ EN LD NF+R I+ V+ P WHRISYESKQLVKD DA+DF+ I
Sbjct: 361 WWNMENVLDINFIRSIDSVMVPNWHRISYESKQLVKDIRFLRHLLKMLVTSDAVDFFGEI 420
Query: 410 QLILDANKPSVTRKYSESPWLLADESQLVISFAKKRVIDDGNYNLEPLPKWEQLCALMED 469
QL LDANKPSV+RKYSESPWLL DE+QLVIS+AKKR+ Y LE PKWEQL ++ D
Sbjct: 421 QLSLDANKPSVSRKYSESPWLLVDEAQLVISYAKKRIFYKNEYTLEENPKWEQLIHILHD 480
Query: 470 IEHESSKYPAGTQGSVLILCSDDRTSNQLRQVIRNMKSKHNGHKNIMLDKLDIYXXXXXX 529
I HE + QG L+ CSD+ T +L +V+ N +K G + ++L+KL Y
Sbjct: 481 ISHE--RMTNHLQGPTLVACSDNLTCLELAKVL-NASNKKRGVRQVLLNKLKWYRKQREE 537
Query: 530 XXXISKDIVEENEKYQNGGEMNVSRAFHKQEINTKRRRTRGASFVAAVDRLKNAQAGQGQ 589
+ K+ V+ + + +NVS F K+++ TKRRRTRGAS VAAV++L+NA G
Sbjct: 538 TKKLVKE-VQSQDTFPENATLNVSSTFSKEQVTTKRRRTRGASQVAAVEKLRNA--GTNV 594
Query: 590 DIDAMITSDSIKEE------RDRSLVSMETLGDEAAVKEDFDSEDELSIIE--------E 635
D++ + + EE D E +++ + E + E+E+ I + E
Sbjct: 595 DMEVVFEDHKLSEEIKKGSGDDLDDGQEENAANDSKIFEIQEQENEILIDDGDAEFDNGE 654
Query: 636 KLQDGSIPEYAMVQNWEEVWERRKTGYKYVDNDCKIVIETFSTTSDEQILHELMPSYIII 695
G +P++ +++W Y+YVD +I+I TF + +D L E+MPSYII+
Sbjct: 655 LEYVGDLPQHITTHFNKDLWAEHCNEYEYVDRQDEILISTFKSLNDNCSLQEMMPSYIIM 714
Query: 696 YEPNLAFVRKLEVYKAIHRHNPPKVYFMYYGDSVEEQSHLSSIKREKEAFTKLIREHSNM 755
+EP+++F+R++EVYKAI + PKVYFMYYG+S+EEQSHL++IKREK+AFTKLIRE++N+
Sbjct: 715 FEPDISFIRQIEVYKAIVKDLQPKVYFMYYGESIEEQSHLTAIKREKDAFTKLIRENANL 774
Query: 756 AQHFETDEDLSRYKNLAHRKMQLSRMK--NSRIAGGQDFLNPMTYDVVVVDMREFRAALP 813
+ HFET+EDLS YKNLA RK++LS+++ N+R AGGQ + +T DVV+VD REF A+LP
Sbjct: 775 SHHFETNEDLSHYKNLAERKLKLSKLRKSNTRNAGGQQGFHNLTQDVVIVDTREFNASLP 834
Query: 814 GLLYRYGVRVVPCMLTIGDYVITPDICIERKSIADLIGSFKNGRLDKQIRSLSRFYKYPT 873
GLLYRYG+RV+PCMLT+GDYVITPDIC+ERKSI+DLIGS +N RL Q + + ++Y YPT
Sbjct: 835 GLLYRYGIRVIPCMLTVGDYVITPDICLERKSISDLIGSLQNNRLANQCKKMLKYYAYPT 894
Query: 874 LLIEFDDSQSFSLEPFSERNVYASAASSTVHPISGKLMQEEIQRELSHLVMKYPSLKIVW 933
LLIEFD+ QSFSLEPFSER Y + STVHPIS KL Q+EIQ +L+ LV+++P+LKI+W
Sbjct: 895 LLIEFDEGQSFSLEPFSERRNYKNKDISTVHPISSKLSQDEIQLKLAKLVLRFPTLKIIW 954
Query: 934 SSSPLQTVNIFLDLKTNREQPDPVKCVQFGSTK-----KQTGKNKKDTESNNKFKNLLTI 988
SSSPLQTVNI L+LK REQPDP V G+ K T K KD ++ +KFK LL +
Sbjct: 955 SSSPLQTVNIILELKLGREQPDPSNAVILGTNKVRSDFNSTAKGLKDGDNESKFKRLLNV 1014
Query: 989 PGLSNVDYYNIKKRYKRYADLLNASVEDLKNIVTDPDLSERIQLSLQRQ 1037
PG+S +DY+N++K+ K + L S ++ ++ D DL++RI L+ +
Sbjct: 1015 PGVSKIDYFNLRKKIKSFNKLQKLSWNEINELINDEDLTDRIYYFLRTE 1063
>SPCC970.01 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic
component XPF/ERCC4
Length = 892
Score = 423 bits (1087), Expect = e-118
Identities = 287/949 (30%), Positives = 468/949 (49%), Gaps = 133/949 (14%)
Query: 84 VDVHLSLSLPFQHLILENLLVSENALLVIGKGLSVLSIVSNLLYTLSTPTRIDGTDKRSL 143
++ + L L +Q + N L+ E+ L VI GLS+L I +N+L + P SL
Sbjct: 1 METKVHLPLAYQQQVF-NELIEEDGLCVIAPGLSLLQIAANVLSYFAVPG--------SL 51
Query: 144 VLVLNAXXXXXXXXXXXLMELQWTCNEDD------EEGQPGDRPFTVISSDSFTVDQRSK 197
+L++ A N DD E ++ +++++ +VD+R K
Sbjct: 52 LLLVGA-------------------NVDDIELIQHEMESHLEKKLITVNTETMSVDKREK 92
Query: 198 RYQQGGIISVTSRILIVDLLSGIVHPKNITGLLILHAEKLDSMSIESFIVEIYRNSNKWG 257
Y +GGI ++TSRIL++DLL+ I+ + ITG+++LHA+++ S +FI+ +YR +NK G
Sbjct: 93 SYLEGGIFAITSRILVMDLLTKIIPTEKITGIVLLHADRVVSTGTVAFIMRLYRETNKTG 152
Query: 258 FIKAITDSAESMISEFAPLAKKMKDLLLKRILLWPRFHADISSCLNTTSNTKVIEIRVSL 317
FIKA +D E + L+ ++ L L+ + ++PRFH ++ L S V+E+ V+L
Sbjct: 153 FIKAFSDDPEQFLMGINALSHCLRCLFLRHVFIYPRFHVVVAESLEK-SPANVVELNVNL 211
Query: 318 TDSMSKIQFGLYECLKKCIDELNRKNPE-LSTEYWSFENALDSNFLRIINGVLSPKWHRI 376
+DS IQ L C++ + EL R N L E W+ E+AL +F I+ L WHR+
Sbjct: 212 SDSQKTIQSCLLTCIESTMRELRRLNSAYLDMEDWNIESALHRSFDVIVRRQLDSVWHRV 271
Query: 377 SYESKQLVKDXXXXXXXXXXXXXYDALDFYELIQ-LILDANKPSVTRKYSESPWLLADES 435
S ++KQLV D YD + F +L+ L+L N S SPWL+ D +
Sbjct: 272 SPKTKQLVGDLSTLKFLLSALVCYDCVSFLKLLDTLVLSVNVSSYPSNAQPSPWLMLDAA 331
Query: 436 QLVISFAKKRVIDDGNYN-------LEPLPKWEQLCALMEDIEHESSKYPAGTQ---GSV 485
+I A+ RV + LE PKW L ++ ++ HE+ + S+
Sbjct: 332 NKMIRVARDRVYKESEGPNMDAIPILEEQPKWSVLQDVLNEVCHETMLADTDAETSNNSI 391
Query: 486 LILCSDDRTSNQLRQVIRNMKSKHNGHKNIMLDKLDIYXXXXXXXXXISKDIVEENEKYQ 545
+I+C+D+RT QLR + + + M KL Y +SK I + +
Sbjct: 392 MIMCADERTCLQLRDYLSTVTYDNKDSLKNMNSKLVDYFQWREQYRKMSKSIKKPEPSKE 451
Query: 546 NGGEMNVSRAFHKQEINTKRRRTRGASFVAAVDRLKNAQAGQGQDIDAMITSDSIKEERD 605
SR K +KRRR RG + + N A D + +
Sbjct: 452 REASNTTSR---KGVPPSKRRRVRGGNNATSRTTSDNTDANDSFSRDLRL---------E 499
Query: 606 RSLVSMETLGDEAAVKEDFDSEDELSIIEEKLQDGSIPEYAMVQNWEEVWERRKTGYKYV 665
+ L+S + E V D ++ +
Sbjct: 500 KILLSHLSKRYEPEVGND-------------------------------------AFEVI 522
Query: 666 DNDCKIVIETFSTTSDEQILHELMPSYIIIYEPNLAFVRKLEVYKAIHRHNPPKVYFMYY 725
D+ I I +++ DE +L+ L P Y+I+++ + F+R++EVYKA + +VYFMYY
Sbjct: 523 DDFNSIYIYSYNGERDELVLNNLRPRYVIMFDSDPNFIRRVEVYKATYPKRSLRVYFMYY 582
Query: 726 GDSVEEQSHLSSIKREKEAFTKLIREHSNMAQHFETDEDLSRYKNLAHRKMQLSRMKNSR 785
G S+EEQ +L S++REK++F++LI+E SNMA D + R+++ ++ + R N+R
Sbjct: 583 GGSIEEQKYLFSVRREKDSFSRLIKERSNMAIVLTADSE--RFES---QESKFLRNVNTR 637
Query: 786 IAGGQDFL----NPMTYDV-----------VVVDMREFRAALPGLLYRYGVRVVPCMLTI 830
IAGG P + V+VD+REFR++LP +L+ V+PC L +
Sbjct: 638 IAGGGQLSITNEKPRVRSLYLMFICIKTLKVIVDLREFRSSLPSILHGNNFSVIPCQLLV 697
Query: 831 GDYVITPDICIERKSIADLIGSFKNGRLDKQIRSLSRFYKYPTLLIEFDDSQSFSLEPFS 890
GDY+++P IC+ERKSI DLI S NGRL Q +++ +Y+ P LLIEF+ QSF+ PFS
Sbjct: 698 GDYILSPKICVERKSIRDLIQSLSNGRLYSQCEAMTEYYEIPVLLIEFEQHQSFTSPPFS 757
Query: 891 ERNVYASAASSTVHPISGKLMQEEIQRELSHLVMKYPSLKIVWSSSPLQTVNIFLDLKTN 950
+ +S ++ + ++Q +L L + +P+L+IVWSSS T IF DLK
Sbjct: 758 D--------------LSSEIGKNDVQSKLVLLTLSFPNLRIVWSSSAYVTSIIFQDLKAM 803
Query: 951 REQPDPVKCVQFGSTKKQTGKNKKDTESNNKFKNLLTIPGLSNVDYYNI 999
++PDP G + G++ +T + L+ +P ++ +Y N+
Sbjct: 804 EQEPDPASAASIG---LEAGQDSTNTYNQAPLDLLMGLPYITMKNYRNV 849
>Hs4885217 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic
component XPF/ERCC4
Length = 916
Score = 382 bits (980), Expect = e-105
Identities = 289/959 (30%), Positives = 463/959 (48%), Gaps = 122/959 (12%)
Query: 95 QHLILENLLVSENALLVIGKGLSVLSIVSNLLYTLSTPTRIDGTDKRSLVLVLNAXXXXX 154
+ L+LE L+ + L+V +GL ++ + L P LVLVLN
Sbjct: 20 RQLVLE--LLDTDGLVVCARGLGADRLLYHFLQLHCHPA--------CLVLVLNTQPA-- 67
Query: 155 XXXXXXLMELQWTCNEDDEEGQPGDRPFTVISSDSFTVDQRSKRYQQGGIISVTSRILIV 214
E ++ N+ EG P V ++ T + R + Y QGG+I TSRIL+V
Sbjct: 68 --------EEEYFINQLKIEGVE-HLPRRV--TNEITSNSRYEVYTQGGVIFATSRILVV 116
Query: 215 DLLSGIVHPKNITGLLILHAEKLDSMSIESFIVEIYRNSNKWGFIKAITDSAESMISEFA 274
D L+ + ITG+L+ A ++ E+FI+ ++R NK GFIKA TD+A + + F
Sbjct: 117 DFLTDRIPSDLITGILVYRAHRIIESCQEAFILRLFRQKNKRGFIKAFTDNAVAFDTGFC 176
Query: 275 PLAKKMKDLLLKRILLWPRFHADISSCLNTTSNTKVIEIRVSLTDSMSKIQFGLYECLKK 334
+ + M++L ++++ LWPRFH ++S L +V+EI VS+T +M IQ + + L
Sbjct: 177 HVERVMRNLFVRKLYLWPRFHVAVNSFLEQ-HKPEVVEIHVSMTPTMLAIQTAILDILNA 235
Query: 335 CIDELNRKNPELSTEYWSFENALDSNFLRIINGVLSPKWHRISYESKQLVKDXXXXXXXX 394
C+ EL NP L E S ENA+ F + I L P WH++ ++K LV+D
Sbjct: 236 CLKELKCHNPSLEVEDLSLENAIGKPFDKTIRHYLDPLWHQLGAKTKSLVQDLKILRTLL 295
Query: 395 XXXXXYDALDFYELIQLILDANKPSVTRKYSESPWLLADESQLVISFAKKRV-------- 446
YD + F L++ + K S WL D S + A+ RV
Sbjct: 296 QYLSQYDCVTFLNLLESLRATEKAFG----QNSGWLFLDSSTSMFINARARVYHLPDAKM 351
Query: 447 -----------IDDGNYN-----LEPLPKWEQLCALMEDIEHESSKYPA-GTQGSVLILC 489
I +G LE PKWE L ++++IE E+ + A G G VLI
Sbjct: 352 SKKEKISEKMEIKEGEETKKELVLESNPKWEALTEVLKEIEAENKESEALGGPGQVLICA 411
Query: 490 SDDRTSNQLRQVIRNMKSKHNGHKNIMLDKLDIYXXXXXXXXXISKDIVEENEKYQNGGE 549
SDDRT +QLR I G + +L + + E++ K +
Sbjct: 412 SDDRTCSQLRDYITL------GAEAFLL--------------RLYRKTFEKDSKAEE--- 448
Query: 550 MNVSRAFHKQEINTKRRRTRGASFVAAVDRLKNAQAGQGQDIDAMITSDSIKEERDRSLV 609
V F K++ + + R++ + + Q+ + T + +++ R L
Sbjct: 449 --VWMKFRKEDSSKRIRKS-------------HKRPKDPQNKERASTKERTLKKKKRKLT 493
Query: 610 SMETLGDEAAVKEDFDSED----ELSIIEEKLQDGSIPEYAMVQNWEEVWERRKTGYKYV 665
+ +G ++E+ D E+ E+S E S PE + ++ V + +
Sbjct: 494 LTQMVGKPEELEEEGDVEEGYRREISSSPE-----SCPEEIKHEEFD-VNLSSDAAFGIL 547
Query: 666 DNDCKIVIETFSTTSD---EQILHELMPSYIIIYEPNLAFVRKLEVYKAIHRHNPPKVYF 722
I+ + ++LHE+ P Y+++Y+ L FVR+LE+Y+A P +VYF
Sbjct: 548 KEPLTIIHPLLGCSDPYALTRVLHEVEPRYVVLYDAELTFVRQLEIYRASRPGKPLRVYF 607
Query: 723 MYYGDSVEEQSHLSSIKREKEAFTKLIREHSNMAQHFETDEDLSRYKNLAHRKMQLSRMK 782
+ YG S EEQ +L+++++EKEAF KLIRE ++M E + +L
Sbjct: 608 LIYGGSTEEQRYLTALRKEKEAFEKLIREKASMVVPEEREGRDETNLDLVRGTASADVST 667
Query: 783 NSRIAGGQDFLNPMTYDVVVVDMREFRAALPGLLYRYGVRVVPCMLTIGDYVITPDICIE 842
++R AGGQ+ T +VVDMREFR+ LP L++R G+ + P L +GDY++TP++C+E
Sbjct: 668 DTRKAGGQE--QNGTQQSIVVDMREFRSELPSLIHRRGIDIEPVTLEVGDYILTPEMCVE 725
Query: 843 RKSIADLIGSFKNGRLDKQIRSLSRFYKYPTLLIEFDDSQSFSLEPFSERNVYASAASST 902
RKSI+DLIGS NGRL Q S+SR+YK P LLIEFD S+ FSL + R SS
Sbjct: 726 RKSISDLIGSLNNGRLYSQCISMSRYYKRPVLLIEFDPSKPFSL---TSRGALFQEISS- 781
Query: 903 VHPISGKLMQEEIQRELSHLVMKYPSLKIVWSSSPLQTVNIFLDLKTNREQPDPVKCVQF 962
+I +L+ L + +P L+I+W SP T +F +LK ++ QPD +
Sbjct: 782 ----------NDISSKLTLLTLHFPRLRILWCPSPHATAELFEELKQSKPQPDAATALAI 831
Query: 963 GSTKKQTGKNKKDTESNNKFKNLLTIPGLSNVDYYNIKKRYKRYADLLNASVEDLKNIV 1021
+ + +++K F LL +PG++ + ++ K A+L S ++L +I+
Sbjct: 832 TADSETLPESEKYNPGPQDF--LLKMPGVNAKNCRSLMHHVKNIAELAALSQDELTSIL 888
>At5g41150 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic
component XPF/ERCC4
Length = 956
Score = 352 bits (903), Expect = 2e-96
Identities = 280/999 (28%), Positives = 469/999 (46%), Gaps = 131/999 (13%)
Query: 90 LSLPFQHLILENLLVSENA-LLVIGKGLSVLSIVSNLLYTLSTPTRIDGTDKRSLVLVLN 148
++L + I+ +LL N LL++ GLS+ ++++LL L +P++ GT L+L+
Sbjct: 1 MALKYHQQIISDLLEDSNGGLLILSSGLSLAKLIASLLI-LHSPSQ--GT---LLLLLSP 54
Query: 149 AXXXXXXXXXXXLMELQWTCNEDDEEGQPGDRPFTVISSDSFTVDQRSKRYQQGGIISVT 208
A + L D P + +QR Y G +T
Sbjct: 55 AAQSLKSRIIHYISSL--------------DSPTPTEITADLPANQRYSLYTSGSPFFIT 100
Query: 209 SRILIVDLLSGIVHPKNITGLLILHAEKLDSMSIESFIVEIYRNSNKWGFIKAITDSAES 268
RILIVDLL+ + ++ G+ IL+A + S E+FI+ I ++ N +I+A +D ++
Sbjct: 101 PRILIVDLLTQRIPVSSLAGIFILNAHSISETSTEAFIIRIVKSLNSSAYIRAFSDRPQA 160
Query: 269 MISEFAPLAKKMKDLLLKRILLWPRFHADISSCLNTTSNTKVIEIRVSLTDSMSKIQFGL 328
M+S FA + M+ L L++I LWPRF D+S L +V++IRVS+++ M IQ +
Sbjct: 161 MVSGFAKTERTMRALFLRKIHLWPRFQLDVSQELEREP-PEVVDIRVSMSNYMVGIQKAI 219
Query: 329 YECLKKCIDELNRKNPELSTEYWSFENALDSNFLRIINGVLSPKWHRISYESKQLVKDXX 388
E + C+ E+ + N ++ + + E+ L +F I+ L P WH + +KQLV D
Sbjct: 220 IEVMDACLKEMKKTN-KVDVDDLTVESGLFKSFDEIVRRQLDPIWHTLGKRTKQLVSDLK 278
Query: 389 XXXXXXXXXXXYDALDFYELIQLILDANKPSVTRKYSESPWLLADESQLVISFAKKRVI- 447
YDA+ F + + + V+ Y S WL A+ S + FAKKRV
Sbjct: 279 TLRKLLDYLVRYDAVSFLKFLDTL------RVSESY-RSVWLFAESSYKIFDFAKKRVYR 331
Query: 448 ----------------------DDGNYN----------------------LEPLPKWEQL 463
G + LE PKW+ L
Sbjct: 332 LVKASDVKSKEHVKNKSGKKRNSKGETDSVEAVGGETATNVATGVVVEEVLEEAPKWKVL 391
Query: 464 CALMEDIEHESSKYP------AGTQGSVLILCSDDRTSNQLRQVIRNMKSKHNGHKNIML 517
++E+ + E K + G VL+ C D+R+ QL I N K +M
Sbjct: 392 REILEETQEERLKQAFSEEDNSDNNGIVLVACKDERSCMQLEDCITNNPQK------VMR 445
Query: 518 DKLDIYXXXXXXXXXISKDIVEENEKYQNG-----GEMNVSRAFHKQEINTKRRRTRG-- 570
++ ++Y + ++ +K G G + V+ + + + R+
Sbjct: 446 EEWEMYLLSKIELRSMQTP-QKKKQKTPKGFGILDGVVPVTTIQNSEGSSVGRQEHEALM 504
Query: 571 --ASFVAAVDRLKNAQAGQGQDIDAMITSDSIKEERDRSLVSMETLGDEAAVKEDFDSED 628
AS + + + + +G + + K + + S+ + K+ +S+
Sbjct: 505 AAASSIRKLGKTTDMASGNNNPEPHVDKASCTKGKAKKDPTSLRR-SLRSCNKKTTNSKP 563
Query: 629 ELSIIEEKLQDGSIPEYAMVQNWEEVWERRKTGYKYVDNDCKIVIETFSTTSDEQILHEL 688
E+ E + + + Q V R +G K + + ++ SD+ IL L
Sbjct: 564 EILPGPENEEKANEASTSAPQEANAV---RPSGAKKLPP-----VHFYALESDQPILDIL 615
Query: 689 MPSYIIIYEPNLAFVRKLEVYKAIHRHNPPKVYFMYYGDSVEEQSHLSSIKREKEAFTKL 748
PS II+Y P++ FVR+LEVYKA + KVYF++Y +S E Q +SI+RE EAF L
Sbjct: 616 KPSVIIVYHPDMGFVRELEVYKAENPLRKLKVYFIFYDESTEVQKFEASIRRENEAFESL 675
Query: 749 IREHSNMAQHFETDEDLSRYKNLAHRKMQLSRMKNS--RIAGGQDFLNPMTYDVVVVDMR 806
IR+ S+M D+D + + + S +NS R AGG+ L T V+VDMR
Sbjct: 676 IRQKSSMI--IPVDQDGLCMGSNSSTEFPASSTQNSLTRKAGGRKELEKETQ--VIVDMR 731
Query: 807 EFRAALPGLLYRYGVRVVPCMLTIGDYVITPDICIERKSIADLIGSFKNGRLDKQIRSLS 866
EF ++LP +L++ G++++P L +GDY+++P IC+ERKSI DL SF +GRL Q+ +S
Sbjct: 732 EFMSSLPNVLHQKGMKIIPVTLEVGDYILSPSICVERKSIQDLFQSFTSGRLFHQVEMMS 791
Query: 867 RFYKYPTLLIEFDDSQSFSLEPFSERNVYASAASSTVHPISGKLMQEEIQRELSHLVMKY 926
R+Y+ P LLIEF +SFS + S+ IS + I +LS LV+ +
Sbjct: 792 RYYRIPVLLIEFSQDKSFSFQSSSD--------------ISDDVTPYNIISKLSLLVLHF 837
Query: 927 PSLKIVWSSSPLQTVNIFLDLKTNREQPDPVKCVQFGSTKKQTGKNKKDTESNN----KF 982
P L+++WS S T IF LK+N+++PD + ++ G + G + D + N
Sbjct: 838 PRLRLLWSRSLHATAEIFTTLKSNQDEPDETRAIRVG-VPSEEGIIENDIRAENYNTSAV 896
Query: 983 KNLLTIPGLSNVDYYNIKKRYKRYADLLNASVEDLKNIV 1021
+ L +PG+S+ +Y +I ++ K A+L + VE L ++
Sbjct: 897 EFLRRLPGVSDANYRSIMEKCKSLAELASLPVETLAELM 935
>7290484 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic
component XPF/ERCC4
Length = 961
Score = 332 bits (850), Expect = 2e-90
Identities = 252/934 (26%), Positives = 441/934 (46%), Gaps = 101/934 (10%)
Query: 103 LVSENALLVIGKGLSVLSIVSNLLYTLSTPTRIDGTDKRSLVLVLNAXXXXXXXXXXXLM 162
LV + LLV KGLS +V ++L S D +LVLV+N+
Sbjct: 57 LVEADGLLVCAKGLSYDRVVISILKAYS--------DSGNLVLVINSS------------ 96
Query: 163 ELQWTCNEDDEEGQPGDRPFTVISSDSFTVDQRSKRYQQGGIISVTSRILIVDLLSGIVH 222
W + +P + + T +R + Y +GG+ +++RIL+VDLL +
Sbjct: 97 --DWEEQYYKSKIEP-----KYVHEVASTATERERVYLEGGLQFISTRILVVDLLKQRIP 149
Query: 223 PKNITGLLILHAEKLDSMSIESFIVEIYRNSNKWGFIKAITDSAESMISEFAPLAKKMKD 282
+ I+G+++L A + E+F + ++R NK GF+KA + S E+ ++ + + M++
Sbjct: 150 IELISGIIVLRAHTIIESCQEAFALRLFRQKNKTGFVKAFSSSPEAFTIGYSHVERTMRN 209
Query: 283 LLLKRILLWPRFHADISSCLNTTSNTKVIEIRVSLTDSMSKIQFGLYECLKKCIDELNRK 342
L +K + +WPRFH + + L + IE+ V ++ +++ IQ + E + + E+ R
Sbjct: 210 LFVKHLYIWPRFHESVRTVLQPWK-IQSIEMHVPISQNITSIQSHILEIMNFLVQEIKRI 268
Query: 343 NPELSTEYWSFENALDSNFLRIINGVLSPKWHRISYESKQLVKDXXXXXXXXXXXXXYDA 402
N + E + EN + +F +I+ L WH+++ ++K +V D +DA
Sbjct: 269 NRTVDMEAVTVENCVTKSFHKILQAQLDCIWHQLNSQTKLIVADLKILRSLMISTMYHDA 328
Query: 403 LDFYELIQLILDANKPSVTRKYSESPWLLADESQLVISFAKKRVID-DGNYNLEPLPKWE 461
+ Y ++ S S S W L D ++ + +++RV + + EP PKW+
Sbjct: 329 VSAYAFMK-----RYRSTEYALSNSGWTLLDAAEQIFKLSRQRVFNGQQEFEPEPCPKWQ 383
Query: 462 QLCALM-EDIEHESSKYPAGTQG-SVLILCSDDRTSNQLRQVIRNMKSKHNGHKNIMLDK 519
L L+ ++I + + Q VLILC D RT +QL+Q + G + ++
Sbjct: 384 TLTDLLTKEIPGDMRRSRRSEQQPKVLILCQDARTCHQLKQYLTQ-----GGPRFLLQQA 438
Query: 520 LDIYXXXXXXXXXISKDIVEENEKYQNGGEMNVSRAFHKQEINTKRRRTRGASFVAAVDR 579
L +K+ + +N ++ ++ ++E++ + G +A +
Sbjct: 439 LQHEVPVGKLSDNYAKESQTRSAPPKN---VSSNKELRREEVSGSQPPLAGMDELA---Q 492
Query: 580 LKNAQAGQGQDIDAMITSDSIKEERDRSLVSMETLGDEAAVKEDFDSEDELSIIEEKLQD 639
L + +GQ + + +++M + D + ++SI E
Sbjct: 493 LLSESETEGQHFE------------ESYMLTMTQPVEVGPAAIDIKPDPDVSIFE----- 535
Query: 640 GSIPEYAMVQNWEEVWERRKTGYKYVDNDCKIVIETFSTTSD-----EQILHELMPSYII 694
+IPE V + I ++TF T + E +L +L P Y++
Sbjct: 536 -TIPELEQFDV--------TAALASVPHQPYICLQTFKTEREGSMALEHMLEQLQPHYVV 586
Query: 695 IYEPNLAFVRKLEVYKAIHRHNPP---KVYFMYYGDSVEEQSHLSSIKREKEAFTKLIRE 751
+Y N+ +R+LEV++A R P KVYF+ + +VEEQ++L+S++REK AF +I
Sbjct: 587 MYNMNVTAIRQLEVFEARRRLPPADRMKVYFLIHARTVEEQAYLTSLRREKAAFEFIIDT 646
Query: 752 HSNMA----QHFETDEDLSRYKNLAHRKMQLSRMKNSRIAGGQDFLNPMTYDVVVVDMRE 807
S M Q +TDE K + SR AGGQ V+VDMRE
Sbjct: 647 KSKMVIPKYQDGKTDEAFLLLKT--YDDEPTDENAKSRQAGGQAPQATKETPKVIVDMRE 704
Query: 808 FRAALPGLLYRYGVRVVPCMLTIGDYVITPDICIERKSIADLIGSFKNGRLDKQIRSLSR 867
FR+ LP L+++ G+ V+P +TIGDY++TPDIC+ERKSI+DLIGS +GRL Q + R
Sbjct: 705 FRSDLPCLIHKRGLEVLPLTITIGDYILTPDICVERKSISDLIGSLNSGRLYNQCVQMQR 764
Query: 868 FYKYPTLLIEFDDSQSFSLEPFSERNVYASAASSTVHPISGKLMQEEIQRELSHLVMKYP 927
Y P LLIEFD ++ F L+ + S A++ +I ++L L + +P
Sbjct: 765 HYAKPILLIEFDQNKPFHLQGKFMLSQQTSMANA------------DIVQKLQLLTLHFP 812
Query: 928 SLKIVWSSSPLQTVNIFLDLKTNREQPDPVKCVQFGSTKKQTGKNKKDTESNNKFKNLLT 987
L+++WS SP T +F +LK + +PDP GS + G+ F LL
Sbjct: 813 KLRLIWSPSPYATAQLFEELKLGKPEPDPQTAAALGSDEPTAGEQLHFNSGIYDF--LLR 870
Query: 988 IPGLSNVDYYNIKKRYKRYADLLNASVEDLKNIV 1021
+PG+ + + + ++ LL S ++L+ ++
Sbjct: 871 LPGVHTRNIHGLLRKGGSLRQLLLRSQKELEELL 904
>CE24855 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic
component XPF/ERCC4
Length = 935
Score = 189 bits (480), Expect = 2e-47
Identities = 232/975 (23%), Positives = 401/975 (40%), Gaps = 153/975 (15%)
Query: 102 LLVSENALLVIGKGLSVLSIVSNLL--YTLSTPTRIDGTDKRSLVLVLNAXXXXXXXXXX 159
LL E A L SVL +V+N L L I +D+R L LVLN
Sbjct: 24 LLEYERATLAKTLPASVLFVVANGLGLERLFLEHLILFSDRRLLALVLNTNEHDESYFVS 83
Query: 160 XLMELQWTCNEDDEEGQPGDRPFTVISSDSFTVDQRSKRYQQGGIISVTSRILIVDLLSG 219
L E C+ VI+S+ ++ R Y +GG+ +SR+L+VDLL
Sbjct: 84 KLKEHNVECDPK------------VINSE-VSIKDRQSIYLEGGVQFCSSRVLLVDLLQN 130
Query: 220 IVHPKNITGLLILHAEKLDSMSIESFIVEIYRNSNKWGFIKAITDSAESMISEFAPLAKK 279
+ I + + A + + +SFI+ +YR G +KA TD S+ S L +
Sbjct: 131 RIPTDRIAAIFVYRAHQTLNAFQDSFILRLYREKKPDGTVKAFTDFPNSL-SSLGQLQRL 189
Query: 280 MKDLLLKRILLWPRFHADISSCLNTTSNTKVIEIRVSLTDSMSKIQFGLYECLKKCIDEL 339
+ L ++ + L PRF + I S LN K V + + ++ + E +K C+ +L
Sbjct: 190 VDRLYIRHVELMPRFSSIIESELNRYQ-LKTAIFSVDVPTPLRRVHRTIIEFIKVCVRDL 248
Query: 340 ----------NRKNPELSTEYWSFENALDSNFLRIINGVLSPKWHRISYESKQLVKDXXX 389
+ +N E+ W+ + L + IS + ++L+ D
Sbjct: 249 RTCSTSGKQTDEQNEEMIHVPWAATR---------LEKRLHDRRGHISEKQQRLLNDVAS 299
Query: 390 XXXXXXXXXXYDALDFYELIQLILDANKPSVTRKYSESPWLLADE-SQLVISFAKKRVID 448
D +Q++ K T S WLL+ ++++ +
Sbjct: 300 LREILQLSENMDVATVLSRLQVL----KNDRTVLEEHSGWLLSPSFNRIMEDLLTIAGVT 355
Query: 449 DGNYNLEPLP---KWEQLCALMEDIEH--ESSKYPAGTQGSVLILCSDDRTSNQLRQVIR 503
+G + + KW L ++ +I+ K SVL++ S + S Q+ V+R
Sbjct: 356 NGKADYKKFATPAKWTVLSEILREIKMLPVEKKDRGNDSPSVLVITSSEDLSRQVTDVVR 415
Query: 504 N-------MKSKHNGHKNIMLDKLDIYXXXXXXXXXISKDIVEENEKYQNGGEMNVSRAF 556
M + G+K+ D D + + + GE
Sbjct: 416 YGINKMKWMTWRQLGYKSTQEMPED--------EPLWDPDTISQLMRSSVDGES------ 461
Query: 557 HKQEINTKRRRTRGASFVAAVDRLKNAQAGQGQDIDAMITSDSIK-----EERDRSLVSM 611
K E+ ++T+ + AA R K+A+ G D + ++ I+ +R +S
Sbjct: 462 -KSEVIANVQKTQKTTARAAQKRRKHAEELSGFSSDHRVQTNLIQFGILQYKRRKS---- 516
Query: 612 ETLGDEAAVKEDFDSEDELSIIEEKLQDGSIPEYAMVQNWEEVWERRKTGYKYVDNDCKI 671
G+EA+ ++ E+ + + EE+ E K D + ++
Sbjct: 517 ---GNEASTSQE------------------TTEWEVKEEMEEIEEITKN---IGDLEAEL 552
Query: 672 VIETFSTTSDEQ------ILHELMPSYIIIYEPNLAFVRKLEVYKAIHRHNPPKVYFMYY 725
V+ STT + + +L P I++Y +L +R++E+Y++ + + VY++ Y
Sbjct: 553 VV---STTRERERYTLLKLLETKKPRAIVLYTMSLQTLRQIEIYRSTNPNRSLHVYWLQY 609
Query: 726 GDSVEEQSHLSSIKREKEAFTKLIREHSNM--AQHFETD-EDLSRYKNLAHRKMQLSRMK 782
+S EE +L SI RE +F LIRE + ++ F D ED R K ++ R +R
Sbjct: 610 TESTEESRYLESINRETMSFELLIREQGTLLISREFNVDREDAPRLK-ISTRDGGGARRD 668
Query: 783 NSRIAGGQDFLNP---MTYDVVVVDMREFRAALPGLLYRYGVRVVPCMLTIGDYVITPDI 839
+ +D ++P + ++VDMREF + LP +LY G VV + IGDY+++P+I
Sbjct: 669 GA--VDPRDQMDPEEELERPKIIVDMREFNSELPTVLYTKGYNVVATTIEIGDYILSPNI 726
Query: 840 CIERKSIADLIGSFKNGRLDKQIRSLSRFYKYPTLLIEFDDSQSFSLEPFSERNVYASAA 899
IERK++ DL S ++GR+ KQI + Y LLIE S F + V
Sbjct: 727 AIERKALDDLTQSLQSGRVFKQIEQMLEHYDCTVLLIE-------SNRKFETKIVNGG-- 777
Query: 900 SSTVHPISGKLMQ--EEIQRELSHLVMKYPSLKIVWSSSPLQTVNIFLDLKTNREQPDPV 957
P G+L + EI+ L+ P ++ VW+ SP + F +LK + +PD
Sbjct: 778 -----PFQGELSRHCREIRSIFCSLIWANPKMRCVWTISPTNSAEFFSELKLSAPEPDVD 832
Query: 958 KCVQF----------------GSTKKQTGKNKKDTESNNKFKNLLTIPGLSNVDYYNI-- 999
+ + ST + K KK + + L I G+ + +N+
Sbjct: 833 RAISLKADQVECSSQELTDSEASTSTKAKKGKKWKPNPTVIRTLTQIFGIKASEAHNLLA 892
Query: 1000 KKRYKRYADLLNASV 1014
K ADL + ++
Sbjct: 893 NSSIKTLADLFSLNI 907
>ECU08g0760 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic
component XPF/ERCC4
Length = 768
Score = 121 bits (303), Expect = 7e-27
Identities = 96/329 (29%), Positives = 150/329 (45%), Gaps = 61/329 (18%)
Query: 693 IIIYEPNLAFVRKLEVYKAIHRHNPPKVYFMYYGDSVEEQSHLSSIKREKEAFTKLIREH 752
+I E VRK+E Y H KV+F+ + S+EEQ +L+ I+REK +F KLI E
Sbjct: 463 VIFVESGQDSVRKIERYGVAHS---VKVFFLMHTGSLEEQRYLNEIRREKASFEKLIEER 519
Query: 753 SNMAQHFETDE---DLSRYKNLAHRKMQLSRMKNSRIAGGQDFLNPMTYDVVVVDMREFR 809
S + + + DL Y+ A R+ VVVVD RE R
Sbjct: 520 SRLPLRLDDVDDAIDLEEYEP-AEREY-----------------------VVVVDSRELR 555
Query: 810 AALPGLLYRYGVRVVPCMLTIGDYVITPDICIERKSIADLIGSFKNGRLDKQIRSLSRFY 869
A LP L+R R+ L +GDY+++P CIERKSI DL+ S +GRL Q L Y
Sbjct: 556 AELPFFLFRARNRICISTLPVGDYLVSPTTCIERKSIPDLVSSLNSGRLYLQASMLCHRY 615
Query: 870 KYPTLLIEFDDSQSFSLEPFSERNVYASAASSTVHPISGKLMQEEIQRELSHLVMKYPSL 929
P LL+EFD S + + + +LS L+ +L
Sbjct: 616 PRPVLLLEFD----------------GRPCLSDYYRYDQDTFKNSLVAKLSLLLFNLGAL 659
Query: 930 KIVWSSSPLQTVNIFLDLKTNREQPDPVKCVQFGSTKKQTGKNKKDTESNNKFKNLLTIP 989
+++WS S L + I DL+ + V+ +K D + + LL+IP
Sbjct: 660 RLIWSESRLFSTKIIRDLQRKEDVSSAVE------------GHKMDPVLH---EILLSIP 704
Query: 990 GLSNVDYYNIKKRYKRYADLLNASVEDLK 1018
G++ + +++ ++ DL+ +++E L+
Sbjct: 705 GITQFNISRVRRYFRSLKDLVFSTMERLE 733
Score = 84.3 bits (207), Expect = 9e-16
Identities = 39/139 (28%), Positives = 82/139 (58%), Gaps = 2/139 (1%)
Query: 194 QRSKRYQQGGIISVTSRILIVDLLSGIVHPKNITGLLILHAEKLDSMSIESFIVEIYRNS 253
+R ++Y GG+ ++R+ + D++ G + + I +L+ + E + S ESFI+ ++R+
Sbjct: 112 KRREKYLCGGVCIASNRVFLADMIDGTIDAEKIDCILVNNVETITETSSESFIIHVFRSR 171
Query: 254 NKWGFIKAITDSAESMISEFAPLAKKMKDLLLKRILLWPRFHADISSCLNTTSNTKVIEI 313
N+ G I+ ++S + F+PL +KM+ L + + + +PRFH+ + LN + V+EI
Sbjct: 172 NRTGLIRGFSESPVPLSLGFSPLDRKMRSLKVSKAVFFPRFHSLVEESLN--GDMDVVEI 229
Query: 314 RVSLTDSMSKIQFGLYECL 332
+ +++ S++Q L E +
Sbjct: 230 KFRMSERKSQLQVVLLEII 248
Database: KOG eukaryal database 04/03
Posted date: Apr 14, 2003 1:07 PM
Number of letters in database: 30,389,216
Number of sequences in database: 60,738
Lambda K H
0.316 0.134 0.375
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 59,978,052
Number of Sequences: 60738
Number of extensions: 2591187
Number of successful extensions: 8683
Number of sequences better than 1.0e-05: 7
Number of HSP's better than 0.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8629
Number of HSP's gapped (non-prelim): 17
length of query: 1056
length of database: 30,389,216
effective HSP length: 117
effective length of query: 939
effective length of database: 23,282,870
effective search space: 21862614930
effective search space used: 21862614930
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)