ORF      STATUS       Function Best COG  Functional category                                          Pathways and functional systems
r_klactIV3071 good L KOG0442 Replication, recombination and repair Structure-specific endonuclease ERCC1-XPF, catalytic component XPF/ERCC4

Only best alignment is shown:
BLASTP 2.2.3 [May-13-2002]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= r_klactIV3071 1037870  1041037 1056 
         (1056 letters)

Database: KOG eukaryal database 04/03 
           60,738 sequences; 30,389,216 total letters

Searching..................................................done

Color Key for Alignment Scores:   
Score E Sequences producing significant alignments: (bits) Value YPL022w [L] KOG0442 Structure-specific endonuclease ERCC1-XPF ca... 1013 0.0 SPCC970.01 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF... 423 e-118 Hs4885217 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF ... 382 e-105 At5g41150 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF ... 352 2e-96 7290484 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF ca... 332 2e-90 CE24855 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF ca... 189 2e-47 ECU08g0760 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF... 121 7e-27 >YPL022w [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic component XPF/ERCC4 Length = 1100 Score = 1013 bits (2620), Expect = 0.0 Identities = 537/1069 (50%), Positives = 734/1069 (68%), Gaps = 43/1069 (4%) Query: 3 LFLQDDSDEEDLVIELSN-LSKNDETADVIVEDQPKETAESELSENPQSEVLGDTEDKVL 61 LF Q DSD+E L EL+ ++ +++ + ED+P ++ EN S+VL D D VL Sbjct: 4 LFYQGDSDDE-LQEELTRQTTQASQSSKIKNEDEPDDSNHLNEVENEDSKVLDD--DAVL 60 Query: 62 YPLIPIDANT-ETLKPNIEEIRPVDVHLSLSLPFQHLILENLLVSENALLVIGKGLSVLS 120 YPLIP + + ET KPNI +IRPVD+ L+L LPFQ ++EN L++E+AL+++GKGL +L Sbjct: 61 YPLIPNEPDDIETSKPNINDIRPVDIQLTLPLPFQQKVVENSLITEDALIIMGKGLGLLD 120 Query: 121 IVSNLLYTLSTPTRIDGTDKRSLVLVLNAXXXXXXXXXXXLMELQWTCN---EDDEEGQP 177 IV+NLL+ L+TPT I+G KR+LVLVLNA L EL W N +DD+ Sbjct: 121 IVANLLHVLATPTSINGQLKRALVLVLNAKPIDNVRIKEALEELSWFSNTGKDDDDTAVE 180 Query: 178 GD-----RPFTVISSDSFTVDQRSKRYQQGGIISVTSRILIVDLLSGIVHPKNITGLLIL 232 D RPF V+++DS ++++R K Y GGI+S+TSRILIVDLLSGIVHP +TG+L+L Sbjct: 181 SDDELFERPFNVVTADSLSIEKRRKLYISGGILSITSRILIVDLLSGIVHPNRVTGMLVL 240 Query: 233 HAEKLDSMSIESFIVEIYRNSNKWGFIKAITDSAESMISEFAPLAKKMKDLLLKRILLWP 292 +A+ L S ESFI+EIYR+ N WGFIKA +++ E+ + EF+PL KMK+L LK +LLWP Sbjct: 241 NADSLRHNSNESFILEIYRSKNTWGFIKAFSEAPETFVMEFSPLRTKMKELRLKNVLLWP 300 Query: 293 RFHADISSCLNTTSNT---KVIEIRVSLTDSMSKIQFGLYECLKKCIDELNRKNPELSTE 349 RF ++SSCLN T+ T KVIE++VSLT+SMS+IQFGL ECLKKCI EL+RKNPEL+ + Sbjct: 301 RFRVEVSSCLNATNKTSHNKVIEVKVSLTNSMSQIQFGLMECLKKCIAELSRKNPELALD 360 Query: 350 YWSFENALDSNFLRIINGVLSPKWHRISYESKQLVKDXXXXXXXXXXXXXYDALDFYELI 409 +W+ EN LD NF+R I+ V+ P WHRISYESKQLVKD DA+DF+ I Sbjct: 361 WWNMENVLDINFIRSIDSVMVPNWHRISYESKQLVKDIRFLRHLLKMLVTSDAVDFFGEI 420 Query: 410 QLILDANKPSVTRKYSESPWLLADESQLVISFAKKRVIDDGNYNLEPLPKWEQLCALMED 469 QL LDANKPSV+RKYSESPWLL DE+QLVIS+AKKR+ Y LE PKWEQL ++ D Sbjct: 421 QLSLDANKPSVSRKYSESPWLLVDEAQLVISYAKKRIFYKNEYTLEENPKWEQLIHILHD 480 Query: 470 IEHESSKYPAGTQGSVLILCSDDRTSNQLRQVIRNMKSKHNGHKNIMLDKLDIYXXXXXX 529 I HE + QG L+ CSD+ T +L +V+ N +K G + ++L+KL Y Sbjct: 481 ISHE--RMTNHLQGPTLVACSDNLTCLELAKVL-NASNKKRGVRQVLLNKLKWYRKQREE 537 Query: 530 XXXISKDIVEENEKYQNGGEMNVSRAFHKQEINTKRRRTRGASFVAAVDRLKNAQAGQGQ 589 + K+ V+ + + +NVS F K+++ TKRRRTRGAS VAAV++L+NA G Sbjct: 538 TKKLVKE-VQSQDTFPENATLNVSSTFSKEQVTTKRRRTRGASQVAAVEKLRNA--GTNV 594 Query: 590 DIDAMITSDSIKEE------RDRSLVSMETLGDEAAVKEDFDSEDELSIIE--------E 635 D++ + + EE D E +++ + E + E+E+ I + E Sbjct: 595 DMEVVFEDHKLSEEIKKGSGDDLDDGQEENAANDSKIFEIQEQENEILIDDGDAEFDNGE 654 Query: 636 KLQDGSIPEYAMVQNWEEVWERRKTGYKYVDNDCKIVIETFSTTSDEQILHELMPSYIII 695 G +P++ +++W Y+YVD +I+I TF + +D L E+MPSYII+ Sbjct: 655 LEYVGDLPQHITTHFNKDLWAEHCNEYEYVDRQDEILISTFKSLNDNCSLQEMMPSYIIM 714 Query: 696 YEPNLAFVRKLEVYKAIHRHNPPKVYFMYYGDSVEEQSHLSSIKREKEAFTKLIREHSNM 755 +EP+++F+R++EVYKAI + PKVYFMYYG+S+EEQSHL++IKREK+AFTKLIRE++N+ Sbjct: 715 FEPDISFIRQIEVYKAIVKDLQPKVYFMYYGESIEEQSHLTAIKREKDAFTKLIRENANL 774 Query: 756 AQHFETDEDLSRYKNLAHRKMQLSRMK--NSRIAGGQDFLNPMTYDVVVVDMREFRAALP 813 + HFET+EDLS YKNLA RK++LS+++ N+R AGGQ + +T DVV+VD REF A+LP Sbjct: 775 SHHFETNEDLSHYKNLAERKLKLSKLRKSNTRNAGGQQGFHNLTQDVVIVDTREFNASLP 834 Query: 814 GLLYRYGVRVVPCMLTIGDYVITPDICIERKSIADLIGSFKNGRLDKQIRSLSRFYKYPT 873 GLLYRYG+RV+PCMLT+GDYVITPDIC+ERKSI+DLIGS +N RL Q + + ++Y YPT Sbjct: 835 GLLYRYGIRVIPCMLTVGDYVITPDICLERKSISDLIGSLQNNRLANQCKKMLKYYAYPT 894 Query: 874 LLIEFDDSQSFSLEPFSERNVYASAASSTVHPISGKLMQEEIQRELSHLVMKYPSLKIVW 933 LLIEFD+ QSFSLEPFSER Y + STVHPIS KL Q+EIQ +L+ LV+++P+LKI+W Sbjct: 895 LLIEFDEGQSFSLEPFSERRNYKNKDISTVHPISSKLSQDEIQLKLAKLVLRFPTLKIIW 954 Query: 934 SSSPLQTVNIFLDLKTNREQPDPVKCVQFGSTK-----KQTGKNKKDTESNNKFKNLLTI 988 SSSPLQTVNI L+LK REQPDP V G+ K T K KD ++ +KFK LL + Sbjct: 955 SSSPLQTVNIILELKLGREQPDPSNAVILGTNKVRSDFNSTAKGLKDGDNESKFKRLLNV 1014 Query: 989 PGLSNVDYYNIKKRYKRYADLLNASVEDLKNIVTDPDLSERIQLSLQRQ 1037 PG+S +DY+N++K+ K + L S ++ ++ D DL++RI L+ + Sbjct: 1015 PGVSKIDYFNLRKKIKSFNKLQKLSWNEINELINDEDLTDRIYYFLRTE 1063 >SPCC970.01 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic component XPF/ERCC4 Length = 892 Score = 423 bits (1087), Expect = e-118 Identities = 287/949 (30%), Positives = 468/949 (49%), Gaps = 133/949 (14%) Query: 84 VDVHLSLSLPFQHLILENLLVSENALLVIGKGLSVLSIVSNLLYTLSTPTRIDGTDKRSL 143 ++ + L L +Q + N L+ E+ L VI GLS+L I +N+L + P SL Sbjct: 1 METKVHLPLAYQQQVF-NELIEEDGLCVIAPGLSLLQIAANVLSYFAVPG--------SL 51 Query: 144 VLVLNAXXXXXXXXXXXLMELQWTCNEDD------EEGQPGDRPFTVISSDSFTVDQRSK 197 +L++ A N DD E ++ +++++ +VD+R K Sbjct: 52 LLLVGA-------------------NVDDIELIQHEMESHLEKKLITVNTETMSVDKREK 92 Query: 198 RYQQGGIISVTSRILIVDLLSGIVHPKNITGLLILHAEKLDSMSIESFIVEIYRNSNKWG 257 Y +GGI ++TSRIL++DLL+ I+ + ITG+++LHA+++ S +FI+ +YR +NK G Sbjct: 93 SYLEGGIFAITSRILVMDLLTKIIPTEKITGIVLLHADRVVSTGTVAFIMRLYRETNKTG 152 Query: 258 FIKAITDSAESMISEFAPLAKKMKDLLLKRILLWPRFHADISSCLNTTSNTKVIEIRVSL 317 FIKA +D E + L+ ++ L L+ + ++PRFH ++ L S V+E+ V+L Sbjct: 153 FIKAFSDDPEQFLMGINALSHCLRCLFLRHVFIYPRFHVVVAESLEK-SPANVVELNVNL 211 Query: 318 TDSMSKIQFGLYECLKKCIDELNRKNPE-LSTEYWSFENALDSNFLRIINGVLSPKWHRI 376 +DS IQ L C++ + EL R N L E W+ E+AL +F I+ L WHR+ Sbjct: 212 SDSQKTIQSCLLTCIESTMRELRRLNSAYLDMEDWNIESALHRSFDVIVRRQLDSVWHRV 271 Query: 377 SYESKQLVKDXXXXXXXXXXXXXYDALDFYELIQ-LILDANKPSVTRKYSESPWLLADES 435 S ++KQLV D YD + F +L+ L+L N S SPWL+ D + Sbjct: 272 SPKTKQLVGDLSTLKFLLSALVCYDCVSFLKLLDTLVLSVNVSSYPSNAQPSPWLMLDAA 331 Query: 436 QLVISFAKKRVIDDGNYN-------LEPLPKWEQLCALMEDIEHESSKYPAGTQ---GSV 485 +I A+ RV + LE PKW L ++ ++ HE+ + S+ Sbjct: 332 NKMIRVARDRVYKESEGPNMDAIPILEEQPKWSVLQDVLNEVCHETMLADTDAETSNNSI 391 Query: 486 LILCSDDRTSNQLRQVIRNMKSKHNGHKNIMLDKLDIYXXXXXXXXXISKDIVEENEKYQ 545 +I+C+D+RT QLR + + + M KL Y +SK I + + Sbjct: 392 MIMCADERTCLQLRDYLSTVTYDNKDSLKNMNSKLVDYFQWREQYRKMSKSIKKPEPSKE 451 Query: 546 NGGEMNVSRAFHKQEINTKRRRTRGASFVAAVDRLKNAQAGQGQDIDAMITSDSIKEERD 605 SR K +KRRR RG + + N A D + + Sbjct: 452 REASNTTSR---KGVPPSKRRRVRGGNNATSRTTSDNTDANDSFSRDLRL---------E 499 Query: 606 RSLVSMETLGDEAAVKEDFDSEDELSIIEEKLQDGSIPEYAMVQNWEEVWERRKTGYKYV 665 + L+S + E V D ++ + Sbjct: 500 KILLSHLSKRYEPEVGND-------------------------------------AFEVI 522 Query: 666 DNDCKIVIETFSTTSDEQILHELMPSYIIIYEPNLAFVRKLEVYKAIHRHNPPKVYFMYY 725 D+ I I +++ DE +L+ L P Y+I+++ + F+R++EVYKA + +VYFMYY Sbjct: 523 DDFNSIYIYSYNGERDELVLNNLRPRYVIMFDSDPNFIRRVEVYKATYPKRSLRVYFMYY 582 Query: 726 GDSVEEQSHLSSIKREKEAFTKLIREHSNMAQHFETDEDLSRYKNLAHRKMQLSRMKNSR 785 G S+EEQ +L S++REK++F++LI+E SNMA D + R+++ ++ + R N+R Sbjct: 583 GGSIEEQKYLFSVRREKDSFSRLIKERSNMAIVLTADSE--RFES---QESKFLRNVNTR 637 Query: 786 IAGGQDFL----NPMTYDV-----------VVVDMREFRAALPGLLYRYGVRVVPCMLTI 830 IAGG P + V+VD+REFR++LP +L+ V+PC L + Sbjct: 638 IAGGGQLSITNEKPRVRSLYLMFICIKTLKVIVDLREFRSSLPSILHGNNFSVIPCQLLV 697 Query: 831 GDYVITPDICIERKSIADLIGSFKNGRLDKQIRSLSRFYKYPTLLIEFDDSQSFSLEPFS 890 GDY+++P IC+ERKSI DLI S NGRL Q +++ +Y+ P LLIEF+ QSF+ PFS Sbjct: 698 GDYILSPKICVERKSIRDLIQSLSNGRLYSQCEAMTEYYEIPVLLIEFEQHQSFTSPPFS 757 Query: 891 ERNVYASAASSTVHPISGKLMQEEIQRELSHLVMKYPSLKIVWSSSPLQTVNIFLDLKTN 950 + +S ++ + ++Q +L L + +P+L+IVWSSS T IF DLK Sbjct: 758 D--------------LSSEIGKNDVQSKLVLLTLSFPNLRIVWSSSAYVTSIIFQDLKAM 803 Query: 951 REQPDPVKCVQFGSTKKQTGKNKKDTESNNKFKNLLTIPGLSNVDYYNI 999 ++PDP G + G++ +T + L+ +P ++ +Y N+ Sbjct: 804 EQEPDPASAASIG---LEAGQDSTNTYNQAPLDLLMGLPYITMKNYRNV 849 >Hs4885217 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic component XPF/ERCC4 Length = 916 Score = 382 bits (980), Expect = e-105 Identities = 289/959 (30%), Positives = 463/959 (48%), Gaps = 122/959 (12%) Query: 95 QHLILENLLVSENALLVIGKGLSVLSIVSNLLYTLSTPTRIDGTDKRSLVLVLNAXXXXX 154 + L+LE L+ + L+V +GL ++ + L P LVLVLN Sbjct: 20 RQLVLE--LLDTDGLVVCARGLGADRLLYHFLQLHCHPA--------CLVLVLNTQPA-- 67 Query: 155 XXXXXXLMELQWTCNEDDEEGQPGDRPFTVISSDSFTVDQRSKRYQQGGIISVTSRILIV 214 E ++ N+ EG P V ++ T + R + Y QGG+I TSRIL+V Sbjct: 68 --------EEEYFINQLKIEGVE-HLPRRV--TNEITSNSRYEVYTQGGVIFATSRILVV 116 Query: 215 DLLSGIVHPKNITGLLILHAEKLDSMSIESFIVEIYRNSNKWGFIKAITDSAESMISEFA 274 D L+ + ITG+L+ A ++ E+FI+ ++R NK GFIKA TD+A + + F Sbjct: 117 DFLTDRIPSDLITGILVYRAHRIIESCQEAFILRLFRQKNKRGFIKAFTDNAVAFDTGFC 176 Query: 275 PLAKKMKDLLLKRILLWPRFHADISSCLNTTSNTKVIEIRVSLTDSMSKIQFGLYECLKK 334 + + M++L ++++ LWPRFH ++S L +V+EI VS+T +M IQ + + L Sbjct: 177 HVERVMRNLFVRKLYLWPRFHVAVNSFLEQ-HKPEVVEIHVSMTPTMLAIQTAILDILNA 235 Query: 335 CIDELNRKNPELSTEYWSFENALDSNFLRIINGVLSPKWHRISYESKQLVKDXXXXXXXX 394 C+ EL NP L E S ENA+ F + I L P WH++ ++K LV+D Sbjct: 236 CLKELKCHNPSLEVEDLSLENAIGKPFDKTIRHYLDPLWHQLGAKTKSLVQDLKILRTLL 295 Query: 395 XXXXXYDALDFYELIQLILDANKPSVTRKYSESPWLLADESQLVISFAKKRV-------- 446 YD + F L++ + K S WL D S + A+ RV Sbjct: 296 QYLSQYDCVTFLNLLESLRATEKAFG----QNSGWLFLDSSTSMFINARARVYHLPDAKM 351 Query: 447 -----------IDDGNYN-----LEPLPKWEQLCALMEDIEHESSKYPA-GTQGSVLILC 489 I +G LE PKWE L ++++IE E+ + A G G VLI Sbjct: 352 SKKEKISEKMEIKEGEETKKELVLESNPKWEALTEVLKEIEAENKESEALGGPGQVLICA 411 Query: 490 SDDRTSNQLRQVIRNMKSKHNGHKNIMLDKLDIYXXXXXXXXXISKDIVEENEKYQNGGE 549 SDDRT +QLR I G + +L + + E++ K + Sbjct: 412 SDDRTCSQLRDYITL------GAEAFLL--------------RLYRKTFEKDSKAEE--- 448 Query: 550 MNVSRAFHKQEINTKRRRTRGASFVAAVDRLKNAQAGQGQDIDAMITSDSIKEERDRSLV 609 V F K++ + + R++ + + Q+ + T + +++ R L Sbjct: 449 --VWMKFRKEDSSKRIRKS-------------HKRPKDPQNKERASTKERTLKKKKRKLT 493 Query: 610 SMETLGDEAAVKEDFDSED----ELSIIEEKLQDGSIPEYAMVQNWEEVWERRKTGYKYV 665 + +G ++E+ D E+ E+S E S PE + ++ V + + Sbjct: 494 LTQMVGKPEELEEEGDVEEGYRREISSSPE-----SCPEEIKHEEFD-VNLSSDAAFGIL 547 Query: 666 DNDCKIVIETFSTTSD---EQILHELMPSYIIIYEPNLAFVRKLEVYKAIHRHNPPKVYF 722 I+ + ++LHE+ P Y+++Y+ L FVR+LE+Y+A P +VYF Sbjct: 548 KEPLTIIHPLLGCSDPYALTRVLHEVEPRYVVLYDAELTFVRQLEIYRASRPGKPLRVYF 607 Query: 723 MYYGDSVEEQSHLSSIKREKEAFTKLIREHSNMAQHFETDEDLSRYKNLAHRKMQLSRMK 782 + YG S EEQ +L+++++EKEAF KLIRE ++M E + +L Sbjct: 608 LIYGGSTEEQRYLTALRKEKEAFEKLIREKASMVVPEEREGRDETNLDLVRGTASADVST 667 Query: 783 NSRIAGGQDFLNPMTYDVVVVDMREFRAALPGLLYRYGVRVVPCMLTIGDYVITPDICIE 842 ++R AGGQ+ T +VVDMREFR+ LP L++R G+ + P L +GDY++TP++C+E Sbjct: 668 DTRKAGGQE--QNGTQQSIVVDMREFRSELPSLIHRRGIDIEPVTLEVGDYILTPEMCVE 725 Query: 843 RKSIADLIGSFKNGRLDKQIRSLSRFYKYPTLLIEFDDSQSFSLEPFSERNVYASAASST 902 RKSI+DLIGS NGRL Q S+SR+YK P LLIEFD S+ FSL + R SS Sbjct: 726 RKSISDLIGSLNNGRLYSQCISMSRYYKRPVLLIEFDPSKPFSL---TSRGALFQEISS- 781 Query: 903 VHPISGKLMQEEIQRELSHLVMKYPSLKIVWSSSPLQTVNIFLDLKTNREQPDPVKCVQF 962 +I +L+ L + +P L+I+W SP T +F +LK ++ QPD + Sbjct: 782 ----------NDISSKLTLLTLHFPRLRILWCPSPHATAELFEELKQSKPQPDAATALAI 831 Query: 963 GSTKKQTGKNKKDTESNNKFKNLLTIPGLSNVDYYNIKKRYKRYADLLNASVEDLKNIV 1021 + + +++K F LL +PG++ + ++ K A+L S ++L +I+ Sbjct: 832 TADSETLPESEKYNPGPQDF--LLKMPGVNAKNCRSLMHHVKNIAELAALSQDELTSIL 888 >At5g41150 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic component XPF/ERCC4 Length = 956 Score = 352 bits (903), Expect = 2e-96 Identities = 280/999 (28%), Positives = 469/999 (46%), Gaps = 131/999 (13%) Query: 90 LSLPFQHLILENLLVSENA-LLVIGKGLSVLSIVSNLLYTLSTPTRIDGTDKRSLVLVLN 148 ++L + I+ +LL N LL++ GLS+ ++++LL L +P++ GT L+L+ Sbjct: 1 MALKYHQQIISDLLEDSNGGLLILSSGLSLAKLIASLLI-LHSPSQ--GT---LLLLLSP 54 Query: 149 AXXXXXXXXXXXLMELQWTCNEDDEEGQPGDRPFTVISSDSFTVDQRSKRYQQGGIISVT 208 A + L D P + +QR Y G +T Sbjct: 55 AAQSLKSRIIHYISSL--------------DSPTPTEITADLPANQRYSLYTSGSPFFIT 100 Query: 209 SRILIVDLLSGIVHPKNITGLLILHAEKLDSMSIESFIVEIYRNSNKWGFIKAITDSAES 268 RILIVDLL+ + ++ G+ IL+A + S E+FI+ I ++ N +I+A +D ++ Sbjct: 101 PRILIVDLLTQRIPVSSLAGIFILNAHSISETSTEAFIIRIVKSLNSSAYIRAFSDRPQA 160 Query: 269 MISEFAPLAKKMKDLLLKRILLWPRFHADISSCLNTTSNTKVIEIRVSLTDSMSKIQFGL 328 M+S FA + M+ L L++I LWPRF D+S L +V++IRVS+++ M IQ + Sbjct: 161 MVSGFAKTERTMRALFLRKIHLWPRFQLDVSQELEREP-PEVVDIRVSMSNYMVGIQKAI 219 Query: 329 YECLKKCIDELNRKNPELSTEYWSFENALDSNFLRIINGVLSPKWHRISYESKQLVKDXX 388 E + C+ E+ + N ++ + + E+ L +F I+ L P WH + +KQLV D Sbjct: 220 IEVMDACLKEMKKTN-KVDVDDLTVESGLFKSFDEIVRRQLDPIWHTLGKRTKQLVSDLK 278 Query: 389 XXXXXXXXXXXYDALDFYELIQLILDANKPSVTRKYSESPWLLADESQLVISFAKKRVI- 447 YDA+ F + + + V+ Y S WL A+ S + FAKKRV Sbjct: 279 TLRKLLDYLVRYDAVSFLKFLDTL------RVSESY-RSVWLFAESSYKIFDFAKKRVYR 331 Query: 448 ----------------------DDGNYN----------------------LEPLPKWEQL 463 G + LE PKW+ L Sbjct: 332 LVKASDVKSKEHVKNKSGKKRNSKGETDSVEAVGGETATNVATGVVVEEVLEEAPKWKVL 391 Query: 464 CALMEDIEHESSKYP------AGTQGSVLILCSDDRTSNQLRQVIRNMKSKHNGHKNIML 517 ++E+ + E K + G VL+ C D+R+ QL I N K +M Sbjct: 392 REILEETQEERLKQAFSEEDNSDNNGIVLVACKDERSCMQLEDCITNNPQK------VMR 445 Query: 518 DKLDIYXXXXXXXXXISKDIVEENEKYQNG-----GEMNVSRAFHKQEINTKRRRTRG-- 570 ++ ++Y + ++ +K G G + V+ + + + R+ Sbjct: 446 EEWEMYLLSKIELRSMQTP-QKKKQKTPKGFGILDGVVPVTTIQNSEGSSVGRQEHEALM 504 Query: 571 --ASFVAAVDRLKNAQAGQGQDIDAMITSDSIKEERDRSLVSMETLGDEAAVKEDFDSED 628 AS + + + + +G + + K + + S+ + K+ +S+ Sbjct: 505 AAASSIRKLGKTTDMASGNNNPEPHVDKASCTKGKAKKDPTSLRR-SLRSCNKKTTNSKP 563 Query: 629 ELSIIEEKLQDGSIPEYAMVQNWEEVWERRKTGYKYVDNDCKIVIETFSTTSDEQILHEL 688 E+ E + + + Q V R +G K + + ++ SD+ IL L Sbjct: 564 EILPGPENEEKANEASTSAPQEANAV---RPSGAKKLPP-----VHFYALESDQPILDIL 615 Query: 689 MPSYIIIYEPNLAFVRKLEVYKAIHRHNPPKVYFMYYGDSVEEQSHLSSIKREKEAFTKL 748 PS II+Y P++ FVR+LEVYKA + KVYF++Y +S E Q +SI+RE EAF L Sbjct: 616 KPSVIIVYHPDMGFVRELEVYKAENPLRKLKVYFIFYDESTEVQKFEASIRRENEAFESL 675 Query: 749 IREHSNMAQHFETDEDLSRYKNLAHRKMQLSRMKNS--RIAGGQDFLNPMTYDVVVVDMR 806 IR+ S+M D+D + + + S +NS R AGG+ L T V+VDMR Sbjct: 676 IRQKSSMI--IPVDQDGLCMGSNSSTEFPASSTQNSLTRKAGGRKELEKETQ--VIVDMR 731 Query: 807 EFRAALPGLLYRYGVRVVPCMLTIGDYVITPDICIERKSIADLIGSFKNGRLDKQIRSLS 866 EF ++LP +L++ G++++P L +GDY+++P IC+ERKSI DL SF +GRL Q+ +S Sbjct: 732 EFMSSLPNVLHQKGMKIIPVTLEVGDYILSPSICVERKSIQDLFQSFTSGRLFHQVEMMS 791 Query: 867 RFYKYPTLLIEFDDSQSFSLEPFSERNVYASAASSTVHPISGKLMQEEIQRELSHLVMKY 926 R+Y+ P LLIEF +SFS + S+ IS + I +LS LV+ + Sbjct: 792 RYYRIPVLLIEFSQDKSFSFQSSSD--------------ISDDVTPYNIISKLSLLVLHF 837 Query: 927 PSLKIVWSSSPLQTVNIFLDLKTNREQPDPVKCVQFGSTKKQTGKNKKDTESNN----KF 982 P L+++WS S T IF LK+N+++PD + ++ G + G + D + N Sbjct: 838 PRLRLLWSRSLHATAEIFTTLKSNQDEPDETRAIRVG-VPSEEGIIENDIRAENYNTSAV 896 Query: 983 KNLLTIPGLSNVDYYNIKKRYKRYADLLNASVEDLKNIV 1021 + L +PG+S+ +Y +I ++ K A+L + VE L ++ Sbjct: 897 EFLRRLPGVSDANYRSIMEKCKSLAELASLPVETLAELM 935 >7290484 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic component XPF/ERCC4 Length = 961 Score = 332 bits (850), Expect = 2e-90 Identities = 252/934 (26%), Positives = 441/934 (46%), Gaps = 101/934 (10%) Query: 103 LVSENALLVIGKGLSVLSIVSNLLYTLSTPTRIDGTDKRSLVLVLNAXXXXXXXXXXXLM 162 LV + LLV KGLS +V ++L S D +LVLV+N+ Sbjct: 57 LVEADGLLVCAKGLSYDRVVISILKAYS--------DSGNLVLVINSS------------ 96 Query: 163 ELQWTCNEDDEEGQPGDRPFTVISSDSFTVDQRSKRYQQGGIISVTSRILIVDLLSGIVH 222 W + +P + + T +R + Y +GG+ +++RIL+VDLL + Sbjct: 97 --DWEEQYYKSKIEP-----KYVHEVASTATERERVYLEGGLQFISTRILVVDLLKQRIP 149 Query: 223 PKNITGLLILHAEKLDSMSIESFIVEIYRNSNKWGFIKAITDSAESMISEFAPLAKKMKD 282 + I+G+++L A + E+F + ++R NK GF+KA + S E+ ++ + + M++ Sbjct: 150 IELISGIIVLRAHTIIESCQEAFALRLFRQKNKTGFVKAFSSSPEAFTIGYSHVERTMRN 209 Query: 283 LLLKRILLWPRFHADISSCLNTTSNTKVIEIRVSLTDSMSKIQFGLYECLKKCIDELNRK 342 L +K + +WPRFH + + L + IE+ V ++ +++ IQ + E + + E+ R Sbjct: 210 LFVKHLYIWPRFHESVRTVLQPWK-IQSIEMHVPISQNITSIQSHILEIMNFLVQEIKRI 268 Query: 343 NPELSTEYWSFENALDSNFLRIINGVLSPKWHRISYESKQLVKDXXXXXXXXXXXXXYDA 402 N + E + EN + +F +I+ L WH+++ ++K +V D +DA Sbjct: 269 NRTVDMEAVTVENCVTKSFHKILQAQLDCIWHQLNSQTKLIVADLKILRSLMISTMYHDA 328 Query: 403 LDFYELIQLILDANKPSVTRKYSESPWLLADESQLVISFAKKRVID-DGNYNLEPLPKWE 461 + Y ++ S S S W L D ++ + +++RV + + EP PKW+ Sbjct: 329 VSAYAFMK-----RYRSTEYALSNSGWTLLDAAEQIFKLSRQRVFNGQQEFEPEPCPKWQ 383 Query: 462 QLCALM-EDIEHESSKYPAGTQG-SVLILCSDDRTSNQLRQVIRNMKSKHNGHKNIMLDK 519 L L+ ++I + + Q VLILC D RT +QL+Q + G + ++ Sbjct: 384 TLTDLLTKEIPGDMRRSRRSEQQPKVLILCQDARTCHQLKQYLTQ-----GGPRFLLQQA 438 Query: 520 LDIYXXXXXXXXXISKDIVEENEKYQNGGEMNVSRAFHKQEINTKRRRTRGASFVAAVDR 579 L +K+ + +N ++ ++ ++E++ + G +A + Sbjct: 439 LQHEVPVGKLSDNYAKESQTRSAPPKN---VSSNKELRREEVSGSQPPLAGMDELA---Q 492 Query: 580 LKNAQAGQGQDIDAMITSDSIKEERDRSLVSMETLGDEAAVKEDFDSEDELSIIEEKLQD 639 L + +GQ + + +++M + D + ++SI E Sbjct: 493 LLSESETEGQHFE------------ESYMLTMTQPVEVGPAAIDIKPDPDVSIFE----- 535 Query: 640 GSIPEYAMVQNWEEVWERRKTGYKYVDNDCKIVIETFSTTSD-----EQILHELMPSYII 694 +IPE V + I ++TF T + E +L +L P Y++ Sbjct: 536 -TIPELEQFDV--------TAALASVPHQPYICLQTFKTEREGSMALEHMLEQLQPHYVV 586 Query: 695 IYEPNLAFVRKLEVYKAIHRHNPP---KVYFMYYGDSVEEQSHLSSIKREKEAFTKLIRE 751 +Y N+ +R+LEV++A R P KVYF+ + +VEEQ++L+S++REK AF +I Sbjct: 587 MYNMNVTAIRQLEVFEARRRLPPADRMKVYFLIHARTVEEQAYLTSLRREKAAFEFIIDT 646 Query: 752 HSNMA----QHFETDEDLSRYKNLAHRKMQLSRMKNSRIAGGQDFLNPMTYDVVVVDMRE 807 S M Q +TDE K + SR AGGQ V+VDMRE Sbjct: 647 KSKMVIPKYQDGKTDEAFLLLKT--YDDEPTDENAKSRQAGGQAPQATKETPKVIVDMRE 704 Query: 808 FRAALPGLLYRYGVRVVPCMLTIGDYVITPDICIERKSIADLIGSFKNGRLDKQIRSLSR 867 FR+ LP L+++ G+ V+P +TIGDY++TPDIC+ERKSI+DLIGS +GRL Q + R Sbjct: 705 FRSDLPCLIHKRGLEVLPLTITIGDYILTPDICVERKSISDLIGSLNSGRLYNQCVQMQR 764 Query: 868 FYKYPTLLIEFDDSQSFSLEPFSERNVYASAASSTVHPISGKLMQEEIQRELSHLVMKYP 927 Y P LLIEFD ++ F L+ + S A++ +I ++L L + +P Sbjct: 765 HYAKPILLIEFDQNKPFHLQGKFMLSQQTSMANA------------DIVQKLQLLTLHFP 812 Query: 928 SLKIVWSSSPLQTVNIFLDLKTNREQPDPVKCVQFGSTKKQTGKNKKDTESNNKFKNLLT 987 L+++WS SP T +F +LK + +PDP GS + G+ F LL Sbjct: 813 KLRLIWSPSPYATAQLFEELKLGKPEPDPQTAAALGSDEPTAGEQLHFNSGIYDF--LLR 870 Query: 988 IPGLSNVDYYNIKKRYKRYADLLNASVEDLKNIV 1021 +PG+ + + + ++ LL S ++L+ ++ Sbjct: 871 LPGVHTRNIHGLLRKGGSLRQLLLRSQKELEELL 904 >CE24855 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic component XPF/ERCC4 Length = 935 Score = 189 bits (480), Expect = 2e-47 Identities = 232/975 (23%), Positives = 401/975 (40%), Gaps = 153/975 (15%) Query: 102 LLVSENALLVIGKGLSVLSIVSNLL--YTLSTPTRIDGTDKRSLVLVLNAXXXXXXXXXX 159 LL E A L SVL +V+N L L I +D+R L LVLN Sbjct: 24 LLEYERATLAKTLPASVLFVVANGLGLERLFLEHLILFSDRRLLALVLNTNEHDESYFVS 83 Query: 160 XLMELQWTCNEDDEEGQPGDRPFTVISSDSFTVDQRSKRYQQGGIISVTSRILIVDLLSG 219 L E C+ VI+S+ ++ R Y +GG+ +SR+L+VDLL Sbjct: 84 KLKEHNVECDPK------------VINSE-VSIKDRQSIYLEGGVQFCSSRVLLVDLLQN 130 Query: 220 IVHPKNITGLLILHAEKLDSMSIESFIVEIYRNSNKWGFIKAITDSAESMISEFAPLAKK 279 + I + + A + + +SFI+ +YR G +KA TD S+ S L + Sbjct: 131 RIPTDRIAAIFVYRAHQTLNAFQDSFILRLYREKKPDGTVKAFTDFPNSL-SSLGQLQRL 189 Query: 280 MKDLLLKRILLWPRFHADISSCLNTTSNTKVIEIRVSLTDSMSKIQFGLYECLKKCIDEL 339 + L ++ + L PRF + I S LN K V + + ++ + E +K C+ +L Sbjct: 190 VDRLYIRHVELMPRFSSIIESELNRYQ-LKTAIFSVDVPTPLRRVHRTIIEFIKVCVRDL 248 Query: 340 ----------NRKNPELSTEYWSFENALDSNFLRIINGVLSPKWHRISYESKQLVKDXXX 389 + +N E+ W+ + L + IS + ++L+ D Sbjct: 249 RTCSTSGKQTDEQNEEMIHVPWAATR---------LEKRLHDRRGHISEKQQRLLNDVAS 299 Query: 390 XXXXXXXXXXYDALDFYELIQLILDANKPSVTRKYSESPWLLADE-SQLVISFAKKRVID 448 D +Q++ K T S WLL+ ++++ + Sbjct: 300 LREILQLSENMDVATVLSRLQVL----KNDRTVLEEHSGWLLSPSFNRIMEDLLTIAGVT 355 Query: 449 DGNYNLEPLP---KWEQLCALMEDIEH--ESSKYPAGTQGSVLILCSDDRTSNQLRQVIR 503 +G + + KW L ++ +I+ K SVL++ S + S Q+ V+R Sbjct: 356 NGKADYKKFATPAKWTVLSEILREIKMLPVEKKDRGNDSPSVLVITSSEDLSRQVTDVVR 415 Query: 504 N-------MKSKHNGHKNIMLDKLDIYXXXXXXXXXISKDIVEENEKYQNGGEMNVSRAF 556 M + G+K+ D D + + + GE Sbjct: 416 YGINKMKWMTWRQLGYKSTQEMPED--------EPLWDPDTISQLMRSSVDGES------ 461 Query: 557 HKQEINTKRRRTRGASFVAAVDRLKNAQAGQGQDIDAMITSDSIK-----EERDRSLVSM 611 K E+ ++T+ + AA R K+A+ G D + ++ I+ +R +S Sbjct: 462 -KSEVIANVQKTQKTTARAAQKRRKHAEELSGFSSDHRVQTNLIQFGILQYKRRKS---- 516 Query: 612 ETLGDEAAVKEDFDSEDELSIIEEKLQDGSIPEYAMVQNWEEVWERRKTGYKYVDNDCKI 671 G+EA+ ++ E+ + + EE+ E K D + ++ Sbjct: 517 ---GNEASTSQE------------------TTEWEVKEEMEEIEEITKN---IGDLEAEL 552 Query: 672 VIETFSTTSDEQ------ILHELMPSYIIIYEPNLAFVRKLEVYKAIHRHNPPKVYFMYY 725 V+ STT + + +L P I++Y +L +R++E+Y++ + + VY++ Y Sbjct: 553 VV---STTRERERYTLLKLLETKKPRAIVLYTMSLQTLRQIEIYRSTNPNRSLHVYWLQY 609 Query: 726 GDSVEEQSHLSSIKREKEAFTKLIREHSNM--AQHFETD-EDLSRYKNLAHRKMQLSRMK 782 +S EE +L SI RE +F LIRE + ++ F D ED R K ++ R +R Sbjct: 610 TESTEESRYLESINRETMSFELLIREQGTLLISREFNVDREDAPRLK-ISTRDGGGARRD 668 Query: 783 NSRIAGGQDFLNP---MTYDVVVVDMREFRAALPGLLYRYGVRVVPCMLTIGDYVITPDI 839 + +D ++P + ++VDMREF + LP +LY G VV + IGDY+++P+I Sbjct: 669 GA--VDPRDQMDPEEELERPKIIVDMREFNSELPTVLYTKGYNVVATTIEIGDYILSPNI 726 Query: 840 CIERKSIADLIGSFKNGRLDKQIRSLSRFYKYPTLLIEFDDSQSFSLEPFSERNVYASAA 899 IERK++ DL S ++GR+ KQI + Y LLIE S F + V Sbjct: 727 AIERKALDDLTQSLQSGRVFKQIEQMLEHYDCTVLLIE-------SNRKFETKIVNGG-- 777 Query: 900 SSTVHPISGKLMQ--EEIQRELSHLVMKYPSLKIVWSSSPLQTVNIFLDLKTNREQPDPV 957 P G+L + EI+ L+ P ++ VW+ SP + F +LK + +PD Sbjct: 778 -----PFQGELSRHCREIRSIFCSLIWANPKMRCVWTISPTNSAEFFSELKLSAPEPDVD 832 Query: 958 KCVQF----------------GSTKKQTGKNKKDTESNNKFKNLLTIPGLSNVDYYNI-- 999 + + ST + K KK + + L I G+ + +N+ Sbjct: 833 RAISLKADQVECSSQELTDSEASTSTKAKKGKKWKPNPTVIRTLTQIFGIKASEAHNLLA 892 Query: 1000 KKRYKRYADLLNASV 1014 K ADL + ++ Sbjct: 893 NSSIKTLADLFSLNI 907 >ECU08g0760 [L] KOG0442 Structure-specific endonuclease ERCC1-XPF catalytic component XPF/ERCC4 Length = 768 Score = 121 bits (303), Expect = 7e-27 Identities = 96/329 (29%), Positives = 150/329 (45%), Gaps = 61/329 (18%) Query: 693 IIIYEPNLAFVRKLEVYKAIHRHNPPKVYFMYYGDSVEEQSHLSSIKREKEAFTKLIREH 752 +I E VRK+E Y H KV+F+ + S+EEQ +L+ I+REK +F KLI E Sbjct: 463 VIFVESGQDSVRKIERYGVAHS---VKVFFLMHTGSLEEQRYLNEIRREKASFEKLIEER 519 Query: 753 SNMAQHFETDE---DLSRYKNLAHRKMQLSRMKNSRIAGGQDFLNPMTYDVVVVDMREFR 809 S + + + DL Y+ A R+ VVVVD RE R Sbjct: 520 SRLPLRLDDVDDAIDLEEYEP-AEREY-----------------------VVVVDSRELR 555 Query: 810 AALPGLLYRYGVRVVPCMLTIGDYVITPDICIERKSIADLIGSFKNGRLDKQIRSLSRFY 869 A LP L+R R+ L +GDY+++P CIERKSI DL+ S +GRL Q L Y Sbjct: 556 AELPFFLFRARNRICISTLPVGDYLVSPTTCIERKSIPDLVSSLNSGRLYLQASMLCHRY 615 Query: 870 KYPTLLIEFDDSQSFSLEPFSERNVYASAASSTVHPISGKLMQEEIQRELSHLVMKYPSL 929 P LL+EFD S + + + +LS L+ +L Sbjct: 616 PRPVLLLEFD----------------GRPCLSDYYRYDQDTFKNSLVAKLSLLLFNLGAL 659 Query: 930 KIVWSSSPLQTVNIFLDLKTNREQPDPVKCVQFGSTKKQTGKNKKDTESNNKFKNLLTIP 989 +++WS S L + I DL+ + V+ +K D + + LL+IP Sbjct: 660 RLIWSESRLFSTKIIRDLQRKEDVSSAVE------------GHKMDPVLH---EILLSIP 704 Query: 990 GLSNVDYYNIKKRYKRYADLLNASVEDLK 1018 G++ + +++ ++ DL+ +++E L+ Sbjct: 705 GITQFNISRVRRYFRSLKDLVFSTMERLE 733 Score = 84.3 bits (207), Expect = 9e-16 Identities = 39/139 (28%), Positives = 82/139 (58%), Gaps = 2/139 (1%) Query: 194 QRSKRYQQGGIISVTSRILIVDLLSGIVHPKNITGLLILHAEKLDSMSIESFIVEIYRNS 253 +R ++Y GG+ ++R+ + D++ G + + I +L+ + E + S ESFI+ ++R+ Sbjct: 112 KRREKYLCGGVCIASNRVFLADMIDGTIDAEKIDCILVNNVETITETSSESFIIHVFRSR 171 Query: 254 NKWGFIKAITDSAESMISEFAPLAKKMKDLLLKRILLWPRFHADISSCLNTTSNTKVIEI 313 N+ G I+ ++S + F+PL +KM+ L + + + +PRFH+ + LN + V+EI Sbjct: 172 NRTGLIRGFSESPVPLSLGFSPLDRKMRSLKVSKAVFFPRFHSLVEESLN--GDMDVVEI 229 Query: 314 RVSLTDSMSKIQFGLYECL 332 + +++ S++Q L E + Sbjct: 230 KFRMSERKSQLQVVLLEII 248 Database: KOG eukaryal database 04/03 Posted date: Apr 14, 2003 1:07 PM Number of letters in database: 30,389,216 Number of sequences in database: 60,738 Lambda K H 0.316 0.134 0.375 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 59,978,052 Number of Sequences: 60738 Number of extensions: 2591187 Number of successful extensions: 8683 Number of sequences better than 1.0e-05: 7 Number of HSP's better than 0.0 without gapping: 7 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 8629 Number of HSP's gapped (non-prelim): 17 length of query: 1056 length of database: 30,389,216 effective HSP length: 117 effective length of query: 939 effective length of database: 23,282,870 effective search space: 21862614930 effective search space used: 21862614930 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits)