>AT3G21350.1 |  RNA polymerase transcriptional regulation mediator-related 
MDSSLLSAATADTFNGNAADQIPPPLQPPGTDMTGISFRDQLWINSYPLDRNYIFDYFAL 
SPFYDTTCNNEILRRRSIHPLDLSHLSKMTGLEYMLTDATEPNLFVFRKQKRDGPEKVTP 
MLTYYILDGSIYQAPQLCSVFAARVSRTIYNISKAFTDAASKLETIRQVDTENQNEPAES 
KPASETVDLKEMKRVDVILTSLYRKLAPPPPPPPFPEGYVSQEALGEKEELGTQGGESQP 
PQVDPIIDQGPAKRMKF*
>AT3G21350.1 |  RNA polymerase transcriptional regulation mediator-related 
MDSSLLSAATADTFNGNAADQIPPPLQPPGTDMTGISFRDQLWINSYPLDRNYIFDYFAL 
SPFYDTTCNNEILRRRSIHPLDLSHLSKMTGLEYMLTDATEPNLFVFRKQKRDGPEKVTP 
MLTYYILDGSIYQAPQLCSVFAARVSRTIYNISKAFTDAASKLETIRQVDTENQNEPAES 
KPASETVDLKEMKRVDVILTSLYRKLAPPPPPPPFPEGYVSQEALGEKEELGTQGGESQP 
PQVDPIIDQGPAKRMKF*
>AT3G21350.2 |  RNA polymerase transcriptional regulation mediator-related 
MDSSLLSAATADTFNGNAADQIPPPLQPPGTDMTGISFRDQLWINSYPLDRNYIFDYFAL 
SPFYDTTCNNEILRRRSIHPLDLSHLSKMTGLEYMLTDATEPNLFVFRKQKRDGPEKVTP 
MLTYYILDGSIYQAPQLCSVFAARVSRTIYNISKAFTDAASKLETIRQDLQVCLVAIVLS 
LSVNLGSYFLLIFKLMANGEQVYGFNKFLFDTENQNEPAESKPASETVDLKEMKRVDVIL 
TSLYRKLAPPPPPPPFPEGYVSQEALGEKEELGTQGGESQPPQVDPIIDQGPAKRMKF*
>AT3G21350.2 |  RNA polymerase transcriptional regulation mediator-related 
MDSSLLSAATADTFNGNAADQIPPPLQPPGTDMTGISFRDQLWINSYPLDRNYIFDYFAL 
SPFYDTTCNNEILRRRSIHPLDLSHLSKMTGLEYMLTDATEPNLFVFRKQKRDGPEKVTP 
MLTYYILDGSIYQAPQLCSVFAARVSRTIYNISKAFTDAASKLETIRQDLQVCLVAIVLS 
LSVNLGSYFLLIFKLMANGEQVYGFNKFLFDTENQNEPAESKPASETVDLKEMKRVDVIL 
TSLYRKLAPPPPPPPFPEGYVSQEALGEKEELGTQGGESQPPQVDPIIDQGPAKRMKF*
>AT5G28540.1 |  BIP1 ATP binding 
MARSFGANSTVVLAIIFFGCLFALSSAIEEATKLGSVIGIDLGTTYSCVGVYKNGHVEII 
ANDQGNRITPSWVGFTDSERLIGEAAKNQAAVNPERTVFDVKRLIGRKFEDKEVQKDRKL 
VPYQIVNKDGKPYIQVKIKDGETKVFSPEEISAMILTKMKETAEAYLGKKIKDAVVTVPA 
YFNDAQRQATKDAGVIAGLNVARIINEPTAAAIAYGLDKKGGEKNILVFDLGGGTFDVSV 
LTIDNGVFEVLSTNGDTHLGGEDFDHRVMEYFIKLIKKKHQKDISKDNKALGKLRRECER 
AKRALSSQHQVRVEIESLFDGVDFSEPLTRARFEELNNDLFRKTMGPVKKAMDDAGLQKS 
QIDEIVLVGGSTRIPKVQQLLKDFFEGKEPNKGVNPDEAVAYGAAVQGGILSGEGGDETK 
DILLLDVAPLTLGIETVGGVMTKLIPRNTVIPTKKSQVFTTYQDQQTTVSIQVFEGERSL 
TKDCRLLGKFDLNGIPPAPRGTPQIEVTFEVDANGILNVKAEDKASGKSEKITITNEKGR 
LSQEEIDRMVKEAEEFAEEDKKVKEKIDARNALETYVYNMKNQVNDKDKLADKLEGDEKE 
KIEAATKEALEWLDENQNSEKEEYDEKLKEVEAVCNPIITAVYQRSGGAPGGAGGESSTE 
EEDESHDEL*
>AT3G10690.1 |  DNA gyrase subunit A family protein 
MTPVLCHSTASIPNPNSLMSLSSTLRLSSSLLRRSFFRFPLTDPLCRLRRTEPSATRFFS 
SRTPRSGKFVVGAGKRGDEQVKEESGANNGGLVVSGDESRIVPFELHKEATESYMSYALS 
VLLGRALPDVRDGLKPVHRRILFAMHELGMSSKKPYKKCARVVGEVLGKFHPHGDTAVYD 
SLVRMAQSFSLRCPLIQGHGNFGSIDADPPAAMRYTECRLDPLAEAVLLSDLDQDTVDFV 
ANFDNSQKEPAVLPARLPALLLNGASGIAVGMATNIPPHNLGELVDVLCALIHNPEATLQ 
ELLEYMPAPDFPTGGIIMGNLGVLDAYRTGRGRVVVRGKAEVELLDPKTKRNAVIITEIP 
YQTNKATLVQKIAELVENKTLEGISDIRDESDRNGMRVVIELKRGGDPALVLNNLYRHTA 
LQSSFSCNMVGICDGEPKLMGLKELLQAFIDFRCSVVERRARFKLSHAQQRKHIIEGIVV 
GLDNVDEVIELITKASSHSSATAALQSEYGLSEKQAEAILEITLRRLTALERKKFTDESS 
SLTEQITKLEQLLSTRTNILKLIEQEAIELKDRFSSPRRSMLEDSDSGDLEDIDVIPNEE 
MLMAVSEKGYVKRMKADTFNLQHRGTIGKSVGKLRVDDAMSDFLVCHAHDHVLFFSDRGI 
VYSTRAYKIPECSRNAAGTPLVQILSMSEGERVTSIVPVSEFAEDRYLLMLTVNGCIKKV 
SLKLFSGIRSTGIIAIQLNSGDELKWVRCCSSDDLVAMASQNGMVALSTCDGVRTLSRNT 
KGVTAMRLKNEDKIASMDIIPASLRKDMEEKSEDASLVKQSTGPWLLFVCENGYGKRVPL 
SSFRRSRLNRVGLSGYKFAEDDRLAAVFVVGYSLAEDGESDEQVVLVSQSGTVNRIKVRD 
ISIQSRRARGVILMRLDHAGKIQSASLISAADEEETEGTLSNEAVEAVSL*
>AT4G25630.1 |  FIB2 (FIBRILLARIN 2) snoRNA binding 
MRPPLTGSGGGFSGGRGRGGYSGGRGDGGFSGGRGGGGRGGGRGFSDRGGRGRGRGPPRG 
GARGGRGPAGRGGMKGGSKVIVEPHRHAGVFIAKGKEDALVTKNLVPGEAVYNEKRISVQ 
NEDGTKTEYRVWNPFRSKLAAAILGGVDNIWIKPGAKVLYLGAASGTTVSHVSDLVGPEG 
CVYAVEFSHRSGRDLVNMAKKRTNVIPIIEDARHPAKYRMLVGMVDVIFSDVAQPDQARI 
LALNASYFLKSGGHFVISIKANCIDSTVPAEAVFQTEVKKLQQEQFKPAEQVTLEPFERD 
HACVVGGYRMPKKPKAATAA*
>AT2G38250.1 |  DNA-binding protein-related 
MDGHQHHHLHQLQYLNKHHLHTQSQTPEIASPVAVGDRFPQWSVEETKELIGIRGELDQT 
FMETKRNKLLWEVISNKMRDKSFPRSPEQCKCKWKNLVTRFKGCETMEAETARQQFPFYD 
DMQNIFTTRMQRMLWAESEGGGGGTSGAARKREYSSDEEEENVNEELVDVSNDPKILNPK 
KNIAKKRKGGSNSSNSNNGVREVLEEFMRHQVRMESEWREGWEAREKERAEKEEEWRRKM 
EELEKERLAMERMWRDREEQRRSREEMRAEKRDSLINALLAKLTRDGSL*
>AT3G57660.1 |  NRPA1 DNA binding / DNA-directed RNA polymerase/ zinc ion binding 
MAHAQTTEVCLSFHRSLLFPMGASQVVESVRFSFMTEQDVRKHSFLKVTSPILHDNVGNP 
FPGGLYDLKLGPKDDKQACNSCGQLKLACPGHCGHIELVFPIYHPLLFNLLFNFLQRACF 
FCHHFMAKPEDVERAVSQLKLIIKGDIVSAKQLESNTPTKSKSSDESCESVVTTDSSEEC 
EDSDVEDQRWTSLQFAEVTAVLKNFMRLSSKSCSRCKGINPKLEKPMFGWVRMRAMKDSD 
VGANVIRGLKLKKSTSSVENPDGFDDSGIDALSEVEDGDKETREKSTEVAAEFEEHNSKR 
DLLPSEVRNILKHLWQNEHEFCSFIGDLWQSGSEKIDYSMFFLESVLVPPTKFRPPTTGG 
DSVMEHPQTVGLNKVIESNNILGNACTNKLDQSKVIFRWRNLQESVNVLFDSKTATVQSQ 
RDSSGICQLLEKKEGLFRQKMMGKRVNHACRSVISPDPYIAVNDIGIPPCFALKLTYPER 
VTPWNVEKLREAIINGPDIHPGATHYSDKSSTMKLPSTEKARRAIARKLLSSRGATTELG 
KTCDINFEGKTVHRHMRDGDIVLVNRQPTLHKPSLMAHKVRVLKGEKTLRLHYANCSTYN 
ADFDGDEMNVHFPQDEISRAEAYNIVNANNQYARPSNGEPLRALIQDHIVSSVLLTKRDT 
FLDKDHFNQLLFSSGVTDMVLSTFSGRSGKKVMVSASDAELLTVTPAILKPVPLWTGKQV 
ITAVLNQITKGHPPFTVEKATKLPVDFFKCRSREVKPNSGDLTKKKEIDESWKQNLNEDK 
LHIRKNEFVCGVIDKAQFADYGLVHTVHELYGSNAAGNLLSVFSRLFTVFLQTHGFTCGV 
DDLIILKDMDEERTKQLQECENVGERVLRKTFGIDVDVQIDPQDMRSRIERILYEDGESA 
LASLDRSIVNYLNQCSSKGVMNDLLSDGLLKTPGRNCISLMTISGAKGSKVNFQQISSHL 
GQQDLEGKRVPRMVSGKTLPCFHPWDWSPRAGGFISDRFLSGLRPQEYYFHCMAGREGLV 
DTAVKTSRSGYLQRCLMKNLESLKVNYDCTVRDADGSIIQFQYGEDGVDVHRSSFIEKFK 
ELTINQDMVLQKCSEDMLSGASSYISDLPISLKKGAEKFVEAMPMNERIASKFVRQEELL 
KLVKSKFFASLAQPGEPVGVLAAQSVGEPSTQMTLNTFHLAGRGEMNVTLGIPRLQEILM 
TAAANIKTPIMTCPLLKGKTKEDANDITDRLRKITVADIIKSMELSVVPYTVYENEVCSI 
HKLKINLYKPEHYPKHTDITEEDWEETMRAVFLRKLEDAIETHMKMLHRIRGIHNDVTGP 
IAGNETDNDDSVSGKQNEDDGDDDGEGTEVDDLGSDAQKQKKQETDEMDYEENSEDETNE 
PSSISGVEDPEMDSENEDTEVSKEDTPEPQEESMEPQKEVKGVKNVKEQSKKKRRKFVRA 
KSDRHIFVKGEGEKFEVHFKFATDDPHILLAQIAQQTAQKVYIQNSGKIERCTVANCGDP 
QVIYHGDNPKERREISNDEKKASPALHASGVDFPALWEFQDKLDVRYLYSNSIHDMLNIF 
GVEAARETIIREINHVFKSYGISVSIRHLNLIADYMTFSGGYRPMSRMGGIAESTSPFCR 
MTFETATKFIVQAATYGEKDTLETPSARICLGLPALSGTGCFDLMQRVEL*
>AT5G64680.1 |  unknown protein 
MDFDFKVSGDFIVSGAEQLDDTDLTRSDEFWLIQAPLGQFPEIEENTLKIEPDKDGLFGE 
FKDSNGAKYDLASFHSQDAGAELIIPSEESMIVGKITRRVALVRYPEPNELLQKMKARTQ 
QKLVGSVTNSSKKSSNLTQSSRHKSGTRSSREKSMFSGFTETPKSPKRKNSESSSGKHRS 
STSTVSGSSERSAKSKKKVKKEE*
>AT5G64680.1 |  unknown protein 
MDFDFKVSGDFIVSGAEQLDDTDLTRSDEFWLIQAPLGQFPEIEENTLKIEPDKDGLFGE 
FKDSNGAKYDLASFHSQDAGAELIIPSEESMIVGKITRRVALVRYPEPNELLQKMKARTQ 
QKLVGSVTNSSKKSSNLTQSSRHKSGTRSSREKSMFSGFTETPKSPKRKNSESSSGKHRS 
STSTVSGSSERSAKSKKKVKKEE*
>AT5G64680.1 |  unknown protein 
MDFDFKVSGDFIVSGAEQLDDTDLTRSDEFWLIQAPLGQFPEIEENTLKIEPDKDGLFGE 
FKDSNGAKYDLASFHSQDAGAELIIPSEESMIVGKITRRVALVRYPEPNELLQKMKARTQ 
QKLVGSVTNSSKKSSNLTQSSRHKSGTRSSREKSMFSGFTETPKSPKRKNSESSSGKHRS 
STSTVSGSSERSAKSKKKVKKEE*
>AT5G64680.2 |  unknown protein 
MDFDFKVSGDFIVSGAEQLDDTDLTRSDEFWLIQAPLGQFPEIEENTLKIEPDKDGLFGE 
FKDSNGAKYDLASFHSQDAGAELIIPSEESMIVGKITRRVALVRYPEPNELLQKMKARTQ 
QKLVGSVTNSSKKSSNLTQSSRHKSGTRSSREKSMFSGFTETPKSPKRKNSESSSGKHRS 
STSTVSGSSERSAKSKKKVKKEE*
>AT5G64680.2 |  unknown protein 
MDFDFKVSGDFIVSGAEQLDDTDLTRSDEFWLIQAPLGQFPEIEENTLKIEPDKDGLFGE 
FKDSNGAKYDLASFHSQDAGAELIIPSEESMIVGKITRRVALVRYPEPNELLQKMKARTQ 
QKLVGSVTNSSKKSSNLTQSSRHKSGTRSSREKSMFSGFTETPKSPKRKNSESSSGKHRS 
STSTVSGSSERSAKSKKKVKKEE*
>AT5G64680.2 |  unknown protein 
MDFDFKVSGDFIVSGAEQLDDTDLTRSDEFWLIQAPLGQFPEIEENTLKIEPDKDGLFGE 
FKDSNGAKYDLASFHSQDAGAELIIPSEESMIVGKITRRVALVRYPEPNELLQKMKARTQ 
QKLVGSVTNSSKKSSNLTQSSRHKSGTRSSREKSMFSGFTETPKSPKRKNSESSSGKHRS 
STSTVSGSSERSAKSKKKVKKEE*
>AT5G64680.3 |  unknown protein 
MDFDFKVSGDFIVSGAEQLDDTDLTRSDEFWLIQAPLGQFPEIEENTLKIEPDKDGLFGE 
FKDSNAGSLHLCLGAKYDLASFHSQDAGAELIIPSEESMIVGKITRRVALVRYPEPNELL 
QKMKARTQQKLVGSVTNSSKKSSNLTQSSRHKSGTRSSREKSMFSGFTETPKSPKRKNSE 
SSSGKHRSSTSTVSGSSERSAKSKKKVKKEE*
>AT5G64680.3 |  unknown protein 
MDFDFKVSGDFIVSGAEQLDDTDLTRSDEFWLIQAPLGQFPEIEENTLKIEPDKDGLFGE 
FKDSNAGSLHLCLGAKYDLASFHSQDAGAELIIPSEESMIVGKITRRVALVRYPEPNELL 
QKMKARTQQKLVGSVTNSSKKSSNLTQSSRHKSGTRSSREKSMFSGFTETPKSPKRKNSE 
SSSGKHRSSTSTVSGSSERSAKSKKKVKKEE*
>AT5G64680.3 |  unknown protein 
MDFDFKVSGDFIVSGAEQLDDTDLTRSDEFWLIQAPLGQFPEIEENTLKIEPDKDGLFGE 
FKDSNAGSLHLCLGAKYDLASFHSQDAGAELIIPSEESMIVGKITRRVALVRYPEPNELL 
QKMKARTQQKLVGSVTNSSKKSSNLTQSSRHKSGTRSSREKSMFSGFTETPKSPKRKNSE 
SSSGKHRSSTSTVSGSSERSAKSKKKVKKEE*
>AT3G25940.1 |  transcription factor S-II (TFIIS) domain-containing protein 
MEKSRESEFLFCNLCGTMLVLKSTKYAECPHCKTTRNAKDIIDKEIAYTVSAEDIRRELG 
ISLFGEKTQAEAELPKIKKACEKCQHPELVYTTRQTRSADEGQTTYYTCPNCAHRFTEG*
>AT1G25540.1 |  PFT1 (PHYTOCHROME AND FLOWERING TIME 1) transcription coactivator 
MSSEVKQLIVVAEGTAALGPYWQTIVSDYLEKIIRSFCGSELNGERNPVSTVELSLVIFN 
SHGSYCACLVQRSGWTRDVDIFLHWLSSIQFGGGGFNEVATAEGLAEALMMFSPPSGQAQ 
PSNDLKRHCILITASNPHILPTPVYRPRLQNVERNENGDAQAESRLSDAETVASYFAKCS 
VSLSVVCPKQLPTIRALYNAGKPNQQSADLSIDTAKNTFYLVLISENFVEACAALSHSAT 
NLPQTQSPVKVDRATVAPSIPVTGQPPAPVSSANGPIQNRQPVSVGPVPTATVKVEPSTV 
TSMAPVPSFPHIPAVARPATQAIPSIQTSSASPVSQDMVSNAENAPDIKPVVVSGMTPPL 
RTGPPGGANVNLLNNLSQVRQVMSSAALAGAASSVGQSAVAMHMSNMISTGMATSLPPSQ 
TVFSTGQQGITSMAGSGALMGSAQTGQSPGPNNAFSPQTTSNVASNLGVSQPMQGMNQGS 
HSGAMMQGGISMNQNMMSGLGQGNVSSGTGGMMPTPGVGQQAQSGIQQLGGSNSSAPNMQ 
LSQPSSGAMQTSQSKYVKVWEGNLSGQRQGQPVLITRLEGYRSASASDSLAANWPPTMQI 
VRLISQDHMNNKQYVGKADFLVFRAMSQHGFLGQLQDKKLCAVIQLPSQTLLLSVSDKAC 
RLIGMLFPGDMVVFKPQIPNQQQQQQQQLHQQQQQQQQIQQQQQQQQHLQQQQMPQLQQQ 
QQQHQQQQQQQHQLSQLQHHQQQQQQQQQQQQQHQLTQLQHHHQQQQQASPLNQMQQQTS 
PLNQMQQQTSPLNQMQQQQQPQQMVMGGQAFAQAPGRSQQGGGGGQPNMPGAGFMG*
>AT1G25540.1 |  PFT1 (PHYTOCHROME AND FLOWERING TIME 1) transcription coactivator 
MSSEVKQLIVVAEGTAALGPYWQTIVSDYLEKIIRSFCGSELNGERNPVSTVELSLVIFN 
SHGSYCACLVQRSGWTRDVDIFLHWLSSIQFGGGGFNEVATAEGLAEALMMFSPPSGQAQ 
PSNDLKRHCILITASNPHILPTPVYRPRLQNVERNENGDAQAESRLSDAETVASYFAKCS 
VSLSVVCPKQLPTIRALYNAGKPNQQSADLSIDTAKNTFYLVLISENFVEACAALSHSAT 
NLPQTQSPVKVDRATVAPSIPVTGQPPAPVSSANGPIQNRQPVSVGPVPTATVKVEPSTV 
TSMAPVPSFPHIPAVARPATQAIPSIQTSSASPVSQDMVSNAENAPDIKPVVVSGMTPPL 
RTGPPGGANVNLLNNLSQVRQVMSSAALAGAASSVGQSAVAMHMSNMISTGMATSLPPSQ 
TVFSTGQQGITSMAGSGALMGSAQTGQSPGPNNAFSPQTTSNVASNLGVSQPMQGMNQGS 
HSGAMMQGGISMNQNMMSGLGQGNVSSGTGGMMPTPGVGQQAQSGIQQLGGSNSSAPNMQ 
LSQPSSGAMQTSQSKYVKVWEGNLSGQRQGQPVLITRLEGYRSASASDSLAANWPPTMQI 
VRLISQDHMNNKQYVGKADFLVFRAMSQHGFLGQLQDKKLCAVIQLPSQTLLLSVSDKAC 
RLIGMLFPGDMVVFKPQIPNQQQQQQQQLHQQQQQQQQIQQQQQQQQHLQQQQMPQLQQQ 
QQQHQQQQQQQHQLSQLQHHQQQQQQQQQQQQQHQLTQLQHHHQQQQQASPLNQMQQQTS 
PLNQMQQQTSPLNQMQQQQQPQQMVMGGQAFAQAPGRSQQGGGGGQPNMPGAGFMG*
>AT1G25540.2 |  PFT1 (PHYTOCHROME AND FLOWERING TIME 1) transcription coactivator 
MMFSPPSGQAQPSNDLKRHCILITASNPHILPTPVYRPRLQNVERNENGDAQAESRLSDA 
ETVASYFAKCSVSLSVVCPKQLPTIRALYNAGKPNQQSADLSIDTAKNTFYLVLISENFV 
EACAALSHSATNLPQTQSPVKVDRATVAPSIPVTGQPPAPVSSANGPIQNRQPVSVGPVP 
TATVKVEPSTVTSMAPVPSFPHIPAVARPATQAIPSIQTSSASPVSQDMVSNAENAPDIK 
PVVVSGMTPPLRTGPPGGANVNLLNNLSQVRQVMSSAALAGAASSVGQSAVAMHMSNMIS 
TGMATSLPPSQTVFSTGQQGITSMAGSGALMGSAQTGQSPGPNNAFSPQTTSNVASNLGV 
SQPMQGMNQGSHSGAMMQGGISMNQNMMSGLGQGNVSSGTGGMMPTPGVGQQAQSGIQQL 
GGSNSSAPNMQLSQPSSGAMQTSQSKYVKVWEGNLSGQRQGQPVLITRLEGYRSASASDS 
LAANWPPTMQIVRLISQDHMNNKQYVGKADFLVFRAMSQHGFLGQLQDKKLCAVIQLPSQ 
TLLLSVSDKACRLIGMLFPGDMVVFKPQIPNQQQQQQQQLHQQQQQQQQIQQQQQQQQHL 
QQQQMPQLQQQQQQHQQQQQQQHQLSQLQHHQQQQQQQQQQQQQHQLTQLQHHHQQQQQA 
SPLNQMQQQTSPLNQMQQQTSPLNQMQQQQQPQQMVMGGQAFAQAPGRSQQGGGGGQPNM 
PGAGFMG*
>AT1G25540.2 |  PFT1 (PHYTOCHROME AND FLOWERING TIME 1) transcription coactivator 
MMFSPPSGQAQPSNDLKRHCILITASNPHILPTPVYRPRLQNVERNENGDAQAESRLSDA 
ETVASYFAKCSVSLSVVCPKQLPTIRALYNAGKPNQQSADLSIDTAKNTFYLVLISENFV 
EACAALSHSATNLPQTQSPVKVDRATVAPSIPVTGQPPAPVSSANGPIQNRQPVSVGPVP 
TATVKVEPSTVTSMAPVPSFPHIPAVARPATQAIPSIQTSSASPVSQDMVSNAENAPDIK 
PVVVSGMTPPLRTGPPGGANVNLLNNLSQVRQVMSSAALAGAASSVGQSAVAMHMSNMIS 
TGMATSLPPSQTVFSTGQQGITSMAGSGALMGSAQTGQSPGPNNAFSPQTTSNVASNLGV 
SQPMQGMNQGSHSGAMMQGGISMNQNMMSGLGQGNVSSGTGGMMPTPGVGQQAQSGIQQL 
GGSNSSAPNMQLSQPSSGAMQTSQSKYVKVWEGNLSGQRQGQPVLITRLEGYRSASASDS 
LAANWPPTMQIVRLISQDHMNNKQYVGKADFLVFRAMSQHGFLGQLQDKKLCAVIQLPSQ 
TLLLSVSDKACRLIGMLFPGDMVVFKPQIPNQQQQQQQQLHQQQQQQQQIQQQQQQQQHL 
QQQQMPQLQQQQQQHQQQQQQQHQLSQLQHHQQQQQQQQQQQQQHQLTQLQHHHQQQQQA 
SPLNQMQQQTSPLNQMQQQTSPLNQMQQQQQPQQMVMGGQAFAQAPGRSQQGGGGGQPNM 
PGAGFMG*
>AT4G25210.1 |  transcription regulator 
MAPKKAEEVVESPPVSSEEEESGSSGEESESSAEVPKKVESSQKPESDSEGESESESSSG 
PEPESEPAKTIKLKPVGTKPIPETSGSAATVPESSTAKRPLKEAAPEAIKKQKTSDTEHV 
KKPITNDEVKKISSEDAKKMFQRLFSETDEIALLQGIIDFTSTKGDPYEDIDAFCIYVKK 
LIDFDATKNQIVTKLQRLKKKFNNAVKNSLKKGKTEDDIEFAKDLEQKGFELSRKIWGSN 
GVLVTGKSSRKKVGGTPAPKEMKLVAHSTPKKQQEEAKKPERTEAKVVNTGLSIGKEIAS 
FLNADNGSSCGLDESTLTAVWAKVADGAEKREVEEKWKKLKAKQFELCLQRSGLVNETAK 
MIFKAYES*
>AT1G60850.3 |  ATRPAC42 DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MVTKAEKQFAKNFNIDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTE 
EDDNVKLGNFYDNFKVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIA 
YNTSVIIDEVLAHRMGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVL 
TSDLKWLPNGSELLRESENKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPG 
QEIELEAHAVKGIGKTHAKWSPVGTAWYRMHPEVRSIYINLSYALWETALCFVFPHPNIY 
VS*
>AT1G60850.3 |  ATRPAC42 DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MVTKAEKQFAKNFNIDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTE 
EDDNVKLGNFYDNFKVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIA 
YNTSVIIDEVLAHRMGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVL 
TSDLKWLPNGSELLRESENKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPG 
QEIELEAHAVKGIGKTHAKWSPVGTAWYRMHPEVRSIYINLSYALWETALCFVFPHPNIY 
VS*
>AT1G60850.3 |  ATRPAC42 DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MVTKAEKQFAKNFNIDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTE 
EDDNVKLGNFYDNFKVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIA 
YNTSVIIDEVLAHRMGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVL 
TSDLKWLPNGSELLRESENKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPG 
QEIELEAHAVKGIGKTHAKWSPVGTAWYRMHPEVRSIYINLSYALWETALCFVFPHPNIY 
VS*
>AT1G60850.2 |  ATRPAC42 DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MVTKAEKQFAKNFNIDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTE 
EDDNVKLGNFYDNFKVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIA 
YNTSVIIDEVLAHRMGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVL 
TSDLKWLPNGSELLRESENKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPG 
QEIELEAHAVKGIGKTHAKWSPVGTAWYRMHPEVVLRGEVEDELAERLVNVCPQNVFDIE 
DMGKGKKRATVAQPRKCTLCKECVRDDDLVDHVDLGSVKNHFIFNIESTGSLPPEVLFTE 
AVKILEAKCEAITDF*
>AT1G60850.2 |  ATRPAC42 DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MVTKAEKQFAKNFNIDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTE 
EDDNVKLGNFYDNFKVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIA 
YNTSVIIDEVLAHRMGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVL 
TSDLKWLPNGSELLRESENKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPG 
QEIELEAHAVKGIGKTHAKWSPVGTAWYRMHPEVVLRGEVEDELAERLVNVCPQNVFDIE 
DMGKGKKRATVAQPRKCTLCKECVRDDDLVDHVDLGSVKNHFIFNIESTGSLPPEVLFTE 
AVKILEAKCEAITDF*
>AT1G60850.2 |  ATRPAC42 DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MVTKAEKQFAKNFNIDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTE 
EDDNVKLGNFYDNFKVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIA 
YNTSVIIDEVLAHRMGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVL 
TSDLKWLPNGSELLRESENKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPG 
QEIELEAHAVKGIGKTHAKWSPVGTAWYRMHPEVVLRGEVEDELAERLVNVCPQNVFDIE 
DMGKGKKRATVAQPRKCTLCKECVRDDDLVDHVDLGSVKNHFIFNIESTGSLPPEVLFTE 
AVKILEAKCEAITDF*
>AT1G60850.1 |  ATRPAC42 DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MVTKAEKQFAKNFNIDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTE 
EDDNVKLGNFYDNFKVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIA 
YNTSVIIDEVLAHRMGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVL 
TSDLKWLPNGSELLRESENKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPG 
QEIELEAHAVKGIGKTHAKWSPVGTAWYRMHPEVVLRGEVEDELAERLVNVCPQNVFDIE 
DMGKGKKRATVAQPRKCTLCKECVRDDDLVDHVDLGSVKNHFIFNIESTGSLPPEVLFTE 
AVKILEAKCEAITDF*
>AT1G60850.1 |  ATRPAC42 DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MVTKAEKQFAKNFNIDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTE 
EDDNVKLGNFYDNFKVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIA 
YNTSVIIDEVLAHRMGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVL 
TSDLKWLPNGSELLRESENKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPG 
QEIELEAHAVKGIGKTHAKWSPVGTAWYRMHPEVVLRGEVEDELAERLVNVCPQNVFDIE 
DMGKGKKRATVAQPRKCTLCKECVRDDDLVDHVDLGSVKNHFIFNIESTGSLPPEVLFTE 
AVKILEAKCEAITDF*
>AT1G60850.1 |  ATRPAC42 DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MVTKAEKQFAKNFNIDDLPDVPAGLPPHLKAQQTRVVSKNNAPAHTASAIYSGTYVSSTE 
EDDNVKLGNFYDNFKVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIA 
YNTSVIIDEVLAHRMGLIPIAADPRLFEYLSEHDQANEKNTIVFKLHVKCPKNRPRLKVL 
TSDLKWLPNGSELLRESENKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPG 
QEIELEAHAVKGIGKTHAKWSPVGTAWYRMHPEVVLRGEVEDELAERLVNVCPQNVFDIE 
DMGKGKKRATVAQPRKCTLCKECVRDDDLVDHVDLGSVKNHFIFNIESTGSLPPEVLFTE 
AVKILEAKCEAITDF*
>AT5G19910.1 |  SOH1 family protein 
MASPEEMGDDASEIPSPPKNTYKDPDGGRQRFLLELEFIQCLANPTYIHYLAQNRYFEDE 
AFIGYLKYLQYWQRPEYIKFIMYPHCLYFLELLQNPNFRTAMAHPANKELAHRQQFYYWK 
NYRNNRLKHILPRPLPEPVPPQPPVAPSTSLPPAPSATAALSPALSPMQYNNMLSKNDTR 
NMGATGIDRRKRKKGI*
>AT2G29540.1 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.1 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.1 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.2 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.2 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.2 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MEHGSFTNVSHASFTLSEEDHTLANAVRFVLNQDPRVTVAAYTIPHPSLEQVNIRVQTTG 
DPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAEEEELKRQRDLFGSMDIE 
NN*
>AT2G29540.3 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MLVYIHVMLLEILIVFNECHIILSLRLMELICWCVDNDDYVNNQYCFQFCSPRVTVAAYT 
IPHPSLEQVNIRVQTTGDPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAE 
EEELKRQRDLFGSMDIENN*
>AT2G29540.3 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MLVYIHVMLLEILIVFNECHIILSLRLMELICWCVDNDDYVNNQYCFQFCSPRVTVAAYT 
IPHPSLEQVNIRVQTTGDPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAE 
EEELKRQRDLFGSMDIENN*
>AT2G29540.3 |  ATRPC14 (RNA POLYMERASE 14 KDA SUBUNIT) DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MLVYIHVMLLEILIVFNECHIILSLRLMELICWCVDNDDYVNNQYCFQFCSPRVTVAAYT 
IPHPSLEQVNIRVQTTGDPAREVFKDACQELMQMNRHVRSVFDKAVAEYKDEQKRKEEAE 
EEELKRQRDLFGSMDIENN*
>AT5G03220.1 |  transcriptional co-activator-related 
MATATYPPPPPYYRLYKDYSENPNSAPEPPPPIEGTYVCFGGNYTTEDVLPSLEEQGVPQ 
LYPKDSNLDYKNELRSLNRELQLHILELADVLVDRPSQYAKRIGEISSIFKNLHHLLNSL 
RPHQARATLIHIMELQIQQRKQAVEDIKRRREEAQRLLKDAYLTLDGQ*
>AT1G11760.1 |  unknown protein 
MDNIVDSLNKAYEKFVLASAGVLESKESAGGQKALLTDTALENFKEKWELFRVACDQAEE 
FVESVKQRIGSECLVDEATGLTTTAAGGQAPAAVTGAATSLPPISAVRLEQMSRAVRWLV 
LELQRGSGVAPGSVHSSSTGFDSRFSEDSTQ*
>AT5G42060.1 |  unknown protein 
MEEVAREEVEIDKDLRRKIKKTVKKILESSNLYKITEIKAREEASLKLDLDLSQDPYKVI 
VKEEVENFLEEAVKLIGNKLAMLPKRIESTSI*
>AT2G03070.1 |  unknown protein 
METQPQQPPPPPVAEKLNPKLEKELNLESLKTRAVSLAKAIARILEDFDAYGRTNTTPKW 
QDILGQYSMVNLELFNIVEEVKRVSNAFVVLPKNVNAMNAAILPVMLSSKLLPEMETDDN 
AKREQLLQGVQSLPIPMQIERLKARMDMIAAACENAERVLADTRKAYGFGTRQGPSMLPT 
MDKGQAAKIQEQEKMLRDAVNDGKGTQLPPDQRQITTALPPHLADVLIINDAGKIALPGQ 
SNNINNQGMMQVSGTQFVGRSAASPSGPNFDNTTSPLPYSNSPRATGMVNVPSPQHQIQQ 
QQFQQQQQRSKLMQLPQHQQQQLLAQQQQQLRQSSMQGLGQSQIPALHDMHGQAQQKFQT 
SHGQHQMPYSQPMGAHQQFQARQLSGGHIQHSMSQGQLNPAMNRHLNQFSGGANSALFTS 
AQGSPSSQMIPNMSSMQSQTLVPRMQQFGVSGTNPQRSHSSQMLGDQMFNTSGMMQTQQT 
QIQQSQQQQQQQQQGGYGNMQTNQQSLQPNNMMQNAQQRHQNPQ*
>AT3G01435.1 |  Expressed protein 
MDPQTQNTSLQRLQNVENRVVKVLELAGGVMEELASPSGPKKEFVNSHCREFMQSMKDIQ 
VTLREEIKSACEYRPFEKCDYNARIANEICFQKLEYVLTQLEDLKQTADRYPSSD*
>AT5G02850.1 |  hydroxyproline-rich glycoprotein family protein 
MLQHQIVQSPARLGLTGPGSPSVQNPTPTRHGHPTSSSSSQSQHQQIQQQPNLLPSSTVA 
AASSASASAAVSSSALLSLLPPLPRAQALLQQMAVLTSKLFDVSPNRAIWLSAFRGSLPS 
FLSSHSLPPPPPLENPSPSSTKEILSQFNSLQTQLFEAVTELQEILDLQDAKQKVAREIK 
SKDSSLLAFANKLKDAERVLDMLVDDYSDYRKPKRSKIEEDDEDNDNESSSSSTTVSSQL 
KLKDILAYAHKISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYTFADLDIGLPKT 
VENMEKKVEALIEPPPPPEAMDISAIHNLLPPNIAVPSGWKPGMPVELPRDLPLPPPGWK 
PGDPVVLPPLESIAAPRAEDHQHMRPSQGLHRPPDVIQVRAVQLDILEDDDSSDYSSDDA 
SSDDED*
>AT5G41910.1 |  RNA polymerase II mediator complex protein-related 
MDQTQNTIAGIGEINGSIKTEINAIATVDDSNEKLNQIINSNQKILELLHQLKLTVSSFT 
PASQLHLLQRLNSLVMELDNMAKLSDKCNIQVPIEVLNLIDDGKNPDEFTKDVLNKNCIA 
KNQVTKGKSDAFKGLRKHLLEELEQAFPDEVDRYRDIRASYAAEAKRLAQTQSVLPNGDA 
KVKSEL*
>AT1G26665.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages BEST Arabidopsis thaliana protein match is RNA polymerase II mediator complex protein-related (TAIRAT5G419101) Has 223 Blast hits to 223 proteins in 105 species Archae - 0 Bacteria - 0 Metazoa - 112 Fungi - 83 Plants - 24 Viruses - 0 Other Eukaryotes - 4 (source NCBI BLink) 
MDPTQNTSAGIGGSNGTIRYQTNDGTSTVTVADDSKENLSQVINSIEKTLGVLHQLHLTV 
TSFTPASQLHLLQRLNSLVMELDNMTKLSEKCNIQIPMEVLNLIDDGKNPDEFTKDVLNS 
CIARNQVTKGKTDAFKDLRKHILEELEQNFPDEVDMYREIRASSAAVTKRLAQSQSVLPN 
GDAKVKNEL*
>AT1G26665.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages BEST Arabidopsis thaliana protein match is RNA polymerase II mediator complex protein-related (TAIRAT5G419101) Has 223 Blast hits to 223 proteins in 105 species Archae - 0 Bacteria - 0 Metazoa - 112 Fungi - 83 Plants - 24 Viruses - 0 Other Eukaryotes - 4 (source NCBI BLink) 
MDPTQNTSAGIGGSNGTIRYQTNDGTSTVTVADDSKENLSQVINSIEKTLGVLHQLHLTV 
TSFTPASQLHLLQRLNSLVMELDNMTKLSEKCNIQIPMEVLNLIDDGKNPDEFTKDVLNS 
CIARNQVTKGKTDAFKDLRKHILEELEQNFPDEVDMYREIRASSAAVTKRLAQSQSVLPN 
GDAKVKNEL*
>AT1G26665.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages BEST Arabidopsis thaliana protein match is RNA polymerase II mediator complex protein-related (TAIRAT5G419101) Has 223 Blast hits to 223 proteins in 105 species Archae - 0 Bacteria - 0 Metazoa - 112 Fungi - 83 Plants - 24 Viruses - 0 Other Eukaryotes - 4 (source NCBI BLink) 
MDPTQNTSAGIGGSNGTIRYQTNDGTSTVTVADDSKENLSQVINSIEKTLGVLHQLHLTV 
TSFTPASQLHLLQRLNSLVMELDNMTKLSEKCNIQIPMEVLNLIDDGKNPDEFTKDVLNS 
CIARNQVTKGKTDAFKDLRKHILEELEQNFPDEVDMYREIRASSAAEAKRLAQSQSVLPN 
GDAKVKNEL*
>AT1G26665.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages BEST Arabidopsis thaliana protein match is RNA polymerase II mediator complex protein-related (TAIRAT5G419101) Has 223 Blast hits to 223 proteins in 105 species Archae - 0 Bacteria - 0 Metazoa - 112 Fungi - 83 Plants - 24 Viruses - 0 Other Eukaryotes - 4 (source NCBI BLink) 
MDPTQNTSAGIGGSNGTIRYQTNDGTSTVTVADDSKENLSQVINSIEKTLGVLHQLHLTV 
TSFTPASQLHLLQRLNSLVMELDNMTKLSEKCNIQIPMEVLNLIDDGKNPDEFTKDVLNS 
CIARNQVTKGKTDAFKDLRKHILEELEQNFPDEVDMYREIRASSAAEAKRLAQSQSVLPN 
GDAKVKNEL*
>AT1G55080.1 |  unknown protein 
MDQFSGGGGNWSMIPNVQAQGNFGTPTNHDQQLFLQQQQLQQQQQQQQQQQFHLQQQQQT 
QQQQQQFQPQQQQEMQQYQQFQQQQHFIQQQQFQQQQRLLQSPPLQPQSLQSPPPQQTMV 
HTPQSMMHTPQQQQQLVQTPVQTPQQHQSLASHFHLYPMVEKLADVIENGTRDQNSDALV 
NELNSHFDKCQQLLNSISGSLGSKTMTVDGQKRNVEESEQLLQQRRDLIVEYRKSIEEIV 
TMEH*
>AT3G04740.1 |  SWP (STRUWWELPETER) 
MAELGQQTVDFSALVGRAAEESFLSFKELVDKSKSTELSDTEKKVSLLKYVAKTQQRMLR 
LNALAKWCKQVPLINYFQDLGSTLSAHDICFTQAADSLFFMHEGLQQARAPVYDVPSAVE 
ILLTGSYQRLPKCLDDVGMQSSLDEHQQKPALRKLEVLVRSKLLEITLPKEITEVKISKG 
TVTLSVDGEFKVLVTLGYRGHLSMWRILHLDLLVGERSGPIKLEVTRRHILGDDLERRMS 
VAENPFTILYAVLHELCVAIVMDTVIRQVRALLQGRWKDAIRFDLISDTGTTPANQEGEA 
DSVSLRTPGMKLFYWSDSDKNSGPFIKIEPGSDLQIKCSHSTFVIDPLTGKEAEFSLDQS 
CIDVEKLLLKAICCNRYTRLLEIQKELLRNTRICRTPSDVILQALLDEPGIEGDNMVDSK 
ERVEPEVLRVRAYGSSFFTLGINIRTGRFLLQSSKSILTSSILEEFEDALNQGSISAVDA 
FINLRSKSILHFFAAIGKFLGLEVYEHGFGINKVPKSLLDGSSILTLGFPDCESSHLLLM 
ELEKDFTPLFKLLETQMDGSGKPQSFNDPSNILRAKKIDIGQIRILEDDLNLITSDVVKF 
VSSFSDAEGINQASGHRQPGLVDEALTEMSGSQLSFSSVVDGVFGLQKVTSALMSIDGHG 
LVPKNLSAVTGHGKAPMLTSYHSDSLYNRQGPLQSSSYNMLSSPPGKGSAMKKIAISNSD 
QELSLILSPSLSTGNGVSESGSRLVTESSLSPLPLSQTADLATSSAGPLLRKDQKPRKRS 
ASDLLRLIPSLQVVEGVASPNKRRKTSELVQSELVKSWSPASQTLSTAVSTSTKTIGCSY 
GNLIAEANKGNAPSSVFVYALLHVVRHSSLSIKHAKLTSQMEALDIQYVEEMGLRDAFSD 
IWFRLPFAQNDSWQHICLQLGRPGSMCWDVKINDQHFRDLWELQKGSKTTPWGSGVHIAN 
SSDVDSHIRYDPEGVVLSYQSVEADSIKKLVADIQRLSNARMFSLGMRKLLGIKPDEKTE 
ECSANSTMKGSTGGKGSGEPVDRWRAFKIEAVGLTSLWFSFGSGVLARFVVEWESGKDGC 
TMHVSPDQLWPHTKFLEDFINGAEVESLLDCIRLTAGPLHALAAATRPARASTATGMPVV 
PATASSRQSNQIQQTQGIIAPSTLAAPNATGQSASATSGNTVASSAPSPLGGGFHGVAML 
AAAGRSGPGIVPSSLLPIDVSVVLRGPYWIRIIYRKRFAVDMRCFAGDQVWLQPATPPKG 
GASIGGSLPCPQFRPFIMEHVAQELNGLEPNLTGSQGATNPNSGNPTVNGVNRVNFSPSS 
ARAAMNRVASVASGSLVVSSGLPVRRTPGTAVPAHVRGELNTAIIGLGDDGGYGGGWVPL 
VALKKVLRGILKYLGVLWLFAQLPDLLREILGSILKDNEGALLNLDQEQPALRFFVGGYV 
FAVSVHRVQLLLQVLSVRRFHHQAQQNGSSAAAQEELTQSEIGEICDYFSRRVASEPYDA 
SRVASFITLLTLPISVLREFLKLIAWKKGLSQSQQAGEIAPAQRPRIELCLENHSGTDLD 
NNCAAKSNIHYDRPHNTVDFALTVVLDPVHIPHINAAGGAAWLPYCVSVRLRYTFGENPS 
VTFLGMEGSHGGRACWQRVDDWEKCKQRVSRTVEVNGSAAGDLTQGKLKLVADSVQRTLH 
LCLQGLREGGNNNNNTHQKEFTI*
>AT2G28230.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown CONTAINS InterPro DOMAIN/s TATA-binding related factor (InterProIPR013921) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G090701) Has 43 Blast hits to 43 proteins in 21 species Archae - 0 Bacteria - 0 Metazoa - 22 Fungi - 0 Plants - 18 Viruses - 0 Other Eukaryotes - 3 (source NCBI BLink) 
MPVKWVLHWQPNQGSTVSSQILNEATQCVESINGVKEGRWKATLNYYKPMLKDQANQLEF 
PRDFLGISLADQPNKYYFIIRTQRIVLEADSSIQLIMEKLQSYKSKVALYFDGFQYQLGD 
FRLRVGKVVPTHSENVRGIVMEVEYLPISSMEKAQKVMEEFLEIWNEALAKRSLPGKFVN 
IDLNFGEFGLGDIYTPQHTAVRYALVMAHMIATVQAVRG*
>AT2G22370.1 |  unknown protein 
MSMECVVQGIIETQHVEALEILLQGLCGVQRERLRVHELCLRSGPNLGVVSSEVRLLCDL 
DQPEPTWTVKHVGGAMRGAGADQISVLVRNMIESKVSKNALRMFYALGYKLDHELLKVGF 
AFHFQRTAHISVSVSSVNKMPKVHAIDEAVPVTPGMQIVDVTAPATSENYSEVAAAVSSF 
CEFLAPLVHLSKPSISTGVVPTAAAAAASLMSDGGGTTL*
>AT5G20170.1 |  unknown protein 
MDSDMEISLDRLPIKRLESIEENGAERFPSDVDYDDKRVSLIRRIDFAWALEEEDELKKK 
KQKKSSKDSVEQWKWKGMVENLQLAHQELTVIIDLIDTVQANDAVTVAGMTRPKPMPNEI 
LSDLAVSTATKLQGYRNLGNYFKQSAKALEQKINREARFYGALIRLQRNWKVKRQRMLAS 
NASNEGFTIDLSDSSLYDPTSGFRPSTLSTIRVDHDSAGMLAINVPQDSWYSLRFGYVGL 
NPIGNSNESDEHIDSTTGHDIPGTSEKLSASDDKYVKETHSLLREVHKSIFAEQLFDMLN 
REAFNEGVGFNISGLRENFMEMSIGQGASLFVSLHPSGKNPSIKKSESATLLIESSGRVE 
PAEGGDYRLKKLGFPNRTSYEIYLQQIFHEHAFGKAKDQLKSKSIRASNQTEKDSNSGLL 
DHFCLSLTHRIFSNRVLVHLESVVCKVPYLHLISHPTWNSRTSSWTVFMTVPPSIIPQGR 
SETQSPDGKRNLKTQFRTKVVVKDDCISVEAECTPNVVGLLKSSSCNLFSINKYECDVAD 
LPVMILQQVASQIVCWLLEEARTVGTKASREFLSLSLEIVEGERVSLVAHVNPEDAKGCI 
SWWLVMENGCTEEREGVSESRKLLGHLSLDVLYSVLMDLINLCGTGRNALERL*
>AT1G16430.1 |  surfeit locus protein 5 family protein / SURF5 family protein 
MMNKGGGSGGGSGPTAAAAAAALQKQKALLQRVDTDITSVVDNFNQIVNVARVSDPPMKN 
SQEAYMMEMRASRLVQAADSLLKLVSELKQTAIFSGFASLNDHVEQRIAEFDQEAEKTNR 
LLARIADDASASLKELESHYYSSAQRSTLD*
>AT4G04780.1 |  MED21 (MEDIATOR 21) 
MSTNSYYSSASSSGFRVCPPGVPSKCWCGEEIITFTSKTKENPYRRFYRCAIAMKRENEE 
HLFKWVDEALLDEIKMVNEKCKRVVENISDLRMNVMANMELLNKNAKQMEEELIKKMEGE 
LLTMKENVEELGHVMAKSALKTVGVAVVIVASIVWLWGRVRFKETTEIVPYEFYGIRAQK 
SKGKATLKAHFKNLRAQTQNAERSLLRTTKRPNSQRLSVVATPNRLCVAGEILSTGLLRF 
WKMDIISQLQEQVNTIAAITFNAFGTLQRDAPPVQLSPNYPEPPATTTVTDDATPFPEQP 
KQLSAGLVKAAKQFDALVAALPLSEGGEGAQLKRIAELQVENDLVGQELQKQLEAAEKEL 
KQVQELFGQAADNCLNMKKPE*
>AT4G04780.1 |  MED21 (MEDIATOR 21) 
MSTNSYYSSASSSGFRVCPPGVPSKCWCGEEIITFTSKTKENPYRRFYRCAIAMKRENEE 
HLFKWVDEALLDEIKMVNEKCKRVVENISDLRMNVMANMELLNKNAKQMEEELIKKMEGE 
LLTMKENVEELGHVMAKSALKTVGVAVVIVASIVWLWGRVRFKETTEIVPYEFYGIRAQK 
SKGKATLKAHFKNLRAQTQNAERSLLRTTKRPNSQRLSVVATPNRLCVAGEILSTGLLRF 
WKMDIISQLQEQVNTIAAITFNAFGTLQRDAPPVQLSPNYPEPPATTTVTDDATPFPEQP 
KQLSAGLVKAAKQFDALVAALPLSEGGEGAQLKRIAELQVENDLVGQELQKQLEAAEKEL 
KQVQELFGQAADNCLNMKKPE*
>AT4G04780.2 |  MED21 (MEDIATOR 21) 
MSTNSYYSSASSSGFRVCPPGVPSKCWCGEEIITFTSKTKENPYRRFYRCAIAMKRENEE 
HLFKWVDEALLDEIKMVNEKCKRVVENISDLRMNVMANMELLNKNAKQMEEELIKKMEGE 
LLTMKENVEELGHVMAKSALKTVGVAVVIVASIVWLWGRVRFKETTEIVPYEFYGIRAQK 
SKGKATLKAHFKNLRAQTQNAERSLLRTTKRPNSQRLSVVATPNRLCVAGEILSTGLLRF 
WKMDIISQLQEQVNTIAAITFNAFGTLQRDAPPVQLSPNYPEPPATTTVTDDATPFPEQP 
KQLSAGLVKAAKQFDALVAALPLSEGGEGAQLKRIAELQVENDLVGQELQKQLEAAGAKA 
STRAVWTSC*
>AT4G04780.2 |  MED21 (MEDIATOR 21) 
MSTNSYYSSASSSGFRVCPPGVPSKCWCGEEIITFTSKTKENPYRRFYRCAIAMKRENEE 
HLFKWVDEALLDEIKMVNEKCKRVVENISDLRMNVMANMELLNKNAKQMEEELIKKMEGE 
LLTMKENVEELGHVMAKSALKTVGVAVVIVASIVWLWGRVRFKETTEIVPYEFYGIRAQK 
SKGKATLKAHFKNLRAQTQNAERSLLRTTKRPNSQRLSVVATPNRLCVAGEILSTGLLRF 
WKMDIISQLQEQVNTIAAITFNAFGTLQRDAPPVQLSPNYPEPPATTTVTDDATPFPEQP 
KQLSAGLVKAAKQFDALVAALPLSEGGEGAQLKRIAELQVENDLVGQELQKQLEAAGAKA 
STRAVWTSC*
>AT3G52860.1 |  unknown protein 
MDYQQKPPQSSDPSPSPPDRPPGIRSPETPSNNQNNDIEDIMACVTALEAALLPCLPARE 
LQAIDRSPHPSHQIDVERHARDFMEAAKKLQLYFMGLKREDRAPSRAESLKKDIAVMEEE 
LKTKDELIKKHMRLFQESQKLVKEQIEKHRDELEKV*
>AT5G41010.1 |  NRPB12 DNA binding / DNA-directed RNA polymerase 
MDPAPEPVTYVCGDCGQENTLKSGDVIQCRECGYRILYKKRTRRVVQYEAR*
>AT3G59600.1 |  NRPB8B DNA-directed RNA polymerase 
MASNIIMFEDIFVVDKLDPDGKKFDKVTRVEARSHNLEMFMHLDVNTEVYPLAVGDKFTL 
AMAPTLNLDGTPDTGYFTPGAKKTLADKYEYIMHGKLYKISERDGKTPKAELYVSFGGLL 
MLLQGDPAHISHFELDQRLFLLMRKL*
>AT5G12230.1 |  unknown protein 
MEPERLKFGGPRELCGAADLISQFKLVQHHEFFCKKSLPVSLSDSHYLHNVVGDTEIRKG 
EGMQLDQLIESISQSRETNIRIQPFDIDELQESFQLNDMTPVELPPAEKGAPTIPSKSKS 
ESKDRDRKHKKHKDRDKDKDREHKKHKHKHKDRSKDKDKDKDRDRKKDKNGHHDSGDHSK 
KHHDKKRKHDGDEDLNDVQRHKKNKHKSSKLDEVGAIRVAG*
>AT1G54250.1 |  NRPB8A DNA-directed RNA polymerase 
MASNIILFEDIFVVDQLDPDGKKFDKVTRVQATSHNLEMFMHLDVNTEVYPLAVGDKFTL 
ALAPTLNLDGTPDTGYFTPGAKKTLADKYEYIMHGKLYKISERDGKTPKAELYVSFGGLL 
MLLKGDPAHISHFELDQRLFLLMRKL*
>AT3G09180.1 |  unknown protein 
MQTLHQSQLLQNPAEAANNQSESDAPPKQVAQAMERLNQAARVIADIRLGADRILEAMFV 
ASQPRHTDMPLQLFLREDASMRQHLQDLRLIGKKLEESGVLTESLRSRSNSWGLHMPLVC 
PDGAVVAYAWKRQLAGQAGASAVDRTRLALKAFTDQKRRFFPHIDDGLKMEPSSKKHRAS 
HLLLENGREEPVDYKTLPDIQSRLEKLVPSVKVSTYGRLNWLKRANSLPGSGSDDPTEAS 
KPIFQSSSKLRSGLQTEVVDKIAVIELSFPSLFRAIVSLSPAGSVDPDAVAFFSPDEGGS 
YLHARGFSVYHVYKHITEHAATALQYFLGFGTGTALYSLLLWICSFESVFSKPCTKCGRL 
LAMDKKSALILPPLHRAYQELPLALNLDVCEAYHSSCSQDDT*
>AT3G23590.1 |  RFR1 (REF4-related 1) 
MVVPGRRTVWDCVIELTKMAQENCVDPRLWASQLSSNLKFFAVELPSTELAEVIVSYICW 
DNNVPIVWKFLERAMALKLVSPLVVLALLADRVVPTRSTQQAAYRIYLELLKRNMFTIKD 
HISGPHYQKVMISVSNILRLSELFDLDTSKPGVLLVEFVFKMVSQLLDAALSDEGLLELS 
QDSSSQWLVKSQDMEIDAPERYNEKTGSLEKLQSLNTIMAIELIAEFLRNTVIARLLYLV 
SSNRASKWHEFVQKVQLLGENSSALKHSKVLNSGDLLQLISNRRFGYSYDSKVTSARKSN 
AIVDFGSLSSYAGLCHGASLSSLWLPLDLVFEDAMDGYQVNPTSAIEIITGLAKTLKEIN 
GSTWHDTFLGLWIAALRLVQRERDPIEGPIPRLDTRLCMSLCIVPLVVANLIEEGKYESV 
MEKLRDDLVTSLQVLGDFPGLLAPPKCVVSAANKAATKAILFLSGGNVGKSCFDVINMKD 
MPVNCSGNMRHLIVEACIARNILDMSAYSWPGYVNGRINQIPQSLPNEVPCWSSFVKGAP 
LNAAMVNTLVSVPASSLAELEKLFEVAVKGSDDEKISAATVLCGASLTRGWNIQEHTVEY 
LTRLLSPPVPADYSRAENHLIGYACMLNVVIVGIGSVDSIQIFSLHGMVPQLACSLMPIC 
EEFGSYTPSVSWTLPSGEAISAYSVFSNAFTLLLKLWRFNHPPIEHGVGDVPTVGSQLTP 
EHLLSVRNSYLVSSEILDRDRNRKRLSEVARAASCQPVFVDSFPKLKVWYRQHQRCIAAT 
LSGLTHGSPVHQTVEALLNMTFGKVRGSQTLNPVNSGTSSSSGAASEDSNIRPEFPAWDI 
LKAVPYVVDAALTACTHGRLSPRQLATGLKDLADFLPASLATIVSYFSAEVSRGVWKPVF 
MNGVDWPSPATNLSTVEEYITKILATTGVDIPSLAPGGSSPATLPLPLAAFVSLTITYKI 
DKASERFLNLAGPALECLAAGCPWPCMPIVASLWTQKAKRWFDFLVFSASRTVFLHNQDA 
VIQLLRNCFSATLGLNAAPMSNDGGVGALLGHGFGSHFYGGISPVAPGILYLRMYRALRD 
TVSVSEEILSLLIHSVEDIAQNRLSKEKLEKLKTVKNGSRYGQSSLATAMTQVKLAASLS 
ASLVWLTGGLGVVHVLIKETIPSWFLSTDKSDREQGPSDLVAELRGHALAYFVVLCGALT 
WGVDSRSSASKRRRQAILGSHLEFIASALDGKISVGCETATWRTYISGLVSLMVSCLPLW 
VTEIDTEVLKSLSNGLRKWGKDELAIVLLSLGGLKTMDYAADFIIHLRS*
>AT1G31360.1 |  RECQL2 (ARABIDOPSIS RECQ HELICASE L2) 3-5 DNA helicase/ ATP-dependent helicase/ four-way junction helicase/ protein binding 
MESEAIQEDLQNLDVELKDVQGQISALIEHQDRLYERKSELKTLLKALAASGSPVASSGG 
SSAIENWSETFEWDSRADDVRFNVFGISKYRANQKEIINAIMTGRDVLVIMAAGGGKSLC 
YQLPAMLRGGTTLVVSPLLSLIQDQVMGLAALGISAYMLTSTSGKENEKFVYKALEKGED 
DLKILYVTPEKVSKSKRFMSKLEKCHNAGRLSLISIDEAHCCSQWGHDFRPDYKNLSILK 
TQFPKVPMVALTATATQKVQNDLIEMLHIPKCVKFVSSVNRPNLFYSVREKSAVGKLVVD 
EIAEFIRESYSNNESGIVYCFSRKECEQIAGDLRERGISADYYHADMDANMREKVHMRWS 
KNKLQVIVGTVAFGMGINKPDVRFVIHHSLSKSMETYYQESGRAGRDGLPSECILFFRSA 
DVPRQSSMVFYEYSGLQNLYDIVRYCQSKTKCRRSAFFRHFGEPSQDCNGMCDNCALSSE 
VKEVDVSDLSKLVVSMVQETQAKDQRVTMLQLGDKLRNKHKDLIAELKRDEVEHLVIKLI 
VDSVLKEEFQHTPYSTNAYVTMGPLANQLLQGRKTIKMETSSRQTKKLKRSITFSGLELK 
LDELRKEISAADGSILPHTVLSTQQIGSISSQKPVSLQELESIIGKLKTEKYGDRILEEV 
MRHEAVSEQLVEDPTKEETCKSRLRKRAKTQKDVVLVESSGEEEA*
>AT1G31360.1 |  RECQL2 (ARABIDOPSIS RECQ HELICASE L2) 3-5 DNA helicase/ ATP-dependent helicase/ four-way junction helicase/ protein binding 
MESEAIQEDLQNLDVELKDVQGQISALIEHQDRLYERKSELKTLLKALAASGSPVASSGG 
SSAIENWSETFEWDSRADDVRFNVFGISKYRANQKEIINAIMTGRDVLVIMAAGGGKSLC 
YQLPAMLRGGTTLVVSPLLSLIQDQVMGLAALGISAYMLTSTSGKENEKFVYKALEKGED 
DLKILYVTPEKVSKSKRFMSKLEKCHNAGRLSLISIDEAHCCSQWGHDFRPDYKNLSILK 
TQFPKVPMVALTATATQKVQNDLIEMLHIPKCVKFVSSVNRPNLFYSVREKSAVGKLVVD 
EIAEFIRESYSNNESGIVYCFSRKECEQIAGDLRERGISADYYHADMDANMREKVHMRWS 
KNKLQVIVGTVAFGMGINKPDVRFVIHHSLSKSMETYYQESGRAGRDGLPSECILFFRSA 
DVPRQSSMVFYEYSGLQNLYDIVRYCQSKTKCRRSAFFRHFGEPSQDCNGMCDNCALSSE 
VKEVDVSDLSKLVVSMVQETQAKDQRVTMLQLGDKLRNKHKDLIAELKRDEVEHLVIKLI 
VDSVLKEEFQHTPYSTNAYVTMGPLANQLLQGRKTIKMETSSRQTKKLKRSITFSGLELK 
LDELRKEISAADGSILPHTVLSTQQIGSISSQKPVSLQELESIIGKLKTEKYGDRILEEV 
MRHEAVSEQLVEDPTKEETCKSRLRKRAKTQKDVVLVESSGEEEA*
>AT1G31360.2 |  RECQL2 (ARABIDOPSIS RECQ HELICASE L2) 3-5 DNA helicase/ ATP-dependent helicase/ four-way junction helicase/ protein binding 
MLRGGTTLVVSPLLSLIQDQVMGLAALGISAYMLTSTSGKENEKFVYKALEKGEDDLKIL 
YVTPEKVSKSKRFMSKLEKCHNAGRLSLISIDEAHCCSQWGHDFRPDYKNLSILKTQFPK 
VPMVALTATATQKVQNDLIEMLHIPKCVKFVSSVNRPNLFYSVREKSAVGKLVVDEIAEF 
IRESYSNNESGIVYCFSRKECEQIAGDLRERGISADYYHADMDANMREKVHMRWSKNKLQ 
VIVGTVAFGMGINKPDVRFVIHHSLSKSMETYYQESGRAGRDGLPSECILFFRSADVPRQ 
SSMVFYEYSGLQNLYDIVRYCQSKTKCRRSAFFRHFGEPSQDCNGMCDNCALSSEVKEVD 
VSDLSKLVVSMVQETQAKDQRVTMLQLGDKLRNKHKDLIAELKRDEVEHLVIKLIVDSVL 
KEEFQHTPYSTNAYVTMGPLANQLLQGRKTIKMETSSRQTKKLKRSITFSGLELKLDELR 
KEISAADGSILPHTVLSTQQIGSISSQKPVSLQELESIIGKLKTEKYGDRILEEVMRHEA 
VSEQLVEDPTKEETCKSRLRKRAKTQKDVVLVESSGEEEA*
>AT1G31360.2 |  RECQL2 (ARABIDOPSIS RECQ HELICASE L2) 3-5 DNA helicase/ ATP-dependent helicase/ four-way junction helicase/ protein binding 
MLRGGTTLVVSPLLSLIQDQVMGLAALGISAYMLTSTSGKENEKFVYKALEKGEDDLKIL 
YVTPEKVSKSKRFMSKLEKCHNAGRLSLISIDEAHCCSQWGHDFRPDYKNLSILKTQFPK 
VPMVALTATATQKVQNDLIEMLHIPKCVKFVSSVNRPNLFYSVREKSAVGKLVVDEIAEF 
IRESYSNNESGIVYCFSRKECEQIAGDLRERGISADYYHADMDANMREKVHMRWSKNKLQ 
VIVGTVAFGMGINKPDVRFVIHHSLSKSMETYYQESGRAGRDGLPSECILFFRSADVPRQ 
SSMVFYEYSGLQNLYDIVRYCQSKTKCRRSAFFRHFGEPSQDCNGMCDNCALSSEVKEVD 
VSDLSKLVVSMVQETQAKDQRVTMLQLGDKLRNKHKDLIAELKRDEVEHLVIKLIVDSVL 
KEEFQHTPYSTNAYVTMGPLANQLLQGRKTIKMETSSRQTKKLKRSITFSGLELKLDELR 
KEISAADGSILPHTVLSTQQIGSISSQKPVSLQELESIIGKLKTEKYGDRILEEVMRHEA 
VSEQLVEDPTKEETCKSRLRKRAKTQKDVVLVESSGEEEA*
>AT5G67240.1 |  SDN3 (SMALL RNA DEGRADING NUCLEASE 3) exonuclease/ nucleic acid binding 
MEHKLATAEKKLLVDLVKLVQKRGLEGENGGWKEFLNVYDKKLGSSLSDPARRSNDVLVA 
FLLTFKKKEDLQLIARVMQCGANRELIEKFKQETPDKETPEQRLVRLTITHDDYPGNYTF 
PSYAEDWYVTELGKKKSKVIKSTRMLSIDCEMVTCEDGSQALVRVGAVDRDLKVVLDKFV 
KPDKPVIDYKTDITGVTAEDLERATLSVADIQKKLRRFLSVGTILVGHGLHNDLQVLRID 
HARVIDTSYVFEFVDAPKTQRPSLNNLCKSVLGQEVRMDGAAHNCVHDAAAAMKLVLAAV 
EKGAATLIQPTEEMMVAEKRRQEARQEAGKAQLFLHKIPHDVPSEELHGVLSGNFTLVVK 
PPKTGGYSTAVVDFSSPEEANEAFENVEGDVAKDKSGLPQKKAVLKLSSGLAVSLFVRKM 
VQDDSPCEISTSERARAEENNVSSKRQKTEDETEETKEATVNQREADKTKLFLHKIPHDV 
PSQELHGVLNGDFTLDVKPPKRKGGYYNAVVDFNSPEEANEAFENVEGDVVKDKTGLPQK 
MVVFKLSSGSGVSLYVRKMVHDDSPGEISTTKRARTEESNMSSKRQKTEDESEETKEANA 
KQREADKTKLLLHKIPLNVPSQELKVVITGQFTLEVMPPKRKGRYYNAVVTFNSPEEANK 
AFEKVKGEAVKEKGGLAQKMVAFKLSSGSGACLYVRKMVQDESEETKEANANHCEDDHLK 
EMEELKEKLKAMEFAISCEGHSKEIEELKQKLNAKEHQIQAQDKIIANLKMKLEKKQSKS 
RS*
>AT5G63480.1 |  unknown protein 
MLQKQVSTTTTMTTQELAMEGEKQLEETIEAAFQIISAMNDELCNPSLWSTSATPSSAAT 
TTGSNGSALVSADAAAIDGTSHHSESAGGGGGGGSGNSVLDEASLRYKNSVTSLRAVLAA 
IPNSQKAKASEMQNGLGSPESEDEIEKLEEQALSLRMEIAKKNVHVKELIDKLRELIADI 
STWQSPCSV*
>AT4G35800.1 |  NRPB1 (RNA POLYMERASE II LARGE SUBUNIT) DNA binding / DNA-directed RNA polymerase 
MDTRFPFSPAEVSKVRVVQFGILSPDEIRQMSVIHVEHSETTEKGKPKVGGLSDTRLGTI 
DRKVKCETCMANMAECPGHFGYLELAKPMYHVGFMKTVLSIMRCVCFNCSKILADEVCRS 
LFRQAMKIKNPKNRLKKILDACKNKTKCDGGDDIDDVQSHSTDEPVKKSRGGCGAQQPKL 
TIEGMKMIAEYKIQRKKNDEPDQLPEPAERKQTLGADRVLSVLKRISDADCQLLGFNPKF 
ARPDWMILEVLPIPPPPVRPSVMMDATSRSEDDLTHQLAMIIRHNENLKRQEKNGAPAHI 
ISEFTQLLQFHIATYFDNELPGQPRATQKSGRPIKSICSRLKAKEGRIRGNLMGKRVDFS 
ARTVITPDPTINIDELGVPWSIALNLTYPETVTPYNIERLKELVDYGPHPPPGKTGAKYI 
IRDDGQRLDLRYLKKSSDQHLELGYKVERHLQDGDFVLFNRQPSLHKMSIMGHRIRIMPY 
STFRLNLSVTSPYNADFDGDEMNMHVPQSFETRAEVLELMMVPKCIVSPQANRPVMGIVQ 
DTLLGCRKITKRDTFIEKDVFMNTLMWWEDFDGKVPAPAILKPRPLWTGKQVFNLIIPKQ 
INLLRYSAWHADTETGFITPGDTQVRIERGELLAGTLCKKTLGTSNGSLVHVIWEEVGPD 
AARKFLGHTQWLVNYWLLQNGFTIGIGDTIADSSTMEKINETISNAKTAVKDLIRQFQGK 
ELDPEPGRTMRDTFENRVNQVLNKARDDAGSSAQKSLAETNNLKAMVTAGSKGSFINISQ 
MTACVGQQNVEGKRIPFGFDGRTLPHFTKDDYGPESRGFVENSYLRGLTPQEFFFHAMGG 
REGLIDTAVKTSETGYIQRRLVKAMEDIMVKYDGTVRNSLGDVIQFLYGEDGMDAVWIES 
QKLDSLKMKKSEFDRTFKYEIDDENWNPTYLSDEHLEDLKGIRELRDVFDAEYSKLETDR 
FQLGTEIATNGDSTWPLPVNIKRHIWNAQKTFKIDLRKISDMHPVEIVDAVDKLQERLLV 
VPGDDALSVEAQKNATLFFNILLRSTLASKRVLEEYKLSREAFEWVIGEIESRFLQSLVA 
PGEMIGCVAAQSIGEPATQMTLNTFHYAGVSAKNVTLGVPRLREIINVAKRIKTPSLSVY 
LTPEASKSKEGAKTVQCALEYTTLRSVTQATEVWYDPDPMSTIIEEDFEFVRSYYEMPDE 
DVSPDKISPWLLRIELNREMMVDKKLSMADIAEKINLEFDDDLTCIFNDDNAQKLILRIR 
IMNDEGPKGELQDESAEDDVFLKKIESNMLTEMALRGIPDINKVFIKQVRKSRFDEEGGF 
KTSEEWMLDTEGVNLLAVMCHEDVDPKRTTSNHLIEIIEVLGIEAVRRALLDELRVVISF 
DGSYVNYRHLAILCDTMTYRGHLMAITRHGINRNDTGPLMRCSFEETVDILLDAAAYAET 
DCLRGVTENIMLGQLAPIGTGDCELYLNDEMLKNAIELQLPSYMDGLEFGMTPARSPVSG 
TPYHEGMMSPNYLLSPNMRLSPMSDAQFSPYVGGMAFSPSSSPGYSPSSPGYSPTSPGYS 
PTSPGYSPTSPGYSPTSPTYSPSSPGYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSP 
SYSPTSPSYSPTSPSYSPTSPAYSPTSPAYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSP 
TSPSYSPTSPSYSPTSPAYSPTSPGYSPTSPSYSPTSPSYGPTSPSYNPQSAKYSPSIAY 
SPSNARLSPASPYSPTSPNYSPTSPSYSPTSPSYSPSSPTYSPSSPYSSGASPDYSPSAG 
YSPTLPGYSPSSTGQYTPHEGDKKDKTGKKDASKDDKGNP*
>AT3G10330.1 |  transcription initiation factor IIB-2 / general transcription factor TFIIB-2 (TFIIB2) 
MSDAFCSDCKRHTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVG 
GPTNPLLADGGLTTVISKPNGSSGDFLSSSLGRWQNRGSNPDRGLIVAFKTIATMADRLG 
LVATIKDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKEICSVANGATK 
KEIGRAKEYIVKQLGLETGQLVEMGTIHAGDFMRRFCSNLGMTNQTVKAAQESVQKSEEF 
DIRRSPISIAAAVIYIITQLSDEKKPLRDISVATGVAEGTIRNSYKDLYPHLSKIIPAWY 
AKEEDLKNLQSP*
>AT2G41630.1 |  TFIIB (TRANSCRIPTION FACTOR II B) RNA polymerase II transcription factor/ protein binding / transcription regulator/ translation initiation factor/ zinc ion binding 
MSDAYCTDCKKETELVVDHSAGDTLCSECGLVLESHSIDETSEWRTFANESSNSDPNRVG 
GPTNPLLADSALTTVIAKPNGSSGDFLSSSLGRWQNRNSNSDRGLIQAFKTIATMSERLG 
LVATIKDRANELYKRLEDQKSSRGRNQDALYAACLYIACRQEDKPRTIKEICVIANGATK 
KEIGRAKDYIVKTLGLEPGQSVDLGTIHAGDFMRRFCSNLAMSNHAVKAAQEAVQKSEEF 
DIRRSPISIAAVVIYIITQLSDDKKTLKDISHATGVAEGTIRNSYKDLYPHLSKIAPSWY 
AKEEDLKNLSSP*
>AT4G21710.1 |  NRPB2 DNA binding / DNA-directed RNA polymerase 
MEYNEYEPEPQYVEDDDDEEITQEDAWAVISAYFEEKGLVRQQLDSFDEFIQNTMQEIVD 
ESADIEIRPESQHNPGHQSDFAETIYKISFGQIYLSKPMMTESDGETATLFPKAARLRNL 
TYSAPLYVDVTKRVIKKGHDGEEVTETQDFTKVFIGKVPIMLRSSYCTLFQNSEKDLTEL 
GECPYDQGGYFIINGSEKVLIAQEKMSTNHVYVFKKRQPNKYAYVGEVRSMAENQNRPPS 
TMFVRMLARASAKGGSSGQYIRCTLPYIRTEIPIIIVFRALGFVADKDILEHICYDFADT 
QMMELLRPSLEEAFVIQNQLVALDYIGKRGATVGVTKEKRIKYARDILQKEMLPHVGIGE 
HCETKKAYYFGYIIHRLLLCALGRRPEDDRDHYGNKRLDLAGPLLGGLFRMLFRKLTRDV 
RSYVQKCVDNGKEVNLQFAIKAKTITSGLKYSLATGNWGQANAAGTRAGVSQVLNRLTYA 
STLSHLRRLNSPIGREGKLAKPRQLHNSQWGMMCPAETPEGQACGLVKNLALMVYITVGS 
AAYPILEFLEEWGTENFEEISPSVIPQATKIFVNGMWVGVHRDPDMLVKTLRRLRRRVDV 
NTEVGVVRDIRLKELRIYTDYGRCSRPLFIVDNQKLLIKKRDIYALQQRESAEEDGWHHL 
VAKGFIEYIDTEEEETTMISMTISDLVQARLRPEEAYTENYTHCEIHPSLILGVCASIIP 
FPDHNQSPRNTYQSAMGKQAMGIYVTNYQFRMDTLAYVLYYPQKPLVTTRAMEHLHFRQL 
PAGINAIVAISCYSGYNQEDSVIMNQSSIDRGFFRSLFFRSYRDEEKKMGTLVKEDFGRP 
DRGSTMGMRHGSYDKLDDDGLAPPGTRVSGEDVIIGKTTPISQDEAQGQSSRYTRRDHSI 
SLRHSETGMVDQVLLTTNADGLRFVKVRVRSVRIPQIGDKFSSRHGQKGTVGMTYTQEDM 
PWTIEGVTPDIIVNPHAIPSRMTIGQLIECIMGKVAAHMGKEGDATPFTDVTVDNISKAL 
HKCGYQMRGFERMYNGHTGRPLTAMIFLGPTYYQRLKHMVDDKIHSRGRGPVQILTRQPA 
EGRSRDGGLRFGEMERDCMIAHGAAHFLKERLFDQSDAYRVHVCEVCGLIAIANLKKNSF 
ECRGCKNKTDIVQVYIPYACKLLFQELMSMAIAPRMLTKHLKSAKGRQ*
>AT2G15400.1 |  NRPE3B DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MDGVTYQRFPTVKIRELKDDYAKFELRETDVSMANALRRVMISEVPTMAIHLVKIEVNSS 
VLNDEFIAQRLSLIPLTSERAMSMRFCQDCEDCNGDEHCEFCSVEFPLSAKCVTDQTLDV 
TSRDLYSADPTVTPVDFTSNSSTSDSSEHKGIIIAKLRRGQELKLKALARKGIGKDHAKW 
SPAATVTYMYEPDIIINEEMMNTLTDEEKIDLIESSPTKVFGIDPVTGQVVVVDPEAYTY 
DEEVIKKAEAMGKPGLIEIHPKHDSFVFTVESTGALKASQLVLNAIDILKQKLDAIRLSD 
NTVEADDQFGELGAHMREG*
>AT2G15400.1 |  NRPE3B DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MDGVTYQRFPTVKIRELKDDYAKFELRETDVSMANALRRVMISEVPTMAIHLVKIEVNSS 
VLNDEFIAQRLSLIPLTSERAMSMRFCQDCEDCNGDEHCEFCSVEFPLSAKCVTDQTLDV 
TSRDLYSADPTVTPVDFTSNSSTSDSSEHKGIIIAKLRRGQELKLKALARKGIGKDHAKW 
SPAATVTYMYEPDIIINEEMMNTLTDEEKIDLIESSPTKVFGIDPVTGQVVVVDPEAYTY 
DEEVIKKAEAMGKPGLIEIHPKHDSFVFTVESTGALKASQLVLNAIDILKQKLDAIRLSD 
NTVEADDQFGELGAHMREG*
>AT2G15400.2 |  NRPE3B DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MDGVTYQRFPTVKIRELKDDYAKFELRETDVSMANALRRVMISEVPTMAIHLVKIEVNSS 
VLNDEFIAQRLSLIPLTSERAMSMRFCQDCEDCNGDEHCEFCSVEFPLSAKCVTDQTLDV 
TSRDLYSADPTVTPVDFTSNSSTSDSSEHKGIIIAKLRRGQELKLKALARKGIGKDHAKW 
SPAATVTYMYEPDIIINEEMMNTLTDEEKIDLIESSPTKVFGIDPVTGQVNFDVL*
>AT2G15400.2 |  NRPE3B DNA binding / DNA-directed RNA polymerase/ protein dimerization 
MDGVTYQRFPTVKIRELKDDYAKFELRETDVSMANALRRVMISEVPTMAIHLVKIEVNSS 
VLNDEFIAQRLSLIPLTSERAMSMRFCQDCEDCNGDEHCEFCSVEFPLSAKCVTDQTLDV 
TSRDLYSADPTVTPVDFTSNSSTSDSSEHKGIIIAKLRRGQELKLKALARKGIGKDHAKW 
SPAATVTYMYEPDIIINEEMMNTLTDEEKIDLIESSPTKVFGIDPVTGQVNFDVL*
>AT5G20850.1 |  ATRAD51 ATP binding / DNA binding / DNA-dependent ATPase/ damaged DNA binding / nucleoside-triphosphatase/ nucleotide binding / protein binding / sequence-specific DNA binding 
MTTMEQRRNQNAVQQQDDEETQHGPFPVEQLQAAGIASVDVKKLRDAGLCTVEGVAYTPR 
KDLLQIKGISDAKVDKIVEAASKLVPLGFTSASQLHAQRQEIIQITSGSRELDKVLEGGI 
ETGSITELYGEFRSGKTQLCHTLCVTCQLPMDQGGGEGKAMYIDAEGTFRPQRLLQIADR 
FGLNGADVLENVAYARAYNTDHQSRLLLEAASMMIETRFALLIVDSATALYRTDFSGRGE 
LSARQMHLAKFLRSLQKLADEFGVAVVITNQVVAQVDGSALFAGPQFKPIGGNIMAHATT 
TRLALRKGRAEERICKVISSPCLPEAEARFQISTEGVTDCKD*
>AT1G07950.1 |  surfeit locus protein 5 family protein / SURF5 family protein 
MNKGGGSGGGSGGGSGPTAAAAAAALQKQKALLQRVETDITSVVDNFTQIVNVSRVSDPP 
VKNSQETYMMEMRASRMVQAADSLLKLVSELKQTAIFSGFASLNDHVEQRIEEFDQEAEK 
TNRLLARIADDASANLKELESHYYSSAQRLTLDI*
>AT5G48630.1 |  cyclin family protein 
MASNFWTSTHYKELKDPEEVNVVHPLDAQRGISVEDFRLIKLHMSNYISKLAQHIKIRQR 
VVATAVTYMRRVYTRKSLTEYEPRLVAPTCLYLACKAEESVVHAKLLVFYMKKLYADEKF 
RYEIKDILEMEMKVLEALNFYLVVFHPYRSLPEFLQDSGINDTSMTHLTWGLVNDTYRMD 
LILIHPPFLITLACIYIASVHKEKDIKTWFEELSVDMNIVKNIAMEILDFYENHRLFTEE 
RVHAAFNKLATNP*
>AT5G63610.1 |  CDKE1 (CYCLIN-DEPENDENT KINASE E1) ATP binding / kinase/ protein kinase/ protein serine/threonine kinase 
MGDGSSSRSNSSNSTSEKPEWLQQYNLVGKIGEGTYGLVFLARTKTPPKRPIAIKKFKQS 
KDGDGVSPTAIREIMLLREISHENVVKLVNVHINFADMSLYLAFDYAEYDLYEIIRHHRD 
KVGHSLNTYTVKSLLWQLLNGLNYLHSNWIIHRDLKPSNILVMGDAEEHGIVKIADFGLA 
RIYQAPLKPLSDNGVVVTIWYRAPELLLGSKHYTSAVDMWAVGCIFAELLTLKPLFQGAE 
AKSSQNPFQLDQLDKIFKILGHPTMDKWPTLVNLPHWQNDVQHIQAHKYDSVGLHNVVHL 
NQKSPAYDLLSKMLEYDPLKRITASQALEHEYFRMDPLPGRNAFVASQPMEKNVNYPTRP 
VDTNTDFEGTTSINPPQAVAAGNVAGNMAGAHGMGSRSMPRPMVAHNMQRMQQSQGMMAY 
NFPAQAGLNPSVPLQQQRGMAQPHQQQQLRRKDPGMGMSGYAPPNKSRRL*
>AT4G28300.1 |  hydroxyproline-rich glycoprotein family protein 
MASGSSGRVNSGSKGFDFGSDDILCSYDDYTNQDSSNGPHSDPAIAASNSNKEFHKTRMA 
RSSVFPTSSYSPPEDSLSQDITDTVERTMKMYADNMMRFLEGLSSRLSQLELYCYNLDKT 
IGEMRSELTHAHEDADVKLRSLDKHLQEVHRSVQILRDKQELADTQKELAKLQLVQKESS 
SSSHSQHGEDRVATPVPEPKKSENTSDAHNQQLALALPHQIAPQPQVQPQPQPQQHQYYM 
PPPPTQLQNTPAPVPVSTPPSQLQAPPAQSQFMPPPPAPSHPSSAQTQSFPQYQQNWPPQ 
PQARPQSSGGYPTYSPAPPGNQPPVESLPSSMQMQSPYSGPPQQSMQAYGYGAAPPPQAP 
PQQTKMSYSPQTGDGYLPSGPPPPSGYANAMYEGGRMQYPPPQPQQQQQQAHYLQGPQGG 
GYSPQPHQAGGGNIGAPPVLRSKYGELIEKLVSMGFRGDHVMAVIQRMEESGQPIDFNTL 
LDRLSGQSSGGPPRGW*
>AT4G28300.1 |  hydroxyproline-rich glycoprotein family protein 
MASGSSGRVNSGSKGFDFGSDDILCSYDDYTNQDSSNGPHSDPAIAASNSNKEFHKTRMA 
RSSVFPTSSYSPPEDSLSQDITDTVERTMKMYADNMMRFLEGLSSRLSQLELYCYNLDKT 
IGEMRSELTHAHEDADVKLRSLDKHLQEVHRSVQILRDKQELADTQKELAKLQLVQKESS 
SSSHSQHGEDRVATPVPEPKKSENTSDAHNQQLALALPHQIAPQPQVQPQPQPQQHQYYM 
PPPPTQLQNTPAPVPVSTPPSQLQAPPAQSQFMPPPPAPSHPSSAQTQSFPQYQQNWPPQ 
PQARPQSSGGYPTYSPAPPGNQPPVESLPSSMQMQSPYSGPPQQSMQAYGYGAAPPPQAP 
PQQTKMSYSPQTGDGYLPSGPPPPSGYANAMYEGGRMQYPPPQPQQQQQQAHYLQGPQGG 
GYSPQPHQAGGGNIGAPPVLRSKYGELIEKLVSMGFRGDHVMAVIQRMEESGQPIDFNTL 
LDRLSGQSSGGPPRGW*
>AT4G28300.2 |  hydroxyproline-rich glycoprotein family protein 
MARSSVFPTSSYSPPEDSLSQDITDTVERTMKMYADNMMRFLEGLSSRLSQLELYCYNLD 
KTIGEMRSELTHAHEDADVKLRSLDKHLQEVHRSVQILRDKQELADTQKELAKLQLVQKE 
SSSSSHSQHGEDRVATPVPEPKKSENTSDAHNQQLALALPHQIAPQPQVQPQPQPQQHQY 
YMPPPPTQLQNTPAPVPVSTPPSQLQAPPAQSQFMPPPPAPSHPSSAQTQSFPQYQQNWP 
PQPQARPQSSGGYPTYSPAPPGNQPPVESLPSSMQMQSPYSGPPQQSMQAYGYGAAPPPQ 
APPQQTKMSYSPQTGDGYLPSGPPPPSGYANAMYEGGRMQYPPPQPQQQQQQAHYLQGPQ 
GGGYSPQPHQAGGGNIGAPPVLRSKYGELIEKLVSMGFRGDHVMAVIQRMEESGQPIDFN 
TLLDRLSGQSSGGPPRGW*
>AT4G28300.2 |  hydroxyproline-rich glycoprotein family protein 
MARSSVFPTSSYSPPEDSLSQDITDTVERTMKMYADNMMRFLEGLSSRLSQLELYCYNLD 
KTIGEMRSELTHAHEDADVKLRSLDKHLQEVHRSVQILRDKQELADTQKELAKLQLVQKE 
SSSSSHSQHGEDRVATPVPEPKKSENTSDAHNQQLALALPHQIAPQPQVQPQPQPQQHQY 
YMPPPPTQLQNTPAPVPVSTPPSQLQAPPAQSQFMPPPPAPSHPSSAQTQSFPQYQQNWP 
PQPQARPQSSGGYPTYSPAPPGNQPPVESLPSSMQMQSPYSGPPQQSMQAYGYGAAPPPQ 
APPQQTKMSYSPQTGDGYLPSGPPPPSGYANAMYEGGRMQYPPPQPQQQQQQAHYLQGPQ 
GGGYSPQPHQAGGGNIGAPPVLRSKYGELIEKLVSMGFRGDHVMAVIQRMEESGQPIDFN 
TLLDRLSGQSSGGPPRGW*
>AT5G01230.2 |  FtsJ-like methyltransferase family protein 
MGKASRDKRDIYYRKAKEEGWRARSAFKLLQIDEEFNIFEGVKRVVDLCAAPGSWSQSGR 
S*
>AT5G01230.2 |  FtsJ-like methyltransferase family protein 
MGKASRDKRDIYYRKAKEEGWRARSAFKLLQIDEEFNIFEGVKRVVDLCAAPGSWSQSGR 
S*
>AT5G01230.1 |  FtsJ-like methyltransferase family protein 
MGKASRDKRDIYYRKAKEEGWRARSAFKLLQIDEEFNIFEGVKRVVDLCAAPGSWSQVLS 
RQLYLPAKSSAESKDGDLPLIVAIDLQPMAPIEGVIQVQGDITNARTAEVVIRHFDGCKA 
DLVVCDGAPDVTGLHDMDEFVQSQLILAGLTIVTHILKEGGKFIAKIFRGKDTSLLYCQL 
KLFFPTVTFAKPKSSRNSSIEAFAVCENYSPPEGFNPRDLHRLLEKVGSPSGGSDLDCSS 
GWLEGPNKVYIPFLACGDLTGYDSDRSYPLPREADGSSYQSLDPIQPPIAPPYKRALELK 
KASAQSFNS*
>AT5G01230.1 |  FtsJ-like methyltransferase family protein 
MGKASRDKRDIYYRKAKEEGWRARSAFKLLQIDEEFNIFEGVKRVVDLCAAPGSWSQVLS 
RQLYLPAKSSAESKDGDLPLIVAIDLQPMAPIEGVIQVQGDITNARTAEVVIRHFDGCKA 
DLVVCDGAPDVTGLHDMDEFVQSQLILAGLTIVTHILKEGGKFIAKIFRGKDTSLLYCQL 
KLFFPTVTFAKPKSSRNSSIEAFAVCENYSPPEGFNPRDLHRLLEKVGSPSGGSDLDCSS 
GWLEGPNKVYIPFLACGDLTGYDSDRSYPLPREADGSSYQSLDPIQPPIAPPYKRALELK 
KASAQSFNS*
>AT2G44580.1 |  protein binding / zinc ion binding 
MEEIGGAEAVINLKSGYSLPISYHPCFGPHEDLLLLEADDKLVSDIFHQRVTLRGLPDED 
AVLCTKSKTYAIKFVGNSNSMFLIPPSIFPGDAQVSDTNNNVSVLKIAPGNMELVEASPR 
LDKLKQILLANPFGAGEVEAMMDVDNDDLDHSGKKDLALYTWSDLVNTVQASDEELRNGL 
QSLSAIEIDGFWRVIDENYLDVILRMLLHNCVLKDWSFDDLDEDEVVNALVADEFPSQLA 
SHCLRVFGSKVNETDKWKLEPRLVCLHFARQILREEKMRLESFMEEWKKKIPDGMEERFE 
MLEGEVLTEKIGIETRVYTFSVRSLPSTPEERFSVLFKHRSKWEWKDLEPYLRDLHVPRL 
SMEGLLLKYTRRAQPKADAPPVFSAR*