>AT1G02970.1 |  WEE1 (ARABIDOPSIS WEE1 KINASE HOMOLOG) kinase/ protein kinase 
MFEKNGRTLLAKRKTQGTIKTRASKKIRKMEGTLERHSLLQFGQLSKISFENRPSSNVAS 
SAFQGLLDSDSSELRNQLGSADSDANCGEKDFILSQDFFCTPDYITPDNQNLMSGLDISK 
DHSPCPRSPVKLNTVKSKRCRQESFTGNHSNSTWSSKHRVDEQENDDIDTDEVMGDKLQA 
NQTERTGYVSQAAVALRCRAMPPPCLKNPYVLNQSETATDPFGHQRSKCASFLPVSTSGD 
GLSRYLTDFHEIRQIGAGHFSRVFKVLKRMDGCLYAVKHSTRKLYLDSERRKAMMEVQAL 
AALGFHENIVGYYSSWFENEQLYIQLELCDHSLSALPKKSSLKVSEREILVIMHQIAKAL 
HFVHEKGIAHLDVKPDNIYIKNGVCKLGDFGCATRLDKSLPVEEGDARYMPQEILNEDYE 
HLDKVDIFSLGVTVYELIKGSPLTESRNQSLNIKEGKLPLLPGHSLQLQQLLKTMMDRDP 
KRRPSARELLDHPMFDRIRG*
>AT3G48750.1 |  CDC2 (CELL DIVISION CONTROL 2) cyclin-dependent protein kinase/ kinase/ protein binding / protein kinase 
MDQYEKVEKIGEGTYGVVYKARDKVTNETIALKKIRLEQEDEGVPSTAIREISLLKEMQH 
SNIVKLQDVVHSEKRLYLVFEYLDLDLKKHMDSTPDFSKDLHMIKTYLYQILRGIAYCHS 
HRVLHRDLKPQNLLIDRRTNSLKLADFGLARAFGIPVRTFTHEVVTLWYRAPEILLGSHH 
YSTPVDIWSVGCIFAEMISQKPLFPGDSEIDQLFKIFRIMGTPYEDTWRGVTSLPDYKSA 
FPKWKPTDLETFVPNLDPDGVDLLSKMLLMDPTKRINARAALEHEYFKDLGGMP*
>AT5G10450.1 |  GRF6 (G-box regulating factor 6) protein binding / protein phosphorylated amino acid binding 
MAATLGRDQYVYMAKLAEQAERYEEMVQFMEQLVTGATPAEELTVEERNLLSVAYKNVIG 
SLRAAWRIVSSIEQKEESRKNDEHVSLVKDYRSKVESELSSVCSGILKLLDSHLIPSAGA 
SESKVFYLKMKGDYHRYMAEFKSGDERKTAAEDTMLAYKAAQDIAAADMAPTHPIRLGLA 
LNFSVFYYEILNSSDKACNMAKQAFEEAIAELDTLGEESYKDSTLIMQLLRDNLTLWTSD 
MQEQMDEA*
>AT5G10450.1 |  GRF6 (G-box regulating factor 6) protein binding / protein phosphorylated amino acid binding 
MAATLGRDQYVYMAKLAEQAERYEEMVQFMEQLVTGATPAEELTVEERNLLSVAYKNVIG 
SLRAAWRIVSSIEQKEESRKNDEHVSLVKDYRSKVESELSSVCSGILKLLDSHLIPSAGA 
SESKVFYLKMKGDYHRYMAEFKSGDERKTAAEDTMLAYKAAQDIAAADMAPTHPIRLGLA 
LNFSVFYYEILNSSDKACNMAKQAFEEAIAELDTLGEESYKDSTLIMQLLRDNLTLWTSD 
MQEQMDEA*
>AT5G10450.2 |  GRF6 (G-box regulating factor 6) protein binding / protein phosphorylated amino acid binding 
MAATLGRDQYVYMAKLAEQAERYEEMVQFMEQLVTGATPAEELTVEERNLLSVAYKNVIG 
SLRAAWRIVSSIEQKEESRKNDEHVSLVKDYRSKVESELSSVCSGILKLLDSHLIPSAGA 
SESKVFYLKMKGDYHRYMAEFKSGDERKTAAEDTMLAYKAAQDIAAADMAPTHPIRLGLA 
LNFSVFYYEILNSSDKACNMAKQAFEEAIAELDTLGEESYKDSTLIMQLLRDNLTLWTSD 
MQMDEA*
>AT5G10450.2 |  GRF6 (G-box regulating factor 6) protein binding / protein phosphorylated amino acid binding 
MAATLGRDQYVYMAKLAEQAERYEEMVQFMEQLVTGATPAEELTVEERNLLSVAYKNVIG 
SLRAAWRIVSSIEQKEESRKNDEHVSLVKDYRSKVESELSSVCSGILKLLDSHLIPSAGA 
SESKVFYLKMKGDYHRYMAEFKSGDERKTAAEDTMLAYKAAQDIAAADMAPTHPIRLGLA 
LNFSVFYYEILNSSDKACNMAKQAFEEAIAELDTLGEESYKDSTLIMQLLRDNLTLWTSD 
MQMDEA*
>AT5G65430.1 |  GRF8 (GENERAL REGULATORY FACTOR 8) protein binding / protein phosphorylated amino acid binding 
MATTLSRDQYVYMAKLAEQAERYEEMVQFMEQLVSGATPAGELTVEERNLLSVAYKNVIG 
SLRAAWRIVSSIEQKEESRKNEEHVSLVKDYRSKVETELSSICSGILRLLDSHLIPSATA 
SESKVFYLKMKGDYHRYLAEFKSGDERKTAAEDTMIAYKAAQDVAVADLAPTHPIRLGLA 
LNFSVFYYEILNSSEKACSMAKQAFEEAIAELDTLGEESYKDSTLIMQLLRDNLTLWTSD 
MQEQMDEA*
>AT5G65430.1 |  GRF8 (GENERAL REGULATORY FACTOR 8) protein binding / protein phosphorylated amino acid binding 
MATTLSRDQYVYMAKLAEQAERYEEMVQFMEQLVSGATPAGELTVEERNLLSVAYKNVIG 
SLRAAWRIVSSIEQKEESRKNEEHVSLVKDYRSKVETELSSICSGILRLLDSHLIPSATA 
SESKVFYLKMKGDYHRYLAEFKSGDERKTAAEDTMIAYKAAQDVAVADLAPTHPIRLGLA 
LNFSVFYYEILNSSEKACSMAKQAFEEAIAELDTLGEESYKDSTLIMQLLRDNLTLWTSD 
MQEQMDEA*
>AT5G65430.2 |  GRF8 (GENERAL REGULATORY FACTOR 8) protein binding / protein phosphorylated amino acid binding 
MATTLSRDQYVYMAKLAEQAERYEEMVQFMEQLVSGATPAGELTVEERNLLSVAYKNVIG 
SLRAAWRIVSSIEQKEESRKNEEHVSLVKDYRSKVETELSSICSGILRLLDSHLIPSATA 
SESKVFYLKMKGDYHRYLAEFKSGDERKTAAEDTMIAYKAAQDVAVADLAPTHPIRLGLA 
LNFSVFYYEILNSSEKACSMAKQAFEEAIAELDTLGEESYKDSTLIMQLLRDNLTLWTSD 
MQMDEA*
>AT5G65430.2 |  GRF8 (GENERAL REGULATORY FACTOR 8) protein binding / protein phosphorylated amino acid binding 
MATTLSRDQYVYMAKLAEQAERYEEMVQFMEQLVSGATPAGELTVEERNLLSVAYKNVIG 
SLRAAWRIVSSIEQKEESRKNEEHVSLVKDYRSKVETELSSICSGILRLLDSHLIPSATA 
SESKVFYLKMKGDYHRYLAEFKSGDERKTAAEDTMIAYKAAQDVAVADLAPTHPIRLGLA 
LNFSVFYYEILNSSEKACSMAKQAFEEAIAELDTLGEESYKDSTLIMQLLRDNLTLWTSD 
MQMDEA*
>AT2G18040.1 |  PIN1AT (PEPTIDYLPROLYL CIS/TRANS ISOMERASE NIMA-INTERACTING 1) peptidyl-prolyl cis-trans isomerase 
MASRDQVKASHILIKHQGSRRKASWKDPEGKIILTTTREAAVEQLKSIREDIVSGKANFE 
EVATRVSDCSSAKRGGDLGSFGRGQMQKPFEEATYALKVGDISDIVDTDSGVHIIKRTA*
>AT5G06550.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN cell surface receptor linked signal transduction EXPRESSED IN 19 plant structures EXPRESSED DURING 12 growth stages CONTAINS InterPro DOMAIN/s Cyclin-like F-box (InterProIPR001810) Transcription factor jumonji/aspartyl beta-hydroxylase (InterProIPR003347) Transcription factor jumonji (InterProIPR013129) BEST Arabidopsis thaliana protein match is transferase transferring glycosyl groups (TAIRAT1G782801) Has 1299 Blast hits to 1289 proteins in 204 species Archae - 0 Bacteria - 195 Metazoa - 777 Fungi - 106 Plants - 94 Viruses - 0 Other Eukaryotes - 127 (source NCBI BLink) 
MPKCKNLLLTSKRRKSKSKRLKLHQHEPESLFPEKEVEEEDEDEGGFKLKIAAPSQEHGV 
QPLGNLYFNPGAVNVRNTGLGNLQILSDELVLDILGLLGANHLGVLATVTKSFYIFANHE 
PLWRNLVLEELKGDFLFNGSWRSTYVAAYHPKFKFAGDGESNLKIIDFYSDYLFQSWLCA 
NLEMKPKWLRRDNITRVRGISVEDFITKFEEPNKPVLLEGCLDGWPAIEKWSRDYLTKVV 
GDVEFAVGPVEMKLEKYFRYSDGAREERPLYLFDPKFAEKVPVLDSEYDVPVYFREDLFG 
VLGNERPDYRWIIIGPAGSGSSFHIDPNSTSAWNAVITGSKKWVLFPPDVVPPGVHPSPD 
GAEVACPVSIIEWFMNFYDDTKDWEKKPIECICKAGEVMFVPNGWWHLVINLEESIAITQ 
NYASRSNLLNVLEFLKKPNAKELVSGTTDRENLHDKFKKAIEEAYPGTIQELEKKAEEAK 
RAEEQRVSFWDSAKTDTFKFSF*
>AT1G21190.1 |  small nuclear ribonucleoprotein putative / snRNP putative / Sm protein putative 
MSVEEDATVREPLDLIRLSIEERIYVKLRSDRELRGKLHAFDQHLNMILGDVEEVITTIE 
IDDETYEEIVRTTKRTVPFLFVRGDGVILVSPPLRTT*
>AT1G29330.1 |  ERD2 (ENDOPLASMIC RETICULUM RETENTION DEFECTIVE 2) KDEL sequence binding / receptor 
MNIFRFAGDMSHLISVLILLLKIYATKSCAGISLKTQELYALVFLTRYLDLFTDYVSLYN 
SIMKIVFIASSLAIVWCMRRHPLVRRSYDKDLDTFRHQYVVLACFVLGLILNEKFTVQEV 
FWAFSIYLEAVAILPQLVLLQRSGNVDNLTGQYVVFLGAYRGLYIINWIYRYFTEDHFTR 
WIACVSGLVQTALYADFFYYYYISWKTNTKLKLPA*
>AT5G40820.1 |  ATRAD3 binding / inositol or phosphatidylinositol kinase/ phosphotransferase alcohol group as acceptor / protein serine/threonine kinase 
MAKDDNNLSSLVHELRERVAASASTPANNLRHSSGDEDALEIRFRAVIPNLLNTYVVPSL 
GNGREVTAVLKLVGHTARNIPGVFYHGTPSAILPVIARIIPFFAEPEFVPGHGVLLETVG 
SLLMLLRSNSRKAYRIFFHDALQAIQDMQPIASLHSIEPEVCESHIPFRCFCMSFSGIGG 
DLPDANKPRDGDGLVLNLLGANRWQPFATCILKLICKCLTEGTLYVQGLIHTSFFKAACS 
LVCCGGADVQMACFEFATLVGSILTFNILPHVALIQSIILLLSADEGLPVYRNTIYDSTI 
GRFLTAVYSSCSDAAVKLTAESLVLVLSHALQRTKSEELKASLCSAYVRIVKSCPPCIWK 
IHCLLELLHLPEPCFQLIECFKAVLIVLGPGCVRVETTKCGSHTSATSDRPVQGINAGKK 
RHIEDESTYKRKRQKVGDDIRRGVYFAPEFADETDGKDAASLREMLISTVESLKPPPAGP 
SLSQTESSIVALSMLTNAFCFCPWTDMTHRLFNQMYAWIPWIAGQVEETNPIMFDISIYL 
EGIHNLLLVGVDPQYEYTSKGNDLVAIQFLLKLPWTHYMLFKTPSSLVKSKCLSVGIWTK 
LGLQDGSDFDIFSWSLSDDFEQVQAVAAISMPLKVLFSGLGALLHMFPKLEHLLEEKELM 
IKKAIPQSLGFLSCLYGSSTTDSEKTACHLLLHEDLKKDETLNSLLQGFRCSKCDKFIER 
EDEKHFRIIETPEMVKLKMDHHRDYFNLQSLYFNLLYDESSEETQLACVEVIRRILGHTS 
PDILVRTRSQWIRCLQYLLVHVNTDVREAFCAQIGIFVQHPIVSCLFLSEDATEKSCERN 
FFNLIEHSLAAAKDLLVIQTLLETTAEVMVAVDVTSELFLICLFLLIDQLDHPNLIVRIN 
ASKLINRSCYIHVKGGFATLLSTASHIQNELFDNLSVRLTSRPNVVREFAEAVLGVETEE 
LVRKMVPAVLPKLLVYWQENAQAANTLNELAKLIDTDVVPLIVNWLPRVLAFALNQEEDK 
NLLSVLQLYHSQIGSDNQEIFAAALPALLDELVCFVDIADTPETDRRLQRLPDAIKKISK 
VLTNAEDLPGFLQNHFVGLLNSIDRKMLHADDIFLQKQALKRIKLLIEMMGHYLSTYVPK 
LMVLLMHAIEKDALQSEGLLVLHFFTRKLADVSPSSIKYVISQIFAALIPFLEKEKEGPH 
VYLDEVVKILEELVLKNRDIVKEHICEFPLLPSIPSLGELNNAIQEARGLMSLKDQLRDI 
VNGMKHENLNVRYMVACELSKLLYNRNEDVAALIAGELVSDMEILSSLITYLLQGCAEES 
RTTVGQRLKLVCADCLGAIGAIDPAKVRVASCSRFKIQCSDDDLIFELIHKHLARAFRAA 
QDTIIQDSAALAIQELLKIAGCEPSLAGNVVVLTPQEHVQVNVSGSRRCGGNNEVKDRGQ 
KLWDRFSNYVKELIAPCLTSRFQLPNVSDPGSAGPIYRPSMSFRRWLSYWIRKLTAFATG 
SRVSIFAACRGIVRHDMQTATYLLPYLVLDVVCHGTEAARLSISEEILSVLDAAASENSG 
VTINSFGVGQSEVCVQAVFTLLDNLGQWVDDVKQGVALSSSLQSSGGRQVAPKSKDQVSN 
STTEQDHLLVQCKYVLELLLAIPKVTLARASFRCQAYARSLMYLESHVRGKSGSLNPAAE 
KTGIFENADVSSLMGIYSCLDEPDGLSGFASLSKSLNLQDQLLINKKSGNWADVFTACEQ 
ALQMEPTSVQRHSDVLNCLLNMCHHQTMVTHVDGLISRVPEYKKTWCTQGVQAAWRLGKW 
DLMDEYLDGADAEGLLFSSSDSNASFDRDVAKILHAMMKKDQYSVAEGIAISKQALIAPL 
AAAGMDSYTRAYPFVVKLHLLRELEDFQAVLNGDSYLEKSFSTSDQVFSKAVDNWENRLR 
FTQSSLWTREPLLAFRRLVFGASGLGAQVGNCWLQYAKLCRLAGHYETAHRAILEAQASG 
APNVHMEKAKLLWITKRSDSAIIELQQSLLNMPEGVVDSTVISSINSLLMAPPNPEPTVR 
NTQSFKEKKDVAKTLLLYSKWIHHSGQKQKKDVLNLYTQVKELLPWEKGYFHLAKYYDEL 
YVDARKCQQESSVFSSAGSKKGSVSSNLSTEKAGWDYLFKGMYFYAKALHSGHKNLFQAL 
PRLLTLWFDFGTIYKTSGSAGNKELKSTHMKIMSLMRGCLKDLPTYQWLTVLPQLVSRIC 
HQNADTVLMVKNIITSVLHQFPQQGLWIMAAVSKSTVPARREAAAEIIQGARKGFNQSDR 
GHNLFIQFASLTDHFIKLCFHGGQPRSKVINIATEFSALKRMMPLDIIMPIQQSLTISLP 
AFHMNNNERHSASVFSGSDLPTISGIADEAEILSSLQRPKKIILLGNDGIEYPFLCKPKD 
DLRKDARMMEFTAMINRLLSKYPESRRRKLYIRTFAVAPLTEDCGLVEWVPHTRGLRHIL 
QDIYISCGKFDRQKTNPQIKRIYDQCAVKKEYEMLKTKILPMFPPVFHKWFLTTFSEPAA 
WFRSRVAYAHTTAVWSMVGHIVGLGDRHGENILFDSTSGDCVHVDFSCLFDKGLQLEKPE 
LVPFRLTQNMIDGLGITGYEGIFMRVCEITLTVLRTHRETLMSILETFIHDPLVEWTKSH 
KSSGVEVQNPHAQRAISSIEARLQGVVVGVPLPVEGQARRLIADAVSLENLGKMYIWWMP 
WF*
>AT2G22370.1 |  unknown protein 
MSMECVVQGIIETQHVEALEILLQGLCGVQRERLRVHELCLRSGPNLGVVSSEVRLLCDL 
DQPEPTWTVKHVGGAMRGAGADQISVLVRNMIESKVSKNALRMFYALGYKLDHELLKVGF 
AFHFQRTAHISVSVSSVNKMPKVHAIDEAVPVTPGMQIVDVTAPATSENYSEVAAAVSSF 
CEFLAPLVHLSKPSISTGVVPTAAAAAASLMSDGGGTTL*
>AT4G23930.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN endomembrane system EXPRESSED IN 11 plant structures EXPRESSED DURING 6 growth stages CONTAINS InterPro DOMAIN/s Harpin-induced 1 (InterProIPR010847) BEST Arabidopsis thaliana protein match is proline-rich family protein (TAIRAT1G644501) Has 80 Blast hits to 75 proteins in 8 species Archae - 0 Bacteria - 0 Metazoa - 0 Fungi - 0 Plants - 80 Viruses - 0 Other Eukaryotes - 0 (source NCBI BLink) 
MSKSCSNLASCAVATLFIVFLIIAALTVYLTVFRPRDPEISVTSVKVPSFSVANSSLFYY 
GNRIGYTFVPAGEIESGRTKRMLATFSVQSFPLAAASSSQISAAQFQNSDRSGSTVEIES 
KLEMAGRVRVLGLFTHRIAAKCNCRIAISSSDGSIVAVRC*
>AT4G23930.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN endomembrane system EXPRESSED IN 11 plant structures EXPRESSED DURING 6 growth stages CONTAINS InterPro DOMAIN/s Harpin-induced 1 (InterProIPR010847) BEST Arabidopsis thaliana protein match is proline-rich family protein (TAIRAT1G644501) Has 183 Blast hits to 177 proteins in 15 species Archae - 0 Bacteria - 0 Metazoa - 0 Fungi - 0 Plants - 183 Viruses - 0 Other Eukaryotes - 0 (source NCBI BLink) 
MSKSCSNLASCAVATLFIVFLIIAALTVYLTVFRPRDPEISVTSVKVPSFSVANSSLFYY 
GNRIGYTFVPAGEIESGRTKRMLATFSVQSFPLAAASSSQISAAQFQNSDRSGSTVEIES 
KLEMAGRVRVLGLFTHRIAAKCNCRIAISSSDGSIVAVRC*
>AT4G23930.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN endomembrane system EXPRESSED IN 11 plant structures EXPRESSED DURING 6 growth stages CONTAINS InterPro DOMAIN/s Harpin-induced 1 (InterProIPR010847) BEST Arabidopsis thaliana protein match is proline-rich family protein (TAIRAT1G644501) Has 80 Blast hits to 75 proteins in 8 species Archae - 0 Bacteria - 0 Metazoa - 0 Fungi - 0 Plants - 80 Viruses - 0 Other Eukaryotes - 0 (source NCBI BLink) 
MSKSCSNLASCAVATLFIVFLIIAALTVYLTVFRPRDPEISVTSVKVPSFSVANSSVSFT 
FSQFSAVRNPNRAAFSHYNNVIQLFYYGNRIGYTFVPAGEIESGRTKRMLATFSVQSFPL 
AAASSSQISAAQFQNSDRSGSTVEIESKLEMAGRVRVLGLFTHRIAAKCNCRIAISSSDG 
SIVAVRC*
>AT4G23930.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN endomembrane system EXPRESSED IN 11 plant structures EXPRESSED DURING 6 growth stages CONTAINS InterPro DOMAIN/s Harpin-induced 1 (InterProIPR010847) BEST Arabidopsis thaliana protein match is proline-rich family protein (TAIRAT1G644501) Has 183 Blast hits to 177 proteins in 15 species Archae - 0 Bacteria - 0 Metazoa - 0 Fungi - 0 Plants - 183 Viruses - 0 Other Eukaryotes - 0 (source NCBI BLink) 
MSKSCSNLASCAVATLFIVFLIIAALTVYLTVFRPRDPEISVTSVKVPSFSVANSSVSFT 
FSQFSAVRNPNRAAFSHYNNVIQLFYYGNRIGYTFVPAGEIESGRTKRMLATFSVQSFPL 
AAASSSQISAAQFQNSDRSGSTVEIESKLEMAGRVRVLGLFTHRIAAKCNCRIAISSSDG 
SIVAVRC*
>AT3G57870.1 |  SCE1 (SUMO CONJUGATION ENZYME 1) SUMO ligase 
MASGIARGRLAEERKSWRKNHPHGFVAKPETGQDGTVNLMVWHCTIPGKAGTDWEGGFFP 
LTMHFSEDYPSKPPKCKFPQGFFHPNVYPSGTVCLSILNEDYGWRPAITVKQILVGIQDL 
LDTPNPADPAQTDGYHLFCQDPVEYKKRVKLQSKQYPALV*
>AT4G11330.1 |  ATMPK5 (MAP KINASE 5) MAP kinase/ kinase 
MAKEIESATDLGDTNIKGVLVHGGRYFQYNVYGNLFEVSNKYVPPIRPIGRGAYGFVCAA 
VDSETHEEIAIKKIGKAFDNKVDAKRTLREIKLLRHLEHENVVVIKDIIRPPKKEDFVDV 
YIVFELMDTDLHQIIRSNQSLNDDHCQYFLYQILRGLKYIHSANVLHRDLKPSNLLLNSN 
CDLKITDFGLARTTSETEYMTEYVVTRWYRAPELLLNSSEYTSAIDVWSVGCIFAEIMTR 
EPLFPGKDYVHQLKLITELIGSPDGASLEFLRSANARKYVKELPKFPRQNFSARFPSMNS 
TAIDLLEKMLVFDPVKRITVEEALCYPYLSALHDLNDEPVCSNHFSFHFEDPSSTEEEIK 
ELVWLESVKFNPLPSI*
>AT3G09800.1 |  protein binding 
MSPDSCPLVKKILLLDSEGKRVAVKYYSDDWPTNAAKLSFEKYVFSKTSKTNARTEAEIT 
LLDSNIIVYKFAQDLHFFVTGGENENELILASVLQGFFDAVALLLRSNVEKMEALENLDL 
IFLCLDEMVDQGVVLETDPNVIAGKVAMQSTEASGSLSEQTLTQALATAREHLARSLLT*
>AT3G09800.1 |  protein binding 
MSPDSCPLVKKILLLDSEGKRVAVKYYSDDWPTNAAKLSFEKYVFSKTSKTNARTEAEIT 
LLDSNIIVYKFAQDLHFFVTGGENENELILASVLQGFFDAVALLLRSNVEKMEALENLDL 
IFLCLDEMVDQGVVLETDPNVIAGKVAMQSTEASGSLSEQTLTQALATAREHLARSLLT*
>AT3G09800.2 |  protein binding 
MSPDSCPLVKKILLLDSEGKRVAVKYYSDDWPTNAAKLSFEKYVFSKTSKTNARTEAEIT 
LLDSNIIVYKFAQDLHFFVTGGENENELILASVLQGFFDAVALLLRSNVEKMEALENLDL 
IFLCLDEMVDQGYASLIV*
>AT3G09800.2 |  protein binding 
MSPDSCPLVKKILLLDSEGKRVAVKYYSDDWPTNAAKLSFEKYVFSKTSKTNARTEAEIT 
LLDSNIIVYKFAQDLHFFVTGGENENELILASVLQGFFDAVALLLRSNVEKMEALENLDL 
IFLCLDEMVDQGYASLIV*
>AT3G16840.1 |  ATP binding / ATP-dependent helicase/ helicase/ nucleic acid binding 
MVTGDKESSLMKKRNKRSHKRKREEDFERIDSLPWSSSIPIGEDDEGESFSTLFSGSGQL 
DGGFLSLEEIDEADYHLTLPTIESEITERKQSPEDDDDTNETVDEMIEGEEAEEDGEGRD 
DEDDEDDEETRKKKEKKAKRNKEKKKEKKKKKQKKINEAAKNQDASAVSCDGDDTVEEQV 
EEEEIPPEFSAWSSMRLHPLLMKSIYRLDFKEPTKIQKACFNVAAYQGKDVIGAAETGSG 
KTLAFGLPILQRLLDEREKVGKLYALKGEEAQKYAADGYLRALIITPTRELALQVTEHLE 
NAAKNLSVKVVPIVGGMFSEKQERRLKEKPEIVVATPGRLWELMSAGEKHLVELHSLSFF 
VLDEADRMVERGHFRELQSILDLLPVTDKPNEGKTQTVKSNDTVLNVPKKKRQTFVFSAT 
IALSSDFRKKLKRGSSKSKQSSSGEVNSIEVLSERAGMRDNVAIIDLTTTSILAPKIEES 
FIKCEEKEKDAYLYYILSVHGQGRTIVFCTSVTDLRHISGLLKILGLDVCTLFSEMKQRA 
RLKSIDRFRASENGILIATDLVARGIDIKNVRTIIHYKLPHSAEVYVHRCGRTARAFADG 
CSIALIEPNETSKFYTLCKSFSMESVKIFPLDNSYMPAVRKRLYLARQIYEIERKGSREN 
ADRTWLKKHAESMELELDDEESEEERVDNVRQRKATSARLNKLKEELSTLLSHPMQPKKF 
SGRYFAGVGVSTLMQNQFVELKKQKQAQMQIGGDIKRRKLVVINQNCIEPLQALRAGGNE 
MLKMKGQSAEKRRDIASLKKKRKEEKIGRRDQRRNQKKQRKLMASS*
>AT5G44740.1 |  POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase 
MRLKLLVLRFNWFKFLWLVVVSILAKSGKCERASIDEVYLDLTDAAESMLADAPPESLEL 
IDEEVLKSHILGMNREDGDDFKESVRNWICREDADRRDKLLSCGIIIVAELRKQVLKETE 
FTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQELLSSLPIKKMKQLGGKLGTSLQTD 
LGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGISGEEVQGRLLPKSHGSGKTFPGPR 
ALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIASTLTLHASAFRSKDSDSHKKFPSKSC 
PMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNKLETWRITGLSVSASKIVDIPSGTS 
SIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCSEQRSTETQAAMPEVDTGVTYTLPN 
FENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKGTQTKKIGRKMNNSKEKNRGMPSIV 
DIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSHNSQVNQEVEESRETDWGYKTDEID 
QSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGSTSSIAHYFPPLNR*
>AT5G44740.1 |  POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase 
MRLKLLVLRFNWFKFLWLVVVSILAKSGKCERASIDEVYLDLTDAAESMLADAPPESLEL 
IDEEVLKSHILGMNREDGDDFKESVRNWICREDADRRDKLLSCGIIIVAELRKQVLKETE 
FTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQELLSSLPIKKMKQLGGKLGTSLQTD 
LGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGISGEEVQGRLLPKSHGSGKTFPGPR 
ALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIASTLTLHASAFRSKDSDSHKKFPSKSC 
PMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNKLETWRITGLSVSASKIVDIPSGTS 
SIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCSEQRSTETQAAMPEVDTGVTYTLPN 
FENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKGTQTKKIGRKMNNSKEKNRGMPSIV 
DIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSHNSQVNQEVEESRETDWGYKTDEID 
QSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGSTSSIAHYFPPLNR*
>AT5G44740.2 |  POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase 
MPVARPEASDARVIAHVDMDCFYVQVEQRKQPELRGLPSAVVQYNEWQGGGLIAVSYEAR 
KCGVKRSMRGDEAKAACPQIQLVQVPVARGKADLNLYRSAGSEVVSILAKSGKCERASID 
EVYLDLTDAAESMLADAPPESLELIDEEVLKSHILGMNREDGDDFKESVRNWICREDADR 
RDKLLSCGIIIVAELRKQVLKETEFTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQE 
LLSSLPIKKMKQLGGKLGTSLQTDLGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGI 
SGEEVQGRLLPKSHGSGKTFPGPRALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIAST 
LTLHASAFRSKDSDSHKKFPSKSCPMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNK 
LETWRITGLSVSASKIVDIPSGTSSIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCS 
EQRSTETQAAMPEVDTGVTYTLPNFENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKG 
TQTKKIGRKMNNSKEKNRGMPSIVDIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSH 
NSQVNQEVEESRETDWGYKTDEIDQSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGST 
SSIAHYFPPLNR*
>AT5G44740.2 |  POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase 
MPVARPEASDARVIAHVDMDCFYVQVEQRKQPELRGLPSAVVQYNEWQGGGLIAVSYEAR 
KCGVKRSMRGDEAKAACPQIQLVQVPVARGKADLNLYRSAGSEVVSILAKSGKCERASID 
EVYLDLTDAAESMLADAPPESLELIDEEVLKSHILGMNREDGDDFKESVRNWICREDADR 
RDKLLSCGIIIVAELRKQVLKETEFTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQE 
LLSSLPIKKMKQLGGKLGTSLQTDLGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGI 
SGEEVQGRLLPKSHGSGKTFPGPRALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIAST 
LTLHASAFRSKDSDSHKKFPSKSCPMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNK 
LETWRITGLSVSASKIVDIPSGTSSIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCS 
EQRSTETQAAMPEVDTGVTYTLPNFENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKG 
TQTKKIGRKMNNSKEKNRGMPSIVDIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSH 
NSQVNQEVEESRETDWGYKTDEIDQSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGST 
SSIAHYFPPLNR*
>AT5G53940.1 |  yippee family protein 
MGRIFTVELEGRSYRCRFCRTHLALPDDLVSRSFHCRRGKAYLFNRSVNISMGPLEERLM 
LSGMHTVADIFCCCCGQNVGWKYESAHEKAQKYKEGKFVLERGRIVDEIDLSTEVYIDTH 
GSTSDTEDS*
>AT2G37320.1 |  pentatricopeptide (PPR) repeat-containing protein 
MSCLRNYYCRAFGYKQSRSCYSRSLNREIANESSEVERRARSLRVLDIISSKSGGVSNRQ 
DHFGFVQEFRQTDSWRFRGQAISEDFDLSRTKNGVSSVLEEVMLEDSSSSVKRDGWSFDA 
YGLSSAVRSCGLNRDFRTGSGFHCLALKGGFISDVYLGSSLVVLYRDSGEVENAYKVFEE 
MPERNVVSWTAMISGFAQEWRVDICLKLYSKMRKSTSDPNDYTFTALLSACTGSGALGQG 
RSVHCQTLHMGLKSYLHISNSLISMYCKCGDLKDAFRIFDQFSNKDVVSWNSMIAGYAQH 
GLAMQAIELFELMMPKSGTKPDAITYLGVLSSCRHAGLVKEGRKFFNLMAEHGLKPELNH 
YSCLVDLLGRFGLLQEALELIENMPMKPNSVIWGSLLFSCRVHGDVWTGIRAAEERLMLE 
PDCAATHVQLANLYASVGYWKEAATVRKLMKDKGLKTNPGCSWIEINNYVFMFKAEDGSN 
CRMLEIVHVLHCLIDHMEFL*
>AT1G06580.1 |  pentatricopeptide (PPR) repeat-containing protein 
MRRSIVIVIALTAKGFLHRHLLEKGNLVTALSLRICNSRAFSGRSDYRERLRSGLHSIKF 
NDALTLFCDMAESHPLPSIVDFSRLLIAIAKLNKYEAVISLFRHLEMLGISHDLYSFTTL 
IDCFCRCARLSLALSCLGKMMKLGFEPSIVTFGSLVNGFCHVNRFYEAMSLVDQIVGLGY 
EPNVVIYNTIIDSLCEKGQVNTALDVLKHMKKMGIRPDVVTYNSLITRLFHSGTWGVSAR 
ILSDMMRMGISPDVITFSALIDVYGKEGQLLEAKKQYNEMIQRSVNPNIVTYNSLINGLC 
IHGLLDEAKKVLNVLVSKGFFPNAVTYNTLINGYCKAKRVDDGMKILCVMSRDGVDGDTF 
TYNTLYQGYCQAGKFSAAEKVLGRMVSCGVHPDMYTFNILLDGLCDHGKIGKALVRLEDL 
QKSKTVVGIITYNIIIKGLCKADKVEDAWYLFCSLALKGVSPDVITYITMMIGLRRKRLW 
REAHELYRKMQKEDGLMPIK*
>AT4G01030.1 |  pentatricopeptide (PPR) repeat-containing protein 
MYIKTGYLPYARMVFDMMDAKNIVAWNSLVSGLSYACLLKDAEALMIRMEKEGIKPDAIT 
WNSLASGYATLGKPEKALDVIGKMKEKGVAPNVVSWTAIFSGCSKNGNFRNALKVFIKMQ 
EEGVGPNAATMSTLLKILGCLSLLHSGKEVHGFCLRKNLICDAYVATALVDMYGKSGDLQ 
SAIEIFWGIKNKSLASWNCMLMGYAMFGRGEEGIAAFSVMLEAGMEPDAITFTSVLSVCK 
NSGLVQEGWKYFDLMRSRYGIIPTIEHCSCMVDLLGRSGYLDEAWDFIQTMSLKPDATIW 
GAFLSSCKIHRDLELAEIAWKRLQVLEPHNSANYMMMINLYSNLNRWEDVERIRNLMRNN 
RVRVQDLWSWIQIDQTVHIFYAEGKTHPDEGDIYFELYKLVSEMKKSGYVPDTSCIHQDI 
SDSEKEKLLMGHTEKLAMTYGLIKKKGLAPIRVVKNTNICSDSHTVAKYMSVLRNREIVL 
QEGARVHHFRDGKCSCNDSW*