>AT3G62870.1 |  60S ribosomal protein L7A (RPL7aB) 
MAPKKGVKVASKKKPEKVTNPLFERRPKQFGIGGALPPKKDLSRYIKWPKSIRLQRQKRI 
LKQRLKVPPALNQFTKTLDKNLATSLFKILLKYRPEDKAAKKERLLNKAQAEAEGKPAES 
KKPIVVKYGLNHVTYLIEQNKAQLVVIAHDVDPIELVVWLPALCRKMEVPYCIVKGKSRL 
GAVVHQKTAAALCLTTVKNEDKLEFSKILEAIKANFNDKYEEYRKKWGGGIMGSKSQAKT 
KAKERVIAKEAAQRMN*
>AT3G26690.1 |  ATNUDX13 (ARABIDOPSIS THALIANA NUDIX HYDROLASE HOMOLOG 13) bis(5-adenosyl)-pentaphosphatase/ hydrolase 
MSNLSARTGRDHQRYDNNFRLVSGCIPYRLVKDEEEDSTSVDFENKLQVLMISSPNRHDL 
VFPKGGWEDDETVLEAASREAMEEAGVKGILREDPLGVWEFRSKSSSVEADCCLGGGCKG 
YMFALEVKEELAIWPEQDDRERRWLNVKEALELCRYEWMQSALEEFLRVMAEEGSTKEDS 
LAISSISNRGERQIDPRYCFVV*
>AT3G26690.1 |  ATNUDX13 (ARABIDOPSIS THALIANA NUDIX HYDROLASE HOMOLOG 13) bis(5-adenosyl)-pentaphosphatase/ hydrolase 
MSNLSARTGRDHQRYDNNFRLVSGCIPYRLVKDEEEDSTSVDFENKLQVLMISSPNRHDL 
VFPKGGWEDDETVLEAASREAMEEAGVKGILREDPLGVWEFRSKSSSVEADCCLGGGCKG 
YMFALEVKEELAIWPEQDDRERRWLNVKEALELCRYEWMQSALEEFLRVMAEEGSTKEDS 
LAISSISNRGERQIDPRYCFVV*
>AT3G26690.2 |  ATNUDX13 (ARABIDOPSIS THALIANA NUDIX HYDROLASE HOMOLOG 13) bis(5-adenosyl)-pentaphosphatase/ hydrolase 
MSNLSARTGRDHQRYDNNFRLVSGCIPYRLVKDEEEDSTSVDFENKLQVLMISSPNRHDL 
VFPKGGWEDDETVLEAASREAMEEAGVKGILREDPLGVWEFRSKSSSVEADCCLGGGCKG 
YMFALEVKEELAIWPEQDDRERRWLNVKEALELCRYEWMQSALEEFLRVMAEEGSTKEDS 
LAISSISNRGERQIDPRYCFVV*
>AT3G26690.2 |  ATNUDX13 (ARABIDOPSIS THALIANA NUDIX HYDROLASE HOMOLOG 13) bis(5-adenosyl)-pentaphosphatase/ hydrolase 
MSNLSARTGRDHQRYDNNFRLVSGCIPYRLVKDEEEDSTSVDFENKLQVLMISSPNRHDL 
VFPKGGWEDDETVLEAASREAMEEAGVKGILREDPLGVWEFRSKSSSVEADCCLGGGCKG 
YMFALEVKEELAIWPEQDDRERRWLNVKEALELCRYEWMQSALEEFLRVMAEEGSTKEDS 
LAISSISNRGERQIDPRYCFVV*
>AT3G22890.1 |  APS1 (ATP SULFURYLASE 1) sulfate adenylyltransferase (ATP) 
MASMAAVLSKTPFLSQPLTKSSPNSDLPFAAVSFPSKSLRRRVGSIRAGLIAPDGGKLVE 
LIVEEPKRREKKHEAADLPRVELTAIDLQWMHVLSEGWASPLGGFMRESEFLQTLHFNSL 
RLDDGSVVNMSVPIVLAIDDEQKARIGESTRVALFNSDGNPVAILSDIEIYKHPKEERIA 
RTWGTTAPGLPYVDEAITNAGNWLIGGDLEVLEPVKYNDGLDRFRLSPAELRKELEKRNA 
DAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGFTKADDVPLDWRMKQH 
EKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPVE 
KRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQGKMAFFDPSRPQDFLFISGTKMR 
TLAKNNENPPDGFMCPGGWKVLVDYYESLTPAGNGRLPEVVPV*
>AT2G26080.1 |  AtGLDP2 (Arabidopsis thaliana glycine decarboxylase P-protein 2) ATP binding / glycine dehydrogenase (decarboxylating) 
MERARRLAYRGIVKRLVNETKRHRNGESSLLPTTTVTPSRYVSSVSSFLHRRRDVSGSAF 
TTSGRNQHQTRSISVDALKPSDTFPRRHNSATPDEQAQMANYCGFDNLNTLIDSTVPKSI 
RLDSMKFSGIFDEGLTESQMIEHMSDLASKNKVFKSFIGMGYYNTHVPPVILRNIMENPA 
WYTQYTPYQAEISQGRLESLLNYQTVITDLTGLPMSNASLLDEGTAAAEAMAMCNNILKG 
KKKTFVIASNCHPQTIDVCKTRADGFDLKVVTVDIKDVDYSSGDVCGVLVQYPGTEGEVL 
DYGEFVKNAHANGVKVVMATDLLALTMLKPPGEFGADIVVGSGQRFGVPMGYGGPHAAFL 
ATSQEYKRMMPGRIIGVSVDSSGKQALRMAMQTREQHIRRDKATSNICTAQALLANMTAM 
YAVYHGPEGLKSIAQRVHGLAGVFALGLKKLGTAQVQDLPFFDTVKVTCSDATAIFDVAA 
KKEINLRLVDSNTITVAFDETTTLDDVDKLFEVFASGKPVQFTAESLAPEFNNAIPSSLT 
RESPYLTHPIFNMYHTEHELLRYIHKLQNKDLSLCHSMIPLGSCTMKLNATTEMMPVTWP 
SFTNMHPFAPVEQAQGYQEMFTNLGELLCTITGFDSFSLQPNAGAAGEYAGLMVIRAYHM 
SRGDHHRNVCIIPVSAHGTNPASAAMCGMKIVAVGTDAKGNINIEELRNAAEANKDNLAA 
LMVTYPSTHGVYEEGIDEICNIIHENGGQVYMDGANMNAQVGLTSPGFIGADVCHLNLHK 
TFCIPHGGGGPGMGPIGVKQHLAPFLPSHPVIPTGGIPEPEQTSPLGTISAAPWGSALIL 
PISYTYIAMMGSGGLTDASKIAILNANYMAKRLESHYPVLFRGVNGTVAHEFIIDLRGFK 
NTAGIEPEDVAKRLMDYGFHGPTMSWPVPGTLMIEPTESESKAELDRFCDALISIREEIS 
QIEKGNADPNNNVLKGAPHPPSLLMADTWKKPYSREYAAFPAPWLRSSKFWPTTGRVDNV 
YGDRNLVCTLQPANEEQAAAAVSA*
>AT2G30390.1 |  FC2 (FERROCHELATASE 2) ferrochelatase 
MNCPAMTASPSSSSSSSYSTFRPPPPLLPQLSNDSQRSVVMHCTRLPFEAFAATSSNRLL 
GKHSLPLRAALVTSNPLNISSSSVISDAISSSSVITDDAKIGVLLLNLGGPETLDDVQPF 
LFNLFADPDIIRLPPVFQFLQKPLAQFISVARAPKSKEGYASIGGGSPLRHITDAQAEEL 
RKCLWEKNVPAKVYVGMRYWHPFTEEAIEQIKRDGITKLVVLPLYPQFSISTSGSSLRLL 
ERIFREDEYLVNMQHTVIPSWYQREGYIKAMANLIQSELGKFGSPNQVVIFFSAHGVPLA 
YVEEAGDPYKAEMEECVDLIMEELDKRKITNAYTLAYQSRVGPVEWLKPYTEEAITELGK 
KGVENLLAVPISFVSEHIETLEEIDVEYKELALKSGIKNWGRVPALGTEPMFISDLADAV 
VESLPYVGAMAVSNLEARQSLVPLGSVEELLATYDSQRRELPAPVTMWEWGWTKSAETWN 
GRAAMLAVLALLVLEVTTGKGFLHQWGILPSL*
>AT5G51820.1 |  PGM (PHOSPHOGLUCOMUTASE) phosphoglucomutase 
MTSTYTRFDTVFLFSRFAGAKYSPLLPSPSFTLSTSGIHIRTKPNSRFHSIIASSSSSSV 
VAGTDSIEIKSLPTKPIEGQKTGTSGLRKKVKVFMEDNYLANWIQALFNSLPLEDYKNAT 
LVLGGDGRYFNKEASQIIIKIAAGNGVGQILVGKEGILSTPAVSAVIRKRKANGGFIMSA 
SHNPGGPEYDWGIKFNYSSGQPAPETITDKIYGNTLSISEIKVAEIPDIDLSQVGVTKYG 
NFSVEVIDPVSDYLELMEDVFDFDLIRGLLSRSDFGFMFDAMHAVTGAYAKPIFVDNLGA 
KPDSISNGVPLEDFGHGHPDPNLTYAKDLVDVMYRDNGPDFGAASDGDGDRNMVLGNKFF 
VTPSDSVAIIAANAQEAIPYFRAGPKGLARSMPTSGALDRVAEKLKLPFFEVPTGWKFFG 
NLMDAGKLSICGEESFGTGSDHIREKDGIWAVLAWLSILAHRNKDTKPGDKLVSVADVVK 
EYWATYGRNFFSRYDYEECESEGANKMIEYLREILSKSKAGDVYGNYVLQFADDFSYTDP 
VDGSVASKQGVRFVFTDGSRIIFRLSGTGSAGATVRIYIEQFEPDVSKHDVDAQIALKPL 
IDLALSVSKLKDFTGREKPTVIT*
>AT3G25800.1 |  PP2AA2 (PROTEIN PHOSPHATASE 2A SUBUNIT A2) protein phosphatase type 2A regulator 
MSMIDEPLYPIAVLIDELKNDDIQLRLNSIRRLSTIARALGEERTRKELIPFLSENNDDD 
DEVLLAMAEELGVFIPYVGGVEYAHVLLPPLETLSTVEETCVREKAVESLCRVGSQMRES 
DLVDHFISLVKRLAAGEWFTARVSACGVFHIAYPSAPDMLKTELRSLYTQLCQDDMPMVR 
RAAATNLGKFAATVESAHLKTDVMSMFEDLTQDDQDSVRLLAVEGCAALGKLLEPQDCVQ 
HILPVIVNFSQDKSWRVRYMVANQLYELCEAVGPEPTRTELVPAYVRLLRDNEAEVRIAA 
AGKVTKFCRILNPEIAIQHILPCVKELSSDSSQHVRSALASVIMGMAPVLGKDATIEHLL 
PIFLSLLKDEFPDVRLNIISKLDQVNQVIGIDLLSQSLLPAIVELAEDRHWRVRLAIIEY 
IPLLASQLGVGFFDDKLGALCMQWLQDKVHSIRDAAANNLKRLAEEFGPEWAMQHIVPQV 
LEMVNNPHYLYRMTILRAVSLLAPVMGSEITCSKLLPVVMTASKDRVPNIKFNVAKVLQS 
LIPIVDQSVVEKTIRPGLVELSEDPDVDVRFFANQALQSIDNVMMSS*
>AT3G25800.1 |  PP2AA2 (PROTEIN PHOSPHATASE 2A SUBUNIT A2) protein phosphatase type 2A regulator 
MSMIDEPLYPIAVLIDELKNDDIQLRLNSIRRLSTIARALGEERTRKELIPFLSENNDDD 
DEVLLAMAEELGVFIPYVGGVEYAHVLLPPLETLSTVEETCVREKAVESLCRVGSQMRES 
DLVDHFISLVKRLAAGEWFTARVSACGVFHIAYPSAPDMLKTELRSLYTQLCQDDMPMVR 
RAAATNLGKFAATVESAHLKTDVMSMFEDLTQDDQDSVRLLAVEGCAALGKLLEPQDCVQ 
HILPVIVNFSQDKSWRVRYMVANQLYELCEAVGPEPTRTELVPAYVRLLRDNEAEVRIAA 
AGKVTKFCRILNPEIAIQHILPCVKELSSDSSQHVRSALASVIMGMAPVLGKDATIEHLL 
PIFLSLLKDEFPDVRLNIISKLDQVNQVIGIDLLSQSLLPAIVELAEDRHWRVRLAIIEY 
IPLLASQLGVGFFDDKLGALCMQWLQDKVHSIRDAAANNLKRLAEEFGPEWAMQHIVPQV 
LEMVNNPHYLYRMTILRAVSLLAPVMGSEITCSKLLPVVMTASKDRVPNIKFNVAKVLQS 
LIPIVDQSVVEKTIRPGLVELSEDPDVDVRFFANQALQSIDNVMMSS*
>AT3G25800.2 |  PP2AA2 (PROTEIN PHOSPHATASE 2A SUBUNIT A2) protein phosphatase type 2A regulator 
MSMIDEPLYPIAVLIDELKNDDIQLRLNSIRRLSTIARALGEERTRKELIPFLSENNDDD 
DEVLLAMAEELGVFIPYVGGVEYAHVLLPPLETLSTVEETCVREKAVESLCRVGSQMRES 
DLVDHFISLVKRLAAGEWFTARVSACGVFHIAYPSAPDMLKTELRSLYTQLCQDDMPMVR 
RAAATNLGKFAATVESAHLKTDVMSMFEDLTQDDQDSVRLLAVEGCAALGKLLEPQDCVQ 
HILPVIVNFSQDKSWRVRYMVANQLYELCEAVGPEPTRTELVPAYVRLLRDNEAEVRIAA 
AGKVTKFCRILNPEIAIQHILPCVKELSSDSSQHVRSALASVIMGMAPVLGKDATIEHLL 
PIFLSLLKDEFPDVRLNIISKLDQVNQVIGIDLLSQSLLPAIVELAEDRHWRVRLAIIEY 
IPLLASQLGVGFFDDKLGALCMQWLQDKVHSIRDAAANNLKRLAEEFGPEWAMQHIVPQV 
LEMVNNPHYLYRMTILRAVSLLAPVMGSEITCSKLLPVVMTASKDRQFQTSNSTSLKYFN 
PSFQ*
>AT3G25800.2 |  PP2AA2 (PROTEIN PHOSPHATASE 2A SUBUNIT A2) protein phosphatase type 2A regulator 
MSMIDEPLYPIAVLIDELKNDDIQLRLNSIRRLSTIARALGEERTRKELIPFLSENNDDD 
DEVLLAMAEELGVFIPYVGGVEYAHVLLPPLETLSTVEETCVREKAVESLCRVGSQMRES 
DLVDHFISLVKRLAAGEWFTARVSACGVFHIAYPSAPDMLKTELRSLYTQLCQDDMPMVR 
RAAATNLGKFAATVESAHLKTDVMSMFEDLTQDDQDSVRLLAVEGCAALGKLLEPQDCVQ 
HILPVIVNFSQDKSWRVRYMVANQLYELCEAVGPEPTRTELVPAYVRLLRDNEAEVRIAA 
AGKVTKFCRILNPEIAIQHILPCVKELSSDSSQHVRSALASVIMGMAPVLGKDATIEHLL 
PIFLSLLKDEFPDVRLNIISKLDQVNQVIGIDLLSQSLLPAIVELAEDRHWRVRLAIIEY 
IPLLASQLGVGFFDDKLGALCMQWLQDKVHSIRDAAANNLKRLAEEFGPEWAMQHIVPQV 
LEMVNNPHYLYRMTILRAVSLLAPVMGSEITCSKLLPVVMTASKDRQFQTSNSTSLKYFN 
PSFQ*
>AT5G22780.1 |  adaptin family protein 
MTGMRGLSVFISDVRNCQNKEAERLRVDKELGNIRTCFKNEKVLTPYKKKKYVWKMLYIH 
MLGYDVDFGHMEAVSLISAPKYPEKQVGYIVTSCLLNENHDFLKLAINTVRNDIIGRNET 
FQCLALTLVGNIGGRDFAESLAPDVQKLLISSSCRPLVRKKAALCLLRLFRKNPDAVNVD 
GWADRMAQLLDERDLGVLTSSTSLLVALVSNNHEAYSSCLPKCVKILERLARNQDVPQEY 
TYYGIPSPWLQVKAMRALQYFPTIEDPSTRKALFEVLQRILMGTDVVKNVNKNNASHAVL 
FEALSLVMHLDAEKEMMSQCVALLGKFISVREPNIRYLGLENMTRMLMVTDVQDIIKKHQ 
SQIITSLKDPDISIRRRALDLLYGMCDVSNAKDIVEELLQYLSTAEFSMREELSLKAAIL 
AEKFAPDLSWYVDVILQLIDKAGDFVSDDIWFRVVQFVTNNEDLQPYAASKAREYMDKIA 
IHETMVKVSAYILGEYGHLLARQPGCSASELFSILHEKLPTVSTPTIPILLSTYAKLLMH 
AQPPDPELQKKVWAVFKKYESCIDVEIQQRAVEYFELSKKGPAFMDVLAEMPKFPERQSS 
LIKKAENVEDTADQSAIKLRAQQQPSNAIVLADPQPVNGAPPPLKVPILSGSTDPESVAR 
SLSHPNGTLSNIDPQTPSPDLLSDLLGPLAIEAPPGAVSYEQHGPVGAEGVPDEIDGSAI 
VPVEEQTNTVELIGNIAERFHALCLKDSGVLYEDPHIQIGIKAEWRGHHGRLVLFMGNKN 
TSPLTSVQALILPPAHLRLDLSPVPDTIPPRAQVQSPLEVMNIRPSRDVAVLDFSYKFGT 
NVVSAKLRIPATLNKFLQPLQLTSEEFFPQWRAISGPPLKLQEVVRGVRPLALPEMANLF 
NSFHVTICPGLDPNPNNLVASTTFYSETTGAMLCLARIETDPADRTQLRLTVGSGDPTLT 
FELKEFIKEQLITIPMGSRALVPAAGPAPSPAVQPPSPAALADDPGAMLAGLL*
>AT1G80050.1 |  APT2 (ADENINE PHOSPHORIBOSYL TRANSFERASE 2) adenine phosphoribosyltransferase/ phosphate transmembrane transporter 
MFAVENGLQGDPRLKAISDAIRVIPHFPKTGIMFQDITTLLLDPVAFKHVVDIFVDRYKH 
MNISLVAGVEARGFIFGPPIALAIGAKFVPLRKPGKLPGRVISEEYELEYGRDCLEMSVE 
AVKSEERALIIDDLVATGGTLSASINLLERAGAEVVECACVVGLPKFKGQCKLKGKPLYV 
LVEPNQFDELTL*
>AT2G34450.1 |  high mobility group (HMG1/2) family protein 
MTKRAPKSGPLSPSCSGGSSRNLELAVKSSEGARRSTRLRLQPLRKPKTSPKKKPVKLQT 
KMPKKPATAFFFFLDDFRKQYQEENPDVKSMREIGKTCGEKWKTMTYEEKVKYYDIATEK 
REEFHRAMTEYTKRMESGAHDESETDSDYSE*
>AT2G34450.1 |  high mobility group (HMG1/2) family protein 
MTKRAPKSGPLSPSCSGGSSRNLELAVKSSEGARRSTRLRLQPLRKPKTSPKKKPVKLQT 
KMPKKPATAFFFFLDDFRKQYQEENPDVKSMREIGKTCGEKWKTMTYEEKVKYYDIATEK 
REEFHRAMTEYTKRMESGAHDESETDSDYSE*
>AT2G34450.2 |  high mobility group (HMG1/2) family protein 
MTKRAPKSGPLSPSCSGGSSRNLELAVKSSEGARRSTRLRLQPLRKPKTSPKKKPVKLQT 
KMPKKPATAFFFFLDDFRKQYQEENPDVKSMREVIGKTCGEKWKTMTYEEKVKYYDIATE 
KREEFHRAMTEYTKRMESGAHDESETDSDYSE*
>AT2G34450.2 |  high mobility group (HMG1/2) family protein 
MTKRAPKSGPLSPSCSGGSSRNLELAVKSSEGARRSTRLRLQPLRKPKTSPKKKPVKLQT 
KMPKKPATAFFFFLDDFRKQYQEENPDVKSMREVIGKTCGEKWKTMTYEEKVKYYDIATE 
KREEFHRAMTEYTKRMESGAHDESETDSDYSE*
>AT4G16420.3 |  ADA2B (HOMOLOG OF YEAST ADA2 2B) DNA binding / transcription coactivator/ transcription factor 
MGRSRGNFQNFEDPTQRTRKKKNAANVENFESTSLVPGAEGGGKYNCDYCQKDITGKIRI 
KCAVCPDFDLCIECMSVGAEITPHKCDHPYRVMGNLTFPLICPDWSADDEMLLLEGLEIY 
GLGNWAEVAEHVGTKSKEQCLEHYRNIYLNSPFFPLPDMSHVAGKNRKELQAMAKGRIDD 
KKEQNMKEEYPFSPPKVKVEDTQKESFVDRSFGGKKPVSTSVNNSLVELSNYNQKREEFD 
PEYDNDAEQLLAEMEFKENDTPEEHELKLRVLRIYSKRLDERKRRKEFIIERNLLYPNPF 
EKDLSQEEKVQCRRLDVFMRFHSKEEHDELLRNVVSEYRMVKRLKDLKEAQVAGCRSTAE 
AERYLGRKRKRENEEGMNRGKESGQFGQIAGEMGSRPPVQASSSYVNDLDLIGFTESQLL 
SESEKRLCSEVKLVPPVYLQMQQVMSHEIFKGNVTKKSDAYSLFKIDPTKVDRVYDMLVK 
KGIAQL*
>AT4G16420.3 |  ADA2B (HOMOLOG OF YEAST ADA2 2B) DNA binding / transcription coactivator/ transcription factor 
MGRSRGNFQNFEDPTQRTRKKKNAANVENFESTSLVPGAEGGGKYNCDYCQKDITGKIRI 
KCAVCPDFDLCIECMSVGAEITPHKCDHPYRVMGNLTFPLICPDWSADDEMLLLEGLEIY 
GLGNWAEVAEHVGTKSKEQCLEHYRNIYLNSPFFPLPDMSHVAGKNRKELQAMAKGRIDD 
KKEQNMKEEYPFSPPKVKVEDTQKESFVDRSFGGKKPVSTSVNNSLVELSNYNQKREEFD 
PEYDNDAEQLLAEMEFKENDTPEEHELKLRVLRIYSKRLDERKRRKEFIIERNLLYPNPF 
EKDLSQEEKVQCRRLDVFMRFHSKEEHDELLRNVVSEYRMVKRLKDLKEAQVAGCRSTAE 
AERYLGRKRKRENEEGMNRGKESGQFGQIAGEMGSRPPVQASSSYVNDLDLIGFTESQLL 
SESEKRLCSEVKLVPPVYLQMQQVMSHEIFKGNVTKKSDAYSLFKIDPTKVDRVYDMLVK 
KGIAQL*
>AT4G16420.3 |  ADA2B (HOMOLOG OF YEAST ADA2 2B) DNA binding / transcription coactivator/ transcription factor 
MGRSRGNFQNFEDPTQRTRKKKNAANVENFESTSLVPGAEGGGKYNCDYCQKDITGKIRI 
KCAVCPDFDLCIECMSVGAEITPHKCDHPYRVMGNLTFPLICPDWSADDEMLLLEGLEIY 
GLGNWAEVAEHVGTKSKEQCLEHYRNIYLNSPFFPLPDMSHVAGKNRKELQAMAKGRIDD 
KKEQNMKEEYPFSPPKVKVEDTQKESFVDRSFGGKKPVSTSVNNSLVELSNYNQKREEFD 
PEYDNDAEQLLAEMEFKENDTPEEHELKLRVLRIYSKRLDERKRRKEFIIERNLLYPNPF 
EKDLSQEEKVQCRRLDVFMRFHSKEEHDELLRNVVSEYRMVKRLKDLKEAQVAGCRSTAE 
AERYLGRKRKRENEEGMNRGKESGQFGQIAGEMGSRPPVQASSSYVNDLDLIGFTESQLL 
SESEKRLCSEVKLVPPVYLQMQQVMSHEIFKGNVTKKSDAYSLFKIDPTKVDRVYDMLVK 
KGIAQL*
>AT4G16420.2 |  ADA2B (HOMOLOG OF YEAST ADA2 2B) DNA binding / transcription coactivator/ transcription factor 
MGRSRGNFQNFEDPTQRTRKKKNAANVENFESTSLVPGAEGGGKYNCDYCQKDITGKIRI 
KCAVCPDFDLCIECMSVGAEITPHKCDHPYRVMGNLTFPLICPDWSADDEMLLLEGLEIY 
GLGNWAEVAEHVGTKSKEQCLEHYRNIYLNSPFFPLPDMSHVAGKNRKELQAMAKGRIDD 
KKAEQNMKEEYPFSPPKVKVEDTQKDRSFGGKKPVSTSVNNSLVELSNYNQKREEFDPEY 
DNDAEQLLAEMEFKENDTPEEHELKLRVLRIYSKRLDERKRRKEFIIERNLLYPNPFEKD 
LSQEEKVQCRRLDVFMRFHSKEEHDELLRNVVSEYRMVKRLKDLKEAQVAGCRSTAEAER 
YLGRKRKRENEEGMNRGKESGQFGQIAGEMGSRPPVQASSSYVNDLDLIGFTESQLLSES 
EKRLCSEVKLVPPVYLQMQQVMSHEIFKGNVTKKSDAYSLFKIDPTKVDRVYDMLVKKGI 
AQL*
>AT4G16420.2 |  ADA2B (HOMOLOG OF YEAST ADA2 2B) DNA binding / transcription coactivator/ transcription factor 
MGRSRGNFQNFEDPTQRTRKKKNAANVENFESTSLVPGAEGGGKYNCDYCQKDITGKIRI 
KCAVCPDFDLCIECMSVGAEITPHKCDHPYRVMGNLTFPLICPDWSADDEMLLLEGLEIY 
GLGNWAEVAEHVGTKSKEQCLEHYRNIYLNSPFFPLPDMSHVAGKNRKELQAMAKGRIDD 
KKAEQNMKEEYPFSPPKVKVEDTQKDRSFGGKKPVSTSVNNSLVELSNYNQKREEFDPEY 
DNDAEQLLAEMEFKENDTPEEHELKLRVLRIYSKRLDERKRRKEFIIERNLLYPNPFEKD 
LSQEEKVQCRRLDVFMRFHSKEEHDELLRNVVSEYRMVKRLKDLKEAQVAGCRSTAEAER 
YLGRKRKRENEEGMNRGKESGQFGQIAGEMGSRPPVQASSSYVNDLDLIGFTESQLLSES 
EKRLCSEVKLVPPVYLQMQQVMSHEIFKGNVTKKSDAYSLFKIDPTKVDRVYDMLVKKGI 
AQL*
>AT4G16420.2 |  ADA2B (HOMOLOG OF YEAST ADA2 2B) DNA binding / transcription coactivator/ transcription factor 
MGRSRGNFQNFEDPTQRTRKKKNAANVENFESTSLVPGAEGGGKYNCDYCQKDITGKIRI 
KCAVCPDFDLCIECMSVGAEITPHKCDHPYRVMGNLTFPLICPDWSADDEMLLLEGLEIY 
GLGNWAEVAEHVGTKSKEQCLEHYRNIYLNSPFFPLPDMSHVAGKNRKELQAMAKGRIDD 
KKAEQNMKEEYPFSPPKVKVEDTQKDRSFGGKKPVSTSVNNSLVELSNYNQKREEFDPEY 
DNDAEQLLAEMEFKENDTPEEHELKLRVLRIYSKRLDERKRRKEFIIERNLLYPNPFEKD 
LSQEEKVQCRRLDVFMRFHSKEEHDELLRNVVSEYRMVKRLKDLKEAQVAGCRSTAEAER 
YLGRKRKRENEEGMNRGKESGQFGQIAGEMGSRPPVQASSSYVNDLDLIGFTESQLLSES 
EKRLCSEVKLVPPVYLQMQQVMSHEIFKGNVTKKSDAYSLFKIDPTKVDRVYDMLVKKGI 
AQL*
>AT4G16420.1 |  ADA2B (HOMOLOG OF YEAST ADA2 2B) DNA binding / transcription coactivator/ transcription factor 
MGRSRGNFQNFEDPTQRTRKKKNAANVENFESTSLVPGAEGGGKYNCDYCQKDITGKIRI 
KCAVCPDFDLCIECMSVGAEITPHKCDHPYRVMGNLTFPLICPDWSADDEMLLLEGLEIY 
GLGNWAEVAEHVGTKSKEQCLEHYRNIYLNSPFFPLPDMSHVAGKNRKELQAMAKGRIDD 
KKAEQNMKEEYPFSPPKVKVEDTQKESFVDRSFGGKKPVSTSVNNSLVELSNYNQKREEF 
DPEYDNDAEQLLAEMEFKENDTPEEHELKLRVLRIYSKRLDERKRRKEFIIERNLLYPNP 
FEKDLSQEEKVQCRRLDVFMRFHSKEEHDELLRNVVSEYRMVKRLKDLKEAQVAGCRSTA 
EAERYLGRKRKRENEEGMNRGKESGQFGQIAGEMGSRPPVQASSSYVNDLDLIGFTESQL 
LSESEKRLCSEVKLVPPVYLQMQQVMSHEIFKGNVTKKSDAYSLFKIDPTKVDRVYDMLV 
KKGIAQL*
>AT4G16420.1 |  ADA2B (HOMOLOG OF YEAST ADA2 2B) DNA binding / transcription coactivator/ transcription factor 
MGRSRGNFQNFEDPTQRTRKKKNAANVENFESTSLVPGAEGGGKYNCDYCQKDITGKIRI 
KCAVCPDFDLCIECMSVGAEITPHKCDHPYRVMGNLTFPLICPDWSADDEMLLLEGLEIY 
GLGNWAEVAEHVGTKSKEQCLEHYRNIYLNSPFFPLPDMSHVAGKNRKELQAMAKGRIDD 
KKAEQNMKEEYPFSPPKVKVEDTQKESFVDRSFGGKKPVSTSVNNSLVELSNYNQKREEF 
DPEYDNDAEQLLAEMEFKENDTPEEHELKLRVLRIYSKRLDERKRRKEFIIERNLLYPNP 
FEKDLSQEEKVQCRRLDVFMRFHSKEEHDELLRNVVSEYRMVKRLKDLKEAQVAGCRSTA 
EAERYLGRKRKRENEEGMNRGKESGQFGQIAGEMGSRPPVQASSSYVNDLDLIGFTESQL 
LSESEKRLCSEVKLVPPVYLQMQQVMSHEIFKGNVTKKSDAYSLFKIDPTKVDRVYDMLV 
KKGIAQL*
>AT4G16420.1 |  ADA2B (HOMOLOG OF YEAST ADA2 2B) DNA binding / transcription coactivator/ transcription factor 
MGRSRGNFQNFEDPTQRTRKKKNAANVENFESTSLVPGAEGGGKYNCDYCQKDITGKIRI 
KCAVCPDFDLCIECMSVGAEITPHKCDHPYRVMGNLTFPLICPDWSADDEMLLLEGLEIY 
GLGNWAEVAEHVGTKSKEQCLEHYRNIYLNSPFFPLPDMSHVAGKNRKELQAMAKGRIDD 
KKAEQNMKEEYPFSPPKVKVEDTQKESFVDRSFGGKKPVSTSVNNSLVELSNYNQKREEF 
DPEYDNDAEQLLAEMEFKENDTPEEHELKLRVLRIYSKRLDERKRRKEFIIERNLLYPNP 
FEKDLSQEEKVQCRRLDVFMRFHSKEEHDELLRNVVSEYRMVKRLKDLKEAQVAGCRSTA 
EAERYLGRKRKRENEEGMNRGKESGQFGQIAGEMGSRPPVQASSSYVNDLDLIGFTESQL 
LSESEKRLCSEVKLVPPVYLQMQQVMSHEIFKGNVTKKSDAYSLFKIDPTKVDRVYDMLV 
KKGIAQL*
>AT4G21710.1 |  NRPB2 DNA binding / DNA-directed RNA polymerase 
MEYNEYEPEPQYVEDDDDEEITQEDAWAVISAYFEEKGLVRQQLDSFDEFIQNTMQEIVD 
ESADIEIRPESQHNPGHQSDFAETIYKISFGQIYLSKPMMTESDGETATLFPKAARLRNL 
TYSAPLYVDVTKRVIKKGHDGEEVTETQDFTKVFIGKVPIMLRSSYCTLFQNSEKDLTEL 
GECPYDQGGYFIINGSEKVLIAQEKMSTNHVYVFKKRQPNKYAYVGEVRSMAENQNRPPS 
TMFVRMLARASAKGGSSGQYIRCTLPYIRTEIPIIIVFRALGFVADKDILEHICYDFADT 
QMMELLRPSLEEAFVIQNQLVALDYIGKRGATVGVTKEKRIKYARDILQKEMLPHVGIGE 
HCETKKAYYFGYIIHRLLLCALGRRPEDDRDHYGNKRLDLAGPLLGGLFRMLFRKLTRDV 
RSYVQKCVDNGKEVNLQFAIKAKTITSGLKYSLATGNWGQANAAGTRAGVSQVLNRLTYA 
STLSHLRRLNSPIGREGKLAKPRQLHNSQWGMMCPAETPEGQACGLVKNLALMVYITVGS 
AAYPILEFLEEWGTENFEEISPSVIPQATKIFVNGMWVGVHRDPDMLVKTLRRLRRRVDV 
NTEVGVVRDIRLKELRIYTDYGRCSRPLFIVDNQKLLIKKRDIYALQQRESAEEDGWHHL 
VAKGFIEYIDTEEEETTMISMTISDLVQARLRPEEAYTENYTHCEIHPSLILGVCASIIP 
FPDHNQSPRNTYQSAMGKQAMGIYVTNYQFRMDTLAYVLYYPQKPLVTTRAMEHLHFRQL 
PAGINAIVAISCYSGYNQEDSVIMNQSSIDRGFFRSLFFRSYRDEEKKMGTLVKEDFGRP 
DRGSTMGMRHGSYDKLDDDGLAPPGTRVSGEDVIIGKTTPISQDEAQGQSSRYTRRDHSI 
SLRHSETGMVDQVLLTTNADGLRFVKVRVRSVRIPQIGDKFSSRHGQKGTVGMTYTQEDM 
PWTIEGVTPDIIVNPHAIPSRMTIGQLIECIMGKVAAHMGKEGDATPFTDVTVDNISKAL 
HKCGYQMRGFERMYNGHTGRPLTAMIFLGPTYYQRLKHMVDDKIHSRGRGPVQILTRQPA 
EGRSRDGGLRFGEMERDCMIAHGAAHFLKERLFDQSDAYRVHVCEVCGLIAIANLKKNSF 
ECRGCKNKTDIVQVYIPYACKLLFQELMSMAIAPRMLTKHLKSAKGRQ*
>AT4G17190.2 |  FPS2 (FARNESYL DIPHOSPHATE SYNTHASE 2) dimethylallyltranstransferase/ geranyltranstransferase 
MDNSVTRRGQPCWFRKPKVGMIAINDGILLRNHIHRILKKHFREMPYYVDLVDLFNEVEF 
QTACGQMIDLITTFDGEKDLSKYSLQIHRRIVEYKTAYYSFYLPVACALLMAGENLENHT 
DVKTVLVDMGIYFQVQDDYLDCFADPETLGKIGTDIEDFKCSWLVVKALERCSEEQTKIL 
YENYGKAEPSNVAKVKALYKELDLEGAFMEYEKESYEKLTKLIEAHQSKAIQAVLKSFLA 
KIYKRQK*
>AT4G17190.2 |  FPS2 (FARNESYL DIPHOSPHATE SYNTHASE 2) dimethylallyltranstransferase/ geranyltranstransferase 
MDNSVTRRGQPCWFRKPKVGMIAINDGILLRNHIHRILKKHFREMPYYVDLVDLFNEVEF 
QTACGQMIDLITTFDGEKDLSKYSLQIHRRIVEYKTAYYSFYLPVACALLMAGENLENHT 
DVKTVLVDMGIYFQVQDDYLDCFADPETLGKIGTDIEDFKCSWLVVKALERCSEEQTKIL 
YENYGKAEPSNVAKVKALYKELDLEGAFMEYEKESYEKLTKLIEAHQSKAIQAVLKSFLA 
KIYKRQK*
>AT4G17190.1 |  FPS2 (FARNESYL DIPHOSPHATE SYNTHASE 2) dimethylallyltranstransferase/ geranyltranstransferase 
MADLKSTFLDVYSVLKSDLLQDPSFEFTHESRQWLERMLDYNVRGGKLNRGLSVVDSYKL 
LKQGQDLTEKETFLSCALGWCIEWLQAYFLVLDDIMDNSVTRRGQPCWFRKPKVGMIAIN 
DGILLRNHIHRILKKHFREMPYYVDLVDLFNEVEFQTACGQMIDLITTFDGEKDLSKYSL 
QIHRRIVEYKTAYYSFYLPVACALLMAGENLENHTDVKTVLVDMGIYFQVQDDYLDCFAD 
PETLGKIGTDIEDFKCSWLVVKALERCSEEQTKILYENYGKAEPSNVAKVKALYKELDLE 
GAFMEYEKESYEKLTKLIEAHQSKAIQAVLKSFLAKIYKRQK*
>AT4G17190.1 |  FPS2 (FARNESYL DIPHOSPHATE SYNTHASE 2) dimethylallyltranstransferase/ geranyltranstransferase 
MADLKSTFLDVYSVLKSDLLQDPSFEFTHESRQWLERMLDYNVRGGKLNRGLSVVDSYKL 
LKQGQDLTEKETFLSCALGWCIEWLQAYFLVLDDIMDNSVTRRGQPCWFRKPKVGMIAIN 
DGILLRNHIHRILKKHFREMPYYVDLVDLFNEVEFQTACGQMIDLITTFDGEKDLSKYSL 
QIHRRIVEYKTAYYSFYLPVACALLMAGENLENHTDVKTVLVDMGIYFQVQDDYLDCFAD 
PETLGKIGTDIEDFKCSWLVVKALERCSEEQTKILYENYGKAEPSNVAKVKALYKELDLE 
GAFMEYEKESYEKLTKLIEAHQSKAIQAVLKSFLAKIYKRQK*
>AT1G23460.1 |  polygalacturonase 
MMDKLFILSLLGLLLVTAYGAAGKMVYTDLDILDELENFDVLVDDDDDTKLLDWPSFTSR 
HSGKNLVNVDTFGAAGDGVSDDTQAFVSAWSKACSTSKSVFLVPEGRRYLVNATKFNGPC 
EQKLIIQIDGTIVAPDEPSNWDSKFQRIWLEFSKLKGVVFQGKGVIDGSGSKWWAASCKK 
NKSNPCKSAPTALTIESSSGVKVSGLTIQNSQQMNFIIARSDSVRVSKVMVSSPGDSPNT 
DGIHITGSTNVILQDCKIGTGDDCVSIVNASSNIKMKNIYCGPGHGISIGSLGKDNTTGI 
VTQVVLDTALLRETTNGLRIKTYQGGSGYVQGIRFTNVEMQDVANPILIDQFYCDSPTTC 
QNQTSAVKISQIMYRNITGTTKSAKAIKFACSDTVPCSHIVLNNVNLEGNDGQVEAYCNS 
AEGFGYGVIHPSADCLYSHDDKGLDQTHKSEEAETGHDEL*
>AT1G02730.1 |  ATCSLD5 14-beta-D-xylan synthase/ cellulose synthase 
MVKSAASQSPSPVTITVTPCKGSGDRSLGLTSPIPRASVITNQNSPLSSRATRRTSISSG 
NRRSNGDEGRYCSMSVEDLTAETTNSECVLSYTVHIPPTPDHQTVFASQESEEDEMLKGN 
SNQKSFLSGTIFTGGFKSVTRGHVIDCSMDRADPEKKSGQICWLKGCDEKVVHGRCECGF 
RICRDCYFDCITSGGGNCPGCKEPYRDINDDPETEEEDEEDEAKPLPQMGESKLDKRLSV 
VKSFKAQNQAGDFDHTRWLFETKGTYGYGNAVWPKDGYGIGSGGGGNGYETPPEFGERSK 
RPLTRKVSVSAAIISPYRLLIALRLVALGLFLTWRVRHPNREAMWLWGMSTTCELWFALS 
WLLDQLPKLCPVNRLTDLGVLKERFESPNLRNPKGRSDLPGIDVFVSTADPEKEPPLVTA 
NTILSILAVDYPVEKLACYLSDDGGALLTFEALAQTASFASTWVPFCRKHNIEPRNPEAY 
FGQKRNFLKNKVRLDFVRERRRVKREYDEFKVRINSLPEAIRRRSDAYNVHEELRAKKKQ 
MEMMMGNNPQETVIVPKATWMSDGSHWPGTWSSGETDNSRGDHAGIIQAMLAPPNAEPVY 
GAEADAENLIDTTDVDIRLPMLVYVSREKRPGYDHNKKAGAMNALVRTSAIMSNGPFILN 
LDCDHYIYNSMALREGMCFMLDRGGDRICYVQFPQRFEGIDPNDRYANHNTVFFDVSMRA 
LDGLQGPMYVGTGCIFRRTALYGFSPPRATEHHGWLGRRKVKISLRRPKAMMKKDDEVSL 
PINGEYNEEENDDGDIESLLLPKRFGNSNSFVASIPVAEYQGRLIQDLQGKGKNSRPAGS 
LAVPREPLDAATVAEAISVISCFYEDKTEWGKRVGWIYGSVTEDVVTGYRMHNRGWRSIY 
CVTKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNAIFATRRMKFLQRVAYFNVGM 
YPFTSLFLIVYCILPAISLFSGQFIVQSLDITFLIYLLSITLTLCMLSLLEIKWSGITLH 
EWWRNEQFWVIGGTSAHPAAVLQGLLKVIAGVDISFTLTSKSSAPEDGDDEFADLYVVKW 
SFLMVPPLTIMMVNMIAIAVGLARTLYSPFPQWSKLVGGVFFSFWVLCHLYPFAKGLMGR 
RGRVPTIVFVWSGLLSIIVSLLWVYINPPSGKQDYMQFQFP*
>AT1G10600.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 753 Blast hits to 752 proteins in 167 species Archae - 0 Bacteria - 0 Metazoa - 359 Fungi - 199 Plants - 115 Viruses - 0 Other Eukaryotes - 80 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPGGME 
VLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN mitochondrion EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 601 Blast hits to 601 proteins in 160 species Archae - 0 Bacteria - 0 Metazoa - 286 Fungi - 140 Plants - 102 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPGGME 
VLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 771 Blast hits to 770 proteins in 170 species Archae - 0 Bacteria - 0 Metazoa - 366 Fungi - 201 Plants - 115 Viruses - 0 Other Eukaryotes - 89 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPGGME 
VLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 753 Blast hits to 752 proteins in 167 species Archae - 0 Bacteria - 0 Metazoa - 359 Fungi - 199 Plants - 115 Viruses - 0 Other Eukaryotes - 80 (source NCBI BLink) 
MFISQKGYWRISLSLQERTLRRTSRLVGLSLPFLVLRFSSFMNLMCQAMNEVEVFSIQNE 
RELYPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPG 
GMEVLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN mitochondrion EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 601 Blast hits to 601 proteins in 160 species Archae - 0 Bacteria - 0 Metazoa - 286 Fungi - 140 Plants - 102 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink) 
MFISQKGYWRISLSLQERTLRRTSRLVGLSLPFLVLRFSSFMNLMCQAMNEVEVFSIQNE 
RELYPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPG 
GMEVLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 771 Blast hits to 770 proteins in 170 species Archae - 0 Bacteria - 0 Metazoa - 366 Fungi - 201 Plants - 115 Viruses - 0 Other Eukaryotes - 89 (source NCBI BLink) 
MFISQKGYWRISLSLQERTLRRTSRLVGLSLPFLVLRFSSFMNLMCQAMNEVEVFSIQNE 
RELYPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSKSYGIFKLTDPG 
GMEVLRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 753 Blast hits to 752 proteins in 167 species Archae - 0 Bacteria - 0 Metazoa - 359 Fungi - 199 Plants - 115 Viruses - 0 Other Eukaryotes - 80 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSNYGIFKLTDPGGMEV 
LRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN mitochondrion EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 601 Blast hits to 601 proteins in 160 species Archae - 0 Bacteria - 0 Metazoa - 286 Fungi - 140 Plants - 102 Viruses - 0 Other Eukaryotes - 73 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSNYGIFKLTDPGGMEV 
LRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G10600.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ubiquitin-dependent protein catabolic process LOCATED IN chloroplast EXPRESSED IN cultured cell CONTAINS InterPro DOMAIN/s Mov34/MPN/PAD-1 (InterProIPR000555) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G161441) Has 771 Blast hits to 770 proteins in 170 species Archae - 0 Bacteria - 0 Metazoa - 366 Fungi - 201 Plants - 115 Viruses - 0 Other Eukaryotes - 89 (source NCBI BLink) 
MVTLSSPSPSLSCVENVTCKSSHVSRVLISGTDNINHGESSEAKILRDVHISERLLEDFT 
ELARENTEKDLETCGTLAAFLERGIFYVTTLIIPKQESTSNSCQAMNEVEVFSIQNEREL 
YPVGWIHTHPSQGCFMSSVDLHTHYSYQVMVPEAFAIVVAPTDSSNYGIFKLTDPGGMEV 
LRGCSETGFHPHKEPEDGNPVYEHCSNVYKNSNLRFEIFDLR*
>AT1G26320.1 |  NADP-dependent oxidoreductase putative 
MTNAEAVTVTNKQIIFPDYVTGFPKESDLKITTTTIDLRLPEGSTSVLVKNLYLSCDPYM 
RICMGKPDPLSSSLVPPYKTGVPIIGLGVSKVIDSGHPDYKKGDLLWGLVGWEEYSVITL 
TTYSHFKIEHTDVPLSYYTGLLGMPGMTAYAGFYEVCSPKKGETVFVSAASGAVGQLVGQ 
FAKLMGCYVVGSAGSKEKVYLLKTKFGFDDAFNYKEEKDFSAALKRYFPEGIDIYFENVG 
GKMLDAVLINMKLHGRVAVCGMISQYNLVDPEGVHNLPTILYKRIQLQGFGVCDFYDKYP 
KFLDFVLPYIREGKITYVEDIAEGFESGPSALLGLFEGKNVGKQLFVVARE*
>AT1G52500.2 |  ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase 
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK 
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL 
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY 
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIEKAVEVDADSSQFPSYW 
IFHNREKKPGKAFVDGKKIDFITAGGRTTAYVPELQKLYGKDAEKAAKVRPAKRGVKPKE 
DDGDGEEDEQETEKEDESAKSKKGQKPRGGRGKKPASKTKTEESDDDGDDSEAEEEVVKP 
KGRGTKPAIKRKSEEKATSQAGKKPKGRKS*
>AT1G52500.2 |  ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase 
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK 
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL 
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY 
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIEKAVEVDADSSQFPSYW 
IFHNREKKPGKAFVDGKKIDFITAGGRTTAYVPELQKLYGKDAEKAAKVRPAKRGVKPKE 
DDGDGEEDEQETEKEDESAKSKKGQKPRGGRGKKPASKTKTEESDDDGDDSEAEEEVVKP 
KGRGTKPAIKRKSEEKATSQAGKKPKGRKS*
>AT1G52500.1 |  ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase 
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK 
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL 
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY 
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIQHAVQVNADSKEFPVEW 
LFHFRWGKKAGKVNGKLSHHLSINLMKQNLGFCR*
>AT1G52500.1 |  ATMMH-1 (ARABIDOPSIS THALIANA MUTM HOMOLOG-1) DNA N-glycosylase 
MPELPEVEAARRAIEENCLGKKIKRVIIADDNKVIHGISPSDFQTSILGKTIISARRKGK 
NLWLELDSPPFPSFQFGMAGAIYIKGVAVTKYKRSAVKDSEEWPSKYSKFFVELDDGLEL 
SFTDKRRFAKVRLLANPTSVSPISELGPDALLEPMTVDEFAESLAKKKITIKPLLLDQGY 
ISGIGNWIADEVLYQARIHPLQTASSLSKEQCEALHTSIKEVIQHAVQVNADSKEFPVEW 
LFHFRWGKKAGKVNGKLSHHLSINLMKQNLGFCR*
>AT3G24010.1 |  ING1 (INHIBITOR OF GROWTH 1) DNA binding / methylated histone residue binding 
MSFAEEFEANLVSLAHVLQKKYALLRDLDKSLQENQRQNEQRCEKEIEDIRRGRAGNITP 
NTSLTKFSEEALDEQKHSVRIADEKVTLAMQAYDLVDMHVQQLDQYMKKSDEVIRKEKEA 
AAATLELENNGKAGNAGEGGRGGRKKTRLATAASTAAASTGMTSSNMDLDLPVDPNEPTY 
CICNQVSFGEMVACDNNACKIEWFHFGCVGLKEQPKGKWYCPECATVKKSRKGR*
>AT3G53030.1 |  SRPK4 (Ser/Arg-rich protein kinase 4) kinase/ protein kinase 
MEAEKWNSDGGEYTSEDEGTEDYRRGGYHAVRIGDSFKTGRYVVQSKLGWGHFSTVWLSW 
DTQSSRYVALKVQKSAQHYTEAAMDEITILQQIAEGDTDDTKCVVKLLDHFKHSGPNGQH 
VCMVFEYLGDNLLTLIKYSDYRGLPIPMVKEICYHMLVGLDYLHKQLSIIHTDLKPENVL 
LPSTIDPSKDPRKSGAPLVLPTDKDNTVVDSNGDFVKNQKTGSHRKAKLSAQGHAENKGN 
TESDKVRGVGSPVNGKQCAAEKSVEEDCPSTSDAIELDGSEKGKQGGKKGSRSSRRHLVA 
SADLKCKLVDFGNACWTYKQFTSDIQTRQYRCPEVILGSKYSTSADLWSFACICFELVTG 
DVLFDPHSGDNYDRDEDHLALMMELLGMMPRKIALGGRYSRDFFNRHGDLRHIRRLRFWP 
MNKVLTEKYEFSEQDANDLSDFLVSILDFVPEKRPTAAQCLLHPWINSGPRSIKPSLKDE 
NSDKLDTEKNKRENEEQEAVEVKMGNVVISSLDSKPGMSQSSSTLKLAI*
>AT3G59540.1 |  60S ribosomal protein L38 (RPL38B) 
MPKQIHEIKDFLLTARRKDARSVKIKRSKDIVKFKVRCSRYLYTLCVFDQEKADKLKQSL 
PPGLSVQDL*
>AT3G62770.1 |  AtATG18a 
MATVSSSSWPNPNPNPDSTSASDSDSTFPSHRDRVDEPDSLDSFSSMSLNSDEPNQTSNQ 
SPLSPPTPNLPVMPPPSVLHLSFNQDHACFAVGTDRGFRILNCDPFREIFRRDFDRGGGV 
AVVEMLFRCNILALVGGGPDPQYPPNKVMIWDDHQGRCIGELSFRSDVRSVRLRRDRIIV 
VLEQKIFVYNFSDLKLMHQIETIANPKGLCAVSQGVGSMVLVCPGLQKGQVRIEHYASKR 
TKFVMAHDSRIACFALTQDGHLLATASSKGTLVRIFNTVDGTLRQEVRRGADRAEIYSLA 
FSSNAQWLAVSSDKGTVHVFGLKVNSGSQVKDSSRIAPDATPSSPSSSLSLFKGVLPRYF 
SSEWSVAQFRLVEGTQYIAAFGHQKNTVVILGMDGSFYRCQFDPVNGGEMSQLEYHNCLK 
PPSVF*
>AT3G62770.1 |  AtATG18a 
MATVSSSSWPNPNPNPDSTSASDSDSTFPSHRDRVDEPDSLDSFSSMSLNSDEPNQTSNQ 
SPLSPPTPNLPVMPPPSVLHLSFNQDHACFAVGTDRGFRILNCDPFREIFRRDFDRGGGV 
AVVEMLFRCNILALVGGGPDPQYPPNKVMIWDDHQGRCIGELSFRSDVRSVRLRRDRIIV 
VLEQKIFVYNFSDLKLMHQIETIANPKGLCAVSQGVGSMVLVCPGLQKGQVRIEHYASKR 
TKFVMAHDSRIACFALTQDGHLLATASSKGTLVRIFNTVDGTLRQEVRRGADRAEIYSLA 
FSSNAQWLAVSSDKGTVHVFGLKVNSGSQVKDSSRIAPDATPSSPSSSLSLFKGVLPRYF 
SSEWSVAQFRLVEGTQYIAAFGHQKNTVVILGMDGSFYRCQFDPVNGGEMSQLEYHNCLK 
PPSVF*
>AT3G62770.3 |  AtATG18a 
MATVSSSSWPNPNPNPDSTSASDSDSTFPSHRDRVDEPDSLDSFSSMSLNSDEPNQTSNQ 
SPLSPPTPNLPVMPPPSVLHLSFNQDHACFAVGTDRGFRILNCDPFREIFRRDFDRGGGV 
AVVEMLFRCNILALVGGGPDPQYPPNKVMIWDDHQGRCIGELSFRSDVRSVRLRRDRIIV 
VLEQKIFVYNFSDLKLMHQIETIANPKGLCAVSQGVGSMVLVCPGLQKGQVRIEHYASKR 
TKFVMAHDSRIACFALTQDGHLLATASSKGTLVRIFNTVDGTLRQEVRRGADRAEIYSLA 
FSSNAQWLAVSSDKGTVHVFGLKVNSGSQVKDSSRIAPDATPSSPSSSLSLFKGVLPRYF 
SSEWSVAQFRLVEGTQYIAAFGHQKNTVVILGMDGR*
>AT3G62770.3 |  AtATG18a 
MATVSSSSWPNPNPNPDSTSASDSDSTFPSHRDRVDEPDSLDSFSSMSLNSDEPNQTSNQ 
SPLSPPTPNLPVMPPPSVLHLSFNQDHACFAVGTDRGFRILNCDPFREIFRRDFDRGGGV 
AVVEMLFRCNILALVGGGPDPQYPPNKVMIWDDHQGRCIGELSFRSDVRSVRLRRDRIIV 
VLEQKIFVYNFSDLKLMHQIETIANPKGLCAVSQGVGSMVLVCPGLQKGQVRIEHYASKR 
TKFVMAHDSRIACFALTQDGHLLATASSKGTLVRIFNTVDGTLRQEVRRGADRAEIYSLA 
FSSNAQWLAVSSDKGTVHVFGLKVNSGSQVKDSSRIAPDATPSSPSSSLSLFKGVLPRYF 
SSEWSVAQFRLVEGTQYIAAFGHQKNTVVILGMDGR*
>AT4G00810.1 |  60S acidic ribosomal protein P1 (RPP1B) 
MSTVGELACSYAVMILEDEGIAITSDKIATLVKAAGVEIESYWPMLFAKMAEKRNVTDLI 
MNVGAGGGGGGAPVSAAAPAAAGGAAAAAPAKEEKKDEPAEESDGDLGFGLFD*
>AT4G00810.1 |  60S acidic ribosomal protein P1 (RPP1B) 
MSTVGELACSYAVMILEDEGIAITSDKIATLVKAAGVEIESYWPMLFAKMAEKRNVTDLI 
MNVGAGGGGGGAPVSAAAPAAAGGAAAAAPAKEEKKDEPAEESDGDLGFGLFD*
>AT4G00810.2 |  60S acidic ribosomal protein P1 (RPP1B) 
MSTVGELACSYAVMILEDEGIAITSDKIATLVKAAGVEIESYWPMLFAKMAEKRNVTDLI 
MNVGAGGGGGGAPVSAAAPAAAGGAAAAAPAKEEKKDEPAEESDGDLGFGLFD*
>AT4G00810.2 |  60S acidic ribosomal protein P1 (RPP1B) 
MSTVGELACSYAVMILEDEGIAITSDKIATLVKAAGVEIESYWPMLFAKMAEKRNVTDLI 
MNVGAGGGGGGAPVSAAAPAAAGGAAAAAPAKEEKKDEPAEESDGDLGFGLFD*
>AT4G14000.1 |  unknown protein 
MATKPTFQLFSSSQGSGLGLGFLDSSEPALPPPPPPVEVLSFEVSSSTDFEVDKLTIGEI 
TLLKGRVSTKEVFGLPNSDLVPGVYEGGLKLWEGSIDLVKALEKESQTGNLSFSGKRVLE 
LGCGHALPGIYACLKGSDAVHFQDFNAEVLRCLTIPNLNANLSEKSSSVSVSETEVRFFA 
GEWSEVHQVLPLVNSDGETNKKGGYDIILMAETIYSISAQKSQYELIKRCLAYPDGAVYM 
AAKKYYFGVGGGTRQFLSMIEKDGALASTLVSQVTDGSSNVREVWKLSYK*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal (InterProIPR004046) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1635 Blast hits to 1635 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 158 Plants - 57 Viruses - 0 Other Eukaryotes - 480 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Glutathione S-transferase/chloride channel C-terminal (InterProIPR017933) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1621 Blast hits to 1621 proteins in 489 species Archae - 12 Bacteria - 905 Metazoa - 23 Fungi - 155 Plants - 57 Viruses - 0 Other Eukaryotes - 469 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*
>AT4G19880.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVYAVHFKCNKKLIREYPNLFN 
YTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPFGIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSSYFQSKKKKYTICERISNV 
ETLIQVYAVHFKCNKKLIREYPNLFNYTKDIFQIPGMSSTVNMNHIKQHYYGSHPSINPF 
GIIPHGPNIDYTSPHDRHRFSK*
>AT4G19880.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN response to cadmium ion LOCATED IN chloroplast EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Glutathione S-transferase predicted (InterProIPR016639) Glutathione S-transferase C-terminal-like (InterProIPR010987) Thioredoxin-like fold (InterProIPR012336) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT5G450201) Has 1567 Blast hits to 1567 proteins in 487 species Archae - 12 Bacteria - 899 Metazoa - 23 Fungi - 155 Plants - 55 Viruses - 0 Other Eukaryotes - 423 (source NCBI BLink) 
MSYSTIISNTSFLSLASKFTTRGSRLQCTVSMARSAVDETSDSGAFQRTASTFRNFVSKD 
SNSQFPAESGRYHLYISYACPWASRCLSYLKIKGLDDAISFSSVKPIWGRTKETDEHMGW 
VFPGSDTEVPGADPDHLNGAKSVRELYEIASPNYTGKYTVPVLWDKKLKTVVNNESAEII 
RMFNTEFNHIAGNPDLDLYPSHLQAKIDETNEWIYNGINNGVYRCGFAKKQGPYEEAVEQ 
VYEALDRCEEILGKHRYICGNTLTETDIRLFVTLIRFDEVSLCSPLQMQQETHKGVSEFV 
QLHERHFPDPRYE*
>AT5G03430.1 |  phosphoadenosine phosphosulfate (PAPS) reductase family protein 
MEIDKAIGESDDKRLKTKYNNAIFVIKRALALYSIEEVAFSFNGGKDSTVLLHLLRAGYF 
LHKKEQTCSNGGLSSFPVRTIYFESPSAFTEINAFTYDAAQTYNLQLDIIRQDFKSGLEA 
LLKANPIRAIFLGVRIGDPTAVGQEQFSPSSPGWPPFMRVNPILDWSYRDVWAFLLTCKV 
KYCSLYDQGYTSIGSIHDTVPNSLLSVNDTSSKEKFKPAYLLSDGRLERAGRVKKIASLK 
KDVDTESQKHEVLLASVIAVGDEILSGTVEDQLGLSLCKKLTSVGWSVQQTTVLRNDIDS 
VSEEVDRQRSTSDMVFIYGGVGPLHSDVTLAGVAKAFGVRLAPDEEFEEYLRHLISDQCT 
GDRNEMAQLPEGITELLHHEKLSVPLIKCRNVIVLAATNTEELEKEWECLTELTKLGGGS 
LIEYSSRRLMTSLTDVEVAEPLSKLGLEFPDIYLGCYRKSRQGPIIICLTGKDNARMDSA 
AQALRKKFKKDVFVEIK*
>AT5G39420.1 |  cdc2cAt (Arabidopsis thaliana cdc2c) ATP binding / kinase/ protein kinase/ protein serine/threonine kinase 
MGCISSKNVSCLTDQGDSPLPEPGLLSTSQQHRVLIDHSLEASHNSKRSRKSRRLGGSDL 
RIGVSLGSSHRNIEAEQAAAGWPAWLCSAAAEAVHGWVPLKAEAFQKLEKIGQGTYSSVF 
RAREVETGKMVALKKVKFDNLQPESIRFMAREILILRKLNHPNIMKLEGIVTSRASSSIY 
LVFEYMEHDLAGLSSNPDIRFTEPQIKCYMKQLLWGLEHCHMRGVIHRDIKASNILVNNK 
GVLKLGDFGLANVVTPSNKNQLTSRVVTLWYRAPELLMGSTSYGVSVDLWSVGCVFAEIL 
MGKPILKGRTEIEQLHKIYKLCGSPQDSFWKRTKLPHATSFKPQHTYEATLRERCKDLSA 
TGVYLLETLLSMEPDKRGTASSALNSEYFLTRPYACDPSSLPKYPPNKEMDAKYRDDMRR 
KRANLKLRDSGVGRKHKRPHRAEYDPKNYAKLPIRKDTLEVKNIPNEASRATTTTHGNYY 
KVSDLPMTTGPASGFAWAVKRRKDPDNISTLTYYQPSSKSQLSGTSVAFAKNTFGLNLKP 
DNDSVWEVQGNNYDDVIEEVPSHESKLSRIGERHGSLDGSGLDFSQREEDSPKKTLEHLQ 
FGKQSISGPLIFKSGKIDEILQRNESNIRQAVRKSHLQREQDDR*
>AT5G59850.1 |  40S ribosomal protein S15A (RPS15aF) 
MVRISVLNDALKSMYNAEKRGKRQVMIRPSSKVIIKFLIVMQKHGYIGEFEYVDDHRSGK 
IVVELNGRLNKCGVISPRFDVGVKEIEGWTARLLPSRQFGYIVLTTSAGIMDHEEARRKN 
VGGKVLGFFY*
>AT5G66620.1 |  DAR6 (DA1-RELATED PROTEIN 6) zinc ion binding 
MASDYYSSDDEGFGEKVGLIGEKDRFEAETIHVIEVSQHEADIQKAKQRSLATHEAEKLD 
LATHEAEQLDLAIQEFSRQEEEEERRRTRELENDAQIANVLQHEERERLINKKTALEDEE 
DELLARTLEESLKENNRRKMFEEQVNKDEQLALIVQESLNMEEYPIRLEEYKSISRRAPL 
DVDEQFAKAVKESLKNKGKGKQFEDEQVKKDEQLALIVQESLNMVESPPRLEENNNISTR 
APVDEDEQLAKAVEESLKGKGQIKQSKDEVEGDGMLLELNPPPSLCGGCNFAVEHGGSVN 
ILGVLWHPGCFCCRACHKPIAIHDIENHVSNSRGKFHKSCYERYCYVCKEKKMKTYNNHP 
FWEERYCPVHEADGTPKCCSCERLEPRESNYVMLADGRWLCLECMNSAVMDSDECQPLHF 
DMRDFFEGLNMKIEKEFPFLLVEKQALNKAEKEEKIDYQYEVVTRGICLSEEQIVDSVSQ 
RPVRGPNNKLVGMATESQKVTRECEVTAILILYGLPRLLTGYILAHEMMHAYLRLNGHRN 
LNNILEEGICQVLGHLWLDSQTYATADATADASSSASSSSRTPPAASASKKGEWSDFDKK 
LVEFCKNQIETDDSPVYGLGFRTVNEMVTNSSLQETLKEILRQR*
>AT1G12880.1 |  atnudt12 (Arabidopsis thaliana Nudix hydrolase homolog 12) hydrolase 
MSVLSSRTGRDRQRYDNNFRLVSGCIPYRLMKADETEEDSGVDFVNKLEVLMVSSPNRHD 
LVFPKGGWEDDETVLEAASREAIEEAGVKGILRELPLGVWEFRSKSSTVEDECLGGCKGY 
MFALKVTEELEDWPERKNRERRWLTVKEALELCRYEWMQRALEEFLRVMEDERRLRTEEE 
TVHDSSKLEEESQIDPWYCFVVN*