>AT3G16480.1 |  MPPalpha (mitochondrial processing peptidase alpha subunit) catalytic/ metal ion binding / metalloendopeptidase/ zinc ion binding 
MYRTAASRAKALKGILNHNFRASRYASSSAVATSSSSSSWLSGGYSSSLPSMNIPLAGVS 
LPPPLSDHVEPSKLKTTTLPNGLTIATEMSPNPAASIGLYVDCGSIYETPQFRGATHLLE 
RMAFKSTLNRSHFRLVREIEAIGGNTSASASREQMGYTIDALKTYVPEMVEVLIDSVRNP 
AFLDWEVNEELRKVKVEIGEFATNPMGFLLEAVHSAGYSGALANPLYAPESAITGLTGEV 
LENFVFENYTASRMVLAASGVDHEELLKVVEPLLSDLPNVPRPAEPKSQYVGGDFRQHTG 
GEATHFALAFEVPGWNNEKEAIIATVLQMLMGGGGSFSAGGPGKGMHSWLYLRLLNQHQQ 
FQSCTAFTSVFNNTGLFGIYGCTSPEFASQGIELVASEMNAVADGKVNQKHLDRAKAATK 
SAILMNLESRMIAAEDIGRQILTYGERKPVDQFLKTVDQLTLKDIADFTSKVITKPLTMA 
TFGDVLNVPSYDSVSKRFR*
>AT1G72320.2 |  APUM23 (Arabidopsis Pumilio 23) RNA binding / binding 
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ 
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ 
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH 
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL 
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE 
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV 
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV 
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG 
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI 
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD 
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK 
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT 
GKNRHSNKMRI*
>AT1G72320.2 |  APUM23 (Arabidopsis Pumilio 23) RNA binding / binding 
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ 
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ 
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH 
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL 
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE 
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV 
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV 
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG 
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI 
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD 
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK 
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT 
GKNRHSNKMRI*
>AT1G72320.2 |  APUM23 (Arabidopsis Pumilio 23) RNA binding / binding 
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ 
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ 
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH 
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL 
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE 
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV 
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV 
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG 
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI 
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD 
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK 
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT 
GKNRHSNKMRI*
>AT1G72320.1 |  APUM23 (Arabidopsis Pumilio 23) RNA binding / binding 
MVSVGSKSLPSRRHRTIEEDSLMGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQ 
SGGAPNVKPASKKHSEFEHQNQFVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNA 
LEETRGREYEIATDYIISHVLQTLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESA 
LKSLATHLENPDAYSVIEEALHSICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSP 
ELYGAKSSKALAKRLNLKMSQLDDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQ 
YSSLVLQTALRLMLKQDEQLLEIIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFS 
HLVEVILEVAPESLYNEMFNKVFKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEEL 
APRFKDLLEQGKSGVVASLIAVSQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDY 
YFGCRDKSTWEWAPGAKMHVMGCLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSS 
GARVIEAFLASDAATKQKRRLIIKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASE 
LLDVKVDLSKTKQGPYLLRKLDIDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPK 
NTFVSDASEDAAQEIEVKNTRKEIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKT 
SEATDKPKLAGSKRPFLSGEMTGKNRHSNKMRI*
>AT1G72320.1 |  APUM23 (Arabidopsis Pumilio 23) RNA binding / binding 
MVSVGSKSLPSRRHRTIEEDSLMGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQ 
SGGAPNVKPASKKHSEFEHQNQFVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNA 
LEETRGREYEIATDYIISHVLQTLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESA 
LKSLATHLENPDAYSVIEEALHSICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSP 
ELYGAKSSKALAKRLNLKMSQLDDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQ 
YSSLVLQTALRLMLKQDEQLLEIIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFS 
HLVEVILEVAPESLYNEMFNKVFKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEEL 
APRFKDLLEQGKSGVVASLIAVSQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDY 
YFGCRDKSTWEWAPGAKMHVMGCLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSS 
GARVIEAFLASDAATKQKRRLIIKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASE 
LLDVKVDLSKTKQGPYLLRKLDIDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPK 
NTFVSDASEDAAQEIEVKNTRKEIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKT 
SEATDKPKLAGSKRPFLSGEMTGKNRHSNKMRI*
>AT1G72320.1 |  APUM23 (Arabidopsis Pumilio 23) RNA binding / binding 
MVSVGSKSLPSRRHRTIEEDSLMGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQ 
SGGAPNVKPASKKHSEFEHQNQFVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNA 
LEETRGREYEIATDYIISHVLQTLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESA 
LKSLATHLENPDAYSVIEEALHSICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSP 
ELYGAKSSKALAKRLNLKMSQLDDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQ 
YSSLVLQTALRLMLKQDEQLLEIIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFS 
HLVEVILEVAPESLYNEMFNKVFKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEEL 
APRFKDLLEQGKSGVVASLIAVSQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDY 
YFGCRDKSTWEWAPGAKMHVMGCLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSS 
GARVIEAFLASDAATKQKRRLIIKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASE 
LLDVKVDLSKTKQGPYLLRKLDIDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPK 
NTFVSDASEDAAQEIEVKNTRKEIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKT 
SEATDKPKLAGSKRPFLSGEMTGKNRHSNKMRI*
>AT1G72320.3 |  APUM23 (Arabidopsis Pumilio 23) RNA binding / binding 
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ 
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ 
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH 
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL 
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE 
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV 
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV 
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG 
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI 
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD 
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK 
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT 
GKNRHSNKMRI*
>AT1G72320.3 |  APUM23 (Arabidopsis Pumilio 23) RNA binding / binding 
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ 
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ 
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH 
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL 
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE 
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV 
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV 
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG 
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI 
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD 
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK 
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT 
GKNRHSNKMRI*
>AT1G72320.3 |  APUM23 (Arabidopsis Pumilio 23) RNA binding / binding 
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ 
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ 
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH 
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL 
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE 
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV 
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV 
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG 
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI 
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD 
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK 
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT 
GKNRHSNKMRI*
>AT3G62870.1 |  60S ribosomal protein L7A (RPL7aB) 
MAPKKGVKVASKKKPEKVTNPLFERRPKQFGIGGALPPKKDLSRYIKWPKSIRLQRQKRI 
LKQRLKVPPALNQFTKTLDKNLATSLFKILLKYRPEDKAAKKERLLNKAQAEAEGKPAES 
KKPIVVKYGLNHVTYLIEQNKAQLVVIAHDVDPIELVVWLPALCRKMEVPYCIVKGKSRL 
GAVVHQKTAAALCLTTVKNEDKLEFSKILEAIKANFNDKYEEYRKKWGGGIMGSKSQAKT 
KAKERVIAKEAAQRMN*
>AT3G22890.1 |  APS1 (ATP SULFURYLASE 1) sulfate adenylyltransferase (ATP) 
MASMAAVLSKTPFLSQPLTKSSPNSDLPFAAVSFPSKSLRRRVGSIRAGLIAPDGGKLVE 
LIVEEPKRREKKHEAADLPRVELTAIDLQWMHVLSEGWASPLGGFMRESEFLQTLHFNSL 
RLDDGSVVNMSVPIVLAIDDEQKARIGESTRVALFNSDGNPVAILSDIEIYKHPKEERIA 
RTWGTTAPGLPYVDEAITNAGNWLIGGDLEVLEPVKYNDGLDRFRLSPAELRKELEKRNA 
DAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGFTKADDVPLDWRMKQH 
EKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPVE 
KRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQGKMAFFDPSRPQDFLFISGTKMR 
TLAKNNENPPDGFMCPGGWKVLVDYYESLTPAGNGRLPEVVPV*
>AT3G03920.1 |  Gar1 RNA-binding region family protein 
MRPPMRGGGGFRGRGGRDGGGGGRFGGGGGRFGGGGGRFGGGGGRFGGFRDEGPPSEVVE 
VATFVHACEGDAVTKLSQEKIPHFNAPIYLENKTQIGKVDEIFGPINESLFSIKMMEGIV 
ATSYSPGDKFFIDPYKLLPLARFLPQPKGQSTGGRGGAGRGRGDSRGRGRGGSFSRGRGA 
PRGGRFPPRGGSRGSFRGRGRF*
>AT3G06530.1 |  binding 
MSSSIVSQLQALKSVLQADTEPSKRPFTRPSILFSPKEAADFDIESIYELGLKGLEVLGN 
KDERFKNYMNDLFSHKSKEIDRELLGKEENARIDSSISSYLRLLSGYLQFRASLETLEYL 
IRRYKIHIYNLEDVVLCALPYHDTHAFVRIVQLLSTGNSKWKFLDGVKNSGAPPPRSVIV 
QQCIRDKQVLEALCDYASRTKKYQPSKPVVSFSTAVVVGVLGSVPTVDGDIVKTILPFVD 
SGLQSGVKGCLDQQAGALMVVGMLANRAVLNTNLIKRLMRSIIDIGREHAKESSDPHSLR 
LSLMALINFVQLQSVDLIPRKALDLFNEIRDISGVLLGLSKEFNIKRFLAVLLDSLLFYS 
SSDDKCCEVLASIIETVPVSNLVDHLISKVFSLCMTQYQKNSDFRSSTSGSWAKKFLVVV 
SKKYPAELRAAVPKFLEATEVQSKKEDLKLEMLSCMLDGNSDMSHPFVDSKLWFRLHHPR 
AAVRCAALSSLNGVLKDDSSKAENLVTIQDAILRQLWDDDLAVVQAALSFDKLPNIITSS 
GLLDALLHVVKRCVGILVSGVSHNVQLAVDVVALSLKIAVSSFGNQTDSTEKVTSAMFPF 
LLIQPKTWNLNLLVLKLGKDVNWPLFKNLAADDGMKKLPDIMSTNLSSISMDIINDLGEA 
LSLDPDERRIELIERACNYKLSEVLETCSNIKCSEQDRNKLQKGLLIRESVSALNIDVIN 
KLVEAFMMHPADYIQWLTTEWEELEVEVDVSLKELSKSNCQELLYQLLDTSDFTALNSKV 
LICLFWKLGESFIKLEPAHDASVLNKRLSSGLEDLFFFFATTRLRHVFKEHLHFRVREAK 
VCPVLFLSRLISREDVPPLVQIESLRCFSYLCSSGNNEWLIQVFSSFPVLLVPMSSDNQD 
VKAAAINCIEALFNLRAAIYGSSFDELLGMIVQQRRLILSDNKFFASYLTSLLSSTTNDL 
LVPVGLQKRFDQSTKENILSVILLCAEDLPAYGKLRVLSLLKDLGIMLMRDEIVKLLSQL 
LDKRSQYYYKLDKTSQPLSDTEVDLLCLLLECSMMRTSSFKGQSLDDHILSALNVDCMAS 
ERPAVISPCLTILEKLSNRFYDELQTDVQIRFFHKLVSMFRSSNGSIQNGAKEAVLRLKL 
SSSTVVLALDRITQQDTLVIGSLSKKKKQKKNSKSCPEEDINSEEFRSGEKALSFIASLL 
DMLLLKKDLTHRESLIRPLFKLLQRSMSKEWVKIAFSIEETSLQPPQDVRETTPTFISSI 
QQTLLLILKDIFDSLNMNPLKAEVANEINVKMLVELAHSSNDGVTRNHIFSLFTAIVKFV 
PDKVLDHIISILTLVGESTVTQIDSHSKSIFEGFISMVIPFWLSKTKSEEQLLQIFVKVL 
PDIVEHRRRSIVAYLLGVIGERNGLPALLVLLFKSLISRKDSAWLGNANVSESFASIVKK 
EWEYSFAMEICEQYSSSTWLSSLVILLQTISKDSKQCFLQMRLVLEFVFQKLQDPEFAFA 
VSLEPRNNVSVGIQQELQELMKCCICLLQAIDAKKEKDVTSSVRNEIRMRIHDVLMTVTG 
AMDLSIYFRVVTSLLQQQTDYNGTKKVLGLISERAKDTSSSKMKHKRKISNQKGRNSWLN 
LDEVAVDSFGKMCEEIVHLINATDDESGVPVKRAAISTLEVLAGRFPSGHPIFRKCLAAV 
AECISSKNLGVSSSCLRTTGALINVLGPKALIELPCIMKNLVKQSLEVSFASQSGRNATA 
EEQLLMLSVLVTLEAVIDKLGGFLNPHLGDIMKIMVLHPEYVSDFDKNLKSKANAIRRLL 
TDKIPVRLTLQPLLRIYNEAVSSGNASLVIAFNMLEDLVVKMDRSSIVSSHGKIFDQCLV 
ALDIRRLNPAAIQNIDDAERSVTSAMVALTKKLTESEFRPLFIRSIDWAESDVVDGSGSE 
NKSIDRAISFYGLVDRLCESHRSIFVPYFKYVLDGIVAHLTTAEASVSTRKKKKAKIQQT 
SDSIQPKSWHLRALVLSCLKNCFLHDTGSLKFLDTNNFQVLLKPIVSQLVVEPPSSLKEH 
PHVPSVDEVDDLLVSCIGQMAVASGSDLLWKPLNHEVLMQTRSESVRSRMLSLRSVKQML 
DNLKEEYLVLLAETIPFLAELLEDVELSVKSLAQDIIKQMEEMSGESLAEYL*
>AT3G11964.1 |  RNA binding 
MVVPQKKFANGKRNDSTKSFKPMKKPFKKTKDDVAARSEAMALQLEDVPDFPRGGGTSLS 
KKEREKLYEEVDAEFDADERVSKKSKGGKSKKRIPSDLDDLGLLFGGGLHGKRPRYANKI 
TTKNISPGMKLLGVVTEVNQKDIVISLPGGLRGLVRASEVSDFTDRGIEDDENELLGDIF 
SVGQLVPCIVLELDDDKKEAGKRKIWLSLRLSLLHKGFSFDSFQLGMVFSANVKSIEDHG 
SILHFGLPSITGFIEISDDGNQESGMKTGQLIQGVVTKIDRDRKIVHLSSDPDSVAKCLT 
KDLSGMSFDLLIPGMMVNARVQSVLENGILFDFLTYFNGTVDLFHLKNPLSNKSWKDEYN 
QNKTVNARILFIDPSSRAVGLTLSPHVVCNKAPPLHVFSGDIFDEAKVVRIDKSGLLLEL 
PSKPTPTPAYVSFKEGNHIRVRVLGLKQMEGLAVGTLKESAFEGPVFTHSDVKPGMVTKA 
KVISVDTFGAIVQFSGGLKAMCPLRHMSEFEVTKPRKKFKVGAELVFRVLGCKSKRITVT 
YKKTLVKSKLPILSSYTDATEGLVTHGWITKIEKHGCFVRFYNGVQGFVPRFELGLEPGS 
DPDSVFHVGEVVKCRVTSAVHGTQRITLNDSIKLGSIVSGIIDTITSQAVIVRVKSKSVV 
KGTISAEHLADHHEQAKLIMSLLRPGYELDKLLVLDIEGNNMALSSKYSLIKLAEELPSD 
FNQLQPNSVVHGYVCNLIENGCFVRFLGRLTGFAPRSKAIDDPKADVSESFFVGQSVRAN 
IVDVNQEKSRITLSLKQSSCASVDASFVQEYFLMDEKISDLQSSDITKSDCSWVEKFSIG 
SLIKGTIQEQNDLGVVVNFDNINNVLGFIPQHHMGGATLVPGSVVNAVVLDISRAERLVD 
LSLRPELLNNLTKEVSNSSKKKRKRGISKELEVHQRVSAVVEIVKEQHLVLSIPEHGYTI 
GYASVSDYNTQKLPVKQFSTGQSVVASVKAVQNPLTSGRLLLLLDSVSGTSETSRSKRAK 
KKSSCEVGSVVHAEITEIKPFELRVNFGNSFRGRIHITEVLVNDASTSDEPFAKFRVGQS 
ISARVVAKPCHTDIKKTQLWELSVKPAMLKDSSEFNDTQESEQLEFAAGQCVIGYVYKVD 
KEWVWLAVSRNVTARIFILDTSCKAHELEEFERRFPIGKAVSGYVLTYNKEKKTLRLVQR 
PLLFIHKSIANGGGSKTDKPDSSIPGDDDTLFIHEGDILGGRISKILPGVGGLRVQLGPY 
VFGRVHFTEINDSWVPDPLDGFREGQFVKCKVLEISSSSKGTWQIELSLRTSLDGMSSAD 
HLSEDLKNNDNVCKRFERIEDLSPDMGVQGYVKNTMSKGCFIILSRTVEAKVRLSNLCDT 
FVKEPEKEFPVGKLVTGRVLNVEPLSKRIEVTLKTVNAGGRPKSESYDLKKLHVGDMISG 
RIRRVEPFGLFIDIDQTGMVGLCHISQLSDDRMENVQARYKAGESVRAKILKLDEEKKRI 
SLGMKSSYLMNGDDDKAQPLSEDNTSMECDPINDPKSEVLAAVDDFGFQETSGGTSLVLA 
QVESRASIPPLEVDLDDIEETDFDSSQNQEKLLGANKDEKSKRREKQKDKEEREKKIQAA 
EGRLLEHHAPENADEFEKLVRSSPNSSFVWIKYMAFMLSLADIEKARSIAERALRTINIR 
EEEEKLNIWVAYFNLENEHGNPPEESVKKVFERARQYCDPKKVYLALLGVYERTEQYKLA 
DKLLDEMIKKFKQSCKIWLRKIQSSLKQNEEAIQSVVNRALLCLPRHKHIKFISQTAILE 
FKCGVADRGRSLFEGVLREYPKRTDLWSVYLDQEIRLGEDDVIRSLFERAISLSLPPKKM 
KFLFKKFLEYEKSVGDEERVEYVKQRAMEYANSTLA*
>AT2G37790.1 |  aldo/keto reductase family protein 
MAEEIRFFELNTGAKIPSVGLGTWQADPGLVGNAVDAAVKIGYRHIDCAQIYGNEKEIGL 
VLKKLFDGGVVKREEMFITSKLWCTYHDPQEVPEALNRTLQDLQLDYVDLYLIHWPVSLK 
KGSTGFKPENILPTDIPSTWKAMESLFDSGKARAIGVSNFSSKKLADLLVVARVPPAVNQ 
VECHPSWQQNVLRDFCKSKGVHLSGYSPLGSPGTTWLTSDVLKNPILGGVAEKLGKTPAQ 
VALRWGLQMGQSVLPKSTHEDRIKQNFDVFNWSIPEDMLSKFSEIGQGRLVRGMSFVHET 
SPYKSLEELWDGEI*
>AT2G40360.1 |  transducin family protein / WD-40 repeat family protein 
MTKRSKGANEDKLIETKSKNVSGKSQKQKKPVEAESLKEEDLLQASGTDSDYDGDSLPGS 
LNSDDFDSDFSDSEDDGTHEGTEDGDVEFSDDDDVLEHDGSIDNEDDDGSEHVGSDNNEE 
HGSDEDSERGEAVEESDSSEDEVPSRNTVGNVPLKWYEDEKHIGYDLTGKKITKKEKQDK 
LDSFLATIDDSKTWRKIYDEYNDEDVELTKEESKIVQRILKGEAPHADFDPYAPYVEWFK 
HDDAIHPLSSAPEPKRRFIPSKWEAKKVVKIVRAIRKGWIKFDKPEEEPNVYLLWGDDST 
SDQKSKHLTYIPPPKLKLPGHDESYNPSLEYIPTEEEKASYELMFEEDRPKFIPTRFTSL 
RSIPAYENALKESFERCLDLYLCPRVRKKRINIDPESLKPKLPSRKDLRPYPNSCYLEYK 
GHTGAVTSISTDSSGEWIASGSTDGSVRMWEVETGRCLKVWQFDEAIMCVAWNPLSRLPV 
LAVAMGRDLFFLNTELGTDEEQEITKERLHSGNIPEPDASVAAIVTWLPDELYGGIKIRH 
FKSISSIDWHRKGDYLSTVMASGETRGVVLHQLSKQKTQRLPFKIRGLPVCTLFHPSLSY 
FFVATRKDVRVYNLLKPGEATKKLETGLREISSMAIHPGGDNLIVGSKEGKMCWFDMDLS 
SKPYKTLKNHPKDITNVAVHRSYPLFASCSEDSTAYVFHGMVYNDLNQNPLIVPLEILRG 
HSSKGGVLDCKFHPRQPWLFTAGADSIIKLYCH*
>AT1G63810.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 21 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Nrap protein (InterProIPR005554) Has 268 Blast hits to 263 proteins in 124 species Archae - 3 Bacteria - 0 Metazoa - 116 Fungi - 89 Plants - 17 Viruses - 0 Other Eukaryotes - 43 (source NCBI BLink) 
MEADTKTDSRTLKVNDLLKDARLDYDSLRKLVDDTVSSIKEAIDGIPEKFQVTSELAPSF 
VEDIGADKEVEFSFKKPNGFNLCGSYSICGMAKPDTSVDLLVHLPKECFYEKDYMNHRYH 
AKRCLYLCVIEKHLLSSSSIEKVVWSTLHNEARKPVLVVFPAKKLDQFPGFSIRLIPSAT 
SLFSVAKLSISRNNVRSVTADGVPEPTPTYNSSILEDMFLEENSEFLKKTFSEWKELSDA 
LILLKIWARQRSSIYVHDCLNGFLISVILSYLATHSKINKALSALDIFRVTLDFIATSKL 
WERGLYLPPQSEIRVSKEEKMQFRELFPVVICDSSTFVNLAFRMTSVGFLELQDEASLTL 
KCMEKLRDGGFEEIFMTKIDYPVKYDHCIRLQLKGKTAVSLSGFCLDKECWRLYEQKVHS 
LLLEGLGDRAKSIRVVWRNTNQDWHVESGLSVLDREPLFIGISVSSTEKAYRTVDIGPDA 
ENKIEALRFRKFWGEKSDLRRFKDGRISESTVWETQQWTKHLIMKQIVEYILKRHLSLTS 
DDIVQLVDQLDFSLNYGGKDPISLSGNLVQAYEVLSKCLREIEGIPLKVSSVQSLDSALR 
FTSVFPPEPHPVACEKIDSRRLQKLIPSCIPAMEVMIQLEGSGNWPMDDLAVEKTKSAFL 
LKIAESLQNVKGIPCTATEDNVDVFIGGYAFRLRILHERGLSLVKREIGVDPVKHVSSTD 
KMLFIRSQHASMINGLQGRFPVYAPVARLAKRWVSAHLFSGCLAEEAIELLVAYLFLTPL 
PLGVPSSRINGFLRFLRLLADYEWMFYPLIVDINNDFGRNDEKEINDNFMSSRKGYEEDK 
QNISSAMFLAAPYDKASEAWTSTSPNLLEQKRLVAYARSSANVLSKMVLQEHNDSVQWEC 
LFRTPLNNYDAVILLHRDKLPYPRRLLFPSELNQGKHVARGKASRLFNPFMSPGDLKRSH 
EELKNKLMVDFEPTKCLLSGLQEEFGTLKPWYDHIGGDAIGLTWNKHNSKKRERDEEEEE 
EEESNPMEMLKAVGEMGKGLVRDIYLLKPPRFV*
>AT5G04600.1 |  RNA recognition motif (RRM)-containing protein 
MGAKAKKALKKNMKKVAASASSSQLPLPQNPKPSADFLPLEGGPARKAPVTTPPLQNKAT 
VLYIGRIPHGFYETEIEAFFSQFGTVKRVRVARNKKTGKSKHFGFIQFEDPEVAEIAAGA 
MNDYLLMEHMLKVHVIEPENVKPNLWRGFKCNFKPVDSVQIERRQLNKERTLEEHRKMLQ 
KIVKKDQKRRKRIEAAGIEYECPELVGNTQPVPKRIKFSEED*
>AT4G25630.1 |  FIB2 (FIBRILLARIN 2) snoRNA binding 
MRPPLTGSGGGFSGGRGRGGYSGGRGDGGFSGGRGGGGRGGGRGFSDRGGRGRGRGPPRG 
GARGGRGPAGRGGMKGGSKVIVEPHRHAGVFIAKGKEDALVTKNLVPGEAVYNEKRISVQ 
NEDGTKTEYRVWNPFRSKLAAAILGGVDNIWIKPGAKVLYLGAASGTTVSHVSDLVGPEG 
CVYAVEFSHRSGRDLVNMAKKRTNVIPIIEDARHPAKYRMLVGMVDVIFSDVAQPDQARI 
LALNASYFLKSGGHFVISIKANCIDSTVPAEAVFQTEVKKLQQEQFKPAEQVTLEPFERD 
HACVVGGYRMPKKPKAATAA*
>AT3G56990.1 |  EDA7 (embryo sac development arrest 7) 
MTSYGDRLKSTSINGVKLYNVSSAPNVPTWLNPKKQRALRKNPHYMQRVELIQELKFETA 
TTRIKATPDGEYLIASGIYPPQVKVYELNQLALKFERHLDSEIVDFEILDDDFSKLAFLC 
ADRSINLHAKYGKHHTLRIPRMGRDMTYDSWSCDLLCAASSPDLYRINLEQGRFLSPLST 
QSPALNVVSRSNLHGLVACGGEDGAVEFFDMRMKSSAARINAVTHGGDAAAEVTAIEFDD 
SEGLQVAVGSSAGKVFIYDLRTSTPIRVKDHMYESPILNIKWQRTLNTQQPKLITTDKHI 
VRIWDPNTGEGMTSIEPTQGGINDICVFRGSGLMLLALDSSLIPSYFIPELGPAPKWCSP 
LENLTEELEESAQTTIYDNYKFLAMEDLEKLQLTHLIGTDLLKASMHGYFINYHLYKKAL 
AVIEPFAFDDYLERRKQEKLEEQRTQRITKKRRLPKVNRDLAARLHGDESEEENKTAEDG 
EATKKVLKKKKPILTDEHFVDGRFGSMFQNPDFQIDKDSYEYGVLHPVASSKKQPSLLDE 
HFEAVSDDDENSDSDASQPSDDEADDGDATRPSKKARTPKLYEVKDERHAAAYHNRTSLA 
KEDSLPMGERVKAIENRRGNFGGSKDIKFGPGGSREFSFKARGSSKYKEDRDDEYEDGQR 
NKRRGVQSLGLKSTNIRGGFRGRGGGGFRGRGGGGSRGKGGRGGGRGRGRQ*
>AT4G13980.1 |  AT-HSFA5 DNA binding / transcription factor 
MNGALGNSSASVSGGEGAGGPAPFLVKTYEMVDDSSTDQIVSWSANNNSFIVWNHAEFSR 
LLLPTYFKHNNFSSFIRQLNTYGFRKIDPERWEFLNDDFIKDQKHLLKNIHRRKPIHSHS 
HPPASSTDQERAVLQEQMDKLSREKAAIEAKLLKFKQQKVVAKHQFEEMTEHVDDMENRQ 
KKLLNFLETAIRNPTFVKNFGKKVEQLDISAYNKKRRLPEVEQSKPPSEDSHLDNSSGSS 
RRESGNIFHQNFSNKLRLELSPADSDMNMVSHSIQSSNEEGASPKGILSGGDPNTTLTKR 
EGLPFAPEALELADTGTCPRRLLLNDNTRVETLQQRLTSSEETDGSFSCHLNLTLASAPL 
PDKTASQIAKTTLKSQELNFNSIETSASEKNRGRQEIAVGGSQANAAPPARVNDVFWEQF 
LTERPGSSDNEEASSTYRGNPYEEQEEKRNGSMMLRNTKNIEQLTL*
>AT1G72440.1 |  EDA25 (embryo sac development arrest 25) 
MSKIKPLSKSSQDLSLLTSDIASFASSIGLASALPSSGFNDTDFRKPAKSKTQKRKKPKK 
DQQHKDEDEEGEPKSNIGNEKGKDFGARKQNKDAPVKQTLQPKPKPGFLSIDDESTGYKK 
KRFDEFKSLPKLPLVKASLLSSEWYNDAAEFEEKVFGGRKVAVANKEDFKGVVEKKRELG 
ERLMWQYAEDFATSKGKGGDMKMVISAQKSGTVADKITAFEIMVGENPIANMRSLDALLG 
MVTSKVGKRFAFKGLKALSEILIRLLPDRKLKSLLQRPLNIIPENKDGYSLLLFWYWEDC 
LKQRYERFVTALDESSKDMLPELKDKALKTIYFMLTSKSEQERKLLVSLVNKLGDPQNKS 
ASNADYHLTNLLADHPNMKAVVIDEVDSFLFRPHLGLRAKYHAVNFLSQIRLSHKGEDPK 
VAKRLIDVYFALFKVLTTEANRKQGADDKGAADKKKSNPKDTKQEVSTDSPIELDSRILS 
ALLTGVNRAFPYVSTDEADDIIESQTPVLFKLVHSANFNVGVQSLMLLDKISSKNKIVSD 
RFYRALYSKLLLPSAMNSSKAEMFIGLLLRAMKNDINIKRVAAFSKRVLQVALQQPPQYA 
CGCLFLLSEVLKSRPPLWKMVVQRESVEEEEDIEHFEDVIEGDDVDPNKKAENDENVVEV 
DHDGVEKSSRDGDSSSDDEEALAIRLSDEEDDNASDDSEELIRNETPQLEEVMEVSNDME 
KRSQPPMRPSSLPGGYDPRHREPSYCNADRASWWELGVLSKHAHPSVATMAGTLLSGTNI 
VYNGNPLNDLSLTAFLDKFMEKKPKQNTWHGGSQIEPSKKLDMSNRVIGAEILSLAEGDV 
APEDLVFHKFYVNKMTSTKQSKKKKKKKLPEEEAAEELYDVNDGDGGENYDSDVEFEAGD 
ESDNEEIENMLDDVDDNAVEEEGGEYDYDDLDGVAGEDDEELVADVSDAEMDTDMDMDLI 
DDEDDNNVDDDGTGDGGDDDSDGDDGRSKKKKKEKRKRKSPFASLEEYKHLIDQDEKEDS 
KTKRKATSEPTKKKKKKKSKASE*
>AT3G57150.1 |  NAP57 (Arabidopsis thaliana homologue of NAP57) pseudouridine synthase 
MAEVDISHSKKKKQDKTENDAADTGDYMIKPQSFTPAIDTSQWPILLKNYDRLNVRTGHY 
TPISAGHSPLKRPLQEYIRYGVINLDKPANPSSHEVVAWIKRILRVEKTGHSGTLDPKVT 
GNLIVCIDRATRLVKSQQGAGKEYVCVARLHSAVPDVAKVARALESLTGAVFQRPPLISA 
VKRQLRIRTIYESKLLEYDADRHLVVFWVSCEAGTYIRTMCVHLGLLLGVGGHMQELRRV 
RSGILGENNNMVTMHDVMDAQFVYDNSRDESYLRRVIMPLEMILTSYKRLVVKDSAVNAI 
CYGAKLMIPGLLRFENDIDVGTEVVLMTTKGEAIAVGIAEMTTSVMATCDHGVVAKIKRV 
VMDRDTYPRKWGLGPRASMKKKLIADGKLDKHGKPNEKTPVEWSRNVVLPTGGDAIIAGA 
AAAPEEIKADAENGEAGEARKRKHDDSSDSPAPVTTKKSKTKEVEGEEAEEKVKSSKKKK 
KKDKEEEKEEEAGSEKKEKKKKKDKKEEVIEEVASPKSEKKKKKKSKDTEAAVDAEDESA 
AEKSEKKKKKKDKKKKNKDSEDDEE*
>AT4G19610.1 |  RNA binding / nucleic acid binding / nucleotide binding 
MSRICVKNLPKHVKEDQLRDHFSQKGEITDAKLMRSNDGKSRQFGFIGFRSAQEAQQAIK 
YFNNTYLGTSLIIVEIAHKVGDENAPRPWSRLSHKKEEEAKKSSSEGLKDGNAKGGKKRK 
AEVDDPEFQEFLEVHQRSKSKIWSNDMSIPPAPEETGKEKVLVKKADEQIVSNGVEPKKA 
KKSSDTEKTKKSKVVAASDDVSDMEYFKSRIKKNLSDSESDNESEDSSEDEAGDDDGKAE 
TDGQDADIRYFPIDGDVEAGGVGKDDDGDAMEVEGDGKVAQESKAVSDDVLDTGRLFVRN 
LPYTATEEELMEHFSTFGKISEVHLVLDKETKRSRGIAYILYLIPECAARAMEELDNSSF 
QGRLLHILPAKHRETSDKQVNDTSNLPKTFKQKREEQRKASEAGGDTKAWNSLFMRPDTI 
LENIVRVYGVSKSELLDREAEDPAVRLALGETKVIAETKEALAKAGVNVTSLEKFATRNG 
DEKNRSKHILLVKNLPFASTEKELAQMFGKFGSLDKIILPPTKTMALAVFLEPAEARAAL 
KGMAYKRYKDAPLYLEWAPGNILEPKNLPDTNEERSDIEENGVRRVNLEQQVEIDPDVTE 
SNVLNVKNLSFKTTDEGLKKHFTKLVKQGKILSVTIIKHKKNEKYLSSGYGFVEFDSVET 
ATSVYRDLQGTVLDGHALILRFCENKRSDKVGKDSNKDKPCTKLHVKNIAFEATKRELRQ 
LFSPFGQIKSMRLPKKNIGQYAGYAFVEFVTKQEALNAKKALASTHFYGRHLVLEWANDD 
NSMEAIRKRSAAKFDEENDNARKRKSSKAVEGKNEV*
>AT5G08180.1 |  ribosomal protein L7Ae/L30e/S12e/Gadd45 family protein 
MGSDTEAEKSIQKEKKKYAITLAPIAKPLAGKKLQKRTFKLIQKAAGKKCLKRGVKEVVK 
SIRRGQKGLCVIAGNISPIDVITHLPILCEEAGVPYVYVPSKEDLAQAGATKRPTCCVLV 
MLKPAKGDLTAEELAKLKTDYEQVSDDIKELATSVI*
>AT1G48920.1 |  ATNUC-L1 nucleic acid binding / nucleotide binding 
MGKSKSATKVVAEIKATKPLKKGKREPEDDIDTKVSLKKQKKDVIAAVQKEKAVKKVPKK 
VESSDDSDSESEEEEKAKKVPAKKAASSSDESSDDSSSDDEPAPKKAVAATNGTVAKKSK 
DDSSSSDDDSSDEEVAVTKKPAAAAKNGSVKAKKESSSEDDSSSEDEPAKKPAAKIAKPA 
AKDSSSSDDDSDEDSEDEKPATKKAAPAAAKAASSSDSSDEDSDEESEDEKPAQKKADTK 
ASKKSSSDESSESEEDESEDEEETPKKKSSDVEMVDAEKSSAKQPKTPSTPAAGGSKTLF 
AANLSFNIERADVENFFKEAGEVVDVRFSTNRDDGSFRGFGHVEFASSEEAQKALEFHGR 
PLLGREIRLDIAQERGERGERPAFTPQSGNFRSGGDGGDEKKIFVKGFDASLSEDDIKNT 
LREHFSSCGEIKNVSVPIDRDTGNSKGIAYLEFSEGKEKALELNGSDMGGGFYLVVDEPR 
PRGDSSGGGGFGRGNGRFGSGGGRGRDGGRGRFGSGGGRGRDGGRGRFGSGGGRGSDRGR 
GRPSFTPQGKKTTFGDE*
>AT3G05060.1 |  SAR DNA-binding protein putative 
MVLVLYETAAGFALFKVKDEGKMANVEDLCKEFDTPDSARKMVKLKAFEKFDNTSEALEA 
VAKLLEGAPSKGLRKFLKANCQGETLAVADSKLGNVIKEKLKIDCIHNNAVMELLRGVRS 
QFTELISGLGDQDLAPMSLGLSHSLARYKLKFSSDKVDTMIIQAIGLLDDLDKELNTYAM 
RVREWYGWHFPELAKIISDNILYAKSVKLMGNRVNAAKLDFSEILADEIEADLKDAAVIS 
MGTEVSDLDLLHIRELCDQVLSLSEYRAQLYDYLKSRMNTIAPNLTALVGELVGARLISH 
GGSLLNLSKQPGSTVQILGAEKALFRALKTKHATPKYGLIFHASLVGQAAPKHKGKISRS 
LAAKTVLAIRVDALGDSQDNTMGLENRAKLEARLRNLEGKDLGRLSGSSKGKPKIEVYNK 
DKKMGSGGLITPAKTYNTAADSLLGETSAKSEEPSKKKDKKKKKKVEEEKPEEEEPSEKK 
KKKKAEAETEAVVEVAKEEKKKNKKKRKHEEEETTETPAKKKDKKEKKKKSKD*
>AT3G12860.1 |  nucleolar protein Nop56 putative 
MKIYLLSESPSGYGLFEGHGSDEIGQNTEAVRSSVSDLSRFGRVVQLTAFHPFQSALDAL 
NQINAVSEGYMSDELRSFLELNLPKVKEGKKPKFSLGVSEPKIGSCIFEATKIPCQSNEF 
VHELLRGVRQHFDRFIKDLKPGDLEKAQLGLAHSYSRAKVKFNVNRVDNMVIQAIFMLDT 
LDKDINSFAMRVREWYSWHFPELVKIVNDNYLYAKVSKIIVDKSKLSEEHIPMLTEALGD 
EDKAREVIEAGKASMGQDLSPVDLINVQTFAQRVMDLADYRKKLYDYLVTKMSDIAPNLA 
ALIGEMVGARLISHAGSLTNLAKCPSSTLQILGAEKALFRALKTRGNTPKYGLIFHSSFI 
GRASAKNKGRIARFLANKCSIASRIDCFSDNSTTAFGEKLREQVEERLDFYDKGVAPRKN 
VDVMKEVLENLEKKDEGEKTVDASEKKKKRKTEEKEEEKEEEKSKKKKKKSKAVEGEELT 
ATDNGHSKKKKKTKSQDDE*
>AT3G62310.1 |  RNA helicase putative 
MGTERKRKISLFDVMDDPSAPAKNAKTSGLPDGGINSLINKWNGKPYSQRYYDILEKRRT 
LPVWLQKEEFLKTLNNNQTLILVGETGSGKTTQIPQFVIDAVDAETSDKRRKWLVGCTQP 
RRVAAMSVSRRVAEEMDVTIGEEVGYSIRFEDCSSPRTVLKYLTDGMLLREAMADPLLER 
YKVIILDEAHERTLATDVLFGLLKEVLKNRPDLKLVVMSATLEAEKFQDYFSGAPLMKVP 
GRLHPVEIFYTQEPERDYLEAAIRTVVQIHMCEPPGDILVFLTGEEEIEDACRKINKEVG 
NLGDQVGPIKVVPLYSTLPPAMQQKIFDPAPEPVTEGGPPGRKIVVSTNIAETSLTIDGI 
VYVIDPGFAKQKVYNPRIRVESLLVSPISKASAHQRSGRAGRTRPGKCFRLYTEKSFNND 
LQPQTYPEILRSNLANTVLTLKKLGIDDLVHFDFMDPPAPETLMRALEVLNYLGALDDDG 
NLTKTGEIMSEFPLDPQMAKMLIVSPEFNCSNEILSVSAMLSVPNCFIRPREAQKAADEA 
KARFGHIEGDHLTLLNVYHAFKQNNEDPNWCYENFINNRAMKSADNVRQQLVRIMSRFNL 
KMCSTDFNSRDYYINIRKAMLAGYFMQVAHLERTGHYLTVKDNQVVHLHPSNCLDHKPEW 
VIYNEYVLTSRNFIRTVTDIRGEWLVDVASHYYDLSNFPNCEAKRVIEKLYKKREREKEE 
SKKNRK*
>AT1G56110.1 |  NOP56 (Arabidopsis homolog of nucleolar protein Nop56) 
MAMYVIYESSSGYGLFEVHGLDEIGQNTEAVRTSVSDLSRFGRVVQLTAFHPFESALDAL 
NQVNAVSEGVMTDELRSFLELNLPKVKEGKKPKFSLGLAEPKLGSHIFEATKIPCQSNEF 
VLELLRGVRQHFDRFIKDLKPGDLEKSQLGLAHSYSRAKVKFNVNRVDNMVIQAIFMLDT 
LDKDINSFAMRVREWYSWHFPELVKIVNDNYLYARVSKMIDDKSKLTEDHIPMLTEVLGD 
EDKAKEVIEAGKASMGSDLSPLDLINVQTFAQKVMDLADYRKKLYDYLVTKMSDIAPNLA 
ALIGEMVGARLISHAGSLTNLAKCPSSTLQILGAEKALFRALKTRGNTPKYGLIFHSSFI 
GRASAKNKGRIARYLANKCSIASRIDCFADGATTAFGEKLREQVEERLEFYDKGVAPRKN 
VDVMKEVIENLKQEEEGKEPVDASVKKSKKKKAKGEEEEEVVAMEEDKSEKKKKKEKRKM 
ETAEENEKSEKKKTKKSKAGGEEETDDGHSTKKKKKKSKSAE*
>AT4G05410.1 |  transducin family protein / WD-40 repeat family protein 
MKYNNEKKKGGSFKRGGKKGSNERDPFFEEEPKKRRKVSYDDDDIESVDSDAEENGFTGG 
DEDGRRVDGEVEDEDEFADETAGEKRKRLAEEMLNRRREAMRREREEADNDDDDDEDDDE 
TIKKSLMQKQQEDSGRIRRLIASRVQEPLSTDGFSVIVKHRRSVVSVALSDDDSRGFSAS 
KDGTIMHWDVSSGKTDKYIWPSDEILKSHGMKLREPRNKNHSRESLALAVSSDGRYLATG 
GVDRHVHIWDVRTREHVQAFPGHRNTVSCLCFRYGTSELYSGSFDRTVKVWNVEDKAFIT 
ENHGHQGEILAIDALRKERALTVGRDRTMLYHKVPESTRMIYRAPASSLESCCFISDNEY 
LSGSDNGTVALWGMLKKKPVFVFKNAHQDIPDGITTNGILENGDHEPVNNNCSANSWVNA 
VATSRGSDLAASGAGNGFVRLWAVETNAIRPLYELPLTGFVNSLAFAKSGKFLIAGVGQE 
TRFGRWGCLKSAQNGVAIHPLRLA*
>AT5G66540.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN rRNA processing LOCATED IN cytosol nucleolus nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 12 growth stages CONTAINS InterPro DOMAIN/s U3 small nucleolar ribonucleoprotein complex subunit Mpp10p (InterProIPR012173) Mpp10 protein (InterProIPR007151) Has 76240 Blast hits to 38667 proteins in 1479 species Archae - 252 Bacteria - 6537 Metazoa - 31185 Fungi - 9935 Plants - 3937 Viruses - 750 Other Eukaryotes - 23644 (source NCBI BLink) 
MATVKDSGFEALEKLKATEPPVFLAPSSISEDARSASQYLFMKLKPHNPKCPFDQLSSDG 
FDAEQIWQQIDMQSQPLLTSLRQEVKRFAKNPEEIRKLGKLALKVSHEDDIDEMDMDGFD 
SDDVDDEDKEIESNDSEGEDEEEEEEDEEEEEEEEEEEEEEKDGDNEGIEDKFFKIKELE 
EFLEEGEAEEYGIDHKNKKGVAQRKKQNLSDDEDEEDDDDEEEDVEFDAFAGGDDEETDK 
LGKARYDDFFGGKKETKMKLKDLSEDEEAEIENKGNEKLSTHERARLKLQSKIEQMEKAN 
LDPKHWTMQGEITAAKRPMNSALEVDLDFEHNARPAPVITEEVTASLEDLIKSRIIEARF 
DDVQRAPRLPTKGKREAKELDESKSKKGLAEVYEAEYFQKANPAFAPTTHSDELKKEASM 
LFKKLCLKLDALSHFHFTPKPVIEEMSIPNVSAIAMEEVAPVAVSDAAMLAPEEIFSGKG 
DIKDESELTQEDRKRRRANKKRKFKAESANEPPKKALDTSTKNP*
>AT4G22380.1 |  ribosomal protein L7Ae/L30e/S12e/Gadd45 family protein 
MTGEVVNPKAYPLADSQLSITIMDLVQQATNYKQLKKGANEATKTLNRGISEFVVMAADA 
EPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVIACSVTSNEASQLKSQIQHLK 
DAIEKLLI*
>AT2G47990.1 |  SWA1 (SLOW WALKER1) nucleotide binding 
MEEELRVRLNDHQVSKVFPVKPKSTAKPVSESETPESRYWSSFKNHSTPNLVSSVAALAF 
SPVHPHSLAVAHSATVSLFSSQSLSSSRRFSFRDVVSSVCFRSDGALFAACDLSGVVQVF 
DIKERMALRTLRSHSAPARFVKYPVQDKLHLVSGGDDGVVKYWDVAGATVISDLLGHKDY 
VRCGDCSPVNDSMLVTGSYDHTVKVWDARVHTSNWIAEINHGLPVEDVVYLPSGGLIATA 
GGNSVKVWDLIGGGKMVCSMESHNKTVTSLRVARMESAESRLVSVALDGYMKVFDYGRAK 
VTYSMRFPAPLMSLGLSPDGSTRVIGGSNGMVFAGKKKVRDVVGGQKKSLNLWSLISDVD 
ESRRRALRPTYFRYFQRGQSEKPSKDDYLVKEKKGLKLTRHDKLLKKFRHKEALVSVLEE 
KKPANVVAVMEELVARRKLMKCVSNMEEGELGMLLGFLQRYCTVQRYSGLLMGLTKKVLE 
TRAEDIKGKNEFKGLLRNLKREVNQEIRIQQSLLEIQGVIAPLMRIAGRS*
>AT1G50920.1 |  GTP-binding protein-related 
MVQYNFKRITVVPNGKEFVDIILSRTQRQTPTVVHKGYKINRLRQFYMRKVKYTQTNFHA 
KLSAIIDEFPRLEQIHPFYGDLLHVLYNKDHYKLALGQVNTARNLISKISKDYVKLLKYG 
DSLYRCKCLKVAALGRMCTVLKRITPSLAYLEQIRQHMARLPSIDPNTRTVLICGYPNVG 
KSSFMNKVTRADVDVQPYAFTTKSLFVGHTDYKYLRYQVIDTPGILDRPFEDRNIIEMCS 
ITALAHLRAAVLFFLDISGSCGYTIAQQAALFHSIKSLFMNKPLVIVCNKTDLMPMENIS 
EEDRKLIEEMKSEAMKTAMGASEEQVLLKMSTLTDEGVMSVKNAACERLLDQRVEAKMKS 
KKINDHLNRFHVAIPKPRDSIERLPCIPQVVLEAKAKEAAAMEKRKTEKDLEEENGGAGV 
YSASLKKNYILQHDEWKEDIMPEILDGHNVADFIDPDILQRLAELEREEGIREAGVEEAD 
MEMDIEKLSDEQLKQLSEIRKKKAILIKNHRLKKTVAQNRSTVPRKFDKDKKYTTKRMGR 
ELSAMGLDPSSAMDRARSKSRGRKRDRSEDAGNDAMDVDDEQQSNKKQRVRSKSRAMSIS 
RSQSRPPAHEVVPGEGFKDSTQKLSAIKISNKSHKKRDKNARRGEADRVIPTLRPKHLFS 
GKRGKGKTDRR*
>AT1G06720.1 |  INVOLVED IN ribosome biogenesis LOCATED IN nucleus EXPRESSED IN 21 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s AARP2CN (InterProIPR012948) Protein of unknown function DUF663 (InterProIPR007034) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G424401) Has 7944 Blast hits to 5342 proteins in 373 species Archae - 33 Bacteria - 667 Metazoa - 2609 Fungi - 1045 Plants - 414 Viruses - 80 Other Eukaryotes - 3096 (source NCBI BLink) 
MAADELMPSHRSHRTPKSGPTARKKSELDKKKRGISVDKQKNLKAFGVKSVVHAKKAKHH 
AAEKEQKRLHLPKIDRNYGEAPPFVVVVQGPPGVGKSLVIKSLVKEFTKQNVPEVRGPIT 
IVQGKQRRFQFVECPNDINAMVDCAKVADLALLVVDGSYGFEMETFEFLNIMQVHGFPRV 
MGVLTHLDKFNDVKKLRKTKHHLKHRFWTEIYHGAKLFYLSGLIHGKYTPREVHNLARFV 
IVIKPQPLTWRTAHPYVLVDRLEDVTPPEKVQMDKKCDRNITVFGYLRGCNFKKRMKVHI 
AGVGDFIVAGVTALTDPCPLPSAGKKKGLRDRDKLFYAPMSGIGDLVYDKDAVYININSH 
QVQYSKTDDGKGEPTNKGKGRDVGEDLVKSLQNTKYSVDEKLDKTFINFFGKKTSASSET 
KLKAEDAYHSLPEGSDSESQSGDDEEDIVGNESEMKQETEIHGGRLRRKAIFKTDLNEDD 
FEEADDLELDSYDPDTYDFEEADDAESDDNEVEDGGDDSASDSADGEPGDYQIDDKDSGN 
ISQWKAPLKEIARKKNPNLMQIVYGASSLATPLINENHDISDDDESDDEDFFKPKGEQHK 
NLGGGLDVGYVNSEDCSKFVNYGYLKNWKEKEVCESIRDRFTTGDWSKAALRDKNLGTGG 
EGEDDELYGDFEDLETGEKHKSHENLESGANENEDEDAEVVERDGNNPRSQADEPGYADK 
LKEAQEITKQRNELEYNDLDEETRIELAGFRTGTYLRLEIHNVPYEMVEFFDPCHPILVG 
GIGFGEDNVGYMQARLKKHRWHKKVLKTRDPIIVSIGWRRYQTIPVFAIEDRNGRHRMLK 
YTPEHMHCLASFWGPLVPPNTGFVAFQNLSNNQAGFRITATSVVLEFNHQARIVKKIKLV 
GTPCKIKKKTAFIKDMFTSDLEIARFEGSSVRTVSGIRGQVKKAGKNMLDNKAEEGIARC 
TFEDQIHMSDMVFLRAWTTVEVPQFYNPLTTALQPRDKTWNGMKTFGELRRELNIPIPVN 
KDSLYKAIERKQKKFNPLQIPKRLEKDLPFMSKPKNIPKRKRPSLEDKRAVIMEPKERKE 
HTIIQQFQLLQHHTMKKKKATDQKKRKEYEAEKAKNEEINKKRRREERRDRYREEDKQKK 
KTRRSLD*
>AT1G15420.1 |  unknown protein 
MAKDKLKPLLSSDAAGDIADTPLREKKHKKKSKKRAEPEPDIPSTRDSGLDEDRDGVLVD 
DTLNEPTIGDKLESLDLLNGEKVNSEESNRDSAPGDDKPPTAASVNVLLRQALHADDRSL 
LLDCLYNRDEQVIANSVAKLNSAEVLKLLNALLPILQSRGAILACTIPWIKSLLLTHSSG 
IMSQESSLLALNTMYQLIESRVSTIHTAVEVSSGLDLIVDDLDEEEDEGPVIYEDKDSDE 
DEEEGIEEAMETDEEADDSADEAADGVNDFEGFDDMSD*
>AT1G15440.1 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARLFCVRKLKGVLNKP 
FLFLGHRDSVVGCFFGVDKMTNKVNRAFTIARDGYIFSWGYTEKDVKMDESEDGHSEPPS 
PVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGDDDDEEYMHRGKWVLLRKDGC 
NQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIHLLSISRQKLTTAVFNERGNW 
LTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPDSQLLATGADDNKVKVWNVMS 
GTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDFKRYKNYKTYTTPTPRQFVSL 
TADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPVHGLMFSPLTQLLASSSWDYT 
VRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLDGQINFWDTIEGVLMYTIEGR 
RDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAGTSRYICMYDIADQVLLRRFQ 
ISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGIDKQSRGNLGYDLPGSRPNRGR 
PIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTDLDIDVTPEAVEAAIEEDEVS 
RALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLERLMEALVDLLENCPHLEFIL 
HWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLADMCSSNEYTLRYLCSVPNNH 
*
>AT1G15440.1 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARLFCVRKLKGVLNKP 
FLFLGHRDSVVGCFFGVDKMTNKVNRAFTIARDGYIFSWGYTEKDVKMDESEDGHSEPPS 
PVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGDDDDEEYMHRGKWVLLRKDGC 
NQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIHLLSISRQKLTTAVFNERGNW 
LTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPDSQLLATGADDNKVKVWNVMS 
GTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDFKRYKNYKTYTTPTPRQFVSL 
TADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPVHGLMFSPLTQLLASSSWDYT 
VRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLDGQINFWDTIEGVLMYTIEGR 
RDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAGTSRYICMYDIADQVLLRRFQ 
ISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGIDKQSRGNLGYDLPGSRPNRGR 
PIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTDLDIDVTPEAVEAAIEEDEVS 
RALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLERLMEALVDLLENCPHLEFIL 
HWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLADMCSSNEYTLRYLCSVPNNH 
*
>AT1G15440.2 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARAFTIARDGYIFSWG 
YTEKDVKMDESEDGHSEPPSPVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGD 
DDDEEYMHRGKWVLLRKDGCNQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIH 
LLSISRQKLTTAVFNERGNWLTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPD 
SQLLATGADDNKVKVWNVMSGTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDF 
KRYKNYKTYTTPTPRQFVSLTADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPV 
HGLMFSPLTQLLASSSWDYTVRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLD 
GQINFWDTIEGVLMYTIEGRRDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAG 
TSRYICMYDIADQVLLRRFQISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGID 
KQSRGNLGYDLPGSRPNRGRPIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTD 
LDIDVTPEAVEAAIEEDEVSRALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLE 
RLMEALVDLLENCPHLEFILHWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLA 
DMCSSNEYTLRYLCSVPNNH*
>AT1G15440.2 |  transducin family protein / WD-40 repeat family protein 
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS 
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR 
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARAFTIARDGYIFSWG 
YTEKDVKMDESEDGHSEPPSPVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGD 
DDDEEYMHRGKWVLLRKDGCNQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIH 
LLSISRQKLTTAVFNERGNWLTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPD 
SQLLATGADDNKVKVWNVMSGTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDF 
KRYKNYKTYTTPTPRQFVSLTADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPV 
HGLMFSPLTQLLASSSWDYTVRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLD 
GQINFWDTIEGVLMYTIEGRRDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAG 
TSRYICMYDIADQVLLRRFQISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGID 
KQSRGNLGYDLPGSRPNRGRPIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTD 
LDIDVTPEAVEAAIEEDEVSRALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLE 
RLMEALVDLLENCPHLEFILHWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLA 
DMCSSNEYTLRYLCSVPNNH*
>AT1G31660.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 21 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Bystin (InterProIPR007955) Has 370 Blast hits to 362 proteins in 156 species Archae - 0 Bacteria - 7 Metazoa - 139 Fungi - 93 Plants - 32 Viruses - 0 Other Eukaryotes - 99 (source NCBI BLink) 
MAKKRDRIVNTQPFISDDASVASSRKRSKVPKTHQKQEKLIEAGMSEKIMKQALAQQKEV 
ADEENAERNPSSAAFAVAGAATAGEEQKILEEEEDDIDDFDGTFENQSQFDKQEEINEDD 
EKLFESFLNKNAPPQRTLTDIIIKKLKDKDADLAEEERPDPKMDPAITKLYKGVGKFMSE 
YTVGKLPKAFKLVTSMEHWEDVLYLTEPEKWSPNALYQATRIFASNLKDRQVQRFYNYVL 
LPRVREDIRKHKKLHFALYQALKKSLYKPSAFNQGILFPLCKSGTCNLREAVIIGSILEK 
CSIPMLHSCVALNRLAEMDYCGTTSYFIKVLLEKKYCMPYRVLDALVAHFMRFVDDIRVM 
PVIWHQSLLTFVQRYKYEILKEDKEHLQTLLQRQKHHLVTPEILRELKDSRNRGEKEDPM 
VDNFAPVPAKEDRFDIPEVPMEED*
>AT1G42440.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN ribosome biogenesis LOCATED IN nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s AARP2CN (InterProIPR012948) Protein of unknown function DUF663 (InterProIPR007034) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G067201) Has 2447 Blast hits to 1812 proteins in 205 species Archae - 0 Bacteria - 115 Metazoa - 903 Fungi - 423 Plants - 100 Viruses - 53 Other Eukaryotes - 853 (source NCBI BLink) 
MGRSRVQVNKAHKTRFSSKSSRNLHRTNLQDSGRIGKSDSNYVKGAKAARVQRGKMLREQ 
KRAAVLKEKRASGGINSAPRVIVLFPLSASVELNSLGEDVLKLLSSDGSGIASSTVASSE 
YKLRATVLKAPHGDLLTCMEMAKVADLMAFVASASAPWEENSSNFIDSFGSQCLSVFRSI 
GLPSTTVLIRDLPSDVKKKNEMKKMCASQLASEFPEDCKFYPADTRDELHKFMWLFKAQR 
LTVPHWRSQRSYIVARKAGMLVDDESSGKCTLLLSGYLRARKLSVNQLVHVSGVGDFQFS 
KIEVLKDPFPLNERKNQNSMELDDSHDEEVLKSLVPDPMKQEPLVIENTPDPLAGEQTWP 
TEEEMAEADKNQKQGRLKKKTLPRGTSEYQAAWIVDETDEEDSDNGDSDDNGMVLDRGED 
SNQEGMYDQEFEDDGKSLNLRDIDTETQNESEMVDDEDLTEEQIKDEIKKIKEAYADDEE 
FPDEVETPIDVPARRRFAKYRGLKSFRTSSWDPNESLPQDYARIFAFDNVARTQKLVLKQ 
ALKMEEEDRDDCVPIGSYVRLHIKEVPLGAASKLSSLVNTTKPIIGFGLLQHESKMSVLH 
FSVKKYDGYEAPIKTKEELMFHVGFRQFIARPVFATDNFSSDKHKMERFLHPGCFSLASI 
YGPISFPPLPLVVLKISEGSDPPAIAALGSLKSVEPNKIILKKIILTGYPQRVSKMKASV 
RYMFHNPEDVKWFKPVEVWSKCGRRGRVKEPVGTHGAMKCIFNGVVQQHDVVCMNLYKRA 
YPKWPERLYPQLL*
>AT1G55150.1 |  DEAD box RNA helicase putative (RH20) 
MSRYDSRTGDSTSYRDRRSDSGFGGTSSYGSSGSHTSSKKDNDGNESPRKLDLDGLTPFE 
KNFYVESPAVAAMTDTEVEEYRKLREITVEGKDIPKPVKSFRDVGFPDYVLEEVKKAGFT 
EPTPIQSQGWPMAMKGRDLIGIAETGSGKTLSYLLPAIVHVNAQPMLAHGDGPIVLVLAP 
TRELAVQIQQEASKFGSSSKIKTTCIYGGVPKGPQVRDLQKGVEIVIATPGRLIDMMESN 
NTNLRRVTYLVLDEADRMLDMGFDPQIRKIVSHIRPDRQTLYWSATWPKEVEQLSKKFLY 
NPYKVIIGSSDLKANRAIRQIVDVISESQKYNKLVKLLEDIMDGSRILVFLDTKKGCDQI 
TRQLRMDGWPALSIHGDKSQAERDWVLSEFRSGKSPIMTATDVAARGLDVKDVKYVINYD 
FPGSLEDYVHRIGRTGRAGAKGTAYTFFTVANARFAKELTNILQEAGQKVSPELASMGRS 
TAPPPPGLGGFRDRGSRRGWS*
>AT5G44740.1 |  POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase 
MRLKLLVLRFNWFKFLWLVVVSILAKSGKCERASIDEVYLDLTDAAESMLADAPPESLEL 
IDEEVLKSHILGMNREDGDDFKESVRNWICREDADRRDKLLSCGIIIVAELRKQVLKETE 
FTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQELLSSLPIKKMKQLGGKLGTSLQTD 
LGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGISGEEVQGRLLPKSHGSGKTFPGPR 
ALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIASTLTLHASAFRSKDSDSHKKFPSKSC 
PMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNKLETWRITGLSVSASKIVDIPSGTS 
SIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCSEQRSTETQAAMPEVDTGVTYTLPN 
FENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKGTQTKKIGRKMNNSKEKNRGMPSIV 
DIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSHNSQVNQEVEESRETDWGYKTDEID 
QSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGSTSSIAHYFPPLNR*
>AT5G44740.1 |  POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase 
MRLKLLVLRFNWFKFLWLVVVSILAKSGKCERASIDEVYLDLTDAAESMLADAPPESLEL 
IDEEVLKSHILGMNREDGDDFKESVRNWICREDADRRDKLLSCGIIIVAELRKQVLKETE 
FTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQELLSSLPIKKMKQLGGKLGTSLQTD 
LGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGISGEEVQGRLLPKSHGSGKTFPGPR 
ALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIASTLTLHASAFRSKDSDSHKKFPSKSC 
PMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNKLETWRITGLSVSASKIVDIPSGTS 
SIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCSEQRSTETQAAMPEVDTGVTYTLPN 
FENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKGTQTKKIGRKMNNSKEKNRGMPSIV 
DIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSHNSQVNQEVEESRETDWGYKTDEID 
QSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGSTSSIAHYFPPLNR*
>AT5G44740.2 |  POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase 
MPVARPEASDARVIAHVDMDCFYVQVEQRKQPELRGLPSAVVQYNEWQGGGLIAVSYEAR 
KCGVKRSMRGDEAKAACPQIQLVQVPVARGKADLNLYRSAGSEVVSILAKSGKCERASID 
EVYLDLTDAAESMLADAPPESLELIDEEVLKSHILGMNREDGDDFKESVRNWICREDADR 
RDKLLSCGIIIVAELRKQVLKETEFTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQE 
LLSSLPIKKMKQLGGKLGTSLQTDLGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGI 
SGEEVQGRLLPKSHGSGKTFPGPRALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIAST 
LTLHASAFRSKDSDSHKKFPSKSCPMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNK 
LETWRITGLSVSASKIVDIPSGTSSIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCS 
EQRSTETQAAMPEVDTGVTYTLPNFENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKG 
TQTKKIGRKMNNSKEKNRGMPSIVDIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSH 
NSQVNQEVEESRETDWGYKTDEIDQSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGST 
SSIAHYFPPLNR*
>AT5G44740.2 |  POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase 
MPVARPEASDARVIAHVDMDCFYVQVEQRKQPELRGLPSAVVQYNEWQGGGLIAVSYEAR 
KCGVKRSMRGDEAKAACPQIQLVQVPVARGKADLNLYRSAGSEVVSILAKSGKCERASID 
EVYLDLTDAAESMLADAPPESLELIDEEVLKSHILGMNREDGDDFKESVRNWICREDADR 
RDKLLSCGIIIVAELRKQVLKETEFTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQE 
LLSSLPIKKMKQLGGKLGTSLQTDLGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGI 
SGEEVQGRLLPKSHGSGKTFPGPRALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIAST 
LTLHASAFRSKDSDSHKKFPSKSCPMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNK 
LETWRITGLSVSASKIVDIPSGTSSIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCS 
EQRSTETQAAMPEVDTGVTYTLPNFENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKG 
TQTKKIGRKMNNSKEKNRGMPSIVDIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSH 
NSQVNQEVEESRETDWGYKTDEIDQSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGST 
SSIAHYFPPLNR*
>AT2G21440.1 |  RNA recognition motif (RRM)-containing protein 
MGKNKRERKDGEEKSPHAAATVCVSGLPYSITNAQLEEAFSEVGPVRRCFLVTNKGSDEH 
RGFAFVKFALQEDVNRAIELKNGSTVGGRRITVKQAAHRPSLQERRTKAAEGISVPDNSQ 
GQSDKDTSIPETDEKVSPPEKKLEKPVERKKVEKPIERKQVEKPVERKKAEKPIELKQVE 
KPFERKQVEKPVERKQVEKPVERKQVEKPIERKRPTKLHVDLPDKETCSDKQRVARTVIF 
GGLANAEMAEVVHSRVKEIGTVCSVRYPLPKEELQQNGLTQDGCRAEASAVLFTSVKSAC 
AAVAKLHQTEVKGNLIWARQLGGEGSKAQKWKLIIRNLPFQAKPSDIKVVFSAVGFVWDV 
FIPKNFETGLPKGFAFVKFTCKKDAANAIKKFNGHMFGKRPIAVDWAVPKNIYNGAADAT 
TASADGDKEGSDGDSENSSVDLEEVDEAVESHPPPGDDTDDDEDGSNKLTESDALDKDVG 
TDMNFEDEADVARKVLKNLLASSKGSTATPEGETEESDKSKLKSSSTKPVADSSGVSEPL 
KSGKTKVVAPKETQDNDDFERTLFIRNLPFDVTKEEVKQRFTVFGEVESLSLVLHKVTKR 
PEGTAFVKFKTADASVAAISAADTASGVGVLLKGRQLNVMRAVGKKAAKDIELKKTEEKN 
VDHRNLYLAKEGQILDDTPAAEGVSAEDMDKRRRLHENKMKMLQSPNFHVSRTRLVIYNL 
PKSMNPKQLNRLLVDAVTSRATKQKPCIRQIKFLQNEKKGKVDTKNYSRGVAFVEFTEHE 
HALVALRVLNNNPETFGPQHRPVIEFAVDNVQKLKIREAKQQQFQQREKHNESDQQQANG 
EAQAPDNKYKRKTREGDNTGPRKENAARFKKGPREESKEEAKSNIAVKDNAAEKKRPIRT 
QEKPSSNKKGQLMRQKETTEKPDPKISKDLSEPRKRKFGEDRGEENRNGQRKRKKQGQGQ 
GGAEVVDKLDLLIEKYRSKFSQSSAKTGPQKQSSGQVRRWFES*
>AT2G34357.1 |  binding 
MELLCDDIGTSMCLTPSEPDLPVSEDFGEYMRSRLSQSKRPDHEHLCAVIEELSKTLAED 
NHRRTPVAYFACTCRSLDSLFSAHAEPPVDVVQPHIVILSLVFPKVSAGVLKRDGLALRL 
VLNVLRLKSATPECLISGLKCLVHLLTTVESIMVNEGSDSYNILLNFVTHSDGKVRKLAS 
SCLRDVLQKSHGTKAWQSVSGAITEMFQNYLDLAHKSEVGSTEGARGAKQVLYILSTLKE 
CLALMSKKHIATLIEGFKVLMILRDPYITRPVIDSLNAVCLNPTSEVPVEALLEVLSLAA 
GLFSGHETSADAMTFTARLLKVGMTRSFTLNRDLCVVKLPSVFNGLNDIIASEHEEAIFA 
ATDALKSLIFSCIDESLIREGVNEIRNSNLNVRKPSPTVIEKLCATVESLLDYKYHAVWD 
MAFQVVSAMFDKLGEHSAYFMRNTLQGLSDMQDLPDEGFPYRKQLHECVGSALGAMGPET 
FLSIVRLNLEANDLSEVKVWLFPILKQYTVGGRLSFFTEAIFSMVETMSHKAQKLKLQGL 
PVASRSVDSLVYSLWALLPSFCNYPVDTVESFADLGRILCGVLQTQAETHGIICASLNIL 
IQQNKEVVEGKEVPTNDASPAMQRATARYDSQHAAANLKVLRLCAPKLLDVLSRIFHECS 
KDDGGSLQSAIGNLASIAEKKTVSKLLFKTLQELLEATKTAIAQDESPVSGMDVDNTADK 
NSSSNLRARLFDLLVSLLPGLDGQEVDTIFSSLKPAMQDSKGLIQKKAYKVLSVILKSSD 
GFVSKNLEELLVLMHNICHVSAKRHKLDCLYFLLAHASRTDDLKERKDIVSSFLPEVILA 
LKEVNKKTRNRAYDVLVQIGHAYADEENGGDNEKLHGYFDMVVGCLAGEKPQMISAAVKG 
VARLTYEFSDLISSAYNLLPSTFLLLQRKNKEITKANLGLLKVLVAKSPVEGLHANLKSM 
VEGLLKWPEGTKNLFKAKVRLLLEMLIKKCGTEAVKSVMPEEHMKLLTNIRKIKERKEKK 
YAAGSDISKSQHSKDTSSKVSRWNDTKIFSDVYADSEDSDGDDMDAESHGRSKASSLLKS 
KASALRSKKSRNQSHLEVDESDDEPLDLMDQHKTRLALRSSELRKRKADSDEEAEFDVEG 
RLVIREGERSKRKELSDADSDAKSSKGSRFSGNSSKKNQKRMKTSESGYAYTGKEYASKK 
ASGDLKKKDKLEPYAYWPLDRKMMSRRPEQRAVAVRGMSSVVKMAKKMEGKSAAEALATT 
KFKKFKRSGQKKSAGKKKNK*
>AT2G34970.1 |  eIF4-gamma/eIF5/eIF2-epsilon domain-containing protein 
MGAQKKGGAAARVSEDAEVQSRHRLQAILLADSFATKFRPVTLERPKVLLPIVNVPMIDY 
TLAWLESAGIEEVFVFCCAHSMQVIEYLEKSEWYSHPNLLVRTIESHKSISAGDALRYMY 
EQQTETSQIQGDFVLVSGDTVSNMPLADLIQEHRERKKKDEKAIMTMVIKQSKSSPLTHQ 
SRLGTDQLFIAVDPLTKQLLHYEEDKIDHPSGSVCLEKSLLDTNPSVLVCNDMQDCYIDI 
CSPEVLSLFEDNFDYQHLRRHFVKGVLVDDIMGYKIFTHEIHSSYAGRIDNFRSYDTVSK 
DIIQRWTYPYVPDINFSGNRPLKLGRQGIYKASDVVQSRSADVGASTVIGYGTKIGHGDK 
IMNSVIGNGCSIGSNVVIEGSYIWNNVTIEDGCEIRNAIVCDGVKIRAGAVLQPGVVLSF 
NVVVGRDFVVPAYSKVSLLQQPTTEDSDEELEYADSSSGTADHLSGLNLQMESKASELGP 
DGAGYIWEVCEGAHDEEWKHSVAPIPKDKLSEITQAIDDDDTDDESVVPTSGELKSDADS 
INTDVNDPNDDYYYFEKEVEGTVLRAVEENIKVDLVTMEINGLRLSFNMESADCAGATFF 
SMIKLALDTPHNSGSELYKNAASIITKWKDLLGFYAKKIDEQIEVIMKFEEMCQESHKEL 
GPLFTQILHLLYDKDVLQEDAILRWEEEKAGADEADKVYLKQCDTFIQWLKEASEEEDED 
DEDEEEEEDN*
>AT2G47420.1 |  dimethyladenosine transferase putative 
MAGGKIRKEKPKASNRAPSNHYQGGISFHKSKGQHILKNPLLVDSIVQKAGIKSTDVILE 
IGPGTGNLTKKLLEAGKEVIAVELDSRMVLELQRRFQGTPFSNRLKVIQGDVLKTELPRF 
DICVANIPYQISSPLTFKLLFHPTSFRCAVIMYQREFAMRLVAQPGDNLYCRLSVNTQLY 
ARVSHLLKVGKNNFRPPPKVDSSVVRIEPRRPGPQVNKKEWDGFLRVCFIRKNKTLGSIF 
KQKSVLSMLEKNFKTLQAVLASLQNNGEPALNTTSMDLGDQSMGMEDDDNEMDDDDMEMD 
EGEGDGGETSEFKEKVMNVLKEGGFEEKRSSKLSQQEFLYLLSLFNKSGIHFT*
>AT3G01160.1 |  unknown protein 
MGSKNKKQRKGESIEEAKGSSGVAEEGNEMIKDPRFSSAHTDPKFRRMRRRDSKVAIDSR 
FQPMFNDKRFATGSAPVDKRGKRRTGGTGKDSLREFYRIEDEGKQKTEEESGDESGSETE 
INDLKSEKSSHVESEEESESELKVASLDDESDEKADSEELSSQEEEEEEDDTDEDDEAMY 
EDEGPEIPEENIPLIQEETHRLAIVNMDWRHVSAKDLYVVLNSFLPKDGRILSVAVYPSE 
FGLERMKEEEIHGPVIDGDKKNDASDDEDEEEEEDEDVINQKLRAYEISRLKYYFAVAEC 
DSSATADYLYKSCDGIEFERSSNKLDLRFIPDSMEFKHPPRDIASEAPAGYEGLDFQSRA 
LQMSKVNLSWDEDEPHRIKTLNQKFNPEQLANLEMKEFLASDESDSDEEDDLGNEVINQS 
KKKDKKKDKYRALIEAEDVDSDKDLEEENDQDMEVTFNTGLEDLSKEILKKKDNQSESVW 
ETYLRQRREKKRARKNKQKDDDSSPDDDDDYNIDRKAVKDDGDDDFFMEEPPLKKKKKEG 
KTKKEEVAAEEKSRAELELLLADENAGDGNGLKGYNIKRKAKKGKTDISEDKIPAAELDD 
PRFSALFSSPYYALDPTDPQFKRSATYARQLALKQKEDPKGHEDVKAPKEKQELNSDGNL 
GSKKERHELTSTVKSLKMKMMNKDSEKKKAGNPASSSTLAQRIKKKAKDLSKK*
>AT3G09720.1 |  DEAD/DEAH box helicase putative 
MEKSSYFLFGGTNFNKKKFAPDFAKFKNSTEDDDSNKKVNFFVEEEEDTEQPEAEKVIVS 
SKKRKRRSSNSVPVEGFDVFKSSKKARAKGKAEEQITKNEIVENPKKELNRQMERDALSR 
KQYSIHVSGNNIPPPLKSFAELSSRYGCEGYILRNLAELGFKEPTPIQRQAIPILLSGRE 
CFACAPTGSGKTFAFICPMLIKLKRPSTDGIRAVILSPARELAAQTAREGKKLIKGSNFH 
IRLMTKPLVKTADFSKLWCDVLISTPMRLKRAIKAKKIDLSKVEYLVLDESDKLFEQSLL 
KQIDCVVKACSNPSIIRSLFSATLPDSVEELARSIMHDAVRVIIGRKNTASETVKQKLVF 
AGSEEGKLLALRQSFAESLNPPVLIFVQSKERAKELYDELKCENIRAGVIHSDLPPGERE 
NAVDQFRAGEKWVLIATDVIARGMDFKGINCVINYDFPDSASAYIHRIGRSGRAGRSGEA 
ITFYTEQDVPFLRNIANTMMSSGCEVPSWIMSLKKKKWRKHRPRRDSISTKPKADKNDTD 
E*
>AT3G10530.1 |  transducin family protein / WD-40 repeat family protein 
MEISSEDNNLMEKVLPPVEQESDVELETKVKKYLRGEGANLETLKDKKLKTQLASREKLY 
GKSAKAAAKIEKWLLPAEAGYLETEGLEKTWRVKQTDIANEVDILSSRNQYDIVLPDFGP 
YKLDFTASGRHMLAGGRKGHLALLDMMNMSLIKEIQVRETVRDVAFLHNDQFFAAAQKKY 
AYIYGRDGTELHCLKERGPVARLRFLKNHFLLASVNMSGQLHYQDVTHGGMVASIRTGKG 
RTDVMEVNPYNSVVGLGHSGGTVTMWKPTSQAPLVQMQCHPGPVSSVAFHPNGHLMATSG 
KERKIKIWDLRKFEEVQTIHSFHAKTLSFSQKGLLAAGTGSFVQILGDSSGGSSHNYTRY 
MNHSMVKGYQIEKVMFRPYEDVIGIGHSMGWSSILIPGSGEPNFDSWVANPFETSKQRRE 
KEVHSLLDKLPPETIMLDPSKIGAMRPSRRKEKPSRGEIEAEKEVAIEAAKSTELKNKTK 
GRNKPSKRTKKKKEMVENAKRTFPEQEHNTAIKKRRIVEDAAAELPTSLKRFARKN*
>AT4G04940.1 |  transducin family protein / WD-40 repeat family protein 
MGIFEPFRAIGYITSTVPFSVQRLGTETFVTVSVGKAFQIYNCAKLNLVIISPQLPKKIR 
ALASYRDYTFVAFGNEIAVFRRAHQVATWSKHVAKVDLLLVFGEHVLSLDVEGNMFIWAF 
KGIEEHLAPIGNLQLTGKFTPTSIVHPDTYLNKVLVGSQEGPLQLWNINTKKMLYQFKGW 
GSSVTSCVSSPALDVVAIGCADGKIHVHNIKLDEEIVTFEHASRGAVTALSFSTDGRPLL 
ASGGSFGVISIWNLNKKRLQSVIRDAHDSSIISLNFLANEPVLMSASADNSLKMWIFDTN 
DGDPRLLRFRSGHSAPPLCIRFYSNGRHILSAGQDRAFRLFSVIQEQQSRELSQRHISRR 
AKKLRLKEEELKLKPVVSFDCAEIRERDWCNVVTCHMDTAEAYVWRLQNFVLGEHILKPC 
PENPTPIKACAISACGNFAVVGTAGGWIERFNLQSGISRGSYFDMSEKRRYAHDGEVIGV 
ACDSTNTLMISAGYHGDLKVWDFKKRELKSQWDVGCSLVKIVYHRVNGLLATVADDFVIR 
LYDVVTLKMVREFRGHTDRITDLCFSEDGKWVISSSMDGSLRIWDVILAKQIDGVHVDVP 
ITALSLSPNMDVLATAHSDQNGVYLWVNQSMFSGLPSVESYASGKDVVNVKLPSVSALTS 
SEADDDMDRQVLENSEALQASSFSISQKQIPELVTLSLLPKSQWQSLINLDIIKARNKPI 
EPPKKPEKAPFFLPSIPSLSGDILFKANDSEADGENEENNKKDQNSMKNFDALESPFSKH 
LKSSWDSKHFLDFTNYMKSLSPSALDMELRMLEIIDEDVEEELIKRPEFILIGQLLDYFI 
NEVSCKNDFEFMQAVVKLFLKIHGETIRCHPSLQEKAKKLLETQSLVWQKMEKLFQSTRC 
IVTFLSNSQF*
>AT4G07410.1 |  transducin family protein / WD-40 repeat family protein 
MLEYRCSSVDWKPSPVVALANSSDDSQVAAAREDGSLEIWLVSPGAVGWHCQLTIHGDPN 
SRISSLAWCCSPSIGLPSGRLFSSSIDGSISEWDLFDLKQKIVLESIGISIWQMALAPIS 
GFSSDVEGIKNGYLSEKSNDEEEIGSEEDGSDSDEFHEKSEEEIDRILAAACDDGCVRLY 
RISNLEKLTYYRSLPRVSGRALSVTWSPDAKRIFSGSSDGLIRCWDATSCHEVYRITAGL 
GGLGSSSEICVWSLLSLRCSVLVSGDSTGTVQFWDSEHGTLLEAHSNHKGDVNTLAAAPS 
HNRVFSAGADGQVILYKLSGSTNGSQDLKPSSSQKWDYIGYVKAHTHDIRALTVAVPISR 
EDPFPDDILPDKASRKHRKKGKPVDFTYHKWAHLGVPMLISAGDDAKLFAYSIQEFTKFS 
PHDICPAPQRIPMQMVHNSMFNKTSLLLVQGISTLDILRLNISSDSSGRASTKSLVRVKS 
RDARKIICSAISNTGSHFAYSDQIGPSLFELKKNEFTKCPWSVSRRRLPELPFAHSMIFS 
SDCSRLIIAGHDRRIYTIDISSLELVYAFTPSREEHEGEAPTPKEPPITKLFTSSDGQWL 
AAINCFGDIYVFNLETQRQHWFISRLDGASVTAAGFHPWNNNALVISTSSNQVFAFDVEA 
RQLGKWSLLNTYVLPKRYQEFPGEVLGLSFSPSPNSSSVIVYSSRAKCLIDFGKPVEEDE 
EYDLPNGNLSKTLEGKLVNLGLKKGKGTNRKRRLDEYQLEGKSNERKNFEILPSNHPVLF 
VGHLSKNSILVIEKPWMDVVKSLDNQPVDRHIFGT*
>AT4G07410.1 |  transducin family protein / WD-40 repeat family protein 
MLEYRCSSVDWKPSPVVALANSSDDSQVAAAREDGSLEIWLVSPGAVGWHCQLTIHGDPN 
SRISSLAWCCSPSIGLPSGRLFSSSIDGSISEWDLFDLKQKIVLESIGISIWQMALAPIS 
GFSSDVEGIKNGYLSEKSNDEEEIGSEEDGSDSDEFHEKSEEEIDRILAAACDDGCVRLY 
RISNLEKLTYYRSLPRVSGRALSVTWSPDAKRIFSGSSDGLIRCWDATSCHEVYRITAGL 
GGLGSSSEICVWSLLSLRCSVLVSGDSTGTVQFWDSEHGTLLEAHSNHKGDVNTLAAAPS 
HNRVFSAGADGQVILYKLSGSTNGSQDLKPSSSQKWDYIGYVKAHTHDIRALTVAVPISR 
EDPFPDDILPDKASRKHRKKGKPVDFTYHKWAHLGVPMLISAGDDAKLFAYSIQEFTKFS 
PHDICPAPQRIPMQMVHNSMFNKTSLLLVQGISTLDILRLNISSDSSGRASTKSLVRVKS 
RDARKIICSAISNTGSHFAYSDQIGPSLFELKKNEFTKCPWSVSRRRLPELPFAHSMIFS 
SDCSRLIIAGHDRRIYTIDISSLELVYAFTPSREEHEGEAPTPKEPPITKLFTSSDGQWL 
AAINCFGDIYVFNLETQRQHWFISRLDGASVTAAGFHPWNNNALVISTSSNQVFAFDVEA 
RQLGKWSLLNTYVLPKRYQEFPGEVLGLSFSPSPNSSSVIVYSSRAKCLIDFGKPVEEDE 
EYDLPNGNLSKTLEGKLVNLGLKKGKGTNRKRRLDEYQLEGKSNERKNFEILPSNHPVLF 
VGHLSKNSILVIEKPWMDVVKSLDNQPVDRHIFGT*
>AT4G07410.2 |  transducin family protein / WD-40 repeat family protein 
MALAPISGFSSDVEGIKNGYLSEKSNDEEEIGSEEDGSDSDEFHEKSEEEIDRILAAACD 
DGCVRLYRISNLEKLTYYRSLPRVSGRALSVTWSPDAKRIFSGSSDGLIRCWDATSCHEV 
YRITAGLGGLGSSSEICVWSLLSLRCSVLVSGDSTGTVQFWDSEHGTLLEAHSNHKGDVN 
TLAAAPSHNRVFSAGADGQVILYKLSGSTNGSQDLKPSSSQKWDYIGYVKAHTHDIRALT 
VAVPISREDPFPDDILPDKASRKHRKKGKPVDFTYHKWAHLGVPMLISAGDDAKLFAYSI 
QEFTKFSPHDICPAPQRIPMQMVHNSMFNKTSLLLVQGISTLDILRLNISSDSSGRASTK 
SLVRVKSRDARKIICSAISNTGSHFAYSDQIGPSLFELKKNEFTKCPWSVSRRRLPELPF 
AHSMIFSSDCSRLIIAGHDRRIYTIDISSLELVYAFTPSREEHEGEAPTPKEPPITKLFT 
SSDGQWLAAINCFGDIYVFNLETQRQHWFISRLDGASVTAAGFHPWNNNALVISTSSNQV 
FAFDVEARQLGKWSLLNTYVLPKRYQEFPGEVLGLSFSPSPNSSSVIVYSSRAKCLIDFG 
KPVEEDEEYDLPNGNLSKTLEGKLVNLGLKKGKGTNRKRRLDEYQLEGKSNERKNFEILP 
SNHPVLFVGHLSKNSILVIEKPWMDVVKSLDNQPVDRHIFGT*
>AT4G07410.2 |  transducin family protein / WD-40 repeat family protein 
MALAPISGFSSDVEGIKNGYLSEKSNDEEEIGSEEDGSDSDEFHEKSEEEIDRILAAACD 
DGCVRLYRISNLEKLTYYRSLPRVSGRALSVTWSPDAKRIFSGSSDGLIRCWDATSCHEV 
YRITAGLGGLGSSSEICVWSLLSLRCSVLVSGDSTGTVQFWDSEHGTLLEAHSNHKGDVN 
TLAAAPSHNRVFSAGADGQVILYKLSGSTNGSQDLKPSSSQKWDYIGYVKAHTHDIRALT 
VAVPISREDPFPDDILPDKASRKHRKKGKPVDFTYHKWAHLGVPMLISAGDDAKLFAYSI 
QEFTKFSPHDICPAPQRIPMQMVHNSMFNKTSLLLVQGISTLDILRLNISSDSSGRASTK 
SLVRVKSRDARKIICSAISNTGSHFAYSDQIGPSLFELKKNEFTKCPWSVSRRRLPELPF 
AHSMIFSSDCSRLIIAGHDRRIYTIDISSLELVYAFTPSREEHEGEAPTPKEPPITKLFT 
SSDGQWLAAINCFGDIYVFNLETQRQHWFISRLDGASVTAAGFHPWNNNALVISTSSNQV 
FAFDVEARQLGKWSLLNTYVLPKRYQEFPGEVLGLSFSPSPNSSSVIVYSSRAKCLIDFG 
KPVEEDEEYDLPNGNLSKTLEGKLVNLGLKKGKGTNRKRRLDEYQLEGKSNERKNFEILP 
SNHPVLFVGHLSKNSILVIEKPWMDVVKSLDNQPVDRHIFGT*
>AT4G28200.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN RNA processing LOCATED IN intracellular EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s RNA-processing protein HAT helix (InterProIPR003107) U3 small nucleolar RNA-associated protein 6 (InterProIPR013949) Has 352 Blast hits to 342 proteins in 144 species Archae - 0 Bacteria - 0 Metazoa - 111 Fungi - 130 Plants - 24 Viruses - 0 Other Eukaryotes - 87 (source NCBI BLink) 
MADVVQYRLERMVDELDDLERREIFTRAEIAEIVKQRRKFEYRLKRPSPLKEDFIAYIDY 
EVKLDELRQLRRKSVARVTKKRKKKSVSDFAGVARIVEIYRLATMRYKGDINLWFRYLEF 
CKQKRHGRMKKALAQAIRFHPKVAGVWIYAASWEFDRNLNVTAARALMLNGLRVCSNSED 
LWVEYLRMELTFLNKLKARKVALGEDKGSLVRDTKTVEDEQWKDENKELFMSLDEKEGNE 
KEENDEDSIVEDVEDVTEKVDFLKEKGSNVLQTIYSGAVEAIPSSFDLRKRFLEILEATD 
LAHSDEMRNTILSDLKRDFCNEPEYWNWLARHEMSGCISNEAGLEFANPQMQKAIQVFEE 
GLQTVTSSSMFEIYINFLMEAIVQSNGDENEISSLSNPIISHIINVYQKADETGCLTEEL 
ADEYVSLYLKLEKTHEAQKLAEKLCSEKFAGSAKLWLSRVSIEIRSLSENSSPSKADFQT 
VFELLSNALRKVPISESESLWLMAFNFFAHQRTYLDKLVEMSILSATKSHGSDHVFSLAS 
TVVKFVLETKGAHSARKIYKRFLALPGPSLVLYKGCIEIETNLISVGDKDGLSNARKLYD 
SAVASYGQDVELWKNYYSLETKLGTSETANGVYWRARKTLNESADFIV*
>AT4G28450.1 |  nucleotide binding / protein binding 
MKIKTLSRSVDEYTRERSQDLQRVFHNFDPSLRPMEKAVEYQRALTAAKLEKIFARPFVG 
AMDGHRDGVSCMAKNPNYLKGIFSASMDGDIRLWDISSRRTVCQFPGHQGAVRGLTASTD 
GNVLVSCGTDCTVRLWNVPRPSLEDSSISSENFIEPSATYVWKNAFWAVDHQFEGDLFAT 
AGAQLDIWNHNRSQPVQSFQWGTDSVISVRFNPGEPNLLATSASDRSITIYDLRLSSAAR 
KIIMMTKTNSIAWNPMEPMNLTAANEDGSCYSFDGRKLDEAKCVHKDHVSAVMDIDFSPT 
GREFVTGSYDRSVRIFPYNGGHSREIYHTKRMQRVFCVKYSCDATYVISGSDDTNLRLWK 
AKASEQLGVILPREQKKHEYNEAVKNRYKHLSEVKRIVRHRHLPKPIYKAMGIIRTVNDS 
KRRKEARRKAHSAPGTVVTAPLRKRKIIKEVE*
>AT5G14050.1 |  transducin family protein / WD-40 repeat family protein 
MSLSQNAPKSKGIKREELKKQYEDVEDEEEIGSDDDLTRGKRRKTEKEKQKLEESELVEM 
KKLENLIFGSLYSPVTFGKEEEEDGSALFHVDRSAVRQIPDYEDDGDDDEELSDEENGQV 
VAIRKGEAAWEDEEEKQINVDIASVNRLRKLRKEENEGLISGSEYIARLRAHHAKLNPGT 
DWARPDSQIVDGESSDDDDTQDGGVDDILRTNEDLVVKSRGNKLCAGRLEYSKLVDANAA 
DPSNGPINSVHFHQNAQLLLTAGLDRRLRFFQIDGKRNTKIQSIFLEDCPIRKAAFLPNG 
SQVIVSGRRKFFYSFDLEKAKFDKIGPLVGREEKSLEYFEVSQDSNTIAFVGNEGYILLV 
STKTKELIGTLKMNGSVRSLAFSEDGKHLLSSGGDGQVYVWDLRTMKCLYKGVDEGSTCG 
TSLCSSLNGALFASGTDRGIVNIYKKSEFVGGKRKPIKTVDNLTSKIDFMKFNHDAQILA 
IVSTMNKNSVKLVHVPSLTVFSNWPPPNSTMHYPRCLDFSPGSGFMAMGNAAGKVLLYKL 
HHYQNA*
>AT5G16750.1 |  TOZ (TORMOZEMBRYO DEFECTIVE) nucleotide binding 
MAPHSLKKNYRCSRSLKQFYGGGPFIVSSDGSFIACACGDVINIVDSTDSSVKSTIEGES 
DTLTALALSPDDKLLFSAGHSRQIRVWDLETLKCIRSWKGHEGPVMGMACHASGGLLATA 
GADRKVLVWDVDGGFCTHYFRGHKGVVSSILFHPDSNKNILISGSDDATVRVWDLNAKNT 
EKKCLAIMEKHFSAVTSIALSEDGLTLFSAGRDKVVNLWDLHDYSCKATVATYEVLEAVT 
TVSSGTPFASFVASLDQKKSKKKESDSQATYFITVGERGVVRIWKSEGSICLYEQKSSDI 
TVSSDDEESKRGFTAAAMLPSDHGLLCVTADQQFFFYSVVENVEETELVLSKRLVGYNEE 
IADMKFLGDEEQFLAVATNLEEVRVYDVATMSCSYVLAGHKEVVLSLDTCVSSSGNVLIV 
TGSKDKTVRLWNATSKSCIGVGTGHNGDILAVAFAKKSFSFFVSGSGDRTLKVWSLDGIS 
EDSEEPINLKTRSVVAAHDKDINSVAVARNDSLVCTGSEDRTASIWRLPDLVHVVTLKGH 
KRRIFSVEFSTVDQCVMTASGDKTVKIWAISDGSCLKTFEGHTSSVLRASFITDGTQFVS 
CGADGLLKLWNVNTSECIATYDQHEDKVWALAVGKKTEMIATGGGDAVINLWHDSTASDK 
EDDFRKEEEAILRGQELENAVLDAEYTKAIRLAFELCRPHKVFELFSGLCRKRDSDEQIV 
KALQGLEKEEFRLLFEYVREWNTKPKLCHIAQFVLYKTFNILPPTEIVQVKGIGELLEGL 
IPYSQRHFSRIDRFVRSSFLLDYTLGEMSVIDPETVETEYPKDEKKKEKDVIAAMEQDTD 
ELKQETPSRKRKSQKSKGKSNKKRLIAEAQGSVIAV*
>AT5G30495.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown CONTAINS InterPro DOMAIN/s Fcf2 pre-rRNA processing (InterProIPR014810) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G547701) Has 218 Blast hits to 218 proteins in 114 species Archae - 0 Bacteria - 0 Metazoa - 88 Fungi - 76 Plants - 26 Viruses - 0 Other Eukaryotes - 28 (source NCBI BLink) 
MAETKPLIGLTWEPKLPGLSLDTKTCSTSSKRVESHESSSLWMSKSELVDGLCLPPNDPK 
KINKMIRKQIKDTTGSNWFDMPAPTMTPELKRDLQLLKLRTVMDPAVHYKKSVSRSKLAE 
KYFQIGTVIEPAEEFYGRLTKKNRKATLADELVSDPKVSQYRKRKVKEIEEKSRAVTNKK 
WKKKGNQTTNKKQRRN*
>AT5G30495.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown CONTAINS InterPro DOMAIN/s Fcf2 pre-rRNA processing (InterProIPR014810) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G547701) Has 218 Blast hits to 218 proteins in 114 species Archae - 0 Bacteria - 0 Metazoa - 88 Fungi - 76 Plants - 26 Viruses - 0 Other Eukaryotes - 28 (source NCBI BLink) 
MAETKPLIGLTWEPKLPGLSLDTKTCSTSSKRVESHESSSLWMSKSELVDGLCLPPNDPK 
KINKMIRKQIKDTTGSNWFDMPAPTMTPELKRDLQLLKLRTVMDPAVHYKKSVSRSKLAE 
KYFQIGTVIEPAEEFYGRLTKKNRKATLADELVSDPKVSQYRKRKVKEIEEKSRAVTNKK 
WKKKGNQTTNKKQRRN*
>AT5G30495.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown CONTAINS InterPro DOMAIN/s Fcf2 pre-rRNA processing (InterProIPR014810) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G547701) Has 218 Blast hits to 218 proteins in 114 species Archae - 0 Bacteria - 0 Metazoa - 88 Fungi - 76 Plants - 26 Viruses - 0 Other Eukaryotes - 28 (source NCBI BLink) 
MAETKPLIGLTWEPKLPGLSLDTKTCSTSSKRVESHESSSLWMSKSELVDGLCLPPNDPK 
KINKMIRKQIKDTTGSNWFDMPAPTMTPELKRDLQLLKLRTVMDPAVHYKKSVSRSKLAE 
KYFQIGTVIEPAEEFYGRLTKKNRKATLADELVSDPKVSQYRKRKVKEIEEKSRAVTNKK 
WKKKGNQTTNKKQRRN*
>AT5G30495.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown CONTAINS InterPro DOMAIN/s Fcf2 pre-rRNA processing (InterProIPR014810) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G547701) Has 218 Blast hits to 218 proteins in 114 species Archae - 0 Bacteria - 0 Metazoa - 88 Fungi - 76 Plants - 26 Viruses - 0 Other Eukaryotes - 28 (source NCBI BLink) 
MAETKPLIGLTWEPKLPGLSLDTKTCSTSSKRVESHESSSLWMSKSELVDGLCLPPNDPK 
KINKMIRKQIKDTTGSNWFDMPAPTMTPELKRDLQLLKLRTVMDPAVHYKKSVSRSKLAE 
KYFQIGTVIEPAEEFYGRLTKKNRKATLADELVSDPKVSQYRKRKVKEIEEKSRAVTNKK 
WKKKGNQTTNKKQRRN*
>AT5G61330.1 |  rRNA processing protein-related 
MAGGSKRSKRARLDSESEDISDQENLKAESDNEDDQLPDGIEDDEVDSMEDDEGESEEDD 
EGDTEEDDEGDSEEDDEGENKEDEDGESEDFEDGNDKESESGDEGNDDNKDAQMEELEKE 
VKELRSQEQDILKNLKRDKGEDAVKGQAVKNQKALWDKILEFRFLLQKAFDRSNRLPQEP 
VKSLFCSEDEDVSTAYTDLVTSSKKTLDSLLELQEALFEKNPSVDQQVNATASEESNKSD 
AEDSDEWQRISDLQKRMSVFRNKAVDKWQRKTQVTTGAAAIKGKLHAFNQNVSEQVASYM 
RDPSRMIKQMQQSRSTVAVFGTVPQEAMEPNPEEKQEEGDPELVEDAEFYRQLLKEFLET 
IDPASSEAAFYEMKKFQTKKRKVVDRRASKSRKIRYNVHEKIVNFMAPRPAKIPPNTADL 
LKNLFGLKTRNVQSEA*
>AT5G65900.1 |  DEAD/DEAH box helicase putative 
MANLDMEQHSSENEEIKKKKHKKRARDEAKKLKQPAMEEEPDHEDGDAKENNALIDEEPK 
KKKKKKNKKRGDTDDGEDEAVAEEEPKKKKKKNKKLQQRGDTNDEEDEVIAEEEEPKKKK 
KKQRKDTEAKSEEEEVEDKEEEKKLEETSIMTNKTFESLSLSDNTYKSIKEMGFARMTQI 
QAKAIPPLMMGEDVLGAARTGSGKTLAFLIPAVELLYRVKFTPRNGTGVLVICPTRELAI 
QSYGVAKELLKYHSQTVGKVIGGEKRKTEAEILAKGVNLLVATPGRLLDHLENTNGFIFK 
NLKFLVMDEADRILEQNFEEDLKKILNLLPKTRQTSLFSATQSAKVEDLARVSLTSPVYI 
DVDEGRKEVTNEGLEQGYCVVPSAMRLLFLLTFLKRFQGKKKIMVFFSTCKSTKFHAELF 
RYIKFDCLEIRGGIDQNKRTPTFLQFIKAETGILLCTNVAARGLDFPHVDWIVQYDPPDN 
PTDYIHRVGRTARGEGAKGKALLVLTPQELKFIQYLKAAKIPVEEHEFEEKKLLDVKPFV 
ENLISENYALKESAKEAYKTYISGYDSHSMKDVFNVHQLNLTEVATSFGFSDPPKVALKI 
DRGGYRSKREPVNKFKRGRGGGRPGGKSKFERY*
>AT4G12600.1 |  ribosomal protein L7Ae/L30e/S12e/Gadd45 family protein 
MTVEGVNPKAYPLADSQLSITILDLVQQATNYKQLKKGANEATKTLNRGISEFIVMAADT 
EPLEILLHLPLLAEDKNVPYVFVPSKQALGRACDVTRPVIACSVTSNEASQLKSQIQHLK 
DAIEKLLI*