>AT3G16480.1 | MPPalpha (mitochondrial processing peptidase alpha subunit) catalytic/ metal ion binding / metalloendopeptidase/ zinc ion binding
MYRTAASRAKALKGILNHNFRASRYASSSAVATSSSSSSWLSGGYSSSLPSMNIPLAGVS
LPPPLSDHVEPSKLKTTTLPNGLTIATEMSPNPAASIGLYVDCGSIYETPQFRGATHLLE
RMAFKSTLNRSHFRLVREIEAIGGNTSASASREQMGYTIDALKTYVPEMVEVLIDSVRNP
AFLDWEVNEELRKVKVEIGEFATNPMGFLLEAVHSAGYSGALANPLYAPESAITGLTGEV
LENFVFENYTASRMVLAASGVDHEELLKVVEPLLSDLPNVPRPAEPKSQYVGGDFRQHTG
GEATHFALAFEVPGWNNEKEAIIATVLQMLMGGGGSFSAGGPGKGMHSWLYLRLLNQHQQ
FQSCTAFTSVFNNTGLFGIYGCTSPEFASQGIELVASEMNAVADGKVNQKHLDRAKAATK
SAILMNLESRMIAAEDIGRQILTYGERKPVDQFLKTVDQLTLKDIADFTSKVITKPLTMA
TFGDVLNVPSYDSVSKRFR*
>AT1G72320.2 | APUM23 (Arabidopsis Pumilio 23) RNA binding / binding
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT
GKNRHSNKMRI*
>AT1G72320.2 | APUM23 (Arabidopsis Pumilio 23) RNA binding / binding
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT
GKNRHSNKMRI*
>AT1G72320.2 | APUM23 (Arabidopsis Pumilio 23) RNA binding / binding
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT
GKNRHSNKMRI*
>AT1G72320.1 | APUM23 (Arabidopsis Pumilio 23) RNA binding / binding
MVSVGSKSLPSRRHRTIEEDSLMGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQ
SGGAPNVKPASKKHSEFEHQNQFVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNA
LEETRGREYEIATDYIISHVLQTLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESA
LKSLATHLENPDAYSVIEEALHSICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSP
ELYGAKSSKALAKRLNLKMSQLDDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQ
YSSLVLQTALRLMLKQDEQLLEIIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFS
HLVEVILEVAPESLYNEMFNKVFKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEEL
APRFKDLLEQGKSGVVASLIAVSQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDY
YFGCRDKSTWEWAPGAKMHVMGCLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSS
GARVIEAFLASDAATKQKRRLIIKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASE
LLDVKVDLSKTKQGPYLLRKLDIDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPK
NTFVSDASEDAAQEIEVKNTRKEIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKT
SEATDKPKLAGSKRPFLSGEMTGKNRHSNKMRI*
>AT1G72320.1 | APUM23 (Arabidopsis Pumilio 23) RNA binding / binding
MVSVGSKSLPSRRHRTIEEDSLMGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQ
SGGAPNVKPASKKHSEFEHQNQFVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNA
LEETRGREYEIATDYIISHVLQTLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESA
LKSLATHLENPDAYSVIEEALHSICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSP
ELYGAKSSKALAKRLNLKMSQLDDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQ
YSSLVLQTALRLMLKQDEQLLEIIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFS
HLVEVILEVAPESLYNEMFNKVFKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEEL
APRFKDLLEQGKSGVVASLIAVSQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDY
YFGCRDKSTWEWAPGAKMHVMGCLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSS
GARVIEAFLASDAATKQKRRLIIKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASE
LLDVKVDLSKTKQGPYLLRKLDIDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPK
NTFVSDASEDAAQEIEVKNTRKEIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKT
SEATDKPKLAGSKRPFLSGEMTGKNRHSNKMRI*
>AT1G72320.1 | APUM23 (Arabidopsis Pumilio 23) RNA binding / binding
MVSVGSKSLPSRRHRTIEEDSLMGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQ
SGGAPNVKPASKKHSEFEHQNQFVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNA
LEETRGREYEIATDYIISHVLQTLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESA
LKSLATHLENPDAYSVIEEALHSICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSP
ELYGAKSSKALAKRLNLKMSQLDDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQ
YSSLVLQTALRLMLKQDEQLLEIIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFS
HLVEVILEVAPESLYNEMFNKVFKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEEL
APRFKDLLEQGKSGVVASLIAVSQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDY
YFGCRDKSTWEWAPGAKMHVMGCLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSS
GARVIEAFLASDAATKQKRRLIIKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASE
LLDVKVDLSKTKQGPYLLRKLDIDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPK
NTFVSDASEDAAQEIEVKNTRKEIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKT
SEATDKPKLAGSKRPFLSGEMTGKNRHSNKMRI*
>AT1G72320.3 | APUM23 (Arabidopsis Pumilio 23) RNA binding / binding
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT
GKNRHSNKMRI*
>AT1G72320.3 | APUM23 (Arabidopsis Pumilio 23) RNA binding / binding
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT
GKNRHSNKMRI*
>AT1G72320.3 | APUM23 (Arabidopsis Pumilio 23) RNA binding / binding
MGERGKSSNNHSERNKGMRRKDHKGNRGFDVDSSKKNQSGGAPNVKPASKKHSEFEHQNQ
FVRKEIDPETSKYFSEIANLFDSNEVELEERSVICGNALEETRGREYEIATDYIISHVLQ
TLLEGCELDQLCSFIRNSASVFPAIAMDRSGSHVAESALKSLATHLENPDAYSVIEEALH
SICKVIVDNPLDMMCNCYGSHVLRRLLCLCKGVSLDSPELYGAKSSKALAKRLNLKMSQL
DDNNLEIPHQGFPGMLTYLLSGLLSCSREDMKYLQVDQYSSLVLQTALRLMLKQDEQLLE
IIPLILRCNSTNKKVEGFHIETNVAKEILESMKDNSFSHLVEVILEVAPESLYNEMFNKV
FKNSLFELSVDRCANFVIQALISHARDQEQMGIMWEELAPRFKDLLEQGKSGVVASLIAV
SQRLQSHENKCCEALVGAVCSTNESRISILPRLLFLDYYFGCRDKSTWEWAPGAKMHVMG
CLILQGIFKFSSDHIQPYITSLTSMKAEYITETAKDSSGARVIEAFLASDAATKQKRRLI
IKLRGHFGELSLHTSGSFTVEKCFDACNLTLREAIASELLDVKVDLSKTKQGPYLLRKLD
IDGYASRPDQWKSRQEAKQSTYNEFCSAFGSNKSNFPKNTFVSDASEDAAQEIEVKNTRK
EIDHHPTSGFKRHREKHAKDKDEPFAGEKRSKQKKNKTSEATDKPKLAGSKRPFLSGEMT
GKNRHSNKMRI*
>AT3G62870.1 | 60S ribosomal protein L7A (RPL7aB)
MAPKKGVKVASKKKPEKVTNPLFERRPKQFGIGGALPPKKDLSRYIKWPKSIRLQRQKRI
LKQRLKVPPALNQFTKTLDKNLATSLFKILLKYRPEDKAAKKERLLNKAQAEAEGKPAES
KKPIVVKYGLNHVTYLIEQNKAQLVVIAHDVDPIELVVWLPALCRKMEVPYCIVKGKSRL
GAVVHQKTAAALCLTTVKNEDKLEFSKILEAIKANFNDKYEEYRKKWGGGIMGSKSQAKT
KAKERVIAKEAAQRMN*
>AT3G22890.1 | APS1 (ATP SULFURYLASE 1) sulfate adenylyltransferase (ATP)
MASMAAVLSKTPFLSQPLTKSSPNSDLPFAAVSFPSKSLRRRVGSIRAGLIAPDGGKLVE
LIVEEPKRREKKHEAADLPRVELTAIDLQWMHVLSEGWASPLGGFMRESEFLQTLHFNSL
RLDDGSVVNMSVPIVLAIDDEQKARIGESTRVALFNSDGNPVAILSDIEIYKHPKEERIA
RTWGTTAPGLPYVDEAITNAGNWLIGGDLEVLEPVKYNDGLDRFRLSPAELRKELEKRNA
DAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGFTKADDVPLDWRMKQH
EKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPVE
KRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQGKMAFFDPSRPQDFLFISGTKMR
TLAKNNENPPDGFMCPGGWKVLVDYYESLTPAGNGRLPEVVPV*
>AT3G03920.1 | Gar1 RNA-binding region family protein
MRPPMRGGGGFRGRGGRDGGGGGRFGGGGGRFGGGGGRFGGGGGRFGGFRDEGPPSEVVE
VATFVHACEGDAVTKLSQEKIPHFNAPIYLENKTQIGKVDEIFGPINESLFSIKMMEGIV
ATSYSPGDKFFIDPYKLLPLARFLPQPKGQSTGGRGGAGRGRGDSRGRGRGGSFSRGRGA
PRGGRFPPRGGSRGSFRGRGRF*
>AT3G06530.1 | binding
MSSSIVSQLQALKSVLQADTEPSKRPFTRPSILFSPKEAADFDIESIYELGLKGLEVLGN
KDERFKNYMNDLFSHKSKEIDRELLGKEENARIDSSISSYLRLLSGYLQFRASLETLEYL
IRRYKIHIYNLEDVVLCALPYHDTHAFVRIVQLLSTGNSKWKFLDGVKNSGAPPPRSVIV
QQCIRDKQVLEALCDYASRTKKYQPSKPVVSFSTAVVVGVLGSVPTVDGDIVKTILPFVD
SGLQSGVKGCLDQQAGALMVVGMLANRAVLNTNLIKRLMRSIIDIGREHAKESSDPHSLR
LSLMALINFVQLQSVDLIPRKALDLFNEIRDISGVLLGLSKEFNIKRFLAVLLDSLLFYS
SSDDKCCEVLASIIETVPVSNLVDHLISKVFSLCMTQYQKNSDFRSSTSGSWAKKFLVVV
SKKYPAELRAAVPKFLEATEVQSKKEDLKLEMLSCMLDGNSDMSHPFVDSKLWFRLHHPR
AAVRCAALSSLNGVLKDDSSKAENLVTIQDAILRQLWDDDLAVVQAALSFDKLPNIITSS
GLLDALLHVVKRCVGILVSGVSHNVQLAVDVVALSLKIAVSSFGNQTDSTEKVTSAMFPF
LLIQPKTWNLNLLVLKLGKDVNWPLFKNLAADDGMKKLPDIMSTNLSSISMDIINDLGEA
LSLDPDERRIELIERACNYKLSEVLETCSNIKCSEQDRNKLQKGLLIRESVSALNIDVIN
KLVEAFMMHPADYIQWLTTEWEELEVEVDVSLKELSKSNCQELLYQLLDTSDFTALNSKV
LICLFWKLGESFIKLEPAHDASVLNKRLSSGLEDLFFFFATTRLRHVFKEHLHFRVREAK
VCPVLFLSRLISREDVPPLVQIESLRCFSYLCSSGNNEWLIQVFSSFPVLLVPMSSDNQD
VKAAAINCIEALFNLRAAIYGSSFDELLGMIVQQRRLILSDNKFFASYLTSLLSSTTNDL
LVPVGLQKRFDQSTKENILSVILLCAEDLPAYGKLRVLSLLKDLGIMLMRDEIVKLLSQL
LDKRSQYYYKLDKTSQPLSDTEVDLLCLLLECSMMRTSSFKGQSLDDHILSALNVDCMAS
ERPAVISPCLTILEKLSNRFYDELQTDVQIRFFHKLVSMFRSSNGSIQNGAKEAVLRLKL
SSSTVVLALDRITQQDTLVIGSLSKKKKQKKNSKSCPEEDINSEEFRSGEKALSFIASLL
DMLLLKKDLTHRESLIRPLFKLLQRSMSKEWVKIAFSIEETSLQPPQDVRETTPTFISSI
QQTLLLILKDIFDSLNMNPLKAEVANEINVKMLVELAHSSNDGVTRNHIFSLFTAIVKFV
PDKVLDHIISILTLVGESTVTQIDSHSKSIFEGFISMVIPFWLSKTKSEEQLLQIFVKVL
PDIVEHRRRSIVAYLLGVIGERNGLPALLVLLFKSLISRKDSAWLGNANVSESFASIVKK
EWEYSFAMEICEQYSSSTWLSSLVILLQTISKDSKQCFLQMRLVLEFVFQKLQDPEFAFA
VSLEPRNNVSVGIQQELQELMKCCICLLQAIDAKKEKDVTSSVRNEIRMRIHDVLMTVTG
AMDLSIYFRVVTSLLQQQTDYNGTKKVLGLISERAKDTSSSKMKHKRKISNQKGRNSWLN
LDEVAVDSFGKMCEEIVHLINATDDESGVPVKRAAISTLEVLAGRFPSGHPIFRKCLAAV
AECISSKNLGVSSSCLRTTGALINVLGPKALIELPCIMKNLVKQSLEVSFASQSGRNATA
EEQLLMLSVLVTLEAVIDKLGGFLNPHLGDIMKIMVLHPEYVSDFDKNLKSKANAIRRLL
TDKIPVRLTLQPLLRIYNEAVSSGNASLVIAFNMLEDLVVKMDRSSIVSSHGKIFDQCLV
ALDIRRLNPAAIQNIDDAERSVTSAMVALTKKLTESEFRPLFIRSIDWAESDVVDGSGSE
NKSIDRAISFYGLVDRLCESHRSIFVPYFKYVLDGIVAHLTTAEASVSTRKKKKAKIQQT
SDSIQPKSWHLRALVLSCLKNCFLHDTGSLKFLDTNNFQVLLKPIVSQLVVEPPSSLKEH
PHVPSVDEVDDLLVSCIGQMAVASGSDLLWKPLNHEVLMQTRSESVRSRMLSLRSVKQML
DNLKEEYLVLLAETIPFLAELLEDVELSVKSLAQDIIKQMEEMSGESLAEYL*
>AT3G11964.1 | RNA binding
MVVPQKKFANGKRNDSTKSFKPMKKPFKKTKDDVAARSEAMALQLEDVPDFPRGGGTSLS
KKEREKLYEEVDAEFDADERVSKKSKGGKSKKRIPSDLDDLGLLFGGGLHGKRPRYANKI
TTKNISPGMKLLGVVTEVNQKDIVISLPGGLRGLVRASEVSDFTDRGIEDDENELLGDIF
SVGQLVPCIVLELDDDKKEAGKRKIWLSLRLSLLHKGFSFDSFQLGMVFSANVKSIEDHG
SILHFGLPSITGFIEISDDGNQESGMKTGQLIQGVVTKIDRDRKIVHLSSDPDSVAKCLT
KDLSGMSFDLLIPGMMVNARVQSVLENGILFDFLTYFNGTVDLFHLKNPLSNKSWKDEYN
QNKTVNARILFIDPSSRAVGLTLSPHVVCNKAPPLHVFSGDIFDEAKVVRIDKSGLLLEL
PSKPTPTPAYVSFKEGNHIRVRVLGLKQMEGLAVGTLKESAFEGPVFTHSDVKPGMVTKA
KVISVDTFGAIVQFSGGLKAMCPLRHMSEFEVTKPRKKFKVGAELVFRVLGCKSKRITVT
YKKTLVKSKLPILSSYTDATEGLVTHGWITKIEKHGCFVRFYNGVQGFVPRFELGLEPGS
DPDSVFHVGEVVKCRVTSAVHGTQRITLNDSIKLGSIVSGIIDTITSQAVIVRVKSKSVV
KGTISAEHLADHHEQAKLIMSLLRPGYELDKLLVLDIEGNNMALSSKYSLIKLAEELPSD
FNQLQPNSVVHGYVCNLIENGCFVRFLGRLTGFAPRSKAIDDPKADVSESFFVGQSVRAN
IVDVNQEKSRITLSLKQSSCASVDASFVQEYFLMDEKISDLQSSDITKSDCSWVEKFSIG
SLIKGTIQEQNDLGVVVNFDNINNVLGFIPQHHMGGATLVPGSVVNAVVLDISRAERLVD
LSLRPELLNNLTKEVSNSSKKKRKRGISKELEVHQRVSAVVEIVKEQHLVLSIPEHGYTI
GYASVSDYNTQKLPVKQFSTGQSVVASVKAVQNPLTSGRLLLLLDSVSGTSETSRSKRAK
KKSSCEVGSVVHAEITEIKPFELRVNFGNSFRGRIHITEVLVNDASTSDEPFAKFRVGQS
ISARVVAKPCHTDIKKTQLWELSVKPAMLKDSSEFNDTQESEQLEFAAGQCVIGYVYKVD
KEWVWLAVSRNVTARIFILDTSCKAHELEEFERRFPIGKAVSGYVLTYNKEKKTLRLVQR
PLLFIHKSIANGGGSKTDKPDSSIPGDDDTLFIHEGDILGGRISKILPGVGGLRVQLGPY
VFGRVHFTEINDSWVPDPLDGFREGQFVKCKVLEISSSSKGTWQIELSLRTSLDGMSSAD
HLSEDLKNNDNVCKRFERIEDLSPDMGVQGYVKNTMSKGCFIILSRTVEAKVRLSNLCDT
FVKEPEKEFPVGKLVTGRVLNVEPLSKRIEVTLKTVNAGGRPKSESYDLKKLHVGDMISG
RIRRVEPFGLFIDIDQTGMVGLCHISQLSDDRMENVQARYKAGESVRAKILKLDEEKKRI
SLGMKSSYLMNGDDDKAQPLSEDNTSMECDPINDPKSEVLAAVDDFGFQETSGGTSLVLA
QVESRASIPPLEVDLDDIEETDFDSSQNQEKLLGANKDEKSKRREKQKDKEEREKKIQAA
EGRLLEHHAPENADEFEKLVRSSPNSSFVWIKYMAFMLSLADIEKARSIAERALRTINIR
EEEEKLNIWVAYFNLENEHGNPPEESVKKVFERARQYCDPKKVYLALLGVYERTEQYKLA
DKLLDEMIKKFKQSCKIWLRKIQSSLKQNEEAIQSVVNRALLCLPRHKHIKFISQTAILE
FKCGVADRGRSLFEGVLREYPKRTDLWSVYLDQEIRLGEDDVIRSLFERAISLSLPPKKM
KFLFKKFLEYEKSVGDEERVEYVKQRAMEYANSTLA*
>AT2G37790.1 | aldo/keto reductase family protein
MAEEIRFFELNTGAKIPSVGLGTWQADPGLVGNAVDAAVKIGYRHIDCAQIYGNEKEIGL
VLKKLFDGGVVKREEMFITSKLWCTYHDPQEVPEALNRTLQDLQLDYVDLYLIHWPVSLK
KGSTGFKPENILPTDIPSTWKAMESLFDSGKARAIGVSNFSSKKLADLLVVARVPPAVNQ
VECHPSWQQNVLRDFCKSKGVHLSGYSPLGSPGTTWLTSDVLKNPILGGVAEKLGKTPAQ
VALRWGLQMGQSVLPKSTHEDRIKQNFDVFNWSIPEDMLSKFSEIGQGRLVRGMSFVHET
SPYKSLEELWDGEI*
>AT2G40360.1 | transducin family protein / WD-40 repeat family protein
MTKRSKGANEDKLIETKSKNVSGKSQKQKKPVEAESLKEEDLLQASGTDSDYDGDSLPGS
LNSDDFDSDFSDSEDDGTHEGTEDGDVEFSDDDDVLEHDGSIDNEDDDGSEHVGSDNNEE
HGSDEDSERGEAVEESDSSEDEVPSRNTVGNVPLKWYEDEKHIGYDLTGKKITKKEKQDK
LDSFLATIDDSKTWRKIYDEYNDEDVELTKEESKIVQRILKGEAPHADFDPYAPYVEWFK
HDDAIHPLSSAPEPKRRFIPSKWEAKKVVKIVRAIRKGWIKFDKPEEEPNVYLLWGDDST
SDQKSKHLTYIPPPKLKLPGHDESYNPSLEYIPTEEEKASYELMFEEDRPKFIPTRFTSL
RSIPAYENALKESFERCLDLYLCPRVRKKRINIDPESLKPKLPSRKDLRPYPNSCYLEYK
GHTGAVTSISTDSSGEWIASGSTDGSVRMWEVETGRCLKVWQFDEAIMCVAWNPLSRLPV
LAVAMGRDLFFLNTELGTDEEQEITKERLHSGNIPEPDASVAAIVTWLPDELYGGIKIRH
FKSISSIDWHRKGDYLSTVMASGETRGVVLHQLSKQKTQRLPFKIRGLPVCTLFHPSLSY
FFVATRKDVRVYNLLKPGEATKKLETGLREISSMAIHPGGDNLIVGSKEGKMCWFDMDLS
SKPYKTLKNHPKDITNVAVHRSYPLFASCSEDSTAYVFHGMVYNDLNQNPLIVPLEILRG
HSSKGGVLDCKFHPRQPWLFTAGADSIIKLYCH*
>AT1G63810.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 21 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Nrap protein (InterProIPR005554) Has 268 Blast hits to 263 proteins in 124 species Archae - 3 Bacteria - 0 Metazoa - 116 Fungi - 89 Plants - 17 Viruses - 0 Other Eukaryotes - 43 (source NCBI BLink)
MEADTKTDSRTLKVNDLLKDARLDYDSLRKLVDDTVSSIKEAIDGIPEKFQVTSELAPSF
VEDIGADKEVEFSFKKPNGFNLCGSYSICGMAKPDTSVDLLVHLPKECFYEKDYMNHRYH
AKRCLYLCVIEKHLLSSSSIEKVVWSTLHNEARKPVLVVFPAKKLDQFPGFSIRLIPSAT
SLFSVAKLSISRNNVRSVTADGVPEPTPTYNSSILEDMFLEENSEFLKKTFSEWKELSDA
LILLKIWARQRSSIYVHDCLNGFLISVILSYLATHSKINKALSALDIFRVTLDFIATSKL
WERGLYLPPQSEIRVSKEEKMQFRELFPVVICDSSTFVNLAFRMTSVGFLELQDEASLTL
KCMEKLRDGGFEEIFMTKIDYPVKYDHCIRLQLKGKTAVSLSGFCLDKECWRLYEQKVHS
LLLEGLGDRAKSIRVVWRNTNQDWHVESGLSVLDREPLFIGISVSSTEKAYRTVDIGPDA
ENKIEALRFRKFWGEKSDLRRFKDGRISESTVWETQQWTKHLIMKQIVEYILKRHLSLTS
DDIVQLVDQLDFSLNYGGKDPISLSGNLVQAYEVLSKCLREIEGIPLKVSSVQSLDSALR
FTSVFPPEPHPVACEKIDSRRLQKLIPSCIPAMEVMIQLEGSGNWPMDDLAVEKTKSAFL
LKIAESLQNVKGIPCTATEDNVDVFIGGYAFRLRILHERGLSLVKREIGVDPVKHVSSTD
KMLFIRSQHASMINGLQGRFPVYAPVARLAKRWVSAHLFSGCLAEEAIELLVAYLFLTPL
PLGVPSSRINGFLRFLRLLADYEWMFYPLIVDINNDFGRNDEKEINDNFMSSRKGYEEDK
QNISSAMFLAAPYDKASEAWTSTSPNLLEQKRLVAYARSSANVLSKMVLQEHNDSVQWEC
LFRTPLNNYDAVILLHRDKLPYPRRLLFPSELNQGKHVARGKASRLFNPFMSPGDLKRSH
EELKNKLMVDFEPTKCLLSGLQEEFGTLKPWYDHIGGDAIGLTWNKHNSKKRERDEEEEE
EEESNPMEMLKAVGEMGKGLVRDIYLLKPPRFV*
>AT5G04600.1 | RNA recognition motif (RRM)-containing protein
MGAKAKKALKKNMKKVAASASSSQLPLPQNPKPSADFLPLEGGPARKAPVTTPPLQNKAT
VLYIGRIPHGFYETEIEAFFSQFGTVKRVRVARNKKTGKSKHFGFIQFEDPEVAEIAAGA
MNDYLLMEHMLKVHVIEPENVKPNLWRGFKCNFKPVDSVQIERRQLNKERTLEEHRKMLQ
KIVKKDQKRRKRIEAAGIEYECPELVGNTQPVPKRIKFSEED*
>AT4G25630.1 | FIB2 (FIBRILLARIN 2) snoRNA binding
MRPPLTGSGGGFSGGRGRGGYSGGRGDGGFSGGRGGGGRGGGRGFSDRGGRGRGRGPPRG
GARGGRGPAGRGGMKGGSKVIVEPHRHAGVFIAKGKEDALVTKNLVPGEAVYNEKRISVQ
NEDGTKTEYRVWNPFRSKLAAAILGGVDNIWIKPGAKVLYLGAASGTTVSHVSDLVGPEG
CVYAVEFSHRSGRDLVNMAKKRTNVIPIIEDARHPAKYRMLVGMVDVIFSDVAQPDQARI
LALNASYFLKSGGHFVISIKANCIDSTVPAEAVFQTEVKKLQQEQFKPAEQVTLEPFERD
HACVVGGYRMPKKPKAATAA*
>AT3G56990.1 | EDA7 (embryo sac development arrest 7)
MTSYGDRLKSTSINGVKLYNVSSAPNVPTWLNPKKQRALRKNPHYMQRVELIQELKFETA
TTRIKATPDGEYLIASGIYPPQVKVYELNQLALKFERHLDSEIVDFEILDDDFSKLAFLC
ADRSINLHAKYGKHHTLRIPRMGRDMTYDSWSCDLLCAASSPDLYRINLEQGRFLSPLST
QSPALNVVSRSNLHGLVACGGEDGAVEFFDMRMKSSAARINAVTHGGDAAAEVTAIEFDD
SEGLQVAVGSSAGKVFIYDLRTSTPIRVKDHMYESPILNIKWQRTLNTQQPKLITTDKHI
VRIWDPNTGEGMTSIEPTQGGINDICVFRGSGLMLLALDSSLIPSYFIPELGPAPKWCSP
LENLTEELEESAQTTIYDNYKFLAMEDLEKLQLTHLIGTDLLKASMHGYFINYHLYKKAL
AVIEPFAFDDYLERRKQEKLEEQRTQRITKKRRLPKVNRDLAARLHGDESEEENKTAEDG
EATKKVLKKKKPILTDEHFVDGRFGSMFQNPDFQIDKDSYEYGVLHPVASSKKQPSLLDE
HFEAVSDDDENSDSDASQPSDDEADDGDATRPSKKARTPKLYEVKDERHAAAYHNRTSLA
KEDSLPMGERVKAIENRRGNFGGSKDIKFGPGGSREFSFKARGSSKYKEDRDDEYEDGQR
NKRRGVQSLGLKSTNIRGGFRGRGGGGFRGRGGGGSRGKGGRGGGRGRGRQ*
>AT4G13980.1 | AT-HSFA5 DNA binding / transcription factor
MNGALGNSSASVSGGEGAGGPAPFLVKTYEMVDDSSTDQIVSWSANNNSFIVWNHAEFSR
LLLPTYFKHNNFSSFIRQLNTYGFRKIDPERWEFLNDDFIKDQKHLLKNIHRRKPIHSHS
HPPASSTDQERAVLQEQMDKLSREKAAIEAKLLKFKQQKVVAKHQFEEMTEHVDDMENRQ
KKLLNFLETAIRNPTFVKNFGKKVEQLDISAYNKKRRLPEVEQSKPPSEDSHLDNSSGSS
RRESGNIFHQNFSNKLRLELSPADSDMNMVSHSIQSSNEEGASPKGILSGGDPNTTLTKR
EGLPFAPEALELADTGTCPRRLLLNDNTRVETLQQRLTSSEETDGSFSCHLNLTLASAPL
PDKTASQIAKTTLKSQELNFNSIETSASEKNRGRQEIAVGGSQANAAPPARVNDVFWEQF
LTERPGSSDNEEASSTYRGNPYEEQEEKRNGSMMLRNTKNIEQLTL*
>AT1G72440.1 | EDA25 (embryo sac development arrest 25)
MSKIKPLSKSSQDLSLLTSDIASFASSIGLASALPSSGFNDTDFRKPAKSKTQKRKKPKK
DQQHKDEDEEGEPKSNIGNEKGKDFGARKQNKDAPVKQTLQPKPKPGFLSIDDESTGYKK
KRFDEFKSLPKLPLVKASLLSSEWYNDAAEFEEKVFGGRKVAVANKEDFKGVVEKKRELG
ERLMWQYAEDFATSKGKGGDMKMVISAQKSGTVADKITAFEIMVGENPIANMRSLDALLG
MVTSKVGKRFAFKGLKALSEILIRLLPDRKLKSLLQRPLNIIPENKDGYSLLLFWYWEDC
LKQRYERFVTALDESSKDMLPELKDKALKTIYFMLTSKSEQERKLLVSLVNKLGDPQNKS
ASNADYHLTNLLADHPNMKAVVIDEVDSFLFRPHLGLRAKYHAVNFLSQIRLSHKGEDPK
VAKRLIDVYFALFKVLTTEANRKQGADDKGAADKKKSNPKDTKQEVSTDSPIELDSRILS
ALLTGVNRAFPYVSTDEADDIIESQTPVLFKLVHSANFNVGVQSLMLLDKISSKNKIVSD
RFYRALYSKLLLPSAMNSSKAEMFIGLLLRAMKNDINIKRVAAFSKRVLQVALQQPPQYA
CGCLFLLSEVLKSRPPLWKMVVQRESVEEEEDIEHFEDVIEGDDVDPNKKAENDENVVEV
DHDGVEKSSRDGDSSSDDEEALAIRLSDEEDDNASDDSEELIRNETPQLEEVMEVSNDME
KRSQPPMRPSSLPGGYDPRHREPSYCNADRASWWELGVLSKHAHPSVATMAGTLLSGTNI
VYNGNPLNDLSLTAFLDKFMEKKPKQNTWHGGSQIEPSKKLDMSNRVIGAEILSLAEGDV
APEDLVFHKFYVNKMTSTKQSKKKKKKKLPEEEAAEELYDVNDGDGGENYDSDVEFEAGD
ESDNEEIENMLDDVDDNAVEEEGGEYDYDDLDGVAGEDDEELVADVSDAEMDTDMDMDLI
DDEDDNNVDDDGTGDGGDDDSDGDDGRSKKKKKEKRKRKSPFASLEEYKHLIDQDEKEDS
KTKRKATSEPTKKKKKKKSKASE*
>AT3G57150.1 | NAP57 (Arabidopsis thaliana homologue of NAP57) pseudouridine synthase
MAEVDISHSKKKKQDKTENDAADTGDYMIKPQSFTPAIDTSQWPILLKNYDRLNVRTGHY
TPISAGHSPLKRPLQEYIRYGVINLDKPANPSSHEVVAWIKRILRVEKTGHSGTLDPKVT
GNLIVCIDRATRLVKSQQGAGKEYVCVARLHSAVPDVAKVARALESLTGAVFQRPPLISA
VKRQLRIRTIYESKLLEYDADRHLVVFWVSCEAGTYIRTMCVHLGLLLGVGGHMQELRRV
RSGILGENNNMVTMHDVMDAQFVYDNSRDESYLRRVIMPLEMILTSYKRLVVKDSAVNAI
CYGAKLMIPGLLRFENDIDVGTEVVLMTTKGEAIAVGIAEMTTSVMATCDHGVVAKIKRV
VMDRDTYPRKWGLGPRASMKKKLIADGKLDKHGKPNEKTPVEWSRNVVLPTGGDAIIAGA
AAAPEEIKADAENGEAGEARKRKHDDSSDSPAPVTTKKSKTKEVEGEEAEEKVKSSKKKK
KKDKEEEKEEEAGSEKKEKKKKKDKKEEVIEEVASPKSEKKKKKKSKDTEAAVDAEDESA
AEKSEKKKKKKDKKKKNKDSEDDEE*
>AT4G19610.1 | RNA binding / nucleic acid binding / nucleotide binding
MSRICVKNLPKHVKEDQLRDHFSQKGEITDAKLMRSNDGKSRQFGFIGFRSAQEAQQAIK
YFNNTYLGTSLIIVEIAHKVGDENAPRPWSRLSHKKEEEAKKSSSEGLKDGNAKGGKKRK
AEVDDPEFQEFLEVHQRSKSKIWSNDMSIPPAPEETGKEKVLVKKADEQIVSNGVEPKKA
KKSSDTEKTKKSKVVAASDDVSDMEYFKSRIKKNLSDSESDNESEDSSEDEAGDDDGKAE
TDGQDADIRYFPIDGDVEAGGVGKDDDGDAMEVEGDGKVAQESKAVSDDVLDTGRLFVRN
LPYTATEEELMEHFSTFGKISEVHLVLDKETKRSRGIAYILYLIPECAARAMEELDNSSF
QGRLLHILPAKHRETSDKQVNDTSNLPKTFKQKREEQRKASEAGGDTKAWNSLFMRPDTI
LENIVRVYGVSKSELLDREAEDPAVRLALGETKVIAETKEALAKAGVNVTSLEKFATRNG
DEKNRSKHILLVKNLPFASTEKELAQMFGKFGSLDKIILPPTKTMALAVFLEPAEARAAL
KGMAYKRYKDAPLYLEWAPGNILEPKNLPDTNEERSDIEENGVRRVNLEQQVEIDPDVTE
SNVLNVKNLSFKTTDEGLKKHFTKLVKQGKILSVTIIKHKKNEKYLSSGYGFVEFDSVET
ATSVYRDLQGTVLDGHALILRFCENKRSDKVGKDSNKDKPCTKLHVKNIAFEATKRELRQ
LFSPFGQIKSMRLPKKNIGQYAGYAFVEFVTKQEALNAKKALASTHFYGRHLVLEWANDD
NSMEAIRKRSAAKFDEENDNARKRKSSKAVEGKNEV*
>AT5G08180.1 | ribosomal protein L7Ae/L30e/S12e/Gadd45 family protein
MGSDTEAEKSIQKEKKKYAITLAPIAKPLAGKKLQKRTFKLIQKAAGKKCLKRGVKEVVK
SIRRGQKGLCVIAGNISPIDVITHLPILCEEAGVPYVYVPSKEDLAQAGATKRPTCCVLV
MLKPAKGDLTAEELAKLKTDYEQVSDDIKELATSVI*
>AT1G48920.1 | ATNUC-L1 nucleic acid binding / nucleotide binding
MGKSKSATKVVAEIKATKPLKKGKREPEDDIDTKVSLKKQKKDVIAAVQKEKAVKKVPKK
VESSDDSDSESEEEEKAKKVPAKKAASSSDESSDDSSSDDEPAPKKAVAATNGTVAKKSK
DDSSSSDDDSSDEEVAVTKKPAAAAKNGSVKAKKESSSEDDSSSEDEPAKKPAAKIAKPA
AKDSSSSDDDSDEDSEDEKPATKKAAPAAAKAASSSDSSDEDSDEESEDEKPAQKKADTK
ASKKSSSDESSESEEDESEDEEETPKKKSSDVEMVDAEKSSAKQPKTPSTPAAGGSKTLF
AANLSFNIERADVENFFKEAGEVVDVRFSTNRDDGSFRGFGHVEFASSEEAQKALEFHGR
PLLGREIRLDIAQERGERGERPAFTPQSGNFRSGGDGGDEKKIFVKGFDASLSEDDIKNT
LREHFSSCGEIKNVSVPIDRDTGNSKGIAYLEFSEGKEKALELNGSDMGGGFYLVVDEPR
PRGDSSGGGGFGRGNGRFGSGGGRGRDGGRGRFGSGGGRGRDGGRGRFGSGGGRGSDRGR
GRPSFTPQGKKTTFGDE*
>AT3G05060.1 | SAR DNA-binding protein putative
MVLVLYETAAGFALFKVKDEGKMANVEDLCKEFDTPDSARKMVKLKAFEKFDNTSEALEA
VAKLLEGAPSKGLRKFLKANCQGETLAVADSKLGNVIKEKLKIDCIHNNAVMELLRGVRS
QFTELISGLGDQDLAPMSLGLSHSLARYKLKFSSDKVDTMIIQAIGLLDDLDKELNTYAM
RVREWYGWHFPELAKIISDNILYAKSVKLMGNRVNAAKLDFSEILADEIEADLKDAAVIS
MGTEVSDLDLLHIRELCDQVLSLSEYRAQLYDYLKSRMNTIAPNLTALVGELVGARLISH
GGSLLNLSKQPGSTVQILGAEKALFRALKTKHATPKYGLIFHASLVGQAAPKHKGKISRS
LAAKTVLAIRVDALGDSQDNTMGLENRAKLEARLRNLEGKDLGRLSGSSKGKPKIEVYNK
DKKMGSGGLITPAKTYNTAADSLLGETSAKSEEPSKKKDKKKKKKVEEEKPEEEEPSEKK
KKKKAEAETEAVVEVAKEEKKKNKKKRKHEEEETTETPAKKKDKKEKKKKSKD*
>AT3G12860.1 | nucleolar protein Nop56 putative
MKIYLLSESPSGYGLFEGHGSDEIGQNTEAVRSSVSDLSRFGRVVQLTAFHPFQSALDAL
NQINAVSEGYMSDELRSFLELNLPKVKEGKKPKFSLGVSEPKIGSCIFEATKIPCQSNEF
VHELLRGVRQHFDRFIKDLKPGDLEKAQLGLAHSYSRAKVKFNVNRVDNMVIQAIFMLDT
LDKDINSFAMRVREWYSWHFPELVKIVNDNYLYAKVSKIIVDKSKLSEEHIPMLTEALGD
EDKAREVIEAGKASMGQDLSPVDLINVQTFAQRVMDLADYRKKLYDYLVTKMSDIAPNLA
ALIGEMVGARLISHAGSLTNLAKCPSSTLQILGAEKALFRALKTRGNTPKYGLIFHSSFI
GRASAKNKGRIARFLANKCSIASRIDCFSDNSTTAFGEKLREQVEERLDFYDKGVAPRKN
VDVMKEVLENLEKKDEGEKTVDASEKKKKRKTEEKEEEKEEEKSKKKKKKSKAVEGEELT
ATDNGHSKKKKKTKSQDDE*
>AT3G62310.1 | RNA helicase putative
MGTERKRKISLFDVMDDPSAPAKNAKTSGLPDGGINSLINKWNGKPYSQRYYDILEKRRT
LPVWLQKEEFLKTLNNNQTLILVGETGSGKTTQIPQFVIDAVDAETSDKRRKWLVGCTQP
RRVAAMSVSRRVAEEMDVTIGEEVGYSIRFEDCSSPRTVLKYLTDGMLLREAMADPLLER
YKVIILDEAHERTLATDVLFGLLKEVLKNRPDLKLVVMSATLEAEKFQDYFSGAPLMKVP
GRLHPVEIFYTQEPERDYLEAAIRTVVQIHMCEPPGDILVFLTGEEEIEDACRKINKEVG
NLGDQVGPIKVVPLYSTLPPAMQQKIFDPAPEPVTEGGPPGRKIVVSTNIAETSLTIDGI
VYVIDPGFAKQKVYNPRIRVESLLVSPISKASAHQRSGRAGRTRPGKCFRLYTEKSFNND
LQPQTYPEILRSNLANTVLTLKKLGIDDLVHFDFMDPPAPETLMRALEVLNYLGALDDDG
NLTKTGEIMSEFPLDPQMAKMLIVSPEFNCSNEILSVSAMLSVPNCFIRPREAQKAADEA
KARFGHIEGDHLTLLNVYHAFKQNNEDPNWCYENFINNRAMKSADNVRQQLVRIMSRFNL
KMCSTDFNSRDYYINIRKAMLAGYFMQVAHLERTGHYLTVKDNQVVHLHPSNCLDHKPEW
VIYNEYVLTSRNFIRTVTDIRGEWLVDVASHYYDLSNFPNCEAKRVIEKLYKKREREKEE
SKKNRK*
>AT1G56110.1 | NOP56 (Arabidopsis homolog of nucleolar protein Nop56)
MAMYVIYESSSGYGLFEVHGLDEIGQNTEAVRTSVSDLSRFGRVVQLTAFHPFESALDAL
NQVNAVSEGVMTDELRSFLELNLPKVKEGKKPKFSLGLAEPKLGSHIFEATKIPCQSNEF
VLELLRGVRQHFDRFIKDLKPGDLEKSQLGLAHSYSRAKVKFNVNRVDNMVIQAIFMLDT
LDKDINSFAMRVREWYSWHFPELVKIVNDNYLYARVSKMIDDKSKLTEDHIPMLTEVLGD
EDKAKEVIEAGKASMGSDLSPLDLINVQTFAQKVMDLADYRKKLYDYLVTKMSDIAPNLA
ALIGEMVGARLISHAGSLTNLAKCPSSTLQILGAEKALFRALKTRGNTPKYGLIFHSSFI
GRASAKNKGRIARYLANKCSIASRIDCFADGATTAFGEKLREQVEERLEFYDKGVAPRKN
VDVMKEVIENLKQEEEGKEPVDASVKKSKKKKAKGEEEEEVVAMEEDKSEKKKKKEKRKM
ETAEENEKSEKKKTKKSKAGGEEETDDGHSTKKKKKKSKSAE*
>AT4G05410.1 | transducin family protein / WD-40 repeat family protein
MKYNNEKKKGGSFKRGGKKGSNERDPFFEEEPKKRRKVSYDDDDIESVDSDAEENGFTGG
DEDGRRVDGEVEDEDEFADETAGEKRKRLAEEMLNRRREAMRREREEADNDDDDDEDDDE
TIKKSLMQKQQEDSGRIRRLIASRVQEPLSTDGFSVIVKHRRSVVSVALSDDDSRGFSAS
KDGTIMHWDVSSGKTDKYIWPSDEILKSHGMKLREPRNKNHSRESLALAVSSDGRYLATG
GVDRHVHIWDVRTREHVQAFPGHRNTVSCLCFRYGTSELYSGSFDRTVKVWNVEDKAFIT
ENHGHQGEILAIDALRKERALTVGRDRTMLYHKVPESTRMIYRAPASSLESCCFISDNEY
LSGSDNGTVALWGMLKKKPVFVFKNAHQDIPDGITTNGILENGDHEPVNNNCSANSWVNA
VATSRGSDLAASGAGNGFVRLWAVETNAIRPLYELPLTGFVNSLAFAKSGKFLIAGVGQE
TRFGRWGCLKSAQNGVAIHPLRLA*
>AT5G66540.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN rRNA processing LOCATED IN cytosol nucleolus nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 12 growth stages CONTAINS InterPro DOMAIN/s U3 small nucleolar ribonucleoprotein complex subunit Mpp10p (InterProIPR012173) Mpp10 protein (InterProIPR007151) Has 76240 Blast hits to 38667 proteins in 1479 species Archae - 252 Bacteria - 6537 Metazoa - 31185 Fungi - 9935 Plants - 3937 Viruses - 750 Other Eukaryotes - 23644 (source NCBI BLink)
MATVKDSGFEALEKLKATEPPVFLAPSSISEDARSASQYLFMKLKPHNPKCPFDQLSSDG
FDAEQIWQQIDMQSQPLLTSLRQEVKRFAKNPEEIRKLGKLALKVSHEDDIDEMDMDGFD
SDDVDDEDKEIESNDSEGEDEEEEEEDEEEEEEEEEEEEEEKDGDNEGIEDKFFKIKELE
EFLEEGEAEEYGIDHKNKKGVAQRKKQNLSDDEDEEDDDDEEEDVEFDAFAGGDDEETDK
LGKARYDDFFGGKKETKMKLKDLSEDEEAEIENKGNEKLSTHERARLKLQSKIEQMEKAN
LDPKHWTMQGEITAAKRPMNSALEVDLDFEHNARPAPVITEEVTASLEDLIKSRIIEARF
DDVQRAPRLPTKGKREAKELDESKSKKGLAEVYEAEYFQKANPAFAPTTHSDELKKEASM
LFKKLCLKLDALSHFHFTPKPVIEEMSIPNVSAIAMEEVAPVAVSDAAMLAPEEIFSGKG
DIKDESELTQEDRKRRRANKKRKFKAESANEPPKKALDTSTKNP*
>AT4G22380.1 | ribosomal protein L7Ae/L30e/S12e/Gadd45 family protein
MTGEVVNPKAYPLADSQLSITIMDLVQQATNYKQLKKGANEATKTLNRGISEFVVMAADA
EPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVIACSVTSNEASQLKSQIQHLK
DAIEKLLI*
>AT2G47990.1 | SWA1 (SLOW WALKER1) nucleotide binding
MEEELRVRLNDHQVSKVFPVKPKSTAKPVSESETPESRYWSSFKNHSTPNLVSSVAALAF
SPVHPHSLAVAHSATVSLFSSQSLSSSRRFSFRDVVSSVCFRSDGALFAACDLSGVVQVF
DIKERMALRTLRSHSAPARFVKYPVQDKLHLVSGGDDGVVKYWDVAGATVISDLLGHKDY
VRCGDCSPVNDSMLVTGSYDHTVKVWDARVHTSNWIAEINHGLPVEDVVYLPSGGLIATA
GGNSVKVWDLIGGGKMVCSMESHNKTVTSLRVARMESAESRLVSVALDGYMKVFDYGRAK
VTYSMRFPAPLMSLGLSPDGSTRVIGGSNGMVFAGKKKVRDVVGGQKKSLNLWSLISDVD
ESRRRALRPTYFRYFQRGQSEKPSKDDYLVKEKKGLKLTRHDKLLKKFRHKEALVSVLEE
KKPANVVAVMEELVARRKLMKCVSNMEEGELGMLLGFLQRYCTVQRYSGLLMGLTKKVLE
TRAEDIKGKNEFKGLLRNLKREVNQEIRIQQSLLEIQGVIAPLMRIAGRS*
>AT1G50920.1 | GTP-binding protein-related
MVQYNFKRITVVPNGKEFVDIILSRTQRQTPTVVHKGYKINRLRQFYMRKVKYTQTNFHA
KLSAIIDEFPRLEQIHPFYGDLLHVLYNKDHYKLALGQVNTARNLISKISKDYVKLLKYG
DSLYRCKCLKVAALGRMCTVLKRITPSLAYLEQIRQHMARLPSIDPNTRTVLICGYPNVG
KSSFMNKVTRADVDVQPYAFTTKSLFVGHTDYKYLRYQVIDTPGILDRPFEDRNIIEMCS
ITALAHLRAAVLFFLDISGSCGYTIAQQAALFHSIKSLFMNKPLVIVCNKTDLMPMENIS
EEDRKLIEEMKSEAMKTAMGASEEQVLLKMSTLTDEGVMSVKNAACERLLDQRVEAKMKS
KKINDHLNRFHVAIPKPRDSIERLPCIPQVVLEAKAKEAAAMEKRKTEKDLEEENGGAGV
YSASLKKNYILQHDEWKEDIMPEILDGHNVADFIDPDILQRLAELEREEGIREAGVEEAD
MEMDIEKLSDEQLKQLSEIRKKKAILIKNHRLKKTVAQNRSTVPRKFDKDKKYTTKRMGR
ELSAMGLDPSSAMDRARSKSRGRKRDRSEDAGNDAMDVDDEQQSNKKQRVRSKSRAMSIS
RSQSRPPAHEVVPGEGFKDSTQKLSAIKISNKSHKKRDKNARRGEADRVIPTLRPKHLFS
GKRGKGKTDRR*
>AT1G06720.1 | INVOLVED IN ribosome biogenesis LOCATED IN nucleus EXPRESSED IN 21 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s AARP2CN (InterProIPR012948) Protein of unknown function DUF663 (InterProIPR007034) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G424401) Has 7944 Blast hits to 5342 proteins in 373 species Archae - 33 Bacteria - 667 Metazoa - 2609 Fungi - 1045 Plants - 414 Viruses - 80 Other Eukaryotes - 3096 (source NCBI BLink)
MAADELMPSHRSHRTPKSGPTARKKSELDKKKRGISVDKQKNLKAFGVKSVVHAKKAKHH
AAEKEQKRLHLPKIDRNYGEAPPFVVVVQGPPGVGKSLVIKSLVKEFTKQNVPEVRGPIT
IVQGKQRRFQFVECPNDINAMVDCAKVADLALLVVDGSYGFEMETFEFLNIMQVHGFPRV
MGVLTHLDKFNDVKKLRKTKHHLKHRFWTEIYHGAKLFYLSGLIHGKYTPREVHNLARFV
IVIKPQPLTWRTAHPYVLVDRLEDVTPPEKVQMDKKCDRNITVFGYLRGCNFKKRMKVHI
AGVGDFIVAGVTALTDPCPLPSAGKKKGLRDRDKLFYAPMSGIGDLVYDKDAVYININSH
QVQYSKTDDGKGEPTNKGKGRDVGEDLVKSLQNTKYSVDEKLDKTFINFFGKKTSASSET
KLKAEDAYHSLPEGSDSESQSGDDEEDIVGNESEMKQETEIHGGRLRRKAIFKTDLNEDD
FEEADDLELDSYDPDTYDFEEADDAESDDNEVEDGGDDSASDSADGEPGDYQIDDKDSGN
ISQWKAPLKEIARKKNPNLMQIVYGASSLATPLINENHDISDDDESDDEDFFKPKGEQHK
NLGGGLDVGYVNSEDCSKFVNYGYLKNWKEKEVCESIRDRFTTGDWSKAALRDKNLGTGG
EGEDDELYGDFEDLETGEKHKSHENLESGANENEDEDAEVVERDGNNPRSQADEPGYADK
LKEAQEITKQRNELEYNDLDEETRIELAGFRTGTYLRLEIHNVPYEMVEFFDPCHPILVG
GIGFGEDNVGYMQARLKKHRWHKKVLKTRDPIIVSIGWRRYQTIPVFAIEDRNGRHRMLK
YTPEHMHCLASFWGPLVPPNTGFVAFQNLSNNQAGFRITATSVVLEFNHQARIVKKIKLV
GTPCKIKKKTAFIKDMFTSDLEIARFEGSSVRTVSGIRGQVKKAGKNMLDNKAEEGIARC
TFEDQIHMSDMVFLRAWTTVEVPQFYNPLTTALQPRDKTWNGMKTFGELRRELNIPIPVN
KDSLYKAIERKQKKFNPLQIPKRLEKDLPFMSKPKNIPKRKRPSLEDKRAVIMEPKERKE
HTIIQQFQLLQHHTMKKKKATDQKKRKEYEAEKAKNEEINKKRRREERRDRYREEDKQKK
KTRRSLD*
>AT1G15420.1 | unknown protein
MAKDKLKPLLSSDAAGDIADTPLREKKHKKKSKKRAEPEPDIPSTRDSGLDEDRDGVLVD
DTLNEPTIGDKLESLDLLNGEKVNSEESNRDSAPGDDKPPTAASVNVLLRQALHADDRSL
LLDCLYNRDEQVIANSVAKLNSAEVLKLLNALLPILQSRGAILACTIPWIKSLLLTHSSG
IMSQESSLLALNTMYQLIESRVSTIHTAVEVSSGLDLIVDDLDEEEDEGPVIYEDKDSDE
DEEEGIEEAMETDEEADDSADEAADGVNDFEGFDDMSD*
>AT1G15440.1 | transducin family protein / WD-40 repeat family protein
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARLFCVRKLKGVLNKP
FLFLGHRDSVVGCFFGVDKMTNKVNRAFTIARDGYIFSWGYTEKDVKMDESEDGHSEPPS
PVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGDDDDEEYMHRGKWVLLRKDGC
NQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIHLLSISRQKLTTAVFNERGNW
LTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPDSQLLATGADDNKVKVWNVMS
GTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDFKRYKNYKTYTTPTPRQFVSL
TADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPVHGLMFSPLTQLLASSSWDYT
VRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLDGQINFWDTIEGVLMYTIEGR
RDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAGTSRYICMYDIADQVLLRRFQ
ISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGIDKQSRGNLGYDLPGSRPNRGR
PIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTDLDIDVTPEAVEAAIEEDEVS
RALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLERLMEALVDLLENCPHLEFIL
HWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLADMCSSNEYTLRYLCSVPNNH
*
>AT1G15440.1 | transducin family protein / WD-40 repeat family protein
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARLFCVRKLKGVLNKP
FLFLGHRDSVVGCFFGVDKMTNKVNRAFTIARDGYIFSWGYTEKDVKMDESEDGHSEPPS
PVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGDDDDEEYMHRGKWVLLRKDGC
NQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIHLLSISRQKLTTAVFNERGNW
LTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPDSQLLATGADDNKVKVWNVMS
GTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDFKRYKNYKTYTTPTPRQFVSL
TADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPVHGLMFSPLTQLLASSSWDYT
VRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLDGQINFWDTIEGVLMYTIEGR
RDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAGTSRYICMYDIADQVLLRRFQ
ISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGIDKQSRGNLGYDLPGSRPNRGR
PIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTDLDIDVTPEAVEAAIEEDEVS
RALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLERLMEALVDLLENCPHLEFIL
HWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLADMCSSNEYTLRYLCSVPNNH
*
>AT1G15440.2 | transducin family protein / WD-40 repeat family protein
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARAFTIARDGYIFSWG
YTEKDVKMDESEDGHSEPPSPVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGD
DDDEEYMHRGKWVLLRKDGCNQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIH
LLSISRQKLTTAVFNERGNWLTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPD
SQLLATGADDNKVKVWNVMSGTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDF
KRYKNYKTYTTPTPRQFVSLTADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPV
HGLMFSPLTQLLASSSWDYTVRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLD
GQINFWDTIEGVLMYTIEGRRDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAG
TSRYICMYDIADQVLLRRFQISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGID
KQSRGNLGYDLPGSRPNRGRPIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTD
LDIDVTPEAVEAAIEEDEVSRALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLE
RLMEALVDLLENCPHLEFILHWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLA
DMCSSNEYTLRYLCSVPNNH*
>AT1G15440.2 | transducin family protein / WD-40 repeat family protein
MEFRFENLLGAPYRGGNAVITKNTQLISPVGNRVSVTDLSKNHSVTLPLETSTNICRLAS
SPDGTFLLAVDEQNRCLFINLPRRVVLHRITFKDKVGALKFSPNGKFIAVGIGKLVEIWR
SPGFRRAVLPFERVRTFANSDDKVVSLEWSLDSDYLLVGSRDLAARAFTIARDGYIFSWG
YTEKDVKMDESEDGHSEPPSPVTPDRADEVMVENGGGVGTELKKRKEYDGKGLESDEEGD
DDDEEYMHRGKWVLLRKDGCNQASAKVTACDYHQGLDMVVVGFSNGVFGLYQMPDFICIH
LLSISRQKLTTAVFNERGNWLTFGCAKLGQLLVWDWRTETYILKQQGHYFDVNCVTYSPD
SQLLATGADDNKVKVWNVMSGTCFITFTEHTNAVTALHFMADNHSLLSASLDGTVRAWDF
KRYKNYKTYTTPTPRQFVSLTADPSGDVVCAGTLDSFEIFVWSKKTGQIKDILSGHEAPV
HGLMFSPLTQLLASSSWDYTVRLWDVFASKGTVETFRHNHDVLTVAFRPDGKQLASSTLD
GQINFWDTIEGVLMYTIEGRRDIAGGRVMTDRRSAANSSSGKCFTTLCYSADGGYILAAG
TSRYICMYDIADQVLLRRFQISHNLSLDGVLDFLHSKKMTEAGPIDLIDDDNSDEEGGID
KQSRGNLGYDLPGSRPNRGRPIIRTKSLSIAPTGRSFAAATTEGVLIFSIDDTFIFDPTD
LDIDVTPEAVEAAIEEDEVSRALALSMRLNEDSLIKKCIFAVAPADIKAVAISVRQKYLE
RLMEALVDLLENCPHLEFILHWCQEICKAHGSSIQRNYRTLLPALRSLQKAITRAHQDLA
DMCSSNEYTLRYLCSVPNNH*
>AT1G31660.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 21 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Bystin (InterProIPR007955) Has 370 Blast hits to 362 proteins in 156 species Archae - 0 Bacteria - 7 Metazoa - 139 Fungi - 93 Plants - 32 Viruses - 0 Other Eukaryotes - 99 (source NCBI BLink)
MAKKRDRIVNTQPFISDDASVASSRKRSKVPKTHQKQEKLIEAGMSEKIMKQALAQQKEV
ADEENAERNPSSAAFAVAGAATAGEEQKILEEEEDDIDDFDGTFENQSQFDKQEEINEDD
EKLFESFLNKNAPPQRTLTDIIIKKLKDKDADLAEEERPDPKMDPAITKLYKGVGKFMSE
YTVGKLPKAFKLVTSMEHWEDVLYLTEPEKWSPNALYQATRIFASNLKDRQVQRFYNYVL
LPRVREDIRKHKKLHFALYQALKKSLYKPSAFNQGILFPLCKSGTCNLREAVIIGSILEK
CSIPMLHSCVALNRLAEMDYCGTTSYFIKVLLEKKYCMPYRVLDALVAHFMRFVDDIRVM
PVIWHQSLLTFVQRYKYEILKEDKEHLQTLLQRQKHHLVTPEILRELKDSRNRGEKEDPM
VDNFAPVPAKEDRFDIPEVPMEED*
>AT1G42440.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN ribosome biogenesis LOCATED IN nucleus EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s AARP2CN (InterProIPR012948) Protein of unknown function DUF663 (InterProIPR007034) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G067201) Has 2447 Blast hits to 1812 proteins in 205 species Archae - 0 Bacteria - 115 Metazoa - 903 Fungi - 423 Plants - 100 Viruses - 53 Other Eukaryotes - 853 (source NCBI BLink)
MGRSRVQVNKAHKTRFSSKSSRNLHRTNLQDSGRIGKSDSNYVKGAKAARVQRGKMLREQ
KRAAVLKEKRASGGINSAPRVIVLFPLSASVELNSLGEDVLKLLSSDGSGIASSTVASSE
YKLRATVLKAPHGDLLTCMEMAKVADLMAFVASASAPWEENSSNFIDSFGSQCLSVFRSI
GLPSTTVLIRDLPSDVKKKNEMKKMCASQLASEFPEDCKFYPADTRDELHKFMWLFKAQR
LTVPHWRSQRSYIVARKAGMLVDDESSGKCTLLLSGYLRARKLSVNQLVHVSGVGDFQFS
KIEVLKDPFPLNERKNQNSMELDDSHDEEVLKSLVPDPMKQEPLVIENTPDPLAGEQTWP
TEEEMAEADKNQKQGRLKKKTLPRGTSEYQAAWIVDETDEEDSDNGDSDDNGMVLDRGED
SNQEGMYDQEFEDDGKSLNLRDIDTETQNESEMVDDEDLTEEQIKDEIKKIKEAYADDEE
FPDEVETPIDVPARRRFAKYRGLKSFRTSSWDPNESLPQDYARIFAFDNVARTQKLVLKQ
ALKMEEEDRDDCVPIGSYVRLHIKEVPLGAASKLSSLVNTTKPIIGFGLLQHESKMSVLH
FSVKKYDGYEAPIKTKEELMFHVGFRQFIARPVFATDNFSSDKHKMERFLHPGCFSLASI
YGPISFPPLPLVVLKISEGSDPPAIAALGSLKSVEPNKIILKKIILTGYPQRVSKMKASV
RYMFHNPEDVKWFKPVEVWSKCGRRGRVKEPVGTHGAMKCIFNGVVQQHDVVCMNLYKRA
YPKWPERLYPQLL*
>AT1G55150.1 | DEAD box RNA helicase putative (RH20)
MSRYDSRTGDSTSYRDRRSDSGFGGTSSYGSSGSHTSSKKDNDGNESPRKLDLDGLTPFE
KNFYVESPAVAAMTDTEVEEYRKLREITVEGKDIPKPVKSFRDVGFPDYVLEEVKKAGFT
EPTPIQSQGWPMAMKGRDLIGIAETGSGKTLSYLLPAIVHVNAQPMLAHGDGPIVLVLAP
TRELAVQIQQEASKFGSSSKIKTTCIYGGVPKGPQVRDLQKGVEIVIATPGRLIDMMESN
NTNLRRVTYLVLDEADRMLDMGFDPQIRKIVSHIRPDRQTLYWSATWPKEVEQLSKKFLY
NPYKVIIGSSDLKANRAIRQIVDVISESQKYNKLVKLLEDIMDGSRILVFLDTKKGCDQI
TRQLRMDGWPALSIHGDKSQAERDWVLSEFRSGKSPIMTATDVAARGLDVKDVKYVINYD
FPGSLEDYVHRIGRTGRAGAKGTAYTFFTVANARFAKELTNILQEAGQKVSPELASMGRS
TAPPPPGLGGFRDRGSRRGWS*
>AT5G44740.1 | POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase
MRLKLLVLRFNWFKFLWLVVVSILAKSGKCERASIDEVYLDLTDAAESMLADAPPESLEL
IDEEVLKSHILGMNREDGDDFKESVRNWICREDADRRDKLLSCGIIIVAELRKQVLKETE
FTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQELLSSLPIKKMKQLGGKLGTSLQTD
LGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGISGEEVQGRLLPKSHGSGKTFPGPR
ALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIASTLTLHASAFRSKDSDSHKKFPSKSC
PMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNKLETWRITGLSVSASKIVDIPSGTS
SIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCSEQRSTETQAAMPEVDTGVTYTLPN
FENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKGTQTKKIGRKMNNSKEKNRGMPSIV
DIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSHNSQVNQEVEESRETDWGYKTDEID
QSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGSTSSIAHYFPPLNR*
>AT5G44740.1 | POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase
MRLKLLVLRFNWFKFLWLVVVSILAKSGKCERASIDEVYLDLTDAAESMLADAPPESLEL
IDEEVLKSHILGMNREDGDDFKESVRNWICREDADRRDKLLSCGIIIVAELRKQVLKETE
FTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQELLSSLPIKKMKQLGGKLGTSLQTD
LGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGISGEEVQGRLLPKSHGSGKTFPGPR
ALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIASTLTLHASAFRSKDSDSHKKFPSKSC
PMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNKLETWRITGLSVSASKIVDIPSGTS
SIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCSEQRSTETQAAMPEVDTGVTYTLPN
FENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKGTQTKKIGRKMNNSKEKNRGMPSIV
DIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSHNSQVNQEVEESRETDWGYKTDEID
QSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGSTSSIAHYFPPLNR*
>AT5G44740.2 | POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase
MPVARPEASDARVIAHVDMDCFYVQVEQRKQPELRGLPSAVVQYNEWQGGGLIAVSYEAR
KCGVKRSMRGDEAKAACPQIQLVQVPVARGKADLNLYRSAGSEVVSILAKSGKCERASID
EVYLDLTDAAESMLADAPPESLELIDEEVLKSHILGMNREDGDDFKESVRNWICREDADR
RDKLLSCGIIIVAELRKQVLKETEFTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQE
LLSSLPIKKMKQLGGKLGTSLQTDLGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGI
SGEEVQGRLLPKSHGSGKTFPGPRALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIAST
LTLHASAFRSKDSDSHKKFPSKSCPMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNK
LETWRITGLSVSASKIVDIPSGTSSIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCS
EQRSTETQAAMPEVDTGVTYTLPNFENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKG
TQTKKIGRKMNNSKEKNRGMPSIVDIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSH
NSQVNQEVEESRETDWGYKTDEIDQSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGST
SSIAHYFPPLNR*
>AT5G44740.2 | POLH (Y-FAMILT DNA POLYMERASE H) DNA-directed DNA polymerase
MPVARPEASDARVIAHVDMDCFYVQVEQRKQPELRGLPSAVVQYNEWQGGGLIAVSYEAR
KCGVKRSMRGDEAKAACPQIQLVQVPVARGKADLNLYRSAGSEVVSILAKSGKCERASID
EVYLDLTDAAESMLADAPPESLELIDEEVLKSHILGMNREDGDDFKESVRNWICREDADR
RDKLLSCGIIIVAELRKQVLKETEFTCSAGIAHNKMLAKLASGMNKPAQQTVVPYAAVQE
LLSSLPIKKMKQLGGKLGTSLQTDLGVDTVGDLLQFSETKLQEHYGVNTGTWLWNIARGI
SGEEVQGRLLPKSHGSGKTFPGPRALKSLSTVQHWLNQLSEELSERLGSDLEQNKRIAST
LTLHASAFRSKDSDSHKKFPSKSCPMRYGVTKIQEDAFNLFQAALREYMGSFGIKPQGNK
LETWRITGLSVSASKIVDIPSGTSSIMRYFQSQPTVPSRSADGCVQGNVAMTASASEGCS
EQRSTETQAAMPEVDTGVTYTLPNFENQDKDIDLVSEKDVVSCPSNEATDVSTQSESNKG
TQTKKIGRKMNNSKEKNRGMPSIVDIFKNYNATPPSKQETQEDSTVSSASKRAKLSSSSH
NSQVNQEVEESRETDWGYKTDEIDQSVFDELPVEIQRELRSFLRTNKQFNTGKSKGDGST
SSIAHYFPPLNR*
>AT2G21440.1 | RNA recognition motif (RRM)-containing protein
MGKNKRERKDGEEKSPHAAATVCVSGLPYSITNAQLEEAFSEVGPVRRCFLVTNKGSDEH
RGFAFVKFALQEDVNRAIELKNGSTVGGRRITVKQAAHRPSLQERRTKAAEGISVPDNSQ
GQSDKDTSIPETDEKVSPPEKKLEKPVERKKVEKPIERKQVEKPVERKKAEKPIELKQVE
KPFERKQVEKPVERKQVEKPVERKQVEKPIERKRPTKLHVDLPDKETCSDKQRVARTVIF
GGLANAEMAEVVHSRVKEIGTVCSVRYPLPKEELQQNGLTQDGCRAEASAVLFTSVKSAC
AAVAKLHQTEVKGNLIWARQLGGEGSKAQKWKLIIRNLPFQAKPSDIKVVFSAVGFVWDV
FIPKNFETGLPKGFAFVKFTCKKDAANAIKKFNGHMFGKRPIAVDWAVPKNIYNGAADAT
TASADGDKEGSDGDSENSSVDLEEVDEAVESHPPPGDDTDDDEDGSNKLTESDALDKDVG
TDMNFEDEADVARKVLKNLLASSKGSTATPEGETEESDKSKLKSSSTKPVADSSGVSEPL
KSGKTKVVAPKETQDNDDFERTLFIRNLPFDVTKEEVKQRFTVFGEVESLSLVLHKVTKR
PEGTAFVKFKTADASVAAISAADTASGVGVLLKGRQLNVMRAVGKKAAKDIELKKTEEKN
VDHRNLYLAKEGQILDDTPAAEGVSAEDMDKRRRLHENKMKMLQSPNFHVSRTRLVIYNL
PKSMNPKQLNRLLVDAVTSRATKQKPCIRQIKFLQNEKKGKVDTKNYSRGVAFVEFTEHE
HALVALRVLNNNPETFGPQHRPVIEFAVDNVQKLKIREAKQQQFQQREKHNESDQQQANG
EAQAPDNKYKRKTREGDNTGPRKENAARFKKGPREESKEEAKSNIAVKDNAAEKKRPIRT
QEKPSSNKKGQLMRQKETTEKPDPKISKDLSEPRKRKFGEDRGEENRNGQRKRKKQGQGQ
GGAEVVDKLDLLIEKYRSKFSQSSAKTGPQKQSSGQVRRWFES*
>AT2G34357.1 | binding
MELLCDDIGTSMCLTPSEPDLPVSEDFGEYMRSRLSQSKRPDHEHLCAVIEELSKTLAED
NHRRTPVAYFACTCRSLDSLFSAHAEPPVDVVQPHIVILSLVFPKVSAGVLKRDGLALRL
VLNVLRLKSATPECLISGLKCLVHLLTTVESIMVNEGSDSYNILLNFVTHSDGKVRKLAS
SCLRDVLQKSHGTKAWQSVSGAITEMFQNYLDLAHKSEVGSTEGARGAKQVLYILSTLKE
CLALMSKKHIATLIEGFKVLMILRDPYITRPVIDSLNAVCLNPTSEVPVEALLEVLSLAA
GLFSGHETSADAMTFTARLLKVGMTRSFTLNRDLCVVKLPSVFNGLNDIIASEHEEAIFA
ATDALKSLIFSCIDESLIREGVNEIRNSNLNVRKPSPTVIEKLCATVESLLDYKYHAVWD
MAFQVVSAMFDKLGEHSAYFMRNTLQGLSDMQDLPDEGFPYRKQLHECVGSALGAMGPET
FLSIVRLNLEANDLSEVKVWLFPILKQYTVGGRLSFFTEAIFSMVETMSHKAQKLKLQGL
PVASRSVDSLVYSLWALLPSFCNYPVDTVESFADLGRILCGVLQTQAETHGIICASLNIL
IQQNKEVVEGKEVPTNDASPAMQRATARYDSQHAAANLKVLRLCAPKLLDVLSRIFHECS
KDDGGSLQSAIGNLASIAEKKTVSKLLFKTLQELLEATKTAIAQDESPVSGMDVDNTADK
NSSSNLRARLFDLLVSLLPGLDGQEVDTIFSSLKPAMQDSKGLIQKKAYKVLSVILKSSD
GFVSKNLEELLVLMHNICHVSAKRHKLDCLYFLLAHASRTDDLKERKDIVSSFLPEVILA
LKEVNKKTRNRAYDVLVQIGHAYADEENGGDNEKLHGYFDMVVGCLAGEKPQMISAAVKG
VARLTYEFSDLISSAYNLLPSTFLLLQRKNKEITKANLGLLKVLVAKSPVEGLHANLKSM
VEGLLKWPEGTKNLFKAKVRLLLEMLIKKCGTEAVKSVMPEEHMKLLTNIRKIKERKEKK
YAAGSDISKSQHSKDTSSKVSRWNDTKIFSDVYADSEDSDGDDMDAESHGRSKASSLLKS
KASALRSKKSRNQSHLEVDESDDEPLDLMDQHKTRLALRSSELRKRKADSDEEAEFDVEG
RLVIREGERSKRKELSDADSDAKSSKGSRFSGNSSKKNQKRMKTSESGYAYTGKEYASKK
ASGDLKKKDKLEPYAYWPLDRKMMSRRPEQRAVAVRGMSSVVKMAKKMEGKSAAEALATT
KFKKFKRSGQKKSAGKKKNK*
>AT2G34970.1 | eIF4-gamma/eIF5/eIF2-epsilon domain-containing protein
MGAQKKGGAAARVSEDAEVQSRHRLQAILLADSFATKFRPVTLERPKVLLPIVNVPMIDY
TLAWLESAGIEEVFVFCCAHSMQVIEYLEKSEWYSHPNLLVRTIESHKSISAGDALRYMY
EQQTETSQIQGDFVLVSGDTVSNMPLADLIQEHRERKKKDEKAIMTMVIKQSKSSPLTHQ
SRLGTDQLFIAVDPLTKQLLHYEEDKIDHPSGSVCLEKSLLDTNPSVLVCNDMQDCYIDI
CSPEVLSLFEDNFDYQHLRRHFVKGVLVDDIMGYKIFTHEIHSSYAGRIDNFRSYDTVSK
DIIQRWTYPYVPDINFSGNRPLKLGRQGIYKASDVVQSRSADVGASTVIGYGTKIGHGDK
IMNSVIGNGCSIGSNVVIEGSYIWNNVTIEDGCEIRNAIVCDGVKIRAGAVLQPGVVLSF
NVVVGRDFVVPAYSKVSLLQQPTTEDSDEELEYADSSSGTADHLSGLNLQMESKASELGP
DGAGYIWEVCEGAHDEEWKHSVAPIPKDKLSEITQAIDDDDTDDESVVPTSGELKSDADS
INTDVNDPNDDYYYFEKEVEGTVLRAVEENIKVDLVTMEINGLRLSFNMESADCAGATFF
SMIKLALDTPHNSGSELYKNAASIITKWKDLLGFYAKKIDEQIEVIMKFEEMCQESHKEL
GPLFTQILHLLYDKDVLQEDAILRWEEEKAGADEADKVYLKQCDTFIQWLKEASEEEDED
DEDEEEEEDN*
>AT2G47420.1 | dimethyladenosine transferase putative
MAGGKIRKEKPKASNRAPSNHYQGGISFHKSKGQHILKNPLLVDSIVQKAGIKSTDVILE
IGPGTGNLTKKLLEAGKEVIAVELDSRMVLELQRRFQGTPFSNRLKVIQGDVLKTELPRF
DICVANIPYQISSPLTFKLLFHPTSFRCAVIMYQREFAMRLVAQPGDNLYCRLSVNTQLY
ARVSHLLKVGKNNFRPPPKVDSSVVRIEPRRPGPQVNKKEWDGFLRVCFIRKNKTLGSIF
KQKSVLSMLEKNFKTLQAVLASLQNNGEPALNTTSMDLGDQSMGMEDDDNEMDDDDMEMD
EGEGDGGETSEFKEKVMNVLKEGGFEEKRSSKLSQQEFLYLLSLFNKSGIHFT*
>AT3G01160.1 | unknown protein
MGSKNKKQRKGESIEEAKGSSGVAEEGNEMIKDPRFSSAHTDPKFRRMRRRDSKVAIDSR
FQPMFNDKRFATGSAPVDKRGKRRTGGTGKDSLREFYRIEDEGKQKTEEESGDESGSETE
INDLKSEKSSHVESEEESESELKVASLDDESDEKADSEELSSQEEEEEEDDTDEDDEAMY
EDEGPEIPEENIPLIQEETHRLAIVNMDWRHVSAKDLYVVLNSFLPKDGRILSVAVYPSE
FGLERMKEEEIHGPVIDGDKKNDASDDEDEEEEEDEDVINQKLRAYEISRLKYYFAVAEC
DSSATADYLYKSCDGIEFERSSNKLDLRFIPDSMEFKHPPRDIASEAPAGYEGLDFQSRA
LQMSKVNLSWDEDEPHRIKTLNQKFNPEQLANLEMKEFLASDESDSDEEDDLGNEVINQS
KKKDKKKDKYRALIEAEDVDSDKDLEEENDQDMEVTFNTGLEDLSKEILKKKDNQSESVW
ETYLRQRREKKRARKNKQKDDDSSPDDDDDYNIDRKAVKDDGDDDFFMEEPPLKKKKKEG
KTKKEEVAAEEKSRAELELLLADENAGDGNGLKGYNIKRKAKKGKTDISEDKIPAAELDD
PRFSALFSSPYYALDPTDPQFKRSATYARQLALKQKEDPKGHEDVKAPKEKQELNSDGNL
GSKKERHELTSTVKSLKMKMMNKDSEKKKAGNPASSSTLAQRIKKKAKDLSKK*
>AT3G09720.1 | DEAD/DEAH box helicase putative
MEKSSYFLFGGTNFNKKKFAPDFAKFKNSTEDDDSNKKVNFFVEEEEDTEQPEAEKVIVS
SKKRKRRSSNSVPVEGFDVFKSSKKARAKGKAEEQITKNEIVENPKKELNRQMERDALSR
KQYSIHVSGNNIPPPLKSFAELSSRYGCEGYILRNLAELGFKEPTPIQRQAIPILLSGRE
CFACAPTGSGKTFAFICPMLIKLKRPSTDGIRAVILSPARELAAQTAREGKKLIKGSNFH
IRLMTKPLVKTADFSKLWCDVLISTPMRLKRAIKAKKIDLSKVEYLVLDESDKLFEQSLL
KQIDCVVKACSNPSIIRSLFSATLPDSVEELARSIMHDAVRVIIGRKNTASETVKQKLVF
AGSEEGKLLALRQSFAESLNPPVLIFVQSKERAKELYDELKCENIRAGVIHSDLPPGERE
NAVDQFRAGEKWVLIATDVIARGMDFKGINCVINYDFPDSASAYIHRIGRSGRAGRSGEA
ITFYTEQDVPFLRNIANTMMSSGCEVPSWIMSLKKKKWRKHRPRRDSISTKPKADKNDTD
E*
>AT3G10530.1 | transducin family protein / WD-40 repeat family protein
MEISSEDNNLMEKVLPPVEQESDVELETKVKKYLRGEGANLETLKDKKLKTQLASREKLY
GKSAKAAAKIEKWLLPAEAGYLETEGLEKTWRVKQTDIANEVDILSSRNQYDIVLPDFGP
YKLDFTASGRHMLAGGRKGHLALLDMMNMSLIKEIQVRETVRDVAFLHNDQFFAAAQKKY
AYIYGRDGTELHCLKERGPVARLRFLKNHFLLASVNMSGQLHYQDVTHGGMVASIRTGKG
RTDVMEVNPYNSVVGLGHSGGTVTMWKPTSQAPLVQMQCHPGPVSSVAFHPNGHLMATSG
KERKIKIWDLRKFEEVQTIHSFHAKTLSFSQKGLLAAGTGSFVQILGDSSGGSSHNYTRY
MNHSMVKGYQIEKVMFRPYEDVIGIGHSMGWSSILIPGSGEPNFDSWVANPFETSKQRRE
KEVHSLLDKLPPETIMLDPSKIGAMRPSRRKEKPSRGEIEAEKEVAIEAAKSTELKNKTK
GRNKPSKRTKKKKEMVENAKRTFPEQEHNTAIKKRRIVEDAAAELPTSLKRFARKN*
>AT4G04940.1 | transducin family protein / WD-40 repeat family protein
MGIFEPFRAIGYITSTVPFSVQRLGTETFVTVSVGKAFQIYNCAKLNLVIISPQLPKKIR
ALASYRDYTFVAFGNEIAVFRRAHQVATWSKHVAKVDLLLVFGEHVLSLDVEGNMFIWAF
KGIEEHLAPIGNLQLTGKFTPTSIVHPDTYLNKVLVGSQEGPLQLWNINTKKMLYQFKGW
GSSVTSCVSSPALDVVAIGCADGKIHVHNIKLDEEIVTFEHASRGAVTALSFSTDGRPLL
ASGGSFGVISIWNLNKKRLQSVIRDAHDSSIISLNFLANEPVLMSASADNSLKMWIFDTN
DGDPRLLRFRSGHSAPPLCIRFYSNGRHILSAGQDRAFRLFSVIQEQQSRELSQRHISRR
AKKLRLKEEELKLKPVVSFDCAEIRERDWCNVVTCHMDTAEAYVWRLQNFVLGEHILKPC
PENPTPIKACAISACGNFAVVGTAGGWIERFNLQSGISRGSYFDMSEKRRYAHDGEVIGV
ACDSTNTLMISAGYHGDLKVWDFKKRELKSQWDVGCSLVKIVYHRVNGLLATVADDFVIR
LYDVVTLKMVREFRGHTDRITDLCFSEDGKWVISSSMDGSLRIWDVILAKQIDGVHVDVP
ITALSLSPNMDVLATAHSDQNGVYLWVNQSMFSGLPSVESYASGKDVVNVKLPSVSALTS
SEADDDMDRQVLENSEALQASSFSISQKQIPELVTLSLLPKSQWQSLINLDIIKARNKPI
EPPKKPEKAPFFLPSIPSLSGDILFKANDSEADGENEENNKKDQNSMKNFDALESPFSKH
LKSSWDSKHFLDFTNYMKSLSPSALDMELRMLEIIDEDVEEELIKRPEFILIGQLLDYFI
NEVSCKNDFEFMQAVVKLFLKIHGETIRCHPSLQEKAKKLLETQSLVWQKMEKLFQSTRC
IVTFLSNSQF*
>AT4G07410.1 | transducin family protein / WD-40 repeat family protein
MLEYRCSSVDWKPSPVVALANSSDDSQVAAAREDGSLEIWLVSPGAVGWHCQLTIHGDPN
SRISSLAWCCSPSIGLPSGRLFSSSIDGSISEWDLFDLKQKIVLESIGISIWQMALAPIS
GFSSDVEGIKNGYLSEKSNDEEEIGSEEDGSDSDEFHEKSEEEIDRILAAACDDGCVRLY
RISNLEKLTYYRSLPRVSGRALSVTWSPDAKRIFSGSSDGLIRCWDATSCHEVYRITAGL
GGLGSSSEICVWSLLSLRCSVLVSGDSTGTVQFWDSEHGTLLEAHSNHKGDVNTLAAAPS
HNRVFSAGADGQVILYKLSGSTNGSQDLKPSSSQKWDYIGYVKAHTHDIRALTVAVPISR
EDPFPDDILPDKASRKHRKKGKPVDFTYHKWAHLGVPMLISAGDDAKLFAYSIQEFTKFS
PHDICPAPQRIPMQMVHNSMFNKTSLLLVQGISTLDILRLNISSDSSGRASTKSLVRVKS
RDARKIICSAISNTGSHFAYSDQIGPSLFELKKNEFTKCPWSVSRRRLPELPFAHSMIFS
SDCSRLIIAGHDRRIYTIDISSLELVYAFTPSREEHEGEAPTPKEPPITKLFTSSDGQWL
AAINCFGDIYVFNLETQRQHWFISRLDGASVTAAGFHPWNNNALVISTSSNQVFAFDVEA
RQLGKWSLLNTYVLPKRYQEFPGEVLGLSFSPSPNSSSVIVYSSRAKCLIDFGKPVEEDE
EYDLPNGNLSKTLEGKLVNLGLKKGKGTNRKRRLDEYQLEGKSNERKNFEILPSNHPVLF
VGHLSKNSILVIEKPWMDVVKSLDNQPVDRHIFGT*
>AT4G07410.1 | transducin family protein / WD-40 repeat family protein
MLEYRCSSVDWKPSPVVALANSSDDSQVAAAREDGSLEIWLVSPGAVGWHCQLTIHGDPN
SRISSLAWCCSPSIGLPSGRLFSSSIDGSISEWDLFDLKQKIVLESIGISIWQMALAPIS
GFSSDVEGIKNGYLSEKSNDEEEIGSEEDGSDSDEFHEKSEEEIDRILAAACDDGCVRLY
RISNLEKLTYYRSLPRVSGRALSVTWSPDAKRIFSGSSDGLIRCWDATSCHEVYRITAGL
GGLGSSSEICVWSLLSLRCSVLVSGDSTGTVQFWDSEHGTLLEAHSNHKGDVNTLAAAPS
HNRVFSAGADGQVILYKLSGSTNGSQDLKPSSSQKWDYIGYVKAHTHDIRALTVAVPISR
EDPFPDDILPDKASRKHRKKGKPVDFTYHKWAHLGVPMLISAGDDAKLFAYSIQEFTKFS
PHDICPAPQRIPMQMVHNSMFNKTSLLLVQGISTLDILRLNISSDSSGRASTKSLVRVKS
RDARKIICSAISNTGSHFAYSDQIGPSLFELKKNEFTKCPWSVSRRRLPELPFAHSMIFS
SDCSRLIIAGHDRRIYTIDISSLELVYAFTPSREEHEGEAPTPKEPPITKLFTSSDGQWL
AAINCFGDIYVFNLETQRQHWFISRLDGASVTAAGFHPWNNNALVISTSSNQVFAFDVEA
RQLGKWSLLNTYVLPKRYQEFPGEVLGLSFSPSPNSSSVIVYSSRAKCLIDFGKPVEEDE
EYDLPNGNLSKTLEGKLVNLGLKKGKGTNRKRRLDEYQLEGKSNERKNFEILPSNHPVLF
VGHLSKNSILVIEKPWMDVVKSLDNQPVDRHIFGT*
>AT4G07410.2 | transducin family protein / WD-40 repeat family protein
MALAPISGFSSDVEGIKNGYLSEKSNDEEEIGSEEDGSDSDEFHEKSEEEIDRILAAACD
DGCVRLYRISNLEKLTYYRSLPRVSGRALSVTWSPDAKRIFSGSSDGLIRCWDATSCHEV
YRITAGLGGLGSSSEICVWSLLSLRCSVLVSGDSTGTVQFWDSEHGTLLEAHSNHKGDVN
TLAAAPSHNRVFSAGADGQVILYKLSGSTNGSQDLKPSSSQKWDYIGYVKAHTHDIRALT
VAVPISREDPFPDDILPDKASRKHRKKGKPVDFTYHKWAHLGVPMLISAGDDAKLFAYSI
QEFTKFSPHDICPAPQRIPMQMVHNSMFNKTSLLLVQGISTLDILRLNISSDSSGRASTK
SLVRVKSRDARKIICSAISNTGSHFAYSDQIGPSLFELKKNEFTKCPWSVSRRRLPELPF
AHSMIFSSDCSRLIIAGHDRRIYTIDISSLELVYAFTPSREEHEGEAPTPKEPPITKLFT
SSDGQWLAAINCFGDIYVFNLETQRQHWFISRLDGASVTAAGFHPWNNNALVISTSSNQV
FAFDVEARQLGKWSLLNTYVLPKRYQEFPGEVLGLSFSPSPNSSSVIVYSSRAKCLIDFG
KPVEEDEEYDLPNGNLSKTLEGKLVNLGLKKGKGTNRKRRLDEYQLEGKSNERKNFEILP
SNHPVLFVGHLSKNSILVIEKPWMDVVKSLDNQPVDRHIFGT*
>AT4G07410.2 | transducin family protein / WD-40 repeat family protein
MALAPISGFSSDVEGIKNGYLSEKSNDEEEIGSEEDGSDSDEFHEKSEEEIDRILAAACD
DGCVRLYRISNLEKLTYYRSLPRVSGRALSVTWSPDAKRIFSGSSDGLIRCWDATSCHEV
YRITAGLGGLGSSSEICVWSLLSLRCSVLVSGDSTGTVQFWDSEHGTLLEAHSNHKGDVN
TLAAAPSHNRVFSAGADGQVILYKLSGSTNGSQDLKPSSSQKWDYIGYVKAHTHDIRALT
VAVPISREDPFPDDILPDKASRKHRKKGKPVDFTYHKWAHLGVPMLISAGDDAKLFAYSI
QEFTKFSPHDICPAPQRIPMQMVHNSMFNKTSLLLVQGISTLDILRLNISSDSSGRASTK
SLVRVKSRDARKIICSAISNTGSHFAYSDQIGPSLFELKKNEFTKCPWSVSRRRLPELPF
AHSMIFSSDCSRLIIAGHDRRIYTIDISSLELVYAFTPSREEHEGEAPTPKEPPITKLFT
SSDGQWLAAINCFGDIYVFNLETQRQHWFISRLDGASVTAAGFHPWNNNALVISTSSNQV
FAFDVEARQLGKWSLLNTYVLPKRYQEFPGEVLGLSFSPSPNSSSVIVYSSRAKCLIDFG
KPVEEDEEYDLPNGNLSKTLEGKLVNLGLKKGKGTNRKRRLDEYQLEGKSNERKNFEILP
SNHPVLFVGHLSKNSILVIEKPWMDVVKSLDNQPVDRHIFGT*
>AT4G28200.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN RNA processing LOCATED IN intracellular EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s RNA-processing protein HAT helix (InterProIPR003107) U3 small nucleolar RNA-associated protein 6 (InterProIPR013949) Has 352 Blast hits to 342 proteins in 144 species Archae - 0 Bacteria - 0 Metazoa - 111 Fungi - 130 Plants - 24 Viruses - 0 Other Eukaryotes - 87 (source NCBI BLink)
MADVVQYRLERMVDELDDLERREIFTRAEIAEIVKQRRKFEYRLKRPSPLKEDFIAYIDY
EVKLDELRQLRRKSVARVTKKRKKKSVSDFAGVARIVEIYRLATMRYKGDINLWFRYLEF
CKQKRHGRMKKALAQAIRFHPKVAGVWIYAASWEFDRNLNVTAARALMLNGLRVCSNSED
LWVEYLRMELTFLNKLKARKVALGEDKGSLVRDTKTVEDEQWKDENKELFMSLDEKEGNE
KEENDEDSIVEDVEDVTEKVDFLKEKGSNVLQTIYSGAVEAIPSSFDLRKRFLEILEATD
LAHSDEMRNTILSDLKRDFCNEPEYWNWLARHEMSGCISNEAGLEFANPQMQKAIQVFEE
GLQTVTSSSMFEIYINFLMEAIVQSNGDENEISSLSNPIISHIINVYQKADETGCLTEEL
ADEYVSLYLKLEKTHEAQKLAEKLCSEKFAGSAKLWLSRVSIEIRSLSENSSPSKADFQT
VFELLSNALRKVPISESESLWLMAFNFFAHQRTYLDKLVEMSILSATKSHGSDHVFSLAS
TVVKFVLETKGAHSARKIYKRFLALPGPSLVLYKGCIEIETNLISVGDKDGLSNARKLYD
SAVASYGQDVELWKNYYSLETKLGTSETANGVYWRARKTLNESADFIV*
>AT4G28450.1 | nucleotide binding / protein binding
MKIKTLSRSVDEYTRERSQDLQRVFHNFDPSLRPMEKAVEYQRALTAAKLEKIFARPFVG
AMDGHRDGVSCMAKNPNYLKGIFSASMDGDIRLWDISSRRTVCQFPGHQGAVRGLTASTD
GNVLVSCGTDCTVRLWNVPRPSLEDSSISSENFIEPSATYVWKNAFWAVDHQFEGDLFAT
AGAQLDIWNHNRSQPVQSFQWGTDSVISVRFNPGEPNLLATSASDRSITIYDLRLSSAAR
KIIMMTKTNSIAWNPMEPMNLTAANEDGSCYSFDGRKLDEAKCVHKDHVSAVMDIDFSPT
GREFVTGSYDRSVRIFPYNGGHSREIYHTKRMQRVFCVKYSCDATYVISGSDDTNLRLWK
AKASEQLGVILPREQKKHEYNEAVKNRYKHLSEVKRIVRHRHLPKPIYKAMGIIRTVNDS
KRRKEARRKAHSAPGTVVTAPLRKRKIIKEVE*
>AT5G14050.1 | transducin family protein / WD-40 repeat family protein
MSLSQNAPKSKGIKREELKKQYEDVEDEEEIGSDDDLTRGKRRKTEKEKQKLEESELVEM
KKLENLIFGSLYSPVTFGKEEEEDGSALFHVDRSAVRQIPDYEDDGDDDEELSDEENGQV
VAIRKGEAAWEDEEEKQINVDIASVNRLRKLRKEENEGLISGSEYIARLRAHHAKLNPGT
DWARPDSQIVDGESSDDDDTQDGGVDDILRTNEDLVVKSRGNKLCAGRLEYSKLVDANAA
DPSNGPINSVHFHQNAQLLLTAGLDRRLRFFQIDGKRNTKIQSIFLEDCPIRKAAFLPNG
SQVIVSGRRKFFYSFDLEKAKFDKIGPLVGREEKSLEYFEVSQDSNTIAFVGNEGYILLV
STKTKELIGTLKMNGSVRSLAFSEDGKHLLSSGGDGQVYVWDLRTMKCLYKGVDEGSTCG
TSLCSSLNGALFASGTDRGIVNIYKKSEFVGGKRKPIKTVDNLTSKIDFMKFNHDAQILA
IVSTMNKNSVKLVHVPSLTVFSNWPPPNSTMHYPRCLDFSPGSGFMAMGNAAGKVLLYKL
HHYQNA*
>AT5G16750.1 | TOZ (TORMOZEMBRYO DEFECTIVE) nucleotide binding
MAPHSLKKNYRCSRSLKQFYGGGPFIVSSDGSFIACACGDVINIVDSTDSSVKSTIEGES
DTLTALALSPDDKLLFSAGHSRQIRVWDLETLKCIRSWKGHEGPVMGMACHASGGLLATA
GADRKVLVWDVDGGFCTHYFRGHKGVVSSILFHPDSNKNILISGSDDATVRVWDLNAKNT
EKKCLAIMEKHFSAVTSIALSEDGLTLFSAGRDKVVNLWDLHDYSCKATVATYEVLEAVT
TVSSGTPFASFVASLDQKKSKKKESDSQATYFITVGERGVVRIWKSEGSICLYEQKSSDI
TVSSDDEESKRGFTAAAMLPSDHGLLCVTADQQFFFYSVVENVEETELVLSKRLVGYNEE
IADMKFLGDEEQFLAVATNLEEVRVYDVATMSCSYVLAGHKEVVLSLDTCVSSSGNVLIV
TGSKDKTVRLWNATSKSCIGVGTGHNGDILAVAFAKKSFSFFVSGSGDRTLKVWSLDGIS
EDSEEPINLKTRSVVAAHDKDINSVAVARNDSLVCTGSEDRTASIWRLPDLVHVVTLKGH
KRRIFSVEFSTVDQCVMTASGDKTVKIWAISDGSCLKTFEGHTSSVLRASFITDGTQFVS
CGADGLLKLWNVNTSECIATYDQHEDKVWALAVGKKTEMIATGGGDAVINLWHDSTASDK
EDDFRKEEEAILRGQELENAVLDAEYTKAIRLAFELCRPHKVFELFSGLCRKRDSDEQIV
KALQGLEKEEFRLLFEYVREWNTKPKLCHIAQFVLYKTFNILPPTEIVQVKGIGELLEGL
IPYSQRHFSRIDRFVRSSFLLDYTLGEMSVIDPETVETEYPKDEKKKEKDVIAAMEQDTD
ELKQETPSRKRKSQKSKGKSNKKRLIAEAQGSVIAV*
>AT5G30495.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown CONTAINS InterPro DOMAIN/s Fcf2 pre-rRNA processing (InterProIPR014810) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G547701) Has 218 Blast hits to 218 proteins in 114 species Archae - 0 Bacteria - 0 Metazoa - 88 Fungi - 76 Plants - 26 Viruses - 0 Other Eukaryotes - 28 (source NCBI BLink)
MAETKPLIGLTWEPKLPGLSLDTKTCSTSSKRVESHESSSLWMSKSELVDGLCLPPNDPK
KINKMIRKQIKDTTGSNWFDMPAPTMTPELKRDLQLLKLRTVMDPAVHYKKSVSRSKLAE
KYFQIGTVIEPAEEFYGRLTKKNRKATLADELVSDPKVSQYRKRKVKEIEEKSRAVTNKK
WKKKGNQTTNKKQRRN*
>AT5G30495.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown CONTAINS InterPro DOMAIN/s Fcf2 pre-rRNA processing (InterProIPR014810) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G547701) Has 218 Blast hits to 218 proteins in 114 species Archae - 0 Bacteria - 0 Metazoa - 88 Fungi - 76 Plants - 26 Viruses - 0 Other Eukaryotes - 28 (source NCBI BLink)
MAETKPLIGLTWEPKLPGLSLDTKTCSTSSKRVESHESSSLWMSKSELVDGLCLPPNDPK
KINKMIRKQIKDTTGSNWFDMPAPTMTPELKRDLQLLKLRTVMDPAVHYKKSVSRSKLAE
KYFQIGTVIEPAEEFYGRLTKKNRKATLADELVSDPKVSQYRKRKVKEIEEKSRAVTNKK
WKKKGNQTTNKKQRRN*
>AT5G30495.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown CONTAINS InterPro DOMAIN/s Fcf2 pre-rRNA processing (InterProIPR014810) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G547701) Has 218 Blast hits to 218 proteins in 114 species Archae - 0 Bacteria - 0 Metazoa - 88 Fungi - 76 Plants - 26 Viruses - 0 Other Eukaryotes - 28 (source NCBI BLink)
MAETKPLIGLTWEPKLPGLSLDTKTCSTSSKRVESHESSSLWMSKSELVDGLCLPPNDPK
KINKMIRKQIKDTTGSNWFDMPAPTMTPELKRDLQLLKLRTVMDPAVHYKKSVSRSKLAE
KYFQIGTVIEPAEEFYGRLTKKNRKATLADELVSDPKVSQYRKRKVKEIEEKSRAVTNKK
WKKKGNQTTNKKQRRN*
>AT5G30495.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown CONTAINS InterPro DOMAIN/s Fcf2 pre-rRNA processing (InterProIPR014810) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G547701) Has 218 Blast hits to 218 proteins in 114 species Archae - 0 Bacteria - 0 Metazoa - 88 Fungi - 76 Plants - 26 Viruses - 0 Other Eukaryotes - 28 (source NCBI BLink)
MAETKPLIGLTWEPKLPGLSLDTKTCSTSSKRVESHESSSLWMSKSELVDGLCLPPNDPK
KINKMIRKQIKDTTGSNWFDMPAPTMTPELKRDLQLLKLRTVMDPAVHYKKSVSRSKLAE
KYFQIGTVIEPAEEFYGRLTKKNRKATLADELVSDPKVSQYRKRKVKEIEEKSRAVTNKK
WKKKGNQTTNKKQRRN*
>AT5G61330.1 | rRNA processing protein-related
MAGGSKRSKRARLDSESEDISDQENLKAESDNEDDQLPDGIEDDEVDSMEDDEGESEEDD
EGDTEEDDEGDSEEDDEGENKEDEDGESEDFEDGNDKESESGDEGNDDNKDAQMEELEKE
VKELRSQEQDILKNLKRDKGEDAVKGQAVKNQKALWDKILEFRFLLQKAFDRSNRLPQEP
VKSLFCSEDEDVSTAYTDLVTSSKKTLDSLLELQEALFEKNPSVDQQVNATASEESNKSD
AEDSDEWQRISDLQKRMSVFRNKAVDKWQRKTQVTTGAAAIKGKLHAFNQNVSEQVASYM
RDPSRMIKQMQQSRSTVAVFGTVPQEAMEPNPEEKQEEGDPELVEDAEFYRQLLKEFLET
IDPASSEAAFYEMKKFQTKKRKVVDRRASKSRKIRYNVHEKIVNFMAPRPAKIPPNTADL
LKNLFGLKTRNVQSEA*
>AT5G65900.1 | DEAD/DEAH box helicase putative
MANLDMEQHSSENEEIKKKKHKKRARDEAKKLKQPAMEEEPDHEDGDAKENNALIDEEPK
KKKKKKNKKRGDTDDGEDEAVAEEEPKKKKKKNKKLQQRGDTNDEEDEVIAEEEEPKKKK
KKQRKDTEAKSEEEEVEDKEEEKKLEETSIMTNKTFESLSLSDNTYKSIKEMGFARMTQI
QAKAIPPLMMGEDVLGAARTGSGKTLAFLIPAVELLYRVKFTPRNGTGVLVICPTRELAI
QSYGVAKELLKYHSQTVGKVIGGEKRKTEAEILAKGVNLLVATPGRLLDHLENTNGFIFK
NLKFLVMDEADRILEQNFEEDLKKILNLLPKTRQTSLFSATQSAKVEDLARVSLTSPVYI
DVDEGRKEVTNEGLEQGYCVVPSAMRLLFLLTFLKRFQGKKKIMVFFSTCKSTKFHAELF
RYIKFDCLEIRGGIDQNKRTPTFLQFIKAETGILLCTNVAARGLDFPHVDWIVQYDPPDN
PTDYIHRVGRTARGEGAKGKALLVLTPQELKFIQYLKAAKIPVEEHEFEEKKLLDVKPFV
ENLISENYALKESAKEAYKTYISGYDSHSMKDVFNVHQLNLTEVATSFGFSDPPKVALKI
DRGGYRSKREPVNKFKRGRGGGRPGGKSKFERY*
>AT4G12600.1 | ribosomal protein L7Ae/L30e/S12e/Gadd45 family protein
MTVEGVNPKAYPLADSQLSITILDLVQQATNYKQLKKGANEATKTLNRGISEFIVMAADT
EPLEILLHLPLLAEDKNVPYVFVPSKQALGRACDVTRPVIACSVTSNEASQLKSQIQHLK
DAIEKLLI*