>AT1G16610.1 | SR45 RNA binding / protein binding
MAKPSRGRRSPSVSGSSSRSSSRSRSGSSPSRSISRSRSRSRSLSSSSSPSRSVSSGSRS
PPRRGKSPAGPARRGRSPPPPPSKGASSPSKKAVQESLVLHVDSLSRNVNEAHLKEIFGN
FGEVIHVEIAMDRAVNLPRGHGYVEFKARADAEKAQLYMDGAQIDGKVVKATFTLPPRQK
VSSPPKPVSAAPKRDAPKSDNAAADAEKDGGPRRPRETSPQRKTGLSPRRRSPLPRRGLS
PRRRSPDSPHRRRPGSPIRRRGDTPPRRRPASPSRGRSPSSPPPRRYRSPPRGSPRRIRG
SPVRRRSPLPLRRRSPPPRRLRSPPRRSPIRRRSRSPIRRPGRSRSSSISPRKGRGPAGR
RGRSSSYSSSPSPRRIPRKISRSRSPKRPLRGKRSSSNSSSSSSPPPPPPPRKT*
>AT1G16610.1 | SR45 RNA binding / protein binding
MAKPSRGRRSPSVSGSSSRSSSRSRSGSSPSRSISRSRSRSRSLSSSSSPSRSVSSGSRS
PPRRGKSPAGPARRGRSPPPPPSKGASSPSKKAVQESLVLHVDSLSRNVNEAHLKEIFGN
FGEVIHVEIAMDRAVNLPRGHGYVEFKARADAEKAQLYMDGAQIDGKVVKATFTLPPRQK
VSSPPKPVSAAPKRDAPKSDNAAADAEKDGGPRRPRETSPQRKTGLSPRRRSPLPRRGLS
PRRRSPDSPHRRRPGSPIRRRGDTPPRRRPASPSRGRSPSSPPPRRYRSPPRGSPRRIRG
SPVRRRSPLPLRRRSPPPRRLRSPPRRSPIRRRSRSPIRRPGRSRSSSISPRKGRGPAGR
RGRSSSYSSSPSPRRIPRKISRSRSPKRPLRGKRSSSNSSSSSSPPPPPPPRKT*
>AT1G16610.2 | SR45 RNA binding / protein binding
MAKPSRGRRSPSVSGSSSRSSSRSRSGSSPSRSISRSRSRSRSLSSSSSPSRSVSSGSRS
PPRRGKSPAGPARRGRSPPPPPSKGASSPSKKAVQESLVLHVDSLSRNVNEAHLKEIFGN
FGEVIHVEIAMDRAVNLPRGHGYVEFKARADAEKAQLYMDGAQIDGKVVKATFTLPPRQK
VSSPPKPVSAAPKRDAPKSDNAAADAEKDGGPRRPRERLSPRRRSPLPRRGLSPRRRSPD
SPHRRRPGSPIRRRGDTPPRRRPASPSRGRSPSSPPPRRYRSPPRGSPRRIRGSPVRRRS
PLPLRRRSPPPRRLRSPPRRSPIRRRSRSPIRRPGRSRSSSISPRKGRGPAGRRGRSSSY
SSSPSPRRIPRKISRSRSPKRPLRGKRSSSNSSSSSSPPPPPPPRKT*
>AT1G16610.2 | SR45 RNA binding / protein binding
MAKPSRGRRSPSVSGSSSRSSSRSRSGSSPSRSISRSRSRSRSLSSSSSPSRSVSSGSRS
PPRRGKSPAGPARRGRSPPPPPSKGASSPSKKAVQESLVLHVDSLSRNVNEAHLKEIFGN
FGEVIHVEIAMDRAVNLPRGHGYVEFKARADAEKAQLYMDGAQIDGKVVKATFTLPPRQK
VSSPPKPVSAAPKRDAPKSDNAAADAEKDGGPRRPRERLSPRRRSPLPRRGLSPRRRSPD
SPHRRRPGSPIRRRGDTPPRRRPASPSRGRSPSSPPPRRYRSPPRGSPRRIRGSPVRRRS
PLPLRRRSPPPRRLRSPPRRSPIRRRSRSPIRRPGRSRSSSISPRKGRGPAGRRGRSSSY
SSSPSPRRIPRKISRSRSPKRPLRGKRSSSNSSSSSSPPPPPPPRKT*
>AT1G55310.1 | SR33 RNA binding / protein binding
MRGRSYTPSPPRGYGRRGRSPSPRGRYGGRSRDLPTSLLVRNLRHDCRQEDLRKSFEQFG
PVKDIYLPRDYYTGDPRGFGFVQFMDPADAADAKHHMDGYLLLGRELTVVFAEENRKKPT
EMRARERGGGRFRDRRRTPPRYYSRSRSPPPRRGRSRSRSGDYYSPPPRRHHPRSISPRE
ERYDGRRSYSRSPASDGSRGRSLTPVRGKSRSLSPSPRRSISRSPRRSRSPSPKRNRSVS
PRRSISRSPRRSRSPRRSRRSYTPEPARSRSQSPHGGQYDEDRSPSQ*
>AT1G55310.1 | SR33 RNA binding / protein binding
MRGRSYTPSPPRGYGRRGRSPSPRGRYGGRSRDLPTSLLVRNLRHDCRQEDLRKSFEQFG
PVKDIYLPRDYYTGDPRGFGFVQFMDPADAADAKHHMDGYLLLGRELTVVFAEENRKKPT
EMRARERGGGRFRDRRRTPPRYYSRSRSPPPRRGRSRSRSGDYYSPPPRRHHPRSISPRE
ERYDGRRSYSRSPASDGSRGRSLTPVRGKSRSLSPSPRRSISRSPRRSRSPSPKRNRSVS
PRRSISRSPRRSRSPRRSRRSYTPEPARSRSQSPHGGQYDEDRSPSQ*
>AT1G55310.2 | SR33 RNA binding / protein binding
MRGRSYTPSPPRGYGRRGRSPSPRGRYGGRSRDLPTSLLVRNLRHDCRQEDLRKSFEQFG
PVKDIYLPRDYYTGDPRGFGFVQFMDPADAADAKHHMDGYLLLGRELTVVFAEENRKKPT
EMRARERGGGRFRDRRRTPPRYYSRSRSPPPRRGRSRSRSGDYYSPPPRRHHPRSISPRE
ERYDGRRSYSRSPASDGSRGRSLTPVRGKSRSLSPTQEEA*
>AT1G55310.2 | SR33 RNA binding / protein binding
MRGRSYTPSPPRGYGRRGRSPSPRGRYGGRSRDLPTSLLVRNLRHDCRQEDLRKSFEQFG
PVKDIYLPRDYYTGDPRGFGFVQFMDPADAADAKHHMDGYLLLGRELTVVFAEENRKKPT
EMRARERGGGRFRDRRRTPPRYYSRSRSPPPRRGRSRSRSGDYYSPPPRRHHPRSISPRE
ERYDGRRSYSRSPASDGSRGRSLTPVRGKSRSLSPTQEEA*
>AT3G63400.1 | peptidyl-prolyl cis-trans isomerase cyclophilin-type family protein
MTKKKNPNVFLDVSIGGDPVQRIVIELFADVVPKTAENFRALCTGEAGVGKSTGKPLHFK
GSSFHRVIKGFMAQGGDFSNGNGTGGESIYGGKFSDENFRLDHDGAGVLSMANCGPNTNG
SQFFILFKRQPHLDGKHVVFGKVVEGMAVIKKMELVGTSDGKPTSPVKIIDCGETSQIRA
HDAAEREKGKSKKSNKNFSPGDVSDREAKETRKKESNEKRIKRKRRYSSSDSYSSSSDSD
SDSESEAYSSSSYESSSSSDGKHRKRKSTTRHKGRRGERKSKGRSGKKKARPDRKPSTNS
SSDTESSSSSDDEKVGHKAIKSVKVDNADQHANLDDSVKSRSRSPIRRRNQNSRSKSPSR
SPVRVLGNGNRSPSRSPVRDLGNGSRSPREKPTEETVGKSFRSPSPSGVPKRIRKGRGFT
ERYSFARKYHTPSPERSPPRHWPDRRNFQDRNRDRYPSNRSYSERSPRGRFRSPPRRRSP
PRYNRRRRSTSRSPDGYRRRLRDGSRSQSPRHRSRSQSPRKRQPISQDLKSRLGPQRSPI
RGGRTSPAESLSPSHSPSPPGKRGLVSYAD*
>AT3G63400.1 | peptidyl-prolyl cis-trans isomerase cyclophilin-type family protein
MTKKKNPNVFLDVSIGGDPVQRIVIELFADVVPKTAENFRALCTGEAGVGKSTGKPLHFK
GSSFHRVIKGFMAQGGDFSNGNGTGGESIYGGKFSDENFRLDHDGAGVLSMANCGPNTNG
SQFFILFKRQPHLDGKHVVFGKVVEGMAVIKKMELVGTSDGKPTSPVKIIDCGETSQIRA
HDAAEREKGKSKKSNKNFSPGDVSDREAKETRKKESNEKRIKRKRRYSSSDSYSSSSDSD
SDSESEAYSSSSYESSSSSDGKHRKRKSTTRHKGRRGERKSKGRSGKKKARPDRKPSTNS
SSDTESSSSSDDEKVGHKAIKSVKVDNADQHANLDDSVKSRSRSPIRRRNQNSRSKSPSR
SPVRVLGNGNRSPSRSPVRDLGNGSRSPREKPTEETVGKSFRSPSPSGVPKRIRKGRGFT
ERYSFARKYHTPSPERSPPRHWPDRRNFQDRNRDRYPSNRSYSERSPRGRFRSPPRRRSP
PRYNRRRRSTSRSPDGYRRRLRDGSRSQSPRHRSRSQSPRKRQPISQDLKSRLGPQRSPI
RGGRTSPAESLSPSHSPSPPGKRGLVSYAD*
>AT3G63400.2 | peptidyl-prolyl cis-trans isomerase cyclophilin-type family protein
MTKKKNPNVFLDVSIGGDPVQRIVIELFADVVPKTAENFRALCTGEAGVGKSTGKPLHFK
GSSFHRVIKGFMAQGGDFSNGNGTGGESIYGGKFSDENFRLDHDGAGVLSMANCGPNTNG
SQFFILFKRQPHLDGKHVVFGKVVEGMAVIKKMELVGTSDGKPTSPVKIIDCGETSQIRA
HDAAEREKGKSKKSNKNFSPGDVSDREAKETRKKESNEKRIKRKRRYSSSDSYSSSSDSD
SDSESEAYSSSSYESSSSSDGKHRKRKSTTRHKGRRGERKSKGRSGKKKARPDRKPSTNS
SSDTESSSSSDDGYRRRLRDGSRSQSPRHRSRSQSPRKRQPISQDLKSRLGPQRSPIRGG
RTSPAESLSPSHSPSPPGKRGLVSYAD*
>AT3G63400.2 | peptidyl-prolyl cis-trans isomerase cyclophilin-type family protein
MTKKKNPNVFLDVSIGGDPVQRIVIELFADVVPKTAENFRALCTGEAGVGKSTGKPLHFK
GSSFHRVIKGFMAQGGDFSNGNGTGGESIYGGKFSDENFRLDHDGAGVLSMANCGPNTNG
SQFFILFKRQPHLDGKHVVFGKVVEGMAVIKKMELVGTSDGKPTSPVKIIDCGETSQIRA
HDAAEREKGKSKKSNKNFSPGDVSDREAKETRKKESNEKRIKRKRRYSSSDSYSSSSDSD
SDSESEAYSSSSYESSSSSDGKHRKRKSTTRHKGRRGERKSKGRSGKKKARPDRKPSTNS
SSDTESSSSSDDGYRRRLRDGSRSQSPRHRSRSQSPRKRQPISQDLKSRLGPQRSPIRGG
RTSPAESLSPSHSPSPPGKRGLVSYAD*
>AT3G50670.1 | U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleic acid binding / nucleotide binding
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVHLVTDQLTNKPKG
YAFIEYMHTRDMKAAYKQADGQKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTSRVGGGE
EIVGEQQPQGRTSQSEEPSRPREEREKSREKGKERERSRELSHEQPRERSRDRPREDKHH
RDRDQGGRDRDRDSRRDRDRTRDRGDRDRRDRDRGRDRTSRDHDRDRSRKKERDYEGGEY
EHEGGGRSRERDAEYRGEPEETRGYYEDDQGDTDRYSHRYDKMEEDDFRYEREYKRSKRS
ESREYVR*
>AT3G50670.1 | U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleotide binding
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVHLVTDQLTNKPKG
YAFIEYMHTRDMKAAYKQADGQKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTSRVGGGE
EIVGEQQPQGRTSQSEEPSRPREEREKSREKGKERERSRELSHEQPRERSRDRPREDKHH
RDRDQGGRDRDRDSRRDRDRTRDRGDRDRRDRDRGRDRTSRDHDRDRSRKKERDYEGGEY
EHEGGGRSRERDAEYRGEPEETRGYYEDDQGDTDRYSHRYDKMEEDDFRYEREYKRSKRS
ESREYVR*
>AT3G50670.2 | U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleic acid binding / nucleotide binding
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVGYSEHSLAGSVRI
CVMASLSRALCSICFILSTKVFQG*
>AT3G50670.2 | U1-70K (U1 SMALL NUCLEAR RIBONUCLEOPROTEIN-70K) RNA binding / nucleotide binding
MGDSGDPFLRNPNAAVQARAKVQNRANVLQLKLMGQSHPTGLTNNLLKLFEPRPPLEYKP
PPEKRKCPPYTGMAQFVSNFAEPGDPEYAPPKPEVELPSQKRERIHKLRLEKGVEKAAED
LKKYDPNNDPNATGDPYKTLFVSRLNYESSESKIKREFESYGPIKRVGYSEHSLAGSVRI
CVMASLSRALCSICFILSTKVFQG*
>AT2G43370.1 | U1 small nuclear ribonucleoprotein 70 kDa putative
MSGGGNNVVNKVFYATSYHPIQAGSIDGTDVAPHDNGVRRALLCYNAGLYDPSGDSKAVG
DPYCTLFVGRLSHHTTEDTLREVMSKYGRIKNLRLVRHIVTGASRGYGFVEYETEKEMLR
AYEDAHHSLIDGREIIVDYNRQQLMPGWIPRRLGGGLGGRKESGQLRFGGRDRPFRAPLR
PIPHEDLKKLGIQLPPEGRYMSRTQIPSPPRRKGSVSDREEEYYREKSSVEREEEFKERS
SLRSYHSHRSSAHTHSSHRRRSKDREECSREESRSDRKERARGMEDRYGDNKGEVSGSKR
SKRSEEDRSRKRHKHLPSHHHRRSYSQDHHSSD*
>AT4G24740.2 | AFC2 (ARABIDOPSIS FUS3-COMPLEMENTING GENE 2) kinase/ protein kinase
MGEGTFGQVLECWDRERKEMVAVKIVRGVKKYREAAMIEIEMLQQLGKHDKGGNRCVQIR
NWFDYRNHICIVFEKLGSSLYDFLRKNNYRSFPIDLVREIGWQLLECVAFMHDLRMIHTD
LKPENILLVSSDYVKIPEYKGSRLQRDVCYKRVPKSSAIKVIDFGSTTYERQDQTYIVST
RHYRAPEVILGLGWSYPCDVWSVGCIIVELCTGEALFQTHENLEHLAMMERVLGPFPQQM
LKKVDRHSEKYVRRGRLDWPDGATSRDSLKAVLKLPRLQNLIMQHVDHSAGELINMVQGL
LRFDPSERITAREALRHPFFARRR*
>AT4G24740.2 | AFC2 (ARABIDOPSIS FUS3-COMPLEMENTING GENE 2) kinase/ protein kinase
MGEGTFGQVLECWDRERKEMVAVKIVRGVKKYREAAMIEIEMLQQLGKHDKGGNRCVQIR
NWFDYRNHICIVFEKLGSSLYDFLRKNNYRSFPIDLVREIGWQLLECVAFMHDLRMIHTD
LKPENILLVSSDYVKIPEYKGSRLQRDVCYKRVPKSSAIKVIDFGSTTYERQDQTYIVST
RHYRAPEVILGLGWSYPCDVWSVGCIIVELCTGEALFQTHENLEHLAMMERVLGPFPQQM
LKKVDRHSEKYVRRGRLDWPDGATSRDSLKAVLKLPRLQNLIMQHVDHSAGELINMVQGL
LRFDPSERITAREALRHPFFARRR*
>AT4G24740.1 | AFC2 (ARABIDOPSIS FUS3-COMPLEMENTING GENE 2) kinase/ protein kinase
MEMERVHEFPHTHMDRRPRKRARLGWDVLPQATKAQVGMFCGQEIGNISSFASSGAPSDN
SSSLCVKGVARNGSPPWREDDKDGHYIFELGDDLTPRYKIYSKMGEGTFGQVLECWDRER
KEMVAVKIVRGVKKYREAAMIEIEMLQQLGKHDKGGNRCVQIRNWFDYRNHICIVFEKLG
SSLYDFLRKNNYRSFPIDLVREIGWQLLECVAFMHDLRMIHTDLKPENILLVSSDYVKIP
EYKGSRLQRDVCYKRVPKSSAIKVIDFGSTTYERQDQTYIVSTRHYRAPEVILGLGWSYP
CDVWSVGCIIVELCTGEALFQTHENLEHLAMMERVLGPFPQQMLKKVDRHSEKYVRRGRL
DWPDGATSRDSLKAVLKLPRLQNLIMQHVDHSAGELINMVQGLLRFDPSERITAREALRH
PFFARRR*
>AT4G24740.1 | AFC2 (ARABIDOPSIS FUS3-COMPLEMENTING GENE 2) kinase/ protein kinase
MEMERVHEFPHTHMDRRPRKRARLGWDVLPQATKAQVGMFCGQEIGNISSFASSGAPSDN
SSSLCVKGVARNGSPPWREDDKDGHYIFELGDDLTPRYKIYSKMGEGTFGQVLECWDRER
KEMVAVKIVRGVKKYREAAMIEIEMLQQLGKHDKGGNRCVQIRNWFDYRNHICIVFEKLG
SSLYDFLRKNNYRSFPIDLVREIGWQLLECVAFMHDLRMIHTDLKPENILLVSSDYVKIP
EYKGSRLQRDVCYKRVPKSSAIKVIDFGSTTYERQDQTYIVSTRHYRAPEVILGLGWSYP
CDVWSVGCIIVELCTGEALFQTHENLEHLAMMERVLGPFPQQMLKKVDRHSEKYVRRGRL
DWPDGATSRDSLKAVLKLPRLQNLIMQHVDHSAGELINMVQGLLRFDPSERITAREALRH
PFFARRR*
>AT1G07350.1 | transformer serine/arginine-rich ribonucleoprotein putative
MGKREIHFTPVGRQVQRVLEYPLRLENRSPMSYSRRSRYSPSLSPYDKRRGRSVSRSLSR
SPTRSVSSDAENPGNSLYVTGLSHRVTERDLEDHFAKEGKVTDVHLVLDPWTRESRGFGF
ISMKSVGDANRCIRSLDHSVLQGRVITVEKARRRRGRTPTPGKYLGLRTARGRHKSPSYS
PRRSVSCSRSRSRSYSSDRGRSYSPSYGRRGRSSSYSPFYRRRRFYSPSRSPSPDDRYNR
RRDRSYSPYYRRRDRSRSYSRNCRARDRSPYYMRRYRSRSRSYSPRYRARDRSCSPYYRG
RDRSYSPHYQGRDRSYSPESRYYRRHRSVSGSVSPGGRSMSRSISPRKGRKESRSKSRRH
DRQSSMCHSRSARSSTSRSVSP*
>AT1G07350.1 | transformer serine/arginine-rich ribonucleoprotein putative
MGKREIHFTPVGRQVQRVLEYPLRLENRSPMSYSRRSRYSPSLSPYDKRRGRSVSRSLSR
SPTRSVSSDAENPGNSLYVTGLSHRVTERDLEDHFAKEGKVTDVHLVLDPWTRESRGFGF
ISMKSVGDANRCIRSLDHSVLQGRVITVEKARRRRGRTPTPGKYLGLRTARGRHKSPSYS
PRRSVSCSRSRSRSYSSDRGRSYSPSYGRRGRSSSYSPFYRRRRFYSPSRSPSPDDRYNR
RRDRSYSPYYRRRDRSRSYSRNCRARDRSPYYMRRYRSRSRSYSPRYRARDRSCSPYYRG
RDRSYSPHYQGRDRSYSPESRYYRRHRSVSGSVSPGGRSMSRSISPRKGRKESRSKSRRH
DRQSSMCHSRSARSSTSRSVSP*
>AT1G07350.2 | transformer serine/arginine-rich ribonucleoprotein putative
MSYSRRSRYSPSLSPYDKRRGRSVSRSLSRSPTRSVSSDAENPGNSLYVTGLSHRVTERD
LEDHFAKEGKVTDVHLVLDPWTRESRGFGFISMKSVGDANRCIRSLDHSVLQGRVITVEK
FLWQQVCCL*
>AT1G07350.2 | transformer serine/arginine-rich ribonucleoprotein putative
MSYSRRSRYSPSLSPYDKRRGRSVSRSLSRSPTRSVSSDAENPGNSLYVTGLSHRVTERD
LEDHFAKEGKVTDVHLVLDPWTRESRGFGFISMKSVGDANRCIRSLDHSVLQGRVITVEK
FLWQQVCCL*
>AT2G33730.1 | DEAD box RNA helicase putative
MKRTINEVVGSTSVNTLDSSIAKKPVFLTKKQREELALKRRQDQISEQRVRREQLGRPEP
ETEDVSNGDTNRDKDRDRDRDRDRERDRDRERDRGRDRDRDRDRDRDRDRERERDRERDR
RERDREPDRRNREKEREEEVKAREKARVEKLVEREKEKELDAIKEQYLGGKKPKKRVIRP
SEKFRFSFDWENTEDTSRDMNVLYQNPHEAQLLFGRGFRAGMDRREQKKQAAKHEKEMRD
EIRKKDGIVEKPEEAAAQRVREEAADTYDSFDMRVDRHWSDKRLEEMTERDWRIFREDFN
ISYKGSRIPRPMRSWEESKLTSELLKAVERAGYKKPSPIQMAAIPLGLQQRDVIGIAETG
SGKTAAFVLPMLAYISRLPPMSEENETEGPYAVVMAPTRELAQQIEEETVKFAHYLGFRV
TSIVGGQSIEEQGLKITQGCEIVIATPGRLIDCLERRYAVLNQCNYVVLDEADRMIDMGF
EPQVAGVLDAMPSSNLKPENEEEELDEKKIYRTTYMFSATMPPGVERLARKYLRNPVVVT
IGTAGKTTDLISQHVIMMKESEKFFRLQKLLDELGEKTAIVFVNTKKNCDSIAKNLDKAG
YRVTTLHGGKSQEQREISLEGFRAKRYNVLVATDVVGRGIDIPDVAHVINYDMPKHIEMY
THRIGRTGRAGKSGVATSFLTLHDTEVFYDLKQMLVQSNSAVPPELARHEASRFKPGTVP
DRPPRHSDTVYIN*
>AT1G20960.1 | emb1507 (embryo defective 1507) ATP binding / ATP-dependent helicase/ helicase/ nucleic acid binding / nucleoside-triphosphatase/ nucleotide binding
MANLGGGAEAHARFKQYEYRANSSLVLTTDNRPRDTHEPTGEPETLWGKIDPRSFGDRVA
KGRPQELEDKLKKSKKKERDVVDDMVNIRQSKRRRLREESVLTDTDDAVYQPKTKETRAA
YEAMLGLIQKQLGGQPPSIVSGAADEILAVLKNDAFRNPEKKMEIEKLLNKIENHEFDQL
VSIGKLITDFQEGGDSGGGRANDDEGLDDDLGVAVEFEENEEDDEESDPDMVEEDDDEED
DEPTRTGGMQVDAGINDEDAGDANEGTNLNVQDIDAYWLQRKISQAYEQQIDPQQCQVLA
EELLKILAEGDDRVVEDKLLMHLQYEKFSLVKFLLRNRLKVVWCTRLARAEDQEERNRIE
EEMRGLGPELTAIVEQLHATRATAKEREENLQKSINEEARRLKDETGGDGGRGRRDVADR
DSESGWVKGQRQMLDLESLAFDQGGLLMANKKCDLPPGSYRSHGKGYDEVHVPWVSKKVD
RNEKLVKITEMPDWAQPAFKGMQQLNRVQSKVYDTALFKAENILLCAPTGAGKTNVAMLT
ILQQLEMNRNTDGTYNHGDYKIVYVAPMKALVAEVVGNLSNRLKDYGVIVRELSGDQSLT
GREIEETQIIVTTPEKWDIITRKSGDRTYTQLVRLLIIDEIHLLHDNRGPVLESIVARTL
RQIETTKENIRLVGLSATLPNYEDVALFLRVDLKKGLFKFDRSYRPVPLHQQYIGISVKK
PLQRFQLMNDLCYQKVLAGAGKHQVLIFVHSRKETSKTARAIRDTAMANDTLSRFLKEDS
VTRDVLHSHEDIVKNSDLKDILPYGFAIHHAGLSRGDREIVETLFSQGHVQVLVSTATLA
WGVNLPAHTVIIKGTQVYNPEKGAWMELSPLDVMQMLGRAGRPQYDQHGEGIIITGYSEL
QYYLSLMNEQLPIESQFISKLADQLNAEIVLGTVQNAREACHWLGYTYLYIRMVRNPTLY
GLAPDALAKDVVLEERRADLIHSAATILDKNNLVKYDRKSGYFQVTDLGRIASYYYITHG
TIATYNEHLKPTMGDIDLYRLFSLSDEFKYVTVRQDEKMELAKLLDRVPIPIKETLEEPS
AKINVLLQAYISQLKLEGLSLTSDMVYITQSAGRLVRALYEIVLKRGWAQLAEKALNLSK
MVGKRMWSVQTPLRQFHGLSNDILMQLEKKDLVWERYYDLSAQELGELIRSPKMGKPLHK
FIHQFPKVTLSAHVQPITRTVLNVELTVTPDFLWDEKIHKYVEPFWIIVEDNDGEKILHH
EYFLLKKQYIDEDHTLHFTVPIFEPLPPQYFVRVVSDKWLGSETVLPVSFRHLILPEKYP
PPTELLDLQPLPVTALRNPNYEILYQDFKHFNPVQTQVFTVLYNTNDNVLVAAPTGSGKT
ICAEFAILRNHHEGPDATMRVVYIAPLEAIAKEQFRIWEGKFGKGLGLRVVELTGETALD
LKLLEKGQIIISTPEKWDALSRRWKQRKYVQQVSLFIVDELHLIGGQHGPVLEVIVSRMR
YISSQVINKIRIVALSTSLANAKDLGEWIGASSHGLFNFPPGVRPVPLEIHIQGVDISSF
EARMQAMTKPTYTAIVQHAKNKKPAIVFVPTRKHVRLTAVDLMAYSHMDNPQSPDFLLGK
LEELDPFVEQIREETLKETLCHGIGYLHEGLSSLDQEIVTQLFEAGRIQVCVMSSSLCWG
TPLTAHLVVVMGTQYYDGRENSHSDYPVPDLLQMMGRASRPLLDNAGKCVIFCHAPRKEY
YKKFLYEAFPVESQLQHFLHDNFNAEVVAGVIENKQDAVDYLTWTFMYRRLPQNPNYYNL
QGVSHRHLSDHLSELVENTLSDLEASKCIEVEDEMELSPLNLGMIASYYYISYTTIERFS
SLLSSKTKMKGLLEILTSASEYDMIPIRPGEEDTVRRLINHQRFSFENPKCTDPHVKANA
LLQAHFSRQNIGGNLAMDQRDVLLSATRLLQAMVDVISSNGWLNLALLAMEVSQMVTQGM
WERDSMLLQLPHFTKDLAKRCQENPGKNIETVFDLVEMEDEERQELLKMSDAQLLDIARF
CNRFPNIDLTYEIVGSEEVNPGKEVTLQVMLERDMEGRTEVGPVDSLRYPKTKEEGWWLV
VGDTKTNQLLAIKRVSLQRKVKVKLDFTAPSEPGEKSYTLYFMCDSYLGCDQEYSFSVDV
KGSGAGDRMEE*
>AT1G09760.1 | U2A (U2 small nuclear ribonucleoprotein A) protein binding
MVKLTADLIWKSPHFFNAIKERELDLRGNKIPVIENLGATEDQFDTIDLSDNEIVKLENF
PYLNRLGTLLINNNRITRINPNLGEFLPKLHSLVLTNNRLVNLVEIDPLASIPKLQYLSL
LDNNITKKANYRLYVIHKLKSLRVLDFIKIKAKERAEAASLFSSKEAEEEVKKVSREEVK
KVSETAENPETPKVVAPTAEQILAIKAAIINSQTIEEIARLEQALKFGQVPAGLIIPDPA
TNDSAPMEE*
>AT2G27100.1 | SE (SERRATE) DNA binding / transcription factor
MADVNLPPSDSVDNRLPEKSTSSSPPPPPPSSSLPQQEQEQDQQQLPLRRERDSRERRDE
RDIERPPPNRRERDRSPLPPPRRDYKRRPSLSPPPPYRDRRHSPPQRRSPPQKRYRRDDN
GYDGRRGSPRGGYGPPDRRFGYDHGGGYDREMGGRPGYGDERPHGRFMGRYQDWEGGRGG
YGDASNSGNPQRDGLMSYKQFIQELEDDILPSEAERRYQEYKSEYITTQKRAFFNTHKEE
DWLKNKYHPTNLLSVIERRNDLAQKVAKDFLLDLQSGTLDLGPAVTALNKSGRTSEPNSE
DEAAGVGKRKRHGMGGAKENELLSAAPKAPSFTSDPKRILTDVEQTQALVRKLDSEKKIE
ENVLQGSETEKSGREKLHSGSTGPVVIIRGLTSVKGLEGVELLDTLVTYLWRVHGLDYYG
KVETNEAKGLRHVRAEGKVSDAKGDENESKFDSHWQERLKGQDPLEVMAAKEKIDAAATE
ALDPHVRKIRDEKYGWKYGCGAKGCTKLFHAAEFVYKHLKLKHTELVTELTTKVREELYF
QNYMNDPNAPGGQPATQQSGPRDRPIRRKPSMENRLRDDRGGRRERDGRANGNDRNDRSE
DQQRGDNDGGNPGEVGYDAFGGQGGVHVPPFLSDINPPPMLMPVPGAGPLGPFVPAPPEV
AMQMFRDPSGPNPPFEGSGRGGPAPFLLSPAFRQDPRRLRSYQDLDAPEEEVTVIDYRSL
*
>AT5G64270.1 | splicing factor putative
MADLDPEIAKTQEERRKMEADLASLTSLTFDRDLYGGNDRASYSTSIAPNEEDDANLDTT
GSLVAQRLASYTAPRSILNDVARPHNEDDDVGFKPRQSIAEREGEYRNRRLNRVLSPDRV
DAFAMGDKTPDASVRTYTDHMRETALQREKEETMRLIAKKKKEEEEAAAKHQKDSAPPPP
ASSSSSSSKRRHRWDLPEEDGAAAKKAKAASSDWDLPDAAPGIGRWDAPTPGRVSDATPS
AGRRNRWDETPTPGRVTDSDATPGGGVTPGATPSGVTWDGLATPTPKRQRSRWDETPATM
GSATPMGGVTPGAAYTPGVTPIGGIDMATPTPGQLIFRGPMTPEQLNMQRWEKDIEERNR
PLSDEELDAMFPKDGYKVLDPPATYVPIRTPARKLQQTPTPMATPGYVIPEENRGQQYDV
PPEVPGGLPFMKPEDYQYFGSLLNEENEEELSPEEQKERKIMKLLLKVKNGTPPQRKTAL
RQLTDKARELGAGPLFNKILPLLMQPTLEDQERHLLVKVIDRILYKLDEMVRPYVHKILV
VIEPLLIDEDYYARVEGREIISNLSKAAGLASMIAAMRPDIDNIDEYVRNTTARAFSVVA
SALGIPALLPFLKAVCQSKRSWQARHTGIKIVQQIAILIGCAVLPHLRSLVEIIEHGLSD
ENQKVRTITALSLAALAEAAAPYGIESFDSVLKPLWKGIRSHRGKVLAAFLKAIGFIIPL
MDAIYASYYTKEVMVILIREFQSPDEEMKKIVLKVVKQCVSTEGVEPEYIRSDILPEFFR
NFWTRKMALERRNYKQLVETTVEVANKVGVADIVGRVVEDLKDESEQYRRMVMETIDKVV
TNLGASDIDARLEELLIDGILYAFQEQTSDDANVMLNGFGAVVNALGQRVKPYLPQICGT
IKWRLNNKSAKVRQQAADLISRIAVVMKQCGEEQLMGHLGVVLYEYLGEEYPEVLGSILG
ALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGAEFVPAREW
MRICFELLEMLKAHKKGIRRATVNTFGYIAKAIGPQDVLATLLNNLKVQERQNRVCTTVA
IAIVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPLLED
ALMDRDLVHRQTAASAVKHMALGVAGLGCEDALVHLLNFIWPNIFETSPHVINAVMEAIE
GMRVALGAAVILNYCLQGLFHPARKVREVYWKIYNSLYIGAQDTLVAAYPVLEDEQNNVY
SRPELTMFV*
>AT2G44680.1 | CKB4 (CASEIN KINASE II BETA SUBUNIT 4) protein serine/threonine kinase
MYKDRSGGGIMGGGGSSRSEILGGAIDRKRINDALDKHLKKSSPSTSRVFTSKDKDSVPS
TSTAKSQLHSRSPDVESDTDSEGSDVSGSEGDDTSWISWFCNLRGNEFFCEVDEDYIQDD
FNLCGLSGQVPYYDYALDLILDVESSNGDMFTEEQHEMVESAAEMLYGLIHVRYILTTKG
MAAMMEKYKNYDFGRCPRVFCCGQSCLPVGQSDIPRSSTVKIYCPKCEDIYYPRSKYQGN
IDGAYFGTTFPHLFLMAYGNMKPQKPAQNYVPKIFGFKVHNKQ*
>AT2G44680.1 | CKB4 (CASEIN KINASE II BETA SUBUNIT 4) protein serine/threonine kinase
MYKDRSGGGIMGGGGSSRSEILGGAIDRKRINDALDKHLKKSSPSTSRVFTSKDKDSVPS
TSTAKSQLHSRSPDVESDTDSEGSDVSGSEGDDTSWISWFCNLRGNEFFCEVDEDYIQDD
FNLCGLSGQVPYYDYALDLILDVESSNGDMFTEEQHEMVESAAEMLYGLIHVRYILTTKG
MAAMMEKYKNYDFGRCPRVFCCGQSCLPVGQSDIPRSSTVKIYCPKCEDIYYPRSKYQGN
IDGAYFGTTFPHLFLMAYGNMKPQKPAQNYVPKIFGFKVHNKQ*
>AT2G44680.2 | CKB4 (CASEIN KINASE II BETA SUBUNIT 4) protein serine/threonine kinase
MYKDRSGGGIMGGGGSSRSEILGGAIDRKRINDALDKHLKKSSPSTSRVFTSKDKDSVPS
TSTAKSQLHSRSPDVESDTDSEGSDVSGSEGDDTSWISWFCNLRGNEFFCEVDEDYIQDD
FNLCGLSGQVPYYDYALDLILDVESSNGDMFTEEQHEMVESAAEMLYGLIHVRYILTTKG
MAAMMEKYKNYDFGRCPRVFCCGQSCLPVGQSDIPRSSTVKIYCPKCEDIYYPRSKYQDI
DGAYFGTTFPHLFLMAYGNMKPQKPAQNYVPKIFGFKVHNKQ*
>AT2G44680.2 | CKB4 (CASEIN KINASE II BETA SUBUNIT 4) protein serine/threonine kinase
MYKDRSGGGIMGGGGSSRSEILGGAIDRKRINDALDKHLKKSSPSTSRVFTSKDKDSVPS
TSTAKSQLHSRSPDVESDTDSEGSDVSGSEGDDTSWISWFCNLRGNEFFCEVDEDYIQDD
FNLCGLSGQVPYYDYALDLILDVESSNGDMFTEEQHEMVESAAEMLYGLIHVRYILTTKG
MAAMMEKYKNYDFGRCPRVFCCGQSCLPVGQSDIPRSSTVKIYCPKCEDIYYPRSKYQDI
DGAYFGTTFPHLFLMAYGNMKPQKPAQNYVPKIFGFKVHNKQ*
>AT4G32420.1 | peptidyl-prolyl cis-trans isomerase cyclophilin-type family protein
MAKKKNPQVFMDVSIDGDPAETMVFELFPEVAPKTSENFRALCTGEKGIGPRSGKPLHYK
GSFFHRIMKGSSAQAGDFVNRNGTAGESIYAGKFPDESPKLRHEETGLLSMSIADRDKFG
SHFHITFRPNQQLDRNNVVFGKLIQGKEILKKIERVGDEEGKPTVSVKIIRCGEYSGDKK
KSDGKKNGKHKKSLRVRRKKRRRHSSSESESSSDSETDSSESDSESDSDLSSPSFLSSSS
HERQKKRKRSSKKDKHRRSKQRDKRHEKKRSMRDKRPKRKSRRSPDSLEDSNSGSEASLS
DVNVEIGAKKRKHRVSRRTGNSAPAVEKEAESLHQGKRKGPDLLENRGLRSNGISDAASE
QISDRQPDIVDDHPSKSRSRSLSPKRTVSKSTSVSPRRSQSKSPSSSPRWNGGRSPAKGS
RQVKNLTNSRRESPGSEEKGRHVRRSPTKSVSRSPVRVKKERDISRSPSKSLSRSPLRSP
KRVISRSPVRGRIARSPSRSPVRSASRGSLGRGPLRRSSRRSPSRSPVRSSRRSLSRSPI
QLSRRSLSRSPTRLSRRSLSRSPIRSPRKSVSRSPVRSSRKSVSRSPVRSSRRRISRSPV
RSSRKSVSRSPIRLSRRSISRSPIRLSRRSISRSPVRGRRRISRSPVPARRRSVRPRSPP
PDRRRSLSRSASPNGRIRRGRGFSQRFSYARRYRTSPSPDRSPYRFSDRSDRDRFRSRRR
FSPSRFRSPLRGRTPPRYRRRSRSVSPGLCYRNRRYSRSPIRSRSPPYRKRRSPSASHSL
SPSRSRSRSKSYSKSPIGTGKARSVSRSPSKARSPSKSDSTSSDNSPGGKKGLVAYD*
>AT4G32420.1 | peptidyl-prolyl cis-trans isomerase cyclophilin-type family protein
MAKKKNPQVFMDVSIDGDPAETMVFELFPEVAPKTSENFRALCTGEKGIGPRSGKPLHYK
GSFFHRIMKGSSAQAGDFVNRNGTAGESIYAGKFPDESPKLRHEETGLLSMSIADRDKFG
SHFHITFRPNQQLDRNNVVFGKLIQGKEILKKIERVGDEEGKPTVSVKIIRCGEYSGDKK
KSDGKKNGKHKKSLRVRRKKRRRHSSSESESSSDSETDSSESDSESDSDLSSPSFLSSSS
HERQKKRKRSSKKDKHRRSKQRDKRHEKKRSMRDKRPKRKSRRSPDSLEDSNSGSEASLS
DVNVEIGAKKRKHRVSRRTGNSAPAVEKEAESLHQGKRKGPDLLENRGLRSNGISDAASE
QISDRQPDIVDDHPSKSRSRSLSPKRTVSKSTSVSPRRSQSKSPSSSPRWNGGRSPAKGS
RQVKNLTNSRRESPGSEEKGRHVRRSPTKSVSRSPVRVKKERDISRSPSKSLSRSPLRSP
KRVISRSPVRGRIARSPSRSPVRSASRGSLGRGPLRRSSRRSPSRSPVRSSRRSLSRSPI
QLSRRSLSRSPTRLSRRSLSRSPIRSPRKSVSRSPVRSSRKSVSRSPVRSSRRRISRSPV
RSSRKSVSRSPIRLSRRSISRSPIRLSRRSISRSPVRGRRRISRSPVPARRRSVRPRSPP
PDRRRSLSRSASPNGRIRRGRGFSQRFSYARRYRTSPSPDRSPYRFSDRSDRDRFRSRRR
FSPSRFRSPLRGRTPPRYRRRSRSVSPGLCYRNRRYSRSPIRSRSPPYRKRRSPSASHSL
SPSRSRSRSKSYSKSPIGTGKARSVSRSPSKARSPSKSDSTSSDNSPGGKKGLVAYD*
>AT4G32420.2 | peptidyl-prolyl cis-trans isomerase cyclophilin-type family protein
MSIADRDKFGSHFHITFRPNQQLDRNNVVFGKLIQGKEILKKIERVGDEEGKPTVSVKII
RCGEYSGDKKKSDGKKNGKHKKSLRVRRKKRRRHSSSESESSSDSETDSSESDSESDSDL
SSPSFLSSSSHERQKKRKRSSKKDKHRRSKQRDKRHEKKRSMRDKRPKRKSRRSPDSLED
SNSGSEASLSDVNVEIGAKKRKHRVSRRTGNSAPAVEKEAESLHQGKRKGPDLLENRGLR
SNGISDAASEQISDRQPDIVDDHPSKSRSRSLSPKRTVSKSTSVSPRRSQSKSPSSSPRW
NGGRSPAKGSRQVKNLTNSRRESPGSEEKGRHVRRSPTKSVSRSPVRVKKERDISRSPSK
SLSRSPLRSPKRVISRSPVRGRIARSPSRSPVRSASRGSLGRGPLRRSSRRSPSRSPVRS
SRRSLSRSPIQLSRRSLSRSPTRLSRRSLSRSPIRSPRKSVSRSPVRSSRKSVSRSPVRS
SRRRISRSPVRSSRKSVSRSPIRLSRRSISRSPIRLSRRSISRSPVRGRRRISRSPVPAR
RRSVRPRSPPPDRRRSLSRSASPNGRIRRGRGFSQRFSYARRYRTSPSPDRSPYRFSDRS
DRDRFRSRRRFSPSRFRSPLRGRTPPRYRRRSRSVSPGLCYRNRRYSRSPIRSRSPPYRK
RRSPSASHSLSPSRSRSRSKSYSKSPIGTGKARSVSRSPSKARSPSKSDSTSSDNSPGGK
KGLVAYD*
>AT4G32420.2 | peptidyl-prolyl cis-trans isomerase cyclophilin-type family protein
MSIADRDKFGSHFHITFRPNQQLDRNNVVFGKLIQGKEILKKIERVGDEEGKPTVSVKII
RCGEYSGDKKKSDGKKNGKHKKSLRVRRKKRRRHSSSESESSSDSETDSSESDSESDSDL
SSPSFLSSSSHERQKKRKRSSKKDKHRRSKQRDKRHEKKRSMRDKRPKRKSRRSPDSLED
SNSGSEASLSDVNVEIGAKKRKHRVSRRTGNSAPAVEKEAESLHQGKRKGPDLLENRGLR
SNGISDAASEQISDRQPDIVDDHPSKSRSRSLSPKRTVSKSTSVSPRRSQSKSPSSSPRW
NGGRSPAKGSRQVKNLTNSRRESPGSEEKGRHVRRSPTKSVSRSPVRVKKERDISRSPSK
SLSRSPLRSPKRVISRSPVRGRIARSPSRSPVRSASRGSLGRGPLRRSSRRSPSRSPVRS
SRRSLSRSPIQLSRRSLSRSPTRLSRRSLSRSPIRSPRKSVSRSPVRSSRKSVSRSPVRS
SRRRISRSPVRSSRKSVSRSPIRLSRRSISRSPIRLSRRSISRSPVRGRRRISRSPVPAR
RRSVRPRSPPPDRRRSLSRSASPNGRIRRGRGFSQRFSYARRYRTSPSPDRSPYRFSDRS
DRDRFRSRRRFSPSRFRSPLRGRTPPRYRRRSRSVSPGLCYRNRRYSRSPIRSRSPPYRK
RRSPSASHSLSPSRSRSRSKSYSKSPIGTGKARSVSRSPSKARSPSKSDSTSSDNSPGGK
KGLVAYD*
>AT5G64200.1 | ATSC35 RNA binding / nucleic acid binding / nucleotide binding
MSHFGRSGPPDISDTYSLLVLNITFRTTADDLYPLFAKYGKVVDVFIPRDRRTGDSRGFA
FVRYKYKDEAHKAVERLDGRVVDGREITVQFAKYGPNAEKISKGRVVEPPPKSRRSRSRS
PRRSRSPRRSRSPPRRRSPRRSRSPRRRSRDDYREKDYRKRSRSRSYDRRERHEEKDRDH
RRRTRSRSASPDEKRRVRGRYDNESRSHSRSLSASPARRSPRSSSPQKTSPAREVSPDKR
SNERSPSPRRSLSPRSPALQKASPSKEMSPERRSNERSPSPGSPAPLRKVDAASRSQSPY
AAE*
>AT5G64200.1 | ATSC35 RNA binding / nucleic acid binding / nucleotide binding
MSHFGRSGPPDISDTYSLLVLNITFRTTADDLYPLFAKYGKVVDVFIPRDRRTGDSRGFA
FVRYKYKDEAHKAVERLDGRVVDGREITVQFAKYGPNAEKISKGRVVEPPPKSRRSRSRS
PRRSRSPRRSRSPPRRRSPRRSRSPRRRSRDDYREKDYRKRSRSRSYDRRERHEEKDRDH
RRRTRSRSASPDEKRRVRGRYDNESRSHSRSLSASPARRSPRSSSPQKTSPAREVSPDKR
SNERSPSPRRSLSPRSPALQKASPSKEMSPERRSNERSPSPGSPAPLRKVDAASRSQSPY
AAE*
>AT5G64200.2 | ATSC35 RNA binding / nucleic acid binding / nucleotide binding
MSHFGRSGPPDISDTYSLLVLNITFRTTADDLYPLFAKYGKVVDVFIPRDRRTGDSRGFA
FVRYKYKDEAHKAVERLDGRVVDGREITVQFAKYGPNAEKISKGRVVEPPPKSRRSRSRS
PRRSRSPRRSRSPPRRRSPRRSRSPRRRSRDDYREKDYRKRSRSRSYDRRERHEEKDRDH
RRRTRSRSASPDEKRRVRGRYDNESRSHSRSLSASPARRSPRSSSPQKTSPAREVSPDKR
SNERSPSPRRSLSPRSPALQKASPSKEMSPERRSNERSPSPGSPAPLRKVDAASRSQSPY
AAE*
>AT5G64200.2 | ATSC35 RNA binding / nucleic acid binding / nucleotide binding
MSHFGRSGPPDISDTYSLLVLNITFRTTADDLYPLFAKYGKVVDVFIPRDRRTGDSRGFA
FVRYKYKDEAHKAVERLDGRVVDGREITVQFAKYGPNAEKISKGRVVEPPPKSRRSRSRS
PRRSRSPRRSRSPPRRRSPRRSRSPRRRSRDDYREKDYRKRSRSRSYDRRERHEEKDRDH
RRRTRSRSASPDEKRRVRGRYDNESRSHSRSLSASPARRSPRSSSPQKTSPAREVSPDKR
SNERSPSPRRSLSPRSPALQKASPSKEMSPERRSNERSPSPGSPAPLRKVDAASRSQSPY
AAE*
>AT2G47580.1 | U1A (SPLICEOSOMAL PROTEIN U1A) RNA binding / nucleic acid binding / nucleotide binding
MEMQEANQGGGSEVSPNQTIYINNLNEKVKLDELKKSLNAVFSQFGKILEILAFKTFKHK
GQAWVVFDNTESASTAIAKMNNFPFYDKEMRIQYAKTKSDVVAKADGTFVPREKRKRHEE
KGGGKKKKDQHHDSTQMGMPMNSAYPGVYGAAPPLSQVPYPGGMKPNMPEAPAPPNNILF
VQNLPHETTPMVLQMLFCQYQGFKEVRMIEAKPGIAFVEFADEMQSTVAMQGLQGFKIQQ
NQMLITYAKK*
>AT1G02840.1 | SR1 RNA binding / nucleic acid binding / nucleotide binding
MSSRSSRTVYVGNLPGDIREREVEDLFSKYGPVVQIDLKVPPRPPGYAFVEFDDARDAED
AIHGRDGYDFDGHRLRVELAHGGRRSSDDTRGSFNGGGRGGGRGRGDGGSRGPSRRSEFR
VLVTGLPSSASWQDLKDHMRKGGDVCFSQVYRDARGTTGVVDYTCYEDMKYALKKLDDTE
FRNAFSNGYVRVREYDSRKDSRSPSRGRSYSKSRSRSRGRSVSRSRSRSRSRSRSPKAKS
SRRSPAKSTSRSPGPRSKSRSPSPRRSRSRSRSPLPSVQKEGSKSPSKPSPAKSPIHTRS
PSR*
>AT1G02840.1 | SR1 RNA binding / nucleic acid binding / nucleotide binding
MSSRSSRTVYVGNLPGDIREREVEDLFSKYGPVVQIDLKVPPRPPGYAFVEFDDARDAED
AIHGRDGYDFDGHRLRVELAHGGRRSSDDTRGSFNGGGRGGGRGRGDGGSRGPSRRSEFR
VLVTGLPSSASWQDLKDHMRKGGDVCFSQVYRDARGTTGVVDYTCYEDMKYALKKLDDTE
FRNAFSNGYVRVREYDSRKDSRSPSRGRSYSKSRSRSRGRSVSRSRSRSRSRSRSPKAKS
SRRSPAKSTSRSPGPRSKSRSPSPRRSRSRSRSPLPSVQKEGSKSPSKPSPAKSPIHTRS
PSR*
>AT1G02840.1 | SR1 RNA binding / nucleic acid binding / nucleotide binding
MSSRSSRTVYVGNLPGDIREREVEDLFSKYGPVVQIDLKVPPRPPGYAFVEFDDARDAED
AIHGRDGYDFDGHRLRVELAHGGRRSSDDTRGSFNGGGRGGGRGRGDGGSRGPSRRSEFR
VLVTGLPSSASWQDLKDHMRKGGDVCFSQVYRDARGTTGVVDYTCYEDMKYALKKLDDTE
FRNAFSNGYVRVREYDSRKDSRSPSRGRSYSKSRSRSRGRSVSRSRSRSRSRSRSPKAKS
SRRSPAKSTSRSPGPRSKSRSPSPRRSRSRSRSPLPSVQKEGSKSPSKPSPAKSPIHTRS
PSR*
>AT1G02840.2 | SR1 RNA binding / nucleic acid binding / nucleotide binding
MSSRSSRTVYVGNLPGDIREREVEDLFSKYGPVVQIDLKVPPRPPGYAFVEFDDARDAED
AIHGRDGYDFDGHRLRVELAHGGRRSSDDTRGSFNGGGRGGGRGRGDGGSRGPSRRSEFR
VLVTGLPSSASWQDLKDHMRKGGDVCFSQVYRDARGTTGVVDYTCYEDMKYALKKLDDTE
FRNAFSNGYVRVREYDSRKDSRSPSRGRSYSKSRSRSRGRSVSRSRSRSRSRSRSPKAKS
SRRSPAKSTSRSPGPRSKSRSPSPRRWITVETLDHLDHNIISGFL*
>AT1G02840.2 | SR1 RNA binding / nucleic acid binding / nucleotide binding
MSSRSSRTVYVGNLPGDIREREVEDLFSKYGPVVQIDLKVPPRPPGYAFVEFDDARDAED
AIHGRDGYDFDGHRLRVELAHGGRRSSDDTRGSFNGGGRGGGRGRGDGGSRGPSRRSEFR
VLVTGLPSSASWQDLKDHMRKGGDVCFSQVYRDARGTTGVVDYTCYEDMKYALKKLDDTE
FRNAFSNGYVRVREYDSRKDSRSPSRGRSYSKSRSRSRGRSVSRSRSRSRSRSRSPKAKS
SRRSPAKSTSRSPGPRSKSRSPSPRRWITVETLDHLDHNIISGFL*
>AT1G02840.2 | SR1 RNA binding / nucleic acid binding / nucleotide binding
MSSRSSRTVYVGNLPGDIREREVEDLFSKYGPVVQIDLKVPPRPPGYAFVEFDDARDAED
AIHGRDGYDFDGHRLRVELAHGGRRSSDDTRGSFNGGGRGGGRGRGDGGSRGPSRRSEFR
VLVTGLPSSASWQDLKDHMRKGGDVCFSQVYRDARGTTGVVDYTCYEDMKYALKKLDDTE
FRNAFSNGYVRVREYDSRKDSRSPSRGRSYSKSRSRSRGRSVSRSRSRSRSRSRSPKAKS
SRRSPAKSTSRSPGPRSKSRSPSPRRWITVETLDHLDHNIISGFL*
>AT1G02840.3 | SR1 RNA binding / nucleic acid binding / nucleotide binding
MSSRSSRTVYVGNLPGDIREREVEDLFSKYGPVVQIDLKVPPRPPGYAFVEFDDARDAED
AIHGRDGYDFDGHRLRVELAHGGRRSSDDTRGSFNGGGRGGGRGRGDGGSRGPSRRSEFR
VLVTGLPSSASWQDLKDHMRKGGDVCFSQVYRDARGTTGVVDYTCYEDMKYALKKLDDTE
FRNAFSNGYVRVREYDSRKDSRSPSRGRSYSKSRSRSRGRSVSRSRSRSRSRSRSPKAKS
SRRSPAKSTSRSPGPRSKSRSPSPRRSRSRSRSPLPSVQKEGSKSPSKPSPAKSPIHTRS
PSR*
>AT1G02840.3 | SR1 RNA binding / nucleic acid binding / nucleotide binding
MSSRSSRTVYVGNLPGDIREREVEDLFSKYGPVVQIDLKVPPRPPGYAFVEFDDARDAED
AIHGRDGYDFDGHRLRVELAHGGRRSSDDTRGSFNGGGRGGGRGRGDGGSRGPSRRSEFR
VLVTGLPSSASWQDLKDHMRKGGDVCFSQVYRDARGTTGVVDYTCYEDMKYALKKLDDTE
FRNAFSNGYVRVREYDSRKDSRSPSRGRSYSKSRSRSRGRSVSRSRSRSRSRSRSPKAKS
SRRSPAKSTSRSPGPRSKSRSPSPRRSRSRSRSPLPSVQKEGSKSPSKPSPAKSPIHTRS
PSR*
>AT1G02840.3 | SR1 RNA binding / nucleic acid binding / nucleotide binding
MSSRSSRTVYVGNLPGDIREREVEDLFSKYGPVVQIDLKVPPRPPGYAFVEFDDARDAED
AIHGRDGYDFDGHRLRVELAHGGRRSSDDTRGSFNGGGRGGGRGRGDGGSRGPSRRSEFR
VLVTGLPSSASWQDLKDHMRKGGDVCFSQVYRDARGTTGVVDYTCYEDMKYALKKLDDTE
FRNAFSNGYVRVREYDSRKDSRSPSRGRSYSKSRSRSRGRSVSRSRSRSRSRSRSPKAKS
SRRSPAKSTSRSPGPRSKSRSPSPRRSRSRSRSPLPSVQKEGSKSPSKPSPAKSPIHTRS
PSR*
>AT1G28060.1 | small nuclear ribonucleoprotein family protein / snRNP family protein
MDKERYSRSHRDDRDRDSSPDHSPQREGGRRRDRDVDSKRRDSDHYRSSRRGDREDERDR
TKDRRGRSVERGEREGSRDREKHHHERSHEGSKEKESRSKRKDREEENGARDGKKKSRFA
DGNGERRSRFEDVAIEVENKDAQVSEGSGATNPTSGVTMGASTYSSIPSEASAAPSQTLL
TKVSSISTTDENKASVVRSHEVPGKSSTDGRPLSTAGKSSANLPLDSSALAAKARKALQL
QKGLADRLKNLPLLKKATKPTSEGSPHTRVPPSTTTPAVSTGTSFASTLPHTGLAGFGSI
ANIEAVKRAQELAANMGFHQDREFAPVINLFPGQAPSDMTVAQRPEKPPVLRVDALGREI
DEHGNVISVTKPSNLSTLKVNINKKKKDAFQILKPQLEADLKENPYFDTRMGIDEKKILR
PKRMSFQFVEEGKWTRDAENLKFKSHFGEAKAKELKVKQAQLAKANDDINPNLIEVSERV
PRKEKPKEPIPDVEWWDANVLTNGEYGEITDGTITESHLKIEKLTHYIEHPRPIEPPAEA
APPPPQPLKLTKKEQKKLRTQRRLAKEKEKQEMIRQGLLEPPKAKVKMSNLMKVLGSEAT
QDPTKLEKEIRTAAAEREQAHTDRNAARKLTPAEKREKKERKLFDDPTTVETIVSVYKIK
KLSHPKTRFKVEMNARENRLTGCSVMTDEMSVVVVEGKSKAIKRYGKLMMKRINWEEAER
KEGNEDEEEEVNGGNKCWLVWQGSIGKPSFHRFHVHECVTESTAKKVFMDAGVVHYWDLA
VNYSDD*
>AT2G37340.2 | RSZ33 nucleic acid binding / zinc ion binding
MWENTPCMWSLLSSHFRNQEFGDPRDADDARHYLDGRDFDGSRITVEFSRGAPRGSRDFD
SRGPPPGAGRCFNCGVDGHWARDCTAGDWKNKCYRCGERGHIERNCKNSPKKLRRSGSYS
RSPVRSRSPRRRRSPSRSLSRSRSYSRSRSPVRRRERSVEERSRSPKRMDDSLSPRARDR
SPVLDDEGSPKIIDGSPPPSPKLQKEVGSDRDGGSPQDNGRNSVVSPVVGAGGDSSKEDR
SPVDDDYEPNRTSPRGSESP*
>AT2G37340.2 | RSZ33 nucleic acid binding / nucleotide binding / zinc ion binding
MWENTPCMWSLLSSHFRNQEFGDPRDADDARHYLDGRDFDGSRITVEFSRGAPRGSRDFD
SRGPPPGAGRCFNCGVDGHWARDCTAGDWKNKCYRCGERGHIERNCKNSPKKLRRSGSYS
RSPVRSRSPRRRRSPSRSLSRSRSYSRSRSPVRRRERSVEERSRSPKRMDDSLSPRARDR
SPVLDDEGSPKIIDGSPPPSPKLQKEVGSDRDGGSPQDNGRNSVVSPVVGAGGDSSKEDR
SPVDDDYEPNRTSPRGSESP*
>AT2G37340.2 | RSZ33 nucleic acid binding / nucleotide binding / zinc ion binding
MWENTPCMWSLLSSHFRNQEFGDPRDADDARHYLDGRDFDGSRITVEFSRGAPRGSRDFD
SRGPPPGAGRCFNCGVDGHWARDCTAGDWKNKCYRCGERGHIERNCKNSPKKLRRSGSYS
RSPVRSRSPRRRRSPSRSLSRSRSYSRSRSPVRRRERSVEERSRSPKRMDDSLSPRARDR
SPVLDDEGSPKIIDGSPPPSPKLQKEVGSDRDGGSPQDNGRNSVVSPVVGAGGDSSKEDR
SPVDDDYEPNRTSPRGSESP*
>AT2G37340.3 | RSZ33 nucleic acid binding / zinc ion binding
MKRDYAFVEFGDPRDADDARHYLDGRDFDGSRITVEFSRGAPRGSRDFDSRGPPPGAGRC
FNCGVDGHWARDCTAGDWKNKCYRCGERGHIERNCKNSPKKLRRSGSYSRSPVRSRSPRR
RRSPSRSLSRSRSYSRSRSPVRRRERSVEERSRSPKRMDDSLSPRARDRSPVLDDEGSPK
IIDGSPPPSPKLQKEVGSDRDGGSPQDNGRNSVVSPVVGAGGDSSKEDRSPVDDDYEPNR
TSPRGSESP*
>AT2G37340.3 | RSZ33 nucleic acid binding / nucleotide binding / zinc ion binding
MKRDYAFVEFGDPRDADDARHYLDGRDFDGSRITVEFSRGAPRGSRDFDSRGPPPGAGRC
FNCGVDGHWARDCTAGDWKNKCYRCGERGHIERNCKNSPKKLRRSGSYSRSPVRSRSPRR
RRSPSRSLSRSRSYSRSRSPVRRRERSVEERSRSPKRMDDSLSPRARDRSPVLDDEGSPK
IIDGSPPPSPKLQKEVGSDRDGGSPQDNGRNSVVSPVVGAGGDSSKEDRSPVDDDYEPNR
TSPRGSESP*
>AT2G37340.3 | RSZ33 nucleic acid binding / nucleotide binding / zinc ion binding
MKRDYAFVEFGDPRDADDARHYLDGRDFDGSRITVEFSRGAPRGSRDFDSRGPPPGAGRC
FNCGVDGHWARDCTAGDWKNKCYRCGERGHIERNCKNSPKKLRRSGSYSRSPVRSRSPRR
RRSPSRSLSRSRSYSRSRSPVRRRERSVEERSRSPKRMDDSLSPRARDRSPVLDDEGSPK
IIDGSPPPSPKLQKEVGSDRDGGSPQDNGRNSVVSPVVGAGGDSSKEDRSPVDDDYEPNR
TSPRGSESP*
>AT2G37340.1 | RSZ33 nucleic acid binding / zinc ion binding
MPRYDDRYGNTRLYVGRLSSRTRTRDLERLFSRYGRVRDVDMKRDYAFVEFGDPRDADDA
RHYLDGRDFDGSRITVEFSRGAPRGSRDFDSRGPPPGAGRCFNCGVDGHWARDCTAGDWK
NKCYRCGERGHIERNCKNSPKKLRRSGSYSRSPVRSRSPRRRRSPSRSLSRSRSYSRSRS
PVRRRERSVEERSRSPKRMDDSLSPRARDRSPVLDDEGSPKIIDGSPPPSPKLQKEVGSD
RDGGSPQDNGRNSVVSPVVGAGGDSSKEDRSPVDDDYEPNRTSPRGSESP*
>AT2G37340.1 | RSZ33 nucleic acid binding / nucleotide binding / zinc ion binding
MPRYDDRYGNTRLYVGRLSSRTRTRDLERLFSRYGRVRDVDMKRDYAFVEFGDPRDADDA
RHYLDGRDFDGSRITVEFSRGAPRGSRDFDSRGPPPGAGRCFNCGVDGHWARDCTAGDWK
NKCYRCGERGHIERNCKNSPKKLRRSGSYSRSPVRSRSPRRRRSPSRSLSRSRSYSRSRS
PVRRRERSVEERSRSPKRMDDSLSPRARDRSPVLDDEGSPKIIDGSPPPSPKLQKEVGSD
RDGGSPQDNGRNSVVSPVVGAGGDSSKEDRSPVDDDYEPNRTSPRGSESP*
>AT2G37340.1 | RSZ33 nucleic acid binding / nucleotide binding / zinc ion binding
MPRYDDRYGNTRLYVGRLSSRTRTRDLERLFSRYGRVRDVDMKRDYAFVEFGDPRDADDA
RHYLDGRDFDGSRITVEFSRGAPRGSRDFDSRGPPPGAGRCFNCGVDGHWARDCTAGDWK
NKCYRCGERGHIERNCKNSPKKLRRSGSYSRSPVRSRSPRRRRSPSRSLSRSRSYSRSRS
PVRRRERSVEERSRSPKRMDDSLSPRARDRSPVLDDEGSPKIIDGSPPPSPKLQKEVGSD
RDGGSPQDNGRNSVVSPVVGAGGDSSKEDRSPVDDDYEPNRTSPRGSESP*
>AT1G80070.1 | SUS2 (ABNORMAL SUSPENSOR 2)
MWNNNDGMPLAPPGTGGSMMPPPPAAHPSYTALPPPSNPTPPVEPTPEEAEAKLEEKARK
WMQLNSKRYGDKRKFGFVETQKEDMPPEHVRKIIRLVFFSSFSTISKYSLLDNYFLARDH
GDMSSKKFRHDKRVYLGALKFVPHAVFKLLENMPMPWEQVRDVKVLYHITGAITFVNEIP
WVVEPIYMAQWGTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNLLDVDPLEPIQL
ELDEEEDSAVHTWFYDHKPLVKTKLINGPSYRRWNLSLPIMATLHRLAGQLLSDLIDRNY
FYLFDMPSFFTAKALNMCIPGGPKFEPLYRDMEKGDEDWNEFNDINKLIIRSPLRTEYRI
AFPHLYNNRPRKVKLCVYHSPMIMYIKTEDPDLPAFYYDPLIHPISNTNKEKRERKVYDD
EDDFALPEGVEPLLRDTQLYTDTTAAGISLLFAPRPFNMRSGRTRRAEDIPLVSEWFKEH
CPPAYPVKVRVSYQKLLKCYVLNELHHRPPKAQKKKHLFRSLAATKFFQSTELDWVEVGL
QVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREILRLT
KLVVDANVQFRLGNVDAFQLADGLQYIFSHVGQLTGMYRYKYRLMRQIRMCKDLKHLIYY
RFNTGPVGKGPGCGFWAPMWRVWLFFLRGIVPLLERWLGNLLARQFEGRHSKGVAKTVTK
QRVESHFDLELRAAVMHDVLDAMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPVP
IENMILRYVKSKADWWTNVAHYNRERIRRGATVDKTVCRKNLGRLTRLWLKAEQERQHNY
LKDGPYVTPEEALAIYTTTVHWLESRKFSPIPFPPLSYKHDTKLLILALERLKESYSVAV
RLNQQQREELGLIEQAYDNPHEALSRIKRHLLTQRGFKEVGIEFMDLYSYLIPVYEIEPL
EKITDAYLDQYLWYEGDKRHLFPNWIKPADSEPPPLLVYKWCQGINNLQGIWDTGDGQCV
VMLQTKFEKFFEKIDLTMLNRLLRLVLDHNIADYVSAKNNVVLSYKDMSHTNSYGLIRGL
QFASFVVQFYGLLLDLLLLGLTRASEIAGPPQMPNEFMTFWDTKVETRHPIRLYSRYIDK
VHIMFKFTHEEARDLIQRYLTEHPDPNNENMVGYNNKKCWPRDARMRLMKHDVNLGRSVF
WDMKNRLPRSITTLEWENGFVSVYSKDNPNLLFSMCGFEVRILPKIRMTQEAFSNTKDGV
WNLQNEQTKERTAVAFLRVDDEHMKVFENRVRQILMSSGSTTFTKIVNKWNTALIGLMTY
FREATVHTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVIFYTPKEIGGLGMLSMGHI
LIPQSDLRYSKQTDVGVTHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKR
QEAQAQNRRLTLEDLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPF
WWTHQRHDGKLWNLNNYRTDVIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESM
KYKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISL
IQIFRAHLWQKIHESVVMDLCQVLDQELDALEIETVQKETIHPRKSYKMNSSCADVLLFA
AHKWPMSKPSLVAESKDMFDQKASNKYWIDVQLRWGDYDSHDIERYTRAKFMDYTTDNMS
IYPSPTGVMIGLDLAYNLHSAFGNWFPGSKPLLAQAMNKIMKSNPALYVLRERIRKGLQL
YSSEPTEPYLSSQNYGEIFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPR
TGQLFLKVIHTSVWAGQKRLGQLAKWKTAEEVAALVRSLPVEEQPKQIIVTRKGMLDPLE
VHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNIYDDWLKSISSYTAF
SRLILILRALHVNNEKAKMLLKPDKSVVTEPHHIWPSLTDDQWMKVEVALRDLILSDYAK
KNNVNTSALTQSEIRDIILGAEITPPSQQRQQIAEIEKQAKEASQLTAVTTRTTNVHGDE
LIVTTTSPYEQSAFGSKTDWRVRAISATNLYLRVNHIYVNSDDIKETGYTYIMPKNILKK
FICVADLRTQIAGYLYGISPPDNPQVKEIRCVVMVPQWGNHQLVHLPSSLPEHDFLNDLE
PLGWLHTQPNELPQLSPQDVTSHSRILENNKQWDGEKCIILTCSFTPGSCSLTSYKLTQT
GYEWGRLNKDNGSNPHGYLPTHYEKVQMLLSDRFLGFYMVPESGPWNYSFTGVKHTLSMK
YSVKLGSPKEFYHEEHRPTHFLEFSNMEEADITEGDREDTFT*
>AT4G26300.1 | emb1027 (embryo defective 1027) ATP binding / aminoacyl-tRNA ligase/ arginine-tRNA ligase/ nucleotide binding
MFIFPKDENRRETLTTKLRFSADHLTFTTVTEKLRATAWRFAFSSRAKSVVAMAANEEFT
GNLKRQLAKLFDVSLKLTVPDEPSVEPLVAASALGKFGDYQCNNAMGLWSIIKGKGTQFK
GPPAVGQALVKSLPTSEMVESCSVAGPGFINVVLSAKWMAKSIENMLIDGVDTWAPTLSV
KRAVVDFSSPNIAKEMHVGHLRSTIIGDTLARMLEYSHVEVLRRNHVGDWGTQFGMLIEY
LFEKFPDTDSVTETAIGDLQVFYKASKHKFDLDEAFKEKAQQAVVRLQGGDPVYRKAWAK
ICDISRTEFAKVYQRLRVELEEKGESFYNPHIAKVIEELNSKGLVEESEGARVIFLEGFD
IPLMVVKSDGGFNYASTDLTALWYRLNEEKAEWIIYVTDVGQQQHFNMFFKAARKAGWLP
DNDKTYPRVNHVGFGLVLGEDGKRFRTRATDVVRLVDLLDEAKTRSKLALIERGKDKEWT
PEELDQTAEAVGYGAVKYADLKNNRLTNYTFSFDQMLNDKGNTAVYLLYAHARICSIIRK
SGKDIDELKKTGKLALDHADERALGLHLLRFAETVEEACTNLLPSVLCEYLYNLSEHFTR
FYSNCQVNGSPEETSRLLLCEATAIVMRKCFHLLGITPVYKI*
>AT4G28706.1 | pfkB-type carbohydrate kinase family protein
MLCFTHVPPLPLFSTSGIYLPASRRFTRIRMSSSSSIDSVPPPPDNAIVLGCGGIAVDFL
ATVDSYPQADDKIRSTSLKVQGGGNAANALTCAARLGLNSRLISKVANDSQGKGMLEELE
ADGVDTSFIVVSKEGNSPFTYILVDNQTKTRTCIHTPGDPPMLPTDLSQSSMLSALDRAS
IVYFDVRLHETALVIAKEASRKKIPILVDTEKKRDGLDDLLPFADYVVCPENFPQTWTEV
SSTPGALVSMLLRLPKLKFVIVTSGEHGCLMVQRASKEVFESQETDIESLLETLKHRKDS
TTTFPTCVSSETTKLKANGVGTMTGRLFLGTAEKIPPEELVDTTGAGDAFIGAVLYAICA
GMPPEKMLPFAAQVAGCSCRALGARTGLPHRTDPRLVPFLV*
>AT4G28706.1 | pfkB-type carbohydrate kinase family protein
MLCFTHVPPLPLFSTSGIYLPASRRFTRIRMSSSSSIDSVPPPPDNAIVLGCGGIAVDFL
ATVDSYPQADDKIRSTSLKVQGGGNAANALTCAARLGLNSRLISKVANDSQGKGMLEELE
ADGVDTSFIVVSKEGNSPFTYILVDNQTKTRTCIHTPGDPPMLPTDLSQSSMLSALDRAS
IVYFDVRLHETALVIAKEASRKKIPILVDTEKKRDGLDDLLPFADYVVCPENFPQTWTEV
SSTPGALVSMLLRLPKLKFVIVTSGEHGCLMVQRASKEVFESQETDIESLLETLKHRKDS
TTTFPTCVSSETTKLKANGVGTMTGRLFLGTAEKIPPEELVDTTGAGDAFIGAVLYAICA
GMPPEKMLPFAAQVAGCSCRALGARTGLPHRTDPRLVPFLV*
>AT4G28706.1 | pfkB-type carbohydrate kinase family protein
MLCFTHVPPLPLFSTSGIYLPASRRFTRIRMSSSSSIDSVPPPPDNAIVLGCGGIAVDFL
ATVDSYPQADDKIRSTSLKVQGGGNAANALTCAARLGLNSRLISKVANDSQGKGMLEELE
ADGVDTSFIVVSKEGNSPFTYILVDNQTKTRTCIHTPGDPPMLPTDLSQSSMLSALDRAS
IVYFDVRLHETALVIAKEASRKKIPILVDTEKKRDGLDDLLPFADYVVCPENFPQTWTEV
SSTPGALVSMLLRLPKLKFVIVTSGEHGCLMVQRASKEVFESQETDIESLLETLKHRKDS
TTTFPTCVSSETTKLKANGVGTMTGRLFLGTAEKIPPEELVDTTGAGDAFIGAVLYAICA
GMPPEKMLPFAAQVAGCSCRALGARTGLPHRTDPRLVPFLV*
>AT4G28706.3 | pfkB-type carbohydrate kinase family protein
MLCFTHVPPLPLFSTSGIYLPASRRFTSFRIRMSSSSSIDSVPPPPDNAIVLGCGGIAVD
FLATVDSYPQADDKIRSTSLKVQGGGNAANALTCAARLGLNSRLISKVANDSQGKGMLEE
LEADGVDTSFIVVSKEGNSPFTYILVDNQTKTRTCIHTPGDPPMLPTDLSQSSMLSALDR
ASIVYFDVRLHETALVIAKEASRKKIPILVDTEKKRDGLDDLLPFADYVVCPENFPQTWT
EVSSTPGALVSMLLRLPKLKFVIVTSGEHGCLMVQRASKEVFESQETDIESLLETLKHRK
DSTTTFPTCVSSETTKLKANGVGTMTGRLFLGTAEKIPPEELVDTTGAGDAFIGAVLYAI
CAGMPPEKMLPFAAQVAGCSCRALGARTGLPHRTDPRLVPFLV*
>AT4G28706.3 | pfkB-type carbohydrate kinase family protein
MLCFTHVPPLPLFSTSGIYLPASRRFTSFRIRMSSSSSIDSVPPPPDNAIVLGCGGIAVD
FLATVDSYPQADDKIRSTSLKVQGGGNAANALTCAARLGLNSRLISKVANDSQGKGMLEE
LEADGVDTSFIVVSKEGNSPFTYILVDNQTKTRTCIHTPGDPPMLPTDLSQSSMLSALDR
ASIVYFDVRLHETALVIAKEASRKKIPILVDTEKKRDGLDDLLPFADYVVCPENFPQTWT
EVSSTPGALVSMLLRLPKLKFVIVTSGEHGCLMVQRASKEVFESQETDIESLLETLKHRK
DSTTTFPTCVSSETTKLKANGVGTMTGRLFLGTAEKIPPEELVDTTGAGDAFIGAVLYAI
CAGMPPEKMLPFAAQVAGCSCRALGARTGLPHRTDPRLVPFLV*
>AT4G28706.3 | pfkB-type carbohydrate kinase family protein
MLCFTHVPPLPLFSTSGIYLPASRRFTSFRIRMSSSSSIDSVPPPPDNAIVLGCGGIAVD
FLATVDSYPQADDKIRSTSLKVQGGGNAANALTCAARLGLNSRLISKVANDSQGKGMLEE
LEADGVDTSFIVVSKEGNSPFTYILVDNQTKTRTCIHTPGDPPMLPTDLSQSSMLSALDR
ASIVYFDVRLHETALVIAKEASRKKIPILVDTEKKRDGLDDLLPFADYVVCPENFPQTWT
EVSSTPGALVSMLLRLPKLKFVIVTSGEHGCLMVQRASKEVFESQETDIESLLETLKHRK
DSTTTFPTCVSSETTKLKANGVGTMTGRLFLGTAEKIPPEELVDTTGAGDAFIGAVLYAI
CAGMPPEKMLPFAAQVAGCSCRALGARTGLPHRTDPRLVPFLV*
>AT4G28706.2 | pfkB-type carbohydrate kinase family protein
MLCFTHVPPLPLFSTSGIYLPASRRFTSFRIRMSSSSSIDSVPPPPDNAIVLGCGGIAVD
FLATVDSYPQADDKIRSTSLKVQGGGNAANALTCAARLGLNSRLISKVANDSQGKGMLEE
LEADGVDTSFIVVSKEGNSPFTYILVDNQTKTRTCIHTPGDPPMLPTDLSQSSMLSALDR
ASIVYFDVRLHETALVIAKEASRKKIPILVDTEKKRDGLDDLLPFADYVVCPENFPQTWT
EVSSTPGALVSMLLRLPKLKFVIVTSGEHGCLMVQRASKAEVFESQETDIESLLETLKHR
KDSTTTFPTCVSSETTKLKANGVGTMTGRLFLGTAEKIPPEELVDTTGAGDAFIGAVLYA
ICAGMPPEKMLPFAAQVAGCSCRALGARTGLPHRTDPRLVPFLV*
>AT4G28706.2 | pfkB-type carbohydrate kinase family protein
MLCFTHVPPLPLFSTSGIYLPASRRFTSFRIRMSSSSSIDSVPPPPDNAIVLGCGGIAVD
FLATVDSYPQADDKIRSTSLKVQGGGNAANALTCAARLGLNSRLISKVANDSQGKGMLEE
LEADGVDTSFIVVSKEGNSPFTYILVDNQTKTRTCIHTPGDPPMLPTDLSQSSMLSALDR
ASIVYFDVRLHETALVIAKEASRKKIPILVDTEKKRDGLDDLLPFADYVVCPENFPQTWT
EVSSTPGALVSMLLRLPKLKFVIVTSGEHGCLMVQRASKAEVFESQETDIESLLETLKHR
KDSTTTFPTCVSSETTKLKANGVGTMTGRLFLGTAEKIPPEELVDTTGAGDAFIGAVLYA
ICAGMPPEKMLPFAAQVAGCSCRALGARTGLPHRTDPRLVPFLV*
>AT4G28706.2 | pfkB-type carbohydrate kinase family protein
MLCFTHVPPLPLFSTSGIYLPASRRFTSFRIRMSSSSSIDSVPPPPDNAIVLGCGGIAVD
FLATVDSYPQADDKIRSTSLKVQGGGNAANALTCAARLGLNSRLISKVANDSQGKGMLEE
LEADGVDTSFIVVSKEGNSPFTYILVDNQTKTRTCIHTPGDPPMLPTDLSQSSMLSALDR
ASIVYFDVRLHETALVIAKEASRKKIPILVDTEKKRDGLDDLLPFADYVVCPENFPQTWT
EVSSTPGALVSMLLRLPKLKFVIVTSGEHGCLMVQRASKAEVFESQETDIESLLETLKHR
KDSTTTFPTCVSSETTKLKANGVGTMTGRLFLGTAEKIPPEELVDTTGAGDAFIGAVLYA
ICAGMPPEKMLPFAAQVAGCSCRALGARTGLPHRTDPRLVPFLV*
>AT1G06220.2 | MEE5 (MATERNAL EFFECT EMBRYO ARREST 5) GTP binding / GTPase/ translation elongation factor/ translation factor nucleic acid binding
MESSLYDEFGNYVGPEIESDRDSDDEVEDEDLQDKHLEENGSDGEQGPGGSNGWITTIND
VEMENQIVLPEDKKYYPTAEEVYGEDVETLVMDEDEQPLEQPIIKPVRDIRFEVGVKDQA
TYVSTQFLIGLMSNPALVRNVALVGHLQHGKTVFMDMLVEQTHHMSTFNAKNEKHMKYTD
TRVDEQERNISIKAVPMSLVLEDSRSKSYLCNIMDTPGHVNFSDEMTASLRLADGAVLIV
DAAEGVMVNTERAIRHAIQDHLPIVVVINKVDRLITELKLPPRDAYYKLRHTIEVINNHI
SAASTTAGDLPLIDPAAGNVCFASGTAGWSFTLQSFAKMYAKLHGVAMDVDKFASRLWGD
VYYHSDTRVFKRSPPVGGGERAFVQFILEPLYKIYSQVIGEHKKSVETTLAELGVTLSNS
AYKLNVRPLLRLACSSVFGSASGFTDMLVKHIPSPREAAARKVDHSYTGTKDSPIYESMV
ECDPSGPLMVNVTKLYPKSDTSVFDVFGRVYSGRLQTGQSVRVLGEGYSPEDEEDMTIKE
VTKLWIYQARYRIPVSSAPPGSWVLIEGVDASIMKTATLCNASYDEDVYIFRALQFNTLP
VVKTATEPLNPSELPKMVEGLRKISKSYPLAITKVEESGEHTILGTGELYLDSIMKDLRE
LYSEVEVKVADPVVSFCETVVESSSMKCFAETPNKKNKITMIAEPLDRGLAEDIENGVVS
IDWNRKQLGDFFRTKYDWDLLAARSIWAFGPDKQGPNILLDDTLPTEVDRNLMMAVKDSI
VQGFQWGAREGPLCDEPIRNVKFKIVDARIAPEPLHRGSGQMIPTARRVAYSAFLMATPR
LMEPVYYVEIQTPIDCVTAIYTVLSRRRGHVTSDVPQPGTPAYIVKAFLPVIESFGFETD
LRYHTQGQAFCLSVFDHWAIVPGDPLDKAIQLRPLEPAPIQHLAREFMVKTRRRKGMSED
VSGNKFFDEAMMVELAQQTGDLHLQMI*
>AT1G06220.2 | MEE5 (MATERNAL EFFECT EMBRYO ARREST 5) GTP binding / GTPase/ translation elongation factor/ translation factor nucleic acid binding
MESSLYDEFGNYVGPEIESDRDSDDEVEDEDLQDKHLEENGSDGEQGPGGSNGWITTIND
VEMENQIVLPEDKKYYPTAEEVYGEDVETLVMDEDEQPLEQPIIKPVRDIRFEVGVKDQA
TYVSTQFLIGLMSNPALVRNVALVGHLQHGKTVFMDMLVEQTHHMSTFNAKNEKHMKYTD
TRVDEQERNISIKAVPMSLVLEDSRSKSYLCNIMDTPGHVNFSDEMTASLRLADGAVLIV
DAAEGVMVNTERAIRHAIQDHLPIVVVINKVDRLITELKLPPRDAYYKLRHTIEVINNHI
SAASTTAGDLPLIDPAAGNVCFASGTAGWSFTLQSFAKMYAKLHGVAMDVDKFASRLWGD
VYYHSDTRVFKRSPPVGGGERAFVQFILEPLYKIYSQVIGEHKKSVETTLAELGVTLSNS
AYKLNVRPLLRLACSSVFGSASGFTDMLVKHIPSPREAAARKVDHSYTGTKDSPIYESMV
ECDPSGPLMVNVTKLYPKSDTSVFDVFGRVYSGRLQTGQSVRVLGEGYSPEDEEDMTIKE
VTKLWIYQARYRIPVSSAPPGSWVLIEGVDASIMKTATLCNASYDEDVYIFRALQFNTLP
VVKTATEPLNPSELPKMVEGLRKISKSYPLAITKVEESGEHTILGTGELYLDSIMKDLRE
LYSEVEVKVADPVVSFCETVVESSSMKCFAETPNKKNKITMIAEPLDRGLAEDIENGVVS
IDWNRKQLGDFFRTKYDWDLLAARSIWAFGPDKQGPNILLDDTLPTEVDRNLMMAVKDSI
VQGFQWGAREGPLCDEPIRNVKFKIVDARIAPEPLHRGSGQMIPTARRVAYSAFLMATPR
LMEPVYYVEIQTPIDCVTAIYTVLSRRRGHVTSDVPQPGTPAYIVKAFLPVIESFGFETD
LRYHTQGQAFCLSVFDHWAIVPGDPLDKAIQLRPLEPAPIQHLAREFMVKTRRRKGMSED
VSGNKFFDEAMMVELAQQTGDLHLQMI*
>AT1G06220.1 | MEE5 (MATERNAL EFFECT EMBRYO ARREST 5) GTP binding / GTPase/ translation elongation factor/ translation factor nucleic acid binding
MESSLYDEFGNYVGPEIESDRDSDDEVEDEDLQDKHLEENGSDGEQGPGGSNGWITTIND
VEMENQIVLPEDKKYYPTAEEVYGEDVETLVMDEDEQPLEQPIIKPVRDIRFEVGVKDQA
TYVSTQFLIGLMSNPALVRNVALVGHLQHGKTVFMDMLVEQTHHMSTFNAKNEKHMKYTD
TRVDEQERNISIKAVPMSLVLEDSRSKSYLCNIMDTPGHVNFSDEMTASLRLADGAVLIV
DAAEGVMVNTERAIRHAIQDHLPIVVVINKVDRLITELKLPPRDAYYKLRHTIEVINNHI
SAASTTAGDLPLIDPAAGNVCFASGTAGWSFTLQSFAKMYAKLHGVAMDVDKFASRLWGD
VYYHSDTRVFKRSPPVGGGERAFVQFILEPLYKIYSQVIGEHKKSVETTLAELGVTLSNS
AYKLNVRPLLRLACSSVFGSASGFTDMLVKHIPSPREAAARKVDHSYTGTKDSPIYESMV
ECDPSGPLMVNVTKLYPKSDTSVFDVFGRVYSGRLQTGQSVRVLGEGYSPEDEEDMTIKE
VTKLWIYQARYRIPVSSAPPGSWVLIEGVDASIMKTATLCNASYDEDVYIFRALQFNTLP
VVKTATEPLNPSELPKMVEGLRKISKSYPLAITKVEESGEHTILGTGELYLDSIMKDLRE
LYSEVEVKVADPVVSFCETVVESSSMKCFAETPNKKNKITMIAEPLDRGLAEDIENGVVS
IDWNRKQLGDFFRTKYDWDLLAARSIWAFGPDKQGPNILLDDTLPTEVDRNLMMAVKDSI
VQGFQWGAREGPLCDEPIRNVKFKIVDARIAPEPLHRGSGQMIPTARRVAYSAFLMATPR
LMEPVYYVEIQTPIDCVTAIYTVLSRRRGHVTSDVPQPGTPAYIVKAFLPVIESFGFETD
LRYHTQGQAFCLSVFDHWAIVPGDPLDKAIQLRPLEPAPIQHLAREFMVKTRRRKGMSED
VSGNKFFDEAMMVELAQQTGDLHLQMI*
>AT1G06220.1 | MEE5 (MATERNAL EFFECT EMBRYO ARREST 5) GTP binding / GTPase/ translation elongation factor/ translation factor nucleic acid binding
MESSLYDEFGNYVGPEIESDRDSDDEVEDEDLQDKHLEENGSDGEQGPGGSNGWITTIND
VEMENQIVLPEDKKYYPTAEEVYGEDVETLVMDEDEQPLEQPIIKPVRDIRFEVGVKDQA
TYVSTQFLIGLMSNPALVRNVALVGHLQHGKTVFMDMLVEQTHHMSTFNAKNEKHMKYTD
TRVDEQERNISIKAVPMSLVLEDSRSKSYLCNIMDTPGHVNFSDEMTASLRLADGAVLIV
DAAEGVMVNTERAIRHAIQDHLPIVVVINKVDRLITELKLPPRDAYYKLRHTIEVINNHI
SAASTTAGDLPLIDPAAGNVCFASGTAGWSFTLQSFAKMYAKLHGVAMDVDKFASRLWGD
VYYHSDTRVFKRSPPVGGGERAFVQFILEPLYKIYSQVIGEHKKSVETTLAELGVTLSNS
AYKLNVRPLLRLACSSVFGSASGFTDMLVKHIPSPREAAARKVDHSYTGTKDSPIYESMV
ECDPSGPLMVNVTKLYPKSDTSVFDVFGRVYSGRLQTGQSVRVLGEGYSPEDEEDMTIKE
VTKLWIYQARYRIPVSSAPPGSWVLIEGVDASIMKTATLCNASYDEDVYIFRALQFNTLP
VVKTATEPLNPSELPKMVEGLRKISKSYPLAITKVEESGEHTILGTGELYLDSIMKDLRE
LYSEVEVKVADPVVSFCETVVESSSMKCFAETPNKKNKITMIAEPLDRGLAEDIENGVVS
IDWNRKQLGDFFRTKYDWDLLAARSIWAFGPDKQGPNILLDDTLPTEVDRNLMMAVKDSI
VQGFQWGAREGPLCDEPIRNVKFKIVDARIAPEPLHRGSGQMIPTARRVAYSAFLMATPR
LMEPVYYVEIQTPIDCVTAIYTVLSRRRGHVTSDVPQPGTPAYIVKAFLPVIESFGFETD
LRYHTQGQAFCLSVFDHWAIVPGDPLDKAIQLRPLEPAPIQHLAREFMVKTRRRKGMSED
VSGNKFFDEAMMVELAQQTGDLHLQMI*
>AT5G23880.1 | CPSF100 (CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 100) DNA binding / protein binding
MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDLFDTSLLEPLSRVASTIDAVLL
SHPDTLHIGALPYAMKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDID
SAFQNVIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKE
RHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTA
GRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAF
LLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFG
TLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEEETKASH
GSDDNSSEPMIIDTKTTHDVIGSHGPAYKDILIDGFVPPSSSVAPMFPYYDNTSEWDDFG
EIINPDDYVIKDEDMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVMSNELIVTVSCSLV
KMDYEGRSDGRSIKSMIAHVSPLKLVLVHAIAEATEHLKQHCLNNICPHVYAPQIEETVD
VTSDLCAYKVQLSEKLMSNVIFKKLGDSEVAWVDSEVGKTERDMRSLLPMPGAASPHKPV
LVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKVGPTGQKGGASGPQQILIEGP
LCEDYYKIRDYLYSQFYLL*
>AT1G03910.1 | EXPRESSED IN 25 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Cactin central region (InterProIPR018816) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT2G368152) Has 11516 Blast hits to 6722 proteins in 356 species Archae - 23 Bacteria - 259 Metazoa - 6122 Fungi - 1009 Plants - 493 Viruses - 33 Other Eukaryotes - 3577 (source NCBI BLink)
MGSHGKGKRDRSGRQKKRRDESESGSESESYTSDSDGSDDLSPPRSSRRKKGSSSRRTRR
RSSSDDSSDSDGGRKSKKRSSSKDYSEEKVTEYMSKKAQKKALRAAKKLKTQSVSGYSND
SNPFGDSNLTETFVWRKKIEKDVHRGVPLEEFSVKAEKRRHRERMTEVEKVKKRREERAV
EKARHEEEMALLARERARAEFHDWEKKEEEFHFDQSKVRSEIRLREGRLKPIDVLCKHLD
GSDDLDIELSEPYMVFKGLTVKDMEELRDDIKMYLDLDRATPTRVQYWEALIVVCDWELA
EARKRDALDRARVRGEEPPAELLAQERGLHAGVEADVRKLLDGKTHAELVELQLDIESQL
RSGSAKVVEYWEAVLKRLEIYKAKACLKEIHAEMLRRHLHRLEQLSEGEDDVEVNPGLTR
VVEENEEEINDTNLSDAEEAFSPEPVAEEEEADEAAEAAGSFSPELMHGDDREEAIDPEE
DKKLLQMKRMIVLEKQKKRLKEAMDSKPAPVEDNLELKAMKAMGAMEEGDAIFGSNAEVN
LDSEVYWWHDKYRPRKPKYFNRVHTGYEWNKYNQTHYDHDNPPPKIVQGYKFNIFYPDLV
DKIKAPIYTIEKDGTSAETCMIRFHAGPPYEDIAFRIVNKEWEYSHKKGFKCTFERGILH
LYFNFKRHRYRR*
>AT1G15200.1 | protein-protein interaction regulator family protein
MGDTALEKTAEELRHEIDELHRQQREITERLRDPRGLRRGGFSNVAPRNQGRRGFPRPAE
RNDVEDEPPAKRRLSSAVVKVDGEDVSKDGEFPVDGNGTQVKVGENGTSDQSDKKQSGLH
RGSWSQRDAEQRRTNKRYEAFALPEPAPRVLPKNEDPKLVNRNRRMLGNLLGTLEKFRKE
DKQRSGTDAYARRTAALQRAEEKAREESERLRLQERENLTEKRRRDLTLRARVAAKAEQK
KLELLFLQWSEHQKKLSNFIRTKAEPRIYYAPVKPLEEDTSEVEQQKERTFLEWKAARRQ
EVSEYQKEIEEQCLGNVEKELERWQNARKARKANNEGMNLQETMDKELETHRMEHGPKKR
KIPGGGVGDEDEEDEVEDINGGEDEMIMDDLLEEGGDGTIKEEVATDTVKAEAVEEDIKH
EVL*
>AT1G15470.1 | transducin family protein / WD-40 repeat family protein
MGAPLVCHGHSRPVVDVAYSPVTPDGFFLISASKDSNPMLRNGETGDWIGTFEGHKGAVW
SCSLDKNAIRAASASADFTAKIWNALTGDELHSFEHKHIVRACAFSEDTHRLLTGGMEKI
LRIFDLNRPDAPPKEVGNSPGSIRTVEWLHSDNTILSSCTDTGDIRLWDIRSDKIVHTLE
TKSPVTSAEVSQDGRYITTADGSSVKFWDAKNFGLLKSYDMPCNVESASLEPKHGNTFIA
GGEDMWVHRFDFQTGEEIGCNKGHHGPVHCVRYAPGGESYTSGSEDGTVRIWVVGSVNHP
EESNLSGHVKLVAEEVVRKAESLRISEKAAEAK*
>AT1G44910.1 | protein binding
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR
DDRDESKKSSRKHGNDRKKSRKHANSPESESENRHKRQKKESSRRSGNDELEDGEVGE*
>AT1G44910.1 | protein binding
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR
DDRDESKKSSRKHGNDRKKSRKHANSPESESENRHKRQKKESSRRSGNDELEDGEVGE*
>AT1G44910.2 | protein binding
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR
DDRDESKKSSRKHGNDRKKSRKVGTP*
>AT1G44910.2 | protein binding
MANNPPQSSGTQFRPMVPGQQGQHFVPAASQPFHPYGHVPPNVQSQPPQYSQPIQQQQLF
PVRPGQPVHITSSSQAVSVPYIQTNKILTSGSTQPQPNAPPMTGFATSGPPFSSPYTFVP
SSYPQQQPTSLVQPNSQMHVAGVPPAANTWPVPVNQSTSLVSPVQQTGQQTPVAVSTDPG
NLTPQSASDWQEHTSADGRKYYYNKRTKQSNWEKPLELMTPLERADASTVWKEFTTPEGK
KYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDLAVSTVTS
VVPSTSSALTGHSSSPIQAGLAVPVTRPPSVAPVTPTSGAISDTEATTIKGDNLSSRGAD
DSNDGATAQNNEAENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFKSLLESVNV
HSDWTWEQTLKEIVHDKRYGALRTLGERKQAFNEYLGQRKKVEAEERRRRQKKAREEFVK
MLEECEELSSSLKWSKAMSLFENDQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHR
QYMADYRKFLETCDYIKAGTQWRKIQDRLEDDDRCSCLEKIDRLIGFEEYILDLEKEEEE
LKRVEKEHVRRAERKNRDAFRTLLEEHVAAGILTAKTYWLDYCIELKDLPQYQAVASNTS
GSTPKDLFEDVTEELEKQYHEDKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQIS
DINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEITVASNWEDSKQLVEESQ
EYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKERDEKEKRKDKDKERREK
EREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHHNNSDEDVSSDR
DDRDESKKSSRKHGNDRKKSRKVGTP*
>AT1G54380.1 | spliceosome protein-related
MMTNSDSDHCTPAKTSISGDGLEKESDFKKQLNSEIDPQASSSQNDAITMEDGAVSVNNR
DLQEIKESSFSKGSEQYRVDGALEESLNFEEKEQESEAQRLLEAEKRRLLAEIELGSIFR
KSVDVDTLPKIEETMDNDVDKIELVDHTALVDVVHHPKRPGTAQNEKDTPRKLKKIGDKR
NVIEGSGVENNGKQFRRLYTRKQLESMRFAHIVNQKNLWSEMYSRILPEVVTEYESLVYV
KNYKSSKSNRVRGRTESGNEENLGTEEGTEDPEDYTDDNDDYNSILRPAFEVDGEPDFST
GPPEDGLEYLRRVRWEAKGIPNVRVAKIDESTYIKKEQSVYMPLIPEIPKCPEYLLPMKE
WEDSLLLDFVHLRQTLTQSANSCEDEIISSQCVEDLLVEMFNKHLHTEEDESFGEVVTDI
QGMDSVTRVSKLKKRICLVEKESGLQSSDCKWVVALCASLETPLDADTCACLRGLLRKCA
SVRAETSLEVGDEEVITMANMLITIAGRYFGQMGQ*
>AT1G59760.1 | ATP-dependent RNA helicase putative
MGSVKRKSVEESSDSAPPQKVQREDDSTQIINEELVGCVHDVSFPENYVPLAPSVHNKPP
AKDFPFTLDSFQSEAIKCLDNGESVMVSAHTSAGKTVVASYAIAMSLKENQRVIYTSPIK
ALSNQKYRDFKEEFSDVGLMTGDVTIDPNASCLVMTTEILRSMQYKGSEIMREVAWIIFD
EVHYMRDSERGVVWEESIVMAPKNSRFVFLSATVPNAKEFADWVAKVHQQPCHIVYTDYR
PTPLQHYVFPAGGNGLYLVVDEKSKFHEDSFQKSLNALVPTNESDKKRDNGKFQKGLVIG
KLGEESDIFKLVKMIIQRQYDPVILFSFSKKECEALAMQMSKMVLNSDDEKDAVETIFAS
AIDMLSDDDKKLPQVSNILPILKRGIGVHHSGLLPILKEVIEILFQEGLIKCLFATETFS
IGLNMPAKTVVFTNVRKFDGDKFRWLSSGEYIQMSGRAGRRGIDKRGICILMVDEKMEPA
VAKSMLKGSADSLNSAFHLSYNMLLNQLRCEEGDPENLLRNSFFQFQADRAIPDLEKQIK
SLEEERDSLVIEEEESLKNYYNLILQYKSLKKDIREIVFTPKYCLPFLLPNRAVCLDCTN
DDEEPQSFSIEDQDTWGVIMKFNKVKSLSEDDDSRRPEDANYTVDVLTRCMVSKDGVGKK
KVKAVPIKERGEPVVVTVPLSQIKSLSSAIMNIPKDLVPLEARENALKKVSELLSRHPDG
IPLDPEVDMKIKSSSYKKTVRRLEALENLFEKHKIAKSPLITEKLKVLQMKEELIAKIKS
LKKTVRSSTALAFKDELKARKRVLRRLGYITSDNVVELKGKVACEISSAEELTLTELMFS
GIFKDAKVEELVSLLSCFVWRERLPDAAKPREELDLLFIQLQDTARRVAEVQLDCKVEID
VESFVQSFRPDIMEAVYAWAKGSKFYEVMEIARVFEGSLIRAIRRMEEVLQQLIVAAKSI
GETQLEAKLEEAVSKIKRDIVFAASLYL*
>AT1G60200.1 | splicing factor PWI domain-containing protein / RNA recognition motif (RRM)-containing protein
MADESSSPATGDPNSQKPESTTPISIPNPNPNPSLTPPPPQQHSQPPVAPLVPPGPPYAP
PAQIPSSLLPTNLPPPPPFRPGMQFTPVANFQNPSSGVPPPGSMPQYQPQPGMRPFQPMA
NGYPGIHGVAPPGAMPPHGLLRYPSPYPTMVRPGFIMRPPGTIGAVQLAPRPLIPGMPGL
RPVMPPMVRPASLPFVTPAEKPQTTIYIGKIATVENDFMMSILEFCGHVKSCLRAEDPTT
KKPKGFGFYEFESAEGILRAIRLLTQRTIDGQELLVNVNQATKEYLLKYVEKKIETAKKA
KESQGTKENQAEGPESEQDKLESADNETGKDGESKIKENIDIANSAVLTDEEREADREAM
EKIETAIEERLKSNPLPPPPPPPADGSGMEFAFKSKDGDSNTDVARSDAAANDVETSGEH
NRPDTSSPDWSKRNDRRSRERGEKEQEMDRYEREAERERSRKEREQRRKLEDAERAYQTR
LRQWERREREKEKERQYEKEKEKEKERKRKKEIRYEEEEEEDDDDSRRRWHRAALDERRR
RQLREKEDDLADRLKEEEEVAEAKRSAEEQNLQQQQLDALRILSGQAAIGSETVQTSPIE
NDHKATLQTVGESANEHHAADFEENGSGNESMAIDNNSGSEAHAPSKKLGFGLVGSGKRT
SVPSVFYEEDEDEARKAKKMKPLVPIDYSTEEQEAVAHGGSGNTPPHLALAAEFAKRISS
TNPKEETIETEKQRSRRSHDKASHRDRERERERDRDRDRVRDRGDGHSGPTKDAKESGKA
KIIDTKFLDAKQLIDTIPKTKEDLFSYEINWAMYDKHQVHERMRPWISKKIMEFLGEEEA
TLVDFIVSNTQQHVQASQMLELLQSILDEEAEMFVLKMWRMLIFEIKRVEAGVPVKSKA*
>AT1G60900.1 | U2 snRNP auxiliary factor large subunit putative
MMSYEGNGDGVAISTENHNENYISLESSPFHEDSKSRESHDLKKDSSKISEKDNENGRDK
DGNKDRDREKDRDREKSRDRDREKSRDRDRDRERSKDRQRDRHHRDRHRDRSRERSEKRD
DLDDDHHRRSRDRDRRRSRDRDREVRHRRRSRSRSRSRSERRSRSEHRHKSEHRSRSRSR
SRSKSKRRSGFDMAPPDMLAATAVAAAGQVPSVPTTATIPGMFSNMFPMVPGQQLGALPV
LPVQAMTQQATRHARRVYVGGLPPTANEQSVSTFFSQVMSAIGGNTAGPGDAVVNVYINH
EKKFAFVEMRSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNPNLNLGA
VGLSSGSTGGLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRGFNLVKDRETGNSKGYAF
CVYQDPSVTDIACAALNGIKMGDKTLTVRRAIQGAIQPKPEQEEVLLYAQQQIALQRLMF
QPGGTPTKIVCLTQVVTADDLRDDEEYAEIMEDMRQEGGKFGNLVNVVIPRPNPDHDPTP
GVGKVFLEYADVDGSSKARSGMNGRKFGGNQVVAVYYPEDKYAQGDYED*
>AT1G61150.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink)
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED
PSE*
>AT1G61150.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED
PSE*
>AT1G61150.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED
PSE*
>AT1G61150.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink)
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.7 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.7 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.7 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED
PSE*
>AT1G61150.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED
PSE*
>AT1G61150.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.7 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.7 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED
PSE*
>AT1G61150.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED
PSE*
>AT1G61150.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.7 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G61150.7 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink)
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI
NDLSTGKLEDPSE*
>AT1G67580.1 | protein kinase family protein
MAAGRNIRYPDHELRDQESNSRFSRRDSAYANEDYNHVRNGAIDNGKGRVSNLRHGDRDR
IKSGARQEENKMVSSGFRLSKSNPGSREVFIDLGPKRCGFSARSVDREPGELSSESGSDD
LIESESLAKVNGVVKEVENRAQSPVEKKRKFSPIVWDRDDHERSNLSRNEKPVEVTPLPP
PPPLVKRSSQSPSVSCGGNSHYSPAKSDMHQDPVEVGVSAVSMPALSPSVEMSSLCVVEQ
SSNAEQDDKQEHATHLEEDENMPTRHISSSRWAAGNSSPTDEVEIVEEVGEKKRRKKPFP
VQGRFRNTSQTPEVGELVREGYRSSDSDERGHHSLPGSRDDFEERDAVKSDKMEIDEEEH
RRENSVDSLSETDSDDEYVRHETPEPASTPLRSINMLQGCRSVDEFERLNKIDEGTYGVV
YRAKDKKTGEIVALKKVKMEKEREGFPLTSLREINILLSFHHPSIVDVKEVVVGSSLDSI
FMVMEYMEHDLKALMETMKQRFSQSEVKCLMLQLLEGVKYLHDNWVLHRDLKTSNLLLNN
RGELKICDFGLARQYGSPLKPYTHLVVTLWYRAPELLLGAKQYSTAIDMWSLGCIMAELL
MKAPLFNGKTEFDQLDKIFRILGTPNESIWPGFSKLPGVKVNFVKHQYNLLRKKFPATSF
TGAPVLSDAGFDLLNKLLTYDPERRITVNEALKHDWFREVPLPKSKDFMPTFPAQHAQDR
RGRRMVKSPDPLEEQRRKELTQTELGSGGLFG*
>AT1G67580.1 | protein kinase family protein
MAAGRNIRYPDHELRDQESNSRFSRRDSAYANEDYNHVRNGAIDNGKGRVSNLRHGDRDR
IKSGARQEENKMVSSGFRLSKSNPGSREVFIDLGPKRCGFSARSVDREPGELSSESGSDD
LIESESLAKVNGVVKEVENRAQSPVEKKRKFSPIVWDRDDHERSNLSRNEKPVEVTPLPP
PPPLVKRSSQSPSVSCGGNSHYSPAKSDMHQDPVEVGVSAVSMPALSPSVEMSSLCVVEQ
SSNAEQDDKQEHATHLEEDENMPTRHISSSRWAAGNSSPTDEVEIVEEVGEKKRRKKPFP
VQGRFRNTSQTPEVGELVREGYRSSDSDERGHHSLPGSRDDFEERDAVKSDKMEIDEEEH
RRENSVDSLSETDSDDEYVRHETPEPASTPLRSINMLQGCRSVDEFERLNKIDEGTYGVV
YRAKDKKTGEIVALKKVKMEKEREGFPLTSLREINILLSFHHPSIVDVKEVVVGSSLDSI
FMVMEYMEHDLKALMETMKQRFSQSEVKCLMLQLLEGVKYLHDNWVLHRDLKTSNLLLNN
RGELKICDFGLARQYGSPLKPYTHLVVTLWYRAPELLLGAKQYSTAIDMWSLGCIMAELL
MKAPLFNGKTEFDQLDKIFRILGTPNESIWPGFSKLPGVKVNFVKHQYNLLRKKFPATSF
TGAPVLSDAGFDLLNKLLTYDPERRITVNEALKHDWFREVPLPKSKDFMPTFPAQHAQDR
RGRRMVKSPDPLEEQRRKELTQTELGSGGLFG*
>AT1G67580.2 | protein kinase family protein
MAAGRNIRYPDHELRDQESNSRFSRRDSAYANEDYNHVRNGAIDNGKGRVSNLRHGDRDR
IKSGARQEENKMVSSGFRLSKSNPGSREVFIDLGPKRCGFSARSVDREPGELSSESGSDD
LIESESLAKVNGVVKEVENRAQSPVEKKRKFSPIVWDRDDHERSNLSRNEKPVEVTPLPP
PPPLVKRSSQSPSVSCGGNSHYSPAKSDMHQDPVEVGVSAVSMPALSPSVEMSSLCVVEQ
SSNAEQDDKQEHATHLEEDENMPTRHISSSRWAAGNSSPTDEVEIVEEVGEKKRRKKPFP
VQGRFRNTSQTPEVGELVREGYRSSDSDERGHHSLPGSRDDFEERDAVKSDKMEIDEEEH
RRENSVDSLSETDSDDEYVRHETPEPASTPLRSINMLQGCRSVDEFERLNKIDEGTYGVV
YRAKDKKTGEIVALKKVKMEKEREGFPLTSLREINILLSFHHPSIVDVKEVVVGSSLDSI
FMVMEYMEHDLKALMETMKQRFSQSEVKCLMLQLLEGVKYLHDNWVLHRDLKTSNLLLNN
RGELKICDFGLARQYGSPLKPYTHLVVTLWYRAPELLLGAKQYSTAIDMWSLGCIMAELL
MKAPLFNGKTEFDQLDKIFRILGTPNESIWPGFSKLPGVKVNFVKHQYNLLRKKFPATSF
TGAPVLSDAGFDLLNKLLTYDPERRITVNEALKHDWFREVPLPKSKDFMPTFPAQHAQDR
RGRRMVKSPDPLEEQRRKELTQTELGSGGLFG*
>AT1G67580.2 | protein kinase family protein
MAAGRNIRYPDHELRDQESNSRFSRRDSAYANEDYNHVRNGAIDNGKGRVSNLRHGDRDR
IKSGARQEENKMVSSGFRLSKSNPGSREVFIDLGPKRCGFSARSVDREPGELSSESGSDD
LIESESLAKVNGVVKEVENRAQSPVEKKRKFSPIVWDRDDHERSNLSRNEKPVEVTPLPP
PPPLVKRSSQSPSVSCGGNSHYSPAKSDMHQDPVEVGVSAVSMPALSPSVEMSSLCVVEQ
SSNAEQDDKQEHATHLEEDENMPTRHISSSRWAAGNSSPTDEVEIVEEVGEKKRRKKPFP
VQGRFRNTSQTPEVGELVREGYRSSDSDERGHHSLPGSRDDFEERDAVKSDKMEIDEEEH
RRENSVDSLSETDSDDEYVRHETPEPASTPLRSINMLQGCRSVDEFERLNKIDEGTYGVV
YRAKDKKTGEIVALKKVKMEKEREGFPLTSLREINILLSFHHPSIVDVKEVVVGSSLDSI
FMVMEYMEHDLKALMETMKQRFSQSEVKCLMLQLLEGVKYLHDNWVLHRDLKTSNLLLNN
RGELKICDFGLARQYGSPLKPYTHLVVTLWYRAPELLLGAKQYSTAIDMWSLGCIMAELL
MKAPLFNGKTEFDQLDKIFRILGTPNESIWPGFSKLPGVKVNFVKHQYNLLRKKFPATSF
TGAPVLSDAGFDLLNKLLTYDPERRITVNEALKHDWFREVPLPKSKDFMPTFPAQHAQDR
RGRRMVKSPDPLEEQRRKELTQTELGSGGLFG*
>AT2G29210.1 | splicing factor PWI domain-containing protein
MSGGFFRGTSAEQDTRFSNKQAKLMKSQKFAPELENLVDITKVKMDVMKPWIATRVTELL
GIEDEVLINFIYGLLDGKVVNGKEIQITLTGFMEKNTGKFMKELWTLLLSAQNNPSGVPQ
QFLDARAAETKKKQEEANEIMKKREGDKKNIEHDILRKIDSGVEHKETNGMDAKPSRDRP
EDGRRADEKNGVKERRRDLIPPRRGDASRSPLRGSRSRSISKTNSGSKSYSGERKSRSTS
QSSDASISPRKRRLSNSRRRSRSRSVRRSLSPRRRRIHSPFRSRSRSPIRRHRRPTHEGR
RQSPAPSRRRRSPSPPARRRRSPSPPARRRRSPSPPARRHRSPTPPARQRRSPSPPARRH
RSPPPARRRRSPSPPARRRRSPSPPARRRRSPSPLYRRNRSPSPLYRRNRSRSPLAKRGR
SDSPGRSPSPVARLRDPTGARLPSPSIEQRLPSPPVAQRLPSPPPRRAGLPSPPPAQRLP
SPPPRRAGLPSPMRIGGSHAANHLESPSPSSLSPPGRKKVLPSPPVRRRRSLTPDEERVS
LSQGGRHTSPSHIKQDGSMSPVRGRGKSSPSSRHQKARSPVRRRSPTPVNRRSRRSSSAS
RSPDRRRRRSPSSSRSPSRSRSPPVLHRSPSPRGRKHQRERRSPGRLSEEQDRVQNSKLL
KRTSVPDTDKRKQLPEKLLEVGRVEHYKEQERKSDKLSEKRSVHRHHGSQMSPVENSEGR
SRPVSSKVKDSEQVEKEDNSDLDANLSCDSKDTIRHQIKDKNRRKNKRSSREEVSSDDNG
SSDSDVDDRKEAKRRRKEEKKTRKEEKKRRREERHRKREERRGGKEKHKKQELSDTSEGE
VEARPKIKKGEESDPKRLEIELRNKALESLKAKKGISH*
>AT3G11910.1 | UBP13 (UBIQUITIN-SPECIFIC PROTEASE 13) ubiquitin thiolesterase/ ubiquitin-specific protease
MTMMTPPPLDQQEDEEMLVPNPDLVEGPQPMEVAQTDPAATAVENPPPEDPPSLKFTWTI
PMFTRLNTRKHYSDVFVVGGYKWRILIFPKGNNVDHLSMYLDVADAANLPYGWSRYSQFS
LAVVNQVNNRYSIRKETQHQFNARESDWGFTSFMPLSELYEPTRGYLVNDTVLIEAEVAV
RKVLDYWSYDSKKETGFVGLKNQGATCYMNSLLQTLYHIPYFRKAVYHMPTTENDAPTAS
IPLALQSLFYKLQYNDTSVATKELTKSFGWDTYDSFMQHDVQELNRVLCEKLEDKMKGTV
VEGTIQKLFEGHHMNYIECINVDYKSTRKESFYDLQLDVKGCKDVYASFDKYVEVERLEG
DNKYHAEGHDLQDAKKGVLFIDFPPVLQLQLKRFEYDFMRDTMVKINDRYEFPLQLDLDR
EDGRYLSPDADKSVRNLYTLHSVLVHSGGVHGGHYYAFIRPTLSDQWYKFDDERVTKEDV
KRALEEQYGGEEELPQNNPGFNNPPFKFTKYSNAYMLVYIRESDKDKIICNVDEKDIAEH
LRVRLKKEQEEKEDKRKYKAQAHLFTTIKVARDDDITEQIGKNIYFDLVDHEKVRSFRIQ
KQTPFQQFKEEVAKEFGVPVQLQRFWIWAKRQNHTYRPNRPLSPNEELQTVGQIREASNK
ANNAELKLFLEIERGPDDLPIPPPEKTSEDILLFFKLYDPENAVLRYVGRLMVKSSSKPM
DIVGQLNKMAGFAPDEEIELFEEIKFEPCVMCEQIDKKTSFRLCQIEDGDIICYQKPLSI
EESEFRYPDVPSFLEYVQNRELVRFRTLEKPKEDEFTMELSKLHTYDDVVERVAEKLGLD
DPSKLRLTSHNCYSQQPKPQPIKYRGVDHLSDMLVHYNQTSDILYYEVLDIPLPELQGLK
TLKVAFHSATKDEVIIHNIRLPKQSTVGDVINELKTKVELSHQDAELRLLEVFFHKIYKI
FPSTERIENINDQYWTLRAEEIPEEEKNIGPNDRLIHVYHFTKEAGQNQQVQNFGEPFFL
VIHEGETLEEIKTRIQKKLHVPDEDFAKWKFASFSMGRPDYLLDTDVVYNRFQRRDVYGA
WEQYLGLEHIDNAPKRAYAANQNRHAYEKPVKIYN*
>AT3G17740.1 | unknown protein
MSNTPEDGDGAQVTTGGLFPVFPTSANSISAISNAPQWLRNASFTTDLSVINAAASTAPS
SSEVEAGDDEDEEGGADGNIGLANQARVYNLVEEEGSLESDDDKVKRKREKKKKRKSDNA
SDESRSRKSDEYYSKPVKDYYLDTRPDPDNLAYGSIYRMNVPRYKLDNSQRVPGSGSLRF
YLRNRRSSMLDTEIDIDSLEGRAKSDTRYWYAKHAAMERNKNFKRIRLSAASEAVDSSFD
NFIPLEEDVTVPESDEEDVLSKDSMIGASWEDEVLNKTREFNRVTRERPHDAKAWLAFAD
FQDKVSSMQSQKGVRLQTLEKKISILEKAFELNPDSEELLLALLKAYRSRDNADVLISRW
EKALMQNSASYKLWREFLCVVQGEFSRFKVSEVRRLYSYAIQALSSACSKRHRQVDTTSE
PLDSAAIQQELVLVDMLVSLCRFEWQAGYQELATALLQAEVEFSIFSPSLLLTEQSKLRL
FEHFWSSNGARVGEEGAFGWLLWLEKEEENRQKILKEESSDDNEVGGWTGWTEQVSGRNG
DDIASANTGEVDVDRKGLDEEMEDENSKPEDDTEAMLKLLGIDVNTAASDEVKDTSTWVK
WFEEEVSRDHSQWMPTRKAGEFSSVEGMGEGEDEEQLSSVVLYEDINGYLFSLRSKEARL
SLVYQFIDFFGAHISPWTSSNSLSWSEKISSLETFSDSMLENLRSVHECLSKSDSANCFS
LGSLLGGSCDLSMRTEMMKFLRNAILLCLNVFPRNYILEEAVLVAEELFVTNMKTCEVAT
MPCQALAKRLLKSDRQDLLLCGVYAQREAASGNMKHARRVFDMALTSICGLPKELQCNTP
LLCLWYAESEVANSSGSGRDTESSSRAMHILCYLGSGLAYSPYTSQSSSMQILRARQGFR
EKLKKIQSTWSHGVTDDQSAALVCSAALFEELTNDLPGALEILEHMFSSVLPGRKSQSHQ
LELLFNYYVRMLQRHQDDLTLSQLWKPISEGLQLYPLNPELYRALVDICNHRMTSHKLRM
MFDDYSRKNSSVVVWLFALSYELSKGGSSHRIRGLFERALAQDTQNNSVILWRCYIAYEI
DIADNPSAARRIYFRAINACPWSKKLWLDGFGKLGSVLTAKEMSDLQEVMRDKELNIRTD
IYEILLMQG*
>AT3G44850.1 | protein kinase-related
MAEDKNNGRVNDESEYSSEDEGTEDYKKGGYHTVRVGDTFKNGAYVIQSKLGWGHFSTVW
LAWDTQESRYVALKVQKSAQHYTEAAMDEIKILKQIAEGDSGDKKCVVKLLDHFKHTGPN
GKHVCMVFEYLGDNLLSVIKYSDYRGVPLHMVKELCFHILVGLDYLHRELSIIHTDLKPE
NVLLLSTIDPSRDVRRSGVPLVLPITKDKIVSESAVKPETKSYTYNGDLTKNQKKKIRKK
AKKVVAQDFGGEEALEESERDSNSEARINGNSTVERSEGSSTRLMEGEEAREKANKKNGR
GSRRGSRSTRQKLLSDIECKCKLVDFGNACWTYKQFTSDIQTRQYRCPEVVLGSKYSTSA
DMWSFACICFELATGDVLFDPHSGENYDRDEDHLALMMELLGMMPRKIALGGRYSRDFFN
RQGELRHIRRLRFWPISKVLKEKYDFSEQDAKDMSDFLVTILEFVPEKRPTAAQCLKHPW
FNPGPRLLEPSLKPQQPKGEEEGAANENIEKEKDEREAMEAGVGNIAIDVSEPK*
>AT3G55220.1 | splicing factor putative
MYLYSLTLQQATGIVCAINGNFSGGKTQEIAVARGKILDLLRPDENGKIQTIHSVEVFGA
IRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKEKNVFDKVHQETFGKSGCRRIVPGQYV
AVDPKGRAVMIGACEKQKLVYVLNRDTTARLTISSPLEAHKSHTICYSLCGVDCGFDNPI
FAAIELDYSEADQDPTGQAASEAQKHLTFYELDLGLNHVSRKWSNPVDNGANMLVTVPGG
ADGPSGVLVCAENFVIYMNQGHPDVRAVIPRRTDLPAERGVLVVSAAVHKQKTMFFFLIQ
TEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSICVLKLGFLFSASEFGNHGLYQFQAI
GEEPDVESSSSNLMETEEGFQPVFFQPRRLKNLVRIDQVESLMPLMDMKVLNIFEEETPQ
IFSLCGRGPRSSLRILRPGLAITEMAVSQLPGQPSAVWTVKKNVSDEFDAYIVVSFTNAT
LVLSIGEQVEEVNDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIREDGRINEWRTPGKR
SIVKVGYNRLQVVIALSGGELIYFEADMTGQLMEVEKHEMSGDVACLDIAPVPEGRKRSR
FLAVGSYDNTVRILSLDPDDCLQILSVQSVSSAPESLLFLEVQASIGGDDGADHPANLFL
NSGLQNGVLFRTVVDMVTGQLSDSRSRFLGLKPPKLFSISVRGRSAMLCLSSRPWLGYIH
RGHFHLTPLSYETLEFAAPFSSDQCAEGVVSVAGDALRIFMIDRLGETFNETVVPLRYTP
RKFVLHPKRKLLVIIESDQGAFTAEEREAARKECFEAGGVGENGNGNADQMENGADDEDK
EDPLSDEQYGYPKAESEKWVSCIRVLDPKTATTTCLLELQDNEAAYSVCTVNFHDKEYGT
LLAVGTVKGMQFWPKKNLVAGFIHIYRFVEDGKSLELLHKTQVEGVPLALCQFQGRLLAG
IGPVLRLYDLGKKRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLY
IFADDCVPRWLTASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKL
NGAPNKVDEIVQFHVGDVVTCLQKASMIPGGSESIMYGTVMGSIGALHAFTSRDDVDFFS
HLEMHMRQEYPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTPA
EILKKLEDARNKII*
>AT4G03430.1 | EMB2770 (EMBRYO DEFECTIVE 2770) RNA splicing factor transesterification mechanism
MVFLSIPNGKTLSIDVNPNSTTISAFEQLAHQRSDVPQSFLRYSLRMRNPSRVFVDSKDS
DSILLSDLGVSRFSTVIIHVLLLGGMQAAPPKPRLDFLNSKPPSNYVAGLGRGATGFTTR
SDIGPARAAPDLPDRSALATAAAPGVGRGAGKPSEAEAEDDEEAEEKRYDENQTFDEFEG
NDVGLFANAEYDEDDKEADAIWESIDQRMDSRRKDRREAKLKEEIEKYRASNPKITEQFA
DLKRKLHTLSADEWDSIPEIGDYSLRNKKKKFESFVPIPDTLLEKAKKEKELVMALDPKS
RAAGGSETPWGQTPVTDLTAVGEGRGTVLSLKLDNLSDSVSGQTVVDPKGYLTDLKSMKR
TTDEEIYDRNRARLLYKSLTQSNPKNPNGWIAAARVEEVDGKIKAARFQIQRGCEECPKN
EDVWLEACRLANPEDAKGVIAKGVKLIPNSVKLWLEAAKLEHDVENKSRVLRKGLEHIPD
SVRLWKAVVELANEEDARILLHRAVECCPLHLELWVALARLETYAESKKVLNKAREKLPK
EPAIWITAAKLEEANGKLDEANDNTAMVGKIIDRGIKTLQREGVVIDRENWMSEAEACER
VGSVATCQAIIKNTIGIGVEEEDRKRTWVADADECKKRGSIETARAIYAHALSVFLTKKS
IWLKAAQLEKSHGSRESLDALLRKAVTYVPQAEVLWLMGAKEKWLAGDVPAARAILQEAY
AAIPNSEEIWLAAFKLEFENKEPERARMLLAKARERGGTERVWMKSAIVERELGNVEEER
RLLNEGLKQFPTFFKLWLMLGQLEERFKHLEQARKAYDTGLKHCPHCIPLWLSLADLEEK
VNGLNKARAILTTARKKNPGGAELWLAAIRAELRHDNKREAEHLMSKALQDCPKSGILWA
ADIEMAPRPRRKTKSIDAMKKCDRDPHVTIAVAKLFWQDKKVEKARAWFERAVTVGPDIG
DFWALFYKFELQHGSDEDRKEVVAKCVACEPKHGEKWQAISKAVENAHQPIEVILKRVVN
ALSKEENSA*
>AT4G24270.1 | RNA recognition motif (RRM)-containing protein
MMADDTELVSNTDQKMEDASAENPARADPPSSDDSGDSDSDSEDEAESNQQIVTLESELS
ANPYNYDAYVQYIKLLRKTANLEKLRQAREAMSAIFPLSPSLWLEWARDEASLAASENVP
EIVMLYERGLSDYQSVSLWCDYLSFMLEFDPSVRGYPSEGISKMRSLFERAIPAAGFHVT
EGNRIWEGYREFEQGVLATIDEADIEERNKQIQRIRSIFHRHLSVPLENLSSTLIAYKTW
ELEQGIDLDIGSDDLSKVSHQVAVANKKAQQMYSERAHLEENISKQDLSDTEKFQEFMNY
IKFEKTSGDPTRVQAIYERAVAEYPVSSDLWIDYTVYLDKTLKVGKAITHAYSRATRSCP
WTGDLWARYLLALERGSASEKEIYDVFEKSLQCTFSSFEEYLDLYLTRVDGLRRRMLSTR
MLEALDYSLIRETFQQASDYLTPHMQNTDSLLHLHTYWANLELNIGKDLAGARGVWDSFL
KKSGGMLAAWHAYIDMEVHLGHIKEARSIYRRCYTRKFDGTGSEDICKGWLRFEREHGDL
EHFDLAVQKVMPRLEELQLMRLQQESTPVKPSAGLKEHSSQKRKAEQNVEEESLAKRQKR
KSQKEVDLGGQSATVPATKNVKAENGKTADSDKEETEDAKPLKPKVYRDECTAFISNLSV
KAQEEDIRKFFGDDGGVDSIRILHHKDTGKPRGLAYADFVDDEHLAAAIAKNRKMFFGKK
ISIARSNPKKGKKEFTRRGNDGSGNSKDPSLISEKAKAPLGGETEGERKGNEVEVRGKNT
FAVPRNVKPLGYTTPKPSADETPKSNDEFRNMFLKK*
>AT4G24270.1 | RNA recognition motif (RRM)-containing protein
MMADDTELVSNTDQKMEDASAENPARADPPSSDDSGDSDSDSEDEAESNQQIVTLESELS
ANPYNYDAYVQYIKLLRKTANLEKLRQAREAMSAIFPLSPSLWLEWARDEASLAASENVP
EIVMLYERGLSDYQSVSLWCDYLSFMLEFDPSVRGYPSEGISKMRSLFERAIPAAGFHVT
EGNRIWEGYREFEQGVLATIDEADIEERNKQIQRIRSIFHRHLSVPLENLSSTLIAYKTW
ELEQGIDLDIGSDDLSKVSHQVAVANKKAQQMYSERAHLEENISKQDLSDTEKFQEFMNY
IKFEKTSGDPTRVQAIYERAVAEYPVSSDLWIDYTVYLDKTLKVGKAITHAYSRATRSCP
WTGDLWARYLLALERGSASEKEIYDVFEKSLQCTFSSFEEYLDLYLTRVDGLRRRMLSTR
MLEALDYSLIRETFQQASDYLTPHMQNTDSLLHLHTYWANLELNIGKDLAGARGVWDSFL
KKSGGMLAAWHAYIDMEVHLGHIKEARSIYRRCYTRKFDGTGSEDICKGWLRFEREHGDL
EHFDLAVQKVMPRLEELQLMRLQQESTPVKPSAGLKEHSSQKRKAEQNVEEESLAKRQKR
KSQKEVDLGGQSATVPATKNVKAENGKTADSDKEETEDAKPLKPKVYRDECTAFISNLSV
KAQEEDIRKFFGDDGGVDSIRILHHKDTGKPRGLAYADFVDDEHLAAAIAKNRKMFFGKK
ISIARSNPKKGKKEFTRRGNDGSGNSKDPSLISEKAKAPLGGETEGERKGNEVEVRGKNT
FAVPRNVKPLGYTTPKPSADETPKSNDEFRNMFLKK*
>AT4G24270.2 | RNA recognition motif (RRM)-containing protein
MMADDTELVSNTDQKMEDASAENPARADPPSSDDSGDSDSDSEDEAESNQQIVTLESELS
ANPYNYDAYVQYIKLLRKTANLEKLRQAREAMSAIFPLSPSLWLEWARDEASLAASENVP
EIVMLYERGLSDYQSVSLWCDYLSFMLEFDPSVRGYPSEGISKMRSLFERAIPAAGFHVT
EGNRIWEGYREFEQGVLATIDEADIEERNKQIQRIRSIFHRHLSVPLENLSSTLIAYKTW
ELEQGIDLDIGSDDLSKVSHQVAVANKKAQQMYSERAHLEENISKQDLSDTEKFQEFMNY
IKFEKTSGDPTRVQAIYERAVAEYPVSSDLWIDYTVYLDKTLKVGKAITHAYSRATRSCP
WTGDLWARYLLALERGSASEKEIYDVFEKSLQCTFSSFEEYLDLYLTRVDGLRRRMLSTR
MLEALDYSLIRETFQQASDYLTPHMQNTDSLLHLHTYWANLELNIGKDLAGARGVWDSFL
KKSGGMLAAWHAYIDMEVHLGHIKEARSIYRRCYTRKFDGTGSEDICKGWLRFEREHGDL
EHFDLAVQKVMPRLEELQLMRLQQESTPVKPSAGLKEHSSQKRKAEQNVEEESLAKRQKR
KSQKEVDLGGQSATVPATKNVKAENGKTADSDKEETEDAKPLKPKVYRDECTAFISNLSV
KAQEEDIRKFFGDDGGVDSIRILHHKDTGKPRGLAYADFVDDEHLAAAIAKNRKMFFGKK
ISIARSNPKKGKKEFTRRGNVDGSGNSKDPSLISEKAKAPLGGETEGERKGNEVEVRGKN
TFAVPRNVKPLGYTTPKPSADETPKSNDEFRNMFLKK*
>AT4G24270.2 | RNA recognition motif (RRM)-containing protein
MMADDTELVSNTDQKMEDASAENPARADPPSSDDSGDSDSDSEDEAESNQQIVTLESELS
ANPYNYDAYVQYIKLLRKTANLEKLRQAREAMSAIFPLSPSLWLEWARDEASLAASENVP
EIVMLYERGLSDYQSVSLWCDYLSFMLEFDPSVRGYPSEGISKMRSLFERAIPAAGFHVT
EGNRIWEGYREFEQGVLATIDEADIEERNKQIQRIRSIFHRHLSVPLENLSSTLIAYKTW
ELEQGIDLDIGSDDLSKVSHQVAVANKKAQQMYSERAHLEENISKQDLSDTEKFQEFMNY
IKFEKTSGDPTRVQAIYERAVAEYPVSSDLWIDYTVYLDKTLKVGKAITHAYSRATRSCP
WTGDLWARYLLALERGSASEKEIYDVFEKSLQCTFSSFEEYLDLYLTRVDGLRRRMLSTR
MLEALDYSLIRETFQQASDYLTPHMQNTDSLLHLHTYWANLELNIGKDLAGARGVWDSFL
KKSGGMLAAWHAYIDMEVHLGHIKEARSIYRRCYTRKFDGTGSEDICKGWLRFEREHGDL
EHFDLAVQKVMPRLEELQLMRLQQESTPVKPSAGLKEHSSQKRKAEQNVEEESLAKRQKR
KSQKEVDLGGQSATVPATKNVKAENGKTADSDKEETEDAKPLKPKVYRDECTAFISNLSV
KAQEEDIRKFFGDDGGVDSIRILHHKDTGKPRGLAYADFVDDEHLAAAIAKNRKMFFGKK
ISIARSNPKKGKKEFTRRGNVDGSGNSKDPSLISEKAKAPLGGETEGERKGNEVEVRGKN
TFAVPRNVKPLGYTTPKPSADETPKSNDEFRNMFLKK*
>AT4G25550.1 | protein binding
MAMSQVVNTYPLSNYSFGTKEPKLEKDTSVADRLARMKINYMKEGMRTSVEGILLVQEHN
HPHILLLQIGNTFCKLPGGRLKPGENEADGLKRKLTSKLGGNSAALVPDWTVGECVATWW
RPNFETMMYPYCPPHITKPKECKRLYIVHLSEKEYFAVPKNLKLLAVPLFELYDNVQRYG
PVISTIPQQLSRFHFNMISS*
>AT5G13010.1 | EMB3011 (embryo defective 3011) ATP binding / RNA helicase/ helicase/ nucleic acid binding
MGVDPFKTTETLEADKETNGGVPVKDKLTFKAPERKSRLGLDARAIEKKDNAKTEGEFKV
PKKSAISVTSSLDEEDKSDVSGLDFGTENTRPVHSSRRYREKSSRSQSAQESTVTTENAG
TSDVVAIGIEKNIGVTEVKLRGQDRETLMMRWITTDGGNLIANLTETITEKSVGDTIAIG
GLQDEWERSPHGDRGSSYSRRPQPSPSPMLAAASPDARLASPWLDTPRSTMSSASPWDMG
APSPIPIRASGSSIRSSSSRYGGRSNQLAYSREGDLTNEGHSDEDRSQGAEEFKHEITET
MRVEMEYQSDRAWYDTDEGNSLFDADSASFFLGDDASLQKKETELAKRLVRRDGSKMSLA
QSKKYSQLNADNAQWEDRQLLRSGAVRGTEVQTEFDSEEERKAILLVHDTKPPFLDGRVV
YTKQAEPVMPVKDPTSDMAIISRKGSGLVKEIREKQSANKSRQRFWELAGSNLGNILGIE
KSAEQIDADTAVVGDDGEVDFKGEAKFAQHMKKGEAVSEFAMSKTMAEQRQYLPIFSVRD
ELLQVIRENQVIVVVGETGSGKTTQLTQDGYTINGIVGCTQPRRVAAMSVAKRVSEEMET
ELGDKIGYAIRFEDVTGPNTVIKYMTDGVLLRETLKDSDLDKYRVVVMDEAHERSLNTDV
LFGILKKVVARRRDFKLIVTSATLNAQKFSNFFGSVPIFNIPGRTFPVNILYSKTPCEDY
VEAAVKQAMTIHITSPPGDILIFMTGQDEIEAACFSLKERMEQLVSSSSREITNLLILPI
YSQLPADLQAKIFQKPEDGARKCIVATNIAETSLTVDGIYYVIDTGYGKMKVFNPRMGMD
ALQVFPISRAASDQRAGRAGRTGPGTCYRLYTESAYLNEMLPSPVPEIQRTNLGNVVLLL
KSLKIDNLLDFDFMDPPPQENILNSMYQLWVLGALNNVGGLTDLGWKMVEFPLDPPLAKM
LLMGERLDCIDEVLTIVSMLSVPSVFFRPKERAEESDAAREKFFVPESDHLTLLNVYQQW
KEHDYRGDWCNDHYLQVKGLRKAREVRSQLLDILKQLKIELRSCGPDWDIVRKAICSAYF
HNSARLKGVGEYVNCRTGMPCHLHPSSALYGLGYTPDYVVYHELILTTKEYMQCATSVEP
HWLAELGPMFFSVKDSDTSMLEHKKKQKEEKSGMEEEMEKLRRDQVESELRSKERERKKR
AKQQQQISGPGLKKGTTFLRPKKLGL*
>AT5G13480.1 | FY protein binding
MYAGGDMHRGSQMPQPPMMRQSSASSTNINPDYHHPSGPFDPNVDSFGAKRMRKHTQRRA
VDYTSTVVRYIQARTWQRDSRDRTTLQPTPAAAVDMLPTVAYSDNPSTSFAAKFVHASLN
KNRCSINRVLWTPSGRRLITGSQSGEFTLWNGQSFNFEMILQAHDQPIRSMVWSHNENYM
VSGDDGGTLKYWQNNMNNVKANKTAHKESIRDLSFCKTDLKFCSCSDDTTVKVWDFTKCV
DESSLTGHGWDVKSVDWHPTKSLLVSGGKDQLVKLWDTRSGRELCSLHGHKNIVLSVKWN
QNGNWLLTASKDQIIKLYDIRTMKELQSFRGHTKDVTSLAWHPCHEEYFVSGSSDGSICH
WIVGHENPQIEIPNAHDNSVWDLAWHPIGYLLCSGSNDHTTKFWCRNRPADNPRDVLMQN
QGYNEQGFGRQPDNFQPSEASPIPGAFVPGLTRNEGTIPGIGIAMPFDASSQGDHKQPLP
GSMALGAPPLPPGPHPSLLGSGQQQGYQQQQQHQGHPQQMLPMPNMPHHQLPPSSHMPLH
PHHLPRPMQMPPHGHMPPPSMPMSHQMPGSMGMQGGMNPQMSQSHFMGAPSGVFQGQPNS
GGPQMYPQGRGGFNRPQMIPGYNNPFQQQQQPPLPPGPPPNNNQQHQ*
>AT5G51280.1 | DEAD-box protein abstrakt putative
MESIMEEADSYIEYVSVAERRAIAAQKILQRKGKASELEEEADKEKLAEAKPSLLVQATQ
LKRDVPEVSATEQIILQEKEMMEHLSDKKTLMSVRELAKGITYTEPLLTGWKPPLHIRKM
SSKQRDLIRKQWHIIVNGDDIPPPIKNFKDMKFPRPVLDTLKEKGIVQPTPIQVQGLPVI
LAGRDMIGIAFTGSGKTLVFVLPMIMIALQEEMMMPIAAGEGPIGLIVCPSRELARQTYE
VVEQFVAPLVEAGYPPLRSLLCIGGIDMRSQLEVVKRGVHIVVATPGRLKDMLAKKKMSL
DACRYLTLDEADRLVDLGFEDDIREVFDHFKSQRQTLLFSATMPTKIQIFARSALVKPVT
VNVGRAGAANLDVIQEVEYVKQEAKIVYLLECLQKTSPPVLIFCENKADVDDIHEYLLLK
GVEAVAIHGGKDQEDREYAISSFKAGKKDVLVATDVASKGLDFPDIQHVINYDMPAEIEN
YVHRIGRTGRCGKTGIATTFINKNQSETTLLDLKHLLQEAKQRIPPVLAELNDPMEEAET
IANASGVKGCAYCGGLGHRIRDCPKLEHQKSVAISNSRKDYFGSGGYRGEI*
>AT5G51410.1 | LUC7 N_terminus domain-containing protein
MDAQRALLDELMGAARNLTDEERRGFKEVKWDDREVCAFYMVRFCPHDLFVNTKSDLGAC
SRIHDPKLKESFENSPRHDSYVPKFEAELAQFCEKLVNDLDRKVRRGRERLAQEVEPVPP
PSLSAEKAEQLSVLEEKVKNLLEQVEALGEEGKVDEAEALMRKVEGLNAEKTVLLQRPTD
KVLAMAQEKKMALCEVCGSFLVANDAVERTQSHVTGKQHVGYGLVRDFIAEQKAAKDKGK
EEERLVRGKEADDKRKPREKESESKRSGSSDRERYRDRDRNRDGDRHRDRGRDYRKPYDR
RSRSGREDRDRSRSRSPHGRSGHRRVSRSPIRQY*
>AT5G51410.1 | LUC7 N_terminus domain-containing protein
MDAQRALLDELMGAARNLTDEERRGFKEVKWDDREVCAFYMVRFCPHDLFVNTKSDLGAC
SRIHDPKLKESFENSPRHDSYVPKFEAELAQFCEKLVNDLDRKVRRGRERLAQEVEPVPP
PSLSAEKAEQLSVLEEKVKNLLEQVEALGEEGKVDEAEALMRKVEGLNAEKTVLLQRPTD
KVLAMAQEKKMALCEVCGSFLVANDAVERTQSHVTGKQHVGYGLVRDFIAEQKAAKDKGK
EEERLVRGKEADDKRKPREKESESKRSGSSDRERYRDRDRNRDGDRHRDRGRDYRKPYDR
RSRSGREDRDRSRSRSPHGRSGHRRVSRSPIRQY*
>AT5G51410.2 | LUC7 N_terminus domain-containing protein
MDAQRALLDELMGAARNLTDEERRGFKEVKWDDREVCAFYMVRFCPHDLFVNTKSDLGAC
SRIHDPKLKESFENSPRHDSYVPKFEAELAQFCEKLVNDLDRKVRRGRERLAQEVEPVPP
PSLSAEKAEQLSVLEEKVKNLLEQVEALGEEGKVDEAEALMRKVEGLNAEKTVLLQRPTD
KVLAMAQEKKMALCEVCGSFLVANDAVERTQSHVTGKQHVGYGLVRDFIAEQKAAKDKGK
EEERLVRGKEADDKRKPREKESESKRSGSSDRERYRDRDRNRDGDRHRDRGRDYRKPYDR
RSRSGREDRDRSRSRSPHGRSGHRRVSRSPIRQY*
>AT5G51410.2 | LUC7 N_terminus domain-containing protein
MDAQRALLDELMGAARNLTDEERRGFKEVKWDDREVCAFYMVRFCPHDLFVNTKSDLGAC
SRIHDPKLKESFENSPRHDSYVPKFEAELAQFCEKLVNDLDRKVRRGRERLAQEVEPVPP
PSLSAEKAEQLSVLEEKVKNLLEQVEALGEEGKVDEAEALMRKVEGLNAEKTVLLQRPTD
KVLAMAQEKKMALCEVCGSFLVANDAVERTQSHVTGKQHVGYGLVRDFIAEQKAAKDKGK
EEERLVRGKEADDKRKPREKESESKRSGSSDRERYRDRDRNRDGDRHRDRGRDYRKPYDR
RSRSGREDRDRSRSRSPHGRSGHRRVSRSPIRQY*
>AT5G55100.1 | SWAP (Suppressor-of-White-APricot)/surp domain-containing protein
MDLEIVGRHALFFDDDSMATFVNSPTALVDWNSLFIDRYDVRHLLSSPPPPRIKRRRPNS
NDADLESELDHERYLDLPSESPSPSDDDEHDMNEDSANTNADGYRAVSFSYGSSSDVNDQ
KNAADMESGFHPPFPVPDYLRQNLPPTEKLHQIITRTSSFVSKHGGQSEIVLRVKQGDNP
TFGFLMPDHHLHLYFRFLVDHQELLTGKSSVEEKKNESEKDGGALSLLGSVYGTVEDEDA
NEESANDSKTSESAKGDDGVKVTDSNGPEGSKGAAKIASKHSLPLNDHASFIKRNPSVSA
VNVVEKKQINTEKLVTSDKSQPKLELQIVEPTTEMKRVIDKIVDFIQKNGKELEATLVAQ
DVKYGMFPFLRPSSLYHAYYRKVLQEAEELKSGDKGVIIRKEDVKQEKMGNAVKDSKHGF
GSVLPDDSAKKEKLKMVSDKPKVELHNEPFKPVQPQMRVNVDANTAAAILQAARRGIRNP
QLGILTGKPMDETSQTLGNDVSYPSSKSPDLAKSTGQSLSGSTAASEADSSEAGLSKEQK
LKAERLKRAKMFVAKLKPDAQPVQQAEPSRSISVEPLDSGISGLGAKAAKERDSSSIPYV
AESKLADDGNSERRSKRNYRSRSQRDEDGKMEQGEEEESSMDEVTEETKTDKKHSCSRKR
HKHKTRYSSKDRHSRDKHKHESSSDDEYHSRSRHRHRHSKSSDRHELYDSSDNEGEHRHR
SSKHSKDVDYSKDKRSHHHRSRKHEKHRDSSDDEHHHHRHRSSRRKHEDSSDVEHGHRHK
SSKRIKKDEKTVEEETVSKSDQSDLKASPGDNIPYLQNEPSQVSDELRAKIRAMLADTLG
DGR*
>AT5G55100.1 | SWAP (Suppressor-of-White-APricot)/surp domain-containing protein
MDLEIVGRHALFFDDDSMATFVNSPTALVDWNSLFIDRYDVRHLLSSPPPPRIKRRRPNS
NDADLESELDHERYLDLPSESPSPSDDDEHDMNEDSANTNADGYRAVSFSYGSSSDVNDQ
KNAADMESGFHPPFPVPDYLRQNLPPTEKLHQIITRTSSFVSKHGGQSEIVLRVKQGDNP
TFGFLMPDHHLHLYFRFLVDHQELLTGKSSVEEKKNESEKDGGALSLLGSVYGTVEDEDA
NEESANDSKTSESAKGDDGVKVTDSNGPEGSKGAAKIASKHSLPLNDHASFIKRNPSVSA
VNVVEKKQINTEKLVTSDKSQPKLELQIVEPTTEMKRVIDKIVDFIQKNGKELEATLVAQ
DVKYGMFPFLRPSSLYHAYYRKVLQEAEELKSGDKGVIIRKEDVKQEKMGNAVKDSKHGF
GSVLPDDSAKKEKLKMVSDKPKVELHNEPFKPVQPQMRVNVDANTAAAILQAARRGIRNP
QLGILTGKPMDETSQTLGNDVSYPSSKSPDLAKSTGQSLSGSTAASEADSSEAGLSKEQK
LKAERLKRAKMFVAKLKPDAQPVQQAEPSRSISVEPLDSGISGLGAKAAKERDSSSIPYV
AESKLADDGNSERRSKRNYRSRSQRDEDGKMEQGEEEESSMDEVTEETKTDKKHSCSRKR
HKHKTRYSSKDRHSRDKHKHESSSDDEYHSRSRHRHRHSKSSDRHELYDSSDNEGEHRHR
SSKHSKDVDYSKDKRSHHHRSRKHEKHRDSSDDEHHHHRHRSSRRKHEDSSDVEHGHRHK
SSKRIKKDEKTVEEETVSKSDQSDLKASPGDNIPYLQNEPSQVSDELRAKIRAMLADTLG
DGR*
>AT5G55100.2 | SWAP (Suppressor-of-White-APricot)/surp domain-containing protein
MDLEIVGRHALFFDDDSMATFVNSPTALVDWNSLFIDRYDVRHLLSSPPPPRIKRRRPNS
NDADLESELDHERYLDLPSESPSPSDDDEHDMNEDSANTNADGYRAVSFSYGSSSDVNDQ
KNAADMESGFHPPFPVPDYLRQNLPPTEKLHQIITRTSSFVSKHGGQSEIVLRVKQGDNP
TFGFLMPDHHLHLYFRFLVDHQELLTGKSSVEEKKNESEKDGGALSLLGSVYGTVEDEDA
NEESANDSKTSESAKGDDGVKVTDSNGPEGSKGAAKIASKHSLPLNDHASFIKRNPSVSA
VNVVEKKQINTEKLVTSDKSQPKLELQIVEPTTEMKRVIDKIVDFIQKNGKELEATLVAQ
DVKYGMFPFLRPSSLYHAYYRKVLQEAEELKSGDKGVIIRKEDVKQEKMGNAVKDSKHGF
GSVLPDDSAKKEKLKMVSDKPKVELHNEPFKPVQPQMRVNVDANTAAAILQAARRGIRNP
QLGILTGKPMDETSQTLGNDVSYPSSKSPDLAKSTGQSLSGSTAASEADSSEAGLSKEQK
LKAERLKRAKMFVAKLKPDAQPVQQAEPSRSISVEPLDSGISGLGAKAAKERDSSSIPYV
AESKLADDGNSERRSKRNYRSRSQRDEDGKMEQGEEEESSMDEVTEETKTDKKHSCSRKR
HKHKTRYSSKDRHSRDKHKHESSSDDEYHSRSRHRHRHSKSSDRHELYDSSDNEGEHRHR
SSKHSKDVDYSKDKRSHHHRSRKHEKHRDSSDDEHHHHRHRSSRRKHEDSSDVEHGHRHK
SSKRIKKDEKTVEEETVSKSDQSDLKASPGDNIPYLQNEPSQVSDELRAKIRAMLADTLY
VSSN*
>AT5G55100.2 | SWAP (Suppressor-of-White-APricot)/surp domain-containing protein
MDLEIVGRHALFFDDDSMATFVNSPTALVDWNSLFIDRYDVRHLLSSPPPPRIKRRRPNS
NDADLESELDHERYLDLPSESPSPSDDDEHDMNEDSANTNADGYRAVSFSYGSSSDVNDQ
KNAADMESGFHPPFPVPDYLRQNLPPTEKLHQIITRTSSFVSKHGGQSEIVLRVKQGDNP
TFGFLMPDHHLHLYFRFLVDHQELLTGKSSVEEKKNESEKDGGALSLLGSVYGTVEDEDA
NEESANDSKTSESAKGDDGVKVTDSNGPEGSKGAAKIASKHSLPLNDHASFIKRNPSVSA
VNVVEKKQINTEKLVTSDKSQPKLELQIVEPTTEMKRVIDKIVDFIQKNGKELEATLVAQ
DVKYGMFPFLRPSSLYHAYYRKVLQEAEELKSGDKGVIIRKEDVKQEKMGNAVKDSKHGF
GSVLPDDSAKKEKLKMVSDKPKVELHNEPFKPVQPQMRVNVDANTAAAILQAARRGIRNP
QLGILTGKPMDETSQTLGNDVSYPSSKSPDLAKSTGQSLSGSTAASEADSSEAGLSKEQK
LKAERLKRAKMFVAKLKPDAQPVQQAEPSRSISVEPLDSGISGLGAKAAKERDSSSIPYV
AESKLADDGNSERRSKRNYRSRSQRDEDGKMEQGEEEESSMDEVTEETKTDKKHSCSRKR
HKHKTRYSSKDRHSRDKHKHESSSDDEYHSRSRHRHRHSKSSDRHELYDSSDNEGEHRHR
SSKHSKDVDYSKDKRSHHHRSRKHEKHRDSSDDEHHHHRHRSSRRKHEDSSDVEHGHRHK
SSKRIKKDEKTVEEETVSKSDQSDLKASPGDNIPYLQNEPSQVSDELRAKIRAMLADTLY
VSSN*
>AT5G55670.1 | RNA recognition motif (RRM)-containing protein
MDEGDGRDQMDQFHQNEAISAVADDGFMAEEEDDDYEDLYNDVNVGEGFLQSMKKNDEAG
SRNEEKEKVNMEEEDRVEPVLGEAEVSISIPGLVGESVEKEAEAESGGGGSGSGTDVVVA
SSGYGAQEVKVSDVSQEIPGGIGTGTGGGLRVELGQASNRANDLEAPRGNNISQGLLPPP
PVLGNNENLMRPVMGNVNGGIPPGPGSNMVGNGANIAMPGVVGGGTGGGGGGGAFLFVGD
LHWWTTDAELEAELCKYGAVKEVKFFDEKASGKSKGYCQVEFYDPVAASACKDALNGYPF
NGRPCVVEYASPYSVKRMGEAQVNRTQQAQSVIAQAKRGGPADPPSKPLVANNNNNNNNN
AIGGNFQGGENRGFGRGNWGRGNAQGMGGRGPGGPMRNRPNGMGGRGLMGNGGFGQGMGT
GPPMNMMHQPMMGQGFEQAFGGPMARMGGYGGFPGAPGPQFPGLLSSFPPVGGVGLPGVA
PHVNPAFFGRGMPMNGMGMMPNAGVDGGHNMGMWDPNSGGWGAGEDLGSGRAAESSYGEE
AASDHQYGEVNHERGARPNPVKEKERASEREWSGSSDRRNREDKDAGYERDIPREKDVGH
GYDMPERRHRDDRDTGREREREHHHKDRERSREHVRDRERERERDRHREERERYGGDHRT
RHRDEPEHDEEWNRGRSSRGHNKSRLSREDNHRSKSRDTDYGKRRRLTTE*