>AT4G37880.1 |  protein binding / zinc ion binding 
MELKSIKDAFDRVATKQKLSYSKTNEIVHMLSQEIDKALSILEETPSSDTMLLDHRSILA 
DVKKVFMEIAPITQLEATEKELHAALTKYPKVLEKQLNPDISKAYRHNVEFDTHIVNQII 
ANFFYRQGMFDIGDCFVAETGESECSTRQSFVEMYRILEAMKRRDLEPALNWAVSNSDKL 
KEARSDLEMKLHSLHFLEIARGKNSKEAIDYARKHIATFADSCLPEIQKLMCSLLWNRKL 
DKSPYSEFLSPALWNNAVKELTRQYCNLLGESSESPLSITVTAGTQALPVLLKYMNVVMA 
NKKLDWQTMEQLPVDAQLSEEFQFHSVFVCPVSKEQSSDDNPPMMMSCGHVLCKQTINKM 
SKNGSKSSFKCPYCPTDVDISRCRQLHF*
>AT3G62030.1 |  peptidyl-prolyl cis-trans isomerase chloroplast / cyclophilin / rotamase / cyclosporin A-binding protein (ROC4) 
MASSSSMQMVHTSRSIAQIGFGVKSQLVSANRTTQSVCFGARSSGIALSSRLHYASPIKQ 
FSGVYATTKHQRTACVKSMAAEEEEVIEPQAKVTNKVYFDVEIGGEVAGRIVMGLFGEVV 
PKTVENFRALCTGEKKYGYKGSSFHRIIKDFMIQGGDFTEGNGTGGISIYGAKFEDENFT 
LKHTGPGILSMANAGPNTNGSQFFICTVKTSWLDNKHVVFGQVIEGMKLVRTLESQETRA 
FDVPKKGCRIYACGELPLDA*
>AT3G62030.1 |  peptidyl-prolyl cis-trans isomerase chloroplast / cyclophilin / rotamase / cyclosporin A-binding protein (ROC4) 
MASSSSMQMVHTSRSIAQIGFGVKSQLVSANRTTQSVCFGARSSGIALSSRLHYASPIKQ 
FSGVYATTKHQRTACVKSMAAEEEEVIEPQAKVTNKVYFDVEIGGEVAGRIVMGLFGEVV 
PKTVENFRALCTGEKKYGYKGSSFHRIIKDFMIQGGDFTEGNGTGGISIYGAKFEDENFT 
LKHTGPGILSMANAGPNTNGSQFFICTVKTSWLDNKHVVFGQVIEGMKLVRTLESQETRA 
FDVPKKGCRIYACGELPLDA*
>AT3G62030.2 |  peptidyl-prolyl cis-trans isomerase chloroplast / cyclophilin / rotamase / cyclosporin A-binding protein (ROC4) 
MFRLLLLPYAVGAQQKLLQTPRETKVADAWNIKCQNLLLSKANQQKVFIFNHSMASSSSM 
QMVHTSRSIAQIGFGVKSQLVSANRTTQSVCFGARSSGIALSSRLHYASPIKQFSGVYAT 
TKHQRTACVKSMAAEEEEVIEPQAKVTNKVYFDVEIGGEVAGRIVMGLFGEVVPKTVENF 
RALCTGEKKYGYKGSSFHRIIKDFMIQGGDFTEGNGTGGISIYGAKFEDENFTLKHTGPG 
ILSMANAGPNTNGSQFFICTVKTSWLDNKHVVFGQVIEGMKLVRTLESQETRAFDVPKKG 
CRIYACGELPLDA*
>AT3G62030.2 |  peptidyl-prolyl cis-trans isomerase chloroplast / cyclophilin / rotamase / cyclosporin A-binding protein (ROC4) 
MFRLLLLPYAVGAQQKLLQTPRETKVADAWNIKCQNLLLSKANQQKVFIFNHSMASSSSM 
QMVHTSRSIAQIGFGVKSQLVSANRTTQSVCFGARSSGIALSSRLHYASPIKQFSGVYAT 
TKHQRTACVKSMAAEEEEVIEPQAKVTNKVYFDVEIGGEVAGRIVMGLFGEVVPKTVENF 
RALCTGEKKYGYKGSSFHRIIKDFMIQGGDFTEGNGTGGISIYGAKFEDENFTLKHTGPG 
ILSMANAGPNTNGSQFFICTVKTSWLDNKHVVFGQVIEGMKLVRTLESQETRAFDVPKKG 
CRIYACGELPLDA*
>AT4G16830.1 |  nuclear RNA-binding protein (RGGA) 
MATLNPFDLLDDDAEDPSQLAVAIEKIDKSKKSGQVSSLPAKSAPKLPSKPLPPAQAVRE 
ARSDAPRGGGGRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRG 
SFRGEGGGPGGGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEI 
AAETEAVAGVETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKK 
ALQSLTTSERKVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEF 
LKPAEGGNYYRGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.1 |  nuclear RNA-binding protein (RGGA) 
MATLNPFDLLDDDAEDPSQLAVAIEKIDKSKKSGQVSSLPAKSAPKLPSKPLPPAQAVRE 
ARSDAPRGGGGRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRG 
SFRGEGGGPGGGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEI 
AAETEAVAGVETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKK 
ALQSLTTSERKVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEF 
LKPAEGGNYYRGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.1 |  nuclear RNA-binding protein (RGGA) 
MATLNPFDLLDDDAEDPSQLAVAIEKIDKSKKSGQVSSLPAKSAPKLPSKPLPPAQAVRE 
ARSDAPRGGGGRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRG 
SFRGEGGGPGGGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEI 
AAETEAVAGVETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKK 
ALQSLTTSERKVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEF 
LKPAEGGNYYRGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.2 |  nuclear RNA-binding protein (RGGA) 
MLHVVVEAVEDLTVVVVVTTVMMVTMDIQGDTLNPQVKEMFQSLLTRGVAGGDGERPRRA 
FERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGVETEKDVGEKPAVDDVAADAN 
KEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSERKVDTKVFESMQQLSNKKSND 
EIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYYRGGRGGRGRGGRGRGGVSSG 
ESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.2 |  nuclear RNA-binding protein (RGGA) 
MLHVVVEAVEDLTVVVVVTTVMMVTMDIQGDTLNPQVKEMFQSLLTRGVAGGDGERPRRA 
FERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGVETEKDVGEKPAVDDVAADAN 
KEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSERKVDTKVFESMQQLSNKKSND 
EIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYYRGGRGGRGRGGRGRGGVSSG 
ESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.2 |  nuclear RNA-binding protein (RGGA) 
MLHVVVEAVEDLTVVVVVTTVMMVTMDIQGDTLNPQVKEMFQSLLTRGVAGGDGERPRRA 
FERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGVETEKDVGEKPAVDDVAADAN 
KEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSERKVDTKVFESMQQLSNKKSND 
EIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYYRGGRGGRGRGGRGRGGVSSG 
ESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.3 |  nuclear RNA-binding protein (RGGA) 
MMMLRIQASSLLPSRRLISPRNLDRFRACLLSQLLSFHRSHFLLLKPVREARSDAPRGGG 
GRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRGSFRGEGGGPG 
GGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGV 
ETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSER 
KVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYY 
RGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.3 |  nuclear RNA-binding protein (RGGA) 
MMMLRIQASSLLPSRRLISPRNLDRFRACLLSQLLSFHRSHFLLLKPVREARSDAPRGGG 
GRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRGSFRGEGGGPG 
GGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGV 
ETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSER 
KVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYY 
RGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT4G16830.3 |  nuclear RNA-binding protein (RGGA) 
MMMLRIQASSLLPSRRLISPRNLDRFRACLLSQLLSFHRSHFLLLKPVREARSDAPRGGG 
GRGGFNRGRGGYNRDDGNNGYSGGYTKPSGEGDVSKSSYERRGGGGAPRGSFRGEGGGPG 
GGRRGGFSNEGGDGERPRRAFERRSGTGRGSDFKRDGSGRGNWGTPGEEIAAETEAVAGV 
ETEKDVGEKPAVDDVAADANKEDTVVEEKEPEDKEMTLDEYEKILEEKKKALQSLTTSER 
KVDTKVFESMQQLSNKKSNDEIFIKLGSDKDKRKDDKEEKAKKAVSINEFLKPAEGGNYY 
RGGRGGRGRGGRGRGGVSSGESGGYRNEAAPAIGDAAQFPSLGGK*
>AT3G14120.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN transport LOCATED IN nuclear pore EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Nuclear pore protein 84/107 (InterProIPR007252) Has 207 Blast hits to 206 proteins in 78 species Archae - 0 Bacteria - 0 Metazoa - 123 Fungi - 57 Plants - 22 Viruses - 0 Other Eukaryotes - 5 (source NCBI BLink) 
MDMDMDTSPSYFDPEALSVRDQFRRYRKRHSTSPHEEMLSSNVSENRLLYDGHNIHSPTN 
TALLLENIKEEVDNFHTDHYEGTPTNPISASRRESVGILNDDDEALFRRVESQSLKACKI 
ENDELAESGDTTFALFASLFDSALQGLMSIPNLMLRLEESCRNVSQSIRYGSDIRHRAVE 
DKLMRQKAQLLLGEAASWSLLWNLYGKGTDEVPENLILIPSTSHLEACQFVLNDHTAQLC 
LRIVMWLEELASKSLDLERKVQGSHVGTYLPNAGVWHHTQRYLKKNGSNADTLHHLDFDA 
PTREHARLLPDDYKQDESVLEDVWTLIRAGRIEEACDLCRSAGQSWRAATLCPFSGMDMF 
PSIEALVKNGENRTLQAIEQESGFGNQLRLWKWASYCASEKIAEQDGGKHEVAVFATQCS 
NLNRMLPICTDWESACWAMAKSWLDVQVDLELAQSKPGLTERFKSCIDESPEATQNGCQA 
SFGPEDWPLHVLNQQPRDLPALLQKLHSGEMVHEAVVRGCKEQHRQIQMNLMLGDISHLL 
DIIWSWIAPLEDDQSNFRPHGDPHMIKFGAHMVLVLRLLFTDEINDSFKEKLNNVGDLIL 
HMYAMFLFSKQHEELVGIYASQLARHRCIELFVHMMELRMHSSVHVKYKIFLSAMEYLSF 
SPVDDLHGNFEEIVDRVLSRSREIKLAKYDPSIDVAEQHRQQSLQKAIAIQWLCFTPPST 
IKDVKDVTSKLLLRSLMHSNILFREFALIAMWRVPATPVGAHTLLSYLAEPLKQLSENPD 
TLEDYVSENLQEFQDWNEYYSCDAKYRNWLKFQLENAEVTELSEEENQKAVVAAKETLDS 
SLSLLLRQDNPWMTFLEDHVFESEEYLFLELHATAMLCLPSGECLRPDATVCAALMSALY 
SSVSEEVVLDRQLMVNVSISSRDSYCIEVVLRCLAIKGDGLGPHNANDGGILSAVAAAGF 
KGSDIYGTYFSFTYDLPPFSIEIWGCELTRFQAGVTMDISRLDAWYSSKEGSLETPATYI 
VRGLCRRCCLPELVLRSMQVSVSLMESGNPPEDHDELIELVASDETGFLSLFSRQQLQEF 
MLFEREYRMSQLELQEELSSP*
>AT3G14120.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN transport LOCATED IN nuclear pore EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Nuclear pore protein 84/107 (InterProIPR007252) Has 207 Blast hits to 206 proteins in 78 species Archae - 0 Bacteria - 0 Metazoa - 123 Fungi - 57 Plants - 22 Viruses - 0 Other Eukaryotes - 5 (source NCBI BLink) 
MDMDMDTSPSYFDPEALSVRDQFRRYRKRHSTSPHEEMLSSNVSENRLLYDGHNIHSPTN 
TALLLENIKEEVDNFHTDHYEGTPTNPISASRRESVGILNDDDEALFRRVESQSLKACKI 
ENDELAESGDTTFALFASLFDSALQGLMSIPNLMLRLEESCRNVSQSIRYGSDIRHRAVE 
DKLMRQKAQLLLGEAASWSLLWNLYGKGTDEVPENLILIPSTSHLEACQFVLNDHTAQLC 
LRIVMWLEELASKSLDLERKVQGSHVGTYLPNAGVWHHTQRYLKKNGSNADTLHHLDFDA 
PTREHARLLPDDYKQDESVLEDVWTLIRAGRIEEACDLCRSAGQSWRAATLCPFSGMDMF 
PSIEALVKNGENRTLQAIEQESGFGNQLRLWKWASYCASEKIAEQDGGKHEVAVFATQCS 
NLNRMLPICTDWESACWAMAKSWLDVQVDLELAQSKPGLTERFKSCIDESPEATQNGCQA 
SFGPEDWPLHVLNQQPRDLPALLQKLHSGEMVHEAVVRGCKEQHRQIQMNLMLGDISHLL 
DIIWSWIAPLEDDQSNFRPHGDPHMIKFGAHMVLVLRLLFTDEINDSFKEKLNNVGDLIL 
HMYAMFLFSKQHEELVGIYASQLARHRCIELFVHMMELRMHSSVHVKYKIFLSAMEYLSF 
SPVDDLHGNFEEIVDRVLSRSREIKLAKYDPSIDVAEQHRQQSLQKAIAIQWLCFTPPST 
IKDVKDVTSKLLLRSLMHSNILFREFALIAMWRVPATPVGAHTLLSYLAEPLKQLSENPD 
TLEDYVSENLQEFQDWNEYYSCDAKYRNWLKFQLENAEVTELSEEENQKAVVAAKETLDS 
SLSLLLRQDNPWMTFLEDHVFESEEYLFLELHATAMLCLPSGECLRPDATVCAALMSALY 
SSVSEEVVLDRQLMVNVSISSRDSYCIEVVLRCLAIKGDGLGPHNANDGGILSAVAAAGF 
KGSDIYGTYFSFTYDLPPFSIEIWGCELTRFQAGVTMDISRLDAWYSSKEGSLETPATYI 
VRGLCRRCCLPELVLRSMQVSVSLMESGNPPEDHDELIELVASDETGFLSLFSRQQLQEF 
MLFEREYRMSQLELQEELSSP*
>AT3G14120.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN transport LOCATED IN nuclear pore EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Nuclear pore protein 84/107 (InterProIPR007252) Has 207 Blast hits to 206 proteins in 78 species Archae - 0 Bacteria - 0 Metazoa - 123 Fungi - 57 Plants - 22 Viruses - 0 Other Eukaryotes - 5 (source NCBI BLink) 
MDMDMDTSPSYFDPEALSVRDQFRRYRKRHSTSPHEEMLSSNVSENRLLYDGHNIHSPTN 
TALLLENIKEEVDNFHTDHYEGTPTNPISASRRESVGILNDDDEALFRRVESQSLKACKI 
ENDELAESGDTTFALFASLFDSALQGLMSIPNLMLRLEESCRNVSQSIRYGSDIRHRAVE 
DKLMRQKAQLLLGEAASWSLLWNLYGKGTDEVPENLILIPSTSHLEACQFVLNDHTAQLC 
LRIVMWLEELASKSLDLERKVQGSHVGTYLPNAGVWHHTQRYLKKNGSNADTLHHLDFDA 
PTREHARLLPDDYKQDESVLEDVWTLIRAGRIEEACDLCRSAGQSWRAATLCPFSGMDMF 
PSIEALVKNGENRTLQAIEQESGFGNQLRLWKWASYCASEKIAEQDGGKHEVAVFATQCS 
NLNRMLPICTDWESACWAMAKSWLDVQVDLELAQSKPGLTERFKSCIDESPEATQNGCQA 
SFGPEDWPLHVLNQQPRDLPALLQKLHSGEMVHEAVVRGCKEQHRQIQMNLMLGDISHLL 
DIIWSWIAPLEDDQSNFRPHGDPHMIKFGAHMVLVLRLLFTDEINDSFKEKLNNVGDLIL 
HMYAMFLFSKQHEELVGIYASQLARHRCIELFVHMMELRMHSSVHVKYKIFLSAMEYLSF 
SPVDDLHGNFEEIVDRVLSRSREIKLAKYDPSIDVAEQHRQQSLQKAIAIQWLCFTPPST 
IKDVKDVTSKLLLRSLMHSNILFREFALIAMWRVPATPVGAHTLLSYLAEPLKQLSENPD 
TLEDYVSENLQEFQDWNEYYSCDAKYRNWLKFQLENAEVTELSEEENQKAVVAAKETLDS 
SLSLLLRQDNPWMTFLEDHVFESEEYLFLELHATAMLCLPSGECLRPDATVCAALMSALY 
SSVSEEVVLDRQLMVNVSISSRDSYCIEVVLRCLAIKGDGLGPHNANDGGILSAVAAAGF 
KGELTRFQAGVTMDISRLDAWYSSKEGSLETPATYIVRGLCRRCCLPELVLRSMQVSVSL 
MESGNPPEDHDELIELVASDETGFLSLFSRQQLQEFMLFEREYRMSQLELQEELSSP*
>AT3G14120.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN transport LOCATED IN nuclear pore EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Nuclear pore protein 84/107 (InterProIPR007252) Has 207 Blast hits to 206 proteins in 78 species Archae - 0 Bacteria - 0 Metazoa - 123 Fungi - 57 Plants - 22 Viruses - 0 Other Eukaryotes - 5 (source NCBI BLink) 
MDMDMDTSPSYFDPEALSVRDQFRRYRKRHSTSPHEEMLSSNVSENRLLYDGHNIHSPTN 
TALLLENIKEEVDNFHTDHYEGTPTNPISASRRESVGILNDDDEALFRRVESQSLKACKI 
ENDELAESGDTTFALFASLFDSALQGLMSIPNLMLRLEESCRNVSQSIRYGSDIRHRAVE 
DKLMRQKAQLLLGEAASWSLLWNLYGKGTDEVPENLILIPSTSHLEACQFVLNDHTAQLC 
LRIVMWLEELASKSLDLERKVQGSHVGTYLPNAGVWHHTQRYLKKNGSNADTLHHLDFDA 
PTREHARLLPDDYKQDESVLEDVWTLIRAGRIEEACDLCRSAGQSWRAATLCPFSGMDMF 
PSIEALVKNGENRTLQAIEQESGFGNQLRLWKWASYCASEKIAEQDGGKHEVAVFATQCS 
NLNRMLPICTDWESACWAMAKSWLDVQVDLELAQSKPGLTERFKSCIDESPEATQNGCQA 
SFGPEDWPLHVLNQQPRDLPALLQKLHSGEMVHEAVVRGCKEQHRQIQMNLMLGDISHLL 
DIIWSWIAPLEDDQSNFRPHGDPHMIKFGAHMVLVLRLLFTDEINDSFKEKLNNVGDLIL 
HMYAMFLFSKQHEELVGIYASQLARHRCIELFVHMMELRMHSSVHVKYKIFLSAMEYLSF 
SPVDDLHGNFEEIVDRVLSRSREIKLAKYDPSIDVAEQHRQQSLQKAIAIQWLCFTPPST 
IKDVKDVTSKLLLRSLMHSNILFREFALIAMWRVPATPVGAHTLLSYLAEPLKQLSENPD 
TLEDYVSENLQEFQDWNEYYSCDAKYRNWLKFQLENAEVTELSEEENQKAVVAAKETLDS 
SLSLLLRQDNPWMTFLEDHVFESEEYLFLELHATAMLCLPSGECLRPDATVCAALMSALY 
SSVSEEVVLDRQLMVNVSISSRDSYCIEVVLRCLAIKGDGLGPHNANDGGILSAVAAAGF 
KGELTRFQAGVTMDISRLDAWYSSKEGSLETPATYIVRGLCRRCCLPELVLRSMQVSVSL 
MESGNPPEDHDELIELVASDETGFLSLFSRQQLQEFMLFEREYRMSQLELQEELSSP*
>AT1G10320.1 |  U2 snRNP auxiliary factor-related 
MEQANEKEEEERHEEAAGEKESFEESKEKAAEMSRKEKRKAMKKLKRKQVRKEIAAKERE 
EAKAKLNDPAEQERLKAIEEEDARRREKELKDFEESERAWREAMEIKRKKEEEEEAKREE 
EERRWKDLEELRKLEASGNDECGEDEDGEYEYIEEGPPEIIFQGNEIILKKNKVRVPKKS 
VVQVDGHESSNAEFVLQISDRPTSNPLPPGSEASANYQNVSSAQQILESVAQEVPNFGTE 
QDKAHCPFHLKTGACRFGQRCSRVHFYPNKSCTLLMKNMYNGPGITWEQDEGLEYTDEEA 
ELCYEEFYEDVHTEFLKYGELVNFKVCRNGSFHLKGNVYVHYRSLESAILAYQSINGRYF 
AGKQVNCEFVNISRWKVAICGEYMKSRLKTCSRGSACNFIHCFRNPGGDYEWADHDRPPP 
RFWIHKMTSLFGYSDEKHMEHESSGSLNDSISDLSTDSHRQPSRRSRSRDHDHANVGSTP 
SYRSRKYHGDTQDSTREDKLRRHAENCHDGDDSPSRDGSLEREMYKERRYAKDTLHRDSR 
WSEHSPGHRVGRKRIHGRYSDDDSADGDDYGRRGTGHKRKPRRGTDSGVQEQMDNEKDRK 
THRSSRKHSREGSSADKEEGHEHDRVHTVSDKSHRERSKHRHERSSSRYSHEEDSTESRH 
HQHKESDKKRSVETSPVGYQSDKDRDRSKQRQRYKSDDPESDQSRKGKRQSEENSDRETH 
KERRHRHRKRRRTQNSDDQNPKESEEVEEEIERWRPV*
>AT1G80830.1 |  NRAMP1 (NATURAL RESISTANCE-ASSOCIATED MACROPHAGE PROTEIN 1) inorganic anion transmembrane transporter/ manganese ion transmembrane transporter/ metal ion transmembrane transporter 
MAATGSGRSQFISSSGGNRSFSNSPLIENSDSNQIIVSEKKSWKNFFAYLGPGFLVSIAY 
IDPGNFETDLQAGAHYKYELLWIILVASCAALVIQSLAANLGVVTGKHLAEQCRAEYSKV 
PNFMLWVVAEIAVVACDIPEVIGTAFALNMLFSIPVWIGVLLTGLSTLILLALQKYGVRK 
LEFLIAFLVFTIAICFFVELHYSKPDPGEVLHGLFVPQLKGNGATGLAISLLGAMVMPHN 
LFLHSALVLSRKIPRSASGIKEACRFYLIESGLALMVAFLINVSVISVSGAVCNAPNLSP 
EDRANCEDLDLNKASFLLRNVVGKWSSKLFAIALLASGQSSTITGTYAGQYVMQGFLDLR 
LEPWLRNLLTRCLAIIPSLIVALIGGSAGAGKLIIIASMILSFELPFALVPLLKFTSCKT 
KMGSHVNPMAITALTWVIGGLIMGINIYYLVSSFIKLLIHSHMKLILVVFCGILGFAGIA 
LYLAAIAYLVFRKNRVATSLLISRDSQNVETLPRQDIVNMQLPCRVSTSDVD*
>AT1G61150.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink) 
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA 
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ 
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD 
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED 
PSE*
>AT1G61150.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA 
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ 
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD 
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED 
PSE*
>AT1G61150.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA 
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ 
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD 
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED 
PSE*
>AT1G61150.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink) 
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM 
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE 
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK 
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM 
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE 
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK 
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM 
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE 
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK 
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.7 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT1G111101) Has 740 Blast hits to 708 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 401 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 56 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.7 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.7 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA 
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ 
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD 
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED 
PSE*
>AT1G61150.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA 
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ 
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD 
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED 
PSE*
>AT1G61150.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM 
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE 
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK 
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM 
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE 
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK 
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.7 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.7 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA 
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ 
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD 
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED 
PSE*
>AT1G61150.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MSLFRIFINQLEEDDEDMATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEA 
AEKFQRESGTKPEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQ 
QQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLD 
LSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLED 
PSE*
>AT1G61150.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM 
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE 
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK 
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLATITDRM 
AVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEALEFAQEE 
LAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILTSQSHEK 
DPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.6 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MATSKKMITREEWEKKLNAVKLRKEDMNTLVMNFLVTEGYVEAAEKFQRESGTKPEIDLA 
TITDRMAVKKAVQNGNVEDAIEKVNDLNPEILDTNPELFFHLQQQRLIELIRQGKTEEAL 
EFAQEELAPRGEENQAFLEELEKTVALLVFDDASTCPVKELLDLSHRLKTASEVNAAILT 
SQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHINDLSTGKLEDPSE*
>AT1G61150.7 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s LisH dimerisation motif subgroup (InterProIPR013720) CTLH C-terminal to LisH motif (InterProIPR006595) LisH dimerisation motif (InterProIPR006594) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 726 Blast hits to 694 proteins in 137 species Archae - 0 Bacteria - 0 Metazoa - 389 Fungi - 144 Plants - 139 Viruses - 0 Other Eukaryotes - 54 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT1G61150.7 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s CTLH C-terminal to LisH motif (InterProIPR006595) CT11-RanBPM (InterProIPR013144) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G093001) Has 486 Blast hits to 476 proteins in 122 species Archae - 0 Bacteria - 4 Metazoa - 235 Fungi - 76 Plants - 132 Viruses - 0 Other Eukaryotes - 39 (source NCBI BLink) 
MELVCLMFSLSTPFELPYRETAEIDLATITDRMAVKKAVQNGNVEDAIEKVNDLNPEILD 
TNPELFFHLQQQRLIELIRQGKTEEALEFAQEELAPRGEENQAFLEELEKTVALLVFDDA 
STCPVKELLDLSHRLKTASEVNAAILTSQSHEKDPKLPSLLKMLIWAQTQLDEKAVYPHI 
NDLSTGKLEDPSE*
>AT4G33200.1 |  XI-I motor/ protein binding 
MRNCLPMELNLRKGDKVWVEDKDLAWIAADVLDSFDNKLHVETSTGKKVFVSPEKLFRRD 
PDDEEHNGVDDMTKLTYLHEAGVLYNLQRRYALNDIYTYTGSILIAVNPFKKLPHLYNGH 
MMEQYMGAPFGELSPHVFAVSDVAYRAMIDDSRSQSILVSGESGAGKTETTKLIMQYLTF 
VGGRATDDDRSVEQQVLESNPLLEAFGNAKTVRNDNSSRFGKFVEIQFDTNGRISGAAIR 
TYLLERSRVVRITDPERNYHCFYQLCASGNDAEKYKLSNPRQFHYLNQSKTYELEGVSSA 
EEYKNTRRAMDIVGISQDEQEGIFRTLAAILHLGNVEFSSGREHDSSVVKDPESRHHLQM 
AADLFKCDANLLLASLCTRSILTREGIIIKALDPNAAVTSRDTLAKTVYAHLFDWLVDKI 
NKSVGQDPESRFQIGVLDIYGFECFKNNSFEQFCINFANEKLQQHFNEHVFKMEQDEYRK 
EEINWSYIEFIDNQDVLDLIEKKPIGVIALLDEACMFPRSTHESFSMKLFQNFRFHPRLE 
KPKFSETDFTLSHYAGKVTYQTEAFLDKNRDYTIVEHCNLLSSSKCPFVAGIFPSAPEES 
TRSSYKFSSVSSRFKQQLQALMETLSKTEPHYVRCVKPNSLNRPQKFESLSVLHQLRCGG 
VLEAVRISLAGYPTRRNYSDFVDRFGLLAPEFMDESNDEQALTEKILSKLGLGNYQLGRT 
KVFLRAGQIGILDSRRAEVLDASARLIQRRLRTFVTHQNFISARASAISIQAYCRGCLSR 
NAYATRRNAAAAVLVQKHVRRWLSRCAFVKLVSAAIVLQSCIRADSTRLKFSHQKEHRAA 
SLIQAHWRIHKFRSAFRHRQSSIIAIQCRWRQKLAKREFRKLKQVANEAGALRLAKTKLE 
KRLEDLEWRLQLEKRLRTSGEEAKSSEISKLQKTLESFSLKLDAARLATINECNKNAVLE 
KQLDISMKEKSAVERELNGMVELKKDNALLKNSMNSLEKKNRVLEKELLNAKTNCNNTLQ 
KLKEAEKRCSELQTSVQSLEEKLSHLENENQVLMQKTLITSPERIGQILGEKHSSAVVPA 
QNDRRSVFETPTPSKHIMPFSHSLSESRRSKLTAERNLENYELLSRCIKENLGFNDDKPL 
AACVIYKCLLHWRAFESESTAIFNIIIEGINEALKGGDENGVLPYWLSNASALLCLLQRN 
LRSNSFLNASAQRSGRAAYGVKSPFKLHGPDDGASHIEARYPALLFKQQLTACVEKIYGL 
IRDNLKKELSPLLGSCIQAPKASRGIAGKSRSPGGVPQQSPSSQWESILKFLDSLMSRLR 
ENHVPSFFIRKLVTQVFSFINLSLFNSLLLRRECCTFSNGEYVKSGISELEKWIANAKEE 
FAGTSWHELNYIRQAVGFLVIHQKKKKSLDEIRQDLCPVLTIRQIYRISTMYWDDKYGTQ 
SVSSEVVSQMRVLVDKDNQKQTSNSFLLDDDMSIPFSAEDIDKAIPVLDPSEIEPPKFVS 
EYTCAQSLVKKPSIASTSKQII*
>AT1G70290.1 |  ATTPS8 alphaalpha-trehalose-phosphate synthase (UDP-forming)/ transferase transferring glycosyl groups / trehalose-phosphatase 
MVSRSCANFLDLSSWDLLDFPQTPRTLPRVMTVPGIITDVDGDTTSEVTSTSGGSRERKI 
IVANMLPLQSKRDAETGKWCFNWDEDSLQLQLRDGFSSETEFLYVGSLNVDIETNEQEEV 
SQKLLEEFNCVATFLSQELQEMFYLGFCKHQLWPLFHYMLPMFPDHGDRFDRRLWQAYVS 
ANKIFSDRVMEVINPEDDYVWIQDYHLMVLPTFLRKRFNRIKLGFFLHSPFPSSEIYRTL 
PVRDEILRGLLNCDLIGFHTFDYARHFLSCCSRMLGLDYESKRGHIGLDYFGRTVYIKIL 
PVGVHMGRLESVLSLDSTAAKTKEIQEQFKGKKLVLGIDDMDIFKGISLKLIAMEHLFET 
YWHLKGKVVLVQIVNPARSSGKDVEEAKRETYETARRINERYGTSDYKPIVLIDRLVPRS 
EKTAYYAAADCCLVNAVRDGMNLVPYKYIVCRQGTRSNKAVVDSSPRTSTLVVSEFIGCS 
PSLSGAIRVNPWDVDAVAEAVNSALKMSETEKQLRHEKHYHYISTHDVGYWAKSFMQDLE 
RACRDHYSKRCWGIGFGLGFRVLSLSPSFRKLSVEHIVPVYRKTQRRAIFLDYDGTLVPE 
SSIVQDPSNEVVSVLKALCEDPNNTVFIVSGRGRESLSNWLSPCENLGIAAEHGYFIRWK 
SKDEWETCYSPTDTEWRSMVEPVMRSYMEATDGTSIEFKESALVWHHQDADPDFGSCQAK 
EMLDHLESVLANEPVVVKRGQHIVEVKPQGVSKGLAAEKVIREMVERGEPPEMVMCIGDD 
RSDEDMFESILSTVTNPELLVQPEVFACTVGRKPSKAKYFLDDEADVLKLLRGLGDSSSS 
LKPSSSHTQVAFESIV*
>AT5G48010.1 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ 
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL 
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ 
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.1 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLELLNIMIFRYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKAAKYIEDMQ 
TVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPEGGWGESFL 
SCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQLDNGDFPQQ 
EIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA 
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE 
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL 
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT5G48010.2 |  THAS1 (THALIANOL SYNTHASE 1) catalytic/ thalianol synthase 
MWRLRTGPKAGEDTHLFTTNNYAGRQIWEFDANAGSPQEIAEVEDARHKFSDNTSRFKTT 
ADLLWRMQFLREKKFEQKIPRVIIEDARKIKYEDAKTALKRGLLYFTALQADDGHWPAEN 
SGPNFYTPPFLICLYITGHLEKIFTPEHVKELLRHIYNMQNEDGGWGLHVESHSVMFCTV 
INYVCLRIVGEEVGHDDQRNGCAKAHKWIMDHGGATYTPLIGKALLSVLGVYDWSGCNPI 
PPEFWLLPSSFPVNGGTLWIYLRDTFMGLSYLYGKKFVAPPTPLILQLREELYPEPYAKI 
NWTQTRNRCGKEDLYYPRSFLQDLFWKSVHMFSESILDRWPLNKLIRQRALQSTMALIHY 
HDESTRYITGGCLPKAFHMLACWIEDPKSDYFKKHLARVREYIWIGEDGLKIQSFGSQLW 
DTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGWTF 
SDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIAAW 
QPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFITKA 
AKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQNPE 
GGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINSQL 
DNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLLP*
>AT1G30090.1 |  kelch repeat-containing F-box family protein 
MQRVRVSSQRAVVHKLGDSQMTLSPKFRVAASIQSTLFDRSSELELSLRGEPLIPGLPDD 
VALNCLLRVPVQSHVSSKSVCKRWHLLFGTKETFFAKRKEFGFKDPWLFVVGFSRCTGKI 
QWKVLDLRNLTWHEIPAMPCRDKVCPHGFRSVSMPREGTMFVCGGMVSDSDCPLDLVLKY 
DMVKNHWTVTNKMITARSFFASGVIDGMIYAAGGNAADLYELDCAEVLNPLDGNWRPVSN 
MVAHMASYDTAVLNGKLLVTEGWLWPFFVSPRGQVYDPRTDQWETMSMGLREGWTGTSVV 
IYDRLFIVSELERMKMKVYDPVTDSWETINGPELPEQICRPFAVNCYGNRVYVVGRNLHL 
AVGNIWQSENKFAVRWEVVESPERYADITPSNSQILFA*
>AT2G22690.1 |  protein binding / zinc ion binding 
MELKNVKDAFDRVTKKQKLCYSKTHEVVDKMSQEIDKALKTIQEDNHESVVADLKKTFEE 
IAPINLLEASQKEINGVLTKYPKALDKTLNPDISTAYRNVKFDTHTVHQILAQFFYRQGM 
YDVGDCFISETGEVKPESSVTKAFMEMNMILEAMKERDLGPALKWVASNSDKLKEAKSDL 
ELKLHSLHFLEIAKDKTSEEAINYARKHFATYSADSCCFPEIQKLMCSLLWIRNLNKSPY 
SEFLSPVLWTNAAKELTRQYCILLGESPESPLSVTVAAGSQVLPTFLKYLNVLPEKRKEW 
QTMEQLLVPVELSEEYRFYSVFVCPVSKEHSSEDNPPMRLACGHVLCKQSINRMSRNGSR 
SFKCPYCPTDIDASQCKQLYF*
>AT2G22690.1 |  protein binding / zinc ion binding 
MELKNVKDAFDRVTKKQKLCYSKTHEVVDKMSQEIDKALKTIQEDNHESVVADLKKTFEE 
IAPINLLEASQKEINGVLTKYPKALDKTLNPDISTAYRNVKFDTHTVHQILAQFFYRQGM 
YDVGDCFISETGEVKPESSVTKAFMEMNMILEAMKERDLGPALKWVASNSDKLKEAKSDL 
ELKLHSLHFLEIAKDKTSEEAINYARKHFATYSADSCCFPEIQKLMCSLLWIRNLNKSPY 
SEFLSPVLWTNAAKELTRQYCILLGESPESPLSVTVAAGSQVLPTFLKYLNVLPEKRKEW 
QTMEQLLVPVELSEEYRFYSVFVCPVSKEHSSEDNPPMRLACGHVLCKQSINRMSRNGSR 
SFKCPYCPTDIDASQCKQLYF*
>AT2G22690.2 |  protein binding / zinc ion binding 
MELKNVKDAFDRVTKKQKLCYSKTHEVVDKMSQEIDKALKTIQEDNHESVVADLKKTFEE 
IAPINLLEASQKEINGVLTKYPKALDKTLNPDISTAYRNVKFDTHTVHQILAQFFYRQGM 
YDVGDCFISETGEVKPESSVTKAFMEMNMILEAMKERDLGPALKWVASNSDKLKEAKSDL 
ELKLHSLHFLEIAKDKTSEEAINYARKHFATYSADSCCFPEIQKLMCSLLWIRNLNKSPY 
SEFLSPVLWTNAAKELTRQYCILLGESPESPLSVTVAAGSQVLPTFLKYLNVLPEKRKEW 
QTMEQLLVPVELSEEYRFYSVFVCPVSKEHSSEDNPPMRLACGHVLCKQSINRMSRNGSR 
SFKCPYCPTDIDASQCKQLYF*
>AT2G22690.2 |  protein binding / zinc ion binding 
MELKNVKDAFDRVTKKQKLCYSKTHEVVDKMSQEIDKALKTIQEDNHESVVADLKKTFEE 
IAPINLLEASQKEINGVLTKYPKALDKTLNPDISTAYRNVKFDTHTVHQILAQFFYRQGM 
YDVGDCFISETGEVKPESSVTKAFMEMNMILEAMKERDLGPALKWVASNSDKLKEAKSDL 
ELKLHSLHFLEIAKDKTSEEAINYARKHFATYSADSCCFPEIQKLMCSLLWIRNLNKSPY 
SEFLSPVLWTNAAKELTRQYCILLGESPESPLSVTVAAGSQVLPTFLKYLNVLPEKRKEW 
QTMEQLLVPVELSEEYRFYSVFVCPVSKEHSSEDNPPMRLACGHVLCKQSINRMSRNGSR 
SFKCPYCPTDIDASQCKQLYF*
>AT5G09630.1 |  protein binding / zinc ion binding 
MDVTGTVTVRDAFDRVSKKQKLYHSVTQDVIDLVCDGIQDTLTRIQLGNDDGVEPESVLT 
ELRRKLDALLPIIQLQKSHKETKWSLSKLVKLLEVSYHPDISLACFSVDFDINLVNKILI 
HHCYREGLFDVGDCLVKEAGREEETEVRSQFLEFHQIVDSLKLRNIEPAMRWIFANRGKL 
KQKSSKLEFKLLSLKYCDILREGKSDDALEYARTHFTQYPLHFKEIQKLITCLLWIGNFE 
KSPYAEIVSPSCWDKVTKELIMEYHHLLDQPINSPLKVALSAGYESLPSLLKLVHLMALT 
KQEWQAMKQLPVPLELGNEYKFHSAFVCPVSRDQSSEENPPMQLPCGHVISKQSMMRLSK 
NCAFRTFKCPYCPAETLASACRQLYF*