>AT5G16300.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES 
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES 
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQES 
TLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQAAGLLSSFTNTRSE*
>AT5G16300.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ 
AAGLLSSFTNTRSE*
>AT5G16300.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ 
AAGLLSSFTNTRSE*
>AT5G16300.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASESTLKLGSILTDGQVGIFKDRSAAAMSTFGDILPAQ 
AAGLLSSFTNTRSE*
>AT5G16300.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 204 Blast hits to 183 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 127 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK 
HIETGININ*
>AT5G16300.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 199 Blast hits to 182 proteins in 76 species Archae - 0 Bacteria - 0 Metazoa - 122 Fungi - 34 Plants - 24 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK 
HIETGININ*
>AT5G16300.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Vps51/Vps67 (InterProIPR014812) Has 201 Blast hits to 182 proteins in 75 species Archae - 0 Bacteria - 0 Metazoa - 125 Fungi - 34 Plants - 23 Viruses - 0 Other Eukaryotes - 19 (source NCBI BLink) 
MRMSSASAGEYRPSAVSLSSNGGGQRDAESLFRTKPMSEIRIVESATRKNIEDKKEELRQ 
LVGTRYRDLIDSADSIVHMKSLCESISANISSIHGNIRSLSSSSVAETPKLASLNPVRVN 
VYGIACRVKYLVDTPENIWGCLDESMFLEAAGRYMRAQHVQQRLIKLEGCGGGVAEVDQS 
KLLANFPLLEHQWQIVESFKAQISQRSHERLLDPGLGLGAYVDALTAVAVVDELDPEQVL 
ELFLDSRKTWILQKLNACTGEDAGEVVLVFCDVLSVIQVTVGQVGELFLQALTDMPLFYK 
TILSTPPASQLFGGIPNPEEEVELWKSFRDKLESVMLILDKNDVSKSCLTWLRECGGQIV 
GKVSGKHLIEAIVTGAELGSAEKLIRETMDSKDVLRGSLDWLKSVFGSEVELPWNRIREL 
VLGDDLNLWDEIFEKAFVERMKSIIDSKFENLTKAVNVADSVHAYSEITGEKINFQAYLN 
RPSTGGGVWFIEPNSKKVGLISGNKSSPEESDFQSCLTAYFGPEVSQMRDAVDRRCHSVL 
EDLLSFFESEKAGPRLKDLAPYVQNKCYDSVSALLADVDKELEFLCAAVKKENKDSEAIP 
PAIIIEKSLFMGRLLFALLNHSKHVPLILGSPRLWCRETMTAVSDKLSSLLRQPRFSSNT 
PATADSPGKQLHTDLRKQTSLAVAALLGAEEKTSPKFEELNRTMRDLCIKAHTLWIKWLS 
DELSAILLRDLRSDDGLSATTPLRGWEETIVKQEQDESQSELKISLPSLPSLYMISFLCR 
ASEEIHRIGGHVLDRSILQKFASSLLEKITIIYEDFLSAREASEPQISEKGVLQILLDLR 
FAADVLSGGDTSTNVETPKSTINRSAYRRRQDQQKTKLVNRGRIDGVTSQLTQKLDPIDW 
LTYEPYLWENEKQSYLRHAVLFGFFVQLNRMYTDTAQKLSINIESNIMPCSTVPRFKYLP 
ISAPALSSRSTNKVSIPVTSNDASARNSWKAFTNGEQSQTSDLEENSNFGVAFKSFMQVK 
HIETGININ*
>AT1G73430.1 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT1G73430.1 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT1G73430.2 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT1G73430.2 |  sec34-like family protein 
MATKAASSSSLPKSGAISKGYNFASTWEQSAPLTEQQQAAIVSLSHAVAERPFPANLVHE 
HVHRPENGLSVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTL 
TERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALR 
SKLNYFDELENVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQ 
LQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP 
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAY 
LMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHI 
LKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYP 
AKLEGSPNTTSETDLRDDENADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEV 
CSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR 
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPM 
LSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPI 
LAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQL 
DNFL*
>AT5G11980.1 |  conserved oligomeric Golgi complex component-related / COG complex component-related 
MAMEVGEMSQPEATASLLSLASATQQPYVSELLSFTLDRLHKEPELLRVDAERIQRQMQE 
VAVGNYRAFITAADALLAIRQEVSSIDKHLESLIGEVPKLTSGCTEFIDSAENILEKRKM 
NQALLANHSTLLDLLEIPQLMDTCVRNGNFDEALDLEAFVSKLATLHPKLPVIQALAAEV 
RQTTQSLLSQLLQKLRSNIQLPECLRIIGYLRRIGVFGEYEMRLQFLRCREAWLTGILED 
LDQKNAYEYLKGMINCHRMHLFDVVNQYRAIFSDDTSGSEENYDGGLLFSWAMHQITSHL 
KTLKIMLPKITEGGSLSNILDQCMYCAMGLGGVGLDFRGLLPPLFEEAVLNLFSKNMSTA 
VENFQLVLDSHRWVPLPSVGFPSSGINEDSKDDVTPPSYLMEHPPLAVFINGVSSALNEL 
RPCAPLSLKNVVAHELIKGLQAVSDSLLRYNTTRMLRLSESNLFLSLCRAFVEVVFPHCA 
TCFGRCYPGGATIVMDAKSAYEGLGRILAASSSQEPSNKSPKVISTDTKDASENGVASQP 
EEKQAENPNAKEEDNSPIPLQTPEITPES*
>AT3G07180.1 |  GPI transamidase component PIG-S-related 
MEEISDRLNPGITPEFDPKTMRSTKPGLKRLFITSSVLFSFLLGVPFLWKSVEIYRSPLP 
FHDIDSLSDQLESTPLRFPCNFHAVFVGFRSTDPDNLRSQIQDGINQLTHQSSQCGSCDF 
SLSVTVQNREDQCSDTLAHSSTTCSYRCGVIKRNGFSVGLDDTVDESLNDVFSGCSENSG 
KMYSVVVVNKENANGGDEVKAVVGKRRHAWIVGNGLEERYGDIVARVSEIFVQVFMNGGR 
EEDSIQGEFMPVGSDGKIVLSFSLLNSNPRDWVYDWDFQRIDEALLAPVTKALAPIANIT 
VESQVLYHTPKSSFSSWDKKLQSYIFRTSDLPFFVNSNEWHLDTSAGASGRSKILQFVVY 
IPSGKECPLLLQLPNVEISKTNGFISPMWGGVIVWNPGNCDKDSESPSRNTISLQDLEQI 
VEIFLGQFRQLFGFKSEAKYTTGLGTYKILTSERGFTEWELDVLSRKHTCFNLHSCSTTL 
GSLSRLVRSLPRMIIKDEIGEQVKYSLKAAKLAQSNASLGGYSSSASSSREARSLAENAF 
FHPSIMSVSYFSYEHCFAVYSPFFLPVVGHVVLAAVREWKRYKQEKAKYLTWLTRKKTT*
>AT3G07180.1 |  GPI transamidase component PIG-S-related 
MEEISDRLNPGITPEFDPKTMRSTKPGLKRLFITSSVLFSFLLGVPFLWKSVEIYRSPLP 
FHDIDSLSDQLESTPLRFPCNFHAVFVGFRSTDPDNLRSQIQDGINQLTHQSSQCGSCDF 
SLSVTVQNREDQCSDTLAHSSTTCSYRCGVIKRNGFSVGLDDTVDESLNDVFSGCSENSG 
KMYSVVVVNKENANGGDEVKAVVGKRRHAWIVGNGLEERYGDIVARVSEIFVQVFMNGGR 
EEDSIQGEFMPVGSDGKIVLSFSLLNSNPRDWVYDWDFQRIDEALLAPVTKALAPIANIT 
VESQVLYHTPKSSFSSWDKKLQSYIFRTSDLPFFVNSNEWHLDTSAGASGRSKILQFVVY 
IPSGKECPLLLQLPNVEISKTNGFISPMWGGVIVWNPGNCDKDSESPSRNTISLQDLEQI 
VEIFLGQFRQLFGFKSEAKYTTGLGTYKILTSERGFTEWELDVLSRKHTCFNLHSCSTTL 
GSLSRLVRSLPRMIIKDEIGEQVKYSLKAAKLAQSNASLGGYSSSASSSREARSLAENAF 
FHPSIMSVSYFSYEHCFAVYSPFFLPVVGHVVLAAVREWKRYKQEKAKYLTWLTRKKTT*
>AT3G07180.2 |  GPI transamidase component PIG-S-related 
MEEISDRLNPGITPEFDPKTMRSTKPGLKRLFITSSVLFSFLLGVPFLWKSVEIYRSPLP 
FHDIDSLSDQLESTPLRFPCNFHAVFVGFRSTDPDNLRSQIQDGINQLTHQSSQCGSCDF 
SLSVTVQNREDQCSDTLAHSSTTCSYRCGVIKRNGFSVGLDDTVDESLNDVFSGCSENSG 
KMYSVVVVNKENANGGDEVKAVVGKRRHAWIVGNGLEERYGDIVARVSEIFVQVFMNGGR 
EEDSIQGEFMPVGSDGKIVLSFSLLNSNPRDWVYDWDFQRIDEALLAPVTKALAPIANIT 
VESQVLYHTPKSSFSSWDKKLQSYIFRTSDLPFFVNSNEWHLDTSAGASGRSKILQFVKY 
SLKAAKLAQSNASLGGYSSSASSSREARSLAENAFFHPSIMSVSYFSYEHCFAVYSPFFL 
PVVGHVVLAAVREWKRYKQEKAKYLTWLTRKKTT*
>AT3G07180.2 |  GPI transamidase component PIG-S-related 
MEEISDRLNPGITPEFDPKTMRSTKPGLKRLFITSSVLFSFLLGVPFLWKSVEIYRSPLP 
FHDIDSLSDQLESTPLRFPCNFHAVFVGFRSTDPDNLRSQIQDGINQLTHQSSQCGSCDF 
SLSVTVQNREDQCSDTLAHSSTTCSYRCGVIKRNGFSVGLDDTVDESLNDVFSGCSENSG 
KMYSVVVVNKENANGGDEVKAVVGKRRHAWIVGNGLEERYGDIVARVSEIFVQVFMNGGR 
EEDSIQGEFMPVGSDGKIVLSFSLLNSNPRDWVYDWDFQRIDEALLAPVTKALAPIANIT 
VESQVLYHTPKSSFSSWDKKLQSYIFRTSDLPFFVNSNEWHLDTSAGASGRSKILQFVKY 
SLKAAKLAQSNASLGGYSSSASSSREARSLAENAFFHPSIMSVSYFSYEHCFAVYSPFFL 
PVVGHVVLAAVREWKRYKQEKAKYLTWLTRKKTT*
>AT4G01400.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG 
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD 
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA 
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK 
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS 
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK 
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY 
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE 
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF 
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG 
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT 
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT 
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR 
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP 
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG 
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD 
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA 
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK 
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS 
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK 
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY 
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE 
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF 
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG 
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT 
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT 
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR 
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP 
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKG 
FCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGD 
TRIVDVGIENKKMPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIA 
YQRSLDSDLDTLLSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGK 
VRELDLAQSRVNVTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDS 
GSDQSEQLHASKEQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLK 
KVIALRGRMEYENVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAY 
AICELQEECDLRGSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVE 
EILSLMQLGEDYTEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGF 
FMVENVRKAIRIDEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLG 
NDYHEALQQKIREPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCT 
EVFPAPADRERIKSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELT 
ETEYAENEVNDPWVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKR 
FSQLGGLQLDRDTRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGP 
MTWRLTPAEVRRVLGLRVEFKPESIAALKL*
>AT4G01400.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink) 
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL 
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN 
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK 
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE 
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR 
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY 
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI 
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR 
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI 
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP 
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD 
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR 
VLGLRVEFKPESIAALKL*
>AT4G01400.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) 
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL 
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN 
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK 
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE 
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR 
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY 
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI 
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR 
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI 
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP 
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD 
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR 
VLGLRVEFKPESIAALKL*
>AT4G01400.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) 
MPEIEQDDAAAETVDSSTVKFGTPEALEYVRSLTDVGAMTRLLHECIAYQRSLDSDLDTL 
LSQRTELDRNLVQLQRSAEILDIVKADADHMLGNVRSTCDLADQVSGKVRELDLAQSRVN 
VTLSRIDAIVERGNCIEGVKTALESEDYESAAKFVQRFLQIDLQYKDSGSDQSEQLHASK 
EQLEGIAKKKLLAAIDQRDHPTILRFVRLYSPLGMETEGLQLYVGYLKKVIALRGRMEYE 
NVVELMEQGLGQVNFVGCLTNLFKDIVMAIEENDEILRGLCGEDGVAYAICELQEECDLR 
GSLILKKYMDFRKLAILASDINNSPNLNILPGGASEGPDPREVELYVEEILSLMQLGEDY 
TEFMVSKIKSLTSVDPELLPTATKAFRNKSFSKAIQDVTRYYVILEGFFMVENVRKAIRI 
DEHVPDSLTTSMVDDVFYVLQSCLRRAISTSNISSVIAVLSYAGSLLGNDYHEALQQKIR 
EPNLGARLFLGGIGVENTGTEIATALNNMDVSCEYILKLKHEIEEQCTEVFPAPADRERI 
KSCLSELGELSSTFKQLLNSGMEQLVATVTPRIRPVLDTVATISYELTETEYAENEVNDP 
WVQRLLHSVETNAAWLQPLMTSNNYDSFLHLIIDFIVKRLEVIMMQKRFSQLGGLQLDRD 
TRALVSHFSGMTQRTVRDKFARLTQMATILNLEKVSEILDFWGENSGPMTWRLTPAEVRR 
VLGLRVEFKPESIAALKL*
>AT4G01400.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) Has 11639 Blast hits to 3936 proteins in 180 species Archae - 0 Bacteria - 2 Metazoa - 206 Fungi - 141 Plants - 11004 Viruses - 0 Other Eukaryotes - 286 (source NCBI BLink) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT 
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG 
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK 
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*
>AT4G01400.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s COG4 transport (InterProIPR013167) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT 
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG 
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK 
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*
>AT4G01400.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 23 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT5G461001) 
MIRRPIYDFAAVFRHLTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIAS 
QSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGE 
IFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKS 
SRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQ 
VNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNT 
MILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKG 
FSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIK 
LFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQMKRKNARERRRHL*