>AT1G47330.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN plasma membrane EXPRESSED IN 22 plant structures EXPRESSED DURING 13 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF21 (InterProIPR002550) Cystathionine beta-synthase core (InterProIPR000644) BEST Arabidopsis thaliana protein match is CBS domain-containing protein (TAIRAT2G145201) Has 6970 Blast hits to 6717 proteins in 1361 species Archae - 64 Bacteria - 4382 Metazoa - 390 Fungi - 183 Plants - 125 Viruses - 0 Other Eukaryotes - 1826 (source NCBI BLink) 
AAAACATCAAAGAGTCACTCTAAACTCATCTCTCTCGCCGAATTCTCCCCAACAATTTCC 
GCCGGAAACAAATTCTCAGATTCCGGTAACTCTAAAACTATGTCATCGGATATTCCGTGC 
TGCGGAACCACGTTCTCGCTCTACGTCGTGATAATCATAGCTCTTGTAGCGTTCGCTGGC 
TTGATGGCTGGTTTAACTCTTGGTCTCATGTCTCTCGGTTTAGTCGACCTCGAAGTTCTC 
ATCAAATCTGGCCGTCCTCAAGATCGAATCAACGCCGGTGAGTGCTTCAACTTGATGACG 
ATTCTGTTGAATTTTGAAGATTCTGTTCTTTAGCTGATTTTATGATTTTTAATTGTACAT 
AGCAATTGAAGAACATGTGTATTTGATTGTTAGGTAAGATATTTCCAGTAGTGAAGAATC 
AGCATCTGTTGCTATGTACACTTTTGATCGGAAACTCTATGGCTATGGAGGTGATACTTG 
ATTTTTTGGTTTCTATGTATTTGAAGCAGAGACTTGTATATATAATGGTGTTGATGATCT 
CTTTTTATTGTAGGCTTTACCAATATTTTTGGATAAAATTGTGCCTCCTTGGTTAGCTAT 
TCTTCTTTCAGTGACTCTTATACTGGTGTTTGGAGAGGTTTTATATCTGTTACCTATAGT 
TTTGATATACTTTTGTTTTCCATTTTTACAGTTTGTGATTATAGTTTCTCTGCTGTTGAG 
TATCTGCAGATAATGCCACAAGCAGTTTGTACTCGATACGGGCTTAAGGTTGGAGCGATA 
ATGGCTCCTTTTGTTCGTGTTCTTCTTGTACTGTTCTTCCCAATTTCATATCCAATCAGT 
AAGGTAGAGAGTCAGATTTTCTGTTTTGAAAAATCAGCTTGTGTAATCCCTTTTCTGGCT 
TGATTGAAATGCTCGGCTAAATCTTGATGTTTCTCTGTTTTGGTAAACATTGTTGCTTCA 
GGTTCTTGATTGGATGTTGGGTAAGGGACATGGAGTTCTTTTACGGAGAGCAGAGCTAAA 
GACATTTGTGAATTTCCATGGAAATGAGGTATGCAGCTAGTGATGTTTGCTATGACCTTG 
GAATTGAAAACTGAAGCTGTGTTGTCCTCCATATTAAATAGTTCCAGATGTGTCGCTGAA 
CCGTTGTGTCTCTTAGAGCCAACTTTTCTGACAGACTTTTGTATATAATTCTTGTTTACT 
AGGCTGGGAAAGGTGGAGATCTAACAACCGATGAGACTTCAATCATCACAGGTGCACTTG 
AATTAACTGAAAAGACGGCAAAAGATGCAATGACTCCCATATCGAATGCATTTTCCCTTG 
AGCTTGATACACCTCTTAATTTGTGGGTTCTTTTTTTTTTGTTTTGCTTCCATCATCTTC 
TATTTACTCTTATTCAGTTTACTTAAATATTTTGACCTTATAATCTGCTTTCATCCACAA 
TACAGGGAAACTTTAAATACAATTATGTCAGTTGGTCATAGCAGAGTTCCAGTTTATTTC 
AGAAATCCCACACATATAATTGGGCTCATTCTGGTACCTATCTTTCATATGGACTTTGTT 
TTTGTATTGTCTAGCTTCACAGCATGTAATGTCCCCTTTTGCTGATGCAGGTTAAAAATC 
TATTGGCAGTTGATGCCAGAAAGGAAGTTCCTCTGAGAAAAATGTCCATGAGAAAAATTC 
CACGGTAAATGACTCTCTAAAGCTAACTTTCTTTCTTTACGACCAGGATTTAATGCATAC 
ACAGAGTTCATGTTTATACTATTTATTTCCTCCAGTGTCTCTGAGACCATGCCACTATAT 
GATATCTTAAACGAGTTCCAGAAGGGTCACAGTCACATAGCAGTTGTTTACAAGGATCTT 
GACGAGCAGGAGCAATCTCCCGAAACAAGTGAAAATGGTATTGAGCGCAGAAAAAACAAG 
AAAACGAAAGACGAACTTTTTAAGGACAGTTGTAGGAAACCAAAAGCTCAGTTTGAGGTA 
TCGGAGAAGGAAGGTGTGTCTCGGCATAACTTAGCGACGTTCCATAAATAAACATGAGAA 
CACATGTTAGCCTAATTTTCTTTGGAATTGTAGTATTCAAAATTGAAACCGGAGATGCAA 
AATCCGGCAAGAGTGAGAACGGTGAGGAGCAGCAAGGGTCAGGGAAAACAAGTCTATTGG 
CTGCTCCAGCTAAGAAGCGACATAGAGGCTGTTCGTTCTGCATTTTGGACATTGAGAATA 
CTCCCATACCTGATTTTCCTACCAATGAGGAGGTTGTAGGAGTTATCACCATGGAGGATG 
TTATCGAAGAGCTTCTTCAGGTAACTCAAACCGAAGCATGAGCATCTGCTTTAGTTATTA 
GCATTAGAGAAGTAAAGAATCATAGGCATTCTTTTTTTGCTCTGATTTTTCAGGAAGAGA 
TCCTTGACGAAACCGATGAGTATGTGAACATCCACAACAGGATAAGAGTCAACATGCATG 
CTTCTCCAGAGAATCTACCAAGCGTGATAACCTCAATCACGCAATCATCATCGGGTTCCA 
CTAGTCCCAACCAAACATCTCATATGGCCACACCAGATTCAAGTCCTACAACAAAACCAT 
CAAATTCAAGTCCTACAAGAAAACCTTCAGTATCGAGTCCTACGAGAGAGCCTTCAGATT 
CTTCACACTCCATGGCTCCAAAACATGAAGAGTCCACCCAAACTTTATGATGATCATATA 
TGAGTGAAGATAGATTAGCTATGTATCCTTAAAAGATGACTTCTTTAACAGAATCGTTTG 
TGTGTTTGAGATTAAGACTGGTTAATAGTTTCTTTGTAATGAAAAGGATGGATCTTTTAC 
CAAAAG
>AT5G41170.1 |  pentatricopeptide (PPR) repeat-containing protein 
ATTTCATCTTCTTCGTATCTTCCAAGTAAACGAAAATTAGGCTTCTTAAATACAACGACC 
TTCGATCTCATCCAAACTTTTTTGTAGCTTTTGTAGTAAATCGATCAACGGCATAATCAA 
TTTCGTCTGGGATTTGCGTTAATGGCGATGAGATTTTTTCAACTTCACCGAAATCGTCTT 
GTGAAAGGTAATTCTGGAAAAGCTCTCTCCTTTAGCCGCCTTTTAGATCTTAGTTTCTGG 
GTTCGAGCTTTTTGTAATTACCGAGAGATTTTGAGAAATGGTCTTCACTCTCTTCAGTTT 
AATGAAGCTCTCGATTTGTTCACTCACATGGTTGAGTCTCGTCCTCTTCCTTCAATTATC 
GATTTCACTAAGTTATTGAATGTTATTGCCAAAATGAAGAAGTTTGATGTTGTGATCAAT 
CTCTGCGACCATCTGCAGATAATGGGAGTTTCACATGATCTCTATACTTGCAATCTTTTG 
ATGAATTGTTTCTGCCAATCTTCTCAGCCTTATCTTGCCTCATCTTTTCTTGGGAAGATG 
ATGAAACTTGGTTTTGAGCCTGATATTGTCACGTTTACTTCTCTGATCAATGGGTTCTGT 
CTCGGGAATAGAATGGAGGAGGCTATGTCTATGGTGAATCAGATGGTGGAGATGGGGATT 
AAACCTGATGTTGTAATGTATACAACAATCATTGATAGTCTTTGCAAAAACGGGCATGTG 
AATTACGCGTTGAGCCTTTTCGATCAAATGGAAAACTACGGGATTAGACCGGATGTTGTT 
ATGTACACCTCTCTCGTGAACGGTCTTTGTAACTCTGGTAGATGGAGAGATGCTGATTCA 
TTGCTGAGGGGTATGACGAAGAGGAAAATCAAACCTGATGTAATCACTTTCAATGCATTG 
ATCGATGCGTTTGTGAAAGAAGGAAAGTTTTTGGATGCTGAAGAATTGTACAATGAGATG 
ATTCGTATGTCTATAGCTCCTAATATTTTCACCTATACTTCATTGATCAATGGGTTTTGC 
ATGGAAGGTTGTGTAGATGAGGCCAGACAAATGTTTTATTTGATGGAAACCAAGGGTTGT 
TTTCCAGATGTAGTGGCTTATACTTCTCTCATAAACGGGTTTTGCAAGTGTAAGAAGGTA 
GATGATGCGATGAAAATCTTCTACGAGATGTCCCAAAAAGGATTGACTGGGAACACTATC 
ACTTACACTACTCTTATCCAAGGTTTTGGTCAAGTGGGCAAACCTAATGTCGCCCAAGAA 
GTTTTTAGTCATATGGTTTCTCGTGGCGTGCCTCCCAATATTAGGACCTATAATGTTTTG 
TTACATTGTCTATGTTATAACGGGAAGGTAAAGAAAGCGTTGATGATATTTGAGGATATG 
CAAAAGAGAGAAATGGATGGTGTCGCTCCCAATATTTGGACTTACAACGTCTTGTTACAT 
GGTCTATGTTATAATGGGAAGCTAGAGAAAGCCTTGATGGTATTCGAGGATATGCGAAAG 
AGAGAGATGGATATTGGAATTATTACATATACCATCATCATTCAAGGGATGTGCAAGGCT 
GGTAAAGTGAAAAATGCTGTTAATTTATTTTGTAGCCTTCCTTCAAAAGGAGTGAAGCCT 
AATGTTGTAACATATACTACAATGATATCAGGATTGTTTAGGGAAGGGTTAAAGCATGAA 
GCTCATGTGTTGTTTAGGAAAATGAAAGAAGATGGGGTTTCATAATTAGGTGACACCATA 
GCATAACTGATCATAGTGTTGAGGATTCTCTGATGATGCTTCTTCAACTTACAGAGAATG 
ATGGTGTAGTAGCAGTTTACACGGTCTTTGCGAAGCGAGGGTTTCGTGAAGCTGTGAGGT 
TGTTTTGTTAGTGAAAATGAAGAATGACTAATTGAATAGTGTTTATTCCAGACAGTGTAT 
TGAACAAAAAGACTTCGTAAAGAATGTGCAGAGAAGCAAATGAGTTGGTAGACCACAATT 
TGTAAGTTACAAATTCATAAGTTATTGAACAGTATAGGTTCCTTTAGACAGTGCACAAAG 
ATTGAACTGTATTACGTAACGACAAATTTAAGTATATTTTGGAGACATGCAACCGAGAA