>AT2G03880.1 |  pentatricopeptide (PPR) repeat-containing protein 
TTTCCATAGTTTTAGTGGAGACGATGAAGTCAGTAATGTCAAAGATCAAACTTTTTAGAC 
CAGTCGTCACGTTACGTTGTTCTTACAGCTCCACAGATCAAACTCTTCTTCTTAGCGAAT 
TTACCAGACTCTGTTACCAGAGAGACCTTCCCAGAGCTATGAAAGCAATGGATTCATTGC 
AAAGTCATGGTCTTTGGGCAGACTCTGCAACTTATTCTGAGCTTATCAAGTGTTGTATAT 
CTAATAGAGCTGTTCATGAAGGGAATCTCATTTGTCGTCATTTATACTTCAATGGTCATC 
GACCAATGATGTTTCTGGTTAATGTTCTGATCAATATGTATGTAAAATTCAATCTTTTGA 
ATGATGCTCACCAACTGTTCGATCAAATGCCTCAAAGGAATGTTATTTCTTGGACTACGA 
TGATTTCTGCTTATTCTAAGTGTAAGATTCATCAAAAGGCTTTGGAGCTTTTGGTTTTGA 
TGTTGAGAGATAATGTGAGGCCTAATGTGTATACTTACTCTTCTGTTTTGAGATCTTGCA 
ATGGTATGTCTGATGTTAGAATGCTTCATTGTGGGATTATTAAGGAGGGGTTGGAGTCTG 
ATGTTTTTGTTAGGAGTGCTTTGATTGATGTGTTTGCTAAACTGGGTGAGCCGGAGGATG 
CTCTTTCGGTTTTCGATGAAATGGTTACAGGTGATGCGATTGTTTGGAACTCGATCATTG 
GTGGGTTTGCTCAGAATAGTAGAAGCGATGTAGCTTTGGAGCTTTTTAAGAGGATGAAGA 
GAGCTGGTTTCATTGCGGAGCAGGCTACGTTGACTAGCGTATTGAGGGCTTGTACTGGTT 
TGGCTCTGTTGGAGCTTGGGATGCAAGCTCATGTTCATATCGTGAAATATGACCAAGACT 
TGATACTCAACAACGCACTCGTGGACATGTATTGCAAGTGTGGGAGTTTGGAGGATGCTC 
TCCGTGTGTTCAACCAGATGAAGGAGAGAGATGTGATTACATGGAGCACTATGATTTCAG 
GGTTGGCACAGAATGGTTATAGCCAGGAGGCACTGAAGTTGTTTGAGCGTATGAAATCTT 
CTGGGACGAAGCCAAACTACATTACGATTGTCGGGGTTCTCTTTGCTTGCAGCCACGCAG 
GGCTCCTGGAAGATGGTTGGTACTACTTCAGATCAATGAAGAAACTTTATGGAATCGATC 
CAGTGAGGGAACATTATGGTTGCATGATTGATCTGCTTGGAAAAGCTGGAAAGCTCGACG 
ATGCAGTGAAGCTGTTAAATGAAATGGAGTGTGAGCCAGATGCTGTGACATGGAGAACTT 
TGCTTGGTGCTTGTAGGGTTCAAAGAAACATGGTGCTAGCGGAGTATGCAGCTAAAAAGG 
TCATAGCACTCGATCCCGAGGATGCAGGAACTTACACACTATTGTCTAACATATACGCAA 
ACTCTCAGAAATGGGACAGTGTTGAAGAAATTCGGACACGAATGAGAGACAGAGGAATCA 
AGAAAGAACCTGGATGCAGCTGGATCGAAGTAAACAAACAGATTCATGCTTTTATAATTG 
GAGACAATTCTCACCCACAGATAGTTGAAGTCAGTAAGAAGCTGAACCAATTGATCCATA 
GACTCACTGGTATTGGCTATGTTCCTGAAACAAACTTTGTGCTACAAGATCTTGAAGGTG 
AACAAATGGAAGATTCACTTAGACATCACAGTGAGAAACTGGCTTTAGCCTTTGGCTTGA 
TGACATTACCTATTGAGAAAGTCATTAGGATAAGAAAGAATCTGAGAATATGTGGGGATT 
GTCATGTGTTTTGCAAGCTCGCATCGAAGCTTGAAATCCGCAGTATCGTTATACGAGATC 
CGATCCGTTACCATCATTTCCAGGATGGCAAATGTTCTTGTGGTGACTACTGGTAACAAT 
CAGATGACTTTAGATTACTATGAAACAGAGTATAAGATGGTTTAGAAGATTGTTTCTCTG 
CAAGTTTTCTTCATCAAATTTTTTGATTAGTGAATCATTTTTGCTTATGAAGAAGCTCAT 
TGATTTGCCAAATTTTCCTGATAGGATTTTGTAACATTTGCAAAATTGAATATGTATGAT 
AAAATTGGTCATTGGAGTATGGAGACTCACTAATTTAGTAATTTGTGATAAAAAAGAGTT 
TCTTCAG
>AT3G13770.1 |  pentatricopeptide (PPR) repeat-containing protein 
ATGTTTAACTTGATGAGACTAATTCATCGATCTTTCTCGTCGTCTCCAACAAACTATGTT 
CTTCAAACCATTCTTCCAATCTCTCAATTATGCTCCAACGGTCGTCTTCAAGAAGCTTTA 
CTGGAGATGGCGATGTTAGGTCCCGAGATGGGGTTCCATGGTTACGATGCTTTGTTGAAT 
GCGTGTTTGGACAAAAGGGCTCTTAGAGATGGTCAAAGAGTTCATGCCCACATGATCAAA 
ACCCGTTATTTACCCGCCACTTACCTCCGAACTCGTCTGCTTATCTTCTATGGTAAATGT 
GATTGTTTAGAGGATGCACGGAAAGTGCTCGACGAAATGCCTGAGAAGAATGTCGTTTCC 
TGGACTGCTATGATCTCGCGTTATTCTCAAACTGGGCATTCTTCTGAGGCTCTCACTGTT 
TTTGCAGAGATGATGAGATCAGATGGGAAACCGAATGAGTTCACTTTCGCTACTGTTCTT 
ACTTCTTGTATACGTGCTTCTGGGTTAGGTCTTGGTAAGCAAATCCATGGACTCATAGTT 
AAGTGGAATTACGATTCTCATATTTTTGTTGGAAGCTCTCTTCTTGACATGTATGCTAAA 
GCTGGTCAAATTAAAGAAGCTCGTGAGATTTTCGAGTGTTTGCCTGAGAGAGATGTTGTC 
TCATGTACTGCTATTATTGCTGGTTATGCCCAACTAGGTCTTGATGAAGAGGCGTTAGAG 
ATGTTTCATAGACTGCATAGTGAAGGAATGAGTCCCAATTATGTTACCTATGCTAGTCTT 
TTGACAGCATTATCTGGACTTGCTTTGCTAGATCATGGCAAGCAAGCTCATTGCCATGTC 
TTGAGACGTGAACTTCCGTTTTACGCGGTCCTTCAGAATTCTCTTATTGATATGTACTCA 
AAATGTGGGAATCTTTCTTACGCTAGAAGGCTTTTTGATAACATGCCTGAGAGGACAGCG 
ATTTCATGGAATGCGATGCTTGTGGGATATAGTAAACATGGGTTAGGAAGAGAAGTTCTT 
GAACTTTTTAGATTGATGAGAGATGAGAAGAGAGTAAAGCCTGATGCAGTTACCCTTTTG 
GCTGTTTTGTCTGGTTGCAGCCATGGGAGAATGGAAGACACAGGGTTGAATATATTTGAT 
GGCATGGTAGCGGGAGAATACGGGACTAAGCCTGGCACTGAGCATTATGGTTGCATCGTT 
GATATGCTTGGCCGTGCGGGTAGGATTGATGAGGCATTTGAATTCATCAAAAGAATGCCA 
TCTAAACCAACTGCTGGTGTCTTGGGTTCCCTTTTAGGAGCATGTAGGGTTCACTTATCT 
GTGGATATTGGCGAATCTGTAGGTCGCAGACTCATTGAGATTGAACCAGAGAATGCAGGG 
AACTATGTCATCCTCTCTAATTTGTATGCTTCGGCTGGAAGATGGGCAGATGTAAACAAT 
GTAAGAGCTATGATGATGCAGAAAGCTGTTACAAAAGAACCAGGAAGAAGCTGGATTCAA 
CACGAGCAAACCTTACATTACTTCCACGCCAATGATCGTACCCATCCAAGAAGGGAAGAA 
GTGTTAGCTAAAATGAAAGAAATATCGATAAAAATGAAGCAAGCTGGTTATGTTCCTGAT 
CTTAGCTGTGTGCTATATGATGTGGATGAAGAGCAGAAGGAGAAAATGCTACTTGGCCAC 
AGTGAGAAACTAGCCTTGACTTTTGGTTTGATTGCTACTGGTGAAGGGATTCCAATTAGA 
GTTTTCAAGAATCTACGTATATGCGTTGATTGTCATAACTTCGCCAAGATCTTCTCAAAG 
GTTTTTGAAAGAGAAGTGTCATTAAGGGATAAAAACAGGTTTCATCAAATTGTCGATGGA 
ATATGTTCATGCGGAGATTACTGGTGA
>AT1G34210.1 |  SERK2 (SOMATIC EMBRYOGENESIS RECEPTOR-LIKE KINASE 2) kinase 
ACTTTTGTAGTGACTAGTGAGTAGAGTAGGCTTTTAGAGAGAGAGAGAGAGAGACGGCTG 
TTGAAAGATAACCACAGAACACAAAAACTCATTCATTAAGAATGAGAAAGAAAGTCCCAA 
AAACCTTTTTTGCTCTGAAAAAGCAACGCAAAGTTTTGAAAAATCTCACACCTTTTTCAC 
TTCTCTGTTTGTAGCTGTTACCACTTGTGTTTCCCCTTTGGCATTTTTCTCGGTTGTCAT 
TAATGAGAGTAAAATCATCATCAAGTGTAAACTTCTCTCTCTCTTTCTCTATCTCTATCT 
CAAAGCTCTCAACTTTGGAGAGATCATGGTTTGTGTTTGATTTCTCAAGTTTTTTTTTTT 
TTACCCTCTTGGAGGATCTGGGAGGAGAAATTTGCTTTTTTTTGGTAAATGGGGAGAAAA 
AAGTTTGAAGCTTTTGGTTTTGTCTGCTTAATCTCACTGCTTCTTCTGTTTAATTCGTTA 
TGGCTTGCCTCTTCTAACATGGAAGGTTTGTTACTCTTCTCTTATCTTTCTTTATTATTT 
ATGTGACTACTTTTGCCTGTTTTAGTTGGTACATGAGAATTGAAATTTCCATCTGGCACT 
GTCTCTGTTCTTCAAAAAGGACTTTTTAAGAATCTGATTCTGTTTGAAAGCTTCAATTTC 
TATAGAGAATTTTGTTCATAATCTATGGTTAGGCTCCTTTAGAAAAATCACAATTGATTT 
ACTGTTTGATTCAATGCTATTTCTCTGATAACGACATGTAAAAGAAACTGTATTTATGTC 
ACTAGCAAACACATTAAGAATGAGGAACTGAAGAAGTATTCAATGCAAACTGTTTGGTTG 
TTTCATGTTTGATTTGTTTGGTAAGTATTTCCTCTTTGCGAAAGGCTTAGGCTTTTGTTT 
GGTATATTAACAGGTGATGCACTGCACAGTTTGAGAGCTAATCTAGTTGATCCAAATAAT 
GTCTTGCAAAGCTGGGATCCTACGCTTGTTAATCCGTGTACTTGGTTTCACGTAACGTGT 
AACAACGAGAACAGTGTTATAAGAGTGTGAGTGTCTCTCTGCTTATACCTGACCTGACCA 
ATGTTCTTTGTTTAGCTTATTTAATCTCTGTTTGTTGTTGCTTCTTCAGCGATCTTGGGA 
ATGCAGACTTGTCTGGTCAGTTGGTTCCTCAGCTAGGTCAGCTCAAGAACTTGCAGTACT 
TGTAAGTTTTTGCTTTGATATCTAAATGGAAGATGAATCGTTACTCTGTAGATAAACGAA 
AAGTTGATGGCATTGACTATGTGTAGCATAACTACTTTATTTCTAGAGACTTGTAGTTTC 
TCATACATGATATAGGCGAATATGGCGATTACACTGAATCTTGTTTTTTCATTTTATTAG 
GACTTATTCAATACCCATTTGTCATTGTACATTGTTAATTTTCTTCTGAGGGTGTGATAC 
TCTTAATGGCAGGGAGCTTTATAGTAATAACATAACCGGGCCGGTTCCAAGCGATCTTGG 
GAATCTGACAAACTTAGTGAGCTTGGATCTTTACTTGAACAGCTTCACTGGTCCAATTCC 
AGATTCTCTAGGAAAGCTATTCAAGCTTCGCTTTCTGTAAGAACACTTAGTCTGCTCTTC 
TCATCATGCACTGGCTAAATGTTTTGCTATATTAGAAATTTACTGAGGATTAAATTCTCT 
TTTGGCTAAATGTTATTACTTGTTGGACATTTGAATAGAAGTAAAAGAAATTGGCTTTGT 
CTTTCAAAACACTGAATAAACTTGTTTGATTGAACTCTGGTATGGGAAGATGGTAATGTG 
GTCTGAGTTTTTGTTGCAAGGATAGTCGGCTCAACAATAACAGTCTCACCGGACCAATTC 
CCATGTCATTGACTAATATCATGACCCTTCAAGTTTTGTGAGTATTACTTTTCAATTTTA 
CTTTCTGCTCTTCTGAACTTCTCACTCTTAATGGATCACCTGAGTTTTCCTTATTATCTT 
GCAAACTGCAGGGATCTGTCGAACAACCGATTATCCGGATCTGTTCCTGATAATGGTTCC 
TTCTCGCTCTTCACTCCCATCAGGTTTGATTGAATCCATACTATGCAAACATTAGTTACT 
CGCATTATGGGAACTGAGGCTATAAGTTAAGATTAAGCACAAGTGGCAATAAAATTTAAA 
GTCTGAACCTAAGTTTCTTTTGATCATCTTAACTCTTAAGTAAATTTACTCAATAGACAT 
ACTACATTTTCCAAATTCTATTACTTTAATTGGTGGATATGGAAATTATCAGCAGTTGAC 
CATTGTTCTTCTGCACAGTTTTGCTAACAACTTGGATCTATGCGGCCCAGTTACTAGCCG 
TCCTTGTCCTGGATCTCCCCCGTTTTCTCCTCCACCACCTTTTATACCACCTCCCATAGT 
TCCTACACCAGGTAATGAAATGAAGCAGACAGTAAAATTTAGATCTTTTACCTTCACTCA 
TGCTTGGAAAAACATATTGGATGGTTGTTATAGAGTTTTTTACTGGATTTTGACTGGAAA 
AAAGTCTATCTCACAGTTTAAGAAGCTTGTGATCAACTTCACTGGTAGTTAACTATTTTG 
GTAAAAATAAAATTTAAGGTGGGTATAGTGCTACTGGAGCCATTGCGGGAGGAGTTGCTG 
CTGGTGCTGCTTTACTATTTGCTGCCCCTGCTTTAGCTTTTGCTTGGTGGCGTAGAAGAA 
AACCTCAAGAATTCTTCTTTGATGTTCCTGGTAAGTCACTGAGTCTGCAATATCCAAGCT 
TTGTTTCATTTCAGAATTGGATTAGTATTTAGTACTTAATTTTTCAGTTCTGTGATGCAG 
CCGAAGAGGACCCTGAGGTTCACTTGGGGCAGCTTAAGCGGTTCTCTCTACGGGAACTTC 
AAGTAGCAACTGATAGCTTCAGCAACAAGAACATTTTGGGCCGAGGTGGGTTCGGAAAAG 
TCTACAAAGGCCGTCTTGCTGATGGAACACTTGTTGCAGTCAAACGGCTTAAAGAAGAGC 
GAACCCCAGGTGGCGAGCTCCAGTTTCAGACAGAAGTGGAGATGATAAGCATGGCCGTTC 
ACAGAAATCTCCTCAGGCTACGCGGTTTCTGTATGACCCCTACCGAGAGATTGCTTGTTT 
ATCCTTACATGGCTAATGGAAGTGTCGCTTCCTGTTTGAGAGGTAACCTTGGAATTTTAA 
CTGTTTGTATCATAAAGTAGAAAGACTCCCACAATGATGTATAAGTGTTGTTTTTGATCT 
TATCCATTTTTAAAACTTTCCAATACAATTGAGTGAGCTTTCTTGAAATGATTACAGAAC 
GTCCACCATCACAGTTGCCTCTAGCCTGGTCAATAAGACAGCAAATCGCGCTAGGATCAG 
CGAGGGGTTTGTCTTATCTTCATGATCATTGCGACCCCAAAATTATTCACCGTGATGTGA 
AAGCTGCTAATATTCTGTTGGACGAGGAATTTGAGGCGGTGGTAGGTGATTTCGGGTTAG 
CTAGACTTATGGACTATAAAGATACTCATGTCACAACGGCTGTGCGTGGGACTATTGGAC 
ACATTGCTCCTGAGTATCTCTCAACTGGAAAATCTTCAGAGAAAACTGATGTTTTTGGCT 
ACGGGATCATGCTTTTGGAACTGATTACAGGTCAGAGAGCTTTTGATCTTGCAAGACTGG 
CGAATGACGATGACGTTATGCTCCTAGATTGGGTATAACACAGATCTTTTAGCACATATC 
TGGCTATCTCTCAAAAAGCTGATTTATCTGTTCATTTGGTCTTCTCAGGTGAAAGGGCTT 
TTGAAGGAGAAGAAGCTGGAGATGCTTGTGGATCCTGACCTGCAAAGCAATTACACAGAA 
GCAGAAGTAGAACAGCTCATACAAGTGGCTCTTCTCTGCACACAGAGCTCACCTATGGAA 
CGACCTAAGATGTCTGAGGTTGTTCGAATGCTTGAAGGTGACGGTTTAGCGGAGAAATGG 
GACGAGTGGCAGAAAGTGGAAGTTCTCAGGCAAGAAGTGGAGCTCTCTTCTCACCCCACC 
TCTGACTGGATCCTTGATTCGACTGATAATCTTCATGCTATGGAGTTGTCTGGTCCAAGA 
TAAACGACATTGTAATTTGCCTAACAGAAAAGAGAAAGAACAGAGAAATATTAAGAGAAT 
CACTTCTCTGTATTCTTTATTTCTTTGGTAGAAAAATAATGTAGTCTCTAATCAAATCTT 
ATTCCATCTATCAGCATTCTTCATTCATTTCTTGTG
>AT5G27330.1 |  LOCATED IN endoplasmic reticulum EXPRESSED IN 22 plant structures EXPRESSED DURING 12 growth stages CONTAINS InterPro DOMAIN/s Prefoldin (InterProIPR009053) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT3G051301) Has 110457 Blast hits to 56483 proteins in 2051 species Archae - 1653 Bacteria - 15111 Metazoa - 52441 Fungi - 7858 Plants - 4309 Viruses - 584 Other Eukaryotes - 28501 (source NCBI BLink) 
GTTTAGTCTTCTCTCTCGCTTCACCGTTAAAAGCCGTAGCAACAAAAACCCAAACCCTAA 
CCAAAACCCTAAAATCTCAGCCGCCACTTCCGTTCAAATGGCGAAGAAGAAAGTGTCTCG 
TAACTCCAATGGAGCTTCCAATGAGCAGCAACAAATTCAGAATCAGTCAGTACCAGTCAC 
TTCCCAGAAATCGACGAAGCTAAGTCGTGAGTCATCAATGGAGGACCATGACTCCTCGGA 
GGAAAAGTTTCAGAATTTGAAGTCACTTAATGCGATTCTTCTGAAGCAGACTATGGAGAA 
GAGACAACAGATTGAATCTCTCTTCCAGGCGAAAGATTCGTTGGAGATCGAGTTGGTTCG 
TTCGGGAAAGGAGAAGACTTTGCTTCGTGAGGAGTTGTGTGGTTCGAGCGATGAGAATTT 
CATGTTGAAGATTGAAATGGATTTGTTAATGGGTTTTGTTGAGGGTCGTGTTAAGGAAAT 
GGGTGTTGAAGTAGATTGGTTATTTAAGGAGAAGAGTGATAGAGAGACTGAGATTAGGGA 
TTTGAAAAGAGAAGCTAATGGTTTGATTAGGAAGTTAGAGAGTGAGAGAGAAGAGTTTAG 
TAGGGTTTGTGATGAGAGGGATTTGGTTAAGAGTGGGTTTGATTTGCAGAGTGAGGAGAT 
GAATCTTTTGAAGGAGAGTGTGGTGAGGTTGGAGATGAGGGAAGTGAGTTTGGGAGAAGA 
AGTTGGGAGATTGAAATGTGAGAATGGTAGATTGGTGAAGGAGAGGAAGAAGAGAGAGGA 
GGTGATTGAGAGGGGTAATAGGGAGAGGAGTGAATTGGTGGAGAGCCTTGAGGAGAAGGT 
TAGGGAGATTGATGTTTTGAAGAGAGAAATTGAAGGAGTTGTGAAAGAGAAGATGGAGGT 
TGAGATGGTGAGACGTGATCAGAGGGAGATGATTGTGGAGTTGGAGAAAAAGCTGGGAGA 
TATGAATGAGATTGTGGAGAGTTTGACTAAGGAAAGGGAGGGGTTACGTGGTCAGGTTGT 
TGGGTTGGAGAAGAGTCTTGATGAGGTTACGGAGGAAGCGAAAGCGAGAGCTGAGCAGAT 
CAATGAACTTGTGAAAGAGAAAACTGTTAAGGAATCTGAGCTTGAGGGTTTAATGGTGGA 
GAACAATTCGATTAAGAAAGAGATTGAGATGGCTATGGTGCAATTCTCTGATAAGGAGAA 
ACTGGTTGAGCAATTGTTGCGTGAGAAGAATGAACTTGTGCAGCGTGTTGTTAACCAGGA 
GGCTGAGATAGTTGAGCTGAGTAAACTGGCGGGTGAGCAAAAGCATGCTGTGGCGCAGCT 
GCGAAAGGATTATAATGATCAAATCAAGAATGGTGAGAAGCTTAACTGCAATGTTAGCCA 
GCTTAAGGATGCTCTGGCGCTGGTGGAGGTCGAGAGAGACAATGCTGGAAAGGCTCTTGA 
TGAGGAGAAGAGAAACATGGTGGCTCTTAAGGAAAAAGTTGTAGCATTAGAGAAAACGAA 
TGAAGCTACTGGTAAAGAGCTTGAGAAGATTAAGGCTGAGCGAGGGAGATTGATTAAAGA 
GAAGAAAGAATTGGAGAATCGATCTGAGTCATTGAGAAACGAAAAGGCTATCCTTCAAAA 
GGATATCGTAGAGCTGAAAAGAGCTACGGGTGTTTTGAAAACAGAGCTAGAATCTGCTGG 
AACCAATGCAAAACAAAGTCTAACAATGCTGAAGAGTGTGTCCTCTCTAGTATGTGGAAT 
AGAAAACAAAAAAGACGAGAAGAAGCGTGGAAAAGGAATGGATTCCTACTCTGTGCAGCT 
AGAAGCTATCAAGAAAGCGTTCAAAAACAAAGAAAGCATGGTCGAGGAAATGAAGAAGGA 
ACTGGCGAAAATGAAGCATTCTGTGGAAGATGCACATAAGAAGAAGAGCTTTTGGACCCT 
TGTTTCATCTGTTACTTCTCTCCTCATGGCTGCTTCTGTTGCTTATGCTGCTTCTCTCAA 
GTGATCACCTGAAGCTGAAGATGGTACTACCCTTTAACTCATTCGAGTTCTCAATTTTCG 
CTGTTTAACTTATAAACCATTTATGTAGTTGCTTATGAGCTTTTTACTCATAACTCCTCT 
ATGTAGTTGGTTCAATAACTATCATGTGAACCTTTTTTCCTTTCGTATCTAGCTCTTCTA 
TGTCTGTGATTTCTGCTACTTTTCTCTCAGAACATGTAGTTGATTCCAGAAACTCCAATA 
AACTTGTTTGTTTCATATTCATATATTATGCAATATATATATGACATCTTTTGGTATC
>AT2G25490.1 |  EBF1 (EIN3-BINDING F BOX PROTEIN 1) protein binding / ubiquitin-protein ligase 
GATCATTTCGTTTTCCTGTTTTTTTTCTCCTTCCTCTCCTCTCTCTCTCTCTGGTTTTTT 
TTTGCCCTATCCATAGGGTTTTCTTCGCTCTCTATCTCCTCGATCTTCATGTGCATCTCT 
CTCTTTGGTATAATCGGAGCTAATTCGTTCGTTTTCTTCTTCATCTTCGTCGGATCTCTC 
TTTTTTGCTCTTTCCCACGGGTTTTCGCTTGAGAAATCAAGCGTTTTCGTTTTTCCTGGG 
AGTTTTGAGCTCAAATCATGTCTCAGATCTTTAGTTTTGCCGGTAAGCTTCCTGATCTCT 
GAATTTTTTTTCATCTTTGAATTTGATTTATCAGATTGAAATTTTTTATTTCTCTGTTAG 
ATTTAGACGTTTCCGTGTTTGAGGTGATTTACGTACGAGGAATAAGATTTGGCTTATAGG 
ATAGTGTTCACCGATGACTTGATTCTTCCTAGATCTATAGATATATATATATATATATAT 
ATATATATATAGATCTGAGTTTTAGTTTTGGCCGTTGACGAGCTATAGATCTTATCTGCA 
ATTTCTCTGTTTTGGTATATTGGATCTGAGAATCATTAGGTTAATTTACTGAATCTGATT 
TTGATTCAATGATCTCTTTAGCTCATAAGAATTTCTCCTTTGGTATTTTACAGGTGAAAA 
TGATTTTTACCGTCGTGGCGCAATATACCCAAACCCAAAGGATGCTAGTCTTTTGTTATC 
GCTTGGTAGTTTCGCTGATGTTTATTTCCCTCCAAGCAAGAGATCACGTGTTGTTGCACC 
TACGATCTTCAGTGCTTTCGAGAAAAAGCCAGTTTCCATTGATGTGCTACCAGATGAGTG 
TCTTTTTGAGATCTTTAGGCGTTTGTCTGGACCACAAGAGAGGAGTGCTTGCGCTTTTGT 
CTCCAAACAGTGGCTTACGCTTGTAAGTAGCATCCGTCAAAAGGAGATTGATGTTCCTTC 
CAAGATAACTGAAGATGGTGATGATTGTGAAGGGTGTTTGTCTAGGAGCTTAGATGGGAA 
GAAGGCAACAGATGTTAGATTGGCAGCAATTGCTGTTGGAACTGCTGGTCGTGGGGGACT 
TGGAAAATTGTCGATTCGAGGTAGCAACTCTGCTAAAGTTTCAGATCTTGGTCTTCGGTC 
TATTGGTCGTAGCTGCCCTTCTCTCGGGTCTCTTTCACTGTGGAACGTTTCTACCATTAC 
TGACAATGGACTTTTGGAGATTGCTGAGGGTTGTGCTCAACTTGAGAAGCTTGAGCTGAA 
CCGCTGCTCTACAATCACTGACAAGGGTTTGGTAGCTATTGCTAAGAGCTGCCCCAACTT 
GACTGAGCTGACATTGGAGGCTTGTTCAAGAATTGGAGATGAGGGTTTGCTAGCCATTGC 
AAGATCCTGCTCCAAGCTGAAGTCAGTCTCGATCAAGAACTGTCCTCTTGTCAGGGATCA 
AGGAATCGCCTCTCTACTGTCTAACACCACCTGTTCCTTGGCAAAACTTAAGCTTCAGAT 
GCTGAATGTCACTGATGTGTCTCTTGCTGTTGTGGGTCATTACGGCTTGTCGATCACTGA 
TCTTGTGCTCGCTGGATTATCACACGTGAGCGAGAAGGGATTCTGGGTCATGGGAAATGG 
TGTCGGGCTGCAAAAATTAAACTCTCTGACCATCACAGCCTGCCAAGGAGTGACTGACAT 
GGGGCTTGAATCTGTTGGAAAGGGCTGCCCGAACATGAAAAAGGCGATCATCAGTAAATC 
CCCTTTGTTATCTGACAACGGGTTGGTCTCTTTTGCAAAAGCTTCTTTATCACTTGAGAG 
TCTTCAGCTTGAAGAATGCCACAGGGTTACCCAATTTGGGTTTTTTGGTTCCCTTTTGAA 
CTGTGGTGAAAAGTTGAAGGCTTTCTCTCTGGTGAACTGTTTGAGTATTAGAGATCTCAC 
CACAGGATTGCCTGCTTCATCTCATTGCAGCGCTCTGCGCTCTTTGTCTATTCGTAACTG 
CCCTGGCTTTGGTGATGCAAATCTTGCAGCCATCGGGAAGTTGTGCCCTCAGCTCGAGGA 
TATTGATCTGTGTGGGCTCAAGGGGATAACAGAGTCTGGTTTCCTACATCTGATTCAGAG 
CTCTCTTGTGAAGATCAACTTCAGTGGTTGTTCCAATTTGACTGATAGAGTGATCTCTGC 
CATCACTGCTCGTAACGGGTGGACTCTTGAAGTCTTAAACATCGATGGATGTTCCAATAT 
CACTGACGCCAGCCTGGTCTCCATTGCAGCAAACTGCCAGATTCTCAGTGATTTGGATAT 
TTCGAAATGCGCAATCTCAGATTCAGGGATTCAAGCATTGGCCTCCTCTGATAAGCTCAA 
ACTGCAGATCCTATCAGTTGCAGGTTGCTCTATGGTTACAGACAAGAGCTTGCCAGCCAT 
CGTCGGGTTGGGTTCCACTCTATTGGGATTAAACCTCCAACAGTGTCGATCCATTTCCAA 
TTCCACTGTCGACTTCTTAGTCGAGCGTCTTTACAAATGTGACATCCTCTCCTGATCAAC 
AATTCCACTGTCGACCTCTCCACTTATAATGTAAGGTATTTTAGTCCGTCCAAGTTCGTT 
TATCAAGTCTCAACTCTTTTTCAGGCTCTGTGATCTACGCGTCCCTCTCGCTAGGCTCGG 
TACCCGGTTTTCCTTCTTTTTTCCAAGCAGTTCGCTTCCGGTTTCTTTTTTTCCAGTCAA 
TGGTTTTTTTGCAGGGTTTCCTTTTTATCAAGGAAACTCTAACTTATCTCGAAATCGGAA 
GGTTGTGTGTTGCTTGAGAATATATTTACCGGGTAGACGAGATTTTAGGGTTTATCATCT 
CCTGTTTCTTGCTGCTTTGGTGTCTTTTACGAGCCAGTTATCTATTTTATCCCCGTCGTT 
TGTCTTTGTTGGGTTTGGTTTTCCTCGGAAGGGTTTTGTATGATATGTACGGGTTCAGGT 
TTGGTTCAGGCCTTGGTTCTCTCAGAGGTTTGCTCTGGCTTTTGTTGCAACAACTCTATT 
AAAGAGTTTTTTTTGGGTTTTTTTTTTTGCCAGAGCCCTCGTTTTTTTTCCAGGCACTGT 
ATTACAATCTGTATGAACACTGTACTAAATTATGTAATGCCCTTTGGGATGGTTTAATGA 
GTATTGAATAAAATTCTCATTTCGTTGTATATTACAAA
>AT5G40410.1 |  INVOLVED IN biological_process unknown CONTAINS InterPro DOMAIN/s Pentatricopeptide repeat (InterProIPR002885) BEST Arabidopsis thaliana protein match is pentatricopeptide (PPR) repeat-containing protein (TAIRAT1G080701) Has 12213 Blast hits to 4641 proteins in 127 species Archae - 0 Bacteria - 0 Metazoa - 61 Fungi - 12 Plants - 12006 Viruses - 0 Other Eukaryotes - 134 (source NCBI BLink) 
AACAATGAGAGATGATCAAAGCAAATGTGTATTCATGTTCAAAGTTCAGATTTTTATATC 
GTCGGAGATTTTTGAGCCAATCTAGTTTTGTTCATAGTTTAGATGCAAATGTTTCATCAC 
TTATAGCTGCTGTAAAGTCTTGTGTCTCCATTGAACTGTGTCGTTTGCTTCATTGTAAAG 
TAGTGAAGAGTGTGAGTTACCGCCATGGATTTATTGGGGACCAGCTTGTTGGTTGTTACT 
TGAGATTGGGTCATGATGTTTGTGCAGAGAAGCTGTTCGATGAAATGCCTGAGAGAGATT 
TAGTCTCTTGGAACTCTTTAATATCTGGGTATTCAGGGAGAGGTTATTTGGGGAAATGTT 
TTGAGGTGTTGTCTAGGATGATGATATCTGAGGTGGGTTTTAGACCTAATGAAGTTACGT 
TTTTGTCGATGATCTCGGCTTGTGTTTACGGGGGAAGTAAGGAAGAAGGACGTTGTATTC 
ATGGGTTGGTGATGAAATTTGGTGTACTTGAGGAAGTTAAAGTTGTGAATGCTTTTATCA 
ATTGGTATGGAAAGACTGGAGATTTGACTTCATCCTGCAAATTGTTTGAGGATTTGAGTA 
TAAAGAATTTGGTTTCTTGGAATACGATGATTGTGATTCACTTGCAGAACGGTTTAGCTG 
AAAAAGGTCTGGCTTACTTTAACATGAGCAGAAGGGTTGGGCACGAGCCTGATCAAGCCA 
CTTTTCTGGCTGTTCTCCGTAGCTGTGAAGACATGGGTGTAGTGAGATTGGCGCAAGGAA 
TTCATGGGCTAATCATGTTTGGTGGGTTCAGTGGAAACAAGTGTATAACGACAGCGTTAT 
TAGACTTGTATTCGAAACTTGGGAGGTTGGAGGATTCGTCTACGGTTTTTCATGAGATTA 
CTTCTCCAGATAGTATGGCGTGGACTGCAATGCTTGCTGCTTATGCCACTCATGGGTTTG 
GAAGAGATGCAATCAAGCATTTTGAGCTCATGGTTCACTATGGTATTAGTCCTGATCATG 
TAACCTTCACTCACTTGTTAAATGCTTGTAGTCATTCTGGTCTTGTAGAAGAAGGGAAGC 
ATTATTTCGAAACAATGTCGAAAAGATACAGGATTGACCCGAGGTTAGATCACTATTCAT 
GTATGGTTGATCTACTGGGTCGATCCGGGCTTCTCCAAGATGCTTATGGATTGATCAAAG 
AAATGCCAATGGAGCCTAGTTCTGGTGTTTGGGGAGCTTTGCTTGGTGCTTGTAGGGTTT 
ACAAAGATACACAACTCGGAACAAAAGCCGCAGAGAGATTGTTCGAATTAGAGCCTCGTG 
ATGGTAGAAACTATGTAATGCTCTCAAACATCTATTCAGCTTCTGGTCTATGGAAAGATG 
CTTCAAGAATAAGGAATCTGATGAAACAGAAAGGTCTTGTAAGAGCTTCAGGGTGTAGCT 
ACATTGAACATGGTAACAAAATTCATAAGTTTGTTGTTGGAGATTGGTCTCATCCTGAAT 
CAGAGAAGATACAAAAGAAGCTGAAAGAGATTAGGAAGAAGATGAAGAGTGAAATGGGAT 
ATAAATCAAAAACAGAGTTTGTATTACATGATGTTGGTGAAGATGTTAAAGAGGAAATGA 
TCAATCAACATAGTGAGAAGATTGCGATGGCGTTTGGGCTTTTGGTGGTTAGTCCAATGG 
AGCCAATTATCATAAGGAAGAATCTTAGAATTTGTGGGGATTGTCATGAAACAGCAAAAG 
CAATATCTTTGATCGAGAAAAGGAGAATCATTATTAGAGATTCTAAGAGGTTTCATCATT 
TCTTAGACGGATCATGCTCTTGTAGCGATTATTGGTAGTATCTCAATTTCTCA
>AT2G41080.1 |  pentatricopeptide (PPR) repeat-containing protein 
TCTGATGAGCATGTACTCCAAGCTTGGAGATTTTCCTTCCGCTGTTGCAGTATATGGACG 
TATGCGTAAGAAGAATTATATGTCATCTAACATACTCATCAATGGGTATGTTCGTGCAGG 
CGATTTGGTAAATGCCCGGAAGGTGTTTGATGAAATGCCTGATAGAAAGCTCACAACTTG 
GAATGCTATGATTGCTGGTTTGATCCAGTTTGAGTTTAACGAAGAGGGTTTGAGTTTGTT 
TAGGGAAATGCACGGATTGGGGTTTTCGCCTGATGAATATACTCTTGGTAGTGTTTTTAG 
CGGATCTGCTGGGTTGAGGTCAGTGTCTATAGGGCAGCAGATTCATGGCTACACGATCAA 
ATATGGGCTTGAGTTGGATTTAGTTGTTAATAGTTCGTTGGCTCATATGTATATGAGAAA 
TGGGAAATTGCAAGATGGCGAGATTGTTATAAGATCAATGCCGGTTCGTAATTTGGTTGC 
ATGGAATACACTCATCATGGGGAACGCGCAAAACGGATGTCCTGAAACCGTGCTTTATCT 
GTATAAGATGATGAAAATCTCAGGTTGTAGGCCGAACAAGATCACATTTGTAACTGTGCT 
CAGTTCTTGTTCTGACTTAGCGATAAGAGGTCAAGGTCAGCAGATTCATGCGGAAGCTAT 
TAAAATTGGGGCTAGCTCTGTCGTAGCTGTAGTTAGTTCATTGATCAGTATGTATTCAAA 
ATGTGGGTGTCTTGGAGATGCAGCTAAAGCTTTCTCGGAACGTGAAGATGAAGATGAAGT 
GATGTGGAGCTCAATGATTTCTGCTTATGGGTTTCATGGACAAGGCGATGAGGCAATCGA 
GCTGTTTAATACTATGGCAGAGCAGACGAACATGGAGATAAACGAGGTCGCGTTTCTGAA 
TTTACTTTATGCGTGTAGTCATTCTGGCTTAAAAGACAAAGGGCTTGAGTTGTTTGACAT 
GATGGTGGAAAAATACGGGTTCAAGCCAGGTTTAAAGCACTACACTTGTGTGGTGGATTT 
GCTCGGTCGAGCAGGCTGTTTGGATCAAGCCGAGGCAATAATAAGGTCTATGCCAATAAA 
AACAGACATTGTTATATGGAAAACATTGTTATCTGCGTGTAACATACACAAAAACGCAGA 
AATGGCACAGAGAGTCTTTAAAGAGATTCTTCAGATTGACCCTAATGACTCTGCTTGTTA 
CGTCCTGCTTGCAAATGTTCACGCCTCAGCTAAGAGATGGCGCGATGTATCAGAGGTGAG 
AAAATCGATGAGAGATAAGAATGTGAAGAAAGAAGCGGGAATCAGCTGGTTTGAACACAA 
AGGTGAAGTTCATCAGTTTAAGATGGGAGATCGCTCTCAGTCTAAATCTAAAGAGATCTA 
TTCATACCTGAAAGAACTGACTCTGGAGATGAAGCTGAAGGGATACAAGCCTGATACAGC 
GTCAGTATTGCATGATATGGATGAGGAGGAGAAAGAATCAGACTTGGTGCAGCACAGCGA 
GAAACTAGCGGTTGCATTTGCGCTCATGATATTACCTGAAGGTGCACCGATTAGAATAAT 
CAAGAACTTGCGGGTTTGCAGTGATTGCCATGTTGCTTTCAAATATATATCGGTGATCAA 
GAACCGAGAGATCACGTTAAGAGATGGTAGTAGATTCCATCATTTCATTAACGGAAAGTG 
TTCTTGCGGCGATTATTGGTAGCTGGGCAGTTAAAGTATGCTATGACTCTCTGCTTCAGA 
G
>AT1G21810.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 14 plant structures EXPRESSED DURING 6 growth stages CONTAINS InterPro DOMAIN/s Protein of unknown function DUF869 plant (InterProIPR008587) BEST Arabidopsis thaliana protein match is myosin heavy chain-related (TAIRAT1G775802) Has 62951 Blast hits to 33517 proteins in 1593 species Archae - 800 Bacteria - 6354 Metazoa - 33836 Fungi - 4765 Plants - 2651 Viruses - 311 Other Eukaryotes - 14234 (source NCBI BLink) 
ATGACTGGTACTACTTTGATATTGGAACCAGTTATGGATTCCAAAGATGAGCTAGTGAAA 
CAACATGCCAAAGTTGCAGAAGATGCTGTTGCTGGTAATTAGTCACTCCTTTTCTGATCC 
TGCTTCTGGAGTTAGCATTTTATAGTGAAAGCATTTTCTCTTGTGGTTTTTTTTCAAGGG 
TGGGAGAAAGCTGAGAATGAAGTGGTTGAGCTGAAGCAAAAGCTCGAGGATGCGGCTGAT 
AAGAACATTGTCTTGGAAGATAGGGTAAGTCATCTTGATGGGGCGCTTAAGGAGTGTGTG 
AGACAACTAAGGCAGTTTAGAGATGAACAAGAGAAGAATATTCAAGCAGCTGTGACTGAG 
AGCACCAAGGAGTTGCATTCTGCTAACACTGGTCTAGAGAAGCGAGTTCTTGAGCTCCAG 
AAAGAAGCAGAAGCTGCTAAATCTGAAAACATGATGCTGAGACGCGAGTTTCTTACACAG 
CGTGAAGACCTTGAGATTGTGATGATTGAGAGGGACTTGAGCACTCAAGCAGCAGAAACT 
GCTAGTAAGCAGCATTTAGACATCATAAAGAAGTTGGCGAAACTCGAAGCTGAGTGCAGG 
AAGCTTAGGATTTTGGCTAAAACATCATCATCGCTATCATCTAATCAATCTGTGGATAGT 
CATTCAGATGGAGGAAGGGAGAGAGTTGAGGGGAGTTGCTCTGATTCATGGGCATCATCA 
GCATTTATTTCTGAGTTAGATCAAATCAAGAATGAAAAAGGCGGCAATAGAAGTCTTCAG 
GGCACTACTTCATCAACTGAAATTGATCTCATGGATGATTTCCTTGAGATGGAACGTCTT 
GTGGCTCTTCCTACTGAGACACAGGCTAAAAACTCCAAGGATGGATATGAATTGAGCTTA 
ATGGAGAAGTTGGAGAAGATACAAGCAGAAAAGGATGATCTTGAAAGAGAAGTTAAATGT 
TGTAGAGAAGCGGAAAAGAGATTGAGCTTAGAGATTGAAGCAGTTGTTGGTGACAAAATG 
GAGTTGGAAGATATGTTAAAGAGGGTGGAAGCTGAGAAAGCTGAGCTAAAGACATCTTTT 
GACGTGCTCAAAGATAAATATCAAGAGTCAAGAGTTTGTTTTCAAGAAGTTGACACAAAG 
TTGGAGAAGCTACAAGCAGAAAAGGATGAGCTTGACAGTGAAGTTATTTGTTGTAAAGAA 
GCAGAGAAGAGATTCAGCTTAGAACTCGAAGCTGTAGTTGGTGACAAAATTGAGATGGAA 
GATGAGTTGGAGAAGATGGAAGCTGAGAAAGCTGAGCTAAAGATATCTTTTGACGTAATT 
AAAGATCAATATCAAGAATCTAGAGTTTGTTTTCAAGAAGTTGAGATGAAGTTGGAAGCG 
ATGAAAAGGGAGCTTAAACTAGCTAATGAATCGAAAACACAAGCCGAATCTCGGGTGACC 
AGAATGGAAGCAGAGGTGAGAAAGGAGAGGATTGTCTCTGATGGGCTAAAGGAAAAGTGT 
GAGACATTTGAAGAAGAGCTTAGAAGAGAGATAGAAGAGAAGACAATGATCAAGAGAGAA 
AAAGTGGAACCAAAGATCAAACAGGTATAGCTTTTTATTTTTCTGATCACCAATTAGTCT 
TTTTTTCTTTTTCTTAATCCTTGTTTCACTAAAAGCTGATTCTATTGCAGGAAGACATAG 
CAACAGCTGCAGGAAAATTTGCAGATTGTCAGAAAACAATAGCATCACTTGGGAAACAGC 
TACAATCTCTTGCAACACTAGAAGAATTCTTGATCGATACAGCTAGCATTCCAGGTTCTG 
CAAGGTCAGTTCACAACAAGGAAGCTTTGTTAGGAAAAGATCCTCATGAGTGCATCAAAA 
CAATCAATGGAAGATCACTTGAGTTTCTTGCAATCCAGAACAGCAATAACAAGACCTCAC 
CTCCTTGTTCATCTTCTTCGGACTCAACAACAGTGTCATTAATTATGTCATCAAACCGAG 
GGAGTTCTGAGAAGAATCGCAACGGATTTGCCACGGTTTTCACTCGAAGTAGAAACTCAG 
TAAATTTGGGGATTTAGGAGAAGCAATAGGTTCGTTCTTGGAGTATAGCTAGAATCAATC 
TGTTGTTAAGCTGGATTTCTTACCATAATTTTCAGATTCTGACTTGTTAGTAGCAAAAAG 
ATCACCCTTGTTTCTTTTGCTACATAGTGTAATCTTCTTAACGTTTATGTCTTGTGAGAT 
TCATCTTTGTTATTTC