>AT3G10070.1 |  TAF12 (TBP-ASSOCIATED FACTOR 12) DNA binding / transcription initiation factor 
ACATTCACAACCTTTTCATGGATCAGCCACGGCAAAGCTCGACGGCGTCTCAGCCACCGG 
AAACACCTCCTCAGCCGTCGGATTCGAAACCATCGACTTTAACACAGATTCAACCAACGC 
CTTCCACAAATCCGAGTCCCTCCTCCGTTGTTTCCTCTATTCCCTCTTCCCCAGCTCCAC 
AATCACCATCGCTCAATCCAAACCCTAATCCTCCTCAATACACGCGTCCGGTGACATCTC 
CGGCGACGCAACAGCAGCAACATCTCTCCCAGCCTTTAGTACGGCCGCCTCCGCAGGCGT 
ACTCTAGACCATGGCAACAACACTCCTCTTACACTCACTTCTCCTCCGCTTCGTCGCCAT 
TGTTATCTTCCTCATCAGCGCCAGCCTCTTCGTCTTCGTCTTTACCTATTAGTGGGCAAC 
AAAGAGGTGGTATGGCGATTGGCGTTCCGGCTTCCCCAATTCCGAGTCCTTCTCCTACAC 
CTTCTCAACATTCGCCTTCTGCTTTTCCCGGCTCTTTTGGGCAACAATACGGCGGATTAG 
GTCGTGGAACTGTTGGAATGTCAGAAGCTACGTCCAATACAAGTTCCCCTCAGGTGGATT 
TTGTTCTCTCATATGTGCCACATTTCAAAGAATTGTGCTCTGTTACAAAAAAAATACCAA 
CTTTTTTCTGTATGTGTGTGTTCAGGTTAGAATGATGCAAGGAACTCAGGGAATTGGGAT 
GATGGGAACACTTGGCTCAGGTTCCCAAATAAGGCCGAGTGGGATGACTCAGCACCAACA 
GAGACCAACTCAATCGTCTCTAAGGCCAGCTTCCTCCACAAGTACCCAATCTCCTGTTGC 
TCAAGTATGTATTTTTTACCCTTCTGGCCTCTGTATCTTGACATAGGAACTCCAGCTTTG 
TGACTCCTTTGTTAACAACAACTTTGTTTGCACACTTTAATTCCTTATATTTTACTTTTG 
GCAGAACTTTCAAGGACATAGCCTTATGAGACCTTCACCTATTAGCTCTCCTAATGTCCA 
ATCTACTGGTGCATCACAACAAAGCTTGCAAGCTATCAATCAACCTTGGTTATCATCTAC 
TCCCCAAGGGAAGCCGCCTTTGCCACCTCCTTCGTATAGACCACAAGTAAATTCTCCCTC 
TATGCAACAAAGACCTCATATTCCTCAGCAACATATTTCTACGTCAGCAGCAACTCCACA 
GCCTCAGCAACAACAATCTCAACAACAACATCAGCCTCAAGAGCAACTTCAACAATTGAG 
ATCTCCGCAGCAGCCTTTAGCTCACCCACATCAACCAACCCGGGTCCAAGGTTTGGTAAA 
TCAGAAAGTTACTTCTCCGGTAATGCCGAGCCAGCCTCCTGTGGCTCAACCAGGAAACCA 
TGCTAAAACAGTTTCTGCAGAGACCGAGCCGTCTGATGATCGTATCCTGGGGAAACGAAG 
CATCCATGAGCTACTTCAACAGGTATCCACTGACCTCTCTTCATATTCTCAACTTTGGGT 
AGCCATTTTGCATATTGACCAGTATTCTCATTCTCACGCTTGATTCTTACTTAATGTGTG 
ATTGATTTCTTCTACTTGGATAGATCTCTTACAGATAAAGGGGAATGTATTTGATAAAAA 
TAAAATAGAAGAGATATTGAGTTATTTGCTAATAATACCTCCCATAGAAAGCCAAAAAGC 
ATGTATGATGACTATTGACAGATTCAGTAATAACATATATCGATAGTTTTAGGGGCTACT 
TTGATGGTGTTCAACACTGAACTCTCTGCATGATATCCATTGATTTTTTTCTCTAAATTT 
AAAATGCCTTGTCTATCCACAGATTGACCCGTCAGAGAAATTGGATCCAGAGGTTGAAGA 
CATCCTTTCTGATATTGCTGAAGATTTTGTGGAGTCAGTAAGTAGCCCACACCGCATTGG 
TTAAGAGTCAAACATTTCTGTCCTGTGATGAGTTTTTTTTTTCTAACTCCTGCTCGTTGT 
GTTTCCTTTAACCCAGATCACAACTTTTGGCTGTTCTTTAGCCAAGCATAGAAAGTCAGA 
TATTCTAGAGGCTAAGGACATCTTGCTCCATGTTGGTTCGTTCTGATCAATATTATCTAC 
CTGCCCGTTTTCTTTCAATTTCTCATTCCAATTACAATGAGTAATTTCTTTTTGGTATCT 
TCTCCTTAAATGTAGAAAGAAACTGGAATATTAGGCCTCCTGGTTTTAGCAGTGATGAGT 
TCAAGACATTTCGGAAGCCAGTAAGAATTAGAGATCTGCATTTTATCAATTCGTCTTATT 
TACAAAATAGTTCCTAACATCTTTGGTTGTTTCCTTGGATAATGTTTTGCAGTTGACAAC 
AGATATTCACAAAGAGCGACTTGCGGCTGTAAGTTTTAACACATGATTTTAAACATCCTA 
ATAGTATAGATTATGGTTCTGAGTAGAAGAGAAACTTATATATTGAAGATTTTCTTGTTG 
ATTTTGTATATACATTGTGGATTTTGAAGATAAAGAAGTCGGTCACGGCAACAGAAGCAG 
CAAACGCCAGGAATCAGTTTGGGCATGGGACTGCTAATGCAAGAGGCGGTCAGGCAAAGA 
CACCTTCTAATCCCATGGGCTCTACAACTTTCAATCACTGATGTATTAGATCAACCCTCC 
AATCGTGTTTGGTAGGAATTTTCCATTAAAAAGGTACGAAATTTGTATCACCCATTTTGT 
ACCCTATCTAGATCTCACAGTCCAAGATAACTACAGGCATGGAGGCTAGGCTAGATCTTA 
ATTTTAACCATTAGTAAAGTATCTTACCACCTCATGGGATTGAATTGATTCAATGTTTTG 
CAAAACGGCCAAAAGTGTATGAATTGAGTGATGAGACACAGTTATAAAGCGGTTACAAAA 
CCGGACTAGCCGGTATAATCCATTAAACAGATTCTTCTCATACAGTTGAATCTTCC
>AT1G74140.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 484 Blast hits to 484 proteins in 189 species Archae - 2 Bacteria - 267 Metazoa - 30 Fungi - 47 Plants - 36 Viruses - 0 Other Eukaryotes - 102 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTC
>AT1G74140.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 388 Blast hits to 388 proteins in 139 species Archae - 2 Bacteria - 244 Metazoa - 0 Fungi - 2 Plants - 36 Viruses - 0 Other Eukaryotes - 104 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTC
>AT1G74140.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 428 Blast hits to 428 proteins in 155 species Archae - 4 Bacteria - 272 Metazoa - 0 Fungi - 4 Plants - 36 Viruses - 0 Other Eukaryotes - 112 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTC
>AT1G74140.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 484 Blast hits to 484 proteins in 189 species Archae - 2 Bacteria - 267 Metazoa - 30 Fungi - 47 Plants - 36 Viruses - 0 Other Eukaryotes - 102 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTC
>AT1G74140.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 388 Blast hits to 388 proteins in 139 species Archae - 2 Bacteria - 244 Metazoa - 0 Fungi - 2 Plants - 36 Viruses - 0 Other Eukaryotes - 104 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTC
>AT1G74140.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 428 Blast hits to 428 proteins in 155 species Archae - 4 Bacteria - 272 Metazoa - 0 Fungi - 4 Plants - 36 Viruses - 0 Other Eukaryotes - 112 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTC
>AT1G74140.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 484 Blast hits to 484 proteins in 189 species Archae - 2 Bacteria - 267 Metazoa - 30 Fungi - 47 Plants - 36 Viruses - 0 Other Eukaryotes - 102 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTCTCT
>AT1G74140.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 388 Blast hits to 388 proteins in 139 species Archae - 2 Bacteria - 244 Metazoa - 0 Fungi - 2 Plants - 36 Viruses - 0 Other Eukaryotes - 104 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTCTCT
>AT1G74140.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 428 Blast hits to 428 proteins in 155 species Archae - 4 Bacteria - 272 Metazoa - 0 Fungi - 4 Plants - 36 Viruses - 0 Other Eukaryotes - 112 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTCTCT
>AT1G74140.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 484 Blast hits to 484 proteins in 189 species Archae - 2 Bacteria - 267 Metazoa - 30 Fungi - 47 Plants - 36 Viruses - 0 Other Eukaryotes - 102 (source NCBI BLink) 
AAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCGTCGTCAACGTTGGTGCTTC 
TTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAAACCAGAGTCGCCATCTTCT 
TCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTGTTCCTTCTGCTGTATCGCG 
ATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAAACACCAACCTAAAGCTGAA 
ATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTTCTGAGCTTCCTAGTCATGG 
ATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGATGGTATGGAATCGTTTCTTT 
TCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAGTTCGAAGCTTTCCTAAGAA 
AAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCTGTTTCTCGTTAGCTAAGTA 
AAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGAAATTTCTGGTTTTGATTTA 
GGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTTAAGCTTTTCTGGTTAAGTA 
CACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATTTCAGAGAAGAAGATGAGAA 
ATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGTTGTTGATTTGAGTAGCGCA 
ACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAGCTCGACACTGTACACACCC 
TTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATCAGAATCTTAAAATTCTTCA 
CCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGGAAGTCTTGGATCAATGGAG 
CTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCTGTATTTACAATGTGGCGAG 
TTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAACATATTACTGCTAAGCTCT 
TGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGATTTTTGAGCAGCTATCAAC 
GTACAGTTTTACGAGTGGATATATACACACGCTGATAACTTCGGGTTTTAGTCATATTGG 
TACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACTTCGGCTCCAGAGTATGAAT 
TATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAATATTCGATTTTTGTAAATG 
AGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTACTTTCACTTTATTGGAGACA 
CATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTTAGCTAGATACTATTAAGTC 
GGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATTGAATTAACTATTTATCTTC 
ATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTTGGGACCGCTCTACCTTTTG 
AAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTTCCTGAGTTATCACGCCCTC 
TTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCATGCTATAAAAAAGCTTTTAA 
AAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCAAAAACTAAACACCAAAGCT 
CTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAACATATATATAGGGTTAGCAC 
ACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAAACTCCTTTTTTCCTTTTCT 
TCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACCGACTTGTTTAAGTTGTCTT 
GGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGATTGATAGTTTCTTTCATTGA 
TCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAAACGAAGTTTTGAGGTTTAG 
ACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTACCTTTGATATTGGCTGGTC 
AATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAATCAACAGCCCCCATTTCGCA 
GCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTCTGAGCATGAACAGACTTGG 
TTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTTGTTTTTGGATCACCAAATG 
CCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTGATGGATCTATGTTTGCCAT 
CGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACATACTTTGCGTTGATGCTCCG 
AGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCTTATTGAGTGAAAGTCATGT 
GAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAATCATAAACTTAGGAGTCGAG 
ATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAATCCGCATTTGCTTTGAAAC 
ACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAATGCAGGGGGGACCGAACCAT 
ATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGCTGCCATGGCGTGGGCACGG 
ATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCAAAAGATTCAGTGGACCTGA 
TCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAGATTTTTGTCAACATTTTTT 
CGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAGATCAATTTGATGGGTTTGG 
ACCGTTTGGTTACATTGGAAACTATTTTT
>AT1G74140.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 388 Blast hits to 388 proteins in 139 species Archae - 2 Bacteria - 244 Metazoa - 0 Fungi - 2 Plants - 36 Viruses - 0 Other Eukaryotes - 104 (source NCBI BLink) 
AAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCGTCGTCAACGTTGGTGCTTC 
TTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAAACCAGAGTCGCCATCTTCT 
TCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTGTTCCTTCTGCTGTATCGCG 
ATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAAACACCAACCTAAAGCTGAA 
ATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTTCTGAGCTTCCTAGTCATGG 
ATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGATGGTATGGAATCGTTTCTTT 
TCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAGTTCGAAGCTTTCCTAAGAA 
AAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCTGTTTCTCGTTAGCTAAGTA 
AAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGAAATTTCTGGTTTTGATTTA 
GGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTTAAGCTTTTCTGGTTAAGTA 
CACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATTTCAGAGAAGAAGATGAGAA 
ATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGTTGTTGATTTGAGTAGCGCA 
ACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAGCTCGACACTGTACACACCC 
TTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATCAGAATCTTAAAATTCTTCA 
CCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGGAAGTCTTGGATCAATGGAG 
CTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCTGTATTTACAATGTGGCGAG 
TTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAACATATTACTGCTAAGCTCT 
TGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGATTTTTGAGCAGCTATCAAC 
GTACAGTTTTACGAGTGGATATATACACACGCTGATAACTTCGGGTTTTAGTCATATTGG 
TACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACTTCGGCTCCAGAGTATGAAT 
TATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAATATTCGATTTTTGTAAATG 
AGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTACTTTCACTTTATTGGAGACA 
CATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTTAGCTAGATACTATTAAGTC 
GGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATTGAATTAACTATTTATCTTC 
ATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTTGGGACCGCTCTACCTTTTG 
AAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTTCCTGAGTTATCACGCCCTC 
TTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCATGCTATAAAAAAGCTTTTAA 
AAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCAAAAACTAAACACCAAAGCT 
CTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAACATATATATAGGGTTAGCAC 
ACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAAACTCCTTTTTTCCTTTTCT 
TCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACCGACTTGTTTAAGTTGTCTT 
GGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGATTGATAGTTTCTTTCATTGA 
TCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAAACGAAGTTTTGAGGTTTAG 
ACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTACCTTTGATATTGGCTGGTC 
AATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAATCAACAGCCCCCATTTCGCA 
GCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTCTGAGCATGAACAGACTTGG 
TTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTTGTTTTTGGATCACCAAATG 
CCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTGATGGATCTATGTTTGCCAT 
CGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACATACTTTGCGTTGATGCTCCG 
AGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCTTATTGAGTGAAAGTCATGT 
GAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAATCATAAACTTAGGAGTCGAG 
ATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAATCCGCATTTGCTTTGAAAC 
ACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAATGCAGGGGGGACCGAACCAT 
ATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGCTGCCATGGCGTGGGCACGG 
ATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCAAAAGATTCAGTGGACCTGA 
TCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAGATTTTTGTCAACATTTTTT 
CGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAGATCAATTTGATGGGTTTGG 
ACCGTTTGGTTACATTGGAAACTATTTTT
>AT1G74140.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 428 Blast hits to 428 proteins in 155 species Archae - 4 Bacteria - 272 Metazoa - 0 Fungi - 4 Plants - 36 Viruses - 0 Other Eukaryotes - 112 (source NCBI BLink) 
AAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCGTCGTCAACGTTGGTGCTTC 
TTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAAACCAGAGTCGCCATCTTCT 
TCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTGTTCCTTCTGCTGTATCGCG 
ATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAAACACCAACCTAAAGCTGAA 
ATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTTCTGAGCTTCCTAGTCATGG 
ATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGATGGTATGGAATCGTTTCTTT 
TCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAGTTCGAAGCTTTCCTAAGAA 
AAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCTGTTTCTCGTTAGCTAAGTA 
AAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGAAATTTCTGGTTTTGATTTA 
GGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTTAAGCTTTTCTGGTTAAGTA 
CACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATTTCAGAGAAGAAGATGAGAA 
ATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGTTGTTGATTTGAGTAGCGCA 
ACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAGCTCGACACTGTACACACCC 
TTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATCAGAATCTTAAAATTCTTCA 
CCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGGAAGTCTTGGATCAATGGAG 
CTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCTGTATTTACAATGTGGCGAG 
TTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAACATATTACTGCTAAGCTCT 
TGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGATTTTTGAGCAGCTATCAAC 
GTACAGTTTTACGAGTGGATATATACACACGCTGATAACTTCGGGTTTTAGTCATATTGG 
TACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACTTCGGCTCCAGAGTATGAAT 
TATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAATATTCGATTTTTGTAAATG 
AGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTACTTTCACTTTATTGGAGACA 
CATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTTAGCTAGATACTATTAAGTC 
GGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATTGAATTAACTATTTATCTTC 
ATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTTGGGACCGCTCTACCTTTTG 
AAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTTCCTGAGTTATCACGCCCTC 
TTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCATGCTATAAAAAAGCTTTTAA 
AAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCAAAAACTAAACACCAAAGCT 
CTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAACATATATATAGGGTTAGCAC 
ACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAAACTCCTTTTTTCCTTTTCT 
TCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACCGACTTGTTTAAGTTGTCTT 
GGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGATTGATAGTTTCTTTCATTGA 
TCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAAACGAAGTTTTGAGGTTTAG 
ACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTACCTTTGATATTGGCTGGTC 
AATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAATCAACAGCCCCCATTTCGCA 
GCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTCTGAGCATGAACAGACTTGG 
TTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTTGTTTTTGGATCACCAAATG 
CCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTGATGGATCTATGTTTGCCAT 
CGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACATACTTTGCGTTGATGCTCCG 
AGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCTTATTGAGTGAAAGTCATGT 
GAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAATCATAAACTTAGGAGTCGAG 
ATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAATCCGCATTTGCTTTGAAAC 
ACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAATGCAGGGGGGACCGAACCAT 
ATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGCTGCCATGGCGTGGGCACGG 
ATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCAAAAGATTCAGTGGACCTGA 
TCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAGATTTTTGTCAACATTTTTT 
CGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAGATCAATTTGATGGGTTTGG 
ACCGTTTGGTTACATTGGAAACTATTTTT
>AT1G74140.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 484 Blast hits to 484 proteins in 189 species Archae - 2 Bacteria - 267 Metazoa - 30 Fungi - 47 Plants - 36 Viruses - 0 Other Eukaryotes - 102 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTCTCT
>AT1G74140.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 388 Blast hits to 388 proteins in 139 species Archae - 2 Bacteria - 244 Metazoa - 0 Fungi - 2 Plants - 36 Viruses - 0 Other Eukaryotes - 104 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTCTCT
>AT1G74140.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 428 Blast hits to 428 proteins in 155 species Archae - 4 Bacteria - 272 Metazoa - 0 Fungi - 4 Plants - 36 Viruses - 0 Other Eukaryotes - 112 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTCTCT
>AT1G74140.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 394 Blast hits to 394 proteins in 153 species Archae - 6 Bacteria - 196 Metazoa - 30 Fungi - 47 Plants - 36 Viruses - 0 Other Eukaryotes - 79 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTC
>AT1G74140.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 536 Blast hits to 536 proteins in 206 species Archae - 2 Bacteria - 298 Metazoa - 30 Fungi - 50 Plants - 36 Viruses - 0 Other Eukaryotes - 120 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTC
>AT1G74140.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 394 Blast hits to 394 proteins in 153 species Archae - 6 Bacteria - 196 Metazoa - 30 Fungi - 47 Plants - 36 Viruses - 0 Other Eukaryotes - 79 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTC
>AT1G74140.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 536 Blast hits to 536 proteins in 206 species Archae - 2 Bacteria - 298 Metazoa - 30 Fungi - 50 Plants - 36 Viruses - 0 Other Eukaryotes - 120 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTC
>AT1G74140.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 394 Blast hits to 394 proteins in 153 species Archae - 6 Bacteria - 196 Metazoa - 30 Fungi - 47 Plants - 36 Viruses - 0 Other Eukaryotes - 79 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTCTCT
>AT1G74140.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 536 Blast hits to 536 proteins in 206 species Archae - 2 Bacteria - 298 Metazoa - 30 Fungi - 50 Plants - 36 Viruses - 0 Other Eukaryotes - 120 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTCTCT
>AT1G74140.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 394 Blast hits to 394 proteins in 153 species Archae - 6 Bacteria - 196 Metazoa - 30 Fungi - 47 Plants - 36 Viruses - 0 Other Eukaryotes - 79 (source NCBI BLink) 
AAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCGTCGTCAACGTTGGTGCTTC 
TTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAAACCAGAGTCGCCATCTTCT 
TCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTGTTCCTTCTGCTGTATCGCG 
ATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAAACACCAACCTAAAGCTGAA 
ATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTTCTGAGCTTCCTAGTCATGG 
ATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGATGGTATGGAATCGTTTCTTT 
TCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAGTTCGAAGCTTTCCTAAGAA 
AAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCTGTTTCTCGTTAGCTAAGTA 
AAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGAAATTTCTGGTTTTGATTTA 
GGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTTAAGCTTTTCTGGTTAAGTA 
CACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATTTCAGAGAAGAAGATGAGAA 
ATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGTTGTTGATTTGAGTAGCGCA 
ACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAGCTCGACACTGTACACACCC 
TTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATCAGAATCTTAAAATTCTTCA 
CCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGGAAGTCTTGGATCAATGGAG 
CTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCTGTATTTACAATGTGGCGAG 
TTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAACATATTACTGCTAAGCTCT 
TGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGATTTTTGAGCAGCTATCAAC 
GTACAGTTTTACGAGTGGATATATACACACGCTGATAACTTCGGGTTTTAGTCATATTGG 
TACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACTTCGGCTCCAGAGTATGAAT 
TATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAATATTCGATTTTTGTAAATG 
AGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTACTTTCACTTTATTGGAGACA 
CATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTTAGCTAGATACTATTAAGTC 
GGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATTGAATTAACTATTTATCTTC 
ATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTTGGGACCGCTCTACCTTTTG 
AAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTTCCTGAGTTATCACGCCCTC 
TTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCATGCTATAAAAAAGCTTTTAA 
AAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCAAAAACTAAACACCAAAGCT 
CTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAACATATATATAGGGTTAGCAC 
ACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAAACTCCTTTTTTCCTTTTCT 
TCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACCGACTTGTTTAAGTTGTCTT 
GGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGATTGATAGTTTCTTTCATTGA 
TCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAAACGAAGTTTTGAGGTTTAG 
ACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTACCTTTGATATTGGCTGGTC 
AATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAATCAACAGCCCCCATTTCGCA 
GCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTCTGAGCATGAACAGACTTGG 
TTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTTGTTTTTGGATCACCAAATG 
CCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTGATGGATCTATGTTTGCCAT 
CGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACATACTTTGCGTTGATGCTCCG 
AGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCTTATTGAGTGAAAGTCATGT 
GAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAATCATAAACTTAGGAGTCGAG 
ATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAATCCGCATTTGCTTTGAAAC 
ACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAATGCAGGGGGGACCGAACCAT 
ATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGCTGCCATGGCGTGGGCACGG 
ATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCAAAAGATTCAGTGGACCTGA 
TCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAGATTTTTGTCAACATTTTTT 
CGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAGATCAATTTGATGGGTTTGG 
ACCGTTTGGTTACATTGGAAACTATTTTT
>AT1G74140.4 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 536 Blast hits to 536 proteins in 206 species Archae - 2 Bacteria - 298 Metazoa - 30 Fungi - 50 Plants - 36 Viruses - 0 Other Eukaryotes - 120 (source NCBI BLink) 
AAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCGTCGTCAACGTTGGTGCTTC 
TTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAAACCAGAGTCGCCATCTTCT 
TCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTGTTCCTTCTGCTGTATCGCG 
ATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAAACACCAACCTAAAGCTGAA 
ATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTTCTGAGCTTCCTAGTCATGG 
ATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGATGGTATGGAATCGTTTCTTT 
TCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAGTTCGAAGCTTTCCTAAGAA 
AAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCTGTTTCTCGTTAGCTAAGTA 
AAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGAAATTTCTGGTTTTGATTTA 
GGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTTAAGCTTTTCTGGTTAAGTA 
CACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATTTCAGAGAAGAAGATGAGAA 
ATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGTTGTTGATTTGAGTAGCGCA 
ACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAGCTCGACACTGTACACACCC 
TTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATCAGAATCTTAAAATTCTTCA 
CCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGGAAGTCTTGGATCAATGGAG 
CTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCTGTATTTACAATGTGGCGAG 
TTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAACATATTACTGCTAAGCTCT 
TGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGATTTTTGAGCAGCTATCAAC 
GTACAGTTTTACGAGTGGATATATACACACGCTGATAACTTCGGGTTTTAGTCATATTGG 
TACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACTTCGGCTCCAGAGTATGAAT 
TATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAATATTCGATTTTTGTAAATG 
AGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTACTTTCACTTTATTGGAGACA 
CATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTTAGCTAGATACTATTAAGTC 
GGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATTGAATTAACTATTTATCTTC 
ATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTTGGGACCGCTCTACCTTTTG 
AAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTTCCTGAGTTATCACGCCCTC 
TTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCATGCTATAAAAAAGCTTTTAA 
AAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCAAAAACTAAACACCAAAGCT 
CTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAACATATATATAGGGTTAGCAC 
ACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAAACTCCTTTTTTCCTTTTCT 
TCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACCGACTTGTTTAAGTTGTCTT 
GGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGATTGATAGTTTCTTTCATTGA 
TCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAAACGAAGTTTTGAGGTTTAG 
ACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTACCTTTGATATTGGCTGGTC 
AATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAATCAACAGCCCCCATTTCGCA 
GCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTCTGAGCATGAACAGACTTGG 
TTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTTGTTTTTGGATCACCAAATG 
CCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTGATGGATCTATGTTTGCCAT 
CGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACATACTTTGCGTTGATGCTCCG 
AGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCTTATTGAGTGAAAGTCATGT 
GAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAATCATAAACTTAGGAGTCGAG 
ATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAATCCGCATTTGCTTTGAAAC 
ACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAATGCAGGGGGGACCGAACCAT 
ATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGCTGCCATGGCGTGGGCACGG 
ATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCAAAAGATTCAGTGGACCTGA 
TCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAGATTTTTGTCAACATTTTTT 
CGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAGATCAATTTGATGGGTTTGG 
ACCGTTTGGTTACATTGGAAACTATTTTT
>AT1G74140.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 394 Blast hits to 394 proteins in 153 species Archae - 6 Bacteria - 196 Metazoa - 30 Fungi - 47 Plants - 36 Viruses - 0 Other Eukaryotes - 79 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTCTCT
>AT1G74140.5 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN integral to membrane chloroplast CONTAINS InterPro DOMAIN/s Peptidase S54 rhomboid (InterProIPR002610) BEST Arabidopsis thaliana protein match is rhomboid family protein (TAIRAT1G741301) Has 536 Blast hits to 536 proteins in 206 species Archae - 2 Bacteria - 298 Metazoa - 30 Fungi - 50 Plants - 36 Viruses - 0 Other Eukaryotes - 120 (source NCBI BLink) 
ATTCCGAAGAAAAAGGAGAAAAAATGCATGCGATTTTCTCGAGCTTCTCTCGCAAGGTCG 
TCGTCAACGTTGGTGCTTCTTCGCAATCGCAGCTAACCAAAATGGTGAAGAAGAAGCCAA 
ACCAGAGTCGCCATCTTCTTCCTTCTCGCCTCTCATCTCCTTCTTCAGTACCCCACTTTG 
TTCCTTCTGCTGTATCGCGATCGGCGAAAGTTCATGGGTTCTTCGCTAGCAAGTTGGGAA 
ACACCAACCTAAAGCTGAAATTTGGAAATGTTATGGAATCTAGAGCTGGGTTCTTCAGTT 
CTGAGCTTCCTAGTCATGGATTCGAGTCTGGTGGTTTTACTGGGTTTCAAAAGCGGGGAT 
GGTATGGAATCGTTTCTTTTCGTCTCTCTTTTTGTGATTTTGCTGGGTTTCAAAGTAAAG 
TTCGAAGCTTTCCTAAGAAAAGTTCTGAAATTTCTGAGTTCCTTTTCTCTATGATCCTCT 
GTTTCTCGTTAGCTAAGTAAAAGTCTACATTTTTCTGAGATAAACTTGTCTTCCGGATGA 
AATTTCTGGTTTTGATTTAGGGTTTAAACAGAGTTTCTGTGGTAAATTATCGACCTTTTT 
AAGCTTTTCTGGTTAAGTACACGTTTGGCTCGTCCAAGACATACGACATGTATGAGAATT 
TCAGAGAAGAAGATGAGAAATTAGTAGGCTTTACTTGCTATTTTGCCATTTCTGTGTAGT 
TGTTGATTTGAGTAGCGCAACGTTCATTTGATGATTTTCTCTTTATAGATAGACTAGCAG 
CTCGACACTGTACACACCCTTCTCATTAAAACTTTGTTCTCCAGATATAGAAATAACATC 
AGAATCTTAAAATTCTTCACCAGCTCTAGTTGATTTTGGTTTGTGTTTTCCTCTGGCAGG 
AAGTCTTGGATCAATGGAGCTAATGGTGTAGTTTTTGGACTGGTAATAGCTAATGCTGCT 
GTATTTACAATGTGGCGAGTTTCAGACAGGAGCTGGATGGTGAAAAATTTTGTGGTCTAA 
CATATTACTGCTAAGCTCTTGTCTTGGATTTGGTTGATCTTTGATATGTTTGAACCTTGA 
TTTTTGAGCAGCTATCAACGTACAGTTTTACGAGTGGATATATACACACGCTGATAACTT 
CGGGTTTTAGTCATATTGGTACCAGTCAAATCATCTTGAACATGATTGGAATCTCCTACT 
TCGGCTCCAGAGTATGAATTATCTTTCCTTGTCGTTTCTCCATAGAAAAATGCTTCTCAA 
TATTCGATTTTTGTAAATGAGATGAGAAATGCTTTCATTTGTTTCCTATGAAGTGGCTAC 
TTTCACTTTATTGGAGACACATTCACTAATAGCTAATATACTTAAGAAAGAAAAGTCCTT 
AGCTAGATACTATTAAGTCGGTTCTTTTTGGGAAATTTGCTACCTGCTGATCTATCCATT 
GAATTAACTATTTATCTTCATCTCTAAAATCCATTGTTGCTATTAGATTGCAAGAACCTT 
GGGACCGCTCTACCTTTTGAAGCTGTACTTTGCTGGAGCACTTGGTGGCTCTGTTTGCTT 
CCTGAGTTATCACGCCCTCTTGGCTACACTCAAGGTATATAGCAAGTTTCCTTAAATCAT 
GCTATAAAAAAGCTTTTAAAAGAATAACTCTCTTTTATTGATGATTAGAAAACTCTCTCA 
AAAACTAAACACCAAAGCTCTTTTAATGTTTTCTCTTATGTTTGCCACAACTCAACAAAC 
ATATATATAGGGTTAGCACACACTGCAATTTCCTATTCCTAGTCCTACATGAATTAGGAA 
ACTCCTTTTTTCCTTTTCTTCTTGAACTAGATAATTACTCTTTTTGACATAACTTAAACC 
GACTTGTTTAAGTTGTCTTGGGCTTCAAGACTTGAAGCTTACCCAACAAGGTCCACAGAT 
TGATAGTTTCTTTCATTGATCATGTTGCATTTCATTTGCAGTACATGTTATATGTGCAAA 
ACGAAGTTTTGAGGTTTAGACTAAATGATTTTACCTCTGGAGATTAAACGAAATGCTTTA 
CCTTTGATATTGGCTGGTCAATCATACAGGGTGAAGGAGTTGTCATTAAGGATCATCAAT 
CAACAGCCCCCATTTCGCAGCTGCTGGTATGTTCGTACTCACCTAACTAAATGATTTGTC 
TGAGCATGAACAGACTTGGTTGATAGATATAGAGAGTGAGACGAAAATGCACATGCACTT 
GTTTTTGGATCACCAAATGCCGATGAGGTAAACTCTTTCCTATTTGTGTACAGGGTGCTG 
ATGGATCTATGTTTGCCATCGCGCTTCTCGATATGTTCATCTACCCAAAAGTTACAACAT 
ACTTTGCGTTGATGCTCCGAGTGCATGTAATGTTTGTAAGAACTCTCTACATACTCTTCT 
TATTGAGTGAAAGTCATGTGAACCAACTAAAAAACTCGCACTCTTTAATCTTCAGCGAAT 
CATAAACTTAGGAGTCGAGATTTTAAATATTCCAGAGGTGTGTATAATCTGAAAACCTAA 
TCCGCATTTGCTTTGAAACACTATATATGACTATCTCTTAACCATAACTTTTTTGGGAAT 
GCAGGGGGGACCGAACCATATCGCATCGAGTTCGGGTCAGTTGGGAGGAGTTGTGGTTGC 
TGCCATGGCGTGGGCACGGATAAAGAAAGGTCGATTTTGATACTGAACTGATTTGCTGCA 
AAAGATTCAGTGGACCTGATCATCATCATGCAAGTTATGTTAAACTTCGTGACAAATGAG 
ATTTTTGTCAACATTTTTTCGTTTGGGTTGGACTTTTGTTTTGTAAACTGAAACCAAGAG 
ATCAATTTGATGGGTTTGGACCGTTTGGTTACATTGGAAACTATTTTTTTTTCTCT
>AT1G59580.1 |  ATMPK2 (ARABIDOPSIS THALIANA MITOGEN-ACTIVATED PROTEIN KINASE HOMOLOG 2) MAP kinase/ kinase/ protein kinase 
AGCTCTAAAATGGTAATGAACGAGAAACTAGAGAGTCAACGGTCATAGCGTTGACCAAAT 
CTCTCACTCTCACCAGCCAAATCTCTCCAACTTCCTCTTTCTCTCCACCCAAATCTCGCC 
GGCGATCGTCACTGACGAATCTGATTCGTCCCCAACACTCAACGGGTAAGCTCTCTCTGC 
AAAATTCAGATCGTTCTAGCTCAATCTCAACATCGGATCAAGATTCAACTTTTGTTCTTT 
TCTCTTAAACAAAGCTTTTGATCATTAGTGAAATCTAAAATATTAACCTTTGAATCTGTA 
AACTTAAGCTTAGAGAACCCTAAGATCAAAACTAAAAATTTGATCTTTCATTGTCACTTG 
TTGGAAGCAATTCAAAATTCTGTGTTGTGTATATTTTCAAATTTGTGGATTGTTCCAATT 
TCGATAAAGTTTTGATTTTTATTCCTTTTGTTGTGTGTTTTTATAAGGAGGTAGAAGAAT 
GGCGACTCCTGTTGATCCACCTAATGGAATTAGGAATCAAGGGAAGCATTACTTCTCAAT 
GTGGCAAACACTTTTCGAGATCGATACCAAATACGTGCCTATCAAACCGATAGGCCGAGG 
CGCGTACGGTGTGGTTTGCTCTTCGGTTAACAGAGAGAGTAATGAGAGAGTGGCGATCAA 
GAAGATCCACAATGTGTTTGAGAATAGGATTGATGCGTTGAGGACTCTTAGGGAGCTCAA 
GCTTCTACGTCATCTTCGACATGAGAATGTGGTTGCTCTTAAAGATGTAATGATGGCTAA 
TCATAAGAGAAGCTTTAAGGATGTTTATCTTGTTTATGAGCTTATGGATACTGATCTTCA 
TCAGATTATTAAGTCTTCTCAAGTTCTAAGTAATGACCATTGCCAATACTTCTTGTTCCA 
GGTATGTGATTTGTTTGCATTTGCTTTTACCATTAGTGTGTGTTGTTTGCATTTGGTGTT 
TGCATTTACTTGCTTTTTATTTCTTTTGCATAGTTGCTTCGAGGGCTCAAGTATATTCAT 
TCAGCAAACATTCTCCATCGGGATCTGAAACCCGGTAACCTCCTTGTGAATGCAAACTGC 
GACTTAAAGATATGTGACTTTGGGCTAGCGAGGACGAGCAACACCAAAGGTCAGTTCATG 
ACTGAATATGTTGTGACTAGATGGTACCGAGCACCAGAGCTACTCCTCTGTTGTGACAAC 
TATGGAACCTCCATTGATGTCTGGTCAGTCGGTTGCATATTCGCCGAGCTTCTTGGAAGA 
AAACCAGTATTCCCGGGAACAGAATGTCTAAACCAGATTAAACTCATCATTAACATTTTG 
GGTAGCCAGAGAGAGGAAGATCTCGAGTTTATAGATAACCCAAAAGCCAAAAGATACATA 
GAATCACTCCCTTACTCACCAGGGATATCATTCTCTCGTCTTTACCCGGGTGCAAATGTT 
TTAGCCATTGATCTGCTTCAGAAAATGCTCGTTCTTGACCCTTCGAAAAGGATTAGTGTC 
ACGGAAGCGCTTCAACATCCTTACATGGCGCCTTTATATGACCCGAGTGCAAATCCTCCT 
GCTCAAGTTCCTATTGATCTCGATGTAGATGAAGACGAGGATTTGGGAGCAGAGATGATA 
AGAGAATTAATGTGGAAGGAAATGATTCATTATCATCCAGAAGCTGCTACCATAAACAAC 
AATGAGGTCTCTGAGTTTTGAGATCAAGTCTTTTGCAGGTACTGTTCAGAGAGATCTTTC 
AACAACTTAAACTTATTTATTTTCATGTTTGCTTCTTGAAATGTTGTTAATAATAGTGTC 
GAAGAAGAAGAACAATAACTTTAAATTATGTGATATGTTTGTGTCATGGAGCTTTGTTTT 
GTTTTTGGTTACAAGCTTAAGTGTCTGTAACGTTTGTACATAAGTGTATGTGTCCTTAAA 
ACTTCATACTCTGTCTCTGATCTTATTGTTGGTGATTTTTCTTATTGAAGATTTTCGAAT 
GTTCAAAGAGCAAACCATTGGATTAAATA
>AT1G59580.1 |  ATMPK2 (ARABIDOPSIS THALIANA MITOGEN-ACTIVATED PROTEIN KINASE HOMOLOG 2) MAP kinase/ kinase/ protein kinase 
AGCTCTAAAATGGTAATGAACGAGAAACTAGAGAGTCAACGGTCATAGCGTTGACCAAAT 
CTCTCACTCTCACCAGCCAAATCTCTCCAACTTCCTCTTTCTCTCCACCCAAATCTCGCC 
GGCGATCGTCACTGACGAATCTGATTCGTCCCCAACACTCAACGGGTAAGCTCTCTCTGC 
AAAATTCAGATCGTTCTAGCTCAATCTCAACATCGGATCAAGATTCAACTTTTGTTCTTT 
TCTCTTAAACAAAGCTTTTGATCATTAGTGAAATCTAAAATATTAACCTTTGAATCTGTA 
AACTTAAGCTTAGAGAACCCTAAGATCAAAACTAAAAATTTGATCTTTCATTGTCACTTG 
TTGGAAGCAATTCAAAATTCTGTGTTGTGTATATTTTCAAATTTGTGGATTGTTCCAATT 
TCGATAAAGTTTTGATTTTTATTCCTTTTGTTGTGTGTTTTTATAAGGAGGTAGAAGAAT 
GGCGACTCCTGTTGATCCACCTAATGGAATTAGGAATCAAGGGAAGCATTACTTCTCAAT 
GTGGCAAACACTTTTCGAGATCGATACCAAATACGTGCCTATCAAACCGATAGGCCGAGG 
CGCGTACGGTGTGGTTTGCTCTTCGGTTAACAGAGAGAGTAATGAGAGAGTGGCGATCAA 
GAAGATCCACAATGTGTTTGAGAATAGGATTGATGCGTTGAGGACTCTTAGGGAGCTCAA 
GCTTCTACGTCATCTTCGACATGAGAATGTGGTTGCTCTTAAAGATGTAATGATGGCTAA 
TCATAAGAGAAGCTTTAAGGATGTTTATCTTGTTTATGAGCTTATGGATACTGATCTTCA 
TCAGATTATTAAGTCTTCTCAAGTTCTAAGTAATGACCATTGCCAATACTTCTTGTTCCA 
GGTATGTGATTTGTTTGCATTTGCTTTTACCATTAGTGTGTGTTGTTTGCATTTGGTGTT 
TGCATTTACTTGCTTTTTATTTCTTTTGCATAGTTGCTTCGAGGGCTCAAGTATATTCAT 
TCAGCAAACATTCTCCATCGGGATCTGAAACCCGGTAACCTCCTTGTGAATGCAAACTGC 
GACTTAAAGATATGTGACTTTGGGCTAGCGAGGACGAGCAACACCAAAGGTCAGTTCATG 
ACTGAATATGTTGTGACTAGATGGTACCGAGCACCAGAGCTACTCCTCTGTTGTGACAAC 
TATGGAACCTCCATTGATGTCTGGTCAGTCGGTTGCATATTCGCCGAGCTTCTTGGAAGA 
AAACCAGTATTCCCGGGAACAGAATGTCTAAACCAGATTAAACTCATCATTAACATTTTG 
GGTAGCCAGAGAGAGGAAGATCTCGAGTTTATAGATAACCCAAAAGCCAAAAGATACATA 
GAATCACTCCCTTACTCACCAGGGATATCATTCTCTCGTCTTTACCCGGGTGCAAATGTT 
TTAGCCATTGATCTGCTTCAGAAAATGCTCGTTCTTGACCCTTCGAAAAGGATTAGTGTC 
ACGGAAGCGCTTCAACATCCTTACATGGCGCCTTTATATGACCCGAGTGCAAATCCTCCT 
GCTCAAGTTCCTATTGATCTCGATGTAGATGAAGACGAGGATTTGGGAGCAGAGATGATA 
AGAGAATTAATGTGGAAGGAAATGATTCATTATCATCCAGAAGCTGCTACCATAAACAAC 
AATGAGGTCTCTGAGTTTTGAGATCAAGTCTTTTGCAGGTACTGTTCAGAGAGATCTTTC 
AACAACTTAAACTTATTTATTTTCATGTTTGCTTCTTGAAATGTTGTTAATAATAGTGTC 
GAAGAAGAAGAACAATAACTTTAAATTATGTGATATGTTTGTGTCATGGAGCTTTGTTTT 
GTTTTTGGTTACAAGCTTAAGTGTCTGTAACGTTTGTACATAAGTGTATGTGTCCTTAAA 
ACTTCATACTCTGTCTCTGATCTTATTGTTGGTGATTTTTCTTATTGAAGATTTTCGAAT 
GTTCAAAGAGCAAACCATTGGATTAAATA
>AT1G59580.2 |  ATMPK2 (ARABIDOPSIS THALIANA MITOGEN-ACTIVATED PROTEIN KINASE HOMOLOG 2) MAP kinase/ kinase/ protein kinase 
AAATCTCTCACTCTCACCAGCCAAATCTCTCCAACTTCCTCTTTCTCTCCACCCAAATCT 
CGCCGGCGATCGTCACTGACGAATCTGATTCGTCCCCAACACTCAACGGGTAAGCTCTCT 
CTGCAAAATTCAGATCGTTCTAGCTCAATCTCAACATCGGATCAAGATTCAACTTTTGTT 
CTTTTCTCTTAAACAAAGCTTTTGATCATTAGTGAAATCTAAAATATTAACCTTTGAATC 
TGTAAACTTAAGCTTAGAGAACCCTAAGATCAAAACTAAAAATTTGATCTTTCATTGTCA 
CTTGTTGGAAGCAATTCAAAATTCTGTGTTGTGTATATTTTCAAATTTGTGGATTGTTCC 
AATTTCGATAAAGTTTTGATTTTTATTCCTTTTGTTGTGTGTTTTTATAAGGAGGTAGAA 
GAATGGCGACTCCTGTTGATCCACCTAATGGAATTAGGAATCAAGGGAAGCATTACTTCT 
CAATGTGGCAAACACTTTTCGAGATCGATACCAAATACGTGCCTATCAAACCGATAGGCC 
GAGGCGCGTACGGTGTGGTTTGCTCTTCGGTTAACAGAGAGAGTAATGAGAGAGTGGCGA 
TCAAGAAGATCCACAATGTGTTTGAGAATAGGATTGATGCGTTGAGGACTCTTAGGGAGC 
TCAAGCTTCTACGTCATCTTCGACATGAGAATGTGGTTGCTCTTAAAGATGTAATGATGG 
CTAATCATAAGAGAAGCTTTAAGGATGTTTATCTTGTTTATGAGCTTATGGATACTGATC 
TTCATCAGATTATTAAGTCTTCTCAAGTTCTAAGTAATGACCATTGCCAATACTTCTTGT 
TCCAGGTATGTGATTTGTTTGCATTTGCTTTTACCATTAGTGTGTGTTGTTTGCATTTGG 
TGTTTGCATTTACTTGCTTTTTATTTCTTTTGCATAGTTGCTTCGAGGGCTCAAGTATAT 
TCATTCAGCAAACATTCTCCATCGGGATCTGAAACCCGGTAACCTCCTTGTGAATGCAAA 
CTGCGACTTAAAGATATGTGACTTTGGGCTAGCGAGGACGAGCAACACCAAAGGTCAGTT 
CATGACTGAATATGTTGTGACTAGATGGTACCGAGCACCAGAGCTACTCCTCTGTTGTGA 
CAACTATGGAACCTCCATTGATGTCTGGTCAGTCGGTTGCATATTCGCCGAGCTTCTTGG 
AAGAAAACCAGTATTCCCGGGAACAGAATGTCTAAACCAGATTAAACTCATCATTAACAT 
TTTGGGTAGCCAGAGAGAGGAAGATCTCGAGTTTATAGATAACCCAAAAGCCAAAAGATA 
CATAGAATCACTCCCTTACTCACCAGGGATATCATTCTCTCGTCTTTACCCGGGTGCAAA 
TGTTTTAGCCATTGATCTGCTTCAGAAAATGCTCGTTCTTGACCCTTCGAAAAGGATTAG 
TGTCACGGAAGCGCTTCAACATCCTTACATGGCGCCTTTATATGACCCGAGTGCAAATCC 
TCCTGCTCAAGTTCCTATTGATCTCGATGTAGATGAAGACGAGGATTTGGGAGCAGAGAT 
GATAAGAGAATTAATGTGGAAGGAAATGATTCATTATCATCCAGAAGCTGCTACCATAAA 
CAACAATGAGGTCTCTGAGTTTTGAGATCAAGTCTTTTGCAGGTACTGTTCAGAGAGATC 
TTTCAACAACTTAAACTTATTTATTTTCATGTTTGCTTCTTGAAATGTTGTTAATAATAG 
TGTCGAAGAAGAAGAACAATAACTTTAAATTATGTGATATGTTTGTGTCATGGAGCTTTG 
TTTTGTTTTTGGTTACAAGCTTAAGTGTCTGTAACGTTTGTACATAAGTGTATGTGTCCT 
TAAAACTTCATACTCTGTCTCTGATCTTATTGTTGGTGATTTTTCTTATTGAAGATTTTC 
GAATGTTCAAAGAGC
>AT1G59580.2 |  ATMPK2 (ARABIDOPSIS THALIANA MITOGEN-ACTIVATED PROTEIN KINASE HOMOLOG 2) MAP kinase/ kinase/ protein kinase 
AAATCTCTCACTCTCACCAGCCAAATCTCTCCAACTTCCTCTTTCTCTCCACCCAAATCT 
CGCCGGCGATCGTCACTGACGAATCTGATTCGTCCCCAACACTCAACGGGTAAGCTCTCT 
CTGCAAAATTCAGATCGTTCTAGCTCAATCTCAACATCGGATCAAGATTCAACTTTTGTT 
CTTTTCTCTTAAACAAAGCTTTTGATCATTAGTGAAATCTAAAATATTAACCTTTGAATC 
TGTAAACTTAAGCTTAGAGAACCCTAAGATCAAAACTAAAAATTTGATCTTTCATTGTCA 
CTTGTTGGAAGCAATTCAAAATTCTGTGTTGTGTATATTTTCAAATTTGTGGATTGTTCC 
AATTTCGATAAAGTTTTGATTTTTATTCCTTTTGTTGTGTGTTTTTATAAGGAGGTAGAA 
GAATGGCGACTCCTGTTGATCCACCTAATGGAATTAGGAATCAAGGGAAGCATTACTTCT 
CAATGTGGCAAACACTTTTCGAGATCGATACCAAATACGTGCCTATCAAACCGATAGGCC 
GAGGCGCGTACGGTGTGGTTTGCTCTTCGGTTAACAGAGAGAGTAATGAGAGAGTGGCGA 
TCAAGAAGATCCACAATGTGTTTGAGAATAGGATTGATGCGTTGAGGACTCTTAGGGAGC 
TCAAGCTTCTACGTCATCTTCGACATGAGAATGTGGTTGCTCTTAAAGATGTAATGATGG 
CTAATCATAAGAGAAGCTTTAAGGATGTTTATCTTGTTTATGAGCTTATGGATACTGATC 
TTCATCAGATTATTAAGTCTTCTCAAGTTCTAAGTAATGACCATTGCCAATACTTCTTGT 
TCCAGGTATGTGATTTGTTTGCATTTGCTTTTACCATTAGTGTGTGTTGTTTGCATTTGG 
TGTTTGCATTTACTTGCTTTTTATTTCTTTTGCATAGTTGCTTCGAGGGCTCAAGTATAT 
TCATTCAGCAAACATTCTCCATCGGGATCTGAAACCCGGTAACCTCCTTGTGAATGCAAA 
CTGCGACTTAAAGATATGTGACTTTGGGCTAGCGAGGACGAGCAACACCAAAGGTCAGTT 
CATGACTGAATATGTTGTGACTAGATGGTACCGAGCACCAGAGCTACTCCTCTGTTGTGA 
CAACTATGGAACCTCCATTGATGTCTGGTCAGTCGGTTGCATATTCGCCGAGCTTCTTGG 
AAGAAAACCAGTATTCCCGGGAACAGAATGTCTAAACCAGATTAAACTCATCATTAACAT 
TTTGGGTAGCCAGAGAGAGGAAGATCTCGAGTTTATAGATAACCCAAAAGCCAAAAGATA 
CATAGAATCACTCCCTTACTCACCAGGGATATCATTCTCTCGTCTTTACCCGGGTGCAAA 
TGTTTTAGCCATTGATCTGCTTCAGAAAATGCTCGTTCTTGACCCTTCGAAAAGGATTAG 
TGTCACGGAAGCGCTTCAACATCCTTACATGGCGCCTTTATATGACCCGAGTGCAAATCC 
TCCTGCTCAAGTTCCTATTGATCTCGATGTAGATGAAGACGAGGATTTGGGAGCAGAGAT 
GATAAGAGAATTAATGTGGAAGGAAATGATTCATTATCATCCAGAAGCTGCTACCATAAA 
CAACAATGAGGTCTCTGAGTTTTGAGATCAAGTCTTTTGCAGGTACTGTTCAGAGAGATC 
TTTCAACAACTTAAACTTATTTATTTTCATGTTTGCTTCTTGAAATGTTGTTAATAATAG 
TGTCGAAGAAGAAGAACAATAACTTTAAATTATGTGATATGTTTGTGTCATGGAGCTTTG 
TTTTGTTTTTGGTTACAAGCTTAAGTGTCTGTAACGTTTGTACATAAGTGTATGTGTCCT 
TAAAACTTCATACTCTGTCTCTGATCTTATTGTTGGTGATTTTTCTTATTGAAGATTTTC 
GAATGTTCAAAGAGC