>AT5G04510.2 |  PDK1 (3-PHOSPHOINOSITIDE-DEPENDENT PROTEIN KINASE 1) 3-phosphoinositide-dependent protein kinase/ kinase/ phosphoinositide binding / protein binding / protein kinase 
GAACGAATCATTGCAAAGAGCCAAAGATCCAAAATTTGCAACAAAAACAAAAACTCTACC 
AAGTGAAGAAGAAAGAGAAGTGTCACATCTCTCTCTCTATCTCTTTGATCCAAACAAATT 
CCCTCATATATCTCTGCTCTCTCTTCCCAATATTCTTCCTTTTTCTTCAATCGAGATTTT 
ATCGCTTTTTGATTCGAGAATTTCGAAATCGGAACCCACCACCACCACCCTCCATCGTCG 
CCAACAGATCCAACGGTTCTCTGATTATTTTTATTTGAAAATTGTCTTCTTTGTTCTCCC 
TCTCGCTGGGTTTGATTCTGAGATTGATTAATCTAAATGGGTCGCATCGAGATTTAGAAG 
AAAAGGGAAATGTTGGCAATGGAGAAAGAATTTGATTCAAAGCTTGTTCTTCAAGGGAAC 
TCATCCAACGGTGCTAATGTTTCTAGAAGCAAAAGCTTCTCCTTTAAAGCTCCTCAAGAA 
AATTTCACCAGCCATGATTTCGAATTTGGCAAGATCTATGGTGTTGGTTCTTACTCTAAG 
GTTTCTATTTTCATTACTATTTTCTATTGTGGCTTTGATTCATGAACCCAGCTCGATTCT 
CTGGTTGATTTTTTTGTTGCAATGTGGTGTTGATATTGGAAAGGATTGGTTTTTGACCTG 
TGGGTGATTCATATTAGGTTGTTAGGGCAAAGAAGAAGGAAACTGGAACTGTGTATGCTT 
TAAAGATTATGGACAAAAAGTTTATCACCAAGGAGAATAAAACTGCTTATGTGAAATTGG 
AAAGGATTGTTCTTGATCAACTTGAACATCCTGGGATCATTAAACTTTACTTCACGTTTC 
AAGACACATCCTCACTATGTAGGTATTTCTAATTCTAATGAACATTCCTGTGCTTGAACA 
TTCTGTAGTTGATTGATTTTTCTTATTTCTGCTCGGTATGCTTTTGGTAGATATGGCACT 
TGAATCTTGTGAGGGTGGCGAGCTTTTCGACCAAATAACCAGAGTAAGATGATCGCTGAA 
TCTTGGGAGATTGATTCATTCAATGTTGTCGGAATTTGACATAACATCATGGTTCCTTTT 
TTTGTGCAGAAAGGTCGGCTATCGGAGGATGAAGCTCGGTTCTACACTGCAGAAGTTGTG 
GATGCTCTTGAGTATATACATAGTATGGGACTGATTCATCGAGATATTAAGGTAGTTTCT 
CTTTCGAATAGCTTCTCTTTTATGGTCTTAACAATAGCTGTGCTAAGTGTTTTTCTTGAA 
ATGGTAGCCGGAGAATCTGTTGCTGACTTCAGATGGACACATTAAGATTGCGGATTTTGG 
AAGTGTAAAGCCGATGCAGGATAGCCAGATCACAGTTCTACCTAATGCAGCTTCTGGTAA 
CATGACCGTCCATAATCCTTTCTGGTATCTCTGAGCTTTAAGGTCTTATCAATAATTATT 
GTCAGATTCTGCTTTAAGCCTGTCTCTGTAAAACTGGTGACGTATACAGACGATAAGGCG 
TGCACTTTTGTCGGGACTGCTGCATATGTTCCTCCAGAAGTTCTCAACTCCTCTCCCGCA 
ACTTTCGGGTAAGAGTTTTCTGTTTCTGTAATTTGCGTTTTGGTTTTGTGGATTGCGATG 
CCTCTCTTACTTTGTACAATTTCTTTGGTACTCTGTGGAACACCAGGAATGATCTTTGGG 
CTCTCGGCTGCACTCTCTATCAGATGCTTTCGGGGACTTCCCCATTTAAAGATGCAAGTG 
AATGGCTGATTTTCCAAAGAATTATAGCCAGAGATATAAAGTTCCCAAATCATTTTTCAG 
AAGCAGCAAGAGACCTCATCGACCGGTTGCTGGTAAGATGCATCAAAAGCCGTTGTCACC 
AATCAAATGATATGGCAGTAACTCTTTAAATGTGCGATTCTCTCAGGATACCGAGCCAAG 
CAGAAGGCCAGGTGCTGGCTCAGAAGGTTATGTTGCTCTTAAGAGACATCCTTTCTTTAA 
TGGAGTTGACTGGAAGAATCTAAGGTCCCAGACTCCTCCAAAACTAGCTCCAGATCCTGC 
GGTATGTCTCCTCATGTACTTTGGAATTATGAGGCCCTTTATTATCATTATTGTTTTACT 
CGGTTATTAAGTCAATTTTTTTACATCTGATGCAGTCTCAGACAGCATCTCCCGAGAGGG 
ATGACACACATGGTTCTCCATGGAACCTGACACATATTGGAGATTCTTTAGCCACACAGA 
ACGAGGGGCACAGTGCTCCTCCTACATCTTCTGAATCATCGGGTTCCATAACTCGACTTG 
CTTCAATAGACTCTTTTGATTCAAGATGGTGGGGGCATTTAAGCTTCATTTGTTTTCTGG 
GCAGCAACCTTGTTCAAAGCTCCTAAAACTATACTTCAGTTAGACAATCTGCTAATTTTT 
TTCTTGTGAATATTTGATAAACAGGCAACAGTTTTTAGAGCCAGGAGAATCGGTTCTGAT 
GATATCAGCGGTGAAGAAGCTTCAGAAAATAACGAGCAAGAAGGTGCAGCTAATACTCAC 
CAACAAACCCAAGCTGATCTATGTCGACCCGTCAAAACTAGTTGTGAAAGGAAACATTAT 
ATGGTCTGATAACTCGAATGACCTCAACGTTGTAGTCACTAGCCCTTCACATTTCAAGAT 
TTGCACGGTACTCATCATTTCCCTATTCTTCTATCCACCAAAGATCAAAGATGATGAAAC 
TGAAATTGTGACCTCTTCTTGCATTTGTGCAGCCAAAGAAGGTTTTATCATTTGAAGACG 
CAAAACAGAGAGCTTCAGTGTGGAAAAAGGCAATCGAGACTCTTCAGAACCGCTGAGACA 
AACTCCACTGAAGAATGGTTCTGTTAATTTCCTCGTTGGTGTTTTGTCTTTCCTTCAAGA 
TTTTAAACTGACCAAACTCCATTCTTCATATGTTTTTTCTATTTCACGCATCCTTGAAAA 
GACCAGTTGGACACAAAACTGAAAAACTGCTTTTTGAATCCTTTGTATAAGTCAAACAAA 
TGTATTACAAGGTTAAAGTTACATGAGATACTGAATTCAAGC
>AT5G04510.2 |  PDK1 (3-PHOSPHOINOSITIDE-DEPENDENT PROTEIN KINASE 1) 3-phosphoinositide-dependent protein kinase/ kinase/ phosphoinositide binding / protein binding / protein kinase 
GAACGAATCATTGCAAAGAGCCAAAGATCCAAAATTTGCAACAAAAACAAAAACTCTACC 
AAGTGAAGAAGAAAGAGAAGTGTCACATCTCTCTCTCTATCTCTTTGATCCAAACAAATT 
CCCTCATATATCTCTGCTCTCTCTTCCCAATATTCTTCCTTTTTCTTCAATCGAGATTTT 
ATCGCTTTTTGATTCGAGAATTTCGAAATCGGAACCCACCACCACCACCCTCCATCGTCG 
CCAACAGATCCAACGGTTCTCTGATTATTTTTATTTGAAAATTGTCTTCTTTGTTCTCCC 
TCTCGCTGGGTTTGATTCTGAGATTGATTAATCTAAATGGGTCGCATCGAGATTTAGAAG 
AAAAGGGAAATGTTGGCAATGGAGAAAGAATTTGATTCAAAGCTTGTTCTTCAAGGGAAC 
TCATCCAACGGTGCTAATGTTTCTAGAAGCAAAAGCTTCTCCTTTAAAGCTCCTCAAGAA 
AATTTCACCAGCCATGATTTCGAATTTGGCAAGATCTATGGTGTTGGTTCTTACTCTAAG 
GTTTCTATTTTCATTACTATTTTCTATTGTGGCTTTGATTCATGAACCCAGCTCGATTCT 
CTGGTTGATTTTTTTGTTGCAATGTGGTGTTGATATTGGAAAGGATTGGTTTTTGACCTG 
TGGGTGATTCATATTAGGTTGTTAGGGCAAAGAAGAAGGAAACTGGAACTGTGTATGCTT 
TAAAGATTATGGACAAAAAGTTTATCACCAAGGAGAATAAAACTGCTTATGTGAAATTGG 
AAAGGATTGTTCTTGATCAACTTGAACATCCTGGGATCATTAAACTTTACTTCACGTTTC 
AAGACACATCCTCACTATGTAGGTATTTCTAATTCTAATGAACATTCCTGTGCTTGAACA 
TTCTGTAGTTGATTGATTTTTCTTATTTCTGCTCGGTATGCTTTTGGTAGATATGGCACT 
TGAATCTTGTGAGGGTGGCGAGCTTTTCGACCAAATAACCAGAGTAAGATGATCGCTGAA 
TCTTGGGAGATTGATTCATTCAATGTTGTCGGAATTTGACATAACATCATGGTTCCTTTT 
TTTGTGCAGAAAGGTCGGCTATCGGAGGATGAAGCTCGGTTCTACACTGCAGAAGTTGTG 
GATGCTCTTGAGTATATACATAGTATGGGACTGATTCATCGAGATATTAAGGTAGTTTCT 
CTTTCGAATAGCTTCTCTTTTATGGTCTTAACAATAGCTGTGCTAAGTGTTTTTCTTGAA 
ATGGTAGCCGGAGAATCTGTTGCTGACTTCAGATGGACACATTAAGATTGCGGATTTTGG 
AAGTGTAAAGCCGATGCAGGATAGCCAGATCACAGTTCTACCTAATGCAGCTTCTGGTAA 
CATGACCGTCCATAATCCTTTCTGGTATCTCTGAGCTTTAAGGTCTTATCAATAATTATT 
GTCAGATTCTGCTTTAAGCCTGTCTCTGTAAAACTGGTGACGTATACAGACGATAAGGCG 
TGCACTTTTGTCGGGACTGCTGCATATGTTCCTCCAGAAGTTCTCAACTCCTCTCCCGCA 
ACTTTCGGGTAAGAGTTTTCTGTTTCTGTAATTTGCGTTTTGGTTTTGTGGATTGCGATG 
CCTCTCTTACTTTGTACAATTTCTTTGGTACTCTGTGGAACACCAGGAATGATCTTTGGG 
CTCTCGGCTGCACTCTCTATCAGATGCTTTCGGGGACTTCCCCATTTAAAGATGCAAGTG 
AATGGCTGATTTTCCAAAGAATTATAGCCAGAGATATAAAGTTCCCAAATCATTTTTCAG 
AAGCAGCAAGAGACCTCATCGACCGGTTGCTGGTAAGATGCATCAAAAGCCGTTGTCACC 
AATCAAATGATATGGCAGTAACTCTTTAAATGTGCGATTCTCTCAGGATACCGAGCCAAG 
CAGAAGGCCAGGTGCTGGCTCAGAAGGTTATGTTGCTCTTAAGAGACATCCTTTCTTTAA 
TGGAGTTGACTGGAAGAATCTAAGGTCCCAGACTCCTCCAAAACTAGCTCCAGATCCTGC 
GGTATGTCTCCTCATGTACTTTGGAATTATGAGGCCCTTTATTATCATTATTGTTTTACT 
CGGTTATTAAGTCAATTTTTTTACATCTGATGCAGTCTCAGACAGCATCTCCCGAGAGGG 
ATGACACACATGGTTCTCCATGGAACCTGACACATATTGGAGATTCTTTAGCCACACAGA 
ACGAGGGGCACAGTGCTCCTCCTACATCTTCTGAATCATCGGGTTCCATAACTCGACTTG 
CTTCAATAGACTCTTTTGATTCAAGATGGTGGGGGCATTTAAGCTTCATTTGTTTTCTGG 
GCAGCAACCTTGTTCAAAGCTCCTAAAACTATACTTCAGTTAGACAATCTGCTAATTTTT 
TTCTTGTGAATATTTGATAAACAGGCAACAGTTTTTAGAGCCAGGAGAATCGGTTCTGAT 
GATATCAGCGGTGAAGAAGCTTCAGAAAATAACGAGCAAGAAGGTGCAGCTAATACTCAC 
CAACAAACCCAAGCTGATCTATGTCGACCCGTCAAAACTAGTTGTGAAAGGAAACATTAT 
ATGGTCTGATAACTCGAATGACCTCAACGTTGTAGTCACTAGCCCTTCACATTTCAAGAT 
TTGCACGGTACTCATCATTTCCCTATTCTTCTATCCACCAAAGATCAAAGATGATGAAAC 
TGAAATTGTGACCTCTTCTTGCATTTGTGCAGCCAAAGAAGGTTTTATCATTTGAAGACG 
CAAAACAGAGAGCTTCAGTGTGGAAAAAGGCAATCGAGACTCTTCAGAACCGCTGAGACA 
AACTCCACTGAAGAATGGTTCTGTTAATTTCCTCGTTGGTGTTTTGTCTTTCCTTCAAGA 
TTTTAAACTGACCAAACTCCATTCTTCATATGTTTTTTCTATTTCACGCATCCTTGAAAA 
GACCAGTTGGACACAAAACTGAAAAACTGCTTTTTGAATCCTTTGTATAAGTCAAACAAA 
TGTATTACAAGGTTAAAGTTACATGAGATACTGAATTCAAGC
>AT5G04510.1 |  PDK1 (3-PHOSPHOINOSITIDE-DEPENDENT PROTEIN KINASE 1) 3-phosphoinositide-dependent protein kinase/ kinase/ phosphoinositide binding / protein binding / protein kinase 
GAACGAATCATTGCAAAGAGCCAAAGATCCAAAATTTGCAACAAAAACAAAAACTCTACC 
AAGTGAAGAAGAAAGAGAAGTGTCACATCTCTCTCTCTATCTCTTTGATCCAAACAAATT 
CCCTCATATATCTCTGCTCTCTCTTCCCAATATTCTTCCTTTTTCTTCAATCGAGATTTT 
ATCGCTTTTTGATTCGAGAATTTCGAAATCGGAACCCACCACCACCACCCTCCATCGTCG 
CCAACAGATCCAACGGTTCTCTGATTATTTTTATTTGAAAATTGTCTTCTTTGTTCTCCC 
TCTCGCTGGGTTTGATTCTGAGATTGATTAATCTAAATGGGTCGCATCGAGATTTAGAAG 
AAAAGGGAAATGTTGGCAATGGAGAAAGAATTTGATTCAAAGCTTGTTCTTCAAGGGAAC 
TCATCCAACGGTGCTAATGTTTCTAGAAGCAAAAGCTTCTCCTTTAAAGCTCCTCAAGAA 
AATTTCACCAGCCATGATTTCGAATTTGGCAAGATCTATGGTGTTGGTTCTTACTCTAAG 
GTTTCTATTTTCATTACTATTTTCTATTGTGGCTTTGATTCATGAACCCAGCTCGATTCT 
CTGGTTGATTTTTTTGTTGCAATGTGGTGTTGATATTGGAAAGGATTGGTTTTTGACCTG 
TGGGTGATTCATATTAGGTTGTTAGGGCAAAGAAGAAGGAAACTGGAACTGTGTATGCTT 
TAAAGATTATGGACAAAAAGTTTATCACCAAGGAGAATAAAACTGCTTATGTGAAATTGG 
AAAGGATTGTTCTTGATCAACTTGAACATCCTGGGATCATTAAACTTTACTTCACGTTTC 
AAGACACATCCTCACTATGTAGGTATTTCTAATTCTAATGAACATTCCTGTGCTTGAACA 
TTCTGTAGTTGATTGATTTTTCTTATTTCTGCTCGGTATGCTTTTGGTAGATATGGCACT 
TGAATCTTGTGAGGGTGGCGAGCTTTTCGACCAAATAACCAGAGTAAGATGATCGCTGAA 
TCTTGGGAGATTGATTCATTCAATGTTGTCGGAATTTGACATAACATCATGGTTCCTTTT 
TTTGTGCAGAAAGGTCGGCTATCGGAGGATGAAGCTCGGTTCTACACTGCAGAAGTTGTG 
GATGCTCTTGAGTATATACATAGTATGGGACTGATTCATCGAGATATTAAGGTAGTTTCT 
CTTTCGAATAGCTTCTCTTTTATGGTCTTAACAATAGCTGTGCTAAGTGTTTTTCTTGAA 
ATGGTAGCCGGAGAATCTGTTGCTGACTTCAGATGGACACATTAAGATTGCGGATTTTGG 
AAGTGTAAAGCCGATGCAGGATAGCCAGATCACAGTTCTACCTAATGCAGCTTCTGGTAA 
CATGACCGTCCATAATCCTTTCTGGTATCTCTGAGCTTTAAGGTCTTATCAATAATTATT 
GTCAGATTCTGCTTTAAGCCTGTCTCTGTAAAACTGGTGACGTATACAGACGATAAGGCG 
TGCACTTTTGTCGGGACTGCTGCATATGTTCCTCCAGAAGTTCTCAACTCCTCTCCCGCA 
ACTTTCGGGTAAGAGTTTTCTGTTTCTGTAATTTGCGTTTTGGTTTTGTGGATTGCGATG 
CCTCTCTTACTTTGTACAATTTCTTTGGTACTCTGTGGAACACCAGGAATGATCTTTGGG 
CTCTCGGCTGCACTCTCTATCAGATGCTTTCGGGGACTTCCCCATTTAAAGATGCAAGTG 
AATGGCTGATTTTCCAAAGAATTATAGCCAGAGATATAAAGTTCCCAAATCATTTTTCAG 
AAGCAGCAAGAGACCTCATCGACCGGTTGCTGGTAAGATGCATCAAAAGCCGTTGTCACC 
AATCAAATGATATGGCAGTAACTCTTTAAATGTGCGATTCTCTCAGGATACCGAGCCAAG 
CAGAAGGCCAGGTGCTGGCTCAGAAGGTTATGTTGCTCTTAAGAGACATCCTTTCTTTAA 
TGGAGTTGACTGGAAGAATCTAAGGTCCCAGACTCCTCCAAAACTAGCTCCAGATCCTGC 
GGTATGTCTCCTCATGTACTTTGGAATTATGAGGCCCTTTATTATCATTATTGTTTTACT 
CGGTTATTAAGTCAATTTTTTTACATCTGATGCAGTCTCAGACAGCATCTCCCGAGAGGG 
ATGACACACATGGTTCTCCATGGAACCTGACACATATTGGAGATTCTTTAGCCACACAGA 
ACGAGGGGCACAGTGCTCCTCCTACATCTTCTGAATCATCGGGTTCCATAACTCGACTTG 
CTTCAATAGACTCTTTTGATTCAAGATGGTGGGGGCATTTAAGCTTCATTTGTTTTCTGG 
GCAGCAACCTTGTTCAAAGCTCCTAAAACTATACTTCAGTTAGACAATCTGCTAATTTTT 
TTCTTGTGAATATTTGATAAACAGGCAACAGTTTTTAGAGCCAGGAGAATCGGTTCTGAT 
GATATCAGCGGTGAAGAAGCTTCAGAAAATAACGAGCAAGAAGGTGCAGCTAATACTCAC 
CAACAAACCCAAGCTGATCTATGTCGACCCGTCAAAACTAGTTGTGAAAGGAAACATTAT 
ATGGTCTGATAACTCGAATGACCTCAACGTTGTAGTCACTAGCCCTTCACATTTCAAGAT 
TTGCACGGTACTCATCATTTCCCTATTCTTCTATCCACCAAAGATCAAAGATGATGAAAC 
TGAAATTGTGACCTCTTCTTGCATTTGTGCAGCCAAAGAAGGTTTTATCATTTGAAGACG 
CAAAACAGAGAGCTTCAGTGTGGAAAAAGGCAATCGAGACTCTTCAGAACCGCTGAGACA 
AACTCCACTGAAGAATGGTTCTGTTAATTTCCTCGTTGGTGTTTTGTCTTTCCTTCAAGA 
TTTTAAACTGACCAAACTCCATTCTTCATATGTTTTTTCTATTTCACGCATCCTTGAAAA 
GACCAGTTGGACACAAAACTGAAAAACTGCTTTTTGAATCCTTTGTATAAGTCAAACAAA 
TGTATTACAAGGTTAAAGTTACATGAGATACTGAATTCAAGCA
>AT5G04510.1 |  PDK1 (3-PHOSPHOINOSITIDE-DEPENDENT PROTEIN KINASE 1) 3-phosphoinositide-dependent protein kinase/ kinase/ phosphoinositide binding / protein binding / protein kinase 
GAACGAATCATTGCAAAGAGCCAAAGATCCAAAATTTGCAACAAAAACAAAAACTCTACC 
AAGTGAAGAAGAAAGAGAAGTGTCACATCTCTCTCTCTATCTCTTTGATCCAAACAAATT 
CCCTCATATATCTCTGCTCTCTCTTCCCAATATTCTTCCTTTTTCTTCAATCGAGATTTT 
ATCGCTTTTTGATTCGAGAATTTCGAAATCGGAACCCACCACCACCACCCTCCATCGTCG 
CCAACAGATCCAACGGTTCTCTGATTATTTTTATTTGAAAATTGTCTTCTTTGTTCTCCC 
TCTCGCTGGGTTTGATTCTGAGATTGATTAATCTAAATGGGTCGCATCGAGATTTAGAAG 
AAAAGGGAAATGTTGGCAATGGAGAAAGAATTTGATTCAAAGCTTGTTCTTCAAGGGAAC 
TCATCCAACGGTGCTAATGTTTCTAGAAGCAAAAGCTTCTCCTTTAAAGCTCCTCAAGAA 
AATTTCACCAGCCATGATTTCGAATTTGGCAAGATCTATGGTGTTGGTTCTTACTCTAAG 
GTTTCTATTTTCATTACTATTTTCTATTGTGGCTTTGATTCATGAACCCAGCTCGATTCT 
CTGGTTGATTTTTTTGTTGCAATGTGGTGTTGATATTGGAAAGGATTGGTTTTTGACCTG 
TGGGTGATTCATATTAGGTTGTTAGGGCAAAGAAGAAGGAAACTGGAACTGTGTATGCTT 
TAAAGATTATGGACAAAAAGTTTATCACCAAGGAGAATAAAACTGCTTATGTGAAATTGG 
AAAGGATTGTTCTTGATCAACTTGAACATCCTGGGATCATTAAACTTTACTTCACGTTTC 
AAGACACATCCTCACTATGTAGGTATTTCTAATTCTAATGAACATTCCTGTGCTTGAACA 
TTCTGTAGTTGATTGATTTTTCTTATTTCTGCTCGGTATGCTTTTGGTAGATATGGCACT 
TGAATCTTGTGAGGGTGGCGAGCTTTTCGACCAAATAACCAGAGTAAGATGATCGCTGAA 
TCTTGGGAGATTGATTCATTCAATGTTGTCGGAATTTGACATAACATCATGGTTCCTTTT 
TTTGTGCAGAAAGGTCGGCTATCGGAGGATGAAGCTCGGTTCTACACTGCAGAAGTTGTG 
GATGCTCTTGAGTATATACATAGTATGGGACTGATTCATCGAGATATTAAGGTAGTTTCT 
CTTTCGAATAGCTTCTCTTTTATGGTCTTAACAATAGCTGTGCTAAGTGTTTTTCTTGAA 
ATGGTAGCCGGAGAATCTGTTGCTGACTTCAGATGGACACATTAAGATTGCGGATTTTGG 
AAGTGTAAAGCCGATGCAGGATAGCCAGATCACAGTTCTACCTAATGCAGCTTCTGGTAA 
CATGACCGTCCATAATCCTTTCTGGTATCTCTGAGCTTTAAGGTCTTATCAATAATTATT 
GTCAGATTCTGCTTTAAGCCTGTCTCTGTAAAACTGGTGACGTATACAGACGATAAGGCG 
TGCACTTTTGTCGGGACTGCTGCATATGTTCCTCCAGAAGTTCTCAACTCCTCTCCCGCA 
ACTTTCGGGTAAGAGTTTTCTGTTTCTGTAATTTGCGTTTTGGTTTTGTGGATTGCGATG 
CCTCTCTTACTTTGTACAATTTCTTTGGTACTCTGTGGAACACCAGGAATGATCTTTGGG 
CTCTCGGCTGCACTCTCTATCAGATGCTTTCGGGGACTTCCCCATTTAAAGATGCAAGTG 
AATGGCTGATTTTCCAAAGAATTATAGCCAGAGATATAAAGTTCCCAAATCATTTTTCAG 
AAGCAGCAAGAGACCTCATCGACCGGTTGCTGGTAAGATGCATCAAAAGCCGTTGTCACC 
AATCAAATGATATGGCAGTAACTCTTTAAATGTGCGATTCTCTCAGGATACCGAGCCAAG 
CAGAAGGCCAGGTGCTGGCTCAGAAGGTTATGTTGCTCTTAAGAGACATCCTTTCTTTAA 
TGGAGTTGACTGGAAGAATCTAAGGTCCCAGACTCCTCCAAAACTAGCTCCAGATCCTGC 
GGTATGTCTCCTCATGTACTTTGGAATTATGAGGCCCTTTATTATCATTATTGTTTTACT 
CGGTTATTAAGTCAATTTTTTTACATCTGATGCAGTCTCAGACAGCATCTCCCGAGAGGG 
ATGACACACATGGTTCTCCATGGAACCTGACACATATTGGAGATTCTTTAGCCACACAGA 
ACGAGGGGCACAGTGCTCCTCCTACATCTTCTGAATCATCGGGTTCCATAACTCGACTTG 
CTTCAATAGACTCTTTTGATTCAAGATGGTGGGGGCATTTAAGCTTCATTTGTTTTCTGG 
GCAGCAACCTTGTTCAAAGCTCCTAAAACTATACTTCAGTTAGACAATCTGCTAATTTTT 
TTCTTGTGAATATTTGATAAACAGGCAACAGTTTTTAGAGCCAGGAGAATCGGTTCTGAT 
GATATCAGCGGTGAAGAAGCTTCAGAAAATAACGAGCAAGAAGGTGCAGCTAATACTCAC 
CAACAAACCCAAGCTGATCTATGTCGACCCGTCAAAACTAGTTGTGAAAGGAAACATTAT 
ATGGTCTGATAACTCGAATGACCTCAACGTTGTAGTCACTAGCCCTTCACATTTCAAGAT 
TTGCACGGTACTCATCATTTCCCTATTCTTCTATCCACCAAAGATCAAAGATGATGAAAC 
TGAAATTGTGACCTCTTCTTGCATTTGTGCAGCCAAAGAAGGTTTTATCATTTGAAGACG 
CAAAACAGAGAGCTTCAGTGTGGAAAAAGGCAATCGAGACTCTTCAGAACCGCTGAGACA 
AACTCCACTGAAGAATGGTTCTGTTAATTTCCTCGTTGGTGTTTTGTCTTTCCTTCAAGA 
TTTTAAACTGACCAAACTCCATTCTTCATATGTTTTTTCTATTTCACGCATCCTTGAAAA 
GACCAGTTGGACACAAAACTGAAAAACTGCTTTTTGAATCCTTTGTATAAGTCAAACAAA 
TGTATTACAAGGTTAAAGTTACATGAGATACTGAATTCAAGCA
>AT2G22720.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Chromatin SPT2 (InterProIPR013256) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G378601) Has 34422 Blast hits to 21553 proteins in 958 species Archae - 9 Bacteria - 2953 Metazoa - 16492 Fungi - 6405 Plants - 1061 Viruses - 320 Other Eukaryotes - 7182 (source NCBI BLink) 
ATGCAAGTGAGAACTTTTGAGAGGCGCAACAACAAGATGGCGCATAATAGCAAAAGACCT 
AATTTTGCTTCTGGGTCCTTGGATTTTCTTCCTCTTCCTCTTTATTCCCCTCAAAACCTA 
GGAGCCCAACGAGAGCACCCCTTTCCTCTTCTTCTATATCCCTTTTTATCTCTTCGCAGC 
TTTCTCTCCTCTCTCAGATCTATACTCTCTTCTCCTTCACAATCGTTCGTTTTATCCTCT 
CCATCGGATTCGTCAGATATCTTCGATTTCTCGTCGTTTCATCGGTTCCAGGTCAAATCC 
AGGTATGTCAATTGTAAAATTTCGATCAGATTCGTCGATTTCGATCAGCTTTTTCGATTT 
TGGATCTATTTTCTATGAAATATCAGATCTGGTGATTGTTTTACATATTTTTGGGTTGAA 
TTCACAAGATTTTCTGGAAACGAGATCGATTAATTGAGTTTTCTGTGTTTTTATCTTAAG 
CTAGATCTCGATTTCTATGTTTTTGGATTGATTTGATAAGATTTTCGAGAATTTTTTGTG 
TTTTTGTCAAAGTTCGATCTCGATTTCTATATTTTTGGTTGAATTCACAAGACTTTCTGG 
AAACGAGATCGATTTTGTGAGTTTTCTTTGTTTTTAATCTCGATTTTTGGATTGATTTGA 
GAAGATTTTCTGAAAGCGAGATCGATGTTTTTGGGGATTTTCTTTGTTTTGTTCAATAAT 
TCGGTCTCTGTTTTCTTATCAAAAAATTCGTTTTCCATCTCAAATCGATGTTCTTATTGA 
TTTAATTGAGTTTTAGTTTGCAGGGATTTGATCGTTGGTAAGCTATCTTTCAGCAAACAT 
GCATGGTTATGAAGATGTAAGCACGCTCATGAATTTTTGTTTTCAGTGATTTTGTCGAAT 
TCAATTTAAGGTAGATAGATTTGACATTGTTCGATAATGTTATATTGCAGGACCTTGATG 
AGGAAGCTGGGTATGATGACTATTACAGCGGTGATGAGGATGAGTATGAAGATGAGGAAG 
AGGAGGATGAAGAACCTCCTAAGGAAGAATTGGAATTTCTTGAGTCACGCCAAAAGTTGA 
AGGAATCAATTCGGAAGAAAATGGGAAATGGAAGTGCTAATGCTCAATCTTCACAAGAGA 
GAAGAAGAAAACTTCCTTATAACGAGTATGTGGTGGCTAAATCACATTTTCTAATTCATT 
ACAATGTCCTGGAATGTGTTTTGATGCTGAGCTTATTGATTTTTCTTAATGCAGCTTTGG 
TTCTTTCTTTGGTCCTTCACGGCCTGTTATTTCCTCAAGGGTTATACAAGAAAGCAAATC 
CTTGCTTGAAAACGAGCTACGTAAAATGTCGAATTCGAGCCAAACTGTATGTGCATTTGA 
TCTTTGTTACTCTTTGTATTTTTATCATTTAAGATGTTTTTGCTGATGGAATTGTTTTTT 
GGGGTGCAGAAGAAAAGACCAGTTCCGACGAATGGTTCAGGCTCTAAGAATGTGTCACAA 
GAGAAGCGACCTAAAGTTGTGAATGAGGTGAGAAGGAAAGTTGAGACTCTTAAGGATACA 
AGAGACTATTCGTTTTTGTTTTCCGATGACGCGGAGCTTCCTGTTCCGAAGAAGGAATCT 
CTTTCACGAAGTGGCTCTTTTCCTAATTCTGGTATGTTGTGTCTTTTGAAAAATCTTTTT 
CGCTATTTGTGATCTTTAAGCATACCATTTTCATGAAGATAACTTATACAGGTTTTTTGC 
TGATGTTCAAGAGGCTCGATCTGCTCAATTATCATCGAGGCCCAAACAATCATCAGGTAT 
CAATGGTAGAACTGCTCACAGTCCCCATCGTGAGGAGAAGAGACCTGTTTCAGCGAATGG 
ACATTCAAGACCGTCTTCCTCGGGCAGTCAAATGAATCATTCAAGACCGTCTTCCTCTGG 
CAGTAAAATGAATCATTCAAGACCGGCTACCTCGGGCAGCCAAATGCCAAATTCAAGACC 
AGCTTCCTCTGGCAGCCAAATGCAGTCGAGAGCTGTCTCAGGCTCAGGGCGACCTGCTTC 
CTCAGGCAGCCAGATGCAAAATTCAAGACCACAAAATTCAAGACCAGCTTCCGCTGGTAG 
CCAAATGCAGCAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCCA 
AAGGCCTGCGTCCTCAGGCAGCCAAAGGCCAGGTTCGTCGACAAACCGTCAAGCACCTAT 
GAGGCCACCAGGTTCAGGTTCCACAATGAATGGTCAATCAGCCAACCGGAATGGCCAACT 
GAATTCCAGATCAGATTCCCGAAGATCAGCTCCTGCTAAAGTGCCAGTGGATCATAGGAA 
ACAGATGAGCAGTAGCAATGGAGTTGGTCCTGGTCGGTCAGCGACCAATGCAAGACCTTT 
ACCTTCTAAGAGTTCATTGGAAAGAAAACCCTCAATCTCGGCGGGAAAGAGTTCTCTTCA 
AAGCCCTCAGAGACCGTCCTCATCAAGACCAATGTCATCTGATCCTAGGCAACGGGTAGT 
AGAACAGAGAAAGGTTTCTCGTGACATGGCCACACCCCGAATGATACCTAAACAATCAGC 
GCCTACCTCGAAACACCAGGTATCATGATCATGATCTTTCACATCTCTTTCTTTTGTCCT 
TCCTCTAGCCAAGGCACTAATTTGTCAAGTAATATTTACAGATGATGAGTAAACCAGCGC 
TCAAGAGACCTCCCTCGCGTGACATAGATCATGAAAGGAGGCTGTTGAAGAAGAAGAAGC 
CTGCAAGGTCAGAGGATCAAGAAGCATTCGATATGCTTAGACAGTTATTGTAAGTATTGC 
TCCAAACTTTCTTCCTACTCTCAAATTGTAAGTTACAATTTTCTAATTCTATTTTGTCTC 
CTGATACTTAAATGGGGGTTTGTGTATCAATTTTAGACCACCCAAGCGGTTTTCTCGGTA 
TGACGATGATGACATAAACATGGAAGCAGGCTTTGAAGATATCCAAAAGGAAGAGAGACG 
AAGGTACATGAGTATTTTTGTTATCACACGTTTCATTTATTTGTGTTTCTTGGATATTCC 
TTAACGATTGAATTGGTTGTTAAATGCAGTGCGAGAATCGCAAGGGAGGAAGATGAAAGA 
GAACTTAAGCTCTTAGAGGAAGAAGAAAGGAGAGAAAGACTGAAAAAGAATCGGAAGCTG 
AGCCGTTAGAAGAATCCTTTCTCCTTTGTGTCTTTGTCTTCTTTTAGGACTTTTTTAGTG 
TTTTCTCATTGAAATCTCTTTGGCCGCTTGAGGCAAAAAAGAGTTTGACCTTTTTTTTGT 
TTTGTGTTTTCAAATTAAGGATCTTTTTTTTGTTCATGGAAATTGTACAATTAGAAATAA 
TATCTTTTATTGGGGACACTTCAAGAAGAATCTGTTGGAAACCTTCCCAGTTAGTGAAAG 
CTTGATTCTCTTT
>AT2G22720.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN chloroplast EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Chromatin SPT2 (InterProIPR013256) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G378601) Has 18810 Blast hits to 9739 proteins in 609 species Archae - 9 Bacteria - 1424 Metazoa - 8282 Fungi - 2696 Plants - 486 Viruses - 222 Other Eukaryotes - 5691 (source NCBI BLink) 
ATGCAAGTGAGAACTTTTGAGAGGCGCAACAACAAGATGGCGCATAATAGCAAAAGACCT 
AATTTTGCTTCTGGGTCCTTGGATTTTCTTCCTCTTCCTCTTTATTCCCCTCAAAACCTA 
GGAGCCCAACGAGAGCACCCCTTTCCTCTTCTTCTATATCCCTTTTTATCTCTTCGCAGC 
TTTCTCTCCTCTCTCAGATCTATACTCTCTTCTCCTTCACAATCGTTCGTTTTATCCTCT 
CCATCGGATTCGTCAGATATCTTCGATTTCTCGTCGTTTCATCGGTTCCAGGTCAAATCC 
AGGTATGTCAATTGTAAAATTTCGATCAGATTCGTCGATTTCGATCAGCTTTTTCGATTT 
TGGATCTATTTTCTATGAAATATCAGATCTGGTGATTGTTTTACATATTTTTGGGTTGAA 
TTCACAAGATTTTCTGGAAACGAGATCGATTAATTGAGTTTTCTGTGTTTTTATCTTAAG 
CTAGATCTCGATTTCTATGTTTTTGGATTGATTTGATAAGATTTTCGAGAATTTTTTGTG 
TTTTTGTCAAAGTTCGATCTCGATTTCTATATTTTTGGTTGAATTCACAAGACTTTCTGG 
AAACGAGATCGATTTTGTGAGTTTTCTTTGTTTTTAATCTCGATTTTTGGATTGATTTGA 
GAAGATTTTCTGAAAGCGAGATCGATGTTTTTGGGGATTTTCTTTGTTTTGTTCAATAAT 
TCGGTCTCTGTTTTCTTATCAAAAAATTCGTTTTCCATCTCAAATCGATGTTCTTATTGA 
TTTAATTGAGTTTTAGTTTGCAGGGATTTGATCGTTGGTAAGCTATCTTTCAGCAAACAT 
GCATGGTTATGAAGATGTAAGCACGCTCATGAATTTTTGTTTTCAGTGATTTTGTCGAAT 
TCAATTTAAGGTAGATAGATTTGACATTGTTCGATAATGTTATATTGCAGGACCTTGATG 
AGGAAGCTGGGTATGATGACTATTACAGCGGTGATGAGGATGAGTATGAAGATGAGGAAG 
AGGAGGATGAAGAACCTCCTAAGGAAGAATTGGAATTTCTTGAGTCACGCCAAAAGTTGA 
AGGAATCAATTCGGAAGAAAATGGGAAATGGAAGTGCTAATGCTCAATCTTCACAAGAGA 
GAAGAAGAAAACTTCCTTATAACGAGTATGTGGTGGCTAAATCACATTTTCTAATTCATT 
ACAATGTCCTGGAATGTGTTTTGATGCTGAGCTTATTGATTTTTCTTAATGCAGCTTTGG 
TTCTTTCTTTGGTCCTTCACGGCCTGTTATTTCCTCAAGGGTTATACAAGAAAGCAAATC 
CTTGCTTGAAAACGAGCTACGTAAAATGTCGAATTCGAGCCAAACTGTATGTGCATTTGA 
TCTTTGTTACTCTTTGTATTTTTATCATTTAAGATGTTTTTGCTGATGGAATTGTTTTTT 
GGGGTGCAGAAGAAAAGACCAGTTCCGACGAATGGTTCAGGCTCTAAGAATGTGTCACAA 
GAGAAGCGACCTAAAGTTGTGAATGAGGTGAGAAGGAAAGTTGAGACTCTTAAGGATACA 
AGAGACTATTCGTTTTTGTTTTCCGATGACGCGGAGCTTCCTGTTCCGAAGAAGGAATCT 
CTTTCACGAAGTGGCTCTTTTCCTAATTCTGGTATGTTGTGTCTTTTGAAAAATCTTTTT 
CGCTATTTGTGATCTTTAAGCATACCATTTTCATGAAGATAACTTATACAGGTTTTTTGC 
TGATGTTCAAGAGGCTCGATCTGCTCAATTATCATCGAGGCCCAAACAATCATCAGGTAT 
CAATGGTAGAACTGCTCACAGTCCCCATCGTGAGGAGAAGAGACCTGTTTCAGCGAATGG 
ACATTCAAGACCGTCTTCCTCGGGCAGTCAAATGAATCATTCAAGACCGTCTTCCTCTGG 
CAGTAAAATGAATCATTCAAGACCGGCTACCTCGGGCAGCCAAATGCCAAATTCAAGACC 
AGCTTCCTCTGGCAGCCAAATGCAGTCGAGAGCTGTCTCAGGCTCAGGGCGACCTGCTTC 
CTCAGGCAGCCAGATGCAAAATTCAAGACCACAAAATTCAAGACCAGCTTCCGCTGGTAG 
CCAAATGCAGCAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCCA 
AAGGCCTGCGTCCTCAGGCAGCCAAAGGCCAGGTTCGTCGACAAACCGTCAAGCACCTAT 
GAGGCCACCAGGTTCAGGTTCCACAATGAATGGTCAATCAGCCAACCGGAATGGCCAACT 
GAATTCCAGATCAGATTCCCGAAGATCAGCTCCTGCTAAAGTGCCAGTGGATCATAGGAA 
ACAGATGAGCAGTAGCAATGGAGTTGGTCCTGGTCGGTCAGCGACCAATGCAAGACCTTT 
ACCTTCTAAGAGTTCATTGGAAAGAAAACCCTCAATCTCGGCGGGAAAGAGTTCTCTTCA 
AAGCCCTCAGAGACCGTCCTCATCAAGACCAATGTCATCTGATCCTAGGCAACGGGTAGT 
AGAACAGAGAAAGGTTTCTCGTGACATGGCCACACCCCGAATGATACCTAAACAATCAGC 
GCCTACCTCGAAACACCAGGTATCATGATCATGATCTTTCACATCTCTTTCTTTTGTCCT 
TCCTCTAGCCAAGGCACTAATTTGTCAAGTAATATTTACAGATGATGAGTAAACCAGCGC 
TCAAGAGACCTCCCTCGCGTGACATAGATCATGAAAGGAGGCTGTTGAAGAAGAAGAAGC 
CTGCAAGGTCAGAGGATCAAGAAGCATTCGATATGCTTAGACAGTTATTGTAAGTATTGC 
TCCAAACTTTCTTCCTACTCTCAAATTGTAAGTTACAATTTTCTAATTCTATTTTGTCTC 
CTGATACTTAAATGGGGGTTTGTGTATCAATTTTAGACCACCCAAGCGGTTTTCTCGGTA 
TGACGATGATGACATAAACATGGAAGCAGGCTTTGAAGATATCCAAAAGGAAGAGAGACG 
AAGGTACATGAGTATTTTTGTTATCACACGTTTCATTTATTTGTGTTTCTTGGATATTCC 
TTAACGATTGAATTGGTTGTTAAATGCAGTGCGAGAATCGCAAGGGAGGAAGATGAAAGA 
GAACTTAAGCTCTTAGAGGAAGAAGAAAGGAGAGAAAGACTGAAAAAGAATCGGAAGCTG 
AGCCGTTAGAAGAATCCTTTCTCCTTTGTGTCTTTGTCTTCTTTTAGGACTTTTTTAGTG 
TTTTCTCATTGAAATCTCTTTGGCCGCTTGAGGCAAAAAAGAGTTTGACCTTTTTTTTGT 
TTTGTGTTTTCAAATTAAGGATCTTTTTTTTGTTCATGGAAATTGTACAATTAGAAATAA 
TATCTTTTATTGGGGACACTTCAAGAAGAATCTGTTGGAAACCTTCCCAGTTAGTGAAAG 
CTTGATTCTCTTT
>AT2G22720.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Chromatin SPT2 (InterProIPR013256) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G378601) Has 34349 Blast hits to 21327 proteins in 952 species Archae - 9 Bacteria - 3053 Metazoa - 16365 Fungi - 6337 Plants - 1046 Viruses - 325 Other Eukaryotes - 7214 (source NCBI BLink) 
ATGCAAGTGAGAACTTTTGAGAGGCGCAACAACAAGATGGCGCATAATAGCAAAAGACCT 
AATTTTGCTTCTGGGTCCTTGGATTTTCTTCCTCTTCCTCTTTATTCCCCTCAAAACCTA 
GGAGCCCAACGAGAGCACCCCTTTCCTCTTCTTCTATATCCCTTTTTATCTCTTCGCAGC 
TTTCTCTCCTCTCTCAGATCTATACTCTCTTCTCCTTCACAATCGTTCGTTTTATCCTCT 
CCATCGGATTCGTCAGATATCTTCGATTTCTCGTCGTTTCATCGGTTCCAGGTCAAATCC 
AGGTATGTCAATTGTAAAATTTCGATCAGATTCGTCGATTTCGATCAGCTTTTTCGATTT 
TGGATCTATTTTCTATGAAATATCAGATCTGGTGATTGTTTTACATATTTTTGGGTTGAA 
TTCACAAGATTTTCTGGAAACGAGATCGATTAATTGAGTTTTCTGTGTTTTTATCTTAAG 
CTAGATCTCGATTTCTATGTTTTTGGATTGATTTGATAAGATTTTCGAGAATTTTTTGTG 
TTTTTGTCAAAGTTCGATCTCGATTTCTATATTTTTGGTTGAATTCACAAGACTTTCTGG 
AAACGAGATCGATTTTGTGAGTTTTCTTTGTTTTTAATCTCGATTTTTGGATTGATTTGA 
GAAGATTTTCTGAAAGCGAGATCGATGTTTTTGGGGATTTTCTTTGTTTTGTTCAATAAT 
TCGGTCTCTGTTTTCTTATCAAAAAATTCGTTTTCCATCTCAAATCGATGTTCTTATTGA 
TTTAATTGAGTTTTAGTTTGCAGGGATTTGATCGTTGGTAAGCTATCTTTCAGCAAACAT 
GCATGGTTATGAAGATGTAAGCACGCTCATGAATTTTTGTTTTCAGTGATTTTGTCGAAT 
TCAATTTAAGGTAGATAGATTTGACATTGTTCGATAATGTTATATTGCAGGACCTTGATG 
AGGAAGCTGGGTATGATGACTATTACAGCGGTGATGAGGATGAGTATGAAGATGAGGAAG 
AGGAGGATGAAGAACCTCCTAAGGAAGAATTGGAATTTCTTGAGTCACGCCAAAAGTTGA 
AGGAATCAATTCGGAAGAAAATGGGAAATGGAAGTGCTAATGCTCAATCTTCACAAGAGA 
GAAGAAGAAAACTTCCTTATAACGAGTATGTGGTGGCTAAATCACATTTTCTAATTCATT 
ACAATGTCCTGGAATGTGTTTTGATGCTGAGCTTATTGATTTTTCTTAATGCAGCTTTGG 
TTCTTTCTTTGGTCCTTCACGGCCTGTTATTTCCTCAAGGGTTATACAAGAAAGCAAATC 
CTTGCTTGAAAACGAGCTACGTAAAATGTCGAATTCGAGCCAAACTGTATGTGCATTTGA 
TCTTTGTTACTCTTTGTATTTTTATCATTTAAGATGTTTTTGCTGATGGAATTGTTTTTT 
GGGGTGCAGAAGAAAAGACCAGTTCCGACGAATGGTTCAGGCTCTAAGAATGTGTCACAA 
GAGAAGCGACCTAAAGTTGTGAATGAGGTGAGAAGGAAAGTTGAGACTCTTAAGGATACA 
AGAGACTATTCGTTTTTGTTTTCCGATGACGCGGAGCTTCCTGTTCCGAAGAAGGAATCT 
CTTTCACGAAGTGGCTCTTTTCCTAATTCTGGTATGTTGTGTCTTTTGAAAAATCTTTTT 
CGCTATTTGTGATCTTTAAGCATACCATTTTCATGAAGATAACTTATACAGGTTTTTTGC 
TGATGTTCAAGAGGCTCGATCTGCTCAATTATCATCGAGGCCCAAACAATCATCAGGTAT 
CAATGGTAGAACTGCTCACAGTCCCCATCGTGAGGAGAAGAGACCTGTTTCAGCGAATGG 
ACATTCAAGACCGTCTTCCTCGGGCAGTCAAATGAATCATTCAAGACCGTCTTCCTCTGG 
CAGTAAAATGAATCATTCAAGACCGGCTACCTCGGGCAGCCAAATGCCAAATTCAAGACC 
AGCTTCCTCTGGCAGCCAAATGCAGTCGAGAGCTGTCTCAGGCTCAGGGCGACCTGCTTC 
CTCAGGCAGCCAGATGCAAAATTCAAGACCACAAAATTCAAGACCAGCTTCCGCTGGTAG 
CCAAATGCAGCAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCCA 
AAGGCCTGCGTCCTCAGGCAGCCAAAGGCCAGGTTCGTCGACAAACCGTCAAGCACCTAT 
GAGGCCACCAGGTTCAGGTTCCACAATGAATGGTCAATCAGCCAACCGGAATGGCCAACT 
GAATTCCAGATCAGATTCCCGAAGATCAGCTCCTGCTAAAGTGCCAGTGGATCATAGGAA 
ACAGATGAGCAGTAGCAATGGAGTTGGTCCTGGTCGGTCAGCGACCAATGCAAGACCTTT 
ACCTTCTAAGAGTTCATTGGAAAGAAAACCCTCAATCTCGGCGGGAAAGAGTTCTCTTCA 
AAGCCCTCAGAGACCGTCCTCATCAAGACCAATGTCATCTGATCCTAGGCAACGGGTAGT 
AGAACAGAGAAAGGTTTCTCGTGACATGGCCACACCCCGAATGATACCTAAACAATCAGC 
GCCTACCTCGAAACACCAGGTATCATGATCATGATCTTTCACATCTCTTTCTTTTGTCCT 
TCCTCTAGCCAAGGCACTAATTTGTCAAGTAATATTTACAGATGATGAGTAAACCAGCGC 
TCAAGAGACCTCCCTCGCGTGACATAGATCATGAAAGGAGGCTGTTGAAGAAGAAGAAGC 
CTGCAAGGTCAGAGGATCAAGAAGCATTCGATATGCTTAGACAGTTATTGTAAGTATTGC 
TCCAAACTTTCTTCCTACTCTCAAATTGTAAGTTACAATTTTCTAATTCTATTTTGTCTC 
CTGATACTTAAATGGGGGTTTGTGTATCAATTTTAGACCACCCAAGCGGTTTTCTCGGTA 
TGACGATGATGACATAAACATGGAAGCAGGCTTTGAAGATATCCAAAAGGAAGAGAGACG 
AAGGTACATGAGTATTTTTGTTATCACACGTTTCATTTATTTGTGTTTCTTGGATATTCC 
TTAACGATTGAATTGGTTGTTAAATGCAGTGCGAGAATCGCAAGGGAGGAAGATGAAAGA 
GAACTTAAGCTCTTAGAGGAAGAAGAAAGGAGAGAAAGACTGAAAAAGAATCGGAAGCTG 
AGCCGTTAGAAGAATCCTTTCTCCTTTGTGTCTTTGTCTTCTTTTAGGACTTTTTTAGTG 
TTTTCTCATTGAAATCTCTTTGGCCGCTTGAGGCAAAAAAGAGTTTGACCTTTTTTTTGT 
TTTGTGTTTTCAAATTAAGGATCTTTTTTTTGTTCATGGAAATTGTACAATTAGAAATAA 
TATCTTTTATTGGGGACACTTCAAGAAGAATCTGTTGGAAACCTTCCCAGTTAGTGAAAG 
CTTGATTCTCTTT
>AT2G22720.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Chromatin SPT2 (InterProIPR013256) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G378601) Has 34422 Blast hits to 21553 proteins in 958 species Archae - 9 Bacteria - 2953 Metazoa - 16492 Fungi - 6405 Plants - 1061 Viruses - 320 Other Eukaryotes - 7182 (source NCBI BLink) 
TCCTCTTCTTCTATATCCCTTTTTATCTCTTCGCAGCTTTCTCTCCTCTCTCAGATCTAT 
ACTCTCTTCTCCTTCACAATCGTTCGTTTTATCCTCTCCATCGGATTCGTCAGATATCTT 
CGATTTCTCGTCGTTTCATCGGTTCCAGGTCAAATCCAGGTATGTCAATTGTAAAATTTC 
GATCAGATTCGTCGATTTCGATCAGCTTTTTCGATTTTGGATCTATTTTCTATGAAATAT 
CAGATCTGGTGATTGTTTTACATATTTTTGGGTTGAATTCACAAGATTTTCTGGAAACGA 
GATCGATTAATTGAGTTTTCTGTGTTTTTATCTTAAGCTAGATCTCGATTTCTATGTTTT 
TGGATTGATTTGATAAGATTTTCGAGAATTTTTTGTGTTTTTGTCAAAGTTCGATCTCGA 
TTTCTATATTTTTGGTTGAATTCACAAGACTTTCTGGAAACGAGATCGATTTTGTGAGTT 
TTCTTTGTTTTTAATCTCGATTTTTGGATTGATTTGAGAAGATTTTCTGAAAGCGAGATC 
GATGTTTTTGGGGATTTTCTTTGTTTTGTTCAATAATTCGGTCTCTGTTTTCTTATCAAA 
AAATTCGTTTTCCATCTCAAATCGATGTTCTTATTGATTTAATTGAGTTTTAGTTTGCAG 
GGATTTGATCGTTGGTAAGCTATCTTTCAGCAAACATGCATGGTTATGAAGATGTAAGCA 
CGCTCATGAATTTTTGTTTTCAGTGATTTTGTCGAATTCAATTTAAGGTAGATAGATTTG 
ACATTGTTCGATAATGTTATATTGCAGGACCTTGATGAGGAAGCTGGGTATGATGACTAT 
TACAGCGGTGATGAGGATGAGTATGAAGATGAGGAAGAGGAGGATGAAGAACCTCCTAAG 
GAAGAATTGGAATTTCTTGAGTCACGCCAAAAGTTGAAGGAATCAATTCGGAAGAAAATG 
GGAAATGGAAGTGCTAATGCTCAATCTTCACAAGAGAGAAGAAGAAAACTTCCTTATAAC 
GAGTATGTGGTGGCTAAATCACATTTTCTAATTCATTACAATGTCCTGGAATGTGTTTTG 
ATGCTGAGCTTATTGATTTTTCTTAATGCAGCTTTGGTTCTTTCTTTGGTCCTTCACGGC 
CTGTTATTTCCTCAAGGGTTATACAAGAAAGCAAATCCTTGCTTGAAAACGAGCTACGTA 
AAATGTCGAATTCGAGCCAAACTGTATGTGCATTTGATCTTTGTTACTCTTTGTATTTTT 
ATCATTTAAGATGTTTTTGCTGATGGAATTGTTTTTTGGGGTGCAGAAGAAAAGACCAGT 
TCCGACGAATGGTTCAGGCTCTAAGAATGTGTCACAAGAGAAGCGACCTAAAGTTGTGAA 
TGAGGTGAGAAGGAAAGTTGAGACTCTTAAGGATACAAGAGACTATTCGTTTTTGTTTTC 
CGATGACGCGGAGCTTCCTGTTCCGAAGAAGGAATCTCTTTCACGAAGTGGCTCTTTTCC 
TAATTCTGGTATGTTGTGTCTTTTGAAAAATCTTTTTCGCTATTTGTGATCTTTAAGCAT 
ACCATTTTCATGAAGATAACTTATACAGGTTTTTTGCTGATGTTCAAGAGGCTCGATCTG 
CTCAATTATCATCGAGGCCCAAACAATCATCAGGTATCAATGGTAGAACTGCTCACAGTC 
CCCATCGTGAGGAGAAGAGACCTGTTTCAGCGAATGGACATTCAAGACCGTCTTCCTCGG 
GCAGTCAAATGAATCATTCAAGACCGTCTTCCTCTGGCAGTAAAATGAATCATTCAAGAC 
CGGCTACCTCGGGCAGCCAAATGCCAAATTCAAGACCAGCTTCCTCTGGCAGCCAAATGC 
AGTCGAGAGCTGTCTCAGGCTCAGGGCGACCTGCTTCCTCAGGCAGCCAGATGCAAAATT 
CAAGACCACAAAATTCAAGACCAGCTTCCGCTGGTAGCCAAATGCAGCAAAGGCCTGCGT 
CCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCC 
AAAGGCCAGGTTCGTCGACAAACCGTCAAGCACCTATGAGGCCACCAGGTTCAGGTTCCA 
CAATGAATGGTCAATCAGCCAACCGGAATGGCCAACTGAATTCCAGATCAGATTCCCGAA 
GATCAGCTCCTGCTAAAGTGCCAGTGGATCATAGGAAACAGATGAGCAGTAGCAATGGAG 
TTGGTCCTGGTCGGTCAGCGACCAATGCAAGACCTTTACCTTCTAAGAGTTCATTGGAAA 
GAAAACCCTCAATCTCGGCGGGAAAGAGTTCTCTTCAAAGCCCTCAGAGACCGTCCTCAT 
CAAGACCAATGTCATCTGATCCTAGGCAACGGGTAGTAGAACAGAGAAAGGTTTCTCGTG 
ACATGGCCACACCCCGAATGATACCTAAACAATCAGCGCCTACCTCGAAACACCAGGTAT 
CATGATCATGATCTTTCACATCTCTTTCTTTTGTCCTTCCTCTAGCCAAGGCACTAATTT 
GTCAAGTAATATTTACAGATGATGAGTAAACCAGCGCTCAAGAGACCTCCCTCGCGTGAC 
ATAGATCATGAAAGGAGGCTGTTGAAGAAGAAGAAGCCTGCAAGGTCAGAGGATCAAGAA 
GCATTCGATATGCTTAGACAGTTATTGTAAGTATTGCTCCAAACTTTCTTCCTACTCTCA 
AATTGTAAGTTACAATTTTCTAATTCTATTTTGTCTCCTGATACTTAAATGGGGGTTTGT 
GTATCAATTTTAGACCACCCAAGCGGTTTTCTCGGTATGACGATGATGACATAAACATGG 
AAGCAGGCTTTGAAGATATCCAAAAGGAAGAGAGACGAAGGTACATGAGTATTTTTGTTA 
TCACACGTTTCATTTATTTGTGTTTCTTGGATATTCCTTAACGATTGAATTGGTTGTTAA 
ATGCAGTGCGAGAATCGCAAGGGAGGAAGATGAAAGAGAACTTAAGCTCTTAGAGGAAGA 
AGAAAGGAGAGAAAGACTGAAAAAGAATCGGAAGCTGAGCCGTTAGAAGAATCCTTTCTC 
CTTTGTGTCTTTGTCTTCTTTTAGGACTTTTTTAGTGTTTTCTCATTGAAATCTCTTTGG 
CCGCTTGAGGCAAAAAAGAGTTTGACCTTTTTTTTGTTTTGTGTTTTCAAATTAAGGATC 
TTTTTTTTGTTCATGGAAATTGTACAATTAGAAATAATATCTTTTATTGGGGAC
>AT2G22720.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN chloroplast EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Chromatin SPT2 (InterProIPR013256) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G378601) Has 18810 Blast hits to 9739 proteins in 609 species Archae - 9 Bacteria - 1424 Metazoa - 8282 Fungi - 2696 Plants - 486 Viruses - 222 Other Eukaryotes - 5691 (source NCBI BLink) 
TCCTCTTCTTCTATATCCCTTTTTATCTCTTCGCAGCTTTCTCTCCTCTCTCAGATCTAT 
ACTCTCTTCTCCTTCACAATCGTTCGTTTTATCCTCTCCATCGGATTCGTCAGATATCTT 
CGATTTCTCGTCGTTTCATCGGTTCCAGGTCAAATCCAGGTATGTCAATTGTAAAATTTC 
GATCAGATTCGTCGATTTCGATCAGCTTTTTCGATTTTGGATCTATTTTCTATGAAATAT 
CAGATCTGGTGATTGTTTTACATATTTTTGGGTTGAATTCACAAGATTTTCTGGAAACGA 
GATCGATTAATTGAGTTTTCTGTGTTTTTATCTTAAGCTAGATCTCGATTTCTATGTTTT 
TGGATTGATTTGATAAGATTTTCGAGAATTTTTTGTGTTTTTGTCAAAGTTCGATCTCGA 
TTTCTATATTTTTGGTTGAATTCACAAGACTTTCTGGAAACGAGATCGATTTTGTGAGTT 
TTCTTTGTTTTTAATCTCGATTTTTGGATTGATTTGAGAAGATTTTCTGAAAGCGAGATC 
GATGTTTTTGGGGATTTTCTTTGTTTTGTTCAATAATTCGGTCTCTGTTTTCTTATCAAA 
AAATTCGTTTTCCATCTCAAATCGATGTTCTTATTGATTTAATTGAGTTTTAGTTTGCAG 
GGATTTGATCGTTGGTAAGCTATCTTTCAGCAAACATGCATGGTTATGAAGATGTAAGCA 
CGCTCATGAATTTTTGTTTTCAGTGATTTTGTCGAATTCAATTTAAGGTAGATAGATTTG 
ACATTGTTCGATAATGTTATATTGCAGGACCTTGATGAGGAAGCTGGGTATGATGACTAT 
TACAGCGGTGATGAGGATGAGTATGAAGATGAGGAAGAGGAGGATGAAGAACCTCCTAAG 
GAAGAATTGGAATTTCTTGAGTCACGCCAAAAGTTGAAGGAATCAATTCGGAAGAAAATG 
GGAAATGGAAGTGCTAATGCTCAATCTTCACAAGAGAGAAGAAGAAAACTTCCTTATAAC 
GAGTATGTGGTGGCTAAATCACATTTTCTAATTCATTACAATGTCCTGGAATGTGTTTTG 
ATGCTGAGCTTATTGATTTTTCTTAATGCAGCTTTGGTTCTTTCTTTGGTCCTTCACGGC 
CTGTTATTTCCTCAAGGGTTATACAAGAAAGCAAATCCTTGCTTGAAAACGAGCTACGTA 
AAATGTCGAATTCGAGCCAAACTGTATGTGCATTTGATCTTTGTTACTCTTTGTATTTTT 
ATCATTTAAGATGTTTTTGCTGATGGAATTGTTTTTTGGGGTGCAGAAGAAAAGACCAGT 
TCCGACGAATGGTTCAGGCTCTAAGAATGTGTCACAAGAGAAGCGACCTAAAGTTGTGAA 
TGAGGTGAGAAGGAAAGTTGAGACTCTTAAGGATACAAGAGACTATTCGTTTTTGTTTTC 
CGATGACGCGGAGCTTCCTGTTCCGAAGAAGGAATCTCTTTCACGAAGTGGCTCTTTTCC 
TAATTCTGGTATGTTGTGTCTTTTGAAAAATCTTTTTCGCTATTTGTGATCTTTAAGCAT 
ACCATTTTCATGAAGATAACTTATACAGGTTTTTTGCTGATGTTCAAGAGGCTCGATCTG 
CTCAATTATCATCGAGGCCCAAACAATCATCAGGTATCAATGGTAGAACTGCTCACAGTC 
CCCATCGTGAGGAGAAGAGACCTGTTTCAGCGAATGGACATTCAAGACCGTCTTCCTCGG 
GCAGTCAAATGAATCATTCAAGACCGTCTTCCTCTGGCAGTAAAATGAATCATTCAAGAC 
CGGCTACCTCGGGCAGCCAAATGCCAAATTCAAGACCAGCTTCCTCTGGCAGCCAAATGC 
AGTCGAGAGCTGTCTCAGGCTCAGGGCGACCTGCTTCCTCAGGCAGCCAGATGCAAAATT 
CAAGACCACAAAATTCAAGACCAGCTTCCGCTGGTAGCCAAATGCAGCAAAGGCCTGCGT 
CCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCC 
AAAGGCCAGGTTCGTCGACAAACCGTCAAGCACCTATGAGGCCACCAGGTTCAGGTTCCA 
CAATGAATGGTCAATCAGCCAACCGGAATGGCCAACTGAATTCCAGATCAGATTCCCGAA 
GATCAGCTCCTGCTAAAGTGCCAGTGGATCATAGGAAACAGATGAGCAGTAGCAATGGAG 
TTGGTCCTGGTCGGTCAGCGACCAATGCAAGACCTTTACCTTCTAAGAGTTCATTGGAAA 
GAAAACCCTCAATCTCGGCGGGAAAGAGTTCTCTTCAAAGCCCTCAGAGACCGTCCTCAT 
CAAGACCAATGTCATCTGATCCTAGGCAACGGGTAGTAGAACAGAGAAAGGTTTCTCGTG 
ACATGGCCACACCCCGAATGATACCTAAACAATCAGCGCCTACCTCGAAACACCAGGTAT 
CATGATCATGATCTTTCACATCTCTTTCTTTTGTCCTTCCTCTAGCCAAGGCACTAATTT 
GTCAAGTAATATTTACAGATGATGAGTAAACCAGCGCTCAAGAGACCTCCCTCGCGTGAC 
ATAGATCATGAAAGGAGGCTGTTGAAGAAGAAGAAGCCTGCAAGGTCAGAGGATCAAGAA 
GCATTCGATATGCTTAGACAGTTATTGTAAGTATTGCTCCAAACTTTCTTCCTACTCTCA 
AATTGTAAGTTACAATTTTCTAATTCTATTTTGTCTCCTGATACTTAAATGGGGGTTTGT 
GTATCAATTTTAGACCACCCAAGCGGTTTTCTCGGTATGACGATGATGACATAAACATGG 
AAGCAGGCTTTGAAGATATCCAAAAGGAAGAGAGACGAAGGTACATGAGTATTTTTGTTA 
TCACACGTTTCATTTATTTGTGTTTCTTGGATATTCCTTAACGATTGAATTGGTTGTTAA 
ATGCAGTGCGAGAATCGCAAGGGAGGAAGATGAAAGAGAACTTAAGCTCTTAGAGGAAGA 
AGAAAGGAGAGAAAGACTGAAAAAGAATCGGAAGCTGAGCCGTTAGAAGAATCCTTTCTC 
CTTTGTGTCTTTGTCTTCTTTTAGGACTTTTTTAGTGTTTTCTCATTGAAATCTCTTTGG 
CCGCTTGAGGCAAAAAAGAGTTTGACCTTTTTTTTGTTTTGTGTTTTCAAATTAAGGATC 
TTTTTTTTGTTCATGGAAATTGTACAATTAGAAATAATATCTTTTATTGGGGAC
>AT2G22720.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Chromatin SPT2 (InterProIPR013256) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G378601) Has 34349 Blast hits to 21327 proteins in 952 species Archae - 9 Bacteria - 3053 Metazoa - 16365 Fungi - 6337 Plants - 1046 Viruses - 325 Other Eukaryotes - 7214 (source NCBI BLink) 
TCCTCTTCTTCTATATCCCTTTTTATCTCTTCGCAGCTTTCTCTCCTCTCTCAGATCTAT 
ACTCTCTTCTCCTTCACAATCGTTCGTTTTATCCTCTCCATCGGATTCGTCAGATATCTT 
CGATTTCTCGTCGTTTCATCGGTTCCAGGTCAAATCCAGGTATGTCAATTGTAAAATTTC 
GATCAGATTCGTCGATTTCGATCAGCTTTTTCGATTTTGGATCTATTTTCTATGAAATAT 
CAGATCTGGTGATTGTTTTACATATTTTTGGGTTGAATTCACAAGATTTTCTGGAAACGA 
GATCGATTAATTGAGTTTTCTGTGTTTTTATCTTAAGCTAGATCTCGATTTCTATGTTTT 
TGGATTGATTTGATAAGATTTTCGAGAATTTTTTGTGTTTTTGTCAAAGTTCGATCTCGA 
TTTCTATATTTTTGGTTGAATTCACAAGACTTTCTGGAAACGAGATCGATTTTGTGAGTT 
TTCTTTGTTTTTAATCTCGATTTTTGGATTGATTTGAGAAGATTTTCTGAAAGCGAGATC 
GATGTTTTTGGGGATTTTCTTTGTTTTGTTCAATAATTCGGTCTCTGTTTTCTTATCAAA 
AAATTCGTTTTCCATCTCAAATCGATGTTCTTATTGATTTAATTGAGTTTTAGTTTGCAG 
GGATTTGATCGTTGGTAAGCTATCTTTCAGCAAACATGCATGGTTATGAAGATGTAAGCA 
CGCTCATGAATTTTTGTTTTCAGTGATTTTGTCGAATTCAATTTAAGGTAGATAGATTTG 
ACATTGTTCGATAATGTTATATTGCAGGACCTTGATGAGGAAGCTGGGTATGATGACTAT 
TACAGCGGTGATGAGGATGAGTATGAAGATGAGGAAGAGGAGGATGAAGAACCTCCTAAG 
GAAGAATTGGAATTTCTTGAGTCACGCCAAAAGTTGAAGGAATCAATTCGGAAGAAAATG 
GGAAATGGAAGTGCTAATGCTCAATCTTCACAAGAGAGAAGAAGAAAACTTCCTTATAAC 
GAGTATGTGGTGGCTAAATCACATTTTCTAATTCATTACAATGTCCTGGAATGTGTTTTG 
ATGCTGAGCTTATTGATTTTTCTTAATGCAGCTTTGGTTCTTTCTTTGGTCCTTCACGGC 
CTGTTATTTCCTCAAGGGTTATACAAGAAAGCAAATCCTTGCTTGAAAACGAGCTACGTA 
AAATGTCGAATTCGAGCCAAACTGTATGTGCATTTGATCTTTGTTACTCTTTGTATTTTT 
ATCATTTAAGATGTTTTTGCTGATGGAATTGTTTTTTGGGGTGCAGAAGAAAAGACCAGT 
TCCGACGAATGGTTCAGGCTCTAAGAATGTGTCACAAGAGAAGCGACCTAAAGTTGTGAA 
TGAGGTGAGAAGGAAAGTTGAGACTCTTAAGGATACAAGAGACTATTCGTTTTTGTTTTC 
CGATGACGCGGAGCTTCCTGTTCCGAAGAAGGAATCTCTTTCACGAAGTGGCTCTTTTCC 
TAATTCTGGTATGTTGTGTCTTTTGAAAAATCTTTTTCGCTATTTGTGATCTTTAAGCAT 
ACCATTTTCATGAAGATAACTTATACAGGTTTTTTGCTGATGTTCAAGAGGCTCGATCTG 
CTCAATTATCATCGAGGCCCAAACAATCATCAGGTATCAATGGTAGAACTGCTCACAGTC 
CCCATCGTGAGGAGAAGAGACCTGTTTCAGCGAATGGACATTCAAGACCGTCTTCCTCGG 
GCAGTCAAATGAATCATTCAAGACCGTCTTCCTCTGGCAGTAAAATGAATCATTCAAGAC 
CGGCTACCTCGGGCAGCCAAATGCCAAATTCAAGACCAGCTTCCTCTGGCAGCCAAATGC 
AGTCGAGAGCTGTCTCAGGCTCAGGGCGACCTGCTTCCTCAGGCAGCCAGATGCAAAATT 
CAAGACCACAAAATTCAAGACCAGCTTCCGCTGGTAGCCAAATGCAGCAAAGGCCTGCGT 
CCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCC 
AAAGGCCAGGTTCGTCGACAAACCGTCAAGCACCTATGAGGCCACCAGGTTCAGGTTCCA 
CAATGAATGGTCAATCAGCCAACCGGAATGGCCAACTGAATTCCAGATCAGATTCCCGAA 
GATCAGCTCCTGCTAAAGTGCCAGTGGATCATAGGAAACAGATGAGCAGTAGCAATGGAG 
TTGGTCCTGGTCGGTCAGCGACCAATGCAAGACCTTTACCTTCTAAGAGTTCATTGGAAA 
GAAAACCCTCAATCTCGGCGGGAAAGAGTTCTCTTCAAAGCCCTCAGAGACCGTCCTCAT 
CAAGACCAATGTCATCTGATCCTAGGCAACGGGTAGTAGAACAGAGAAAGGTTTCTCGTG 
ACATGGCCACACCCCGAATGATACCTAAACAATCAGCGCCTACCTCGAAACACCAGGTAT 
CATGATCATGATCTTTCACATCTCTTTCTTTTGTCCTTCCTCTAGCCAAGGCACTAATTT 
GTCAAGTAATATTTACAGATGATGAGTAAACCAGCGCTCAAGAGACCTCCCTCGCGTGAC 
ATAGATCATGAAAGGAGGCTGTTGAAGAAGAAGAAGCCTGCAAGGTCAGAGGATCAAGAA 
GCATTCGATATGCTTAGACAGTTATTGTAAGTATTGCTCCAAACTTTCTTCCTACTCTCA 
AATTGTAAGTTACAATTTTCTAATTCTATTTTGTCTCCTGATACTTAAATGGGGGTTTGT 
GTATCAATTTTAGACCACCCAAGCGGTTTTCTCGGTATGACGATGATGACATAAACATGG 
AAGCAGGCTTTGAAGATATCCAAAAGGAAGAGAGACGAAGGTACATGAGTATTTTTGTTA 
TCACACGTTTCATTTATTTGTGTTTCTTGGATATTCCTTAACGATTGAATTGGTTGTTAA 
ATGCAGTGCGAGAATCGCAAGGGAGGAAGATGAAAGAGAACTTAAGCTCTTAGAGGAAGA 
AGAAAGGAGAGAAAGACTGAAAAAGAATCGGAAGCTGAGCCGTTAGAAGAATCCTTTCTC 
CTTTGTGTCTTTGTCTTCTTTTAGGACTTTTTTAGTGTTTTCTCATTGAAATCTCTTTGG 
CCGCTTGAGGCAAAAAAGAGTTTGACCTTTTTTTTGTTTTGTGTTTTCAAATTAAGGATC 
TTTTTTTTGTTCATGGAAATTGTACAATTAGAAATAATATCTTTTATTGGGGAC
>AT2G22720.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Chromatin SPT2 (InterProIPR013256) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G378601) Has 34422 Blast hits to 21553 proteins in 958 species Archae - 9 Bacteria - 2953 Metazoa - 16492 Fungi - 6405 Plants - 1061 Viruses - 320 Other Eukaryotes - 7182 (source NCBI BLink) 
GAGAGCACCCCTTTCCTCTTCTTCTATATCCCTTTTTATCTCTTCGCAGCTTTCTCTCCT 
CTCTCAGATCTATACTCTCTTCTCCTTCACAATCGTTCGTTTTATCCTCTCCATCGGATT 
CGTCAGATATCTTCGATTTCTCGTCGTTTCATCGGTTCCAGGTCAAATCCAGGTATGTCA 
ATTGTAAAATTTCGATCAGATTCGTCGATTTCGATCAGCTTTTTCGATTTTGGATCTATT 
TTCTATGAAATATCAGATCTGGTGATTGTTTTACATATTTTTGGGTTGAATTCACAAGAT 
TTTCTGGAAACGAGATCGATTAATTGAGTTTTCTGTGTTTTTATCTTAAGCTAGATCTCG 
ATTTCTATGTTTTTGGATTGATTTGATAAGATTTTCGAGAATTTTTTGTGTTTTTGTCAA 
AGTTCGATCTCGATTTCTATATTTTTGGTTGAATTCACAAGACTTTCTGGAAACGAGATC 
GATTTTGTGAGTTTTCTTTGTTTTTAATCTCGATTTTTGGATTGATTTGAGAAGATTTTC 
TGAAAGCGAGATCGATGTTTTTGGGGATTTTCTTTGTTTTGTTCAATAATTCGGTCTCTG 
TTTTCTTATCAAAAAATTCGTTTTCCATCTCAAATCGATGTTCTTATTGATTTAATTGAG 
TTTTAGTTTGCAGGGATTTGATCGTTGGTAAGCTATCTTTCAGCAAACATGCATGGTTAT 
GAAGATGTAAGCACGCTCATGAATTTTTGTTTTCAGTGATTTTGTCGAATTCAATTTAAG 
GTAGATAGATTTGACATTGTTCGATAATGTTATATTGCAGGACCTTGATGAGGAAGCTGG 
GTATGATGACTATTACAGCGGTGATGAGGATGAGTATGAAGATGAGGAAGAGGAGGATGA 
AGAACCTCCTAAGGAAGAATTGGAATTTCTTGAGTCACGCCAAAAGTTGAAGGAATCAAT 
TCGGAAGAAAATGGGAAATGGAAGTGCTAATGCTCAATCTTCACAAGAGAGAAGAAGAAA 
ACTTCCTTATAACGAGTATGTGGTGGCTAAATCACATTTTCTAATTCATTACAATGTCCT 
GGAATGTGTTTTGATGCTGAGCTTATTGATTTTTCTTAATGCAGCTTTGGTTCTTTCTTT 
GGTCCTTCACGGCCTGTTATTTCCTCAAGGGTTATACAAGAAAGCAAATCCTTGCTTGAA 
AACGAGCTACGTAAAATGTCGAATTCGAGCCAAACTGTATGTGCATTTGATCTTTGTTAC 
TCTTTGTATTTTTATCATTTAAGATGTTTTTGCTGATGGAATTGTTTTTTGGGGTGCAGA 
AGAAAAGACCAGTTCCGACGAATGGTTCAGGCTCTAAGAATGTGTCACAAGAGAAGCGAC 
CTAAAGTTGTGAATGAGGTGAGAAGGAAAGTTGAGACTCTTAAGGATACAAGAGACTATT 
CGTTTTTGTTTTCCGATGACGCGGAGCTTCCTGTTCCGAAGAAGGAATCTCTTTCACGAA 
GTGGCTCTTTTCCTAATTCTGGTATGTTGTGTCTTTTGAAAAATCTTTTTCGCTATTTGT 
GATCTTTAAGCATACCATTTTCATGAAGATAACTTATACAGGTTTTTTGCTGATGTTCAA 
GAGGCTCGATCTGCTCAATTATCATCGAGGCCCAAACAATCATCAGGTATCAATGGTAGA 
ACTGCTCACAGTCCCCATCGTGAGGAGAAGAGACCTGTTTCAGCGAATGGACATTCAAGA 
CCGTCTTCCTCGGGCAGTCAAATGAATCATTCAAGACCGTCTTCCTCTGGCAGTAAAATG 
AATCATTCAAGACCGGCTACCTCGGGCAGCCAAATGCCAAATTCAAGACCAGCTTCCTCT 
GGCAGCCAAATGCAGTCGAGAGCTGTCTCAGGCTCAGGGCGACCTGCTTCCTCAGGCAGC 
CAGATGCAAAATTCAAGACCACAAAATTCAAGACCAGCTTCCGCTGGTAGCCAAATGCAG 
CAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCG 
TCCTCAGGCAGCCAAAGGCCAGGTTCGTCGACAAACCGTCAAGCACCTATGAGGCCACCA 
GGTTCAGGTTCCACAATGAATGGTCAATCAGCCAACCGGAATGGCCAACTGAATTCCAGA 
TCAGATTCCCGAAGATCAGCTCCTGCTAAAGTGCCAGTGGATCATAGGAAACAGATGAGC 
AGTAGCAATGGAGTTGGTCCTGGTCGGTCAGCGACCAATGCAAGACCTTTACCTTCTAAG 
AGTTCATTGGAAAGAAAACCCTCAATCTCGGCGGGAAAGAGTTCTCTTCAAAGCCCTCAG 
AGACCGTCCTCATCAAGACCAATGTCATCTGATCCTAGGCAACGGGTAGTAGAACAGAGA 
AAGGTTTCTCGTGACATGGCCACACCCCGAATGATACCTAAACAATCAGCGCCTACCTCG 
AAACACCAGGTATCATGATCATGATCTTTCACATCTCTTTCTTTTGTCCTTCCTCTAGCC 
AAGGCACTAATTTGTCAAGTAATATTTACAGATGATGAGTAAACCAGCGCTCAAGAGACC 
TCCCTCGCGTGACATAGATCATGAAAGGAGGCTGTTGAAGAAGAAGAAGCCTGCAAGGTC 
AGAGGATCAAGAAGCATTCGATATGCTTAGACAGTTATTGTAAGTATTGCTCCAAACTTT 
CTTCCTACTCTCAAATTGTAAGTTACAATTTTCTAATTCTATTTTGTCTCCTGATACTTA 
AATGGGGGTTTGTGTATCAATTTTAGACCACCCAAGCGGTTTTCTCGGTATGACGATGAT 
GACATAAACATGGAAGCAGGCTTTGAAGATATCCAAAAGGAAGAGAGACGAAGGTACATG 
AGTATTTTTGTTATCACACGTTTCATTTATTTGTGTTTCTTGGATATTCCTTAACGATTG 
AATTGGTTGTTAAATGCAGTGCGAGAATCGCAAGGGAGGAAGATGAAAGAGAACTTAAGC 
TCTTAGAGGAAGAAGAAAGGAGAGAAAGACTGAAAAAGAATCGGAAGCTGAGCCGTTAGA 
AGAATCCTTTCTCCTTTGTGTCTTTGTCTTCTTTTAGGACTTTTTTAGTGTTTTCTCATT 
GAAATCTCTTTGGCCGCTTGAGGCAAAAAAGAGTTTGACCTTTTTTTTGTTTTGTGTTTT 
CAAATTAAGGATCTTTTTTTTGTTCATGGAAATTGTACAATTAGAAATAATATCTTTTAT 
TGGGGACACTTCAAGAAGAATCTGTTGGAAACCTTCCCAGTTAGTGAAAGCTTGATTCTC 
TTT
>AT2G22720.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN chloroplast EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Chromatin SPT2 (InterProIPR013256) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G378601) Has 18810 Blast hits to 9739 proteins in 609 species Archae - 9 Bacteria - 1424 Metazoa - 8282 Fungi - 2696 Plants - 486 Viruses - 222 Other Eukaryotes - 5691 (source NCBI BLink) 
GAGAGCACCCCTTTCCTCTTCTTCTATATCCCTTTTTATCTCTTCGCAGCTTTCTCTCCT 
CTCTCAGATCTATACTCTCTTCTCCTTCACAATCGTTCGTTTTATCCTCTCCATCGGATT 
CGTCAGATATCTTCGATTTCTCGTCGTTTCATCGGTTCCAGGTCAAATCCAGGTATGTCA 
ATTGTAAAATTTCGATCAGATTCGTCGATTTCGATCAGCTTTTTCGATTTTGGATCTATT 
TTCTATGAAATATCAGATCTGGTGATTGTTTTACATATTTTTGGGTTGAATTCACAAGAT 
TTTCTGGAAACGAGATCGATTAATTGAGTTTTCTGTGTTTTTATCTTAAGCTAGATCTCG 
ATTTCTATGTTTTTGGATTGATTTGATAAGATTTTCGAGAATTTTTTGTGTTTTTGTCAA 
AGTTCGATCTCGATTTCTATATTTTTGGTTGAATTCACAAGACTTTCTGGAAACGAGATC 
GATTTTGTGAGTTTTCTTTGTTTTTAATCTCGATTTTTGGATTGATTTGAGAAGATTTTC 
TGAAAGCGAGATCGATGTTTTTGGGGATTTTCTTTGTTTTGTTCAATAATTCGGTCTCTG 
TTTTCTTATCAAAAAATTCGTTTTCCATCTCAAATCGATGTTCTTATTGATTTAATTGAG 
TTTTAGTTTGCAGGGATTTGATCGTTGGTAAGCTATCTTTCAGCAAACATGCATGGTTAT 
GAAGATGTAAGCACGCTCATGAATTTTTGTTTTCAGTGATTTTGTCGAATTCAATTTAAG 
GTAGATAGATTTGACATTGTTCGATAATGTTATATTGCAGGACCTTGATGAGGAAGCTGG 
GTATGATGACTATTACAGCGGTGATGAGGATGAGTATGAAGATGAGGAAGAGGAGGATGA 
AGAACCTCCTAAGGAAGAATTGGAATTTCTTGAGTCACGCCAAAAGTTGAAGGAATCAAT 
TCGGAAGAAAATGGGAAATGGAAGTGCTAATGCTCAATCTTCACAAGAGAGAAGAAGAAA 
ACTTCCTTATAACGAGTATGTGGTGGCTAAATCACATTTTCTAATTCATTACAATGTCCT 
GGAATGTGTTTTGATGCTGAGCTTATTGATTTTTCTTAATGCAGCTTTGGTTCTTTCTTT 
GGTCCTTCACGGCCTGTTATTTCCTCAAGGGTTATACAAGAAAGCAAATCCTTGCTTGAA 
AACGAGCTACGTAAAATGTCGAATTCGAGCCAAACTGTATGTGCATTTGATCTTTGTTAC 
TCTTTGTATTTTTATCATTTAAGATGTTTTTGCTGATGGAATTGTTTTTTGGGGTGCAGA 
AGAAAAGACCAGTTCCGACGAATGGTTCAGGCTCTAAGAATGTGTCACAAGAGAAGCGAC 
CTAAAGTTGTGAATGAGGTGAGAAGGAAAGTTGAGACTCTTAAGGATACAAGAGACTATT 
CGTTTTTGTTTTCCGATGACGCGGAGCTTCCTGTTCCGAAGAAGGAATCTCTTTCACGAA 
GTGGCTCTTTTCCTAATTCTGGTATGTTGTGTCTTTTGAAAAATCTTTTTCGCTATTTGT 
GATCTTTAAGCATACCATTTTCATGAAGATAACTTATACAGGTTTTTTGCTGATGTTCAA 
GAGGCTCGATCTGCTCAATTATCATCGAGGCCCAAACAATCATCAGGTATCAATGGTAGA 
ACTGCTCACAGTCCCCATCGTGAGGAGAAGAGACCTGTTTCAGCGAATGGACATTCAAGA 
CCGTCTTCCTCGGGCAGTCAAATGAATCATTCAAGACCGTCTTCCTCTGGCAGTAAAATG 
AATCATTCAAGACCGGCTACCTCGGGCAGCCAAATGCCAAATTCAAGACCAGCTTCCTCT 
GGCAGCCAAATGCAGTCGAGAGCTGTCTCAGGCTCAGGGCGACCTGCTTCCTCAGGCAGC 
CAGATGCAAAATTCAAGACCACAAAATTCAAGACCAGCTTCCGCTGGTAGCCAAATGCAG 
CAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCG 
TCCTCAGGCAGCCAAAGGCCAGGTTCGTCGACAAACCGTCAAGCACCTATGAGGCCACCA 
GGTTCAGGTTCCACAATGAATGGTCAATCAGCCAACCGGAATGGCCAACTGAATTCCAGA 
TCAGATTCCCGAAGATCAGCTCCTGCTAAAGTGCCAGTGGATCATAGGAAACAGATGAGC 
AGTAGCAATGGAGTTGGTCCTGGTCGGTCAGCGACCAATGCAAGACCTTTACCTTCTAAG 
AGTTCATTGGAAAGAAAACCCTCAATCTCGGCGGGAAAGAGTTCTCTTCAAAGCCCTCAG 
AGACCGTCCTCATCAAGACCAATGTCATCTGATCCTAGGCAACGGGTAGTAGAACAGAGA 
AAGGTTTCTCGTGACATGGCCACACCCCGAATGATACCTAAACAATCAGCGCCTACCTCG 
AAACACCAGGTATCATGATCATGATCTTTCACATCTCTTTCTTTTGTCCTTCCTCTAGCC 
AAGGCACTAATTTGTCAAGTAATATTTACAGATGATGAGTAAACCAGCGCTCAAGAGACC 
TCCCTCGCGTGACATAGATCATGAAAGGAGGCTGTTGAAGAAGAAGAAGCCTGCAAGGTC 
AGAGGATCAAGAAGCATTCGATATGCTTAGACAGTTATTGTAAGTATTGCTCCAAACTTT 
CTTCCTACTCTCAAATTGTAAGTTACAATTTTCTAATTCTATTTTGTCTCCTGATACTTA 
AATGGGGGTTTGTGTATCAATTTTAGACCACCCAAGCGGTTTTCTCGGTATGACGATGAT 
GACATAAACATGGAAGCAGGCTTTGAAGATATCCAAAAGGAAGAGAGACGAAGGTACATG 
AGTATTTTTGTTATCACACGTTTCATTTATTTGTGTTTCTTGGATATTCCTTAACGATTG 
AATTGGTTGTTAAATGCAGTGCGAGAATCGCAAGGGAGGAAGATGAAAGAGAACTTAAGC 
TCTTAGAGGAAGAAGAAAGGAGAGAAAGACTGAAAAAGAATCGGAAGCTGAGCCGTTAGA 
AGAATCCTTTCTCCTTTGTGTCTTTGTCTTCTTTTAGGACTTTTTTAGTGTTTTCTCATT 
GAAATCTCTTTGGCCGCTTGAGGCAAAAAAGAGTTTGACCTTTTTTTTGTTTTGTGTTTT 
CAAATTAAGGATCTTTTTTTTGTTCATGGAAATTGTACAATTAGAAATAATATCTTTTAT 
TGGGGACACTTCAAGAAGAATCTGTTGGAAACCTTCCCAGTTAGTGAAAGCTTGATTCTC 
TTT
>AT2G22720.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown EXPRESSED IN 24 plant structures EXPRESSED DURING 15 growth stages CONTAINS InterPro DOMAIN/s Chromatin SPT2 (InterProIPR013256) BEST Arabidopsis thaliana protein match is unknown protein (TAIRAT4G378601) Has 34349 Blast hits to 21327 proteins in 952 species Archae - 9 Bacteria - 3053 Metazoa - 16365 Fungi - 6337 Plants - 1046 Viruses - 325 Other Eukaryotes - 7214 (source NCBI BLink) 
GAGAGCACCCCTTTCCTCTTCTTCTATATCCCTTTTTATCTCTTCGCAGCTTTCTCTCCT 
CTCTCAGATCTATACTCTCTTCTCCTTCACAATCGTTCGTTTTATCCTCTCCATCGGATT 
CGTCAGATATCTTCGATTTCTCGTCGTTTCATCGGTTCCAGGTCAAATCCAGGTATGTCA 
ATTGTAAAATTTCGATCAGATTCGTCGATTTCGATCAGCTTTTTCGATTTTGGATCTATT 
TTCTATGAAATATCAGATCTGGTGATTGTTTTACATATTTTTGGGTTGAATTCACAAGAT 
TTTCTGGAAACGAGATCGATTAATTGAGTTTTCTGTGTTTTTATCTTAAGCTAGATCTCG 
ATTTCTATGTTTTTGGATTGATTTGATAAGATTTTCGAGAATTTTTTGTGTTTTTGTCAA 
AGTTCGATCTCGATTTCTATATTTTTGGTTGAATTCACAAGACTTTCTGGAAACGAGATC 
GATTTTGTGAGTTTTCTTTGTTTTTAATCTCGATTTTTGGATTGATTTGAGAAGATTTTC 
TGAAAGCGAGATCGATGTTTTTGGGGATTTTCTTTGTTTTGTTCAATAATTCGGTCTCTG 
TTTTCTTATCAAAAAATTCGTTTTCCATCTCAAATCGATGTTCTTATTGATTTAATTGAG 
TTTTAGTTTGCAGGGATTTGATCGTTGGTAAGCTATCTTTCAGCAAACATGCATGGTTAT 
GAAGATGTAAGCACGCTCATGAATTTTTGTTTTCAGTGATTTTGTCGAATTCAATTTAAG 
GTAGATAGATTTGACATTGTTCGATAATGTTATATTGCAGGACCTTGATGAGGAAGCTGG 
GTATGATGACTATTACAGCGGTGATGAGGATGAGTATGAAGATGAGGAAGAGGAGGATGA 
AGAACCTCCTAAGGAAGAATTGGAATTTCTTGAGTCACGCCAAAAGTTGAAGGAATCAAT 
TCGGAAGAAAATGGGAAATGGAAGTGCTAATGCTCAATCTTCACAAGAGAGAAGAAGAAA 
ACTTCCTTATAACGAGTATGTGGTGGCTAAATCACATTTTCTAATTCATTACAATGTCCT 
GGAATGTGTTTTGATGCTGAGCTTATTGATTTTTCTTAATGCAGCTTTGGTTCTTTCTTT 
GGTCCTTCACGGCCTGTTATTTCCTCAAGGGTTATACAAGAAAGCAAATCCTTGCTTGAA 
AACGAGCTACGTAAAATGTCGAATTCGAGCCAAACTGTATGTGCATTTGATCTTTGTTAC 
TCTTTGTATTTTTATCATTTAAGATGTTTTTGCTGATGGAATTGTTTTTTGGGGTGCAGA 
AGAAAAGACCAGTTCCGACGAATGGTTCAGGCTCTAAGAATGTGTCACAAGAGAAGCGAC 
CTAAAGTTGTGAATGAGGTGAGAAGGAAAGTTGAGACTCTTAAGGATACAAGAGACTATT 
CGTTTTTGTTTTCCGATGACGCGGAGCTTCCTGTTCCGAAGAAGGAATCTCTTTCACGAA 
GTGGCTCTTTTCCTAATTCTGGTATGTTGTGTCTTTTGAAAAATCTTTTTCGCTATTTGT 
GATCTTTAAGCATACCATTTTCATGAAGATAACTTATACAGGTTTTTTGCTGATGTTCAA 
GAGGCTCGATCTGCTCAATTATCATCGAGGCCCAAACAATCATCAGGTATCAATGGTAGA 
ACTGCTCACAGTCCCCATCGTGAGGAGAAGAGACCTGTTTCAGCGAATGGACATTCAAGA 
CCGTCTTCCTCGGGCAGTCAAATGAATCATTCAAGACCGTCTTCCTCTGGCAGTAAAATG 
AATCATTCAAGACCGGCTACCTCGGGCAGCCAAATGCCAAATTCAAGACCAGCTTCCTCT 
GGCAGCCAAATGCAGTCGAGAGCTGTCTCAGGCTCAGGGCGACCTGCTTCCTCAGGCAGC 
CAGATGCAAAATTCAAGACCACAAAATTCAAGACCAGCTTCCGCTGGTAGCCAAATGCAG 
CAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCGTCCTCAGGCAGCCAAAGGCCTGCG 
TCCTCAGGCAGCCAAAGGCCAGGTTCGTCGACAAACCGTCAAGCACCTATGAGGCCACCA 
GGTTCAGGTTCCACAATGAATGGTCAATCAGCCAACCGGAATGGCCAACTGAATTCCAGA 
TCAGATTCCCGAAGATCAGCTCCTGCTAAAGTGCCAGTGGATCATAGGAAACAGATGAGC 
AGTAGCAATGGAGTTGGTCCTGGTCGGTCAGCGACCAATGCAAGACCTTTACCTTCTAAG 
AGTTCATTGGAAAGAAAACCCTCAATCTCGGCGGGAAAGAGTTCTCTTCAAAGCCCTCAG 
AGACCGTCCTCATCAAGACCAATGTCATCTGATCCTAGGCAACGGGTAGTAGAACAGAGA 
AAGGTTTCTCGTGACATGGCCACACCCCGAATGATACCTAAACAATCAGCGCCTACCTCG 
AAACACCAGGTATCATGATCATGATCTTTCACATCTCTTTCTTTTGTCCTTCCTCTAGCC 
AAGGCACTAATTTGTCAAGTAATATTTACAGATGATGAGTAAACCAGCGCTCAAGAGACC 
TCCCTCGCGTGACATAGATCATGAAAGGAGGCTGTTGAAGAAGAAGAAGCCTGCAAGGTC 
AGAGGATCAAGAAGCATTCGATATGCTTAGACAGTTATTGTAAGTATTGCTCCAAACTTT 
CTTCCTACTCTCAAATTGTAAGTTACAATTTTCTAATTCTATTTTGTCTCCTGATACTTA 
AATGGGGGTTTGTGTATCAATTTTAGACCACCCAAGCGGTTTTCTCGGTATGACGATGAT 
GACATAAACATGGAAGCAGGCTTTGAAGATATCCAAAAGGAAGAGAGACGAAGGTACATG 
AGTATTTTTGTTATCACACGTTTCATTTATTTGTGTTTCTTGGATATTCCTTAACGATTG 
AATTGGTTGTTAAATGCAGTGCGAGAATCGCAAGGGAGGAAGATGAAAGAGAACTTAAGC 
TCTTAGAGGAAGAAGAAAGGAGAGAAAGACTGAAAAAGAATCGGAAGCTGAGCCGTTAGA 
AGAATCCTTTCTCCTTTGTGTCTTTGTCTTCTTTTAGGACTTTTTTAGTGTTTTCTCATT 
GAAATCTCTTTGGCCGCTTGAGGCAAAAAAGAGTTTGACCTTTTTTTTGTTTTGTGTTTT 
CAAATTAAGGATCTTTTTTTTGTTCATGGAAATTGTACAATTAGAAATAATATCTTTTAT 
TGGGGACACTTCAAGAAGAATCTGTTGGAAACCTTCCCAGTTAGTGAAAGCTTGATTCTC 
TTT