>AT2G37210.1 |  Encodes a protein of unknown function  It has been crystallized and shown to be structurally almost identical to the protein encoded by At5g11950 
MEIKGESMQKSKFRRICVFCGSSQGKKSSYQDAAVDLGNELVSRNIDLVYGGGSIGLMGL 
VSQAVHDGGRHVIGIIPKTLMPRELTGETVGEVRAVADMHQRKAEMAKHSDAFIALPGGY 
GTLEELLEVITWAQLGIHDKPVGLLNVDGYYNSLLSFIDKAVEEGFISPTAREIIVSAPT 
AKELVKKLEEYAPCHERVATKLCWEMERIGYSSEE*
>AT1G37130.1 |  NIA2 (NITRATE REDUCTASE 2) nitrate reductase (NADH)/ nitrate reductase 
MAASVDNRQYARLEPGLNGVVRSYKPPVPGRSDSPKAHQNQTTNQTVFLKPAKVHDDDED 
VSSEDENETHNSNAVYYKEMIRKSNAELEPSVLDPRDEYTADSWIERNPSMVRLTGKHPF 
NSEAPLNRLMHHGFITPVPLHYVRNHGHVPKAQWAEWTVEVTGFVKRPMKFTMDQLVSEF 
AYREFAATLVCAGNRRKEQNMVKKSKGFNWGSAGVSTSVWRGVPLCDVLRRCGIFSRKGG 
ALNVCFEGSEDLPGGAGTAGSKYGTSIKKEYAMDPSRDIILAYMQNGEYLTPDHGFPVRI 
IIPGFIGGRMVKWLKRIIVTTKESDNFYHFKDNRVLPSLVDAELADEEGWWYKPEYIINE 
LNINSVITTPCHEEILPINAFTTQRPYTLKGYAYSGGGKKVTRVEVTVDGGETWNVCALD 
HQEKPNKYGKFWCWCFWSLEVEVLDLLSAKEIAVRAWDETLNTQPEKMIWNLMGMMNNCW 
FRVKTNVCKPHKGEIGIVFEHPTLPGNESGGWMAKERHLEKSADAPPSLKKSVSTPFMNT 
TAKMYSMSEVKKHNSADSCWIIVHGHIYDCTRFLMDHPGGSDSILINAGTDCTEEFEAIH 
SDKAKKMLEDYRIGELITTGYSSDSSSPNNSVHGSSAVFSLLAPIGEATPVRNLALVNPR 
AKVPVQLVEKTSISHDVRKFRFALPVEDMVLGLPVGKHIFLCATINDKLCLRAYTPSSTV 
DVVGYFELVVKIYFGGVHPRFPNGGLMSQYLDSLPIGSTLEIKGPLGHVEYLGKGSFTVH 
GKPKFADKLAMLAGGTGITPVYQIIQAILKDPEDETEMYVIYANRTEEDILLREELDGWA 
EQYPDRLKVWYVVESAKEGWAYSTGFISEAIMREHIPDGLDGSALAMACGPPPMIQFAVQ 
PNLEKMQYNIKEDFLIF*
>AT5G20990.1 |  B73 molybdenum ion binding 
MEGQGCCGGGGGKTEMIPTEEALRIVFGVSKRLPPVIVSLYEALGKVLAEDIRAPDPLPP 
YPASVKDGYAVVASDGPGEYPVITESRAGNDGLGVTVTPGTVAYVTTGGPIPDGADAVVQ 
VEDTKVIGDVSTESKRVKILIQTKKGTDIRRVGCDIEKDATVLTTGERIGASEIGLLATA 
GVTMVKVYPMPIVAILSTGDELVEPTAGTLGRGQIRDSNRAMLVAAVMQQQCKVVDLGIV 
RDDRKELEKVLDEAVSSGVDIILTSGGVSMGDRDFVKPLLEEKGKVYFSKVLMKPGKPLT 
FAEIRAKPTESMLGKTVLAFGLPGNPVSCLVCFNIFVVPTIRQLAGWTSPHPLRVRLRLQ 
EPIKSDPIRPEFHRAIIKWKDNDGSGTPGFVAESTGHQMSSRLLSMRSANALLELPATGN 
VLSAGSSVSAIIVSDISAFSIDKKASLSEPGSIRKEKKYDEVPGPEYKVAILTVSDTVSA 
GAGPDRSGPRAVSVVDSSSEKLGGAKVVATAVVPDEVERIKDILQKWSDVDEMDLILTLG 
GTGFTPRDVTPEATKKVIERETPGLLFVMMQESLKITPFAMLSRSAAGIRGSTLIINMPG 
NPNAVAECMEALLPALKHALKQIKGDKREKHPKHIPHAEATLPTDTWDQSYKSAYETGEK 
KEEAGCSCTH*
>AT2G28305.1 |  unknown protein 
MEIESKFKRICVFCGSSAGNKVSYKDAAIELGTELVSRNIDLVYGGGSIGLMGLISQAVF 
NGGRHVIGVIPKTLMPREITGETVGEVKAVADMHQRKAEMAKHSDAFIALPGGYGTLEEL 
LEVITWAQLGIHDKPVGLLNVEGYYNSLLSFIDKAVEEGFISPTARHIIVSAPSAKELVK 
KLEDYVPRHEKVASKKSWEMEQIGLSPTCEISR*
>AT2G35990.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 3002 Blast hits to 3001 proteins in 744 species Archae - 8 Bacteria - 1757 Metazoa - 10 Fungi - 79 Plants - 185 Viruses - 0 Other Eukaryotes - 963 (source NCBI BLink) 
MEETKSRFRRICVFCGSSSGNKTTYHDAALQLAHQLVERNIDLVYGGGSVGLMGLISQAV 
HDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALPGGYGTFEE 
LLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVSAPNAPQLL 
QLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 2269 Blast hits to 2268 proteins in 637 species Archae - 6 Bacteria - 1228 Metazoa - 8 Fungi - 85 Plants - 183 Viruses - 0 Other Eukaryotes - 759 (source NCBI BLink) 
MEETKSRFRRICVFCGSSSGNKTTYHDAALQLAHQLVERNIDLVYGGGSVGLMGLISQAV 
HDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALPGGYGTFEE 
LLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVSAPNAPQLL 
QLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.1 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) 
MEETKSRFRRICVFCGSSSGNKTTYHDAALQLAHQLVERNIDLVYGGGSVGLMGLISQAV 
HDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALPGGYGTFEE 
LLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVSAPNAPQLL 
QLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 3002 Blast hits to 3001 proteins in 744 species Archae - 8 Bacteria - 1757 Metazoa - 10 Fungi - 79 Plants - 185 Viruses - 0 Other Eukaryotes - 963 (source NCBI BLink) 
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP 
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS 
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 2269 Blast hits to 2268 proteins in 637 species Archae - 6 Bacteria - 1228 Metazoa - 8 Fungi - 85 Plants - 183 Viruses - 0 Other Eukaryotes - 759 (source NCBI BLink) 
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP 
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS 
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.2 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) 
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP 
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS 
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 3002 Blast hits to 3001 proteins in 744 species Archae - 8 Bacteria - 1757 Metazoa - 10 Fungi - 79 Plants - 185 Viruses - 0 Other Eukaryotes - 963 (source NCBI BLink) 
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP 
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS 
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 2269 Blast hits to 2268 proteins in 637 species Archae - 6 Bacteria - 1228 Metazoa - 8 Fungi - 85 Plants - 183 Viruses - 0 Other Eukaryotes - 759 (source NCBI BLink) 
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP 
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS 
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.3 |  FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) 
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP 
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS 
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT5G11950.1 |  protein homodimerization 
MEDNQRSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRR 
VYEGGLHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTME 
ELLEMITWSQLGIHKKTVGLLNVDGYYNNLLALFDTGVEEGFIKPGARNIVVSAPTAKEL 
MEKMEEYTPSHMHVASHESWKVEELGDYPGQENKPQ*
>AT5G11950.1 |  protein homodimerization 
MEDNQRSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRR 
VYEGGLHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTME 
ELLEMITWSQLGIHKKTVGLLNVDGYYNNLLALFDTGVEEGFIKPGARNIVVSAPTAKEL 
MEKMEEYTPSHMHVASHESWKVEELGDYPGQENKPQ*
>AT5G11950.2 |  protein homodimerization 
MEDNQRSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRR 
VYEGGLHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTME 
ELLEMITWSQLGIHKKTVGLLNVDGYYNNLLALFDTGVEEGFIKPGARNIVVSAPTAKEL 
MEKMEEYTPSHMHVASHESWKVEELGDYPGQENKPQ*
>AT5G11950.2 |  protein homodimerization 
MEDNQRSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRR 
VYEGGLHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTME 
ELLEMITWSQLGIHKKTVGLLNVDGYYNNLLALFDTGVEEGFIKPGARNIVVSAPTAKEL 
MEKMEEYTPSHMHVASHESWKVEELGDYPGQENKPQ*
>AT5G06300.1 |  carboxy-lyase 
MEETKSRFKRICVFCGSSSGKKPSYQEAAIQLGNELVERRIDLVYGGGSVGLMGLVSQAV 
HHGGRHVLGVIPKTLMPREITGETIGEVKAVADMHQRKAEMARQADAFIALPGGYGTLEE 
LLEVITWAQLGIHRKPVGLLNVDGYYNSLLTFIDKAVDEGFISPMARRIIVSAPNAKELV 
RQLEEYEPEFDEITSKLVWDEVDRISYVPGSEVATAT*
>AT3G53450.1 |  unknown protein 
MEVNNETMQKSKFGRICVFCGSSQGKKSSYQDAAVDLGNELVLRNIDLVYGGGSIGLMGL 
VSQAVHDGGRHVIGVIPKTLMPRELTGETVGEVRAVADMHQRKAEMARHSDAFIALPGGY 
GTLEELLEVITWAQLGIHDKPVGLLNVDGYYNSLLSFIDKAVEEGFISTNARQIIISAPT 
AKELVKKLEEYSPCHESVATKLCWEIERIDYSSED*
>AT4G35190.1 |  unknown protein 
MEIVKSRFKRVCVFCGSSSGKRECYSDAATDLAQELVTRRLNLVYGGGSIGLMGLVSQAV 
HEAGGHVLGIIPRTLMDKEITGETYGEVIAVADMHERKAEMARHSDCFIALPGGYGTLEE 
LLEVIAWAQLGIHDKPVGLLNVDGYYNYLLTFIDKAVDDGFIKPSQRHIFVSAPNAKELV 
QKLEAYKPVNDGVIAKSRWEVEKKVQQPQQQQQVVFCSNTSMQTEIAL*
>AT5G03270.1 |  unknown protein 
MENEEGKREMTKKQSSRFKSICVFCGSSNGNKASYQDAAIDLAKELVMRKIDLVYGGGSI 
GLMGLVSQAVHDGGRHNNNNNGNDDALFCHSVNVSQTNSKLTGETVGEVKEVADMHQRKA 
VMAKHSDAFITLPGGYGTLEELLEVITWAQLGIHDKPVGLLNVDGYYDALLLFIDKAVEE 
GFILPTARHIIVSAPTARELFIKLELNMVSLDRISKHALSLFQVMSKAG*