>AT2G37210.1 | Encodes a protein of unknown function It has been crystallized and shown to be structurally almost identical to the protein encoded by At5g11950
MEIKGESMQKSKFRRICVFCGSSQGKKSSYQDAAVDLGNELVSRNIDLVYGGGSIGLMGL
VSQAVHDGGRHVIGIIPKTLMPRELTGETVGEVRAVADMHQRKAEMAKHSDAFIALPGGY
GTLEELLEVITWAQLGIHDKPVGLLNVDGYYNSLLSFIDKAVEEGFISPTAREIIVSAPT
AKELVKKLEEYAPCHERVATKLCWEMERIGYSSEE*
>AT1G37130.1 | NIA2 (NITRATE REDUCTASE 2) nitrate reductase (NADH)/ nitrate reductase
MAASVDNRQYARLEPGLNGVVRSYKPPVPGRSDSPKAHQNQTTNQTVFLKPAKVHDDDED
VSSEDENETHNSNAVYYKEMIRKSNAELEPSVLDPRDEYTADSWIERNPSMVRLTGKHPF
NSEAPLNRLMHHGFITPVPLHYVRNHGHVPKAQWAEWTVEVTGFVKRPMKFTMDQLVSEF
AYREFAATLVCAGNRRKEQNMVKKSKGFNWGSAGVSTSVWRGVPLCDVLRRCGIFSRKGG
ALNVCFEGSEDLPGGAGTAGSKYGTSIKKEYAMDPSRDIILAYMQNGEYLTPDHGFPVRI
IIPGFIGGRMVKWLKRIIVTTKESDNFYHFKDNRVLPSLVDAELADEEGWWYKPEYIINE
LNINSVITTPCHEEILPINAFTTQRPYTLKGYAYSGGGKKVTRVEVTVDGGETWNVCALD
HQEKPNKYGKFWCWCFWSLEVEVLDLLSAKEIAVRAWDETLNTQPEKMIWNLMGMMNNCW
FRVKTNVCKPHKGEIGIVFEHPTLPGNESGGWMAKERHLEKSADAPPSLKKSVSTPFMNT
TAKMYSMSEVKKHNSADSCWIIVHGHIYDCTRFLMDHPGGSDSILINAGTDCTEEFEAIH
SDKAKKMLEDYRIGELITTGYSSDSSSPNNSVHGSSAVFSLLAPIGEATPVRNLALVNPR
AKVPVQLVEKTSISHDVRKFRFALPVEDMVLGLPVGKHIFLCATINDKLCLRAYTPSSTV
DVVGYFELVVKIYFGGVHPRFPNGGLMSQYLDSLPIGSTLEIKGPLGHVEYLGKGSFTVH
GKPKFADKLAMLAGGTGITPVYQIIQAILKDPEDETEMYVIYANRTEEDILLREELDGWA
EQYPDRLKVWYVVESAKEGWAYSTGFISEAIMREHIPDGLDGSALAMACGPPPMIQFAVQ
PNLEKMQYNIKEDFLIF*
>AT5G20990.1 | B73 molybdenum ion binding
MEGQGCCGGGGGKTEMIPTEEALRIVFGVSKRLPPVIVSLYEALGKVLAEDIRAPDPLPP
YPASVKDGYAVVASDGPGEYPVITESRAGNDGLGVTVTPGTVAYVTTGGPIPDGADAVVQ
VEDTKVIGDVSTESKRVKILIQTKKGTDIRRVGCDIEKDATVLTTGERIGASEIGLLATA
GVTMVKVYPMPIVAILSTGDELVEPTAGTLGRGQIRDSNRAMLVAAVMQQQCKVVDLGIV
RDDRKELEKVLDEAVSSGVDIILTSGGVSMGDRDFVKPLLEEKGKVYFSKVLMKPGKPLT
FAEIRAKPTESMLGKTVLAFGLPGNPVSCLVCFNIFVVPTIRQLAGWTSPHPLRVRLRLQ
EPIKSDPIRPEFHRAIIKWKDNDGSGTPGFVAESTGHQMSSRLLSMRSANALLELPATGN
VLSAGSSVSAIIVSDISAFSIDKKASLSEPGSIRKEKKYDEVPGPEYKVAILTVSDTVSA
GAGPDRSGPRAVSVVDSSSEKLGGAKVVATAVVPDEVERIKDILQKWSDVDEMDLILTLG
GTGFTPRDVTPEATKKVIERETPGLLFVMMQESLKITPFAMLSRSAAGIRGSTLIINMPG
NPNAVAECMEALLPALKHALKQIKGDKREKHPKHIPHAEATLPTDTWDQSYKSAYETGEK
KEEAGCSCTH*
>AT2G28305.1 | unknown protein
MEIESKFKRICVFCGSSAGNKVSYKDAAIELGTELVSRNIDLVYGGGSIGLMGLISQAVF
NGGRHVIGVIPKTLMPREITGETVGEVKAVADMHQRKAEMAKHSDAFIALPGGYGTLEEL
LEVITWAQLGIHDKPVGLLNVEGYYNSLLSFIDKAVEEGFISPTARHIIVSAPSAKELVK
KLEDYVPRHEKVASKKSWEMEQIGLSPTCEISR*
>AT2G35990.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 3002 Blast hits to 3001 proteins in 744 species Archae - 8 Bacteria - 1757 Metazoa - 10 Fungi - 79 Plants - 185 Viruses - 0 Other Eukaryotes - 963 (source NCBI BLink)
MEETKSRFRRICVFCGSSSGNKTTYHDAALQLAHQLVERNIDLVYGGGSVGLMGLISQAV
HDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALPGGYGTFEE
LLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVSAPNAPQLL
QLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 2269 Blast hits to 2268 proteins in 637 species Archae - 6 Bacteria - 1228 Metazoa - 8 Fungi - 85 Plants - 183 Viruses - 0 Other Eukaryotes - 759 (source NCBI BLink)
MEETKSRFRRICVFCGSSSGNKTTYHDAALQLAHQLVERNIDLVYGGGSVGLMGLISQAV
HDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALPGGYGTFEE
LLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVSAPNAPQLL
QLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.1 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001)
MEETKSRFRRICVFCGSSSGNKTTYHDAALQLAHQLVERNIDLVYGGGSVGLMGLISQAV
HDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALPGGYGTFEE
LLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVSAPNAPQLL
QLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 3002 Blast hits to 3001 proteins in 744 species Archae - 8 Bacteria - 1757 Metazoa - 10 Fungi - 79 Plants - 185 Viruses - 0 Other Eukaryotes - 963 (source NCBI BLink)
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 2269 Blast hits to 2268 proteins in 637 species Archae - 6 Bacteria - 1228 Metazoa - 8 Fungi - 85 Plants - 183 Viruses - 0 Other Eukaryotes - 759 (source NCBI BLink)
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.2 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001)
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 3002 Blast hits to 3001 proteins in 744 species Archae - 8 Bacteria - 1757 Metazoa - 10 Fungi - 79 Plants - 185 Viruses - 0 Other Eukaryotes - 963 (source NCBI BLink)
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001) Has 2269 Blast hits to 2268 proteins in 637 species Archae - 6 Bacteria - 1228 Metazoa - 8 Fungi - 85 Plants - 183 Viruses - 0 Other Eukaryotes - 759 (source NCBI BLink)
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT2G35990.3 | FUNCTIONS IN molecular_function unknown INVOLVED IN biological_process unknown LOCATED IN cellular_component unknown EXPRESSED IN 15 plant structures EXPRESSED DURING 8 growth stages CONTAINS InterPro DOMAIN/s Conserved hypothetical protein CHP00730 (InterProIPR005269) BEST Arabidopsis thaliana protein match is carboxy-lyase (TAIRAT5G063001)
MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP
GGYGTFEELLEVITWSQLGIHTKPVGLLNVDGFYDSLLTFIDKAVDEGFVSSTARRIIVS
APNAPQLLQLLEEYVPKHDDFVSKMVWDNTTDAFTLEGDSF*
>AT5G11950.1 | protein homodimerization
MEDNQRSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRR
VYEGGLHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTME
ELLEMITWSQLGIHKKTVGLLNVDGYYNNLLALFDTGVEEGFIKPGARNIVVSAPTAKEL
MEKMEEYTPSHMHVASHESWKVEELGDYPGQENKPQ*
>AT5G11950.1 | protein homodimerization
MEDNQRSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRR
VYEGGLHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTME
ELLEMITWSQLGIHKKTVGLLNVDGYYNNLLALFDTGVEEGFIKPGARNIVVSAPTAKEL
MEKMEEYTPSHMHVASHESWKVEELGDYPGQENKPQ*
>AT5G11950.2 | protein homodimerization
MEDNQRSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRR
VYEGGLHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTME
ELLEMITWSQLGIHKKTVGLLNVDGYYNNLLALFDTGVEEGFIKPGARNIVVSAPTAKEL
MEKMEEYTPSHMHVASHESWKVEELGDYPGQENKPQ*
>AT5G11950.2 | protein homodimerization
MEDNQRSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRR
VYEGGLHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTME
ELLEMITWSQLGIHKKTVGLLNVDGYYNNLLALFDTGVEEGFIKPGARNIVVSAPTAKEL
MEKMEEYTPSHMHVASHESWKVEELGDYPGQENKPQ*
>AT5G06300.1 | carboxy-lyase
MEETKSRFKRICVFCGSSSGKKPSYQEAAIQLGNELVERRIDLVYGGGSVGLMGLVSQAV
HHGGRHVLGVIPKTLMPREITGETIGEVKAVADMHQRKAEMARQADAFIALPGGYGTLEE
LLEVITWAQLGIHRKPVGLLNVDGYYNSLLTFIDKAVDEGFISPMARRIIVSAPNAKELV
RQLEEYEPEFDEITSKLVWDEVDRISYVPGSEVATAT*
>AT3G53450.1 | unknown protein
MEVNNETMQKSKFGRICVFCGSSQGKKSSYQDAAVDLGNELVLRNIDLVYGGGSIGLMGL
VSQAVHDGGRHVIGVIPKTLMPRELTGETVGEVRAVADMHQRKAEMARHSDAFIALPGGY
GTLEELLEVITWAQLGIHDKPVGLLNVDGYYNSLLSFIDKAVEEGFISTNARQIIISAPT
AKELVKKLEEYSPCHESVATKLCWEIERIDYSSED*
>AT4G35190.1 | unknown protein
MEIVKSRFKRVCVFCGSSSGKRECYSDAATDLAQELVTRRLNLVYGGGSIGLMGLVSQAV
HEAGGHVLGIIPRTLMDKEITGETYGEVIAVADMHERKAEMARHSDCFIALPGGYGTLEE
LLEVIAWAQLGIHDKPVGLLNVDGYYNYLLTFIDKAVDDGFIKPSQRHIFVSAPNAKELV
QKLEAYKPVNDGVIAKSRWEVEKKVQQPQQQQQVVFCSNTSMQTEIAL*
>AT5G03270.1 | unknown protein
MENEEGKREMTKKQSSRFKSICVFCGSSNGNKASYQDAAIDLAKELVMRKIDLVYGGGSI
GLMGLVSQAVHDGGRHNNNNNGNDDALFCHSVNVSQTNSKLTGETVGEVKEVADMHQRKA
VMAKHSDAFITLPGGYGTLEELLEVITWAQLGIHDKPVGLLNVDGYYDALLLFIDKAVEE
GFILPTARHIIVSAPTARELFIKLELNMVSLDRISKHALSLFQVMSKAG*