CN108611344A - The preparation and application of AtAGM2 and AtAGM3 encoding genes and enzyme - Google Patents
The preparation and application of AtAGM2 and AtAGM3 encoding genes and enzyme Download PDFInfo
- Publication number
- CN108611344A CN108611344A CN201611133399.6A CN201611133399A CN108611344A CN 108611344 A CN108611344 A CN 108611344A CN 201611133399 A CN201611133399 A CN 201611133399A CN 108611344 A CN108611344 A CN 108611344A
- Authority
- CN
- China
- Prior art keywords
- sequence
- acetylglucosamine
- seq
- phosphate
- leu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 44
- 102000004190 Enzymes Human genes 0.000 title claims abstract description 41
- 108090000790 Enzymes Proteins 0.000 title claims abstract description 41
- 238000002360 preparation method Methods 0.000 title claims description 6
- 230000000694 effects Effects 0.000 claims abstract description 39
- 108010074307 Phosphoacetylglucosamine mutase Proteins 0.000 claims abstract description 13
- 102100024440 Phosphoacetylglucosamine mutase Human genes 0.000 claims abstract description 12
- 238000004519 manufacturing process Methods 0.000 claims abstract description 11
- 238000000034 method Methods 0.000 claims abstract description 8
- 241000588724 Escherichia coli Species 0.000 claims abstract description 7
- XCCTYIAWTASOJW-UHFFFAOYSA-N UDP-Glc Natural products OC1C(O)C(COP(O)(=O)OP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-UHFFFAOYSA-N 0.000 claims abstract description 5
- 108020004414 DNA Proteins 0.000 claims description 50
- 102000053602 DNA Human genes 0.000 claims description 50
- JHVGZSJOQBRMBQ-DBKUKYHUSA-N P(O)(O)(O)=O.C(C)(=O)C1(O)[C@H](N)[C@@H](O)[C@H](O)[C@H](O1)CO Chemical compound P(O)(O)(O)=O.C(C)(=O)C1(O)[C@H](N)[C@@H](O)[C@H](O)[C@H](O1)CO JHVGZSJOQBRMBQ-DBKUKYHUSA-N 0.000 claims description 48
- 239000002773 nucleotide Substances 0.000 claims description 34
- 125000003729 nucleotide group Chemical group 0.000 claims description 32
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 24
- 239000013604 expression vector Substances 0.000 claims description 23
- 210000004027 cell Anatomy 0.000 claims description 21
- -1 hexose phosphates Chemical class 0.000 claims description 20
- 229910019142 PO4 Inorganic materials 0.000 claims description 13
- 235000021317 phosphate Nutrition 0.000 claims description 13
- BRGMHAYQAZFZDJ-UHFFFAOYSA-N (5-acetamido-3,4,6-trihydroxyoxan-2-yl)methyl dihydrogen phosphate Chemical compound CC(=O)NC1C(O)OC(COP(O)(O)=O)C(O)C1O BRGMHAYQAZFZDJ-UHFFFAOYSA-N 0.000 claims description 12
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 claims description 12
- 241000894006 Bacteria Species 0.000 claims description 10
- 239000010452 phosphate Substances 0.000 claims description 10
- 244000063299 Bacillus subtilis Species 0.000 claims description 8
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 8
- LFTYTUAZOPRMMI-CFRASDGPSA-N UDP-N-acetyl-alpha-D-glucosamine Chemical compound O1[C@H](CO)[C@@H](O)[C@H](O)[C@@H](NC(=O)C)[C@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-CFRASDGPSA-N 0.000 claims description 8
- LFTYTUAZOPRMMI-UHFFFAOYSA-N UNPD164450 Natural products O1C(CO)C(O)C(O)C(NC(=O)C)C1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-UHFFFAOYSA-N 0.000 claims description 8
- 238000007792 addition Methods 0.000 claims description 8
- 238000012217 deletion Methods 0.000 claims description 8
- 230000037430 deletion Effects 0.000 claims description 8
- 238000003259 recombinant expression Methods 0.000 claims description 8
- 238000006467 substitution reaction Methods 0.000 claims description 8
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 7
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 7
- 125000000539 amino acid group Chemical group 0.000 claims description 7
- 239000004310 lactic acid Substances 0.000 claims description 6
- 235000014655 lactic acid Nutrition 0.000 claims description 6
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 6
- 235000000346 sugar Nutrition 0.000 claims description 6
- VFRROHXSMXFLSN-UHFFFAOYSA-N Glc6P Natural products OP(=O)(O)OCC(O)C(O)C(O)C(O)C=O VFRROHXSMXFLSN-UHFFFAOYSA-N 0.000 claims description 5
- 241000238631 Hexapoda Species 0.000 claims description 4
- 241000187747 Streptomyces Species 0.000 claims description 4
- 210000004962 mammalian cell Anatomy 0.000 claims description 4
- 239000013598 vector Substances 0.000 claims description 3
- 241000351920 Aspergillus nidulans Species 0.000 claims description 2
- 241000228245 Aspergillus niger Species 0.000 claims description 2
- 241000255789 Bombyx mori Species 0.000 claims description 2
- 241000699800 Cricetinae Species 0.000 claims description 2
- 241000699802 Cricetulus griseus Species 0.000 claims description 2
- 241000672609 Escherichia coli BL21 Species 0.000 claims description 2
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 claims description 2
- 241000233866 Fungi Species 0.000 claims description 2
- 241001138401 Kluyveromyces lactis Species 0.000 claims description 2
- 241000235058 Komagataella pastoris Species 0.000 claims description 2
- 241000223261 Trichoderma viride Species 0.000 claims description 2
- CYKLRRKFBPBYEI-NQQHDEILSA-N UDP-alpha-D-glucosamine Chemical compound O1[C@H](CO)[C@@H](O)[C@H](O)[C@@H](N)[C@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 CYKLRRKFBPBYEI-NQQHDEILSA-N 0.000 claims description 2
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 claims description 2
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 claims description 2
- 210000004978 chinese hamster ovary cell Anatomy 0.000 claims description 2
- 230000002538 fungal effect Effects 0.000 claims description 2
- 210000003292 kidney cell Anatomy 0.000 claims description 2
- 210000005265 lung cell Anatomy 0.000 claims description 2
- 230000009466 transformation Effects 0.000 claims description 2
- 230000009261 transgenic effect Effects 0.000 claims description 2
- NBSCHQHZLSJFNQ-GASJEMHNSA-N D-Glucose 6-phosphate Chemical compound OC1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H](O)[C@H]1O NBSCHQHZLSJFNQ-GASJEMHNSA-N 0.000 claims 2
- YMJBYRVFGYXULK-QZABAPFNSA-N alpha-D-glucosamine 1-phosphate Chemical compound N[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(O)=O YMJBYRVFGYXULK-QZABAPFNSA-N 0.000 claims 2
- HXXFSFRBOHSIMQ-VFUOTHLCSA-N alpha-D-glucose 1-phosphate Chemical compound OC[C@H]1O[C@H](OP(O)(O)=O)[C@H](O)[C@@H](O)[C@@H]1O HXXFSFRBOHSIMQ-VFUOTHLCSA-N 0.000 claims 2
- XHMJOUIAFHJHBW-VFUOTHLCSA-N glucosamine 6-phosphate Chemical compound N[C@H]1[C@H](O)O[C@H](COP(O)(O)=O)[C@H](O)[C@@H]1O XHMJOUIAFHJHBW-VFUOTHLCSA-N 0.000 claims 2
- 229950010772 glucose-1-phosphate Drugs 0.000 claims 2
- 241000588722 Escherichia Species 0.000 claims 1
- 241000223259 Trichoderma Species 0.000 claims 1
- UGEVDEPHPZWGPI-KEWYIRBNSA-N [(2R,3S,4R,5R)-6-acetyl-5-amino-3,4,6-trihydroxyoxan-2-yl]methyl dihydrogen phosphate Chemical compound P(=O)(O)(O)OC[C@@H]1[C@H]([C@@H]([C@H](C(O)(O1)C(C)=O)N)O)O UGEVDEPHPZWGPI-KEWYIRBNSA-N 0.000 claims 1
- VQRCPOPQRYDVRE-KEWYIRBNSA-N [(3R,4R,5S,6R)-2-acetyl-3-amino-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl] dihydrogen phosphate Chemical compound P(=O)(O)(O)OC1([C@H](N)[C@@H](O)[C@H](O)[C@H](O1)CO)C(C)=O VQRCPOPQRYDVRE-KEWYIRBNSA-N 0.000 claims 1
- 230000003505 mutagenic effect Effects 0.000 claims 1
- 231100000243 mutagenic effect Toxicity 0.000 claims 1
- 241001446247 uncultured actinomycete Species 0.000 claims 1
- 241000219195 Arabidopsis thaliana Species 0.000 abstract description 18
- 230000014509 gene expression Effects 0.000 abstract description 9
- 238000006555 catalytic reaction Methods 0.000 abstract description 8
- 241000219194 Arabidopsis Species 0.000 abstract description 7
- 230000015572 biosynthetic process Effects 0.000 abstract description 7
- 238000003786 synthesis reaction Methods 0.000 abstract description 6
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 abstract description 4
- DUKURNFHYQXCJG-UHFFFAOYSA-N Lewis A pentasaccharide Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(C)=O)C(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)OC1CO DUKURNFHYQXCJG-UHFFFAOYSA-N 0.000 abstract description 3
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 abstract description 3
- OVRNDRQMDRJTHS-RTRLPJTCSA-N N-acetyl-D-glucosamine Chemical compound CC(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-RTRLPJTCSA-N 0.000 abstract description 3
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 abstract description 3
- 229950006780 n-acetylglucosamine Drugs 0.000 abstract description 3
- KNEWRWNYNNYMKW-RTRLPJTCSA-N acetyloxy-N-[(3R,4R,5S,6R)-2,4,5-trihydroxy-6-(hydroxymethyl)oxan-3-yl]phosphonamidic acid Chemical compound CC(=O)OP(O)(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O KNEWRWNYNNYMKW-RTRLPJTCSA-N 0.000 abstract description 2
- 230000001580 bacterial effect Effects 0.000 abstract description 2
- XCCTYIAWTASOJW-XVFCMESISA-N Uridine-5'-Diphosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-XVFCMESISA-N 0.000 abstract 2
- 108091000080 Phosphotransferase Proteins 0.000 abstract 1
- 238000012215 gene cloning Methods 0.000 abstract 1
- 102000020233 phosphotransferase Human genes 0.000 abstract 1
- 239000000758 substrate Substances 0.000 description 27
- 238000006243 chemical reaction Methods 0.000 description 26
- 102000004169 proteins and genes Human genes 0.000 description 15
- 235000018102 proteins Nutrition 0.000 description 13
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 12
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 11
- 235000001014 amino acid Nutrition 0.000 description 10
- 150000001413 amino acids Chemical class 0.000 description 10
- 108010031311 Intramolecular Transferases Proteins 0.000 description 7
- 102000005385 Intramolecular Transferases Human genes 0.000 description 7
- 230000037361 pathway Effects 0.000 description 7
- 108010050848 glycylleucine Proteins 0.000 description 6
- 238000001514 detection method Methods 0.000 description 5
- 230000002255 enzymatic effect Effects 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- MSWZFWKMSRAUBD-UHFFFAOYSA-N 2-Amino-2-Deoxy-Hexose Chemical compound NC1C(O)OC(CO)C(O)C1O MSWZFWKMSRAUBD-UHFFFAOYSA-N 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- XHMJOUIAFHJHBW-UKFBFLRUSA-N alpha-D-glucosamine 6-phosphate Chemical compound N[C@H]1[C@@H](O)O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O XHMJOUIAFHJHBW-UKFBFLRUSA-N 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- 229910021645 metal ion Inorganic materials 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 3
- HXXFSFRBOHSIMQ-GASJEMHNSA-N D-glucopyranose 1-phosphate Chemical compound OC[C@H]1OC(OP(O)(O)=O)[C@H](O)[C@@H](O)[C@@H]1O HXXFSFRBOHSIMQ-GASJEMHNSA-N 0.000 description 3
- HXXFSFRBOHSIMQ-UHFFFAOYSA-N Di-K salt-alpha-D-Pyranose-Galactose 1-dihydrogen phosphate Natural products OCC1OC(OP(O)(O)=O)C(O)C(O)C1O HXXFSFRBOHSIMQ-UHFFFAOYSA-N 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 230000007935 neutral effect Effects 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 2
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 2
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- GSXOAOHZAIYLCY-UHFFFAOYSA-N D-F6P Natural products OCC(=O)C(O)C(O)C(O)COP(O)(O)=O GSXOAOHZAIYLCY-UHFFFAOYSA-N 0.000 description 2
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 2
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 2
- 238000000246 agarose gel electrophoresis Methods 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 238000003149 assay kit Methods 0.000 description 2
- 238000002869 basic local alignment search tool Methods 0.000 description 2
- BGWGXPAPYGQALX-ARQDHWQXSA-N beta-D-fructofuranose 6-phosphate Chemical compound OC[C@@]1(O)O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O BGWGXPAPYGQALX-ARQDHWQXSA-N 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 239000003480 eluent Substances 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 150000002402 hexoses Chemical class 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 239000013067 intermediate product Substances 0.000 description 2
- 238000005342 ion exchange Methods 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- 101150028074 2 gene Proteins 0.000 description 1
- 108010044087 AS-I toxin Proteins 0.000 description 1
- 241000186046 Actinomyces Species 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- YWWATNIVMOCSAV-UBHSHLNASA-N Ala-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YWWATNIVMOCSAV-UBHSHLNASA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- IYCZBJXFSZSHPN-DLOVCJGASA-N Ala-Cys-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IYCZBJXFSZSHPN-DLOVCJGASA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 1
- JGIAYNNXZKKKOW-KKUMJFAQSA-N Asn-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N JGIAYNNXZKKKOW-KKUMJFAQSA-N 0.000 description 1
- JRCASHGTXZYSPW-XIRDDKMYSA-N Asn-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CC(=O)N)N JRCASHGTXZYSPW-XIRDDKMYSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- NZJDBCYBYCUEDC-UBHSHLNASA-N Asp-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N NZJDBCYBYCUEDC-UBHSHLNASA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 1
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- 108010072062 GEKG peptide Proteins 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- 108010021582 Glucokinase Proteins 0.000 description 1
- 102000030595 Glucokinase Human genes 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- VLPMGIJPAWENQB-SRVKXCTJSA-N His-Cys-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O VLPMGIJPAWENQB-SRVKXCTJSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- DYKZGTLPSNOFHU-DEQVHRJGSA-N His-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DYKZGTLPSNOFHU-DEQVHRJGSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- UMBKDWGQESDCTO-KKUMJFAQSA-N His-Lys-Lys Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O UMBKDWGQESDCTO-KKUMJFAQSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- LPBWRHRHEIYAIP-KKUMJFAQSA-N His-Tyr-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LPBWRHRHEIYAIP-KKUMJFAQSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- AYLAAGNJNVZDPY-CYDGBPFRSA-N Ile-Met-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N AYLAAGNJNVZDPY-CYDGBPFRSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- FZMNAYBEFGZEIF-AVGNSLFASA-N Leu-Met-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N FZMNAYBEFGZEIF-AVGNSLFASA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 1
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- VOAKKHOIAFKOQZ-JYJNAYRXSA-N Met-Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 VOAKKHOIAFKOQZ-JYJNAYRXSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- 241001291091 Mimivirus Species 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- ABEFOXGAIIJDCL-SFJXLCSZSA-N Phe-Thr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ABEFOXGAIIJDCL-SFJXLCSZSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 1
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- RQXDSYQXBCRXBT-GUBZILKMSA-N Ser-Met-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RQXDSYQXBCRXBT-GUBZILKMSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 229910021607 Silver chloride Inorganic materials 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 241000499912 Trichoderma reesei Species 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- VNRTXOUAOUZCFW-WDSOQIARSA-N Trp-Val-His Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O VNRTXOUAOUZCFW-WDSOQIARSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 1
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Natural products O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- RWYFURDDADFSHT-RBBHPAOJSA-N diane Chemical compound OC1=CC=C2[C@H]3CC[C@](C)([C@](CC4)(O)C#C)[C@@H]4[C@@H]3CCC2=C1.C1=C(Cl)C2=CC(=O)[C@@H]3CC3[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@@](C(C)=O)(OC(=O)C)[C@@]1(C)CC2 RWYFURDDADFSHT-RBBHPAOJSA-N 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 230000034659 glycolysis Effects 0.000 description 1
- 150000002336 glycosamine derivatives Chemical class 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 238000000703 high-speed centrifugation Methods 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 238000006317 isomerization reaction Methods 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 239000008204 material by function Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 150000002772 monosaccharides Chemical group 0.000 description 1
- 238000002887 multiple sequence alignment Methods 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 150000003833 nucleoside derivatives Chemical class 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 150000002482 oligosaccharides Chemical class 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- HKZLPVFGJNLROG-UHFFFAOYSA-M silver monochloride Chemical compound [Cl-].[Ag+] HKZLPVFGJNLROG-UHFFFAOYSA-M 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/02—Monosaccharides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
- C12P19/28—N-glycosides
- C12P19/38—Nucleosides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y504/00—Intramolecular transferases (5.4)
- C12Y504/02—Phosphotransferases (phosphomutases) (5.4.2)
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Biomedical Technology (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Description
技术领域technical field
本发明涉及两种乙酰葡萄糖胺磷酸变位酶AtAGM2与AtAGM3的基因序列及其制备方法,尤其涉及这两种酶在核苷酸糖及磷酸己糖异构体生产中的应用。本发明还提供了这两种乙酰葡萄糖胺磷酸变位酶的重组质粒和重组基因工程菌株,以及两种酶的酶学性质。The invention relates to gene sequences of two acetylglucosamine phosphate mutases AtAGM2 and AtAGM3 and a preparation method thereof, in particular to the application of these two enzymes in the production of nucleotide sugar and hexose phosphate isomers. The invention also provides the recombinant plasmids and recombinant genetic engineering strains of the two acetylglucosamine phosphate mutases, as well as the enzymatic properties of the two enzymes.
背景技术Background technique
乙酰葡萄糖胺(GlcNAc,N-乙酰-β-D-氨基葡萄糖)是葡萄糖的一种氨基糖衍生物。在生物体内,是仅次于葡萄糖的一种具有重要生理生化功能的单糖。常以UDP-GlcNAc形式参与主要包括神经系统和免疫系统在内的诸多生物反应过程和生物体内多种糖缀合物的结构组成。在生物体内,生成UDP-GlcNAc的主要途径是氨基己糖途径(Hexosaminebiosynthesis pathway)。这一途径是糖酵解的分支,都以己糖代谢途径的中间产物果糖-6-磷酸(Fructose-6-P)为起始底物,在多种酶的协同催化作用下,最终合成UDP-GlcNAc。根据合成过程中催化反应的顺序及合成途径所涉及酶的来源不同,分为真核,原核以及拟菌病毒的UDP-GlcNAc合成途径。目前,氨基己糖途径已经被广泛的研究,但是植物中的氨基己糖途径中的第三个酶乙酰葡萄糖胺磷酸变位酶至今尚未研究和应用,所以植物中的氨基己糖途径仍未被打通。Acetylglucosamine (GlcNAc, N-acetyl-β-D-glucosamine) is an amino sugar derivative of glucose. In organisms, it is a monosaccharide with important physiological and biochemical functions after glucose. Often in the form of UDP-GlcNAc, it participates in many biological reaction processes including the nervous system and immune system and the structural composition of various glycoconjugates in organisms. In organisms, the main pathway for generating UDP-GlcNAc is the Hexosamine biosynthesis pathway. This pathway is a branch of glycolysis, which uses the intermediate product of hexose metabolism pathway, fructose-6-phosphate (Fructose-6-P) as the initial substrate, and finally synthesizes UDP under the synergistic catalysis of various enzymes. -GlcNAc. According to the sequence of catalytic reactions in the synthesis process and the sources of enzymes involved in the synthesis pathway, it can be divided into eukaryotic, prokaryotic and mimivirus UDP-GlcNAc synthesis pathways. At present, the hexosamine pathway has been extensively studied, but the third enzyme acetylglucosamine phosphate mutase in the plant hexosamine pathway has not been studied and applied so far, so the hexosamine pathway in plants has not yet been studied. get through.
另外UDP-GlcNAc作为生物体内重要的活性核苷糖,可被大量用于寡糖的生产,进而被开发成医药品或功能性材料。但由于其生产方法的限制,目前其供应量仍较小,价格也很高。目前已经有研究使用无细胞催化的多步酶催化反应体系,偶联了葡萄糖激酶,AGM与GlcNAc-1-P尿苷转移酶(GlmU)三个酶,以GlcNAc作为底物生产UDP-GlcNAc。无论是体内表达体系还是体外表达体系,来自酵母的AGM l的可溶表达水平均很低,影响了产物合成的效率和成本。所以提供一种高活性、高表达的AGM具有十分重要的意义。In addition, UDP-GlcNAc, as an important active nucleoside sugar in the body, can be used in large quantities for the production of oligosaccharides, and then be developed into pharmaceuticals or functional materials. However, due to the limitations of its production methods, its supply is still small and its price is high. At present, there have been studies using a cell-free catalyzed multi-step enzyme-catalyzed reaction system, coupled with three enzymes, glucokinase, AGM and GlcNAc-1-P uridine transferase (GlmU), to produce UDP-GlcNAc with GlcNAc as a substrate. Regardless of the expression system in vivo or in vitro, the soluble expression level of AGM 1 from yeast is very low, which affects the efficiency and cost of product synthesis. Therefore, it is of great significance to provide a highly active and highly expressed AGM.
磷酸己糖的价格十分的昂贵,目前主要是化学方法合成,生产步骤繁琐。Sigma以及Santa Cruz等公司的储量很少甚至断货。由于合成方法的限制,导致磷酸己糖不同的异构体之间价格差异巨大(几十甚至上百倍);如GlcN-6-P的价格29元/mg,而GlcN-1-P的价格则是460/mg。所以探索新的磷酸己糖的制备方法,特别是价格昂贵的磷酸己糖的制备方法势在必行。The price of hexose phosphate is very expensive. At present, it is mainly synthesized by chemical methods, and the production steps are cumbersome. Companies such as Sigma and Santa Cruz have little or no stock. Due to the limitation of the synthesis method, the price difference between the different isomers of hexose phosphate is huge (tens or even hundreds of times); for example, the price of GlcN-6-P is 29 yuan/mg, while the price of GlcN-1-P is 29 yuan/mg. It is 460/mg. Therefore, it is imperative to explore new preparation methods of hexose phosphates, especially expensive hexose phosphates.
针对上述乙酰葡萄糖胺磷酸变位酶的研究现状及其在应用中存在的问题,本发明公开了两个来源于拟南芥(Arabidopsis thaliana)的乙酰葡萄糖胺磷酸变位酶(Arabidopsis thaliana N-acetylpHospHoglucosamine mutase,AtAGM)的基因序列及其制备方法。这两种酶均具有催化不同己糖磷酸异构体的活性,与另外一个来与拟南芥中的乙酰葡萄糖胺磷酸变位酶AtAGM1相比,三者催化反应的效率相差不大,并且具有类似的酶学性质,均可以应用于无细胞酶法催化合成UDP-GlcNAc,或己糖磷酸异构体的生产中。In view of the research status of the above-mentioned acetylglucosamine phosphate mutase and the problems existing in its application, the present invention discloses two acetylglucosamine phosphate mutases (Arabidopsis thaliana N-acetylpHospHoglucosamine) derived from Arabidopsis thaliana (Arabidopsis thaliana) mutase, AtAGM) gene sequence and preparation method thereof. These two enzymes both have the activity of catalyzing different hexose phosphate isomers. Compared with the other one and the acetylglucosamine phosphate mutase AtAGM1 in Arabidopsis thaliana, the efficiency of the three catalytic reactions is not much different, and they have Similar enzymatic properties can be applied to the cell-free enzymatic synthesis of UDP-GlcNAc, or the production of hexose phosphate isomers.
发明内容Contents of the invention
本发明的第一个目的是提供两个植物来源的乙酰葡萄糖胺磷酸变位酶AtAGM2与AtAGM3及其编码基因。The first object of the present invention is to provide two plant-derived acetylglucosamine phosphate mutases AtAGM2 and AtAGM3 and their coding genes.
本发明的第二个目的是提供一种制备乙酰葡萄糖胺磷酸变位酶AtAGM2与AtAGM3的方法。The second object of the present invention is to provide a method for preparing acetylglucosamine phosphomutases AtAGM2 and AtAGM3.
本发明的第三个目的是提供含有所述的乙酰葡萄糖胺磷酸变位酶AtAGM2与AtAGM3的重组表达质粒和重组基因工程菌株。The third object of the present invention is to provide recombinant expression plasmids and recombinant genetic engineering strains containing the acetylglucosamine phosphate mutases AtAGM2 and AtAGM3.
本发明所提供的乙酰葡萄糖胺磷酸变位酶AtAGM2,来源于拟南芥(Arabidopsisthaliana),其氨基酸序列具有如下特征中的一种或二种:The acetylglucosamine phosphate mutase AtAGM2 provided by the present invention is derived from Arabidopsis thaliana (Arabidopsisthaliana), and its amino acid sequence has one or both of the following characteristics:
1)序列表中的SEQ ID NO.2从氨基端开始的第1-625氨基酸残基序列,为有活性的乙酰葡萄糖胺磷酸变位酶AtAGM的氨基酸序列,620-625为His-Tag的氨基酸序列。1) The 1-625th amino acid residue sequence of SEQ ID NO.2 in the sequence listing starting from the amino terminal is the amino acid sequence of the active acetylglucosamine phosphomutase AtAGM, and 620-625 is the amino acid residue of His-Tag sequence.
2)将序列表中的SEQ ID NO.2从氨基端开始的第1-625位氨基酸残基进行一个或两个以上氨基酸取代、缺失或添加而形成具有乙酰葡萄糖胺磷酸变位酶活性不变的氨基酸序列。2) Substituting, deleting or adding one or more amino acids to amino acid residues 1-625 of SEQ ID NO.2 starting from the amino terminal of SEQ ID NO.2 in the sequence listing to form an acetylglucosamine phosphomutase activity unchanged amino acid sequence.
本发明还提供了上述乙酰葡萄糖胺磷酸变位酶AtAGM2的编码基因,来源于拟南芥(Arabidopsis thaliana),其核苷酸序列具有如下特征中的一种或二种以上:The present invention also provides a gene encoding the above-mentioned acetylglucosamine phosphate mutase AtAGM2, which is derived from Arabidopsis thaliana, and its nucleotide sequence has one or more than two of the following characteristics:
1)序列表中SEQ ID NO.1的脱氧核糖核酸(DNA)序列;1) the deoxyribonucleic acid (DNA) sequence of SEQ ID NO.1 in the sequence listing;
2)编码序列表中SEQ ID NO.2氨基酸序列的脱氧核糖核酸(DNA)序列;2) the deoxyribonucleic acid (DNA) sequence encoding the amino acid sequence of SEQ ID NO.2 in the sequence listing;
3)对序列表中SEQ ID NO.1的脱氧核糖核酸(DNA)序列进行一个或两个以上核苷酸取代、缺失或添加而得到的编码具有乙酰葡萄糖胺磷酸变位酶活性的核苷酸序列。3) A nucleotide encoding acetylglucosamine phosphate mutase activity obtained by performing one or more than two nucleotide substitutions, deletions or additions to the deoxyribonucleic acid (DNA) sequence of SEQ ID NO.1 in the sequence listing sequence.
发明所提供的乙酰葡萄糖胺磷酸变位酶AtAGM3,来源于拟南芥(Arabidopsisthaliana),其氨基酸序列具有如下特征中的一种或二种:The acetylglucosamine phosphate mutase AtAGM3 provided by the invention is derived from Arabidopsis thaliana (Arabidopsisthaliana), and its amino acid sequence has one or both of the following characteristics:
1)序列表中的SEQ ID NO.4从氨基端开始的第1-623氨基酸残基序列,为有活性的乙酰葡萄糖胺磷酸变位酶AtAGM的氨基酸序列,518-623为His-Tag的氨基酸序列。1) The sequence of amino acid residues 1-623 from the amino terminal of SEQ ID NO.4 in the sequence listing is the amino acid sequence of the active acetylglucosamine phosphomutase AtAGM, and 518-623 is the amino acid of His-Tag sequence.
2)将序列表中的SEQ ID NO.4从氨基端开始的第1-623位氨基酸残基进行一个或两个以上氨基酸取代、缺失或添加而形成具有乙酰葡萄糖胺磷酸变位酶活性不变的氨基酸序列。2) Substituting, deleting or adding one or more amino acids to amino acid residues 1-623 of SEQ ID NO.4 starting from the amino terminus in the sequence listing to form an acetylglucosamine phosphomutase activity unchanged amino acid sequence.
本发明还提供了上述乙酰葡萄糖胺磷酸变位酶AtAGM3的编码基因,来源于拟南芥(Arabidopsis thaliana),其核苷酸序列具有如下特征中的一种或二种以上:The present invention also provides a gene encoding the above-mentioned acetylglucosamine phosphate mutase AtAGM3, which is derived from Arabidopsis thaliana, and its nucleotide sequence has one or more than two of the following characteristics:
1)序列表中SEQ ID NO.3的脱氧核糖核酸(DNA)序列;1) the deoxyribonucleic acid (DNA) sequence of SEQ ID NO.3 in the sequence listing;
2)编码序列表中SEQ ID NO.4氨基酸序列的脱氧核糖核酸(DNA)序列;2) the deoxyribonucleic acid (DNA) sequence encoding the amino acid sequence of SEQ ID NO.4 in the sequence listing;
3)对序列表中SEQ ID NO.3的脱氧核糖核酸(DNA)序列进行一个或两个以上核苷酸取代、缺失或添加而得到的编码具有乙酰葡萄糖胺磷酸变位酶活性的核苷酸序列。3) The nucleotide encoding the acetylglucosamine phosphate mutase activity obtained by performing one or more than two nucleotide substitutions, deletions or additions to the deoxyribonucleic acid (DNA) sequence of SEQ ID NO.3 in the sequence listing sequence.
本发明的乙酰葡萄糖胺磷酸变位酶AtAGM2与AtAGM3的氨基酸序列及其核苷酸编码序列也可以根据预测的AtAGM2与AtAGM3的氨基酸序列及其核苷酸编码序列人工合成获得。The amino acid sequences and nucleotide coding sequences of the acetylglucosamine phosphate mutases AtAGM2 and AtAGM3 of the present invention can also be artificially synthesized according to the predicted amino acid sequences and nucleotide coding sequences of AtAGM2 and AtAGM3.
制备重组酶AtAGM2与AtAGM3的方法,是将乙酰葡萄糖胺磷酸变位酶AtAGM2或AtAGM3的编码基因克隆入重组表达载体,导入宿主细胞,获得重组表达的乙酰葡萄糖胺磷酸变位酶The method for preparing recombinant enzymes AtAGM2 and AtAGM3 is to clone the coding gene of acetylglucosamine phosphate mutase AtAGM2 or AtAGM3 into a recombinant expression vector, introduce it into host cells, and obtain recombinantly expressed acetylglucosamine phosphate mutase
上述乙酰葡萄糖胺磷酸变位酶AtAGM2或AtAGM3的编码基因,其核苷酸序列具有如下特征中的一种或二种以上:The nucleotide sequence of the gene encoding the above-mentioned acetylglucosamine phosphate mutase AtAGM2 or AtAGM3 has one or more than two of the following characteristics:
1)具有序列表中SEQ ID NO.1或SEQ ID NO.3的脱氧核糖核酸(DNA)序列,2)编码SEQ ID NO.2或SEQ ID NO.4氨基酸序列的脱氧核糖核酸(DNA)序列,1) have the deoxyribonucleic acid (DNA) sequence of SEQ ID NO.1 or SEQ ID NO.3 in the sequence listing, 2) the deoxyribonucleic acid (DNA) sequence of encoding SEQ ID NO.2 or SEQ ID NO.4 aminoacid sequence ,
3)对序列表中SEQ ID NO.1或SEQ ID NO.3的脱氧核糖核酸(DNA)序列进行一个或两个以上核苷酸取代、缺失或添加而得到的编码具有乙酰葡萄糖胺磷酸变位酶活性的核苷酸序列。3) The code obtained by performing one or more than two nucleotide substitutions, deletions or additions to the deoxyribonucleic acid (DNA) sequence of SEQ ID NO.1 or SEQ ID NO.3 in the sequence listing has acetylglucosamine phosphate shift Nucleotide sequence for enzymatic activity.
所述的重组表达乙酰葡萄糖胺磷酸变位酶或AtAGM2或AtAGM3的表达载体可以是大肠杆菌表达载体、酵母表达载体、枯草杆菌表达载体、乳酸菌表达载体、链霉菌表达载体、噬菌体载体、丝状真菌表达载体、植物表达载体、昆虫表达载体、或哺乳动物细胞表达载体等。The expression vector for the recombinant expression of acetylglucosamine phosphate mutase or AtAGM2 or AtAGM3 can be Escherichia coli expression vector, yeast expression vector, Bacillus subtilis expression vector, lactic acid bacteria expression vector, streptomyces expression vector, phage vector, filamentous fungus Expression vectors, plant expression vectors, insect expression vectors, or mammalian cell expression vectors, etc.
用于重组表达乙酰葡萄糖胺磷酸变位酶AtAGM2或AtAGM3的重组菌或转基因细胞系,可以是大肠杆菌宿主细胞(如Escherichia coli BL21、Escherichia coli JM109、Escherichia coli DH5α等)、酵母菌宿主细胞(如Saccharomyces cerevisiae、Pichiapastoris、Kluyveromyces lactis等)、枯草杆菌宿主细胞(如Bacillus subtilis R25、Bacillus subtilis 9920等)、乳酸菌宿主细胞(如Lactic acid bacteria COCC101等)、放线菌宿主细胞(如Streptomyces spp.等)、丝状真菌宿主细胞(如Trichoderma viride,Trichoderma reesei,Aspergillus niger、Aspergillus nidulans等)、昆虫细胞(如Bombyx mori,Antharaea eucalypti等)或哺乳动物细胞(如中国仓鼠卵巢细胞CHO,幼小仓鼠肾脏细胞BHK、中国仓鼠肺细胞CHL等)。Recombinant bacteria or transgenic cell lines for recombinant expression of acetylglucosamine phosphate mutase AtAGM2 or AtAGM3 can be Escherichia coli host cells (such as Escherichia coli BL21, Escherichia coli JM109, Escherichia coli DH5α, etc.), yeast host cells (such as Saccharomyces cerevisiae, Pichiapastoris, Kluyveromyces lactis, etc.), Bacillus subtilis host cells (such as Bacillus subtilis R25, Bacillus subtilis 9920, etc.), lactic acid bacteria host cells (such as Lactic acid bacteria COCC101, etc.), actinomyces host cells (such as Streptomyces spp., etc.) , filamentous fungal host cells (such as Trichoderma viride, Trichoderma reesei, Aspergillus niger, Aspergillus nidulans, etc.), insect cells (such as Bombyx mori, Antharaea eucalypti, etc.) or mammalian cells (such as Chinese hamster ovary cells CHO, young hamster kidney cells BHK , Chinese hamster lung cells CHL, etc.).
上述的乙酰葡萄糖胺磷酸变位酶在磷酸己糖与核苷酸糖生产中可以应用,包括以下应用中的一或二种以上:The above-mentioned acetylglucosamine phosphate mutase can be applied in the production of hexose phosphate and nucleotide sugar, including one or more of the following applications:
1)在实现磷酸己糖1,6位异构体的转变,生产相应同分异构体中的应用;1) Application in realizing the transformation of the 1,6-position isomer of hexose phosphate and producing the corresponding isomer;
2)在UDP-GlcNAc(UDP-GlcN;UDP-Glc)等核苷酸糖的生产中的应用;2) Application in the production of nucleotide sugars such as UDP-GlcNAc (UDP-GlcN; UDP-Glc);
附图说明Description of drawings
图1:N-乙酰葡萄糖胺磷酸变位酶基因Atagm2与Atagm3琼脂糖凝胶电泳检测图。Figure 1: Agarose gel electrophoresis detection map of N-acetylglucosamine phosphate mutase gene Atagm2 and Atagm3.
图2:乙酰葡萄糖胺磷酸变位酶AtAGM2表达及纯化的SDS-PAGE图。各加入的样品分别是:1-E.coli BL21(DE3)/pET28a-AtAGM2诱导菌体;2-E.coli BL21(DE3)/pET28a-AtAGM2未诱导菌体;M-预染蛋白分子量标准;其余为不同浓度的咪唑洗脱液。箭头所指的条带为AtAGM2蛋白条带。Figure 2: SDS-PAGE diagram of the expression and purification of acetylglucosamine phosphate mutase AtAGM2. The added samples are: 1-E.coli BL21(DE3)/pET28a-AtAGM2 induced bacteria; 2-E.coli BL21(DE3)/pET28a-AtAGM2 uninduced bacteria; M-pre-stained protein molecular weight standard; The rest are eluents of different concentrations of imidazole. The band indicated by the arrow is the AtAGM2 protein band.
图3:乙酰葡萄糖胺磷酸变位酶AtAGM3表达及纯化的SDS-PAGE图。各加入的样品分别是:1-E.coli BL21(DE3)/pET28a-AtAGM3未诱导菌体;2,3-E.coli BL21(DE3)/pET28a-AtAGM3诱导后菌体;M-预染蛋白分子量标准;其余为不同浓度的咪唑洗脱液。箭头所指的条带为AtAGM3蛋白条带。Figure 3: SDS-PAGE diagram of the expression and purification of acetylglucosamine phosphomutase AtAGM3. The added samples are: 1-E.coli BL21(DE3)/pET28a-AtAGM3 uninduced cells; 2,3-E.coli BL21(DE3)/pET28a-AtAGM3 induced cells; M-prestained protein Molecular weight standards; the rest are different concentrations of imidazole eluents. The band indicated by the arrow is the AtAGM3 protein band.
图4:AtAGM2酶在不同反应体系pH下的相对活性图。Figure 4: The relative activity of AtAGM2 enzyme at different reaction system pH.
图5:AtAGM2酶在不同反应温度下的相对活性折线图。Figure 5: Line graph of relative activity of AtAGM2 enzyme at different reaction temperatures.
图6:AtAGM3酶在不同反应体系pH下的相对活性图。Figure 6: The relative activity of AtAGM3 enzyme at different reaction system pH.
图7:AtAGM3酶在不同反应温度下的相对活性折线图。Figure 7: Line graph of relative activity of AtAGM3 enzyme at different reaction temperatures.
图8:AtAGM2酶在不同金属离子催化下的相对活性图。Figure 8: Relative activity diagram of AtAGM2 enzyme under the catalysis of different metal ions.
图9:AtAGM3酶在不同金属离子催化下的相对活性图。Figure 9: Relative activity diagram of AtAGM3 enzyme under the catalysis of different metal ions.
具体实施方式Detailed ways
SEQ ID NO.1的信息Information on SEQ ID NO.1
(a)序列特征(a) Sequential features
长度:1878个核苷酸Length: 1878 nucleotides
类型:核苷酸Type: Nucleotide
链型:单链Chain type: single chain
(b)分子类型:DNA(b) Molecule type: DNA
序列描述:SEQ ID NO.1Sequence description: SEQ ID NO.1
ATGGAAGGAAAGGTTTTCCAAAACTTTAATGTAGTACAGAGCTGCTACCGACAAAATAAGCAGTTTAAGACACGATACCAAAGAGAACCTGACCTGTTCATGTCTACTTTACTTCCCTGTCCAAGAGAGAAGATGGCATTTAACCTCAACTCTTCCATGCGTGCCCACACTTTGTCTAAATACCAGTTTGTTCTTTCAAAGCAAAGAACTTTTTACTGCAATGCTACTTCGTCAAGTGCTACTGTGCCATCTCTTGACAAAAATGATTTTCTGAAGCTCCAAAACGGCAGTGATATTCGGGGTGTAGCGGTCACTGGGGTTGAGGGGGAACCTGTAAGCCTTCCTGAACCAGTGACTGAAGCCATAGCTGCTGCTTTTGGGCAATGGCTGTTACACAAGAAGAAGGCTGAATCCCGGCGTTTGAGAGTATCTGTTGGCCATGACTCTCGCATCTCTGCACAAACTTTGCTGGAGGCGGTTTCTCGAGGTCTTGGTGTTTCTGGATTAGATGTTGTTCAGTTTGGATTAGCATCAACACCAGCAATGTTTAATAGCACATTGACTGAAGATGAGTCATTCTTGTGCCCAGCTGATGGGGCTATTATGATAACAGCAAGCCATCTTCCTTACAACAGGAACGGTTTCAAGTTCTTTACCAGTGATGGAGGACTTGGGAAGGTTGATATCAAGAACATTTTGGAGCGAGCTGCAGATATTTACAAGAAGCTTTCTGATGAAAATTTGAGGAAATCACAAAGAGAAAGTTCTTCTATTACAAAGGTTGACTACATGTCAGTATACACCTCTGGTCTTGTAAAGGCAGTCCGGAAAGCAGCAGGAGATTTGGAGAAGCCTCTAGAGGGATTTCATATAGTTGTTGATGCTGGAAATGGAGCTGGAGGATTTTTTGCTGCCAAGGTGCTTGAGCCTTTAGGAGCAATTACTTCTGGCAGTCAATTTCTGGAACCAGATGGTATGTTCCCAAATCATATCCCTAATCCGGAAGATAAGGCGGCAATGGAAGCTATAACCAAGGCTGTTCTTGATAATAAGGCTGATTTGGGTATCATCTTTGATACTGATGTTGATAGGTCTGCTGCTGTGGATTCATCTGGCCGTGAATTCAACCGTAATCGTCTTATTGCCTTGCTATCAGCCATTGTTCTAGAGGAACACCCTGGCACAACTATAGTTACGGATAGTGTCACTTCGGACGGTCTGACCTCATTTATTGAGAAGAAGCTTGGCGGAAAGCATCACAGGTTCAAAAGAGGTTACAAGAATGTCATTGACGAAGCTATTCGCTTGAACTCGGTTGGGGAAGAATCACATCTGGCTATAGAAACCAGTGGTCATGGAGCTCTAAAGGAAAACCATTGGCTCGACGATGGGGCCTATCTCATGGTAAAAATCCTGAACAAACTAGCTGCGGCCCGAGCTGCTGGTCAAGGGAGTGGCAGCAAAGTTTTAACAGATCTTGTTGAAGGTCTGGAAGAGCCCAAAGTGGCTTTAGAACTGAGGCTTAAAATCGACAAGAATCACCCTGACCTTGAAGGAAGTGATTTCCGGGAGTATGGAGAGAAGGTCCTGCAACACGTGTCGAACTCAATAGAAACAAATCCAAATCTTATAATAGCTCCAGTTAACTACGAAGGGATCCGCGTTTCGGGCTTTGGTGGATGGTTTCTTCTCAGACTTTCTCTCCATGATCCTGTTCTTCCCCTTAACATCGAGGCACAGAGTGAGGATGATGCTGTGAAATTAGGCCTTGTGGTTGCTACGACAGTGAAGGAGTTCAATGCTTTGGACACCTGTGCCTTGTCCAACCTCACTCACTCCTCCGCGGCCGCACTCGAGCACCACCACCACCACCACTGAATGGAAGGAAAGGTTTTCCAAAACTTTAATGTAGTACAGAGCTGCTACCGACAAAATAAGCAGTTTAAGACACGATACCAAAGAGAACCTGACCTGTTCATGTCTACTTTACTTCCCTGTCCAAGAGAGAAGATGGCATTTAACCTCAACTCTTCCATGCGTGCCCACACTTTGTCTAAATACCAGTTTGTTCTTTCAAAGCAAAGAACTTTTTACTGCAATGCTACTTCGTCAAGTGCTACTGTGCCATCTCTTGACAAAAATGATTTTCTGAAGCTCCAAAACGGCAGTGATATTCGGGGTGTAGCGGTCACTGGGGTTGAGGGGGAACCTGTAAGCCTTCCTGAACCAGTGACTGAAGCCATAGCTGCTGCTTTTGGGCAATGGCTGTTACACAAGAAGAAGGCTGAATCCCGGCGTTTGAGAGTATCTGTTGGCCATGACTCTCGCATCTCTGCACAAACTTTGCTGGAGGCGGTTTCTCGAGGTCTTGGTGTTTCTGGATTAGATGTTGTTCAGTTTGGATTAGCATCAACACCAGCAATGTTTAATAGCACATTGACTGAAGATGAGTCATTCTTGTGCCCAGCTGATGGGGCTATTATGATAACAGCAAGCCATCTTCCTTACAACAGGAACGGTTTCAAGTTCTTTACCAGTGATGGAGGACTTGGGAAGGTTGATATCAAGAACATTTTGGAGCGAGCTGCAGATATTTACAAGAAGCTTTCTGATGAAAATTTGAGGAAATCACAAAGAGAAAGTTCTTCTATTACAAAGGTTGACTACATGTCAGTATACACCTCTGGTCTTGTAAAGGCAGTCCGGAAAGCAGCAGGAGATTTGGAGAAGCCTCTAGAGGGATTTCATATAGTTGTTGATGCTGGAAATGGAGCTGGAGGATTTTTTGCTGCCAAGGTGCTTGAGCCTTTAGGAGCAATTACTTCTGGCAGTCAATTTCTGGAACCAGATGGTATGTTCCCAAATCATATCCCTAATC CGGAAGATAAGGCGGCAATGGAAGCTATAACCAAGGCTGTTCTTGATAATAAGGCTGATTTGGGTATCATCTTTGATACTGATGTTGATAGGTCTGCTGCTGTGGATTCATCTGGCCGTGAATTCAACCGTAATCGTCTTATTGCCTTGCTATCAGCCATTGTTCTAGAGGAACACCCTGGCACAACTATAGTTACGGATAGTGTCACTTCGGACGGTCTGACCTCATTTATTGAGAAGAAGCTTGGCGGAAAGCATCACAGGTTCAAAAGAGGTTACAAGAATGTCATTGACGAAGCTATTCGCTTGAACTCGGTTGGGGAAGAATCACATCTGGCTATAGAAACCAGTGGTCATGGAGCTCTAAAGGAAAACCATTGGCTCGACGATGGGGCCTATCTCATGGTAAAAATCCTGAACAAACTAGCTGCGGCCCGAGCTGCTGGTCAAGGGAGTGGCAGCAAAGTTTTAACAGATCTTGTTGAAGGTCTGGAAGAGCCCAAAGTGGCTTTAGAACTGAGGCTTAAAATCGACAAGAATCACCCTGACCTTGAAGGAAGTGATTTCCGGGAGTATGGAGAGAAGGTCCTGCAACACGTGTCGAACTCAATAGAAACAAATCCAAATCTTATAATAGCTCCAGTTAACTACGAAGGGATCCGCGTTTCGGGCTTTGGTGGATGGTTTCTTCTCAGACTTTCTCTCCATGATCCTGTTCTTCCCCTTAACATCGAGGCACAGAGTGAGGATGATGCTGTGAAATTAGGCCTTGTGGTTGCTACGACAGTGAAGGAGTTCAATGCTTTGGACACCTGTGCCTTGTCCAACCTCACTCACTCCTCCGCGGCCGCACTCGAGCACCACCACCACCACCACTGA
SEQ ID NO.2的信息Information on SEQ ID NO.2
(a)序列特征(a) Sequential features
长度:625个氨基酸Length: 625 amino acids
类型:氨基酸Type: amino acid
链型:单链Chain type: single chain
(b)分子类型:蛋白(b) Molecule type: protein
序列描述:SEQ ID NO.2Sequence description: SEQ ID NO.2
MEGKVFQNFNVVQSCYRQNKQFKTRYQREPDLFMSTLLPCPREKMAFNLNSSMRAHTLSKYQFVLSKQRTFYCNATSSSATVPSLDKNDFLKLQNGSDIRGVAVTGVEGEPVSLPEPVTEAIAAAFGQWLLHKKKAESRRLRVSVGHDSRISAQTLLEAVSRGLGVSGLDVVQFGLASTPAMFNSTLTEDESFLCPADGAIMITASHLPYNRNGFKFFTSDGGLGKVDIKNILERAADIYKKLSDENLRKSQRESSSITKVDYMSVYTSGLVKAVRKAAGDLEKPLEGFHIVVDAGNGAGGFFAAKVLEPLGAITSGSQFLEPDGMFPNHIPNPEDKAAMEAITKAVLDNKADLGIIFDTDVDRSAAVDSSGREFNRNRLIALLSAIVLEEHPGTTIVTDSVTSDGLTSFIEKKLGGKHHRFKRGYKNVIDEAIRLNSVGEESHLAIETSGHGALKENHWLDDGAYLMVKILNKLAAARAAGQGSGSKVLTDLVEGLEEPKVALELRLKIDKNHPDLEGSDFREYGEKVLQHVSNSIETNPNLIIAPVNYEGIRVSGFGGWFLLRLSLHDPVLPLNIEAQSEDDAVKLGLVVATTVKEFNALDTCALSNLTHSSAAALEHHHHHH-MEGKVFQNFNVVQSCYRQNKQFKTRYQREPDLFMSTLLPCPREKMAFNLNSSMRAHTLSKYQFVLSKQRTFYCNATSSSATVPSLDKNDFLKLQNGSDIRGVAVTGVEGEPVSLPEPVTEAIAAAFGQWLLHKKKAESRRLRVSVGHDSRISAQTLLEAVSRGLGVSGLDVVQFGLASTPAMFNSTLTEDESFLCPADGAIMITASHLPYNRNGFKFFTSDGGLGKVDIKNILERAADIYKKLSDENLRKSQRESSSITKVDYMSVYTSGLVKAVRKAAGDLEKPLEGFHIVVDAGNGAGGFFAAKVLEPLGAITSGSQFLEPDGMFPNHIPNPEDKAAMEAITKAVLDNKADLGIIFDTDVDRSAAVDSSGREFNRNRLIALLSAIVLEEHPGTTIVTDSVTSDGLTSFIEKKLGGKHHRFKRGYKNVIDEAIRLNSVGEESHLAIETSGHGALKENHWLDDGAYLMVKILNKLAAARAAGQGSGSKVLTDLVEGLEEPKVALELRLKIDKNHPDLEGSDFREYGEKVLQHVSNSIETNPNLIIAPVNYEGIRVSGFGGWFLLRLSLHDPVLPLNIEAQSEDDAVKLGLVVATTVKEFNALDTCALSNLTHSSAAALEHHHHHH-
SEQ ID NO.3的信息Information on SEQ ID NO.3
(a)序列特征(a) Sequential features
长度:1872个核苷酸Length: 1872 nucleotides
类型:核苷酸Type: Nucleotide
链型:单链Chain type: single chain
(b)分子类型:DNA(b) Molecule type: DNA
序列描述:SEQ ID NO.3Sequence description: SEQ ID NO.3
ATGGCGTCGACTTCAACATCATCTTTAATGGCTTCTAAAACTGTAATCTCCAAAACAGCTCTGTTTTCTTCCTTACCGGGAATAGTCAGCCGGAGTTTTTTAACATTCGCACCGGCTTCTCCTTCCGTTAAACCCCTTAGGATAAGATCTTCAAATGTTACTAAGTTCGACGAAGTAACCAACAGTCTTGACGAAGACATGGACCAGATTCGACGGTTACAAAACGGTTCTGACGTGAGAGGAGTCGCATTGGAAGGAGAGAAAGGTCGAACAGTTGACCTAACGCCTGCAGCTGTTGAAGCAATCGCAGAGAGCTTTGGAGAATGGGTTGCAGCAACGGAGAGTAACGGAAACGGCGTCATTAAGATTTCTCTCGGACGGGATCCACGTGTTTCCGGTGGGAAGCTAAGCACGGCAGTGTTTGCCGGCTTAGCTCGTGCAGGCTGTTTAGCTTTTGACATGGGTTTAGCTACAGCGCCAGCTTGCTTCATGAGCACGTTACTCTCTCCATTCGAATACGACGCTTCAATTATGATGACAGCTTCTCATTTACCGTATACAAGAAACGGACTCAAGTTCTTTACCAAGAGAGGAGGATTAACGTCTCCTGAAGTGGAGAAGATATGCGATTTAGCTGCGCGAAAGTACGCTACTAGGCAGACTAAAGTCTCTACATTGATCAGAACGCGACCGCAGCAAGTTGATTTTATGAGCGCTTACTCTAAGCACCTTAGAGAAATCATTAAAGAGAGAATCAATCACCCTGAACACTATGACACTCCTCTCAAAGGATTTCAGATAGTTGTGAATGCGGGTAATGGGTCAGGAGGCTTCTTTACGTGGGACGTTCTAGACAAGTTAGGAGCCGATACATTCGGTTCGCTCTATCTAAACCCTGACGGGATGTTCCCTAATCACATTCCTAATCCGGAAAACAAAATCGCAATGCAACACACCCGAGCCGCGGTTCTTGAGAACTCAGCAGATCTCGGGGTTGTGTTTGATACGGATGTTGACAGGAGTGGAGTGGTGGATAACAAAGGAAATCCTATCAACGGAGATAAGCTTATTGCGCTTATGTCAGCTATAGTGCTTAAAGAACATCCAGGAAGTACAGTAGTGACTGACGCAAGAACGAGTATGGGGCTAACTAGGTTTATAACGGAGCGAGGAGGGAGGCATTGTTTGTATAGAGTAGGGTATAGAAACGTGATTGACAAGGGAGTAGAGCTGAACAAAGACGGCATCGAGACTCATCTCATGATGGAAACTTCAGGACATGGTGCGGTTAAGGAGAATCACTTCTTGGATGATGGTGCATACATGGTGGTGAAGATCATAATTGAAATGGTGAGAATGAGACTCGCGGGATCAAATGAAGGTATCGGTAGTTTGATCGAAGATCTTGAGGAGCCGTTAGAAGCGGTTGAGCTTCGGTTGAATATTTTATCAGAGCCAAGAGATGCCAAAGCAAAAGGCATTGAAGCCATTGAGACTTTCAGGCAATACATTGAGGAAGGAAAACTGAAAGGGTGGGAATTGGGCACGTGTGGGGATTGTTGGGTTACTGAAGGTTGCTTGGTGGACTCAAATGATCATCCATCTGCTATTGATGCTCACATGTACAGGGCAAGAGTGAGTGATGAAGAGAGTGGGGAAGAGTATGGTTGGGTGCATATGAGGCAGAGTATTCATAACCCTAACATCGCACTTAATATGCAATCAATGCTTCCTGGTGGATGTCTCTCCATGACAAGAATCTTCAGAGACCAGTTTCTTGAAGCTAGTGGGGTGGCTAGATTCCTGGATATAAGTGACTTCGACAATTACATCGGAGGTCAATCTCTCGAGCACCACCACCACCACCACTGAATGGCGTCGACTTCAACATCATCTTTAATGGCTTCTAAAACTGTAATCTCCAAAACAGCTCTGTTTTCTTCCTTACCGGGAATAGTCAGCCGGAGTTTTTTAACATTCGCACCGGCTTCTCCTTCCGTTAAACCCCTTAGGATAAGATCTTCAAATGTTACTAAGTTCGACGAAGTAACCAACAGTCTTGACGAAGACATGGACCAGATTCGACGGTTACAAAACGGTTCTGACGTGAGAGGAGTCGCATTGGAAGGAGAGAAAGGTCGAACAGTTGACCTAACGCCTGCAGCTGTTGAAGCAATCGCAGAGAGCTTTGGAGAATGGGTTGCAGCAACGGAGAGTAACGGAAACGGCGTCATTAAGATTTCTCTCGGACGGGATCCACGTGTTTCCGGTGGGAAGCTAAGCACGGCAGTGTTTGCCGGCTTAGCTCGTGCAGGCTGTTTAGCTTTTGACATGGGTTTAGCTACAGCGCCAGCTTGCTTCATGAGCACGTTACTCTCTCCATTCGAATACGACGCTTCAATTATGATGACAGCTTCTCATTTACCGTATACAAGAAACGGACTCAAGTTCTTTACCAAGAGAGGAGGATTAACGTCTCCTGAAGTGGAGAAGATATGCGATTTAGCTGCGCGAAAGTACGCTACTAGGCAGACTAAAGTCTCTACATTGATCAGAACGCGACCGCAGCAAGTTGATTTTATGAGCGCTTACTCTAAGCACCTTAGAGAAATCATTAAAGAGAGAATCAATCACCCTGAACACTATGACACTCCTCTCAAAGGATTTCAGATAGTTGTGAATGCGGGTAATGGGTCAGGAGGCTTCTTTACGTGGGACGTTCTAGACAAGTTAGGAGCCGATACATTCGGTTCGCTCTATCTAAACCCTGACGGGATGTTCCCTAATCACATTCCTAATCCGGAAAACAAAATCGCAATGCAACACACCCGAGCCGCGGTTCTTGAGAACTCAGCAGATCTCGGGGTTGTGT TTGATACGGATGTTGACAGGAGTGGAGTGGTGGATAACAAAGGAAATCCTATCAACGGAGATAAGCTTATTGCGCTTATGTCAGCTATAGTGCTTAAAGAACATCCAGGAAGTACAGTAGTGACTGACGCAAGAACGAGTATGGGGCTAACTAGGTTTATAACGGAGCGAGGAGGGAGGCATTGTTTGTATAGAGTAGGGTATAGAAACGTGATTGACAAGGGAGTAGAGCTGAACAAAGACGGCATCGAGACTCATCTCATGATGGAAACTTCAGGACATGGTGCGGTTAAGGAGAATCACTTCTTGGATGATGGTGCATACATGGTGGTGAAGATCATAATTGAAATGGTGAGAATGAGACTCGCGGGATCAAATGAAGGTATCGGTAGTTTGATCGAAGATCTTGAGGAGCCGTTAGAAGCGGTTGAGCTTCGGTTGAATATTTTATCAGAGCCAAGAGATGCCAAAGCAAAAGGCATTGAAGCCATTGAGACTTTCAGGCAATACATTGAGGAAGGAAAACTGAAAGGGTGGGAATTGGGCACGTGTGGGGATTGTTGGGTTACTGAAGGTTGCTTGGTGGACTCAAATGATCATCCATCTGCTATTGATGCTCACATGTACAGGGCAAGAGTGAGTGATGAAGAGAGTGGGGAAGAGTATGGTTGGGTGCATATGAGGCAGAGTATTCATAACCCTAACATCGCACTTAATATGCAATCAATGCTTCCTGGTGGATGTCTCTCCATGACAAGAATCTTCAGAGACCAGTTTCTTGAAGCTAGTGGGGTGGCTAGATTCCTGGATATAAGTGACTTCGACAATTACATCGGAGGTCAATCTCTCGAGCACCACCACCACCACCACTGA
SEQ ID NO.4的信息Information on SEQ ID NO.4
(a)序列特征(a) Sequential features
长度:623个氨基酸Length: 623 amino acids
类型:氨基酸Type: amino acid
链型:单链Chain type: single chain
(b)分子类型:蛋白(b) Molecule type: protein
序列描述:SEQ ID NO.4Sequence description: SEQ ID NO.4
MASTSTSSLMASKTVISKTALFSSLPGIVSRSFLTFAPASPSVKPLRIRSSNVTKFDEVTNSLDEDMDQIRRLQNGSDVRGVALEGEKGRTVDLTPAAVEAIAESFGEWVAATESNGNGVIKISLGRDPRVSGGKLSTAVFAGLARAGCLAFDMGLATAPMASTSTSSLMASKTVISKTALFSSLPGIVSRSFLTFAPASSPVKPLRIRSSNVTKFDEVTNSLDEDMDQIRRLQNGSDVRGVALEGEKGRTVDLTPAAVEAIAESFGEWVAATESNGNGVIKISLGRDPRVSGGKLSTAVFAGLARAGCLAFDMGLATAP
ACFMSTLLSPFEYDASIMMTASHLPYTRNGLKFFTKRGGLTSPEVEKICDLAARKYATRQTKVSTLIRTRPQQVDFMSAYSKHLREIIKERINHPEHYDTPLKGFQIVVNAGNGSGGFFTWDVLDKLGADTFGSLYLNPDGMFPNHIPNPENKIAMQHTRAAVLENSADLGVVFDTDVDRSGVVDNKGNPINGDKLIALMSAIVLKEHPGSTVVTDARTSMGLTRFITERGGRHCLYRVGYRNVIDKGVELNKDGIETHLMMETSGHGAVKENHFLDDGAYMVVKIIIEMVRMRLAGSNEGIGSLIEDLEEPLEAVELRLNILSEPRDAKAKGIEAIETFRQYIEEGKLKGWELGTCGDCWVTEGCLVDSNDHPSAIDAHMYRARVSDEESGEEYGWVHMRQSIHNPNIALNMQSMLPGGCLSMTRIFRDQFLEASGVARFLDISDFDNYIGGQSLEHHHHHH-ACFMSTLLSPFEYDASIMMTASHLPYTRNGLKFFTKRGGLTSPEVEKICDLAARKYATRQTKVSTLIRTRPQQVDFMSAYSKHLREIIKERINHPEHYDTPLKGFQIVVNAGNGSGGFFTWDVLDKLGADTFGSLYLNPDGMFPNHIPNPENKIAMQHTRAAVLENSADLGVVFDTDVDRSGVVDNKGNPINGDKLIALMSAIVLKEHPGSTVVTDARTSMGLTRFITERGGRHCLYRVGYRNVIDKGVELNKDGIETHLMMETSGHGAVKENHFLDDGAYMVVKIIIEMVRMRLAGSNEGIGSLIEDLEEPLEAVELRLNILSEPRDAKAKGIEAIETFRQYIEEGKLKGWELGTCGDCWVTEGCLVDSNDHPSAIDAHMYRARVSDEESGEEYGWVHMRQSIHNPNIALNMQSMLPGGCLSMTRIFRDQFLEASGVARFLDISDFDNYIGGQSLEHHHHHH-
实施例1 乙酰葡萄糖胺磷酸变位酶AtAGM2与AtAGM3全长基因的克隆Example 1 Cloning of full-length genes of acetylglucosamine phosphate mutase AtAGM2 and AtAGM3
对The National Center for Biotechnology Information(NCBI)数据库中拟南芥的己糖磷酸变位酶基因进行分析后,选定了己糖变位酶家族中两个尚未明确功能的基因。经过序列分析后,设计引物如下:After analyzing the hexose phosphate mutase genes of Arabidopsis in The National Center for Biotechnology Information (NCBI) database, two genes in the hexose mutase family whose functions have not been clarified were selected. After sequence analysis, the primers were designed as follows:
Agm2-F:5’-GCGTCCATGGAAGGAAAGGTTTTCCAAAAC-3’;Agm2-R:5’-ATATATGCGGCCGCGGAGGAGTGAGTG-3’来扩增Atagm2的基因序列。Agm3-F:5’-AACTCCATGGCGTCGACTTCAACATCATC-3’;Agm3-R:5’-ACTGCTCGAGAGATTGACCTCCGATGTAA-3’来扩增Atagm3的基因序列。Agm2-F: 5'-GCGTCCATGGAAGGAAAGGTTTTCCAAAAAC-3'; Agm2-R: 5'-ATATATGCGGCCGCGGAGGAGTGAGTG-3' to amplify the gene sequence of Atagm2. Agm3-F: 5'-AACTCCATGGCGTCGACTTCAACATCATC-3'; Agm3-R: 5'-ACTGCTCGAGAGATTGACCTCCGATGTAA-3' to amplify the gene sequence of Atagm3.
参照RNA提取试剂盒(博迈德生物,货号RN0112)操作步骤提取拟南芥叶片的mRNA。以提取的拟南芥的RNA反转的cDNA为模板进行PCR扩增。PCR反应条件为:94℃2min,1个循环;94℃30s,55℃30s,72℃2min,30个循环;72℃5min,1个循环。PCR产物进行琼脂糖凝胶电泳分析后,对目的片段进行切胶回收(如图1所示),经双酶切后连接到原核表达载体pET28a上后测序。The mRNA of Arabidopsis leaves was extracted according to the operation steps of the RNA extraction kit (Bomad Biotechnology, Cat. No. RN0112). PCR amplification was carried out using the reversed cDNA of the extracted Arabidopsis RNA as a template. The PCR reaction conditions were: 94°C for 2 min, 1 cycle; 94°C for 30 s, 55°C for 30 s, 72°C for 2 min, 30 cycles; 72°C for 5 min, 1 cycle. After the PCR product was analyzed by agarose gel electrophoresis, the target fragment was recovered by gel cutting (as shown in Figure 1), and after double enzyme digestion, it was connected to the prokaryotic expression vector pET28a and then sequenced.
实施例2 乙酰葡萄糖胺磷酸变位酶AtAGM2与AtAGM3基因序列分析Example 2 Gene sequence analysis of acetylglucosamine phosphate mutase AtAGM2 and AtAGM3
测序的结果采用GenBank数据库中的Basic Local Alignment Search Tool(BLAST)分析,Vector NTI Suite 8.0软件进行多序列比对,分析序列信息。The results of the sequencing were analyzed using the Basic Local Alignment Search Tool (BLAST) in the GenBank database, and the Vector NTI Suite 8.0 software was used for multiple sequence alignment and sequence information analysis.
获得的乙酰葡萄糖胺磷酸变位酶基因2(命名为AtAGM2)编码区长1878bp,其核苷酸序列如SEQ ID NO 1所示。AtAGM2编码625个氨基酸和一个终止密码子,其氨基酸序列如SEQ ID NO 3所示,蛋白质理论分子量为67.0kDa,预测等电点为6.1。AtAGM2的核苷酸序列在拟南芥的基因组中位于拟南芥的五号染色体上(locus-tag=″AT5G17530″)。The coding region of the obtained acetylglucosamine phosphate mutase gene 2 (named AtAGM2) is 1878 bp long, and its nucleotide sequence is shown in SEQ ID NO 1. AtAGM2 encodes 625 amino acids and a stop codon, its amino acid sequence is shown in SEQ ID NO 3, the theoretical protein molecular weight is 67.0kDa, and the predicted isoelectric point is 6.1. The nucleotide sequence of AtAGM2 is located on chromosome 5 of Arabidopsis thaliana (locus-tag="AT5G17530") in the Arabidopsis genome.
获得的乙酰葡萄糖胺磷酸变位酶基因3(命名为AtAGM3)编码区长1872bp,其核苷酸序列如SEQ ID NO 2所示。AtAGM编码623个氨基酸和一个终止密码子,其氨基酸序列如SEQ ID NO 4所示,蛋白质理论分子量为67.3kDa,预测等电点为5.62。AtAGM3的核苷酸序列在拟南芥的基因组中位于拟南芥的一号染色体上(locus-tag=″AT1G70820″)。The coding region of the obtained acetylglucosamine phosphate mutase gene 3 (named AtAGM3) is 1872 bp long, and its nucleotide sequence is shown in SEQ ID NO 2. AtAGM encodes 623 amino acids and a stop codon, its amino acid sequence is shown in SEQ ID NO 4, the theoretical protein molecular weight is 67.3kDa, and the predicted isoelectric point is 5.62. The nucleotide sequence of AtAGM3 is located on chromosome 1 of Arabidopsis thaliana (locus-tag="AT1G70820") in the Arabidopsis genome.
AtAGM2与AtAGM3均是假定的己糖磷酸变位酶基因,属于己糖磷酸变位酶(α-D-phosphohexomutases)家族的成员。Both AtAGM2 and AtAGM3 are putative hexose phosphate mutase genes, which belong to the family of α-D-phosphohexomutases.
拟南芥还存在另外一个乙酰葡萄糖胺磷酸变位酶AtAGM1,该酶的编码基因(AT5G18070)的编码区长1710bp,编码556个氨基酸,大肠杆菌重组表达获得的AtAGM1,蛋白分子量在61.5KDa,预测等电点为5.35.AtAGM1的核苷酸序列位于拟南芥的五号染色体上(locus-tag=″AT5G18070″),其活性已经被鉴定。There is another acetylglucosamine phosphate mutase AtAGM1 in Arabidopsis thaliana. The coding region of the coding gene (AT5G18070) of this enzyme is 1710bp long and encodes 556 amino acids. The isoelectric point is 5.35. The nucleotide sequence of AtAGM1 is located on chromosome 5 of Arabidopsis (locus-tag="AT5G18070"), and its activity has been identified.
AtAGM2与AtAGM3的基因相似性高达36.39%,而它们与AtAGM1的序列相似性较低,仅为15.45%和12.28%,但是一些序列是高度保守的。并且三者的蛋白分子量及等电点相差不大。The gene similarity between AtAGM2 and AtAGM3 is as high as 36.39%, while their sequence similarity with AtAGM1 is lower, only 15.45% and 12.28%, but some sequences are highly conserved. And the protein molecular weight and isoelectric point of the three are not much different.
实施例3 AtAGM2与AtAGM3基因在大肠杆菌中的重组表达及纯化Example 3 Recombinant expression and purification of AtAGM2 and AtAGM3 genes in Escherichia coli
测序结果表明,在pET28a上插入SEQ ID NO 1所示的AtAGM2基因,且插入方向正确,证明了构建的重组质粒正确,将该重组质粒命名为pET28a-AtAGM2。Sequencing results showed that the AtAGM2 gene shown in SEQ ID NO 1 was inserted into pET28a, and the insertion direction was correct, which proved that the constructed recombinant plasmid was correct, and the recombinant plasmid was named pET28a-AtAGM2.
同时,AtAGM3的重组质粒也构建成功,pET28a上插入了SEQ ID NO 3所示的AtAGM3基因,且插入方向正确,将该重组质粒命名为pET28a-AtAGM3。At the same time, the recombinant plasmid of AtAGM3 was also constructed successfully. The AtAGM3 gene shown in SEQ ID NO 3 was inserted into pET28a, and the insertion direction was correct. The recombinant plasmid was named pET28a-AtAGM3.
将pET28a-AtAGM2(或者pET28a-AtAGM3)转化大肠杆菌菌株BL21(DE3)进行诱导表达。以1%的接种量将过夜培养的种子液加入新鲜的LB培养基中于37℃进行扩大培养,当菌液的OD600nm=0.6-0.8时加入IPTG,使其终浓度为0.5mM,16℃进行过夜诱导。菌体破碎后高速离心取上清进行镍柱纯化,梯度咪唑洗脱(20-500mM咪唑,20mM Tris-HCl,PH7.6)。用聚丙烯酰胺凝胶电泳检测乙酰葡萄糖胺磷酸变位酶AtAGM2(或AtAGM3)的表达及纯化情况,电泳使用12%的分离胶,80V电压压平后,换120V电压跑完分离胶。结果如图2(或3)所示,纯化后的乙酰葡萄糖胺磷酸变位酶AtAGM2(或AtAGM3)在电泳胶上的位置与预测的分子量相吻合。Transform pET28a-AtAGM2 (or pET28a-AtAGM3) into Escherichia coli strain BL21(DE3) for induced expression. Add the overnight cultured seed solution to fresh LB medium with 1% inoculum size for expansion at 37°C, add IPTG when the OD600nm of the bacterial solution is 0.6-0.8, make the final concentration 0.5mM, and carry out at 16°C Induce overnight. After the cells were broken, the supernatant was collected by high-speed centrifugation for nickel column purification, and gradient imidazole elution (20-500mM imidazole, 20mM Tris-HCl, pH7.6). Polyacrylamide gel electrophoresis was used to detect the expression and purification of acetylglucosamine phosphate mutase AtAGM2 (or AtAGM3). The electrophoresis used 12% separation gel, and after the 80V voltage was flattened, the separation gel was run with 120V voltage. The results are shown in Figure 2 (or 3), the position of the purified acetylglucosamine phosphate mutase AtAGM2 (or AtAGM3) on the electrophoresis gel coincides with the predicted molecular weight.
实施例4 乙酰葡萄糖胺磷酸变位酶AtAGM2与AtAGM3的酶学性质Example 4 Enzymatic properties of acetylglucosamine phosphate mutases AtAGM2 and AtAGM3
(1)乙酰葡萄糖胺磷酸变位酶AtAGM的活力测定(1) Activity determination of acetylglucosamine phosphate mutase AtAGM
测定AtAGM酶活的通用体系如下:不同的磷酸己糖底物(GlcNAc-1-P,GlcNAc-6-P,GlcN-1-P,GlcN-6-P,Glc-1-P,Glc-6-P)作为底物,每组底物加入反应体系(300μL):20mMPBS,pH 7.6,5mM MgSO4,10μM Glc-1,6-2P,加入适量的重组酶AtAGM,反应10min。将反应体系立即煮沸后,除蛋白,每组底物加入200μL 200mM NaOH。用HAPEC-PAD方法检测体系中底物的消耗与产物的生成情况。由于AtAGM催化的反应是可逆反应,并且还有中间产物己糖-1,6-2P的产生,所以酶活(nmol/min/mg)用1mg蛋白每分钟消耗的底物量(nmol)来表示。蛋白浓度使用碧云天BCA蛋白浓度测定试剂盒进行测定。The general system for the determination of AtAGM enzyme activity is as follows: different hexose phosphate substrates (GlcNAc-1-P, GlcNAc-6-P, GlcN-1-P, GlcN-6-P, Glc-1-P, Glc-6 -P) As substrates, each group of substrates was added to the reaction system (300 μL): 20 mMPBS, pH 7.6, 5 mM MgSO 4 , 10 μM Glc-1,6-2P, an appropriate amount of recombinant enzyme AtAGM was added, and reacted for 10 min. After the reaction system was boiled immediately, protein was removed, and 200 μL of 200 mM NaOH was added to each group of substrates. The consumption of substrate and the formation of product in the system were detected by HAPEC-PAD method. Since the reaction catalyzed by AtAGM is a reversible reaction, and there is also the production of the intermediate product hexose-1,6-2P, the enzyme activity (nmol/min/mg) is expressed by the amount of substrate consumed by 1 mg protein per minute (nmol) . Protein concentration was determined using Beyontian BCA Protein Concentration Assay Kit.
HAPEC-PAD法酶活检测所用的离子交换色谱系统包括戴安Bio-LC梯度混匀泵,GM-3(4mm)梯度混匀器,离子交换色谱柱CarboPac PA-100column(4×250mm),AgCl参比电极与电化学检测器。用于检测的脉冲电位变化为:t=0s,E=0.10v;t=0.20s,E=0.10v;t=0.40s,E=0.10v;t=0.41s,E=-2.00v;t=0.42s,E=-2.00v;t=0.43s,E=0.60v;t=0.44s,E=-0.10v;t=0.50s,E=-0.10v.所用的流动相:A,100mM NaOH水溶液;B,800mM乙酸钠与100mM NaOH水溶液。流动相洗脱条件是0-5min,90%A+10%B;6-15min,10%-90%B;16-18min,10%A+90%B;19-20min,90%A+10%B.总流速是0.5ml/min,检测柱温是30℃,进样体积是20μLThe ion exchange chromatographic system used in HAPEC-PAD enzyme activity detection includes Diane Bio-LC gradient mixing pump, GM-3 (4mm) gradient mixer, ion exchange column CarboPac PA-100column (4×250mm), AgCl Reference electrode and electrochemical detector. The pulse potential change used for detection is: t=0s, E=0.10v; t=0.20s, E=0.10v; t=0.40s, E=0.10v; t=0.41s, E=-2.00v; t =0.42s, E=-2.00v; t=0.43s, E=0.60v; t=0.44s, E=-0.10v; t=0.50s, E=-0.10v. Mobile phase used: A, 100mM NaOH in water; B, 800 mM sodium acetate with 100 mM NaOH in water. The mobile phase elution conditions are 0-5min, 90%A+10%B; 6-15min, 10%-90%B; 16-18min, 10%A+90%B; 19-20min, 90%A+10 %B. The total flow rate is 0.5ml/min, the detection column temperature is 30°C, and the injection volume is 20μL
(2)乙酰葡萄糖胺磷酸变位酶AtAGM底物特异性(2) Substrate specificity of acetylglucosamine phosphomutase AtAGM
由于AtAGM底物选择性较广,可以催化GlcNAc-6-P与GlcNAc-1-P;GlcN-6-P与GlcN-1-P;Glc-6-P与Glc-1-P几对异构体之间的转换。并且AtAGM催化的是可逆反应,所以采用以上六种底物来检测AtAGM的底物特异性。采用实施例4(1)中提到的通用体系,以0.1mM不同的磷酸己糖底物(GlcNAc-1-P,GlcNAc-6-P,GlcN-1-P,GlcN-6-P,Glc-1-P,Glc-6-P)作为底物,每组底物反应加入0.5μg AtAGM2(或AtAGM3或AtAGM1),于30℃水浴中反应10min。蛋白浓度使用碧云天BCA蛋白浓度测定试剂盒进行测定。Due to the wide substrate selectivity of AtAGM, it can catalyze several pairs of isomerization of GlcNAc-6-P and GlcNAc-1-P; GlcN-6-P and GlcN-1-P; Glc-6-P and Glc-1-P conversion between entities. And AtAGM catalyzed a reversible reaction, so the above six substrates were used to test the substrate specificity of AtAGM. Using the general system mentioned in Example 4(1), different hexose phosphate substrates (GlcNAc-1-P, GlcNAc-6-P, GlcN-1-P, GlcN-6-P, Glc -1-P, Glc-6-P) as substrates, add 0.5 μg AtAGM2 (or AtAGM3 or AtAGM1) to each group of substrates, and react in a water bath at 30°C for 10 minutes. Protein concentration was determined using Beyontian BCA Protein Concentration Assay Kit.
结果如表1所示,在如上的反应条件下,相同的底物浓度,AtAGM对不同底物的活力不同。AtAGM2与AtAGM3与之前鉴定出来的AtAGM1相比,虽然三种AGM酶对六种底物表现出不同的催化速率,但是仍然存在一定的共性。一般规律是正反应的活力大于逆反应(己糖-6-P的活性高于相应的己糖-1-P)。但是总的来说AtAGM2与AtAGM3的活性要低于AtAGM1,尤其是在其主要的作用底物GlcNAc-1-P与GlcNAc-6-P上面。并且AtAGM1与AtAGM3对Glc-1-P的催化速率要高于Glc-6-P,这与AtAGM2并不相同。The results are shown in Table 1. Under the above reaction conditions and the same substrate concentration, AtAGM has different activities to different substrates. Compared with the previously identified AtAGM1, AtAGM2 and AtAGM3, although the three AGM enzymes showed different catalytic rates for the six substrates, there were still some commonalities. The general rule is that the activity of the forward reaction is greater than that of the reverse reaction (hexose-6-P is more active than the corresponding hexose-1-P). But in general, the activity of AtAGM2 and AtAGM3 is lower than that of AtAGM1, especially on its main substrates GlcNAc-1-P and GlcNAc-6-P. And the catalytic rate of AtAGM1 and AtAGM3 to Glc-1-P is higher than that of Glc-6-P, which is different from AtAGM2.
表1:乙酰葡萄糖胺磷酸变位酶AtAGM2与AtAGM3的底物特异性。Table 1: Substrate specificity of acetylglucosamine phosphomutases AtAGM2 and AtAGM3.
(3)pH对重组酶AtAGM的影响(3) Effect of pH on the recombinant enzyme AtAGM
采用实施例4(1)中提到的通用体系,在30℃的条件下,分别以30μM的GlcNAc-6-P为底物,在pH 3.6-10.6(pH 3.6-5.6HAc-NaAc,Ph 6.6-7.6Na2HPO4-NaH2PO4,pH 8.6Tris-HCl,pH 9.6-10.6Gly-NaOH)反应体系中,5μg酶按照标准检测方法测定其活性。根据酶在不同pH下的相对活性绘制柱状图,以及折线图确定酶的最适反应pH。以灭活的酶作为对照,以活性最高的值为100%,测定在各个反应pH下酶的相对活性。Using the general system mentioned in Example 4(1), under the condition of 30°C, 30 μM GlcNAc-6-P was used as the substrate respectively, at pH 3.6-10.6 (pH 3.6-5.6 HAc-NaAc, Ph 6.6 -7.6Na 2 HPO 4 -NaH 2 PO 4 , pH 8.6 Tris-HCl, pH 9.6-10.6 Gly-NaOH) reaction system, the activity of 5 μg enzyme was determined according to the standard detection method. According to the relative activity of the enzyme at different pH, the histogram and the line graph were drawn to determine the optimum reaction pH of the enzyme. The relative activity of the enzyme at each reaction pH was measured with the inactivated enzyme as a control, with the highest activity being 100%.
结果如图4所示,AtAGM2以GlcNAc-6-P为底物时的最适pH范围在中性偏酸性范围内,最适pH为5.6。如图6所示,AtAGM3以GlcNAc-6-P为底物时的最适pH范围在中性内,最适pH为7.0。与AtAGM1的最适pH为7.6相比,三者最适pH相差不大,均处于中性范围内。The results are shown in Figure 4. When AtAGM2 uses GlcNAc-6-P as the substrate, the optimum pH range is in the neutral to acidic range, and the optimum pH is 5.6. As shown in Figure 6, when AtAGM3 uses GlcNAc-6-P as a substrate, the optimum pH range is within neutral, and the optimum pH is 7.0. Compared with the optimum pH of AtAGM1 which was 7.6, the optimum pH of the three were not much different, and they were all in the neutral range.
(4)温度对重组酶AtAGM的影响(4) The influence of temperature on the recombinant enzyme AtAGM
采用实施例4(1)中提到的通用体系,以30μM的GlcNAc-1-P或GlcNAc-6-P为底物,分别在4-80℃下按照标准方法测定重组酶的活性,5μg酶在不同温度下的相对活性如图5,7所示。以灭活的酶为对照,以反应最高酶活力为100%计算相对酶活。AtAGM2以GlcNAc-6-P作为底物的最适反应温度为30℃(图5),AtAGM2以以GlcNAc-6-P作为底物的最适反应温度是20℃(图7)。与AtAGM1的最适温度范围为25-30℃相比,三者对反应温度的要求也大致相同。Using the general system mentioned in Example 4(1), using 30 μM GlcNAc-1-P or GlcNAc-6-P as a substrate, the activity of the recombinant enzyme was determined according to the standard method at 4-80° C., 5 μg enzyme The relative activities at different temperatures are shown in Figures 5 and 7. Taking the inactivated enzyme as a control, the relative enzyme activity was calculated with the highest enzyme activity as 100%. The optimum reaction temperature of AtAGM2 with GlcNAc-6-P as substrate is 30°C (Figure 5), and the optimum reaction temperature of AtAGM2 with GlcNAc-6-P as substrate is 20°C (Figure 7). Compared with AtAGM1, which has an optimum temperature range of 25-30°C, the requirements for the reaction temperature of the three are roughly the same.
(5)EDTA、SDS及金属离子等对AtAGM活性的影响(5) Effects of EDTA, SDS and metal ions on the activity of AtAGM
采用实施例4(1)中提到的通用体系,以30μM的GlcNAc-6-P作为底物,反应体系中设定各种金属离子浓度在10mM,按照标准方法检测酶活力。以灭活的酶作为对照,以活性最高的值为100%。结果如图8所示,Mg2+对AtAGM2的酶活具有明显的提高作用,一些离子如Ca2 +,Zn2+等对反应有明显的抑制作用。如图9所示,Mg2+对AtAGM3的酶活也具有明显的提高作用,一些离子如Fe2+,等对反应有明显的抑制作用。对于AtAGM1来说,Mg2+的存在也可以促进反应的进行。说明Mg2+激活反应的现象,是所有的AtAGM的共性。The general system mentioned in Example 4 (1) was adopted, 30 μM GlcNAc-6-P was used as the substrate, the concentration of various metal ions was set at 10 mM in the reaction system, and the enzyme activity was detected according to standard methods. The inactivated enzyme was used as a control, and the value with the highest activity was 100%. The results are shown in Figure 8, Mg 2+ can obviously improve the enzyme activity of AtAGM2, and some ions such as Ca 2 + and Zn 2+ can obviously inhibit the reaction. As shown in Figure 9, Mg 2+ can also significantly improve the enzyme activity of AtAGM3, and some ions such as Fe 2+ can obviously inhibit the reaction. For AtAGM1, the presence of Mg 2+ can also promote the reaction. It shows that the phenomenon of Mg 2+ activation reaction is common to all AtAGMs.
SEQUENCE LISTINGSEQUENCE LISTING
<110> 中国科学院大连化学物理研究所<110> Dalian Institute of Chemical Physics, Chinese Academy of Sciences
<120> AtAGM2与AtAGM3编码基因及酶的制备与应用<120> Preparation and application of AtAGM2 and AtAGM3 coding genes and enzymes
<130><130>
<160> 8<160> 8
<170> PatentIn version 3.5<170> PatentIn version 3.5
<210> 1<210> 1
<211> 1878<211> 1878
<212> DNA<212>DNA
<213> 拟南芥(Arabidopsis thaliana)<213> Arabidopsis thaliana
<220><220>
<221> DNA<221> DNA
<222> (1)..(1878)<222> (1)..(1878)
<400> 1<400> 1
atggaaggaa aggttttcca aaactttaat gtagtacaga gctgctaccg acaaaataag 60atggaaggaa aggttttcca aaactttaat gtagtacaga gctgctaccg acaaaataag 60
cagtttaaga cacgatacca aagagaacct gacctgttca tgtctacttt acttccctgt 120cagtttaaga cacgatacca aagagaacct gacctgttca tgtctacttt acttccctgt 120
ccaagagaga agatggcatt taacctcaac tcttccatgc gtgcccacac tttgtctaaa 180ccaagagaga agatggcatt taacctcaac tcttccatgc gtgccccacac tttgtctaaa 180
taccagtttg ttctttcaaa gcaaagaact ttttactgca atgctacttc gtcaagtgct 240taccagtttg ttctttcaaa gcaaagaact ttttactgca atgctacttc gtcaagtgct 240
actgtgccat ctcttgacaa aaatgatttt ctgaagctcc aaaacggcag tgatattcgg 300actgtgccat ctcttgacaa aaatgatttt ctgaagctcc aaaacggcag tgatattcgg 300
ggtgtagcgg tcactggggt tgagggggaa cctgtaagcc ttcctgaacc agtgactgaa 360ggtgtagcgg tcactggggt tgagggggaa cctgtaagcc ttcctgaacc agtgactgaa 360
gccatagctg ctgcttttgg gcaatggctg ttacacaaga agaaggctga atcccggcgt 420gccatagctg ctgcttttgg gcaatggctg ttacacaaga agaaggctga atcccggcgt 420
ttgagagtat ctgttggcca tgactctcgc atctctgcac aaactttgct ggaggcggtt 480ttgagagtat ctgttggcca tgactctcgc atctctgcac aaactttgct ggaggcggtt 480
tctcgaggtc ttggtgtttc tggattagat gttgttcagt ttggattagc atcaacacca 540tctcgaggtc ttggtgtttc tggattagat gttgttcagt ttggattagc atcaacacca 540
gcaatgttta atagcacatt gactgaagat gagtcattct tgtgcccagc tgatggggct 600gcaatgttta atagcacatt gactgaagat gagtcattct tgtgcccagc tgatggggct 600
attatgataa cagcaagcca tcttccttac aacaggaacg gtttcaagtt ctttaccagt 660attatgataa cagcaagcca tcttccttac aacaggaacg gtttcaagtt ctttaccagt 660
gatggaggac ttgggaaggt tgatatcaag aacattttgg agcgagctgc agatatttac 720gatggaggac ttgggaaggt tgatatcaag aacattttgg agcgagctgc agatattac 720
aagaagcttt ctgatgaaaa tttgaggaaa tcacaaagag aaagttcttc tattacaaag 780aagaagcttt ctgatgaaaa tttgaggaaa tcacaaagag aaagttcttc tattacaaag 780
gttgactaca tgtcagtata cacctctggt cttgtaaagg cagtccggaa agcagcagga 840gttgactaca tgtcagtata cacctctggt cttgtaaagg cagtccggaa agcagcagga 840
gatttggaga agcctctaga gggatttcat atagttgttg atgctggaaa tggagctgga 900gatttggaga agcctctaga gggatttcat atagttgttg atgctggaaa tggagctgga 900
ggattttttg ctgccaaggt gcttgagcct ttaggagcaa ttacttctgg cagtcaattt 960ggattttttg ctgccaaggt gcttgagcct ttaggagcaa ttacttctgg cagtcaattt 960
ctggaaccag atggtatgtt cccaaatcat atccctaatc cggaagataa ggcggcaatg 1020ctggaaccag atggtatgtt cccaaatcat atccctaatc cggaagataa ggcggcaatg 1020
gaagctataa ccaaggctgt tcttgataat aaggctgatt tgggtatcat ctttgatact 1080gaagctataa ccaaggctgt tcttgataat aaggctgatt tgggtatcat ctttgatact 1080
gatgttgata ggtctgctgc tgtggattca tctggccgtg aattcaaccg taatcgtctt 1140gatgttgata ggtctgctgc tgtggattca tctggccgtg aattcaaccg taatcgtctt 1140
attgccttgc tatcagccat tgttctagag gaacaccctg gcacaactat agttacggat 1200attgccttgc tatcagccat tgttctagag gaacaccctg gcacaactat agttacggat 1200
agtgtcactt cggacggtct gacctcattt attgagaaga agcttggcgg aaagcatcac 1260agtgtcactt cggacggtct gacctcattt attgagaaga agcttggcgg aaagcatcac 1260
aggttcaaaa gaggttacaa gaatgtcatt gacgaagcta ttcgcttgaa ctcggttggg 1320aggttcaaaa gaggttacaa gaatgtcatt gacgaagcta ttcgcttgaa ctcggttggg 1320
gaagaatcac atctggctat agaaaccagt ggtcatggag ctctaaagga aaaccattgg 1380gaagaatcac atctggctat agaaaccagt ggtcatggag ctctaaagga aaaccattgg 1380
ctcgacgatg gggcctatct catggtaaaa atcctgaaca aactagctgc ggcccgagct 1440ctcgacgatg gggcctatct catggtaaaa atcctgaaca aactagctgc ggcccgagct 1440
gctggtcaag ggagtggcag caaagtttta acagatcttg ttgaaggtct ggaagagccc 1500gctggtcaag ggagtggcag caaagtttta acagatcttg ttgaaggtct ggaagagccc 1500
aaagtggctt tagaactgag gcttaaaatc gacaagaatc accctgacct tgaaggaagt 1560aaagtggctt tagaactgag gcttaaaatc gacaagaatc accctgacct tgaaggaagt 1560
gatttccggg agtatggaga gaaggtcctg caacacgtgt cgaactcaat agaaacaaat 1620gatttccggg agtatggaga gaaggtcctg caacacgtgt cgaactcaat agaaacaaat 1620
ccaaatctta taatagctcc agttaactac gaagggatcc gcgtttcggg ctttggtgga 1680ccaaatctta taatagctcc agttaactac gaagggatcc gcgtttcggg ctttggtgga 1680
tggtttcttc tcagactttc tctccatgat cctgttcttc cccttaacat cgaggcacag 1740tggtttcttc tcagactttc tctccatgat cctgttcttc cccttaacat cgaggcacag 1740
agtgaggatg atgctgtgaa attaggcctt gtggttgcta cgacagtgaa ggagttcaat 1800agtgaggatg atgctgtgaa attaggcctt gtggttgcta cgacagtgaa ggagttcaat 1800
gctttggaca cctgtgcctt gtccaacctc actcactcct ccgcggccgc actcgagcac 1860gctttggaca cctgtgcctt gtccaacctc actcactcct ccgcggccgc actcgagcac 1860
caccaccacc accactga 1878caccaccacc accactga 1878
<210> 2<210> 2
<211> 625<211> 625
<212> PRT<212> PRT
<213> 拟南芥(Arabidopsis thaliana)<213> Arabidopsis thaliana
<220><220>
<221> PRT<221> PRT
<222> (1)..(625)<222> (1)..(625)
<400> 2<400> 2
Met Glu Gly Lys Val Phe Gln Asn Phe Asn Val Val Gln Ser Cys TyrMet Glu Gly Lys Val Phe Gln Asn Phe Asn Val Val Gln Ser Cys Tyr
1 5 10 151 5 10 15
Arg Gln Asn Lys Gln Phe Lys Thr Arg Tyr Gln Arg Glu Pro Asp LeuArg Gln Asn Lys Gln Phe Lys Thr Arg Tyr Gln Arg Glu Pro Asp Leu
20 25 30 20 25 30
Phe Met Ser Thr Leu Leu Pro Cys Pro Arg Glu Lys Met Ala Phe AsnPhe Met Ser Thr Leu Leu Pro Cys Pro Arg Glu Lys Met Ala Phe Asn
35 40 45 35 40 45
Leu Asn Ser Ser Met Arg Ala His Thr Leu Ser Lys Tyr Gln Phe ValLeu Asn Ser Ser Met Arg Ala His Thr Leu Ser Lys Tyr Gln Phe Val
50 55 60 50 55 60
Leu Ser Lys Gln Arg Thr Phe Tyr Cys Asn Ala Thr Ser Ser Ser AlaLeu Ser Lys Gln Arg Thr Phe Tyr Cys Asn Ala Thr Ser Ser Ser Ser Ala
65 70 75 8065 70 75 80
Thr Val Pro Ser Leu Asp Lys Asn Asp Phe Leu Lys Leu Gln Asn GlyThr Val Pro Ser Leu Asp Lys Asn Asp Phe Leu Lys Leu Gln Asn Gly
85 90 95 85 90 95
Ser Asp Ile Arg Gly Val Ala Val Thr Gly Val Glu Gly Glu Pro ValSer Asp Ile Arg Gly Val Ala Val Thr Gly Val Glu Gly Glu Pro Val
100 105 110 100 105 110
Ser Leu Pro Glu Pro Val Thr Glu Ala Ile Ala Ala Ala Phe Gly GlnSer Leu Pro Glu Pro Val Thr Glu Ala Ile Ala Ala Ala Phe Gly Gln
115 120 125 115 120 125
Trp Leu Leu His Lys Lys Lys Ala Glu Ser Arg Arg Leu Arg Val SerTrp Leu Leu His Lys Lys Lys Ala Glu Ser Arg Arg Leu Arg Val Ser
130 135 140 130 135 140
Val Gly His Asp Ser Arg Ile Ser Ala Gln Thr Leu Leu Glu Ala ValVal Gly His Asp Ser Arg Ile Ser Ala Gln Thr Leu Leu Glu Ala Val
145 150 155 160145 150 155 160
Ser Arg Gly Leu Gly Val Ser Gly Leu Asp Val Val Gln Phe Gly LeuSer Arg Gly Leu Gly Val Ser Gly Leu Asp Val Val Gln Phe Gly Leu
165 170 175 165 170 175
Ala Ser Thr Pro Ala Met Phe Asn Ser Thr Leu Thr Glu Asp Glu SerAla Ser Thr Pro Ala Met Phe Asn Ser Thr Leu Thr Glu Asp Glu Ser
180 185 190 180 185 190
Phe Leu Cys Pro Ala Asp Gly Ala Ile Met Ile Thr Ala Ser His LeuPhe Leu Cys Pro Ala Asp Gly Ala Ile Met Ile Thr Ala Ser His Leu
195 200 205 195 200 205
Pro Tyr Asn Arg Asn Gly Phe Lys Phe Phe Thr Ser Asp Gly Gly LeuPro Tyr Asn Arg Asn Gly Phe Lys Phe Phe Thr Ser Asp Gly Gly Leu
210 215 220 210 215 220
Gly Lys Val Asp Ile Lys Asn Ile Leu Glu Arg Ala Ala Asp Ile TyrGly Lys Val Asp Ile Lys Asn Ile Leu Glu Arg Ala Ala Asp Ile Tyr
225 230 235 240225 230 235 240
Lys Lys Leu Ser Asp Glu Asn Leu Arg Lys Ser Gln Arg Glu Ser SerLys Lys Leu Ser Asp Glu Asn Leu Arg Lys Ser Gln Arg Glu Ser Ser
245 250 255 245 250 255
Ser Ile Thr Lys Val Asp Tyr Met Ser Val Tyr Thr Ser Gly Leu ValSer Ile Thr Lys Val Asp Tyr Met Ser Val Tyr Thr Ser Gly Leu Val
260 265 270 260 265 270
Lys Ala Val Arg Lys Ala Ala Gly Asp Leu Glu Lys Pro Leu Glu GlyLys Ala Val Arg Lys Ala Ala Gly Asp Leu Glu Lys Pro Leu Glu Gly
275 280 285 275 280 285
Phe His Ile Val Val Asp Ala Gly Asn Gly Ala Gly Gly Phe Phe AlaPhe His Ile Val Val Asp Ala Gly Asn Gly Ala Gly Gly Phe Phe Ala
290 295 300 290 295 300
Ala Lys Val Leu Glu Pro Leu Gly Ala Ile Thr Ser Gly Ser Gln PheAla Lys Val Leu Glu Pro Leu Gly Ala Ile Thr Ser Gly Ser Gln Phe
305 310 315 320305 310 315 320
Leu Glu Pro Asp Gly Met Phe Pro Asn His Ile Pro Asn Pro Glu AspLeu Glu Pro Asp Gly Met Phe Pro Asn His Ile Pro Asn Pro Glu Asp
325 330 335 325 330 335
Lys Ala Ala Met Glu Ala Ile Thr Lys Ala Val Leu Asp Asn Lys AlaLys Ala Ala Met Glu Ala Ile Thr Lys Ala Val Leu Asp Asn Lys Ala
340 345 350 340 345 350
Asp Leu Gly Ile Ile Phe Asp Thr Asp Val Asp Arg Ser Ala Ala ValAsp Leu Gly Ile Ile Phe Asp Thr Asp Val Asp Arg Ser Ala Ala Val
355 360 365 355 360 365
Asp Ser Ser Gly Arg Glu Phe Asn Arg Asn Arg Leu Ile Ala Leu LeuAsp Ser Ser Gly Arg Glu Phe Asn Arg Asn Arg Leu Ile Ala Leu Leu
370 375 380 370 375 380
Ser Ala Ile Val Leu Glu Glu His Pro Gly Thr Thr Ile Val Thr AspSer Ala Ile Val Leu Glu Glu His Pro Gly Thr Thr Ile Val Thr Asp
385 390 395 400385 390 395 400
Ser Val Thr Ser Asp Gly Leu Thr Ser Phe Ile Glu Lys Lys Leu GlySer Val Thr Ser Asp Gly Leu Thr Ser Phe Ile Glu Lys Lys Leu Gly
405 410 415 405 410 415
Gly Lys His His Arg Phe Lys Arg Gly Tyr Lys Asn Val Ile Asp GluGly Lys His His Arg Phe Lys Arg Gly Tyr Lys Asn Val Ile Asp Glu
420 425 430 420 425 430
Ala Ile Arg Leu Asn Ser Val Gly Glu Glu Ser His Leu Ala Ile GluAla Ile Arg Leu Asn Ser Val Gly Glu Glu Ser His Leu Ala Ile Glu
435 440 445 435 440 445
Thr Ser Gly His Gly Ala Leu Lys Glu Asn His Trp Leu Asp Asp GlyThr Ser Gly His Gly Ala Leu Lys Glu Asn His Trp Leu Asp Asp Gly
450 455 460 450 455 460
Ala Tyr Leu Met Val Lys Ile Leu Asn Lys Leu Ala Ala Ala Arg AlaAla Tyr Leu Met Val Lys Ile Leu Asn Lys Leu Ala Ala Ala Arg Ala
465 470 475 480465 470 475 480
Ala Gly Gln Gly Ser Gly Ser Lys Val Leu Thr Asp Leu Val Glu GlyAla Gly Gln Gly Ser Gly Ser Lys Val Leu Thr Asp Leu Val Glu Gly
485 490 495 485 490 495
Leu Glu Glu Pro Lys Val Ala Leu Glu Leu Arg Leu Lys Ile Asp LysLeu Glu Glu Pro Lys Val Ala Leu Glu Leu Arg Leu Lys Ile Asp Lys
500 505 510 500 505 510
Asn His Pro Asp Leu Glu Gly Ser Asp Phe Arg Glu Tyr Gly Glu LysAsn His Pro Asp Leu Glu Gly Ser Asp Phe Arg Glu Tyr Gly Glu Lys
515 520 525 515 520 525
Val Leu Gln His Val Ser Asn Ser Ile Glu Thr Asn Pro Asn Leu IleVal Leu Gln His Val Ser Asn Ser Ile Glu Thr Asn Pro Asn Leu Ile
530 535 540 530 535 540
Ile Ala Pro Val Asn Tyr Glu Gly Ile Arg Val Ser Gly Phe Gly GlyIle Ala Pro Val Asn Tyr Glu Gly Ile Arg Val Ser Gly Phe Gly Gly
545 550 555 560545 550 555 560
Trp Phe Leu Leu Arg Leu Ser Leu His Asp Pro Val Leu Pro Leu AsnTrp Phe Leu Leu Arg Leu Ser Leu His Asp Pro Val Leu Pro Leu Asn
565 570 575 565 570 575
Ile Glu Ala Gln Ser Glu Asp Asp Ala Val Lys Leu Gly Leu Val ValIle Glu Ala Gln Ser Glu Asp Asp Ala Val Lys Leu Gly Leu Val Val
580 585 590 580 585 590
Ala Thr Thr Val Lys Glu Phe Asn Ala Leu Asp Thr Cys Ala Leu SerAla Thr Thr Val Lys Glu Phe Asn Ala Leu Asp Thr Cys Ala Leu Ser
595 600 605 595 600 605
Asn Leu Thr His Ser Ser Ala Ala Ala Leu Glu His His His His HisAsn Leu Thr His Ser Ser Ala Ala Ala Leu Glu His His His His His His His
610 615 620 610 615 620
HisHis
625625
<210> 3<210> 3
<211> 1872<211> 1872
<212> DNA<212>DNA
<213> 拟南芥(Arabidopsis thaliana)<213> Arabidopsis thaliana
<220><220>
<221> DNA<221> DNA
<222> (1)..(1872)<222> (1)..(1872)
<400> 3<400> 3
atggcgtcga cttcaacatc atctttaatg gcttctaaaa ctgtaatctc caaaacagct 60atggcgtcga cttcaacatc atctttaatg gcttctaaaa ctgtaatctc caaaacagct 60
ctgttttctt ccttaccggg aatagtcagc cggagttttt taacattcgc accggcttct 120ctgttttctt ccttaccgggg aatagtcagc cggagttttt taacattcgc accggcttct 120
ccttccgtta aaccccttag gataagatct tcaaatgtta ctaagttcga cgaagtaacc 180ccttccgtta aaccccttag gataagatct tcaaatgtta ctaagttcga cgaagtaacc 180
aacagtcttg acgaagacat ggaccagatt cgacggttac aaaacggttc tgacgtgaga 240aacagtcttg acgaagacat ggaccagatt cgacggttac aaaacggttc tgacgtgaga 240
ggagtcgcat tggaaggaga gaaaggtcga acagttgacc taacgcctgc agctgttgaa 300ggagtcgcat tggaaggaga gaaaggtcga acagttgacc taacgcctgc agctgttgaa 300
gcaatcgcag agagctttgg agaatgggtt gcagcaacgg agagtaacgg aaacggcgtc 360gcaatcgcag agagctttgg agaatgggtt gcagcaacgg agagtaacgg aaacggcgtc 360
attaagattt ctctcggacg ggatccacgt gtttccggtg ggaagctaag cacggcagtg 420attaagattt ctctcggacg ggatccacgt gtttccggtg ggaagctaag cacggcagtg 420
tttgccggct tagctcgtgc aggctgttta gcttttgaca tgggtttagc tacagcgcca 480tttgccggct tagctcgtgc aggctgttta gcttttgaca tgggtttagc tacagcgcca 480
gcttgcttca tgagcacgtt actctctcca ttcgaatacg acgcttcaat tatgatgaca 540gcttgcttca tgagcacgtt actctctcca ttcgaatacg acgcttcaat tatgatgaca 540
gcttctcatt taccgtatac aagaaacgga ctcaagttct ttaccaagag aggaggatta 600gcttctcatt taccgtatac aagaaacgga ctcaagttct ttaccaagag aggaggatta 600
acgtctcctg aagtggagaa gatatgcgat ttagctgcgc gaaagtacgc tactaggcag 660acgtctcctg aagtggagaa gatatgcgat ttagctgcgc gaaagtacgc tactaggcag 660
actaaagtct ctacattgat cagaacgcga ccgcagcaag ttgattttat gagcgcttac 720actaaagtct ctacattgat cagaacgcga ccgcagcaag ttgattttat gagcgcttac 720
tctaagcacc ttagagaaat cattaaagag agaatcaatc accctgaaca ctatgacact 780tctaagcacc ttagagaaat cattaaagag agaatcaatc accctgaaca ctatgacact 780
cctctcaaag gatttcagat agttgtgaat gcgggtaatg ggtcaggagg cttctttacg 840cctctcaaag gatttcagat agttgtgaat gcgggtaatg ggtcaggagg cttctttacg 840
tgggacgttc tagacaagtt aggagccgat acattcggtt cgctctatct aaaccctgac 900tgggacgttc tagacaagtt aggagccgat acattcggtt cgctctatct aaaccctgac 900
gggatgttcc ctaatcacat tcctaatccg gaaaacaaaa tcgcaatgca acacacccga 960gggatgttcc ctaatcacat tcctaatccg gaaaacaaaa tcgcaatgca acacacccga 960
gccgcggttc ttgagaactc agcagatctc ggggttgtgt ttgatacgga tgttgacagg 1020gccgcggttc ttgagaactc agcagatctc ggggttgtgt ttgatacgga tgttgacagg 1020
agtggagtgg tggataacaa aggaaatcct atcaacggag ataagcttat tgcgcttatg 1080agtggagtgg tggataacaa aggaaatcct atcaacggag ataagcttat tgcgcttatg 1080
tcagctatag tgcttaaaga acatccagga agtacagtag tgactgacgc aagaacgagt 1140tcagctatag tgcttaaaga acatccagga agtacagtag tgactgacgc aagaacgagt 1140
atggggctaa ctaggtttat aacggagcga ggagggaggc attgtttgta tagagtaggg 1200atggggctaa ctaggtttat aacggagcga ggagggaggc attgtttgta tagagtaggg 1200
tatagaaacg tgattgacaa gggagtagag ctgaacaaag acggcatcga gactcatctc 1260tatagaaacg tgattgacaa gggagtagag ctgaacaaag acggcatcga gactcatctc 1260
atgatggaaa cttcaggaca tggtgcggtt aaggagaatc acttcttgga tgatggtgca 1320atgatggaaa cttcaggaca tggtgcggtt aaggagaatc acttcttgga tgatggtgca 1320
tacatggtgg tgaagatcat aattgaaatg gtgagaatga gactcgcggg atcaaatgaa 1380tacatggtgg tgaagatcat aattgaaatg gtgagaatga gactcgcggg atcaaatgaa 1380
ggtatcggta gtttgatcga agatcttgag gagccgttag aagcggttga gcttcggttg 1440ggtatcggta gtttgatcga agatcttgag gagccgttag aagcggttga gcttcggttg 1440
aatattttat cagagccaag agatgccaaa gcaaaaggca ttgaagccat tgagactttc 1500aatattttat cagagccaag agatgccaaa gcaaaaggca ttgaagccat tgagactttc 1500
aggcaataca ttgaggaagg aaaactgaaa gggtgggaat tgggcacgtg tggggattgt 1560aggcaataca ttgaggaagg aaaactgaaa gggtgggaat tgggcacgtg tggggattgt 1560
tgggttactg aaggttgctt ggtggactca aatgatcatc catctgctat tgatgctcac 1620tgggttactg aaggttgctt ggtggactca aatgatcatc catctgctat tgatgctcac 1620
atgtacaggg caagagtgag tgatgaagag agtggggaag agtatggttg ggtgcatatg 1680atgtacaggg caagagtgag tgatgaagag agtggggaag agtatggttg ggtgcatatg 1680
aggcagagta ttcataaccc taacatcgca cttaatatgc aatcaatgct tcctggtgga 1740aggcagagta ttcataaccc taacatcgca cttaatatgc aatcaatgct tcctggtgga 1740
tgtctctcca tgacaagaat cttcagagac cagtttcttg aagctagtgg ggtggctaga 1800tgtctctcca tgacaagaat cttcagagac cagtttcttg aagctagtgg ggtggctaga 1800
ttcctggata taagtgactt cgacaattac atcggaggtc aatctctcga gcaccaccac 1860ttcctggata taagtgactt cgacaattac atcggaggtc aatctctcga gcaccaccac 1860
caccaccact ga 1872caccaccact ga 1872
<210> 4<210> 4
<211> 623<211>623
<212> PRT<212> PRT
<213> MASTSTSSLMASKTVISKTALFSSLPGIVSRSFLTFAPASPSVKPLRIRSSNVTKFDEVTNSLDEDMDQIRRLQNGSDVRGVALEGEKGRTVDLTPAAVEAIAESFGEWVAATESNGNGVIKISLGRDPRVSGGKLSTAVFAGLARAGCLAFDMGLATAP<213> MASTSTSSLMASKTVISKTALFSSLPGIVSRSFLTFAPASSPVKPLRIRSSNVTKFDEVTNSLDEDMDQIRRLQNGSDVRGVALEGEKGRTVDLTPAAVEAIAESFGEWVAATESNGNGVIKISLGRDPRVSGGKLSTAVFAGLARAGCLAFDMGLATAP
<220><220>
<221> PRT<221> PRT
<222> (1)..(623)<222> (1)..(623)
<400> 4<400> 4
Met Ala Ser Thr Ser Thr Ser Ser Leu Met Ala Ser Lys Thr Val IleMet Ala Ser Thr Ser Thr Ser Ser Leu Met Ala Ser Lys Thr Val Ile
1 5 10 151 5 10 15
Ser Lys Thr Ala Leu Phe Ser Ser Leu Pro Gly Ile Val Ser Arg SerSer Lys Thr Ala Leu Phe Ser Ser Leu Pro Gly Ile Val Ser Arg Ser
20 25 30 20 25 30
Phe Leu Thr Phe Ala Pro Ala Ser Pro Ser Val Lys Pro Leu Arg IlePhe Leu Thr Phe Ala Pro Ala Ser Pro Ser Val Lys Pro Leu Arg Ile
35 40 45 35 40 45
Arg Ser Ser Asn Val Thr Lys Phe Asp Glu Val Thr Asn Ser Leu AspArg Ser Ser Asn Val Thr Lys Phe Asp Glu Val Thr Asn Ser Leu Asp
50 55 60 50 55 60
Glu Asp Met Asp Gln Ile Arg Arg Leu Gln Asn Gly Ser Asp Val ArgGlu Asp Met Asp Gln Ile Arg Arg Leu Gln Asn Gly Ser Asp Val Arg
65 70 75 8065 70 75 80
Gly Val Ala Leu Glu Gly Glu Lys Gly Arg Thr Val Asp Leu Thr ProGly Val Ala Leu Glu Gly Glu Lys Gly Arg Thr Val Asp Leu Thr Pro
85 90 95 85 90 95
Ala Ala Val Glu Ala Ile Ala Glu Ser Phe Gly Glu Trp Val Ala AlaAla Ala Val Glu Ala Ile Ala Glu Ser Phe Gly Glu Trp Val Ala Ala
100 105 110 100 105 110
Thr Glu Ser Asn Gly Asn Gly Val Ile Lys Ile Ser Leu Gly Arg AspThr Glu Ser Asn Gly Asn Gly Val Ile Lys Ile Ser Leu Gly Arg Asp
115 120 125 115 120 125
Pro Arg Val Ser Gly Gly Lys Leu Ser Thr Ala Val Phe Ala Gly LeuPro Arg Val Ser Gly Gly Lys Leu Ser Thr Ala Val Phe Ala Gly Leu
130 135 140 130 135 140
Ala Arg Ala Gly Cys Leu Ala Phe Asp Met Gly Leu Ala Thr Ala ProAla Arg Ala Gly Cys Leu Ala Phe Asp Met Gly Leu Ala Thr Ala Pro
145 150 155 160145 150 155 160
Ala Cys Phe Met Ser Thr Leu Leu Ser Pro Phe Glu Tyr Asp Ala SerAla Cys Phe Met Ser Thr Leu Leu Ser Pro Phe Glu Tyr Asp Ala Ser
165 170 175 165 170 175
Ile Met Met Thr Ala Ser His Leu Pro Tyr Thr Arg Asn Gly Leu LysIle Met Met Thr Ala Ser His Leu Pro Tyr Thr Arg Asn Gly Leu Lys
180 185 190 180 185 190
Phe Phe Thr Lys Arg Gly Gly Leu Thr Ser Pro Glu Val Glu Lys IlePhe Phe Thr Lys Arg Gly Gly Leu Thr Ser Pro Glu Val Glu Lys Ile
195 200 205 195 200 205
Cys Asp Leu Ala Ala Arg Lys Tyr Ala Thr Arg Gln Thr Lys Val SerCys Asp Leu Ala Ala Arg Lys Tyr Ala Thr Arg Gln Thr Lys Val Ser
210 215 220 210 215 220
Thr Leu Ile Arg Thr Arg Pro Gln Gln Val Asp Phe Met Ser Ala TyrThr Leu Ile Arg Thr Arg Pro Gln Gln Val Asp Phe Met Ser Ala Tyr
225 230 235 240225 230 235 240
Ser Lys His Leu Arg Glu Ile Ile Lys Glu Arg Ile Asn His Pro GluSer Lys His Leu Arg Glu Ile Ile Lys Glu Arg Ile Asn His Pro Glu
245 250 255 245 250 255
His Tyr Asp Thr Pro Leu Lys Gly Phe Gln Ile Val Val Asn Ala GlyHis Tyr Asp Thr Pro Leu Lys Gly Phe Gln Ile Val Val Asn Ala Gly
260 265 270 260 265 270
Asn Gly Ser Gly Gly Phe Phe Thr Trp Asp Val Leu Asp Lys Leu GlyAsn Gly Ser Gly Gly Phe Phe Thr Trp Asp Val Leu Asp Lys Leu Gly
275 280 285 275 280 285
Ala Asp Thr Phe Gly Ser Leu Tyr Leu Asn Pro Asp Gly Met Phe ProAla Asp Thr Phe Gly Ser Leu Tyr Leu Asn Pro Asp Gly Met Phe Pro
290 295 300 290 295 300
Asn His Ile Pro Asn Pro Glu Asn Lys Ile Ala Met Gln His Thr ArgAsn His Ile Pro Asn Pro Glu Asn Lys Ile Ala Met Gln His Thr Arg
305 310 315 320305 310 315 320
Ala Ala Val Leu Glu Asn Ser Ala Asp Leu Gly Val Val Phe Asp ThrAla Ala Val Leu Glu Asn Ser Ala Asp Leu Gly Val Val Phe Asp Thr
325 330 335 325 330 335
Asp Val Asp Arg Ser Gly Val Val Asp Asn Lys Gly Asn Pro Ile AsnAsp Val Asp Arg Ser Gly Val Val Asp Asn Lys Gly Asn Pro Ile Asn
340 345 350 340 345 350
Gly Asp Lys Leu Ile Ala Leu Met Ser Ala Ile Val Leu Lys Glu HisGly Asp Lys Leu Ile Ala Leu Met Ser Ala Ile Val Leu Lys Glu His
355 360 365 355 360 365
Pro Gly Ser Thr Val Val Thr Asp Ala Arg Thr Ser Met Gly Leu ThrPro Gly Ser Thr Val Val Thr Asp Ala Arg Thr Ser Met Gly Leu Thr
370 375 380 370 375 380
Arg Phe Ile Thr Glu Arg Gly Gly Arg His Cys Leu Tyr Arg Val GlyArg Phe Ile Thr Glu Arg Gly Gly Arg His Cys Leu Tyr Arg Val Gly
385 390 395 400385 390 395 400
Tyr Arg Asn Val Ile Asp Lys Gly Val Glu Leu Asn Lys Asp Gly IleTyr Arg Asn Val Ile Asp Lys Gly Val Glu Leu Asn Lys Asp Gly Ile
405 410 415 405 410 415
Glu Thr His Leu Met Met Glu Thr Ser Gly His Gly Ala Val Lys GluGlu Thr His Leu Met Met Glu Thr Ser Gly His Gly Ala Val Lys Glu
420 425 430 420 425 430
Asn His Phe Leu Asp Asp Gly Ala Tyr Met Val Val Lys Ile Ile IleAsn His Phe Leu Asp Asp Gly Ala Tyr Met Val Val Lys Ile Ile Ile
435 440 445 435 440 445
Glu Met Val Arg Met Arg Leu Ala Gly Ser Asn Glu Gly Ile Gly SerGlu Met Val Arg Met Arg Leu Ala Gly Ser Asn Glu Gly Ile Gly Ser
450 455 460 450 455 460
Leu Ile Glu Asp Leu Glu Glu Pro Leu Glu Ala Val Glu Leu Arg LeuLeu Ile Glu Asp Leu Glu Glu Pro Leu Glu Ala Val Glu Leu Arg Leu
465 470 475 480465 470 475 480
Asn Ile Leu Ser Glu Pro Arg Asp Ala Lys Ala Lys Gly Ile Glu AlaAsn Ile Leu Ser Glu Pro Arg Asp Ala Lys Ala Lys Gly Ile Glu Ala
485 490 495 485 490 495
Ile Glu Thr Phe Arg Gln Tyr Ile Glu Glu Gly Lys Leu Lys Gly TrpIle Glu Thr Phe Arg Gln Tyr Ile Glu Glu Gly Lys Leu Lys Gly Trp
500 505 510 500 505 510
Glu Leu Gly Thr Cys Gly Asp Cys Trp Val Thr Glu Gly Cys Leu ValGlu Leu Gly Thr Cys Gly Asp Cys Trp Val Thr Glu Gly Cys Leu Val
515 520 525 515 520 525
Asp Ser Asn Asp His Pro Ser Ala Ile Asp Ala His Met Tyr Arg AlaAsp Ser Asn Asp His Pro Ser Ala Ile Asp Ala His Met Tyr Arg Ala
530 535 540 530 535 540
Arg Val Ser Asp Glu Glu Ser Gly Glu Glu Tyr Gly Trp Val His MetArg Val Ser Asp Glu Glu Ser Gly Glu Glu Tyr Gly Trp Val His Met
545 550 555 560545 550 555 560
Arg Gln Ser Ile His Asn Pro Asn Ile Ala Leu Asn Met Gln Ser MetArg Gln Ser Ile His Asn Pro Asn Ile Ala Leu Asn Met Gln Ser Met
565 570 575 565 570 575
Leu Pro Gly Gly Cys Leu Ser Met Thr Arg Ile Phe Arg Asp Gln PheLeu Pro Gly Gly Cys Leu Ser Met Thr Arg Ile Phe Arg Asp Gln Phe
580 585 590 580 585 590
Leu Glu Ala Ser Gly Val Ala Arg Phe Leu Asp Ile Ser Asp Phe AspLeu Glu Ala Ser Gly Val Ala Arg Phe Leu Asp Ile Ser Asp Phe Asp
595 600 605 595 600 605
Asn Tyr Ile Gly Gly Gln Ser Leu Glu His His His His His HisAsn Tyr Ile Gly Gly Gln Ser Leu Glu His His His His His His His His
610 615 620 610 615 620
<210> 5<210> 5
<211> 30<211> 30
<212> DNA<212>DNA
<213> 人工合成<213> Synthetic
<220><220>
<221> DNA<221> DNA
<222> (1)..(30)<222> (1)..(30)
<400> 5<400> 5
gcgtccatgg aaggaaaggt tttccaaaac 30gcgtccatgg aaggaaaggt tttccaaaac 30
<210> 6<210> 6
<211> 27<211> 27
<212> DNA<212>DNA
<213> 人工合成<213> Synthetic
<220><220>
<221> DNA<221> DNA
<222> (1)..(27)<222> (1)..(27)
<400> 6<400> 6
atatatgcgg ccgcggagga gtgagtg 27atatatgcgg ccgcggagga gtgagtg 27
<210> 7<210> 7
<211> 29<211> 29
<212> DNA<212>DNA
<213> 人工合成<213> Synthetic
<220><220>
<221> DNA<221>DNA
<222> (1)..(29)<222> (1)..(29)
<400> 7<400> 7
aactccatgg cgtcgacttc aacatcatc 29aactccatgg cgtcgacttc aacatcatc 29
<210> 8<210> 8
<211> 29<211> 29
<212> DNA<212>DNA
<213> 人工合成<213> Synthetic
<220><220>
<221> DNA<221> DNA
<222> (1)..(29)<222> (1)..(29)
<400> 8<400> 8
actgctcgag agattgacct ccgatgtaa 29actgctcgag agattgacct ccgatgtaa 29
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611133399.6A CN108611344A (en) | 2016-12-10 | 2016-12-10 | The preparation and application of AtAGM2 and AtAGM3 encoding genes and enzyme |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611133399.6A CN108611344A (en) | 2016-12-10 | 2016-12-10 | The preparation and application of AtAGM2 and AtAGM3 encoding genes and enzyme |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108611344A true CN108611344A (en) | 2018-10-02 |
Family
ID=63657478
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611133399.6A Pending CN108611344A (en) | 2016-12-10 | 2016-12-10 | The preparation and application of AtAGM2 and AtAGM3 encoding genes and enzyme |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108611344A (en) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1314950A (en) * | 1998-06-08 | 2001-09-26 | 以生物技术资源的名义经营的Dcv公司 | Vitamin C production in microorganisms and plants |
WO2003025007A2 (en) * | 2001-09-21 | 2003-03-27 | Affinium Pharmaceuticals, Inc. | Purified polypeptides involved in membrane biosynthesis |
CN1533439A (en) * | 2001-07-18 | 2004-09-29 | �������¹ɷ�����˾ | Process for preparing L-amino acids using strains of enterobascteriaceae family which contain an atteuated aspa gene |
US20050214773A1 (en) * | 2001-09-21 | 2005-09-29 | Affinium Pharmaceuticals, Inc. | Novel purified polypeptides from bacteria |
WO2014068238A1 (en) * | 2012-10-30 | 2014-05-08 | Ghislaine Tissot-Lecuelle | Production of hyaluronic acid in algae |
EP2927316A1 (en) * | 2014-03-31 | 2015-10-07 | Jennewein Biotechnologie GmbH | Total fermentation of oligosaccharides |
CN105821063A (en) * | 2015-01-05 | 2016-08-03 | 中国科学院大连化学物理研究所 | Incision alginate lyase Alg2B and coding gene, preparation and application thereof |
-
2016
- 2016-12-10 CN CN201611133399.6A patent/CN108611344A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1314950A (en) * | 1998-06-08 | 2001-09-26 | 以生物技术资源的名义经营的Dcv公司 | Vitamin C production in microorganisms and plants |
CN1533439A (en) * | 2001-07-18 | 2004-09-29 | �������¹ɷ�����˾ | Process for preparing L-amino acids using strains of enterobascteriaceae family which contain an atteuated aspa gene |
WO2003025007A2 (en) * | 2001-09-21 | 2003-03-27 | Affinium Pharmaceuticals, Inc. | Purified polypeptides involved in membrane biosynthesis |
US20050214773A1 (en) * | 2001-09-21 | 2005-09-29 | Affinium Pharmaceuticals, Inc. | Novel purified polypeptides from bacteria |
WO2014068238A1 (en) * | 2012-10-30 | 2014-05-08 | Ghislaine Tissot-Lecuelle | Production of hyaluronic acid in algae |
EP2927316A1 (en) * | 2014-03-31 | 2015-10-07 | Jennewein Biotechnologie GmbH | Total fermentation of oligosaccharides |
CN105821063A (en) * | 2015-01-05 | 2016-08-03 | 中国科学院大连化学物理研究所 | Incision alginate lyase Alg2B and coding gene, preparation and application thereof |
Non-Patent Citations (3)
Title |
---|
TABATA,S.ET AL.,: "Arabidopsis thaliana phosphoglucosamine mutase family protein mRNA", 《GENBANK》 * |
THEOLOGIS A. ET AL.,: "Arabidopsis thaliana phosphoglucomutase, putative / glucose phosphomutase (AT1G70820), mRNA", 《GENBANK》 * |
王伟继等: "五种海洋生物葡萄糖磷酸变位酶和磷酸葡萄糖异构酶的同工酶分析", 《水产学报》 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10501813B2 (en) | Enzyme preparation containing thermostable DNA polymerase, method for producing same, and method for detecting subject organism to be detected | |
Girod et al. | Homologs of the essential ubiquitin conjugating enzymes UBC1, 4, and 5 in yeast are encoded by a multigene family in Arabidopsis thaliana | |
CN113302299B (en) | Psicose epimerase variants, methods of producing the same, and methods of producing psicose using the same | |
CN113736763B (en) | Myrosinase Rmmr and application thereof in preparation of sulforaphane and sulforaphane | |
CN113316636B (en) | DNA polymerase with improved enzymatic activity and use thereof | |
KR20170074120A (en) | Allose producing-strain using the fructose and method for producing allose using the same | |
Ooi et al. | RNA lariat debranching enzyme | |
CN105087516B (en) | ADP-glucose pyrophosphorylase mutant and screening method and application thereof | |
CN108531470B (en) | A kind of sulfate fucoidan lyase TFLFM and its preparation method and application | |
CN112852774A (en) | Phosphofructosyl aminotransferase encoding gene and preparation and application of enzyme | |
CN106755022B (en) | Acetylglucosamine phosphoglucomutase AtAGM coding gene, enzyme, preparation and application thereof, and enzyme activity detection method | |
CN108611344A (en) | The preparation and application of AtAGM2 and AtAGM3 encoding genes and enzyme | |
CN110144340B (en) | Chitosanase CsnQ and application thereof | |
GB2620254A (en) | Method for preparing multisubunit SCF E3 ligase with fusion protein through in vitro reconstitution, and use of multisubunit SCF E3 ligase | |
CN110195047B (en) | Chitosanase CsnT and application thereof | |
CN108374002B (en) | A kind of Gracilaria gracilis phosphoglucomutase protein and its encoding gene and application | |
WO2017070164A2 (en) | A thermostable haloarchaeal inorganic pyrophosphatase | |
CN105039281A (en) | UDPG pyrophosphatase produced from aureobasidium pullulans, coding gene, carrier and application | |
KR101895914B1 (en) | Preparation method of D-allose using a novel L-rhamnose isomerase | |
CN110272888B (en) | Novel chondrosulphatase CslF and application thereof | |
CN107164354A (en) | A kind of UDP xylose epimerase from Ornithogalum caudatum, its nucleotide sequence and application | |
WO2023082266A1 (en) | Chimeric dna polymerase and use thereof | |
JP4346449B2 (en) | Production of UPPase | |
KR20180007725A (en) | Mass production method of beta agarase | |
CN106191019B (en) | A kind of truncated uridine-5'-xylose diphosphate synthase, its nucleotide sequence and application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20181002 |
|
WD01 | Invention patent application deemed withdrawn after publication |