CN109825538B - A kind of synthetic method of chiral 2-amino-1-butanol - Google Patents
A kind of synthetic method of chiral 2-amino-1-butanol Download PDFInfo
- Publication number
- CN109825538B CN109825538B CN201711181396.4A CN201711181396A CN109825538B CN 109825538 B CN109825538 B CN 109825538B CN 201711181396 A CN201711181396 A CN 201711181396A CN 109825538 B CN109825538 B CN 109825538B
- Authority
- CN
- China
- Prior art keywords
- enzyme
- gly
- ala
- val
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- JCBPETKZIGVZRE-UHFFFAOYSA-N 2-aminobutan-1-ol Chemical compound CCC(N)CO JCBPETKZIGVZRE-UHFFFAOYSA-N 0.000 title claims abstract description 33
- 238000010189 synthetic method Methods 0.000 title claims description 4
- 102000004190 Enzymes Human genes 0.000 claims abstract description 239
- 108090000790 Enzymes Proteins 0.000 claims abstract description 239
- 102000007698 Alcohol dehydrogenase Human genes 0.000 claims abstract description 82
- 108010021809 Alcohol dehydrogenase Proteins 0.000 claims abstract description 82
- 108090000340 Transaminases Proteins 0.000 claims abstract description 76
- 102000003929 Transaminases Human genes 0.000 claims abstract description 76
- 238000000034 method Methods 0.000 claims abstract description 45
- 238000006555 catalytic reaction Methods 0.000 claims abstract description 36
- JCBPETKZIGVZRE-BYPYZUCNSA-N (2s)-2-aminobutan-1-ol Chemical compound CC[C@H](N)CO JCBPETKZIGVZRE-BYPYZUCNSA-N 0.000 claims abstract description 30
- 102000005751 Alcohol Oxidoreductases Human genes 0.000 claims abstract description 24
- 108010031132 Alcohol Oxidoreductases Proteins 0.000 claims abstract description 24
- 101710088194 Dehydrogenase Proteins 0.000 claims abstract description 20
- JCBPETKZIGVZRE-SCSAIBSYSA-N (2r)-2-aminobutan-1-ol Chemical compound CC[C@@H](N)CO JCBPETKZIGVZRE-SCSAIBSYSA-N 0.000 claims abstract description 18
- 239000000758 substrate Substances 0.000 claims abstract description 18
- 239000005515 coenzyme Substances 0.000 claims abstract description 16
- GFAZHVHNLUBROE-UHFFFAOYSA-N 1-hydroxybutan-2-one Chemical compound CCC(=O)CO GFAZHVHNLUBROE-UHFFFAOYSA-N 0.000 claims abstract description 15
- BMRWNKZVCUKKSR-UHFFFAOYSA-N butane-1,2-diol Chemical compound CCC(O)CO BMRWNKZVCUKKSR-UHFFFAOYSA-N 0.000 claims abstract description 15
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 12
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 29
- 108090000623 proteins and genes Proteins 0.000 claims description 90
- 238000006243 chemical reaction Methods 0.000 claims description 76
- 210000004027 cell Anatomy 0.000 claims description 49
- 150000001413 amino acids Chemical class 0.000 claims description 44
- 230000004927 fusion Effects 0.000 claims description 38
- 241000193385 Geobacillus stearothermophilus Species 0.000 claims description 36
- 102000004169 proteins and genes Human genes 0.000 claims description 35
- 239000007788 liquid Substances 0.000 claims description 20
- 239000013598 vector Substances 0.000 claims description 20
- 108091026890 Coding region Proteins 0.000 claims description 18
- 239000008176 lyophilized powder Substances 0.000 claims description 17
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 claims description 16
- 230000035772 mutation Effects 0.000 claims description 16
- 108010028658 Leucine Dehydrogenase Proteins 0.000 claims description 14
- 239000008363 phosphate buffer Substances 0.000 claims description 13
- 241000894006 Bacteria Species 0.000 claims description 12
- 240000001929 Lactobacillus brevis Species 0.000 claims description 12
- 235000013957 Lactobacillus brevis Nutrition 0.000 claims description 12
- 108020001507 fusion proteins Proteins 0.000 claims description 12
- 102000037865 fusion proteins Human genes 0.000 claims description 12
- 241001112741 Bacillaceae Species 0.000 claims description 11
- 241000194107 Bacillus megaterium Species 0.000 claims description 11
- 238000003786 synthesis reaction Methods 0.000 claims description 11
- 241001225321 Aspergillus fumigatus Species 0.000 claims description 10
- 241001465318 Aspergillus terreus Species 0.000 claims description 10
- 241001468191 Lactobacillus kefiri Species 0.000 claims description 10
- 229940091771 aspergillus fumigatus Drugs 0.000 claims description 10
- 108020004707 nucleic acids Proteins 0.000 claims description 10
- 102000039446 nucleic acids Human genes 0.000 claims description 10
- 150000007523 nucleic acids Chemical class 0.000 claims description 10
- 241001507865 Aspergillus fischeri Species 0.000 claims description 9
- 241000223195 Fusarium graminearum Species 0.000 claims description 9
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 9
- 241000064289 Leifsonia sp. Species 0.000 claims description 9
- 241001147775 Thermoanaerobacter brockii Species 0.000 claims description 9
- 241000202694 Thermoanaerobacter wiegelii Species 0.000 claims description 9
- 239000000872 buffer Substances 0.000 claims description 9
- 239000008103 glucose Substances 0.000 claims description 9
- 239000000843 powder Substances 0.000 claims description 9
- 241000142559 Mycobacterium vanbaalenii Species 0.000 claims description 8
- 230000014509 gene expression Effects 0.000 claims description 8
- 239000013612 plasmid Substances 0.000 claims description 8
- 241000588724 Escherichia coli Species 0.000 claims description 7
- 108010050375 Glucose 1-Dehydrogenase Proteins 0.000 claims description 7
- JJWLVOIRVHMVIS-UHFFFAOYSA-N isopropylamine Chemical compound CC(C)N JJWLVOIRVHMVIS-UHFFFAOYSA-N 0.000 claims description 7
- 241000589517 Pseudomonas aeruginosa Species 0.000 claims description 6
- 102220348655 c.31A>G Human genes 0.000 claims description 5
- 229910052760 oxygen Inorganic materials 0.000 claims description 5
- 102220097971 rs764750609 Human genes 0.000 claims description 5
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 4
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 4
- 102220162436 rs886047270 Human genes 0.000 claims description 4
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 3
- 230000001580 bacterial effect Effects 0.000 claims description 3
- 230000009261 transgenic effect Effects 0.000 claims description 3
- 102220565767 Carboxylesterase 4A_K68Y_mutation Human genes 0.000 claims description 2
- 241000186359 Mycobacterium Species 0.000 claims description 2
- 102220554267 Protein dispatched homolog 1_K68S_mutation Human genes 0.000 claims description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 2
- 230000006698 induction Effects 0.000 claims description 2
- 238000004806 packaging method and process Methods 0.000 claims description 2
- 102220237748 rs281865531 Human genes 0.000 claims description 2
- 210000005253 yeast cell Anatomy 0.000 claims description 2
- 241000186660 Lactobacillus Species 0.000 claims 6
- 229940039696 lactobacillus Drugs 0.000 claims 6
- 241000635201 Pumilus Species 0.000 claims 5
- 230000008676 import Effects 0.000 claims 1
- 230000001177 retroviral effect Effects 0.000 claims 1
- 241001515965 unidentified phage Species 0.000 claims 1
- 235000013618 yogurt Nutrition 0.000 claims 1
- 229940083957 1,2-butanediol Drugs 0.000 abstract description 13
- 230000004186 co-expression Effects 0.000 abstract description 7
- 230000002194 synthesizing effect Effects 0.000 abstract description 6
- 239000002994 raw material Substances 0.000 abstract description 3
- LRHPLDYGYMQRHN-UHFFFAOYSA-N 1-butanol Substances CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 abstract 1
- 235000001014 amino acid Nutrition 0.000 description 28
- 239000000243 solution Substances 0.000 description 27
- 229940024606 amino acid Drugs 0.000 description 26
- 235000018102 proteins Nutrition 0.000 description 26
- 238000002741 site-directed mutagenesis Methods 0.000 description 20
- 108020004414 DNA Proteins 0.000 description 19
- 238000002360 preparation method Methods 0.000 description 18
- NGVDGCNFYWLIFO-UHFFFAOYSA-N pyridoxal 5'-phosphate Chemical compound CC1=NC=C(COP(O)(O)=O)C(C=O)=C1O NGVDGCNFYWLIFO-UHFFFAOYSA-N 0.000 description 16
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 15
- 108010050848 glycylleucine Proteins 0.000 description 15
- 238000004817 gas chromatography Methods 0.000 description 14
- 239000013604 expression vector Substances 0.000 description 13
- 108010047857 aspartylglycine Proteins 0.000 description 12
- 238000001514 detection method Methods 0.000 description 12
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 11
- 108010037850 glycylvaline Proteins 0.000 description 11
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 9
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 9
- 108010038633 aspartylglutamate Proteins 0.000 description 9
- 238000004128 high performance liquid chromatography Methods 0.000 description 9
- 238000004811 liquid chromatography Methods 0.000 description 9
- 238000003259 recombinant expression Methods 0.000 description 9
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 8
- 108010047495 alanylglycine Proteins 0.000 description 8
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- 108010089804 glycyl-threonine Proteins 0.000 description 8
- 244000005700 microbiome Species 0.000 description 8
- 235000007682 pyridoxal 5'-phosphate Nutrition 0.000 description 8
- 239000011589 pyridoxal 5'-phosphate Substances 0.000 description 8
- 229960001327 pyridoxal phosphate Drugs 0.000 description 8
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 7
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 7
- 108010079364 N-glycylalanine Proteins 0.000 description 7
- 108010062796 arginyllysine Proteins 0.000 description 7
- 108010016616 cysteinylglycine Proteins 0.000 description 7
- 238000006467 substitution reaction Methods 0.000 description 7
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 6
- VTYYLEPIZMXCLO-UHFFFAOYSA-L Calcium carbonate Chemical compound [Ca+2].[O-]C([O-])=O VTYYLEPIZMXCLO-UHFFFAOYSA-L 0.000 description 6
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 6
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 6
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 6
- 108010066427 N-valyltryptophan Proteins 0.000 description 6
- 241000228393 Sporidiobolus salmonicolor Species 0.000 description 6
- 108010087924 alanylproline Proteins 0.000 description 6
- 108010068265 aspartyltyrosine Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 230000003647 oxidation Effects 0.000 description 6
- 238000007254 oxidation reaction Methods 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- 238000011426 transformation method Methods 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 5
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 5
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 5
- 108010065920 Insulin Lispro Proteins 0.000 description 5
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 5
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 5
- XJLXINKUBYWONI-NNYOXOHSSA-O NADP(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-NNYOXOHSSA-O 0.000 description 5
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 5
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 5
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 5
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 5
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 108010093581 aspartyl-proline Proteins 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 229940054441 o-phthalaldehyde Drugs 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- ZWLUXSQADUDCSB-UHFFFAOYSA-N phthalaldehyde Chemical compound O=CC1=CC=CC=C1C=O ZWLUXSQADUDCSB-UHFFFAOYSA-N 0.000 description 5
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 4
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 4
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 4
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 4
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 4
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 4
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 4
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 4
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 4
- RENBRDSDKPSRIH-HJWJTTGWSA-N Ile-Phe-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O RENBRDSDKPSRIH-HJWJTTGWSA-N 0.000 description 4
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 4
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 4
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 4
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 4
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 4
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 4
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 4
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 4
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 4
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 4
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 4
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 239000002585 base Substances 0.000 description 4
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 4
- 239000013642 negative control Substances 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 230000009467 reduction Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- RWHQMRRVZJSKGX-UHFFFAOYSA-N 2-oxobutanal Chemical compound CCC(=O)C=O RWHQMRRVZJSKGX-UHFFFAOYSA-N 0.000 description 3
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 3
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 3
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 3
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 3
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 3
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 3
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 3
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 3
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 3
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 3
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 3
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 3
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 3
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 3
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 3
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 3
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 3
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 3
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 3
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 3
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 3
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 3
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 3
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 3
- HZVRQFKRALAMQS-SLBDDTMCSA-N Ile-Trp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZVRQFKRALAMQS-SLBDDTMCSA-N 0.000 description 3
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 3
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 3
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 3
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 3
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 3
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 3
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 3
- URBJRJKWSUFCKS-AVGNSLFASA-N Lys-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N URBJRJKWSUFCKS-AVGNSLFASA-N 0.000 description 3
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 3
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 3
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 3
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 3
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 3
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 3
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 3
- 108010003201 RGH 0205 Proteins 0.000 description 3
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 3
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 3
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 3
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 3
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 3
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 3
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 3
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 3
- XTOCLOATLKOZAU-JBACZVJFSA-N Tyr-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N XTOCLOATLKOZAU-JBACZVJFSA-N 0.000 description 3
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 3
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 3
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 3
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 3
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 3
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 3
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 3
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 3
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 3
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 3
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 3
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 3
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- -1 azodicarboxylate Benzyl ester Chemical class 0.000 description 3
- 229910000019 calcium carbonate Inorganic materials 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 108010079547 glutamylmethionine Proteins 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 description 3
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 239000008055 phosphate buffer solution Substances 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 230000008929 regeneration Effects 0.000 description 3
- 238000011069 regeneration method Methods 0.000 description 3
- 108010020532 tyrosyl-proline Proteins 0.000 description 3
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 2
- KDZIGQIDPXKMBA-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-methylbutanoyl)amino]acetyl]amino]-3-hydroxypropanoyl]amino]pentanedioic acid Chemical compound CC(C)C(N)C(=O)NCC(=O)NC(CO)C(=O)NC(C(O)=O)CCC(O)=O KDZIGQIDPXKMBA-UHFFFAOYSA-N 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 2
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 2
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 2
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- XKHLBBQNPSOGPI-GUBZILKMSA-N Ala-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N XKHLBBQNPSOGPI-GUBZILKMSA-N 0.000 description 2
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 2
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 2
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 2
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 2
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 2
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 2
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 2
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 2
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 2
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 2
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 2
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- ZTQSAGDEMFDKMZ-UHFFFAOYSA-N Butyraldehyde Chemical compound CCCC=O ZTQSAGDEMFDKMZ-UHFFFAOYSA-N 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 2
- NITLUESFANGEIW-BQBZGAKWSA-N Cys-Pro-Gly Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O NITLUESFANGEIW-BQBZGAKWSA-N 0.000 description 2
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 2
- 108020005199 Dehydrogenases Proteins 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 2
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 2
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- IVSWQHKONQIOHA-YUMQZZPRSA-N Gly-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN IVSWQHKONQIOHA-YUMQZZPRSA-N 0.000 description 2
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 2
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 2
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 2
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 2
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 2
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 2
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 2
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 2
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 2
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 2
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 2
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 2
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 2
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 2
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 2
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 2
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 2
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 2
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 2
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- FTUZWJVSNZMLPI-RVMXOQNASA-N Ile-Met-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N FTUZWJVSNZMLPI-RVMXOQNASA-N 0.000 description 2
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 2
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 2
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 2
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 2
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 2
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 2
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 2
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 2
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 2
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 2
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 2
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 2
- MUYQDMBLDFEVRJ-LSJOCFKGSA-N Met-Ala-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 MUYQDMBLDFEVRJ-LSJOCFKGSA-N 0.000 description 2
- NSGXXVIHCIAISP-CIUDSAMLSA-N Met-Asn-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NSGXXVIHCIAISP-CIUDSAMLSA-N 0.000 description 2
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 2
- WXJXYMFUTRXRGO-UWVGGRQHSA-N Met-His-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 WXJXYMFUTRXRGO-UWVGGRQHSA-N 0.000 description 2
- MXEASDMFHUKOGE-ULQDDVLXSA-N Met-His-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MXEASDMFHUKOGE-ULQDDVLXSA-N 0.000 description 2
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 2
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 2
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 2
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 2
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 2
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 2
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 2
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 2
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 2
- ZYBUKTMPPFQSHL-JYJNAYRXSA-N Pro-Asp-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZYBUKTMPPFQSHL-JYJNAYRXSA-N 0.000 description 2
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 2
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 2
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 2
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 2
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 2
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 2
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 2
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 2
- MEZCXKYMMQJRDE-PMVMPFDFSA-N Trp-Leu-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=C(O)C=C1 MEZCXKYMMQJRDE-PMVMPFDFSA-N 0.000 description 2
- JEYRCNVVYHTZMY-SZMVWBNQSA-N Trp-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JEYRCNVVYHTZMY-SZMVWBNQSA-N 0.000 description 2
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 2
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 2
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 2
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 2
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 2
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 2
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 2
- CCEVJBJLPRNAFH-BVSLBCMMSA-N Tyr-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N CCEVJBJLPRNAFH-BVSLBCMMSA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 2
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 2
- XJFXZQKJQGYFMM-GUBZILKMSA-N Val-Cys-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N XJFXZQKJQGYFMM-GUBZILKMSA-N 0.000 description 2
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 2
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 2
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 2
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 2
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- QWCKQJZIFLGMSD-UHFFFAOYSA-N alpha-aminobutyric acid Chemical compound CCC(N)C(O)=O QWCKQJZIFLGMSD-UHFFFAOYSA-N 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 230000037429 base substitution Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 239000012295 chemical reaction liquid Substances 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 239000000543 intermediate Substances 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 2
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 238000004460 liquid liquid chromatography Methods 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 229910052759 nickel Inorganic materials 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 229910052717 sulfur Inorganic materials 0.000 description 2
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 108010034462 valyl-leucyl-prolyl-valyl-proline Proteins 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 108010000998 wheylin-2 peptide Proteins 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- JBFQOLHAGBKPTP-NZATWWQASA-N (2s)-2-[[(2s)-4-carboxy-2-[[3-carboxy-2-[[(2s)-2,6-diaminohexanoyl]amino]propanoyl]amino]butanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)C(CC(O)=O)NC(=O)[C@@H](N)CCCCN JBFQOLHAGBKPTP-NZATWWQASA-N 0.000 description 1
- SCAKQYSGEIHPLV-IUCAKERBSA-N (4S)-4-[(2-aminoacetyl)amino]-5-[(2S)-2-(carboxymethylcarbamoyl)pyrrolidin-1-yl]-5-oxopentanoic acid Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SCAKQYSGEIHPLV-IUCAKERBSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 1
- PWYFCPCBOYMOGB-LKTVYLICSA-N Ala-Gln-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PWYFCPCBOYMOGB-LKTVYLICSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- SIGTYDNEPYEXGK-ZANVPECISA-N Ala-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 SIGTYDNEPYEXGK-ZANVPECISA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- RUQBGIMJQUWXPP-CYDGBPFRSA-N Ala-Leu-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O RUQBGIMJQUWXPP-CYDGBPFRSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- MAEQBGQTDWDSJQ-LSJOCFKGSA-N Ala-Met-His Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MAEQBGQTDWDSJQ-LSJOCFKGSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 1
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- DEAGTWNKODHUIY-MRFFXTKBSA-N Ala-Tyr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DEAGTWNKODHUIY-MRFFXTKBSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 101100407030 Arabidopsis thaliana PAO2 gene Proteins 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- ZDSVOHFDQKSTAY-UHFFFAOYSA-N Arg-Lys-Met-Phe Chemical compound NC(N)=NCCCC(N)C(=O)NC(CCCCN)C(=O)NC(CCSC)C(=O)NC(C(O)=O)CC1=CC=CC=C1 ZDSVOHFDQKSTAY-UHFFFAOYSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- JBIRFLWXWDSDTR-CYDGBPFRSA-N Arg-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N JBIRFLWXWDSDTR-CYDGBPFRSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- ZUVDFJXRAICIAJ-BPUTZDHNSA-N Arg-Trp-Asp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 ZUVDFJXRAICIAJ-BPUTZDHNSA-N 0.000 description 1
- XOZYYXMHMIEJET-XIRDDKMYSA-N Arg-Trp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XOZYYXMHMIEJET-XIRDDKMYSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 1
- AYZAWXAPBAYCHO-CIUDSAMLSA-N Asn-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N AYZAWXAPBAYCHO-CIUDSAMLSA-N 0.000 description 1
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 1
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- QDXQWFBLUVTOFL-FXQIFTODSA-N Asn-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)N)N QDXQWFBLUVTOFL-FXQIFTODSA-N 0.000 description 1
- RLHANKIRBONJBK-IHRRRGAJSA-N Asn-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N RLHANKIRBONJBK-IHRRRGAJSA-N 0.000 description 1
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 1
- KSZHWTRZPOTIGY-AVGNSLFASA-N Asn-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KSZHWTRZPOTIGY-AVGNSLFASA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- YBMUFUWSMIKJQA-GUBZILKMSA-N Asp-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N YBMUFUWSMIKJQA-GUBZILKMSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 1
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- NVXLFIPTHPKSKL-UBHSHLNASA-N Asp-Trp-Asn Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 NVXLFIPTHPKSKL-UBHSHLNASA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- LRZPRGJXAZFXCR-DCAQKATOSA-N Cys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N LRZPRGJXAZFXCR-DCAQKATOSA-N 0.000 description 1
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- FIADUEYFRSCCIK-CIUDSAMLSA-N Cys-Glu-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIADUEYFRSCCIK-CIUDSAMLSA-N 0.000 description 1
- UUOYKFNULIOCGJ-GUBZILKMSA-N Cys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N UUOYKFNULIOCGJ-GUBZILKMSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- IRKLTAKLAFUTLA-KATARQTJSA-N Cys-Thr-Lys Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCCCN)C(O)=O IRKLTAKLAFUTLA-KATARQTJSA-N 0.000 description 1
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 1
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- ONIBWKKTOPOVIA-SCSAIBSYSA-N D-Proline Chemical compound OC(=O)[C@H]1CCCN1 ONIBWKKTOPOVIA-SCSAIBSYSA-N 0.000 description 1
- RGHNJXZEOKUKBD-UHFFFAOYSA-N D-gluconic acid Natural products OCC(O)C(O)C(O)C(O)C(O)=O RGHNJXZEOKUKBD-UHFFFAOYSA-N 0.000 description 1
- 229930182820 D-proline Natural products 0.000 description 1
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 description 1
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 1
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 1
- LTXLIIZACMCQTO-GUBZILKMSA-N Gln-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LTXLIIZACMCQTO-GUBZILKMSA-N 0.000 description 1
- OOLCSQQPSLIETN-JYJNAYRXSA-N Gln-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)O OOLCSQQPSLIETN-JYJNAYRXSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 1
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- RGNMNWULPAYDAH-JSGCOSHPSA-N Gln-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N RGNMNWULPAYDAH-JSGCOSHPSA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 1
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 1
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 1
- SAEBUDRWKUXLOM-ACZMJKKPSA-N Glu-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O SAEBUDRWKUXLOM-ACZMJKKPSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 1
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 1
- YVYVMJNUENBOOL-KBIXCLLPSA-N Glu-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YVYVMJNUENBOOL-KBIXCLLPSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- CBOVGULVQSVMPT-CIUDSAMLSA-N Glu-Pro-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CBOVGULVQSVMPT-CIUDSAMLSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- JDAYMLXPUJRSDJ-XIRDDKMYSA-N Glu-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 JDAYMLXPUJRSDJ-XIRDDKMYSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- RGHNJXZEOKUKBD-SQOUGZDYSA-N Gluconic acid Natural products OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C(O)=O RGHNJXZEOKUKBD-SQOUGZDYSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 1
- YKJUITHASJAGHO-HOTGVXAUSA-N Gly-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN YKJUITHASJAGHO-HOTGVXAUSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- IMRNSEPSPFQNHF-STQMWFEESA-N Gly-Ser-Trp Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O IMRNSEPSPFQNHF-STQMWFEESA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 1
- RJVZMGQMJOQIAX-GJZGRUSLSA-N Gly-Trp-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(O)=O RJVZMGQMJOQIAX-GJZGRUSLSA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- JYGYNWYVKXENNE-OALUTQOASA-N Gly-Tyr-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JYGYNWYVKXENNE-OALUTQOASA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 1
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 1
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 description 1
- QAMFAYSMNZBNCA-UWVGGRQHSA-N His-Gly-Met Chemical compound CSCC[C@H](NC(=O)CNC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O QAMFAYSMNZBNCA-UWVGGRQHSA-N 0.000 description 1
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- NKRWVZQTPXPNRZ-SRVKXCTJSA-N His-Met-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CN=CN1 NKRWVZQTPXPNRZ-SRVKXCTJSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- CMPHFUWXKBPNRS-WDSOQIARSA-N His-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 CMPHFUWXKBPNRS-WDSOQIARSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 1
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- JHCVYQKVKOLAIU-NAKRPEOUSA-N Ile-Cys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N JHCVYQKVKOLAIU-NAKRPEOUSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- QWCKQJZIFLGMSD-VKHMYHEASA-N L-alpha-aminobutyric acid Chemical compound CC[C@H](N)C(O)=O QWCKQJZIFLGMSD-VKHMYHEASA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 102000004882 Lipase Human genes 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- GFWLIJDQILOEPP-HSCHXYMDSA-N Lys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N GFWLIJDQILOEPP-HSCHXYMDSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 description 1
- CENKQZWVYMLRAX-ULQDDVLXSA-N Lys-Phe-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CENKQZWVYMLRAX-ULQDDVLXSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- OKCJTECLRDARDZ-XIRDDKMYSA-N Lys-Trp-Cys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 OKCJTECLRDARDZ-XIRDDKMYSA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 1
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 1
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 1
- GPVLSVCBKUCEBI-KKUMJFAQSA-N Met-Gln-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GPVLSVCBKUCEBI-KKUMJFAQSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 1
- ABHVWYPPHDYFNY-WDSOQIARSA-N Met-His-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 ABHVWYPPHDYFNY-WDSOQIARSA-N 0.000 description 1
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 1
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- XOFDBXYPKZUAAM-GUBZILKMSA-N Met-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N XOFDBXYPKZUAAM-GUBZILKMSA-N 0.000 description 1
- HUURTRNKPBHHKZ-JYJNAYRXSA-N Met-Phe-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 HUURTRNKPBHHKZ-JYJNAYRXSA-N 0.000 description 1
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 1
- DOQXHOUYYSPISL-SZMVWBNQSA-N Met-Trp-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N DOQXHOUYYSPISL-SZMVWBNQSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- FXBNBKPUONJTTP-RZVRUWJTSA-N N[C@H](CO)CC.N[C@H](CO)CC Chemical compound N[C@H](CO)CC.N[C@H](CO)CC FXBNBKPUONJTTP-RZVRUWJTSA-N 0.000 description 1
- 108010073038 Penicillin Amidase Proteins 0.000 description 1
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- FSPGBMWPNMRWDB-AVGNSLFASA-N Phe-Cys-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FSPGBMWPNMRWDB-AVGNSLFASA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 1
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- XBCOOBCTVMMQSC-BVSLBCMMSA-N Phe-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XBCOOBCTVMMQSC-BVSLBCMMSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- CDGABSWLRMECHC-IHRRRGAJSA-N Pro-Lys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CDGABSWLRMECHC-IHRRRGAJSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- RQXDSYQXBCRXBT-GUBZILKMSA-N Ser-Met-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RQXDSYQXBCRXBT-GUBZILKMSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- FEWJPZIEWOKRBE-UHFFFAOYSA-N Tartaric acid Natural products [H+].[H+].[O-]C(=O)C(O)C(O)C([O-])=O FEWJPZIEWOKRBE-UHFFFAOYSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 1
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 1
- PNHABSVRPFBUJY-UMPQAUOISA-N Trp-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PNHABSVRPFBUJY-UMPQAUOISA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 1
- PHNBFZBKLWEBJN-BPUTZDHNSA-N Trp-Glu-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PHNBFZBKLWEBJN-BPUTZDHNSA-N 0.000 description 1
- AZBIIKDSDLVJAK-VHWLVUOQSA-N Trp-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N AZBIIKDSDLVJAK-VHWLVUOQSA-N 0.000 description 1
- BONYBFXWMXBAND-GQGQLFGLSA-N Trp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BONYBFXWMXBAND-GQGQLFGLSA-N 0.000 description 1
- CSRCUZAVBSEDMB-FDARSICLSA-N Trp-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CSRCUZAVBSEDMB-FDARSICLSA-N 0.000 description 1
- BOESUSAIMQGVJD-RYQLBKOJSA-N Trp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N BOESUSAIMQGVJD-RYQLBKOJSA-N 0.000 description 1
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 1
- NJNCVQYFNKZMAH-JYBASQMISA-N Trp-Thr-Cys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 NJNCVQYFNKZMAH-JYBASQMISA-N 0.000 description 1
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- KPEVFMGKBCMTJF-SZMVWBNQSA-N Trp-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N KPEVFMGKBCMTJF-SZMVWBNQSA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- PDKILSUYSUGCAO-JBACZVJFSA-N Tyr-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PDKILSUYSUGCAO-JBACZVJFSA-N 0.000 description 1
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- OLWFDNLLBWQWCP-STQMWFEESA-N Tyr-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OLWFDNLLBWQWCP-STQMWFEESA-N 0.000 description 1
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 1
- LQGDFDYGDQEMGA-PXDAIIFMSA-N Tyr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N LQGDFDYGDQEMGA-PXDAIIFMSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- DAOREBHZAKCOEN-ULQDDVLXSA-N Tyr-Leu-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O DAOREBHZAKCOEN-ULQDDVLXSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- BJCILVZEZRDIDR-PMVMPFDFSA-N Tyr-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 BJCILVZEZRDIDR-PMVMPFDFSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- KZOZXAYPVKKDIO-UFYCRDLUSA-N Tyr-Met-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KZOZXAYPVKKDIO-UFYCRDLUSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- ZYVAAYAOTVJBSS-GMVOTWDCSA-N Tyr-Trp-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZYVAAYAOTVJBSS-GMVOTWDCSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 1
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- YTUABZMPYKCWCQ-XQQFMLRXSA-N Val-His-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N YTUABZMPYKCWCQ-XQQFMLRXSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 1
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 150000001370 alpha-amino acid derivatives Chemical class 0.000 description 1
- 235000008206 alpha-amino acids Nutrition 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 150000001414 amino alcohols Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010066119 arginyl-leucyl-aspartyl-serine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 239000011942 biocatalyst Substances 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 238000010523 cascade reaction Methods 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 235000012208 gluconic acid Nutrition 0.000 description 1
- 239000000174 gluconic acid Substances 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010008671 glycyl-tryptophyl-methionine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 125000000623 heterocyclic group Chemical group 0.000 description 1
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 238000005984 hydrogenation reaction Methods 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- 239000012280 lithium aluminium hydride Substances 0.000 description 1
- 108010089256 lysyl-aspartyl-glutamyl-leucine Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 229940056360 penicillin g Drugs 0.000 description 1
- 239000012450 pharmaceutical intermediate Substances 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010007513 prolyl-glycyl-prolyl-leucine Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 235000002906 tartaric acid Nutrition 0.000 description 1
- 239000011975 tartaric acid Substances 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- HGBOYTHUEUWSSQ-UHFFFAOYSA-N valeric aldehyde Natural products CCCCC=O HGBOYTHUEUWSSQ-UHFFFAOYSA-N 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Images
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/10—Biofuels, e.g. bio-diesel
Landscapes
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
Description
技术领域technical field
本发明属于生物技术领域,涉及一种手性2-氨基-1-丁醇的合成方法,特别涉及一种以利用生物酶催化合成手性2-氨基-1-丁醇的方法。The invention belongs to the field of biotechnology, and relates to a method for synthesizing chiral 2-amino-1-butanol, in particular to a method for catalyzing and synthesizing chiral 2-amino-1-butanol by utilizing biological enzymes.
背景技术Background technique
近年来以生物催化剂应用为核心的绿色化学工艺受到越来越多的关注,尤其是组合使用天然酶或新型改造酶为多酶分子机器的级联催化反应备受研究者和产业界青睐(Fischereder et al.,ACS Catal.,2016,6(1):23-30;France et al.,ACS Catal.,2016,6(6):3753-3759;Li et al.,2016,J Agr Food Chem.,64(46):8927-8934.)。采用多酶分子机器构建的级联反应新途径,不仅可以减少中间体的损失,还可提高转化效率,大大降低工艺生产成本(Schrittwieser et al.,Curr Opin Chem Biol.,2011,15(2):249-256;Wheeldon et al.,Nat Chem.,2016,8(4):299-309.)。In recent years, green chemical processes centered on the application of biocatalysts have received more and more attention, especially the cascade catalytic reactions that combine natural enzymes or new modified enzymes as multi-enzyme molecular machines are favored by researchers and industries (Fischereder et al. et al., ACS Catal., 2016, 6(1):23-30; France et al., ACS Catal., 2016, 6(6):3753-3759; Li et al., 2016, J Agr Food Chem ., 64(46):8927-8934.). The new pathway of cascade reaction constructed by multi-enzyme molecular machine can not only reduce the loss of intermediates, but also improve the conversion efficiency and greatly reduce the cost of process production (Schrittwieser et al., Curr Opin Chem Biol., 2011, 15(2) :249-256; Wheeldon et al., Nat Chem., 2016, 8(4):299-309.).
手性2-氨基-1-丁醇,尤其是(S)-2-氨基-1-丁醇,在有机合成和药物生产中有着广泛的用途,可作为重要的中间体制备具有光学活性的化合物。结构式如图1所示。(S)-2-氨基-1-丁醇((S)-2-Amino-1-butanol)是重要医药中间体,目前的生产方法有化学法,如:以丁醛、偶氮二甲酸二卞酯和D-脯氨酸为原料,在NaBH4、H2和镍等催化作用下合成(S)-2-氨基丁醇(Kotkar&Sudalai,Tetrahedron:Asymmetry,2006,17(11):1738-1742);也有报道用氢化铝锂还原2-氨基丁酸获得相应的手性氨基醇产物(Doherty&Shapira,J Org Chem,1963,28(5):1339-1342);采用H2和镍在高压条件下还原L-2-氨基丁酸获得(S)-2-氨基丁醇(CN105481703A);或者采用NaHB4在H2SO4/THF条件下还原相应的α-氨基酸获得(Li etal.,J Agr Food Chem,2016,64(46):8927-8934)。氨基丁醇化学拆分法,如:联苯甲酰酒石酸拆分法(Periasamy et al.,Synthesis-Stuttgart,2003(13):1965-1967);L-酒石酸拆分法(Zhao et al.,J Heterocyclic Chem,2012,49(4):943-946)。也有酶法拆分的报道,如:在氨基基团保护的情况下,由脂肪酶选择性水解羟基形成的酯键而获得手性单体(Francalanci et al.,J Org Chem,1987,52(23):5079-5082);固定化青霉素G酰化酶选择性水解N-苯乙酰衍生2-氨基丁醇外消旋混合物中的(S)-构型获得(S)-2-氨基丁醇,ee值可达99%(Fadnavis et al.,Tetrahedron:Asymmetry,1999,10(23):4495-4500);最新报道通过将固定化青霉素G酰化固定在固定床上实现连续拆分,转化率达到39.3%,ee值98.2%。Chiral 2-amino-1-butanol, especially (S)-2-amino-1-butanol, has a wide range of uses in organic synthesis and pharmaceutical production, and can be used as an important intermediate to prepare optically active compounds . The structure is shown in Figure 1. (S)-2-Amino-1-butanol ((S)-2-Amino-1-butanol) is an important pharmaceutical intermediate. The current production methods include chemical methods, such as: using butyraldehyde, azodicarboxylate Benzyl ester and D-proline were used as raw materials to synthesize (S)-2-aminobutanol under the catalysis of NaBH 4 , H 2 and nickel (Kotkar & Sudalai, Tetrahedron: Asymmetry, 2006, 17(11): 1738-1742 ); also reported to use lithium aluminum hydride to reduce 2-aminobutyric acid to obtain the corresponding chiral amino alcohol product (Doherty & Shapira, J Org Chem, 1963, 28 (5): 1339-1342 ); using H and nickel under high pressure conditions Reduce L-2-aminobutyric acid to obtain (S)-2-aminobutanol (CN105481703A); or use NaHB 4 to reduce the corresponding α-amino acid under H 2 SO 4 /THF conditions (Li et al., J Agr Food Chem, 2016, 64(46):8927-8934). Aminobutanol chemical resolution method, such as: bibenzoyl tartaric acid resolution method (Periasamy et al., Synthesis-Stuttgart, 2003 (13): 1965-1967); L-tartaric acid resolution method (Zhao et al., J Heterocyclic Chem, 2012, 49(4):943-946). There are also reports of enzymatic resolution, such as: in the case of amino group protection, the ester bond formed by the selective hydrolysis of hydroxyl groups by lipase obtains chiral monomers (Francalanci et al., J Org Chem, 1987, 52 (Francalanci et al., J Org Chem, 1987, 52( 23): 5079-5082); immobilized penicillin G acylase selectively hydrolyzes (S)-configuration in the racemic mixture of N-phenylacetyl-derived 2-aminobutanol to obtain (S)-2-aminobutanol , the ee value can reach 99% (Fadnavis et al., Tetrahedron: Asymmetry, 1999, 10(23): 4495-4500); the latest report achieves continuous splitting by acylating immobilized penicillin G on a fixed bed, and the conversion rate It reached 39.3% and the ee value was 98.2%.
以上所述(S)-2-氨基-1-丁醇的合成方法,化学法需要高温、高压和金属催化剂利用氢化还原而获得,污染大,安全系数低。而化学拆分法也需要用到大量酸碱和其他化学试剂,虽然酶法拆分反应条件温和,立体选择性好,但拆分法的转化率仅50%,且收率低。The above-mentioned synthetic method of (S)-2-amino-1-butanol, the chemical method requires high temperature, high pressure and metal catalyst to obtain by hydrogenation reduction, the pollution is large, and the safety factor is low. The chemical resolution method also needs to use a large amount of acid, alkali and other chemical reagents. Although the enzymatic resolution method has mild reaction conditions and good stereoselectivity, the conversion rate of the resolution method is only 50%, and the yield is low.
发明内容SUMMARY OF THE INVENTION
为解决以上问题,本发明提供了一种利用生物酶催化合成手性2-氨基-1-丁醇的方法,可包括如下步骤(图2):In order to solve the above problems, the present invention provides a method for catalyzing and synthesizing chiral 2-amino-1-butanol by biological enzymes, which may include the following steps (Fig. 2):
(A)以1,2-丁二醇为底物,经酶A及其辅酶催化反应生成2-酮-1-丁醇;(A) Using 1,2-butanediol as a substrate, 2-keto-1-butanol is generated by the catalytic reaction of enzyme A and its coenzyme;
(B)以步骤(A)生成的2-酮-1-丁醇为底物,经酶B及其辅酶催化反应生成手性2-氨基-1-丁醇;(B) using the 2-keto-1-butanol generated in step (A) as a substrate, and generating chiral 2-amino-1-butanol through the catalytic reaction of enzyme B and its coenzyme;
所述酶A可选自如下任一种:醇脱氢酶、羰基还原酶、所述醇脱氢酶的突变体、所述羰基还原酶的突变体;The enzyme A may be selected from any of the following: alcohol dehydrogenase, carbonyl reductase, mutants of the alcohol dehydrogenase, mutants of the carbonyl reductase;
所述酶B可选自如下任一种:氨基酸脱氢酶、转氨酶、所述氨基酸脱氢酶的突变体、所述转氨酶的突变体。The enzyme B may be selected from any of the following: amino acid dehydrogenases, transaminases, mutants of said amino acid dehydrogenases, mutants of said transaminases.
本发明所提供的方法通过多酶共表达或级联或分步催化的方法实现。The method provided by the present invention is realized by multi-enzyme co-expression or cascade or step-by-step catalysis.
进一步地,所述醇脱氢酶和所述羰基还原酶均可来源于如下任一微生物:嗜热脂肪芽胞杆菌(Geobacillus stearothermophilus)、高加索酸奶乳杆菌(Lactobacilluskefiri)、短小乳杆菌(Lactobacillus brevis)、芽孢杆菌科(Bacillaceae)、高温厌氧杆菌(Thermoanaerobacter brockii)、赖氏菌属(Leifsonia sp.)、威吉利热厌氧杆菌(Thermoanaerobacter wiegelii)、赭色掷孢酵母(Sporobolomyces salmonicolor)。Further, the alcohol dehydrogenase and the carbonyl reductase can be derived from any of the following microorganisms: Bacillus stearothermophilus (Geobacillus stearothermophilus), Lactobacillus kefiri (Lactobacillus kefiri), Lactobacillus brevis (Lactobacillus brevis), Bacillaceae, Thermoanaerobacter brockii, Leifsonia sp., Thermoanaerobacter wiegelii, Sporobolomyces salmonicolor.
优选来源于短小乳杆菌(Lactobacillus brevis)的醇脱氢酶LBADH、高温厌氧杆菌(Thermoanaerobacter brockii)的醇脱氢酶TbSADH,以及芽胞杆菌科(Bacillaceae)、嗜热脂肪芽胞杆菌(Geobacillus stearothermophilus)、高加索酸奶乳杆菌(Lactobacilluskefiri)、赖氏菌属(Leifsonia sp.)、威吉利热厌氧杆菌(Thermoanaerobacter wiegelii)来源的的醇脱氢酶和赭色掷孢酵母(Sporobolomyces salmonicolor)来源的羰基还原酶。Preferred are alcohol dehydrogenase LBADH derived from Lactobacillus brevis, alcohol dehydrogenase TbSADH derived from Thermoanaerobacter brockii, and Bacillaceae, Geobacillus stearothermophilus, Alcohol dehydrogenase derived from Lactobacillus kefiri, Leifsonia sp., Thermoanaerobacter wiegelii and carbonyl reductase derived from Sporobolomyces salmonicolor .
进一步地,所述氨基酸脱氢酶可来源于:嗜热脂肪芽孢杆菌(Geobacillusstearothermophilus)。Further, the amino acid dehydrogenase can be derived from: Geobacillus stearothermophilus.
进一步地,所述转氨酶可来源于如下任一微生物:巨大芽孢杆菌(Bacillusmegaterium)、铜绿色假单胞菌(P.aeruginosa)、土曲霉(Aspergillus terreus)、烟曲霉菌(Aspergillus fumigatus)、费希新萨托菌(Neosartorya fischeri)、玉米赤霉(Gibberella zeae)、分支杆菌(Mycobacterium vanbaalenii),或者为Codexis公司的ATA-117转氨酶。Further, the transaminase can be derived from any of the following microorganisms: Bacillus megaterium (Bacillus megaterium), Pseudomonas aeruginosa (P. aeruginosa), Aspergillus terreus (Aspergillus terreus), Aspergillus fumigatus (Aspergillus fumigatus), Fisher Neosartorya fischeri, Gibberella zeae, Mycobacterium vanbaalenii, or ATA-117 transaminase from Codexis.
更进一步地,所述醇脱氢酶具体可为如下(a1)-(a8)中任一:Further, the alcohol dehydrogenase may specifically be any of the following (a1)-(a8):
(a1)来源于短小乳杆菌(Lactobacillus brevis)的醇脱氢酶,氨基酸序列为SEQID No.2;(a1) alcohol dehydrogenase derived from Lactobacillus brevis, the amino acid sequence is SEQID No.2;
(a2)来源于嗜热脂肪芽胞杆菌(Geobacillus stearothermophilus)的醇脱氢酶,氨基酸序列为SEQ ID No.4;(a2) alcohol dehydrogenase derived from Bacillus stearothermophilus (Geobacillus stearothermophilus), the amino acid sequence is SEQ ID No.4;
(a3)来源于高温厌氧杆菌(Thermoanaerobacter brockii)的醇脱氢酶,氨基酸序列为SEQ ID No.6;(a3) alcohol dehydrogenase derived from thermoanaerobic bacteria (Thermoanaerobacter brockii), the amino acid sequence is SEQ ID No.6;
(a4)来源于高加索酸奶乳杆菌(Lactobacillus kefir)的醇脱氢酶,氨基酸序列为SEQ ID No.8;(a4) an alcohol dehydrogenase derived from Lactobacillus kefir, the amino acid sequence of which is SEQ ID No. 8;
(a5)来源于芽孢杆菌科(Bacillaceae)的醇脱氢酶,氨基酸序列为SEQ ID No.10;(a5) alcohol dehydrogenase derived from Bacillaceae (Bacillaceae), the amino acid sequence is SEQ ID No.10;
(a6)来源于赖氏菌属(Leifsonia sp.)的醇脱氢酶,氨基酸序列为SEQ ID No.12;(a6) alcohol dehydrogenase derived from Leifsonia sp., the amino acid sequence is SEQ ID No.12;
(a7)来源于威吉利热厌氧杆菌(Thermoanaerobacter wiegelii)的醇脱氢酶,氨基酸序列为SEQ ID No.14;(a7) alcohol dehydrogenase derived from Thermoanaerobacter wiegelii, the amino acid sequence is SEQ ID No.14;
(a8)在(a1)-(a7)中任一所限定的蛋白质的N端和/或C端连接标签后得到的融合蛋白。(a8) A fusion protein obtained by attaching a tag to the N-terminus and/or C-terminus of the proteins defined in any of (a1)-(a7).
更进一步地,所述羰基还原酶具体可为如下(b1)或(b2):Further, the carbonyl reductase can specifically be the following (b1) or (b2):
(b1)来源于赭色掷孢酵母(Sporobolomyces salmonicolor)的羰基还原酶,氨基酸序列为SEQ ID No.16;(b1) carbonyl reductase derived from Sporobolomyces salmonicolor, the amino acid sequence is SEQ ID No. 16;
(b2)在(b1)所限定的蛋白质的N端和/或C端连接标签后得到的融合蛋白。(b2) A fusion protein obtained by attaching a tag to the N-terminus and/or C-terminus of the protein defined in (b1).
更进一步地,所述氨基酸脱氢酶具体可为如下(c1)-(c2)中任一:Further, the amino acid dehydrogenase may specifically be any of the following (c1)-(c2):
(c1)来源于嗜热脂肪芽胞杆菌(Geobacillus stearothermophilus)的亮氨酸脱氢酶,SEQ ID No.18;(c1) Leucine dehydrogenase derived from Geobacillus stearothermophilus, SEQ ID No. 18;
(c2)在(c1)所限定的蛋白质的N端和/或C端连接标签后得到的融合蛋白。(c2) A fusion protein obtained by attaching a tag to the N-terminus and/or C-terminus of the protein defined in (c1).
更进一步地,所述转氨酶具体可为如下(d1)-(d9)中任一:Further, the transaminase may specifically be any of the following (d1)-(d9):
(d1)Codexis公司的ATA-117转氨酶,氨基酸序列为SEQ ID No.20;(d1) ATA-117 transaminase of Codexis company, the amino acid sequence is SEQ ID No.20;
(d2)来源于土曲霉(Aspergillus terreus)的转氨酶,氨基酸序列为SEQ IDNo.22;(d2) a transaminase derived from Aspergillus terreus, the amino acid sequence is SEQ ID No.22;
(d3)来源于烟曲霉菌(Aspergillus fumigatus)的转氨酶,氨基酸序列为SEQ IDNo.24;(d3) a transaminase derived from Aspergillus fumigatus, the amino acid sequence is SEQ ID No.24;
(d3)来源于费希新萨托菌(Neosartorya fischeri)的转氨酶,氨基酸序列为SEQID No.26;(d3) a transaminase derived from Neosartorya fischeri, the amino acid sequence is SEQID No.26;
(d5)来源于玉米赤霉(Gibberella zeae)的转氨酶,氨基酸序列为SEQ ID No.28;(d5) a transaminase derived from Gibberella zeae, the amino acid sequence is SEQ ID No.28;
(d6)来源于分支杆菌(Mycobacterium vanbaalenii)的转氨酶,氨基酸序列为SEQID No.30;(d6) a transaminase derived from Mycobacterium vanbaalenii, the amino acid sequence is SEQID No.30;
(d7)来源于巨大芽孢杆菌(Bacillus megaterium)的转氨酶,氨基酸序列为SEQID No.32;(d7) a transaminase derived from Bacillus megaterium, the amino acid sequence is SEQID No.32;
(d8)来源于铜绿色假单胞菌(P.aeruginosa)的转氨酶,氨基酸序列为SEQ IDNo.34;(d8) a transaminase derived from Pseudomonas aeruginosa (P. aeruginosa), the amino acid sequence is SEQ ID No. 34;
(d9)在(d1)-(d8)中任一所限定的蛋白质的N端和/或C端连接标签后得到的融合蛋白。(d9) A fusion protein obtained by attaching a tag to the N-terminus and/or C-terminus of the proteins defined in any of (d1)-(d8).
进一步地,所述醇脱氢酶的突变体具体可为如下(e1)或(e2):Further, the mutant of the alcohol dehydrogenase may specifically be as follows (e1) or (e2):
(e1)与SEQ ID No.2所示来源于短小乳杆菌(Lactobacillus brevis)的醇脱氢酶相比,存在或仅存在如下突变中的至少一种:I11V、G37D;(e1) compared with the alcohol dehydrogenase derived from Lactobacillus brevis shown in SEQ ID No.2, there is or only has at least one of the following mutations: I11V, G37D;
(e2)在(e1)所限定的蛋白质的N端和/或C端连接标签后得到的融合蛋白。(e2) A fusion protein obtained by attaching a tag to the N-terminus and/or C-terminus of the protein defined in (e1).
进一步地,所述氨基酸脱氢酶的突变体具体可为如下(f1)-(f2)中任一:Further, the mutant of the amino acid dehydrogenase may specifically be any of the following (f1)-(f2):
(f1)与SEQ ID No.18所示来源于嗜热脂肪芽胞杆菌(Geobacillusstearothermophilus)的亮氨酸脱氢酶相比,存在或仅存在如下突变中的至少一种:K68X、N261X,其中X代表除野生型氨基酸以外的其他19种氨基酸(以公认的单字母简称呈现,包括F,L,I,V,S,P,T,A,Y,H,Q,N,D,E,C,R,G,M,W)。优选的,所述X为极性氨基酸(如:S,T,Y,H,Q,N,D,E,C,R等)。进一步优选的,所述X为S、Y、L或C。再进一步优选的,所述氨基酸脱氢酶的突变体为与SEQ ID No.18所示来源于嗜热脂肪芽胞杆菌(Geobacillusstearothermophilus)的亮氨酸脱氢酶相比,存在或仅存在如下突变:K68S/N261L(f1) compared with the leucine dehydrogenase derived from Bacillus stearothermophilus (Geobacillus stearothermophilus) shown in SEQ ID No. 18, at least one of the following mutations is present or only present: K68X, N261X, wherein X represents 19 other amino acids (presented by recognized one-letter abbreviations including F, L, I, V, S, P, T, A, Y, H, Q, N, D, E, C, R, G, M, W). Preferably, the X is a polar amino acid (eg, S, T, Y, H, Q, N, D, E, C, R, etc.). Further preferably, the X is S, Y, L or C. Still further preferably, the mutant of the amino acid dehydrogenase is compared with the leucine dehydrogenase derived from Bacillus stearothermophilus (Geobacillus stearothermophilus) shown in SEQ ID No. 18, which has or only has the following mutations: K68S/N261L
或者K68Y/N261C。Or K68Y/N261C.
(f2)在(f1)所限定的蛋白质的N端和/或C端连接标签后得到的融合蛋白。(f2) A fusion protein obtained by attaching a tag to the N-terminus and/or C-terminus of the protein defined in (f1).
在本发明中,对于氨基酸取代,使用下述命名法:原始氨基酸,位置,取代氨基酸。如,在SEQ ID No.2的第11位用缬氨酸(V)取代原有的异亮氨酸(I)命名为“I11V”。包含多重改变的变体由斜杠符号(“/”)分隔。In the present invention, for amino acid substitutions, the following nomenclature is used: original amino acid, position, substituted amino acid. For example, replacing the original isoleucine (I) with valine (V) at position 11 of SEQ ID No. 2 is named "I11V". Variants containing multiple changes are separated by a slash character ("/").
进一步地,在所述方法中,所述酶A和所述酶B均可以粗酶液、粗酶液冻干粉、纯酶或全细胞的形式发生催化作用。Further, in the method, both the enzyme A and the enzyme B can catalyze in the form of crude enzyme solution, lyophilized powder of crude enzyme solution, pure enzyme or whole cells.
进一步,所述粗酶液、粗酶液冻干粉和纯酶可按照包括如下步骤的方法制备得到:在宿主细胞中表达所述酶A和/或所述酶B,得到重组细胞;裂解所述重组细胞获得所述酶A和/或所述酶B的粗酶液、粗酶液冻干粉或纯酶。所述全细胞可按照包括如下步骤的方法制备得到:在宿主细胞中表达所述酶A和/或所述酶B,得到的重组细胞即为所述酶A和/或所述酶B的全细胞。Further, the crude enzyme liquid, the lyophilized powder of the crude enzyme liquid and the pure enzyme can be prepared according to a method comprising the following steps: expressing the enzyme A and/or the enzyme B in a host cell to obtain a recombinant cell; The recombinant cell obtains the crude enzyme solution of the enzyme A and/or the enzyme B, the lyophilized powder of the crude enzyme solution or the pure enzyme. The whole cell can be prepared according to a method comprising the following steps: expressing the enzyme A and/or the enzyme B in a host cell, and the obtained recombinant cell is the whole of the enzyme A and/or the enzyme B. cell.
再进一步,所述重组细胞具体可按照包括如下步骤的方法制备获得:向所述宿主细胞中导入能够表达所述酶A和/或所述酶B的核酸分子,经诱导培养后获得表达所述酶A和/或所述酶B的所述重组细胞。Still further, the recombinant cell can be specifically prepared according to a method comprising the following steps: introducing a nucleic acid molecule capable of expressing the enzyme A and/or the enzyme B into the host cell, and obtaining the expression of the enzyme after inducing and culturing. said recombinant cells of enzyme A and/or said enzyme B.
更进一步,所述“能够表达所述酶A和/或所述酶B的核酸分子”可通过重组载体的形式导入到所述宿主细胞中。其中,所述重组载体可为携带有所述酶A和/或所述酶B的编码基因的细菌质粒(如在细菌中表达的基于T7启动子的表达载体,具体如pET-28a等)、噬菌体、酵母质粒(如YEp系列载体等)或逆转录病毒包装质粒。Further, the "nucleic acid molecule capable of expressing the enzyme A and/or the enzyme B" can be introduced into the host cell in the form of a recombinant vector. Wherein, the recombinant vector may be a bacterial plasmid carrying the encoding gene of the enzyme A and/or the enzyme B (such as a T7 promoter-based expression vector expressed in bacteria, specifically such as pET-28a, etc.), Phage, yeast plasmid (such as YEp series vector, etc.) or retrovirus packaging plasmid.
在本发明的一个实施例中,所述重组载体具体为将所述酶A或所述酶B的编码基因替换pET22b载体的酶切位点NdeⅠ和XhoⅠ之间的小片段后得到的重组质粒。In one embodiment of the present invention, the recombinant vector is specifically a recombinant plasmid obtained by replacing the encoding gene of the enzyme A or the enzyme B with a small fragment between the restriction sites NdeI and XhoI of the pET22b vector.
在本发明的另一个实施例中,所述重组载体为将所述酶A的编码基因插入到pETDuet-1载体的酶切位点酶切位点EcoRⅠ和Hind III之间,同时将所述酶B的编码基因插入到pETDuet-1载体的酶切位点NdeⅠ和XhoⅠ之间后得到的重组质粒。In another embodiment of the present invention, the recombinant vector is that the gene encoding the enzyme A is inserted into the restriction enzyme restriction site EcoRI and Hind III of the pETDuet-1 vector, and the enzyme is inserted into the pETDuet-1 vector. The recombinant plasmid obtained by inserting the coding gene of B between the restriction sites NdeI and XhoI of the pETDuet-1 vector.
进一步地,所述宿主细胞可为原核细胞或低等真核细胞。Further, the host cells can be prokaryotic cells or lower eukaryotic cells.
更进一步地,所述原核细胞具体可为细菌;所述低等真核细胞具体可为酵母细胞。Further, the prokaryotic cells can be bacteria; the lower eukaryotic cells can be yeast cells.
在本发明的一个实施例中,所述宿主细胞具体为大肠杆菌,更加具体的为E.coliBL21(DE3)。相应的,所述诱导培养为向培养体系中加IPTG至终浓度0.1-0.5mM(具体如0.1mM),20-37℃诱导培养12-24h(具体如16h)。In one embodiment of the present invention, the host cell is specifically Escherichia coli, more specifically E. coliBL21(DE3). Correspondingly, the induction culture is to add IPTG to the culture system to a final concentration of 0.1-0.5 mM (specifically, 0.1 mM), and induce and culture at 20-37° C. for 12-24 h (specifically, 16 h).
所述来源于短小乳杆菌(Lactobacillus brevis)的醇脱氢酶的编码基因的序列为SEQ ID No.1或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the coding gene of the alcohol dehydrogenase derived from Lactobacillus brevis is SEQ ID No.1 or the fusion sequence obtained after connecting the tag coding sequence at its 5' end and/or 3' end or keep it Random or/and site-directed mutagenesis sequences that are functional and encode the same protein.
所述来源于嗜热脂肪芽胞杆菌(Geobacillus stearothermophilus)的醇脱氢酶的编码基因的序列为SEQ ID No.3或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the gene encoding alcohol dehydrogenase derived from Bacillus stearothermophilus (Geobacillus stearothermophilus) is SEQ ID No. 3 or a fusion sequence obtained by connecting a tag encoding sequence at its 5' end and/or 3' end Or random or/and site-directed mutagenesis sequences that retain function and encode the same protein.
所述来源于高温厌氧杆菌(Thermoanaerobacter brockii)的醇脱氢酶的编码基因的序列为SEQ ID No.5或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the encoding gene of the alcohol dehydrogenase derived from thermoanaerobic bacillus (Thermoanaerobacter brockii) is SEQ ID No.5 or the fusion sequence obtained after connecting the tag encoding sequence at its 5' end and/or 3' end or Random or/and site-directed mutagenesis sequences that retain function and encode the same protein.
所述来源于高加索酸奶乳杆菌(Lactobacillus kefiri)的醇脱氢酶的编码基因的序列为SEQ ID No.7或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the coding gene of the alcohol dehydrogenase derived from Lactobacillus kefiri is SEQ ID No. 7 or the fusion sequence obtained after connecting the tag coding sequence at its 5' end and/or 3' end or Random or/and site-directed mutagenesis sequences that retain function and encode the same protein.
所述来源于芽孢杆菌科(Bacillaceae)的醇脱氢酶的编码基因的序列为SEQ IDNo.9或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the gene encoding alcohol dehydrogenase derived from Bacillaceae (Bacillaceae) is SEQ ID No. 9 or a fusion sequence obtained after connecting a tag encoding sequence at its 5' end and/or 3' end or retains function and Random or/and site-directed mutagenesis sequences encoding the same protein.
所述来源于赖氏菌属(Leifsonia sp.)的醇脱氢酶的编码基因的序列为SEQ IDNo.11或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the coding gene of the alcohol dehydrogenase derived from the genus Leifsonia sp. is SEQ ID No. 11 or the fusion sequence obtained after connecting the tag coding sequence at its 5' end and/or 3' end or Random or/and site-directed mutagenesis sequences that retain function and encode the same protein.
所述来源于威吉利热厌氧杆菌(Thermoanaerobacter wiegelii)的醇脱氢酶的编码基因的序列为SEQ ID No.13或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the coding gene of the alcohol dehydrogenase derived from Thermoanaerobacter wiegelii is SEQ ID No. 13 or the fusion obtained after connecting the tag coding sequence at its 5' end and/or 3' end Sequences or random or/and site-directed mutagenesis sequences that retain function and encode the same protein.
所述来源于赭色掷孢酵母(Sporobolomyces salmonicolor)的羰基还原酶的编码基因的序列为SEQ ID No.15或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the gene encoding carbonyl reductase derived from Sporobolomyces salmonicolor is SEQ ID No. 15 or a fusion sequence obtained by connecting a tag encoding sequence at its 5' end and/or 3' end or Random or/and site-directed mutagenesis sequences that retain function and encode the same protein.
所述来源于嗜热脂肪芽胞杆菌(Geobacillus stearothermophilus)的亮氨酸脱氢酶的编码基因的序列为SEQ ID No.17或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the encoding gene of the leucine dehydrogenase derived from Bacillus stearothermophilus (Geobacillus stearothermophilus) is SEQ ID No. 17 or obtained after connecting a tag encoding sequence at its 5' end and/or 3' end Fusion sequences or random or/and site-directed mutagenesis sequences that retain function and encode the same protein.
所述Codexis公司的ATA-117转氨酶的编码基因的序列为SEQ ID No.19或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the coding gene of the ATA-117 transaminase of the Codexis company is SEQ ID No. 19 or the fusion sequence obtained after connecting the tag coding sequence at its 5' end and/or 3' end, or a random sequence that retains the function and encodes the same protein. or/and site-directed mutagenesis sequences.
所述来源于土曲霉(Aspergillus terreus)的转氨酶的编码基因的序列为SEQ IDNo.21或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the encoding gene of the transaminase derived from Aspergillus terreus is SEQ ID No. 21 or a fusion sequence obtained after connecting a tag encoding sequence at its 5' end and/or 3' end or retains function and encodes the same protein random or/and site-directed mutagenesis sequences.
所述来源于烟曲霉菌(Aspergillus fumigatus)的转氨酶的编码基因的序列为SEQ ID No.23或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the encoding gene of the transaminase derived from Aspergillus fumigatus is SEQ ID No. 23 or a fusion sequence obtained after connecting a tag encoding sequence at its 5' end and/or 3' end or retains function and encodes Random or/and site-directed mutagenesis sequences of the same protein.
所述来源于费希新萨托菌(Neosartorya fischeri)的转氨酶的编码基因的序列为SEQ ID No.25或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the encoding gene of the transaminase derived from Neosartorya fischeri is SEQ ID No. 25 or the fusion sequence obtained after connecting the tag encoding sequence at its 5' end and/or 3' end or retains the Random or/and site-directed mutagenesis sequences that are functional and encode the same protein.
所述来源于玉米赤霉(Gibberella zeae)的转氨酶的编码基因的序列为SEQ IDNo.27或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the encoding gene of the transaminase derived from Gibberella zeae is SEQ ID No. 27 or the fusion sequence obtained after connecting the tag encoding sequence at its 5' end and/or 3' end or retains the function and encodes the same Random or/and site-directed mutagenesis sequences of proteins.
所述来源于分支杆菌的转氨酶的编码基因的序列为SEQ ID No.29或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the encoding gene of the transaminase derived from Mycobacterium is SEQ ID No. 29 or a fusion sequence obtained by connecting a tag encoding sequence at its 5' end and/or 3' end, or a random or random sequence that retains the function and encodes the same protein. / and site-directed mutagenesis sequences.
所述来源于巨大芽孢杆菌(Bacillus megaterium)的转氨酶的编码基因的序列为SEQ ID No.31或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the encoding gene of the transaminase derived from Bacillus megaterium is SEQ ID No. 31 or a fusion sequence obtained after connecting a tag encoding sequence at its 5' end and/or 3' end, or retains function and encodes Random or/and site-directed mutagenesis sequences of the same protein.
所述来源于铜绿色假单胞菌(P.aeruginosa)的转氨酶的编码基因的序列为SEQID No.33或在其5’端和/或3’端连接标签编码序列后得到的融合序列或保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the encoding gene of the transaminase derived from Pseudomonas aeruginosa (P. aeruginosa) is SEQ ID No. 33 or the fusion sequence obtained after connecting the tag encoding sequence at its 5' end and/or 3' end or retains Random or/and site-directed mutagenesis sequences that are functional and encode the same protein.
所述来源于短小乳杆菌(Lactobacillus brevis)的醇脱氢酶的突变体的编码基因的序列为如下(g1)-(g3)中任一:(g1)与SEQ ID No.1相比,存在或仅存在如下突变中的至少一种:A31G/T33G、G110A/C111T;(g2)在(g1)所限定序列的5’端和/或3’端连接标签编码序列后得到的融合序列;(g3)与(g1)或(g2)所限定的序列相比保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the coding gene of the mutant alcohol dehydrogenase derived from Lactobacillus brevis is any one of the following (g1)-(g3): (g1) compared with SEQ ID No.1, there is Or only have at least one of the following mutations: A31G/T33G, G110A/C111T; (g2) a fusion sequence obtained by connecting a tag coding sequence at the 5' end and/or 3' end of the sequence defined in (g1); ( g3) Random or/and site-directed mutagenesis sequences that retain function compared to the sequences defined in (g1) or (g2) and encode the same protein.
所述来源于嗜热脂肪芽胞杆菌(Geobacillus stearothermophilus)的亮氨酸脱氢酶的突变体的编码基因的序列为如下(h1)-(h3)中任一:(h1)与SEQ ID No.17相比,存在或仅存在如下突变中的至少一种:A203G/A204C、A202T/A204T、A781C/A782T/C783G、A781T/A782G;(h2)在(h1)所限定序列的5’端和/或3’端连接标签编码序列后得到的融合序列;(h3)与(h1)或(h2)所限定的序列相比保有功能且编码相同蛋白的随机或/和定点诱变序列。The sequence of the coding gene of the leucine dehydrogenase mutant derived from Bacillus stearothermophilus (Geobacillus stearothermophilus) is any one of the following (h1)-(h3): (h1) and SEQ ID No. 17 In contrast, the presence or presence of at least one of the following mutations: A203G/A204C, A202T/A204T, A781C/A782T/C783G, A781T/A782G; (h2) at the 5' end of the sequence defined by (h1) and/or The fusion sequence obtained by linking the tag coding sequence at the 3' end; (h3) a random or/and site-directed mutagenesis sequence that retains function compared to the sequence defined by (h1) or (h2) and encodes the same protein.
进一步,所述(h1)为:与SEQ ID No.17相比,存在或仅存在如下突变中的任一种:A203G/A204C/A781C/A782T/C783G、A202T/A204T/A781T/A782G。Further, the (h1) is: compared with SEQ ID No. 17, there is or only any one of the following mutations: A203G/A204C/A781C/A782T/C783G, A202T/A204T/A781T/A782G.
在本发明中,对于碱基取代,使用下述命名法:原始碱基,位置(即在W1或W2或W3核苷酸序列中的位置),取代碱基。相应的,在SEQ ID No.1的第31位用A取代原有的G命名为“A31G”。包含多重改变的变体由斜杠符号(“/”)分隔。In the present invention, for base substitutions, the following nomenclature is used: original base, position (ie, position in the W1 or W2 or W3 nucleotide sequence), substituted base. Correspondingly, the original G was replaced by A at the 31st position of SEQ ID No. 1, named "A31G". Variants containing multiple changes are separated by a slash character ("/").
在步骤(A)和步骤(B)中,所述催化反应的温度均可为25~37℃,如30~37℃,具体如30℃或37℃。In step (A) and step (B), the temperature of the catalytic reaction can be both 25-37°C, such as 30-37°C, specifically 30°C or 37°C.
在步骤(A)和步骤(B)中,所述催化反应的时间均可为4~48h,如24h。In step (A) and step (B), the time of the catalytic reaction can be 4-48h, such as 24h.
当所述酶A和所述酶B是以粗酶液、粗酶液冻干粉或纯酶的形式发生催化作用时,步骤(A)中,所述催化反应在如下(k1)所示缓冲液中进行;步骤(B)中,所述催化反应均在如下(k2)-(k3)任一所示缓冲液中进行;当所述酶A和所述酶B以共表达所述酶A和所述酶B的全细胞的形式发生催化作用时,步骤(A)和步骤(B)的所述催化反应均是在如下(k1)所示缓冲液中进行;When the enzyme A and the enzyme B are in the form of crude enzyme solution, lyophilized powder of crude enzyme solution or pure enzyme, catalysis occurs, in step (A), the catalytic reaction is buffered as shown in the following (k1) In step (B), the catalytic reaction is carried out in the buffer shown in any of the following (k2)-(k3); when the enzyme A and the enzyme B co-express the enzyme A When catalyzing with the whole cell form of the enzyme B, the catalytic reactions of step (A) and step (B) are both carried out in the buffer shown in the following (k1);
(k1)浓度为50~100mM,pH值为6.5~8.0的磷酸盐缓冲液;(k1) Phosphate buffer with a concentration of 50 to 100 mM and a pH of 6.5 to 8.0;
具体如:浓度为100mM,pH值为8.0的磷酸盐缓冲液。For example: phosphate buffer with a concentration of 100 mM and a pH of 8.0.
(k2)浓度为100mM~2M,pH为8.0~9.6的NH3·H2O或NH4Cl缓冲液;(k2) NH 3 ·H 2 O or NH 4 Cl buffer with a concentration of 100mM to 2M and a pH of 8.0 to 9.6;
具体如:浓度为1M,pH为8.7的NH3·H2O或NH4Cl缓冲液。For example: NH 3 ·H 2 O or NH 4 Cl buffer with a concentration of 1M and a pH of 8.7.
(k3)浓度为50~100mM,异丙胺的浓度为250mM-1M,pH值为7.5~8.5的磷酸盐缓冲液;(k3) Phosphate buffer with a concentration of 50-100 mM, a concentration of isopropylamine of 250 mM-1 M, and a pH of 7.5-8.5;
具体如:浓度为100mM、异丙胺的浓度为500mM、pH值为8.0的磷酸盐缓冲液。Specifically, a phosphate buffer with a concentration of 100 mM, a concentration of isopropylamine of 500 mM, and a pH of 8.0.
当所述酶A和所述酶B是以粗酶液、粗酶液冻干粉或纯酶的形式发生催化作用时,步骤(A)和步骤(B)中,所述酶A和所述酶B在各自反应体系中的浓度均可为0.1~10g/L,如10g/L。当所述酶A和所述酶B以共表达所述酶A和所述酶B的全细胞的形式发生催化作用时,步骤(A)和步骤(B)在一个反应体系中完成,所述反应体系中所述全细胞(共表达所述酶A和所述酶B)的浓度为100g/L(每升反应体系中含有所述全细胞的湿重为100g)。When the enzyme A and the enzyme B are in the form of crude enzyme solution, lyophilized powder of the crude enzyme solution or pure enzyme, catalysis, in step (A) and step (B), the enzyme A and the The concentration of enzyme B in each reaction system can be 0.1-10 g/L, such as 10 g/L. When the enzyme A and the enzyme B are catalyzed in the form of whole cells co-expressing the enzyme A and the enzyme B, the step (A) and the step (B) are completed in one reaction system, and the said The concentration of the whole cells (co-expressing the enzyme A and the enzyme B) in the reaction system was 100 g/L (the wet weight of the whole cells contained in the reaction system per liter was 100 g).
在本发明中,所述酶A的辅酶具体为氧化型辅因子II,即NADP+,或者为氧化型辅因子I,即NAD+。当所述酶B为氨基酸脱氢酶或所述氨基酸脱氢酶的突变体时,所述酶B的辅酶具体为氧化型辅因子I,即NAD+;当所述酶B为转氨酶或所述转氨酶的突变体时,所述酶B的辅酶具体为磷酸吡哆醛(PLP)。In the present invention, the coenzyme of the enzyme A is specifically oxidized cofactor II, namely NADP + , or oxidized cofactor I, namely NAD + . When the enzyme B is an amino acid dehydrogenase or a mutant of the amino acid dehydrogenase, the coenzyme of the enzyme B is specifically oxidized cofactor I, namely NAD + ; when the enzyme B is a transaminase or the In the case of a mutant of transaminase, the coenzyme of the enzyme B is specifically pyridoxal phosphate (PLP).
在本发明中,当所述酶A和所述酶B是以粗酶液、粗酶液冻干粉或纯酶的形式发生催化作用时,所述酶A和所述酶B的辅酶在其各自的反应体系中的浓度均可为0.5~3mM(具体如1mM)。In the present invention, when the enzyme A and the enzyme B are in the form of crude enzyme solution, lyophilized powder of the crude enzyme solution or pure enzyme, the coenzymes of the enzyme A and the enzyme B are in their The concentration in each reaction system can be 0.5-3 mM (specifically, 1 mM).
当所述酶A和所述酶B是以粗酶液、粗酶液冻干粉或纯酶的形式发生催化作用时,在步骤(A)中,所述催化反应的反应体系中除了含有1,2-丁二醇和所述酶A及其辅酶外,还含有丙酮。When the enzyme A and the enzyme B are in the form of crude enzyme solution, lyophilized powder of crude enzyme solution or pure enzyme, catalyzing action occurs, in step (A), the reaction system of the catalyzed reaction contains 1 , 2-butanediol and the enzyme A and its coenzyme, it also contains acetone.
具体的,步骤(A)中,所述催化反应的反应体系组成如下:浓度为100mM,pH值为8.0的磷酸盐缓冲,终浓度为20mM的1,2-丁二醇、终浓度为1mM的氧化型辅因子II,即NADP+,或者为氧化型辅因子I,即NAD+、终浓度为5%的丙酮(v/v)、终浓度为10g/L的所述酶A的粗酶液、粗酶液冻干粉或纯酶。Specifically, in step (A), the reaction system of the catalytic reaction is composed of the following: a phosphate buffer with a concentration of 100 mM, a pH value of 8.0, 1,2-butanediol with a final concentration of 20 mM, and a final concentration of 1 mM Oxidized cofactor II, namely NADP + , or oxidized cofactor I, namely NAD + , acetone (v/v) with a final concentration of 5%, and a crude enzyme solution of the enzyme A with a final concentration of 10 g/L , lyophilized powder of crude enzyme liquid or pure enzyme.
当所述酶A和所述酶B是以粗酶液、粗酶液冻干粉或纯酶的形式发生催化作用时,在步骤(B)中,当所述酶B为氨基酸脱氢酶或所述氨基酸脱氢酶的突变体时,所述催化反应的反应体系中除了含有2-酮-1-丁醇和所述酶B及其辅酶外,还含有葡萄糖脱氢酶和葡萄糖。进一步地,所述反应体系中还含有碳酸钙。When the enzyme A and the enzyme B are in the form of crude enzyme solution, lyophilized powder of the crude enzyme solution or pure enzyme, catalysis occurs, in step (B), when the enzyme B is an amino acid dehydrogenase or In the case of the amino acid dehydrogenase mutant, the reaction system of the catalytic reaction contains glucose dehydrogenase and glucose in addition to 2-keto-1-butanol and the enzyme B and its coenzyme. Further, the reaction system also contains calcium carbonate.
具体的,步骤(B)中,当所述酶B为氨基酸脱氢酶或所述氨基酸脱氢酶的突变体时,所述催化反应的反应体系的组成可如下:浓度为1M,pH为8.7的NH3·H2O或NH4Cl缓冲液;终浓度为1mM的氧化型辅因子I,即NAD+;终浓度为1g/L的葡萄糖脱氢酶;终浓度为100mM的葡萄糖;终浓度为10g/L的所述酶B的粗酶液、粗酶液冻干粉或纯酶;终浓度为5g/L的碳酸钙。当所述酶B为转氨酶或所述转氨酶的突变体时,所述催化反应的反应体系的组成可如下:浓度为50~100mM,异丙胺的浓度为500mM,pH值为8.0的磷酸盐缓冲液;终浓度为1mM的磷酸吡哆醛(PLP);终浓度为10g/L的所述酶B的粗酶液、粗酶液冻干粉或纯酶。Specifically, in step (B), when the enzyme B is an amino acid dehydrogenase or a mutant of the amino acid dehydrogenase, the composition of the reaction system for the catalytic reaction may be as follows: the concentration is 1M, and the pH is 8.7 NH 3 ·H 2 O or NH 4 Cl buffer; final concentration of 1 mM oxidized cofactor I, namely NAD + ; final concentration of 1 g/L glucose dehydrogenase; final concentration of 100 mM glucose; final concentration It is 10g/L of the crude enzyme solution of the enzyme B, the lyophilized powder of the crude enzyme solution or the pure enzyme; the final concentration is 5g/L of calcium carbonate. When the enzyme B is a transaminase or a mutant of the transaminase, the composition of the reaction system for the catalytic reaction may be as follows: the concentration of the isopropylamine is 50-100 mM, the concentration of isopropylamine is 500 mM, and the pH is 8.0 phosphate buffer solution ; Pyridoxal Phosphate (PLP) with a final concentration of 1 mM; Crude enzyme liquid, lyophilized powder of crude enzyme liquid or pure enzyme of the enzyme B with a final concentration of 10 g/L.
当所述酶A和所述酶B以共表达所述酶A和所述酶B的全细胞的形式发生催化作用时,所述反应体系中可以不加入辅酶,进一步地,所述反应体系中还含有葡萄糖。When the enzyme A and the enzyme B are catalyzed in the form of whole cells that co-express the enzyme A and the enzyme B, no coenzyme may be added to the reaction system. Also contains glucose.
具体的,所述反应体系的组成可为如下:浓度为100mM、pH值为8.0的磷酸盐缓冲液;终浓度为50mM的1,2-丁二醇;100mM葡萄糖;终浓度为100g/L的所述全细胞(即每升反应体系中含有所述全细胞的湿重为100g)。Specifically, the composition of the reaction system can be as follows: phosphate buffer with a concentration of 100 mM and a pH value of 8.0; 1,2-butanediol with a final concentration of 50 mM; 100 mM glucose; The whole cells (that is, the wet weight of the whole cells contained in each liter of reaction system is 100 g).
在所述方法中,所述手性2-氨基-1-丁醇为(S)-2-氨基-1-丁醇和/或(R)-2-氨基-1-丁醇。In the method, the chiral 2-amino-1-butanol is (S)-2-amino-1-butanol and/or (R)-2-amino-1-butanol.
前文所述的氨基酸脱氢酶及其突变体,以及来源于Codexis公司的ATA-117转氨酶、来源于土曲霉(Aspergillus terreus)的转氨酶、来源于烟曲霉菌(Aspergillusfumigatus)的转氨酶、来源于费希新萨托菌(Neosartorya fischeri)的转氨酶、来源于玉米赤霉(Gibberella zeae)的转氨酶、来源于分支杆菌(Mycobacterium vanbaalenii)的转氨酶,用于(S)-2-氨基-1-丁醇的合成;前文所述的来源于巨大芽胞杆菌(Bacillusmegaterium)的转氨酶、来源于铜绿色假单胞菌(P.aeruginosa)的转氨酶用于(R)-2-氨基-1-丁醇的合成。The aforementioned amino acid dehydrogenase and its mutants, as well as the ATA-117 transaminase derived from Codexis, the transaminase derived from Aspergillus terreus, the transaminase derived from Aspergillus fumigatus, and the transaminase derived from Feich Transaminase from Neosartorya fischeri, from Gibberella zeae, from Mycobacterium vanbaalenii for the synthesis of (S)-2-amino-1-butanol The aforementioned transaminase derived from Bacillus megaterium and transaminase derived from P. aeruginosa were used for the synthesis of (R)-2-amino-1-butanol.
本发明还提供了一种酶系统及其相关产品。The invention also provides an enzyme system and related products.
本发明所提供的酶系统包括前文所述酶A和所述酶B。当然,也可以包括所述酶A和所述酶B各自的辅酶。The enzyme system provided by the present invention includes the aforementioned enzyme A and the aforementioned enzyme B. Of course, the respective coenzymes of the enzyme A and the enzyme B may also be included.
所述相关产品为能够表达所述酶系统中各酶的核酸分子,或含有所述核酸分子的表达盒、重组载体、重组菌或转基因细胞系。The related product is a nucleic acid molecule capable of expressing each enzyme in the enzyme system, or an expression cassette, recombinant vector, recombinant bacteria or transgenic cell line containing the nucleic acid molecule.
所述酶系统或所述相关产品在合成手性2-氨基-1-丁醇中的应用也属于本发明的保护范围。The application of the enzyme system or the related products in the synthesis of chiral 2-amino-1-butanol also belongs to the protection scope of the present invention.
本发明所提供的合成手性2-氨基-1-丁醇的方法中,存在辅因子再生系统。所述辅因子再生系统为葡萄糖脱氢酶催化葡萄糖氧化,或醇脱氢酶催化丙酮还原或异丙醇氧化促进辅因子再生。在本发明合成手性2-氨基-1-丁醇的方法中,醇脱氢酶或羰基还原酶催化1,2-丁二醇氧化成2-酮-1-丁醇,NAD(P)+被还原成NAD(P)H,同时,醇脱氢酶催化丙酮还原成异丙醇,NAD(P)H被重新氧化成NAD(P)+,生成的NAD(P)+重新参与到1,2-丁二醇生成2-酮-1-丁醇的氧化。氨基酸脱氢酶及其突变体催化2-酮-1-丁醇还原成1,2-丁二醇,NADH被氧化成NAD+,同时,葡萄糖脱氢酶(GDH)催化葡萄糖氧化成葡萄糖酸,NAD+被重新还原成NADH,生成的NADH重新参与到2-酮-1-丁醇生成(S)-2-氨基-1-丁醇的氧化。In the method for synthesizing chiral 2-amino-1-butanol provided by the present invention, there is a cofactor regeneration system. The cofactor regeneration system is that glucose dehydrogenase catalyzes glucose oxidation, or alcohol dehydrogenase catalyzes acetone reduction or isopropanol oxidation to promote cofactor regeneration. In the method for synthesizing chiral 2-amino-1-butanol of the present invention, alcohol dehydrogenase or carbonyl reductase catalyzes the oxidation of 1,2-butanediol to 2-keto-1-butanol, NAD(P) + is reduced to NAD(P)H, at the same time, alcohol dehydrogenase catalyzes the reduction of acetone to isopropanol, NAD(P)H is re-oxidized to NAD(P) + , and the generated NAD(P) + re-engages in 1, Oxidation of 2-butanediol to 2-keto-1-butanol. Amino acid dehydrogenase and its mutants catalyze the reduction of 2-keto-1-butanol to 1,2-butanediol, NADH is oxidized to NAD + , and glucose dehydrogenase (GDH) catalyzes the oxidation of glucose to gluconic acid, NAD + is re-reduced to NADH, and the resulting NADH is re-involved in the oxidation of 2-keto-1-butanol to (S)-2-amino-1-butanol.
本发明提供了一条全新的绿色生物合成路线,以廉价的1,2-丁二醇为原料,通过多酶共表达或级联或分步催化合成手性2-氨基-1-丁醇,即(S)-2-氨基-1-丁醇和/或(R)-2-氨基-1-丁醇。The present invention provides a brand-new green biosynthesis route, which uses cheap 1,2-butanediol as raw material to synthesize chiral 2-amino-1-butanol through multi-enzyme co-expression or cascade or step-by-step catalysis, namely (S)-2-amino-1-butanol and/or (R)-2-amino-1-butanol.
附图说明Description of drawings
图1为(S)-2-氨基-1-丁醇和(R)-2-氨基-1-丁醇的结构式。Figure 1 shows the structural formulas of (S)-2-amino-1-butanol and (R)-2-amino-1-butanol.
图2为本发明手性2-氨基-1-丁醇的生物制备方法的反应原理图。A:醇脱氢酶或羰基还原酶偶联氨基酸脱氢酶制备(S)-2-氨基-1-丁醇。B:醇脱氢酶或羰基还原酶偶联转氨酶制备(S)-2-氨基-1-丁醇或者(R)-2-氨基-1-丁醇。Fig. 2 is the reaction principle diagram of the biological preparation method of chiral 2-amino-1-butanol of the present invention. A: Alcohol dehydrogenase or carbonyl reductase coupled with amino acid dehydrogenase to prepare (S)-2-amino-1-butanol. B: Alcohol dehydrogenase or carbonyl reductase coupled transaminase to prepare (S)-2-amino-1-butanol or (R)-2-amino-1-butanol.
图3为2-酮-1-丁醇气相色谱(GC)鉴定结果。Figure 3 shows the identification results of 2-keto-1-butanol by gas chromatography (GC).
图4为2-氨基-1-丁醇标准品用邻苯二甲醛衍生后分离效果图。A:混旋型2-氨基-1-丁醇;B:(S)-2-氨基-1-丁醇标准品;C:(R)-2-氨基-1-丁醇标准品。Figure 4 is a diagram showing the separation effect of 2-amino-1-butanol standard product after derivatization with o-phthalaldehyde. A: Mixed 2-amino-1-butanol; B: (S)-2-amino-1-butanol standard; C: (R)-2-amino-1-butanol standard.
图5为反应液液相色谱图。A:阴性对照组反应液液相色谱结果;B:阴性对照+(S)-2-氨基-1-丁醇标准品液相色谱结果;C:阴性对照+(R)-2-氨基-1-丁醇标准品液相色谱结果;D:实验组1反应液液相色谱结果(产物为(S)-2-氨基-1-丁醇);E:实验组2反应液液相色谱结果(产物为(R)-2-氨基-1-丁醇)。其中,阴性对照反应体系中只含有空表达载体的表达宿主粗酶粉或酶液或全细胞,其它成分与实验组相同。Figure 5 is a liquid chromatogram of the reaction solution. A: The liquid chromatography result of the reaction solution of the negative control group; B: The liquid chromatography result of the negative control+(S)-2-amino-1-butanol standard product; C: The negative control+(R)-2-amino-1 -Butanol standard product liquid chromatography result; D: Experimental group 1 reaction liquid liquid chromatography result (the product is (S)-2-amino-1-butanol); E: Experimental group 2 reaction liquid liquid chromatography result ( The product is (R)-2-amino-1-butanol). Among them, the negative control reaction system only contains the expression host crude enzyme powder or enzyme solution or whole cells of the empty expression vector, and other components are the same as the experimental group.
具体实施方式Detailed ways
下述实施例中所使用的实验方法如无特殊说明,均为常规方法。The experimental methods used in the following examples are conventional methods unless otherwise specified.
下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。The materials, reagents, etc. used in the following examples can be obtained from commercial sources unless otherwise specified.
实施例1、醇脱氢酶或羰基还原酶偶联嗜热脂肪芽孢杆菌(Geobacillusstearothermophilus)的亮氨酸脱氢酶或其突变体制备(S)-2-氨基-1-丁醇Example 1. Preparation of (S)-2-amino-1-butanol by coupling alcohol dehydrogenase or carbonyl reductase to leucine dehydrogenase of Geobacillus stearothermophilus or its mutant
一、重组醇脱氢酶或其突变体、氨基酸酸脱氢酶或其突变体的工程菌的制备1. Preparation of engineering bacteria of recombinant alcohol dehydrogenase or its mutants, amino acid dehydrogenase or its mutants
将相关酶的编码基因分别进行全基因合成(根据需要以大肠杆菌为宿主进行密码子优化),将合成的基因连接于各种表达载体上构建而成。所述的表达载体为本领域常规的各种载体。本发明所述载体具体为pET22b(+),将全基因合成后的相关酶的编码基因插入到pET22b(+)的酶切位点NdeⅠ和XhoⅠ之间后,并经测序验证正确后得到重组载体。并利用定点突变方法得到其相关基因突变体。The encoding genes of the related enzymes are separately synthesized by total gene synthesis (codon optimization is carried out using Escherichia coli as the host if necessary), and the synthesized genes are connected to various expression vectors to construct. The expression vectors are various conventional vectors in the field. The vector of the present invention is specifically pET22b(+), and the encoding gene of the related enzyme after the whole gene synthesis is inserted between the restriction sites NdeI and XhoI of pET22b(+), and the recombinant vector is obtained after the sequence is verified to be correct. . And use site-directed mutagenesis method to obtain its related gene mutants.
将上述测序验证正确的重组表达载体转化至合适的微生物宿主中。所述宿主微生物为本领域常规的各种宿主微生物,只要能满足上述重组表达载体稳定地自行复制,且所携带的醇脱氢及氨基酸脱氢酶基因可被有效表达即可。其中,较佳的所述宿主微生物为大肠杆菌(Escherichia coli),优选大肠杆菌BL21(DE3),将前述重组表达质粒转化入E.coliBL21(DE3)中,可得到本发明的基因工程菌株。其中所述转化方法为本领域常规转化方法,较佳地为电转法或化学转化法。The above-mentioned sequence-verified correct recombinant expression vector is transformed into a suitable microbial host. The host microorganisms are various conventional host microorganisms in the field, as long as the above-mentioned recombinant expression vector can stably replicate by itself, and the carried alcohol dehydrogenase and amino acid dehydrogenase genes can be effectively expressed. Wherein, the preferred host microorganism is Escherichia coli, preferably Escherichia coli BL21 (DE3). The recombinant expression plasmid is transformed into E. coliBL21 (DE3) to obtain the genetically engineered strain of the present invention. The transformation method is a conventional transformation method in the field, preferably an electro-transformation method or a chemical transformation method.
本实施例中所涉及的酶及其突变体详见表1。The enzymes involved in this example and their mutants are detailed in Table 1.
表1实施例1中所涉及的酶及其突变体Table 1 Enzymes involved in Example 1 and their mutants
注:W1表示来源于短小乳杆菌(Lactobacillus brevis)的醇脱氢酶LBADH;W2表示来源于嗜热脂肪芽孢杆菌(Geobacillus stearothermophilus)的醇脱氢酶;W3表示来源于高温厌氧杆菌(Thermoanaerobacter brockii)的醇脱氢酶TbSADH;W4表示来源于高加索酸奶乳杆菌(Lactobacillus kefir)的醇脱氢酶;W5表示来源于芽孢杆菌科(Bacillaceae)的醇脱氢酶;W6表示来源于赖氏菌属(Leifsonia sp.)的醇脱氢酶;W7表示来源于威吉利热厌氧杆菌(Thermoanaerobacter wiegelii)的醇脱氢酶;W8表示来源于赭色掷孢酵母(Sporobolomyces salmonicolor)的羰基还原酶SSCR;W9表示来源于嗜热脂肪芽胞杆菌(Geobacillus stearothermophilus)的亮氨酸脱氢酶。Wn-Mn表示Wn(n为自然数)的突变体。蛋白取代的编号是从Wn(n为自然数)所示野生型氨基酸序列的N端为起始的;基因取代的编号是从Wn(n为自然数)所示野生型核苷酸序列的5’端为起始的。表中,对于氨基酸取代,使用下述命名法:原始氨基酸,位置(即在Wn氨基酸序列中的位置),取代氨基酸。相应地,在SEQ ID No.2的第11位用缬氨酸(V)取代原有的异亮氨酸(I)命名为“I11V”。对于碱基取代,使用下述命名法:原始碱基,位置(即在Wn核苷酸序列中的位置),取代碱基。相应的,在SEQ ID No.1的第31位用A取代原有的G命名为“A31G”。包含多重改变的变体由斜杠符号(“/”)分隔。Note: W1 represents alcohol dehydrogenase LBADH derived from Lactobacillus brevis; W2 represents alcohol dehydrogenase derived from Geobacillus stearothermophilus; W3 represents alcohol dehydrogenase derived from Thermoanaerobacter brockii ) alcohol dehydrogenase TbSADH; W4 represents alcohol dehydrogenase derived from Lactobacillus kefir; W5 represents alcohol dehydrogenase derived from Bacillaceae; W6 represents alcohol dehydrogenase derived from Lysella (Leifsonia sp.) alcohol dehydrogenase; W7 represents alcohol dehydrogenase derived from Thermoanaerobacter wiegelii; W8 represents carbonyl reductase SSCR derived from Sporobolomyces salmonicolor; W9 represents a leucine dehydrogenase derived from Geobacillus stearothermophilus. Wn-Mn represents a mutant of Wn (n is a natural number). The numbering of protein substitutions starts from the N-terminus of the wild-type amino acid sequence indicated by Wn (n is a natural number); the numbering of gene substitutions is from the 5' end of the wild-type nucleotide sequence indicated by Wn (n is a natural number). for starting. In the Tables, for amino acid substitutions, the following nomenclature is used: original amino acid, position (ie, position in the Wn amino acid sequence), substituted amino acid. Correspondingly, the original isoleucine (I) was replaced by valine (V) at position 11 of SEQ ID No. 2 and named "I11V". For base substitutions, the following nomenclature is used: original base, position (ie, position in the Wn nucleotide sequence), substituted base. Correspondingly, the original G was replaced by A at the 31st position of SEQ ID No. 1 and named "A31G". Variants containing multiple changes are separated by a slash character ("/").
二、重组醇脱氢酶或其突变体、氨基酸脱氢酶或其突变体的表达及粗酶制备2. Expression and crude enzyme preparation of recombinant alcohol dehydrogenase or its mutant, amino acid dehydrogenase or its mutant
将步骤一构建的重组表达载体转入大肠杆菌BL21(DE3)感受态细胞中,37℃培养12-16h,待转化子长出,将单个转化子接种到含有相应抗生素的5mL LB培养基中,37℃,220rmp培养过夜(12-16h)。然后按照1%(体积百分含量)的比例接种到TB培养基中,37℃,220rmp培养到OD600为0.6左右,添加终浓度为0.1mM的IPTG,20-37℃诱导培养16h,然后4000rpm,4℃离心10min收集细胞,用100mM,pH值为8.0的磷酸盐缓冲重悬清洗一次,之后超声破菌并制备酶冻干粉。The recombinant expression vector constructed in step 1 was transferred into Escherichia coli BL21 (DE3) competent cells, and cultured at 37°C for 12-16 hours. After the transformants grew out, a single transformant was inoculated into 5 mL of LB medium containing corresponding antibiotics. Incubate overnight (12-16h) at 37°C, 220rmp. Then inoculate it into TB medium at a ratio of 1% (volume percentage), cultivate at 37°C, 220rmp until the OD600 is about 0.6, add IPTG with a final concentration of 0.1mM, induce culture at 20-37°C for 16h, and then 4000rpm, The cells were collected by centrifugation at 4°C for 10 min, resuspended and washed once with 100 mM phosphate buffer with a pH value of 8.0, and then sonicated to prepare enzyme lyophilized powder.
三、手性2-氨基-1-丁醇的制备3. Preparation of chiral 2-amino-1-butanol
第一步反应:在反应体系中依次加入浓度为100mM,pH值为8.0的磷酸盐缓冲、终浓度为20mM的1,2-丁二醇、终浓度为1mM的氧化型辅因子II,即NADP+,或者为氧化型辅因子I,即NAD+、终浓度为5%的丙酮(v/v)、终浓度为10g/L的醇脱氢酶或羰基还原酶冻干粉或酶液组成反应体系。将该反应体系在30℃的条件下反应24h后,对产物进行气相色谱(GC)检测。The first reaction: phosphate buffer with a concentration of 100 mM, pH 8.0, 1,2-butanediol with a final concentration of 20 mM, and oxidized cofactor II with a final concentration of 1 mM, namely NADP, were sequentially added to the reaction system. + , or oxidized cofactor I, namely NAD + , acetone (v/v) with a final concentration of 5%, alcohol dehydrogenase or carbonyl reductase with a final concentration of 10 g/L lyophilized powder or enzyme liquid composition reaction system. After the reaction system was reacted at 30° C. for 24 h, the product was detected by gas chromatography (GC).
气相色谱(GC)的检测条件如下:进样量:2μL;色谱柱:HP-5;分流比:20:1;分流流量:40mL/min;升温程序:40℃,5分钟;5℃/min升温到60℃,2分钟;30℃/min升温到200℃,2.333分钟。运行时间:18分钟。The detection conditions of gas chromatography (GC) are as follows: injection volume: 2 μL; chromatographic column: HP-5; split ratio: 20:1; split flow: 40 mL/min; temperature program: 40 °C, 5 minutes; 5 °C/min Raised to 60°C for 2 minutes; 30°C/min to 200°C for 2.333 minutes. Run time: 18 minutes.
结果参考图3,证明该步反应得到2-酮-1-丁酮。The results refer to Figure 3, which proves that 2-keto-1-butanone is obtained in this step.
第二步反应:以上一步反应生成的2-酮-1-丁醇为底物,在反应体系中依次加入为pH为8.7、终浓度为1M的NH3·H2O或NH4Cl缓冲液、终浓度为1mM的氧化型辅因子I,即NAD+、终浓度为1g/L的葡萄糖脱氢酶、终浓度为100mM的葡萄糖、终浓度为10g/L的嗜热脂肪芽孢杆菌(Geobacillus stearothermophilus)的亮氨酸脱氢酶突变体冻干粉或酶液、终浓度为5g/L的碳酸钙组成反应体系。将该反应体系在30℃的条件下反应24h,即得到(S)-2-氨基-1-丁醇。反应液用邻苯二甲醛衍生后液相色谱检测。The second step reaction: the 2-keto-1-butanol generated in the previous step reaction is used as the substrate, and NH 3 ·H 2 O or NH 4 Cl buffer solution with pH of 8.7 and final concentration of 1M is added to the reaction system in turn. , Oxidative cofactor I at a final concentration of 1 mM, namely NAD + , glucose dehydrogenase at a final concentration of 1 g/L, glucose at a final concentration of 100 mM, Bacillus stearothermophilus (Geobacillus stearothermophilus at a final concentration of 10 g/L) ) leucine dehydrogenase mutant freeze-dried powder or enzyme solution, and calcium carbonate with a final concentration of 5 g/L to form a reaction system. The reaction system was reacted at 30° C. for 24 h to obtain (S)-2-amino-1-butanol. The reaction solution was derivatized with o-phthalaldehyde and detected by liquid chromatography.
HPLC检测条件如下:Agilent SB-Aq C18柱(4.6mm*250mm,5um);检测波长334nm;柱温:35℃;流速:1mL/min;梯度洗脱程序如表2所示。HPLC detection conditions are as follows: Agilent SB-Aq C18 column (4.6mm*250mm, 5um); detection wavelength 334nm; column temperature: 35°C; flow rate: 1 mL/min;
表2 HPLC的梯度洗脱程序Table 2 Gradient elution procedure for HPLC
注:(1)表中的%表示体积百分含量。Note: (1) % in the table means volume percentage.
ee=(AS-AR)/(AS+AR)×100%;AS:液相色谱分析获得的(S)-2-氨基-1-丁醇的峰面积值;AR:液相色谱分析获得的(R)-2-氨基-1-丁醇的峰面积值。ee=(A S -A R )/(A S +A R )×100%; A S : the peak area value of (S)-2-amino-1-butanol obtained by liquid chromatography analysis; A R : The peak area values of (R)-2-amino-1-butanol obtained were analyzed by liquid chromatography.
底物转化效率=C转/C总×100%;C转:反应体系中转化为(S)或(R)-2-氨基-1-丁醇的底物的摩尔数;C总:反应体系中底物的总摩尔数。Substrate conversion efficiency = C turn /C total × 100%; C turn : the number of moles of substrate converted into (S) or (R)-2-amino-1-butanol in the reaction system; C total : the reaction system Total moles of substrate.
HPLC检测结果参考图4和图5中D,底物转化效率45-55%,ee值大于99%。具体结果参见表3。HPLC detection results refer to Figure 4 and D in Figure 5, the substrate conversion efficiency is 45-55%, and the ee value is greater than 99%. The specific results are shown in Table 3.
表3醇脱氢酶或羰基还原酶偶联嗜热脂肪芽孢杆菌(Geobacillusstearothermophilus)的亮氨酸脱氢酶或其突变体制备(S)-2-氨基-1-丁醇的结果Table 3 The results of preparing (S)-2-amino-1-butanol by coupling alcohol dehydrogenase or carbonyl reductase to leucine dehydrogenase of Geobacillus stearothermophilus or its mutants
注:表中第一步反应和第二步反应中Wn和Wn-Mn(n为自然数)所代表的含义与表1相同。Note: The meanings represented by Wn and Wn-Mn (n is a natural number) in the first-step reaction and the second-step reaction in the table are the same as those in table 1.
实施例2、醇脱氢酶或羰基还原酶偶联转氨酶制备(S)-2-氨基-1-丁醇Example 2. Preparation of (S)-2-amino-1-butanol by alcohol dehydrogenase or carbonyl reductase coupled transaminase
一、重组醇脱氢酶或羰基还原酶、转氨酶的工程菌的制备1. Preparation of recombinant alcohol dehydrogenase or carbonyl reductase, transaminase engineering bacteria
参照实施例1步骤一进行。Carry out with reference to Step 1 of Example 1.
本实施例中所涉及的酶及其突变体详见表4。The enzymes involved in this example and their mutants are detailed in Table 4.
表4实施例2中所涉及的酶及其突变体Enzymes involved in Table 4 Example 2 and their mutants
注:W1-W8所表示的含义同表1。W10表示Codexis公司的ATA-117转氨酶;W11表示来源于土曲霉(Aspergillus terreus)的转氨酶;W12表示来源于烟曲霉菌(Aspergillusfumigatus)的转氨酶;W13表示来源于费希新萨托菌(Neosartorya fischeri)的转氨酶;W14表示来源于玉米赤霉(Gibberella zeae)的转氨酶;W15表示来源于分支杆菌(Mycobacterium vanbaalenii)的转氨酶。Wn-Mn表示Wn(n为自然数)的突变体。蛋白、基因的取代的编号以及具体命名法均同表1。Note: The meanings represented by W1-W8 are the same as in Table 1. W10 represents ATA-117 transaminase from Codexis; W11 represents transaminase derived from Aspergillus terreus; W12 represents transaminase derived from Aspergillus fumigatus; W13 represents transaminase derived from Neosartorya fischeri W14 represents the transaminase derived from Gibberella zeae; W15 represents the transaminase derived from Mycobacterium vanbaalenii. Wn-Mn represents a mutant of Wn (n is a natural number). The substitution numbers and specific nomenclature of proteins and genes are the same as in Table 1.
二、重组醇脱氢酶或羰基还原酶、转氨酶的表达及粗酶制备2. Expression and crude enzyme preparation of recombinant alcohol dehydrogenase or carbonyl reductase and transaminase
参照实施例1步骤二进行。Carry out with reference to step 2 of Example 1.
三、手性2-氨基-1-丁醇的制备3. Preparation of chiral 2-amino-1-butanol
第一步反应:在反应体系中依次加入浓度为100mM,pH值为8.0的磷酸盐缓冲、终浓度为20mM的1,2-丁二醇、终浓度为1mM的氧化型辅因子II,即NADP+,或者为氧化型辅因子I,即NAD+、终浓度为5%的丙酮(v/v)、终浓度为10g/L的醇脱氢酶或羰基还原酶酶冻干粉或酶液组成反应体系;将该反应体系在30℃的条件下反应24h后,对产物进行气相色谱(GC)检测。气相色谱(GC)的检测条件如实施例1步骤三所示。结果参考图3,证明该步反应得到2-酮-1-丁酮。The first reaction: phosphate buffer with a concentration of 100 mM, pH 8.0, 1,2-butanediol with a final concentration of 20 mM, and oxidized cofactor II with a final concentration of 1 mM, namely NADP, were sequentially added to the reaction system. + , or oxidized cofactor I, namely NAD + , acetone (v/v) with a final concentration of 5%, alcohol dehydrogenase or carbonyl reductase enzyme with a final concentration of 10 g/L lyophilized powder or enzyme solution Reaction system; after the reaction system was reacted at 30° C. for 24 hours, the product was detected by gas chromatography (GC). The detection conditions of gas chromatography (GC) are as shown in step 3 of Example 1. The results refer to Figure 3, which proves that 2-keto-1-butanone is obtained in this step.
第二步反应:以上一步反应生成的2-酮-1-丁醇为底物,在反应体系中依次加入浓度为50~100mM、异丙胺的浓度为500mM、pH值为8.0的磷酸盐缓冲液,终浓度为1mM的磷酸吡哆醛(PLP),终浓度为10g/L的Codexis公司的ATA-117转氨酶,或土曲霉(Aspergillusterreus)的转氨酶,或烟曲霉菌(Aspergillus fumigatus)的转氨酶,或费希新萨托菌(Neosartorya fischeri)的转氨酶,或玉米赤霉(Gibberella zeae)的转氨酶,或分支杆菌(Mycobacterium vanbaalenii)的转氨酶酶冻干粉或酶液组成反应体系,将该反应体系在30℃的条件下反应24h,即得到(S)-2-氨基-1-丁醇。反应液用邻苯二甲醛衍生后液相色谱检测。HPLC检测条件如实施例1步骤三所示。The second step reaction: the 2-keto-1-butanol generated in the previous step reaction is used as the substrate, and the phosphate buffer solution with a concentration of 50-100 mM, a concentration of isopropylamine of 500 mM, and a pH value of 8.0 is sequentially added to the reaction system. , pyridoxal phosphate (PLP) at a final concentration of 1 mM, ATA-117 transaminase from Codexis at a final concentration of 10 g/L, or transaminase from Aspergillus terreus, or transaminase from Aspergillus fumigatus, or The transaminase of Neosartorya fischeri (Neosartorya fischeri), or the transaminase of Gibberella zeae, or the transaminase enzyme freeze-dried powder or enzyme liquid of Mycobacterium vanbaalenii constitutes a reaction system, and the reaction system is kept at 30 Under the condition of ℃, react for 24h to obtain (S)-2-amino-1-butanol. The reaction solution was derivatized with o-phthalaldehyde and detected by liquid chromatography. HPLC detection conditions are as shown in step 3 of Example 1.
ee值和底物转化效率的具体计算方法同实施例1步骤三。The specific calculation method of ee value and substrate conversion efficiency is the same as that of step 3 of embodiment 1.
HPLC检测结果参考图4和图5中D,底物转化效率45-55%,ee值大于99%。具体结果参见表5。HPLC detection results refer to Figure 4 and D in Figure 5, the substrate conversion efficiency is 45-55%, and the ee value is greater than 99%. The specific results are shown in Table 5.
表5醇脱氢酶或其突变体偶联转氨酶制备(S)-2-氨基-1-丁醇的结果Table 5 Results of the preparation of (S)-2-amino-1-butanol by alcohol dehydrogenase or its mutant coupled with transaminase
注:表中第一步反应和第二步反应中Wn和Wn-Mn(n为自然数)所代表的含义与表4相同。Note: The meanings represented by Wn and Wn-Mn (n is a natural number) in the first-step reaction and the second-step reaction in the table are the same as those in table 4.
实施例3、醇脱氢酶或羰基还原酶偶联转氨酶制备(R)-2-氨基-1-丁醇Example 3. Preparation of (R)-2-amino-1-butanol by alcohol dehydrogenase or carbonyl reductase coupled transaminase
一、重组醇脱氢酶、转氨酶的工程菌的制备1. Preparation of recombinant alcohol dehydrogenase and transaminase engineering bacteria
参照实施例1步骤一进行。Carry out with reference to Step 1 of Example 1.
本实施例中所涉及的酶及其突变体详见表6。The enzymes involved in this example and their mutants are detailed in Table 6.
表6实施例3中所涉及的酶及其突变体Enzymes involved in Table 6 Example 3 and their mutants
注:W1-W8所表示的含义同表1。W16表示来源于巨大芽孢杆菌(Bacillusmegaterium)的转氨酶;W17表示来源于铜绿色假单胞菌(P.aeruginosa)的转氨酶。Wn-Mn表示Wn(n为自然数)的突变体。蛋白、基因的取代的编号以及具体命名法均同表1。Note: The meanings represented by W1-W8 are the same as in Table 1. W16 represents a transaminase derived from Bacillus megaterium; W17 represents a transaminase derived from P. aeruginosa. Wn-Mn represents a mutant of Wn (n is a natural number). The substitution numbers and specific nomenclature of proteins and genes are the same as in Table 1.
二、重组醇脱氢酶、转氨酶的表达及粗酶制备2. Expression and crude enzyme preparation of recombinant alcohol dehydrogenase and transaminase
参照实施例1步骤二进行。Carry out with reference to step 2 of Example 1.
三、手性2-氨基-1-丁醇的制备3. Preparation of chiral 2-amino-1-butanol
第一步反应:在反应体系中依次加入浓度为100mM,pH值为8.0的磷酸盐缓冲液、终浓度为20mM的1,2-丁二醇、终浓度为1mM的氧化型辅因子II,即NADP+,或者为氧化型辅因子I,即NAD+、终浓度为5%的丙酮(v/v)、终浓度为10g/L的醇脱氢酶或羰基还原酶冻干粉或酶液组成反应体系;将该反应体系在30℃的条件下反应24h后,对产物进行气相色谱(GC)检测。气相色谱(GC)的检测条件如实施例1步骤三所示。结果参考图3,证明该步反应得到2-酮-1-丁酮。The first reaction: phosphate buffer with a concentration of 100 mM and a pH of 8.0, 1,2-butanediol with a final concentration of 20 mM, and oxidized cofactor II with a final concentration of 1 mM are added to the reaction system in turn, namely NADP + , or oxidized cofactor I, namely NAD + , acetone (v/v) with a final concentration of 5%, alcohol dehydrogenase or carbonyl reductase with a final concentration of 10 g/L lyophilized powder or enzyme solution Reaction system; after the reaction system was reacted at 30° C. for 24 hours, the product was detected by gas chromatography (GC). The detection conditions of gas chromatography (GC) are as shown in step 3 of Example 1. The results refer to Figure 3, which proves that 2-keto-1-butanone is obtained in this step.
第二步反应:以上一步反应生成的2-酮-1-丁醇为底物,在反应体系中依次加入浓度为100mM、异丙胺的浓度为500mM、pH值为8.0的磷酸盐缓冲液,终浓度为1mM的磷酸吡哆醛(PLP),终浓度为10g/L的巨大芽胞杆菌(Bacillus megaterium)的转氨酶,或铜绿色假单胞菌(P.aeruginosa PAO2)的转氨酶冻干粉或酶液组成反应体系,将该反应体系在30℃的条件下反应24h,即得到(S)-2-氨基-1-丁醇。反应液用邻苯二甲醛衍生后液相色谱检测。HPLC检测条件如如实施例一步骤三所示。The second step reaction: the 2-keto-1-butanol generated in the previous step reaction is used as the substrate, and the phosphate buffer solution with a concentration of 100 mM, a concentration of isopropylamine of 500 mM, and a pH value of 8.0 is added to the reaction system in turn. Pyridoxal phosphate (PLP) at a concentration of 1mM, transaminase from Bacillus megaterium at a final concentration of 10g/L, or transaminase from P. aeruginosa (P.aeruginosa PAO2) lyophilized powder or enzyme solution A reaction system was formed, and the reaction system was reacted at 30° C. for 24 h to obtain (S)-2-amino-1-butanol. The reaction solution was derivatized with o-phthalaldehyde and detected by liquid chromatography. HPLC detection conditions are as shown in step 3 of Example 1.
HPLC检测结果参考图4和图5中E,底物转化效率45-50%,ee值大于99%。具体结果参见表7。HPLC detection results refer to E in Figure 4 and Figure 5, the substrate conversion efficiency is 45-50%, and the ee value is greater than 99%. The specific results are shown in Table 7.
表7醇脱氢酶偶联转氨酶制备(R)-2-氨基-1-丁醇的结果Table 7 Results of preparing (R)-2-amino-1-butanol by alcohol dehydrogenase coupled transaminase
注:表中第一步反应和第二步反应中Wn和Wn-Mn(n为自然数)所代表的含义与表6相同。Note: The meanings represented by Wn and Wn-Mn (n is a natural number) in the first-step reaction and the second-step reaction in the table are the same as those in Table 6.
实施例4、酶A和酶B共表达全细胞制备(S)-2-氨基-1-丁醇和(R)-2-氨基-1-丁醇Example 4. Co-expression of enzyme A and enzyme B Whole cell preparation of (S)-2-amino-1-butanol and (R)-2-amino-1-butanol
一、酶A和酶B共表达工程菌的制备1. Preparation of co-expression engineering bacteria of enzyme A and enzyme B
将相关酶的编码基因分别进行全基因合成(根据需要以大肠杆菌为宿主进行密码子优化),将合成的基因连接于各种表达载体上构建而成。所述的表达载体为本领域常规的各种载体。本发明所述载体具体pETDuet-1,将全基因合成后酶A的DNA片段插入到pETDuet-1的酶切位点EcoRⅠ和HindⅢ之间,将全基因合成后酶B的DNA片段插入到pETDuet-1的酶切位点NdeⅠ和XhoⅠ之间。将重组载体转入大肠杆菌DH5α感受态细胞;挑取阳性转化子并测序鉴定后,得到正确的重组表达载体。The encoding genes of the relevant enzymes are separately synthesized by total gene synthesis (codon optimization is carried out using Escherichia coli as the host as required), and the synthesized genes are connected to various expression vectors to construct. The expression vectors are various conventional vectors in the field. The vector of the present invention is specifically pETDuet-1. The DNA fragment of enzyme A after full gene synthesis is inserted between the restriction sites EcoRI and HindIII of pETDuet-1, and the DNA fragment of enzyme B after full gene synthesis is inserted into pETDuet- 1 between the enzyme cleavage sites NdeI and XhoI. The recombinant vector was transferred into Escherichia coli DH5α competent cells; positive transformants were picked and sequenced and identified, and the correct recombinant expression vector was obtained.
将上述测序验证正确的重组表达载体转化至合适的微生物宿主中。所述宿主微生物为本领域常规的各种宿主微生物,只要能满足上述重组表达载体稳定地自行复制,且所携带的醇脱氢及氨基酸脱氢酶基因可被有效表达即可。其中所述宿主微生物较佳地为:大肠杆菌(Escherichia coli),优选地为大肠杆菌BL21(DE3),将前述重组表达质粒转化入E.coli BL21(DE3)中,可得到本发明的基因工程菌株。其中所述转化方法为本领域常规转化方法,较佳地为电转法或化学转化法。The above-mentioned sequence-verified correct recombinant expression vector is transformed into a suitable microbial host. The host microorganisms are various conventional host microorganisms in the field, as long as the above-mentioned recombinant expression vector can stably replicate by itself, and the carried alcohol dehydrogenase and amino acid dehydrogenase genes can be effectively expressed. Wherein the host microorganism is preferably: Escherichia coli (Escherichia coli), preferably Escherichia coli BL21 (DE3), the aforementioned recombinant expression plasmid is transformed into E. coli BL21 (DE3), the genetic engineering of the present invention can be obtained strains. The transformation method is a conventional transformation method in the field, preferably an electro-transformation method or a chemical transformation method.
本实施例中所涉及的酶及其突变体详见表8。The enzymes involved in this example and their mutants are detailed in Table 8.
表8实施例4中所涉及的酶及其突变体Table 8 Enzymes involved in Example 4 and their mutants
注:W1-W17所表示的含义同表1、4、6。Wn-Mn表示Wn(n为自然数)的突变体。蛋白、基因的取代的编号以及具体命名法均同表1。Note: The meanings represented by W1-W17 are the same as in Tables 1, 4, and 6. Wn-Mn represents a mutant of Wn (n is a natural number). The substitution numbers and specific nomenclature of proteins and genes are the same as in Table 1.
二、酶A和酶B的共表达2. Co-expression of enzyme A and enzyme B
将步骤一构建的重组表达载体转入大肠杆菌BL21(DE3)感受态细胞中,37℃培养12-16h,待转化子长出,将单个转化子接种到含有相应抗生素的5mL LB培养基中,37℃,220rmp培养过夜(12-16h)。然后按照1%(体积百分含量)的比例接种到TB培养基中,37℃,220rmp培养到OD600为0.6左右,添加终浓度为0.1mM的IPTG,20-37℃诱导培养16h,然后4000rpm,4℃离心10min收集细胞。The recombinant expression vector constructed in step 1 was transferred into Escherichia coli BL21 (DE3) competent cells, and cultured at 37°C for 12-16 hours. After the transformants grew out, a single transformant was inoculated into 5 mL of LB medium containing corresponding antibiotics. Incubate overnight (12-16h) at 37°C, 220rmp. Then inoculate it into TB medium at a ratio of 1% (volume percentage), cultivate at 37°C, 220rmp until the OD600 is about 0.6, add IPTG with a final concentration of 0.1mM, induce culture at 20-37°C for 16h, and then 4000rpm, Cells were collected by centrifugation at 4°C for 10 min.
三、手性2-氨基-1-丁醇的制备3. Preparation of chiral 2-amino-1-butanol
在反应体系中依次加入浓度为100mM、pH值为8.0的磷酸盐缓冲液,终浓度为50mM的1,2-丁二醇,100mM葡萄糖,终浓度为100g/L的能够共表达酶A和酶B的全细胞(湿菌重)组成反应体系。将该反应体系在30℃的条件下反应24h后,即得到或(R)或(S)-2-氨基-1-丁醇。发酵液用邻苯二甲醛衍生后液相色谱检测。HPLC检测条件如实施例1步骤三所示。Phosphate buffer with a concentration of 100 mM and a pH of 8.0 was added to the reaction system in turn, 1,2-butanediol with a final concentration of 50 mM, 100 mM glucose, and a final concentration of 100 g/L, which can co-express enzyme A and enzyme Whole cells of B (wet bacterial weight) constitute the reaction system. After the reaction system was reacted at 30° C. for 24 h, either (R) or (S)-2-amino-1-butanol was obtained. The fermentation broth was derivatized with o-phthalaldehyde and detected by liquid chromatography. HPLC detection conditions are as shown in step 3 of Example 1.
ee值和底物转化效率的具体计算方法同实施例1步骤三。The specific calculation method of ee value and substrate conversion efficiency is the same as that of step 3 of embodiment 1.
HPLC检测结果参考图4和图5中D-E,底物转化效率15-30%,ee值大于99%。具体结果参见表9。HPLC detection results refer to D-E in Figure 4 and Figure 5, the substrate conversion efficiency is 15-30%, and the ee value is greater than 99%. The specific results are shown in Table 9.
表9酶A和B共表达全细胞催化制备手性2-氨基-1-丁醇的结果Table 9 Results of co-expression of enzymes A and B in whole cell catalytic preparation of chiral 2-amino-1-butanol
注:表中Wn和Wn-Mn(n为自然数)所代表的含义与表8相同。Note: The meanings represented by Wn and Wn-Mn (n is a natural number) in the table are the same as those in Table 8.
<110> 中国科学院天津工业生物技术研究所<110> Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences
<120> 一种手性2-氨基-1-丁醇的合成方法<120> A kind of synthetic method of chiral 2-amino-1-butanol
<130> GNCLN171921<130> GNCLN171921
<160> 34<160> 34
<170> PatentIn version 3.5<170> PatentIn version 3.5
<210> 1<210> 1
<211> 756<211> 756
<212> DNA<212> DNA
<213> 人工序列<213> Artificial sequences
<220><220>
<223><223>
<400> 1<400> 1
agcaatcgcc tggatggcaa agtggcgatt attaccggcg gtaccctggg tattggctta 60agcaatcgcc tggatggcaa agtggcgatt attaccggcg gtaccctggg tattggctta 60
gcgattgcga ccaaatttgt ggaagaaggc gcgaaagtga tgattaccgg ccgccatagc 120gcgattgcga ccaaatttgt ggaagaaggc gcgaaagtga tgattaccgg ccgccatagc 120
gatgttggcg aaaaagcggc gaaaagcgtt ggtaccccgg atcagattca gttttttcag 180gatgttggcg aaaaagcggc gaaaagcgtt ggtaccccgg atcagattca gttttttcag 180
cacgatagca gcgatgaaga tggctggacc aaactgtttg atgcgaccga aaaagcgttt 240cacgatagca gcgatgaaga tggctggacc aaactgtttg atgcgaccga aaaagcgttt 240
ggcccggtga gcaccttagt taacaatgcg ggcatcgcgg tgaacaaaag cgtggaagaa 300ggcccggtga gcaccttagt taacaatgcg ggcatcgcgg tgaacaaaag cgtggaagaa 300
accaccacag cggaatggcg caaattactg gcggtgaacc tggatggcgt gttttttggt 360accaccacag cggaatggcg caaattactg gcggtgaacc tggatggcgt gttttttggt 360
acccgcctgg gcattcagcg catgaaaaac aaaggcctgg gcgcgagcat tattaacatg 420acccgcctgg gcattcagcg catgaaaaac aaaggcctgg gcgcgagcat tattaacatg 420
agcagcattg aaggctttgt gggcgatcct agcttaggtg cgtataacgc gagcaaaggc 480agcagcattg aaggctttgt gggcgatcct agcttaggtg cgtataacgc gagcaaaggc 480
gcggttcgca ttatgagcaa aagcgcggcg ttagattgtg cgctgaagga ttatgatgtg 540gcggttcgca ttatgagcaa aagcgcggcg ttagattgtg cgctgaagga ttatgatgtg 540
cgcgtgaaca ctgttcatcc gggctatatt aaaaccccgc tggtggatga tttaccgggt 600cgcgtgaaca ctgttcatcc gggctatatt aaaaccccgc tggtggatga tttaccgggt 600
gcggaagaag ctatgagcca gcgtaccaaa accccgatgg gccatattgg cgaaccgaac 660gcggaagaag ctatgagcca gcgtaccaaa accccgatgg gccatattgg cgaaccgaac 660
gatattgcgt atatctgcgt gtatctggcg agcaacgaaa gcaaatttgc gaccggcagc 720gatattgcgt atatctgcgt gtatctggcg agcaacgaaa gcaaatttgc gaccggcagc 720
gaatttgttg tggatggcgg ctataccgcg caataa 756gaatttgttg tggatggcgg ctataccgcg caataa 756
<210> 2<210> 2
<211> 251<211> 251
<212> PRT<212> PRT
<213> 短小乳杆菌(Lactobacillus brevis)<213> Lactobacillus brevis
<400> 2<400> 2
Ser Asn Arg Leu Asp Gly Lys Val Ala Ile Ile Thr Gly Gly Thr LeuSer Asn Arg Leu Asp Gly Lys Val Ala Ile Ile Thr Gly Gly Thr Leu
1 5 10 151 5 10 15
Gly Ile Gly Leu Ala Ile Ala Thr Lys Phe Val Glu Glu Gly Ala LysGly Ile Gly Leu Ala Ile Ala Thr Lys Phe Val Glu Glu Gly Ala Lys
20 25 30 20 25 30
Val Met Ile Thr Gly Arg His Ser Asp Val Gly Glu Lys Ala Ala LysVal Met Ile Thr Gly Arg His Ser Asp Val Gly Glu Lys Ala Ala Lys
35 40 45 35 40 45
Ser Val Gly Thr Pro Asp Gln Ile Gln Phe Phe Gln His Asp Ser SerSer Val Gly Thr Pro Asp Gln Ile Gln Phe Phe Gln His Asp Ser Ser
50 55 60 50 55 60
Asp Glu Asp Gly Trp Thr Lys Leu Phe Asp Ala Thr Glu Lys Ala PheAsp Glu Asp Gly Trp Thr Lys Leu Phe Asp Ala Thr Glu Lys Ala Phe
65 70 75 8065 70 75 80
Gly Pro Val Ser Thr Leu Val Asn Asn Ala Gly Ile Ala Val Asn LysGly Pro Val Ser Thr Leu Val Asn Asn Ala Gly Ile Ala Val Asn Lys
85 90 95 85 90 95
Ser Val Glu Glu Thr Thr Thr Ala Glu Trp Arg Lys Leu Leu Ala ValSer Val Glu Glu Thr Thr Thr Ala Glu Trp Arg Lys Leu Leu Ala Val
100 105 110 100 105 110
Asn Leu Asp Gly Val Phe Phe Gly Thr Arg Leu Gly Ile Gln Arg MetAsn Leu Asp Gly Val Phe Phe Gly Thr Arg Leu Gly Ile Gln Arg Met
115 120 125 115 120 125
Lys Asn Lys Gly Leu Gly Ala Ser Ile Ile Asn Met Ser Ser Ile GluLys Asn Lys Gly Leu Gly Ala Ser Ile Ile Asn Met Ser Ser Ile Glu
130 135 140 130 135 140
Gly Phe Val Gly Asp Pro Ser Leu Gly Ala Tyr Asn Ala Ser Lys GlyGly Phe Val Gly Asp Pro Ser Leu Gly Ala Tyr Asn Ala Ser Lys Gly
145 150 155 160145 150 155 160
Ala Val Arg Ile Met Ser Lys Ser Ala Ala Leu Asp Cys Ala Leu LysAla Val Arg Ile Met Ser Lys Ser Ala Ala Leu Asp Cys Ala Leu Lys
165 170 175 165 170 175
Asp Tyr Asp Val Arg Val Asn Thr Val His Pro Gly Tyr Ile Lys ThrAsp Tyr Asp Val Arg Val Asn Thr Val His Pro Gly Tyr Ile Lys Thr
180 185 190 180 185 190
Pro Leu Val Asp Asp Leu Pro Gly Ala Glu Glu Ala Met Ser Gln ArgPro Leu Val Asp Asp Leu Pro Gly Ala Glu Glu Ala Met Ser Gln Arg
195 200 205 195 200 205
Thr Lys Thr Pro Met Gly His Ile Gly Glu Pro Asn Asp Ile Ala TyrThr Lys Thr Pro Met Gly His Ile Gly Glu Pro Asn Asp Ile Ala Tyr
210 215 220 210 215 220
Ile Cys Val Tyr Leu Ala Ser Asn Glu Ser Lys Phe Ala Thr Gly SerIle Cys Val Tyr Leu Ala Ser Asn Glu Ser Lys Phe Ala Thr Gly Ser
225 230 235 240225 230 235 240
Glu Phe Val Val Asp Gly Gly Tyr Thr Ala GlnGlu Phe Val Val Asp Gly Gly Tyr Thr Ala Gln
245 250 245 250
<210> 3<210> 3
<211> 1014<211> 1014
<212> DNA<212> DNA
<213> 嗜热脂肪芽孢杆菌(Geobacillus stearothermophilus)<213> Geobacillus stearothermophilus
<400> 3<400> 3
atgaaagctg cagttgtgga acaatttaaa aagccgttac aagtgaaaga agtggaaaaa 60atgaaagctg cagttgtgga acaatttaaa aagccgttac aagtgaaaga agtggaaaaa 60
cctaagatct catacgggga agtattagtg cgcatcaaag cgtgtggggt atgccataca 120cctaagatct catacgggga agtattagtg cgcatcaaag cgtgtggggt atgccataca 120
gacttgcatg ccgcacatgg cgactggcct gtaaagccta aactgcctct cattcctggc 180gacttgcatg ccgcacatgg cgactggcct gtaaagccta aactgcctct cattcctggc 180
catgaaggcg tcggtgtaat tgaagaagta ggtcctgggg taacacattt aaaagttgga 240catgaaggcg tcggtgtaat tgaagaagta ggtcctgggg taacacattt aaaagttgga 240
gatcgcgtag gtatcccttg gctttattcg gcgtgcggtc attgtgacta ttgcttaagc 300gatcgcgtag gtatcccttg gctttattcg gcgtgcggtc attgtgacta ttgcttaagc 300
ggacaagaaa cattatgcga acgtcaacaa aacgctggct attccgtcga tggtggttat 360ggacaagaaa cattatgcga acgtcaacaa aacgctggct attccgtcga tggtggttat 360
gctgaatatt gccgtgctgc agccgattat gtcgtaaaaa ttcctgataa cttatcgttt 420gctgaatatt gccgtgctgc agccgattat gtcgtaaaaa ttcctgataa cttatcgttt 420
gaagaagccg ctccaatctt ttgcgctggt gtaacaacat ataaagcgct caaagtaaca 480gaagaagccg ctccaatctt ttgcgctggt gtaacaacat ataaagcgct caaagtaaca 480
ggcgcaaaac caggtgaatg ggtagccatt tacggtatcg gcgggcttgg acatgtcgca 540ggcgcaaaac caggtgaatg ggtagccatt tacggtatcg gcgggcttgg acatgtcgca 540
gtccaatacg caaaggcgat ggggttaaac gtcgttgctg tcgatttagg tgatgaaaaa 600gtccaatacg caaaggcgat ggggttaaac gtcgttgctg tcgatttagg tgatgaaaaa 600
cttgagcttg ctaaacaact tggtgcagat cttgtcgtca atccgaaaca tgatgatgca 660cttgagcttg ctaaacaact tggtgcagat cttgtcgtca atccgaaaca tgatgatgca 660
gcacaatgga taaaagaaaa agtgggcggt gtgcatgcga ctgtcgtcac agctgtttca 720gcacaatgga taaaagaaaa agtgggcggt gtgcatgcga ctgtcgtcac agctgtttca 720
aaagccgcgt tcgaatcagc ctacaaatcc attcgtcgcg gtggtgcttg cgtactcgtc 780aaagccgcgt tcgaatcagc ctacaaatcc attcgtcgcg gtggtgcttg cgtactcgtc 780
ggattaccgc cggaagaaat acctattcca attttcgata cagtattaaa tggagtaaaa 840ggattaccgc cggaagaaat acctattcca attttcgata cagtattaaa tggagtaaaa 840
attattggtt ctatcgttgg tacgcgcaaa gacttacaag aggcacttca atttgcagca 900attattggtt ctatcgttgg tacgcgcaaa gacttacaag aggcacttca atttgcagca 900
gaaggaaaag taaaaacaat tgtcgaagtg caaccgcttg aaaacattaa cgacgtattc 960gaaggaaaag taaaaacaat tgtcgaagtg caaccgcttg aaaacattaa cgacgtattc 960
gatcgtatgt taaaagggca aattaacggc cgcgtcgtgt taaaagtaga ttaa 1014gatcgtatgt taaaagggca aattaacggc cgcgtcgtgt taaaagtaga ttaa 1014
<210> 4<210> 4
<211> 337<211> 337
<212> PRT<212> PRT
<213> 嗜热脂肪芽孢杆菌(Geobacillus stearothermophilus)<213> Geobacillus stearothermophilus
<400> 4<400> 4
Met Lys Ala Ala Val Val Glu Gln Phe Lys Lys Pro Leu Gln Val LysMet Lys Ala Ala Val Val Glu Gln Phe Lys Lys Pro Leu Gln Val Lys
1 5 10 151 5 10 15
Glu Val Glu Lys Pro Lys Ile Ser Tyr Gly Glu Val Leu Val Arg IleGlu Val Glu Lys Pro Lys Ile Ser Tyr Gly Glu Val Leu Val Arg Ile
20 25 30 20 25 30
Lys Ala Cys Gly Val Cys His Thr Asp Leu His Ala Ala His Gly AspLys Ala Cys Gly Val Cys His Thr Asp Leu His Ala Ala His Gly Asp
35 40 45 35 40 45
Trp Pro Val Lys Pro Lys Leu Pro Leu Ile Pro Gly His Glu Gly ValTrp Pro Val Lys Pro Lys Leu Pro Leu Ile Pro Gly His Glu Gly Val
50 55 60 50 55 60
Gly Val Ile Glu Glu Val Gly Pro Gly Val Thr His Leu Lys Val GlyGly Val Ile Glu Glu Val Gly Pro Gly Val Thr His Leu Lys Val Gly
65 70 75 8065 70 75 80
Asp Arg Val Gly Ile Pro Trp Leu Tyr Ser Ala Cys Gly His Cys AspAsp Arg Val Gly Ile Pro Trp Leu Tyr Ser Ala Cys Gly His Cys Asp
85 90 95 85 90 95
Tyr Cys Leu Ser Gly Gln Glu Thr Leu Cys Glu Arg Gln Gln Asn AlaTyr Cys Leu Ser Gly Gln Glu Thr Leu Cys Glu Arg Gln Gln Asn Ala
100 105 110 100 105 110
Gly Tyr Ser Val Asp Gly Gly Tyr Ala Glu Tyr Cys Arg Ala Ala AlaGly Tyr Ser Val Asp Gly Gly Tyr Ala Glu Tyr Cys Arg Ala Ala Ala
115 120 125 115 120 125
Asp Tyr Val Val Lys Ile Pro Asp Asn Leu Ser Phe Glu Glu Ala AlaAsp Tyr Val Val Lys Ile Pro Asp Asn Leu Ser Phe Glu Glu Ala Ala
130 135 140 130 135 140
Pro Ile Phe Cys Ala Gly Val Thr Thr Tyr Lys Ala Leu Lys Val ThrPro Ile Phe Cys Ala Gly Val Thr Thr Tyr Lys Ala Leu Lys Val Thr
145 150 155 160145 150 155 160
Gly Ala Lys Pro Gly Glu Trp Val Ala Ile Tyr Gly Ile Gly Gly LeuGly Ala Lys Pro Gly Glu Trp Val Ala Ile Tyr Gly Ile Gly Gly Leu
165 170 175 165 170 175
Gly His Val Ala Val Gln Tyr Ala Lys Ala Met Gly Leu Asn Val ValGly His Val Ala Val Gln Tyr Ala Lys Ala Met Gly Leu Asn Val Val
180 185 190 180 185 190
Ala Val Asp Leu Gly Asp Glu Lys Leu Glu Leu Ala Lys Gln Leu GlyAla Val Asp Leu Gly Asp Glu Lys Leu Glu Leu Ala Lys Gln Leu Gly
195 200 205 195 200 205
Ala Asp Leu Val Val Asn Pro Lys His Asp Asp Ala Ala Gln Trp IleAla Asp Leu Val Val Asn Pro Lys His Asp Asp Ala Ala Gln Trp Ile
210 215 220 210 215 220
Lys Glu Lys Val Gly Gly Val His Ala Thr Val Val Thr Ala Val SerLys Glu Lys Val Gly Gly Val His Ala Thr Val Val Thr Ala Val Ser
225 230 235 240225 230 235 240
Lys Ala Ala Phe Glu Ser Ala Tyr Lys Ser Ile Arg Arg Gly Gly AlaLys Ala Ala Phe Glu Ser Ala Tyr Lys Ser Ile Arg Arg Gly Gly Ala
245 250 255 245 250 255
Cys Val Leu Val Gly Leu Pro Pro Glu Glu Ile Pro Ile Pro Ile PheCys Val Leu Val Gly Leu Pro Pro Glu Glu Ile Pro Ile Pro Ile Phe
260 265 270 260 265 270
Asp Thr Val Leu Asn Gly Val Lys Ile Ile Gly Ser Ile Val Gly ThrAsp Thr Val Leu Asn Gly Val Lys Ile Ile Gly Ser Ile Val Gly Thr
275 280 285 275 280 285
Arg Lys Asp Leu Gln Glu Ala Leu Gln Phe Ala Ala Glu Gly Lys ValArg Lys Asp Leu Gln Glu Ala Leu Gln Phe Ala Ala Glu Gly Lys Val
290 295 300 290 295 300
Lys Thr Ile Val Glu Val Gln Pro Leu Glu Asn Ile Asn Asp Val PheLys Thr Ile Val Glu Val Gln Pro Leu Glu Asn Ile Asn Asp Val Phe
305 310 315 320305 310 315 320
Asp Arg Met Leu Lys Gly Gln Ile Asn Gly Arg Val Val Leu Lys ValAsp Arg Met Leu Lys Gly Gln Ile Asn Gly Arg Val Val Leu Lys Val
325 330 335 325 330 335
AspAsp
<210> 5<210> 5
<211> 1059<211> 1059
<212> DNA<212> DNA
<213> 高温厌氧杆菌(Thermoanaerobacter brockii)<213> Thermoanaerobacter brockii
<400> 5<400> 5
atgaaaggtt ttgcaatgct cagtatcggt aaagttggct ggattgagaa ggaaaagcct 60atgaaaggtt ttgcaatgct cagtatcggt aaagttggct ggattgagaa ggaaaagcct 60
gctcctggcc catttgatgc tattgtaaga cctctagctg tggccccttg cacttcggac 120gctcctggcc catttgatgc tattgtaaga cctctagctg tggccccttg cacttcggac 120
attcataccg tttttgaagg agccattggc gaaagacata acatgatact cggtcacgaa 180attcataccg tttttgaagg agccattggc gaaagacata acatgatact cggtcacgaa 180
gctgtaggtg aagtagttga agtaggtagt gaggtaaaag attttaaacc tggtgatcgc 240gctgtaggtg aagtagttga agtaggtagt gaggtaaaag attttaaacc tggtgatcgc 240
gttgttgtgc cagctattac ccctgattgg cggacctctg aagtacaaag aggatatcac 300gttgttgtgc cagctattac ccctgattgg cggacctctg aagtacaaag aggatatcac 300
cagcactccg gtggaatgct ggcaggctgg aaattttcga atgtaaaaga tggtgttttt 360cagcactccg gtggaatgct ggcaggctgg aaattttcga atgtaaaaga tggtgttttt 360
ggtgaatttt ttcatgtgaa tgatgctgat atgaatttag cacatctgcc taaagaaatt 420ggtgaatttt ttcatgtgaa tgatgctgat atgaatttag cacatctgcc taaagaaatt 420
ccattggaag ctgcagttat gattcccgat atgatgacca ctggttttca cggagctgaa 480ccattggaag ctgcagttat gattcccgat atgatgacca ctggttttca cggagctgaa 480
ctggcagata tagaattagg tgcgacggta gcagttttgg gtattggccc agtaggtctt 540ctggcagata tagaattagg tgcgacggta gcagttttgg gtattggccc agtaggtctt 540
atggcagtcg ctggtgccaa attgcgtgga gccggaagaa ttattgccgt aggcagtaga 600atggcagtcg ctggtgccaa attgcgtgga gccggaagaa ttattgccgt aggcagtaga 600
ccagtttgtg tagatgctgc aaaatactat ggagctactg atattgtaaa ctataaagat 660ccagttttgtg tagatgctgc aaaatactat ggagctactg atattgtaaa ctataaagat 660
ggtcctatcg aaagtcagat tatgaatcta actgaaggca aaggtgtcga tgctgccatc 720ggtcctatcg aaagtcagat tatgaatcta actgaaggca aaggtgtcga tgctgccatc 720
atcgctggag gaaatgctga cattatggct acagcagtta agattgttaa acctggtggc 780atcgctggag gaaatgctga cattatggct acagcagtta agattgttaa acctggtggc 780
accatcgcta atgtaaatta ttttggcgaa ggagaggttt tgcctgttcc tcgtcttgaa 840accatcgcta atgtaaatta ttttggcgaa ggagaggttt tgcctgttcc tcgtcttgaa 840
tggggttgcg gcatggctca taaaactata aaaggcgggc tatgccccgg tggacgtcta 900tggggttgcg gcatggctca taaaactata aaaggcgggc tatgccccgg tggacgtcta 900
agaatggaaa gactgattga ccttgttttt tataagcgtg tcgatccttc taagctcgtc 960agaatggaaa gactgattga ccttgttttt tataagcgtg tcgatccttc taagctcgtc 960
actcacgttt tccggggatt tgacaatatt gaaaaagcct ttatgttgat gaaagacaaa 1020actcacgttt tccggggatt tgacaatatt gaaaaagcct ttatgttgat gaaagacaaa 1020
ccaaaagacc taatcaaacc tgttgtaata ttagcataa 1059ccaaaagacc taatcaaacc tgttgtaata ttagcataa 1059
<210> 6<210> 6
<211> 352<211> 352
<212> PRT<212> PRT
<213> 高温厌氧杆菌(Thermoanaerobacter brockii)<213> Thermoanaerobacter brockii
<400> 6<400> 6
Met Lys Gly Phe Ala Met Leu Ser Ile Gly Lys Val Gly Trp Ile GluMet Lys Gly Phe Ala Met Leu Ser Ile Gly Lys Val Gly Trp Ile Glu
1 5 10 151 5 10 15
Lys Glu Lys Pro Ala Pro Gly Pro Phe Asp Ala Ile Val Arg Pro LeuLys Glu Lys Pro Ala Pro Gly Pro Phe Asp Ala Ile Val Arg Pro Leu
20 25 30 20 25 30
Ala Val Ala Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly AlaAla Val Ala Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala
35 40 45 35 40 45
Ile Gly Glu Arg His Asn Met Ile Leu Gly His Glu Ala Val Gly GluIle Gly Glu Arg His Asn Met Ile Leu Gly His Glu Ala Val Gly Glu
50 55 60 50 55 60
Val Val Glu Val Gly Ser Glu Val Lys Asp Phe Lys Pro Gly Asp ArgVal Val Glu Val Gly Ser Glu Val Lys Asp Phe Lys Pro Gly Asp Arg
65 70 75 8065 70 75 80
Val Val Val Pro Ala Ile Thr Pro Asp Trp Arg Thr Ser Glu Val GlnVal Val Val Pro Ala Ile Thr Pro Asp Trp Arg Thr Ser Glu Val Gln
85 90 95 85 90 95
Arg Gly Tyr His Gln His Ser Gly Gly Met Leu Ala Gly Trp Lys PheArg Gly Tyr His Gln His Ser Gly Gly Met Leu Ala Gly Trp Lys Phe
100 105 110 100 105 110
Ser Asn Val Lys Asp Gly Val Phe Gly Glu Phe Phe His Val Asn AspSer Asn Val Lys Asp Gly Val Phe Gly Glu Phe Phe His Val Asn Asp
115 120 125 115 120 125
Ala Asp Met Asn Leu Ala His Leu Pro Lys Glu Ile Pro Leu Glu AlaAla Asp Met Asn Leu Ala His Leu Pro Lys Glu Ile Pro Leu Glu Ala
130 135 140 130 135 140
Ala Val Met Ile Pro Asp Met Met Thr Thr Gly Phe His Gly Ala GluAla Val Met Ile Pro Asp Met Met Thr Thr Gly Phe His Gly Ala Glu
145 150 155 160145 150 155 160
Leu Ala Asp Ile Glu Leu Gly Ala Thr Val Ala Val Leu Gly Ile GlyLeu Ala Asp Ile Glu Leu Gly Ala Thr Val Ala Val Leu Gly Ile Gly
165 170 175 165 170 175
Pro Val Gly Leu Met Ala Val Ala Gly Ala Lys Leu Arg Gly Ala GlyPro Val Gly Leu Met Ala Val Ala Gly Ala Lys Leu Arg Gly Ala Gly
180 185 190 180 185 190
Arg Ile Ile Ala Val Gly Ser Arg Pro Val Cys Val Asp Ala Ala LysArg Ile Ile Ala Val Gly Ser Arg Pro Val Cys Val Asp Ala Ala Lys
195 200 205 195 200 205
Tyr Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asp Gly Pro Ile GluTyr Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asp Gly Pro Ile Glu
210 215 220 210 215 220
Ser Gln Ile Met Asn Leu Thr Glu Gly Lys Gly Val Asp Ala Ala IleSer Gln Ile Met Asn Leu Thr Glu Gly Lys Gly Val Asp Ala Ala Ile
225 230 235 240225 230 235 240
Ile Ala Gly Gly Asn Ala Asp Ile Met Ala Thr Ala Val Lys Ile ValIle Ala Gly Gly Asn Ala Asp Ile Met Ala Thr Ala Val Lys Ile Val
245 250 255 245 250 255
Lys Pro Gly Gly Thr Ile Ala Asn Val Asn Tyr Phe Gly Glu Gly GluLys Pro Gly Gly Thr Ile Ala Asn Val Asn Tyr Phe Gly Glu Gly Glu
260 265 270 260 265 270
Val Leu Pro Val Pro Arg Leu Glu Trp Gly Cys Gly Met Ala His LysVal Leu Pro Val Pro Arg Leu Glu Trp Gly Cys Gly Met Ala His Lys
275 280 285 275 280 285
Thr Ile Lys Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu ArgThr Ile Lys Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu Arg
290 295 300 290 295 300
Leu Ile Asp Leu Val Phe Tyr Lys Arg Val Asp Pro Ser Lys Leu ValLeu Ile Asp Leu Val Phe Tyr Lys Arg Val Asp Pro Ser Lys Leu Val
305 310 315 320305 310 315 320
Thr His Val Phe Arg Gly Phe Asp Asn Ile Glu Lys Ala Phe Met LeuThr His Val Phe Arg Gly Phe Asp Asn Ile Glu Lys Ala Phe Met Leu
325 330 335 325 330 335
Met Lys Asp Lys Pro Lys Asp Leu Ile Lys Pro Val Val Ile Leu AlaMet Lys Asp Lys Pro Lys Asp Leu Ile Lys Pro Val Val Ile Leu Ala
340 345 350 340 345 350
<210> 7<210> 7
<211> 759<211> 759
<212> DNA<212> DNA
<213> 高加索酸奶乳杆菌(Lactobacillus kefiri)<213> Lactobacillus kefiri
<400> 7<400> 7
atgactgatc gtttaaaagg caaagtagca attgtaactg gcggtacctt gggaattggc 60atgactgatc gtttaaaagg caaagtagca attgtaactg gcggtacctt gggaattggc 60
ttggcaatcg ctgataagtt tgttgaagaa ggcgcaaagg ttgttattac cggccgtcac 120ttggcaatcg ctgataagtt tgttgaagaa ggcgcaaagg ttgttattac cggccgtcac 120
gctgatgtag gtgaaaaagc tgccaaatca atcggcggca cagacgttat ccgttttgtc 180gctgatgtag gtgaaaaagc tgccaaatca atcggcggca cagacgttat ccgttttgtc 180
caacacgatg cttctgatga agccggctgg actaagttgt ttgatacgac tgaagaagca 240caacacgatg cttctgatga agccggctgg actaagttgt ttgatacgac tgaagaagca 240
tttggcccag ttaccacggt tgtcaacaat gccggaattg cggtcagcaa gagtgttgaa 300tttggcccag ttaccacggt tgtcaacaat gccggaattg cggtcagcaa gagtgttgaa 300
gataccacaa ctgaagaatg gcgcaagctg ctctcagtta acttggatgg tgtcttcttc 360gataccacaa ctgaagaatg gcgcaagctg ctctcagtta acttggatgg tgtcttcttc 360
ggtacccgtc ttggaatcca acgtatgaag aataaaggac tcggagcatc aatcatcaat 420ggtacccgtc ttggaatcca acgtatgaag aataaaggac tcggagcatc aatcatcaat 420
atgtcatcta tcgaaggttt tgttggtgat ccaactctgg gtgcatacaa cgcttcaaaa 480atgtcatcta tcgaaggttt tgttggtgat ccaactctgg gtgcatacaa cgcttcaaaa 480
ggtgctgtca gaattatgtc taaatcagct gccttggatt gcgctttgaa ggactacgat 540ggtgctgtca gaattatgtc taaatcagct gccttggatt gcgctttgaa ggactacgat 540
gttcgggtta acactgttca tccaggttat atcaagacac cattggttga cgatcttgaa 600gttcgggtta acactgttca tccaggttat atcaagacac cattggttga cgatcttgaa 600
ggggcagaag aaatgatgtc acagcggacc aagacaccaa tgggtcatat cggtgaacct 660ggggcagaag aaatgatgtc acagcggacc aagacaccaa tgggtcatat cggtgaacct 660
aacgatatcg cttggatctg tgtttacctg gcatctgacg aatctaaatt tgccactggt 720aacgatatcg cttggatctg tgtttacctg gcatctgacg aatctaaatt tgccactggt 720
gcagaattcg ttgtcgatgg tggatacact gctcaataa 759gcagaattcg ttgtcgatgg tggatacact gctcaataa 759
<210> 8<210> 8
<211> 252<211> 252
<212> PRT<212> PRT
<213> 高加索酸奶乳杆菌(Lactobacillus kefiri)<213> Lactobacillus kefiri
<400> 8<400> 8
Met Thr Asp Arg Leu Lys Gly Lys Val Ala Ile Val Thr Gly Gly ThrMet Thr Asp Arg Leu Lys Gly Lys Val Ala Ile Val Thr Gly Gly Thr
1 5 10 151 5 10 15
Leu Gly Ile Gly Leu Ala Ile Ala Asp Lys Phe Val Glu Glu Gly AlaLeu Gly Ile Gly Leu Ala Ile Ala Asp Lys Phe Val Glu Glu Gly Ala
20 25 30 20 25 30
Lys Val Val Ile Thr Gly Arg His Ala Asp Val Gly Glu Lys Ala AlaLys Val Val Ile Thr Gly Arg His Ala Asp Val Gly Glu Lys Ala Ala
35 40 45 35 40 45
Lys Ser Ile Gly Gly Thr Asp Val Ile Arg Phe Val Gln His Asp AlaLys Ser Ile Gly Gly Thr Asp Val Ile Arg Phe Val Gln His Asp Ala
50 55 60 50 55 60
Ser Asp Glu Ala Gly Trp Thr Lys Leu Phe Asp Thr Thr Glu Glu AlaSer Asp Glu Ala Gly Trp Thr Lys Leu Phe Asp Thr Thr Glu Glu Ala
65 70 75 8065 70 75 80
Phe Gly Pro Val Thr Thr Val Val Asn Asn Ala Gly Ile Ala Val SerPhe Gly Pro Val Thr Thr Val Val Asn Asn Ala Gly Ile Ala Val Ser
85 90 95 85 90 95
Lys Ser Val Glu Asp Thr Thr Thr Glu Glu Trp Arg Lys Leu Leu SerLys Ser Val Glu Asp Thr Thr Thr Glu Glu Trp Arg Lys Leu Leu Ser
100 105 110 100 105 110
Val Asn Leu Asp Gly Val Phe Phe Gly Thr Arg Leu Gly Ile Gln ArgVal Asn Leu Asp Gly Val Phe Phe Gly Thr Arg Leu Gly Ile Gln Arg
115 120 125 115 120 125
Met Lys Asn Lys Gly Leu Gly Ala Ser Ile Ile Asn Met Ser Ser IleMet Lys Asn Lys Gly Leu Gly Ala Ser Ile Ile Asn Met Ser Ser Ile
130 135 140 130 135 140
Glu Gly Phe Val Gly Asp Pro Thr Leu Gly Ala Tyr Asn Ala Ser LysGlu Gly Phe Val Gly Asp Pro Thr Leu Gly Ala Tyr Asn Ala Ser Lys
145 150 155 160145 150 155 160
Gly Ala Val Arg Ile Met Ser Lys Ser Ala Ala Leu Asp Cys Ala LeuGly Ala Val Arg Ile Met Ser Lys Ser Ala Ala Leu Asp Cys Ala Leu
165 170 175 165 170 175
Lys Asp Tyr Asp Val Arg Val Asn Thr Val His Pro Gly Tyr Ile LysLys Asp Tyr Asp Val Arg Val Asn Thr Val His Pro Gly Tyr Ile Lys
180 185 190 180 185 190
Thr Pro Leu Val Asp Asp Leu Glu Gly Ala Glu Glu Met Met Ser GlnThr Pro Leu Val Asp Asp Leu Glu Gly Ala Glu Glu Met Met Ser Gln
195 200 205 195 200 205
Arg Thr Lys Thr Pro Met Gly His Ile Gly Glu Pro Asn Asp Ile AlaArg Thr Lys Thr Pro Met Gly His Ile Gly Glu Pro Asn Asp Ile Ala
210 215 220 210 215 220
Trp Ile Cys Val Tyr Leu Ala Ser Asp Glu Ser Lys Phe Ala Thr GlyTrp Ile Cys Val Tyr Leu Ala Ser Asp Glu Ser Lys Phe Ala Thr Gly
225 230 235 240225 230 235 240
Ala Glu Phe Val Val Asp Gly Gly Tyr Thr Ala GlnAla Glu Phe Val Val Asp Gly Gly Tyr Thr Ala Gln
245 250 245 250
<210> 9<210> 9
<211> 1023<211> 1023
<212> DNA<212> DNA
<213> 芽胞杆菌科(Bacillaceae)<213> Bacillaceae
<400> 9<400> 9
atgaaagctg cagtagtaga gcaatttaag gaaccattaa aaattaaaga agtggaaaag 60atgaaagctg cagtagtaga gcaatttaag gaaccattaa aaattaaaga agtggaaaag 60
ccatccattt catatggcga agtattagtc cgcattaaag catgcggtgt atgccatacg 120ccatccattt catatggcga agtattagtc cgcattaaag catgcggtgt atgccatacg 120
gacttgcacg ccgctcatgg cgattggcca gtaaaaccaa aacttccttt aatccctggc 180gacttgcacg ccgctcatgg cgattggcca gtaaaaccaa aacttccttt aatccctggc 180
catgaaggag tcggaattgt tgaagaagtc ggtccggggg taacccattt aaaagtggga 240catgaaggag tcggaattgt tgaagaagtc ggtccggggg taacccattt aaaagtggga 240
gaccgcgttg gaattccttg gttatattct gcttgcggcc attgcgaata ttgtttaagc 300gaccgcgttg gaattccttg gttatattct gcttgcggcc attgcgaata ttgtttaagc 300
ggacaagaga cattatgtga acatcaagaa aacgccggct actcagtcga cgggggttat 360ggacaagaga cattatgtga acatcaagaa aacgccggct actcagtcga cgggggttat 360
gcagaatatt gcagagctgc ggcagactat gtggtgaaaa ttcctgacaa cttgtcgttt 420gcagaatatt gcagagctgc ggcagactat gtggtgaaaa ttcctgacaa cttgtcgttt 420
gaagaagctg ctcctatttt ctgcgccgga gttactactt ataaagcgtt aaaagtcaca 480gaagaagctg ctcctatttt ctgcgccgga gttactactt ataaagcgtt aaaagtcaca 480
ggtacaaaac cgggagaatg ggtagcgatc tatggcatcg gcggccttgg acatgttgcc 540ggtacaaaac cgggagaatg ggtagcgatc tatggcatcg gcggccttgg acatgttgcc 540
gtccagtatg cgaaagcgat ggggcttcat gttgttgcag tggatatcgg cgatgagaaa 600gtccagtatg cgaaagcgat ggggcttcat gttgttgcag tggatatcgg cgatgagaaa 600
ctggaacttg caaaagagct tggcgccgat cttgttgtaa atcctgcaaa agaaaatgcg 660ctggaacttg caaaagagct tggcgccgat cttgttgtaa atcctgcaaa agaaaatgcg 660
gcacaattta tgaaagagaa agtcggcgga gtacacgcgg ctgttgtgac agctgtatct 720gcacaattta tgaaagagaa agtcggcgga gtacacgcgg ctgttgtgac agctgtatct 720
aaacctgctt ttcaatctgc gtacaattct atccgcagag gcggcacgtg cgtgcttgtc 780aaacctgctt ttcaatctgc gtacaattct atccgcagag gcggcacgtg cgtgcttgtc 780
ggattaccgc cggaagaaat gcctattcca atctttgata cggtattaaa cggaattaaa 840ggattaccgc cggaagaaat gcctattcca atctttgata cggtattaaa cggaattaaa 840
attatcggtt ccattgtcgg cacgcggaaa gacttgcaag aagcgcttca gttcgctgca 900attatcggtt ccattgtcgg cacgcggaaa gacttgcaag aagcgcttca gttcgctgca 900
gaaggtaaag taaaaaccat tattgaagtg caacctcttg aaaaaattaa cgaagtattt 960gaaggtaaag taaaaaccat tattgaagtg caacctcttg aaaaaattaa cgaagtattt 960
gacagaatgc taaaaggaga aattaacgga cgggttgttt taacgttaga aaataataat 1020gacagaatgc taaaaggaga aattaacgga cgggttgttt taacgttaga aaataataat 1020
taa 1023taa 1023
<210> 10<210> 10
<211> 340<211> 340
<212> PRT<212> PRT
<213> 芽胞杆菌科(Bacillaceae)<213> Bacillaceae
<400> 10<400> 10
Met Lys Ala Ala Val Val Glu Gln Phe Lys Glu Pro Leu Lys Ile LysMet Lys Ala Ala Val Val Glu Gln Phe Lys Glu Pro Leu Lys Ile Lys
1 5 10 151 5 10 15
Glu Val Glu Lys Pro Ser Ile Ser Tyr Gly Glu Val Leu Val Arg IleGlu Val Glu Lys Pro Ser Ile Ser Tyr Gly Glu Val Leu Val Arg Ile
20 25 30 20 25 30
Lys Ala Cys Gly Val Cys His Thr Asp Leu His Ala Ala His Gly AspLys Ala Cys Gly Val Cys His Thr Asp Leu His Ala Ala His Gly Asp
35 40 45 35 40 45
Trp Pro Val Lys Pro Lys Leu Pro Leu Ile Pro Gly His Glu Gly ValTrp Pro Val Lys Pro Lys Leu Pro Leu Ile Pro Gly His Glu Gly Val
50 55 60 50 55 60
Gly Ile Val Glu Glu Val Gly Pro Gly Val Thr His Leu Lys Val GlyGly Ile Val Glu Glu Val Gly Pro Gly Val Thr His Leu Lys Val Gly
65 70 75 8065 70 75 80
Asp Arg Val Gly Ile Pro Trp Leu Tyr Ser Ala Cys Gly His Cys GluAsp Arg Val Gly Ile Pro Trp Leu Tyr Ser Ala Cys Gly His Cys Glu
85 90 95 85 90 95
Tyr Cys Leu Ser Gly Gln Glu Thr Leu Cys Glu His Gln Glu Asn AlaTyr Cys Leu Ser Gly Gln Glu Thr Leu Cys Glu His Gln Glu Asn Ala
100 105 110 100 105 110
Gly Tyr Ser Val Asp Gly Gly Tyr Ala Glu Tyr Cys Arg Ala Ala AlaGly Tyr Ser Val Asp Gly Gly Tyr Ala Glu Tyr Cys Arg Ala Ala Ala
115 120 125 115 120 125
Asp Tyr Val Val Lys Ile Pro Asp Asn Leu Ser Phe Glu Glu Ala AlaAsp Tyr Val Val Lys Ile Pro Asp Asn Leu Ser Phe Glu Glu Ala Ala
130 135 140 130 135 140
Pro Ile Phe Cys Ala Gly Val Thr Thr Tyr Lys Ala Leu Lys Val ThrPro Ile Phe Cys Ala Gly Val Thr Thr Tyr Lys Ala Leu Lys Val Thr
145 150 155 160145 150 155 160
Gly Thr Lys Pro Gly Glu Trp Val Ala Ile Tyr Gly Ile Gly Gly LeuGly Thr Lys Pro Gly Glu Trp Val Ala Ile Tyr Gly Ile Gly Gly Leu
165 170 175 165 170 175
Gly His Val Ala Val Gln Tyr Ala Lys Ala Met Gly Leu His Val ValGly His Val Ala Val Gln Tyr Ala Lys Ala Met Gly Leu His Val Val
180 185 190 180 185 190
Ala Val Asp Ile Gly Asp Glu Lys Leu Glu Leu Ala Lys Glu Leu GlyAla Val Asp Ile Gly Asp Glu Lys Leu Glu Leu Ala Lys Glu Leu Gly
195 200 205 195 200 205
Ala Asp Leu Val Val Asn Pro Ala Lys Glu Asn Ala Ala Gln Phe MetAla Asp Leu Val Val Asn Pro Ala Lys Glu Asn Ala Ala Gln Phe Met
210 215 220 210 215 220
Lys Glu Lys Val Gly Gly Val His Ala Ala Val Val Thr Ala Val SerLys Glu Lys Val Gly Gly Val His Ala Ala Val Val Thr Ala Val Ser
225 230 235 240225 230 235 240
Lys Pro Ala Phe Gln Ser Ala Tyr Asn Ser Ile Arg Arg Gly Gly ThrLys Pro Ala Phe Gln Ser Ala Tyr Asn Ser Ile Arg Arg Gly Gly Thr
245 250 255 245 250 255
Cys Val Leu Val Gly Leu Pro Pro Glu Glu Met Pro Ile Pro Ile PheCys Val Leu Val Gly Leu Pro Pro Glu Glu Met Pro Ile Pro Ile Phe
260 265 270 260 265 270
Asp Thr Val Leu Asn Gly Ile Lys Ile Ile Gly Ser Ile Val Gly ThrAsp Thr Val Leu Asn Gly Ile Lys Ile Ile Gly Ser Ile Val Gly Thr
275 280 285 275 280 285
Arg Lys Asp Leu Gln Glu Ala Leu Gln Phe Ala Ala Glu Gly Lys ValArg Lys Asp Leu Gln Glu Ala Leu Gln Phe Ala Ala Glu Gly Lys Val
290 295 300 290 295 300
Lys Thr Ile Ile Glu Val Gln Pro Leu Glu Lys Ile Asn Glu Val PheLys Thr Ile Ile Glu Val Gln Pro Leu Glu Lys Ile Asn Glu Val Phe
305 310 315 320305 310 315 320
Asp Arg Met Leu Lys Gly Glu Ile Asn Gly Arg Val Val Leu Thr LeuAsp Arg Met Leu Lys Gly Glu Ile Asn Gly Arg Val Val Leu Thr Leu
325 330 335 325 330 335
Glu Asn Asn AsnGlu Asn Asn Asn
340 340
<210> 11<210> 11
<211> 747<211> 747
<212> DNA<212> DNA
<213> 赖氏菌属(Leifsonia sp.)<213> Leifsonia sp.
<400> 11<400> 11
atggctcagt acgacgtcgc cgaccggtcc gcgatcgtga ccggaggcgg ctcgggcatc 60atggctcagt acgacgtcgc cgaccggtcc gcgatcgtga ccggaggcgg ctcgggcatc 60
gggcgcgccg tggcgctcac tctcgcggcg agcggcgcag ccgtcctcgt caccgacctg 120gggcgcgccg tggcgctcac tctcgcggcg agcggcgcag ccgtcctcgt caccgacctg 120
aacgaggagc acgcgcaggc cgtcgtggcc gagatcgagg ccgcgggcgg taaggccgcc 180aacgaggagc acgcgcaggc cgtcgtggcc gagatcgagg ccgcgggcgg taaggccgcc 180
gcgctcgcgg gcgacgtgac cgaccccgcg ttcggcgagg cgagcgtcgc cggggcgaac 240gcgctcgcgg gcgacgtgac cgaccccgcg ttcggcgagg cgagcgtcgc cggggcgaac 240
gctctcgcgc ccctcaagat cgcggtcaac aacgcgggca tcggcggcga ggccgccacg 300gctctcgcgc ccctcaagat cgcggtcaac aacgcgggca tcggcggcga ggccgccacg 300
gtcggcgact actcgctcga cagctggcgc acggtgatcg aggtcaacct caacgccgtg 360gtcggcgact actcgctcga cagctggcgc acggtgatcg aggtcaacct caacgccgtg 360
ttctacggga tgcagccgca gctgaaggcc atggccgcca acggcggcgg tgcgatcgtc 420ttctacggga tgcagccgca gctgaaggcc atggccgcca acggcggcgg tgcgatcgtc 420
aacatggcgt ccatcctggg aagcgtcggc ttcgccaact cgtcggccta cgtcacggcc 480aacatggcgt ccatcctggg aagcgtcggc ttcgccaact cgtcggccta cgtcacggcc 480
aagcacgcgc tgctcggtct cacccagaac gccgcgctcg agtacgccgc cgacaaggtg 540aagcacgcgc tgctcggtct cacccagaac gccgcgctcg agtacgccgc cgacaaggtg 540
cgcgtcgtcg cggtcggccc cggcttcatc cgcaccccgc tcgtggaggc caacctctcc 600cgcgtcgtcg cggtcggccc cggcttcatc cgcaccccgc tcgtggaggc caacctctcc 600
gccgacgcgc tggcgttcct cgagggcaag cacgccctcg gccgcctggg cgagccggaa 660gccgacgcgc tggcgttcct cgagggcaag cacgccctcg gccgcctggg cgagccggaa 660
gaggtcgcct cgctggtcgc gttcctcgcc tccgacgccg cgagcttcat caccggcagc 720gaggtcgcct cgctggtcgc gttcctcgcc tccgacgccg cgagcttcat caccggcagc 720
taccacctgg tggacggcgg ctacacc 747taccacctgg tggacggcgg ctacacc 747
<210> 12<210> 12
<211> 249<211> 249
<212> PRT<212> PRT
<213> 赖氏菌属(Leifsonia sp.)<213> Leifsonia sp.
<400> 12<400> 12
Met Ala Gln Tyr Asp Val Ala Asp Arg Ser Ala Ile Val Thr Gly GlyMet Ala Gln Tyr Asp Val Ala Asp Arg Ser Ala Ile Val Thr Gly Gly
1 5 10 151 5 10 15
Gly Ser Gly Ile Gly Arg Ala Val Ala Leu Thr Leu Ala Ala Ser GlyGly Ser Gly Ile Gly Arg Ala Val Ala Leu Thr Leu Ala Ala Ser Gly
20 25 30 20 25 30
Ala Ala Val Leu Val Thr Asp Leu Asn Glu Glu His Ala Gln Ala ValAla Ala Val Leu Val Thr Asp Leu Asn Glu Glu His Ala Gln Ala Val
35 40 45 35 40 45
Val Ala Glu Ile Glu Ala Ala Gly Gly Lys Ala Ala Ala Leu Ala GlyVal Ala Glu Ile Glu Ala Ala Gly Gly Lys Ala Ala Ala Leu Ala Gly
50 55 60 50 55 60
Asp Val Thr Asp Pro Ala Phe Gly Glu Ala Ser Val Ala Gly Ala AsnAsp Val Thr Asp Pro Ala Phe Gly Glu Ala Ser Val Ala Gly Ala Asn
65 70 75 8065 70 75 80
Ala Leu Ala Pro Leu Lys Ile Ala Val Asn Asn Ala Gly Ile Gly GlyAla Leu Ala Pro Leu Lys Ile Ala Val Asn Asn Ala Gly Ile Gly Gly
85 90 95 85 90 95
Glu Ala Ala Thr Val Gly Asp Tyr Ser Leu Asp Ser Trp Arg Thr ValGlu Ala Ala Thr Val Gly Asp Tyr Ser Leu Asp Ser Trp Arg Thr Val
100 105 110 100 105 110
Ile Glu Val Asn Leu Asn Ala Val Phe Tyr Gly Met Gln Pro Gln LeuIle Glu Val Asn Leu Asn Ala Val Phe Tyr Gly Met Gln Pro Gln Leu
115 120 125 115 120 125
Lys Ala Met Ala Ala Asn Gly Gly Gly Ala Ile Val Asn Met Ala SerLys Ala Met Ala Ala Asn Gly Gly Gly Ala Ile Val Asn Met Ala Ser
130 135 140 130 135 140
Ile Leu Gly Ser Val Gly Phe Ala Asn Ser Ser Ala Tyr Val Thr AlaIle Leu Gly Ser Val Gly Phe Ala Asn Ser Ser Ala Tyr Val Thr Ala
145 150 155 160145 150 155 160
Lys His Ala Leu Leu Gly Leu Thr Gln Asn Ala Ala Leu Glu Tyr AlaLys His Ala Leu Leu Gly Leu Thr Gln Asn Ala Ala Leu Glu Tyr Ala
165 170 175 165 170 175
Ala Asp Lys Val Arg Val Val Ala Val Gly Pro Gly Phe Ile Arg ThrAla Asp Lys Val Arg Val Val Ala Val Gly Pro Gly Phe Ile Arg Thr
180 185 190 180 185 190
Pro Leu Val Glu Ala Asn Leu Ser Ala Asp Ala Leu Ala Phe Leu GluPro Leu Val Glu Ala Asn Leu Ser Ala Asp Ala Leu Ala Phe Leu Glu
195 200 205 195 200 205
Gly Lys His Ala Leu Gly Arg Leu Gly Glu Pro Glu Glu Val Ala SerGly Lys His Ala Leu Gly Arg Leu Gly Glu Pro Glu Glu Val Ala Ser
210 215 220 210 215 220
Leu Val Ala Phe Leu Ala Ser Asp Ala Ala Ser Phe Ile Thr Gly SerLeu Val Ala Phe Leu Ala Ser Asp Ala Ala Ser Phe Ile Thr Gly Ser
225 230 235 240225 230 235 240
Tyr His Leu Val Asp Gly Gly Tyr ThrTyr His Leu Val Asp Gly Gly Tyr Thr
245 245
<210> 13<210> 13
<211> 1059<211> 1059
<212> DNA<212> DNA
<213> 威吉利热厌氧杆菌(Thermoanaerobacter wiegelii)<213> Thermoanaerobacter wiegelii
<400> 13<400> 13
atgaaaggtt ttgcaatgct cagtatcggt aaggttggct ggattgaggt agaaaagcct 60atgaaaggtt ttgcaatgct cagtatcggt aaggttggct ggattgaggt agaaaagcct 60
aatccaggac cctttgatgc tatcgtaaga cccctagctg tggccccttg ctcttcggac 120aatccaggac cctttgatgc tatcgtaaga cccctagctg tggccccttg ctcttcggac 120
attcacactg tttttgaagg aggccttggt gaacttcaca acgcagtgct aggtcacgaa 180attcacactg tttttgaagg aggccttggt gaacttcaca acgcagtgct aggtcacgaa 180
gctgtaggtg aagtagtcga agtcggtagt gaagtaaaag actttaaacc tggtgataag 240gctgtaggtg aagtagtcga agtcggtagt gaagtaaaag actttaaacc tggtgataag 240
gtggtcattc ctgctatcac tcctgattgg agaacgttag atgttcaacg tggttatcat 300gtggtcattc ctgctatcac tcctgattgg agaacgttag atgttcaacg tggttatcat 300
cagcagtccg gaggtatgct tgctggttac aagttcacag cccagaaacc tggtgtgttc 360cagcagtccg gaggtatgct tgctggttac aagttcacag cccagaaacc tggtgtgttc 360
gccgagtaca tctacgttaa cgatgcagac atgaatcttg ctcatttacc tgacggcatc 420gccgagtaca tctacgttaa cgatgcagac atgaatcttg ctcatttacc tgacggcatc 420
tctttagaag cggccgtcat gatcacagat atgatgacta ccggttttca cggagccgaa 480tctttagaag cggccgtcat gatcacagat atgatgacta ccggttttca cggagccgaa 480
ctggcagaaa tagaattagg tgcaacagta gcggttttgg gtattggtcc agtaggtctt 540ctggcagaaa tagaattagg tgcaacagta gcggttttgg gtattggtcc agtaggtctt 540
atggcagtcg ctggtgccaa attgcggggt gctggaagaa ttattgcagt aggcagtaga 600atggcagtcg ctggtgccaa attgcggggt gctggaagaa ttattgcagt aggcagtaga 600
cctgtttgtg tagatgctgc aaaatactat ggagctactg atattgtaaa ctataaaaat 660cctgtttgtg tagatgctgc aaaatactat ggagctactg atattgtaaa ctataaaaat 660
ggtcctatcg acagtcagat tatggattta acgaaaggca aaggtgttga tgctgccatc 720ggtcctatcg acagtcagat tatggattta acgaaaggca aaggtgttga tgctgccatc 720
atcgctggag gaaatgttga catcatggct acagcagtta agattgttaa acctggtggc 780atcgctggag gaaatgttga catcatggct acagcagtta agattgttaa acctggtggc 780
accattgcta atgtaaatta ctttggcgaa ggagatgttt tgcctgttcc tcgtcttgaa 840accattgcta atgtaaatta ctttggcgaa ggagatgttt tgcctgttcc tcgtcttgaa 840
tggggttgcg gcatggctca taaagctata aaaggcggtt tatgccctgg tggacgtcta 900tggggttgcg gcatggctca taaagctata aaaggcggtt tatgccctgg tggacgtcta 900
agaatggaaa gactgattga ccttgttttt tataagcgtg tcgatccttc caaactcgtc 960agaatggaaa gactgattga ccttgttttt tataagcgtg tcgatccttc caaactcgtc 960
actcatgttt ttcaaggatt tgataatatt gaaaaagctc taatgctgat gaaagataaa 1020actcatgttt ttcaaggatt tgataatatt gaaaaagctc taatgctgat gaaagataaa 1020
ccaaaggacc taatcaaacc tgttgtaata ttagcataa 1059ccaaaggacc taatcaaacc tgttgtaata ttagcataa 1059
<210> 14<210> 14
<211> 352<211> 352
<212> PRT<212> PRT
<213> 威吉利热厌氧杆菌(Thermoanaerobacter wiegelii)<213> Thermoanaerobacter wiegelii
<400> 14<400> 14
Met Lys Gly Phe Ala Met Leu Ser Ile Gly Lys Val Gly Trp Ile GluMet Lys Gly Phe Ala Met Leu Ser Ile Gly Lys Val Gly Trp Ile Glu
1 5 10 151 5 10 15
Val Glu Lys Pro Asn Pro Gly Pro Phe Asp Ala Ile Val Arg Pro LeuVal Glu Lys Pro Asn Pro Gly Pro Phe Asp Ala Ile Val Arg Pro Leu
20 25 30 20 25 30
Ala Val Ala Pro Cys Ser Ser Asp Ile His Thr Val Phe Glu Gly GlyAla Val Ala Pro Cys Ser Ser Asp Ile His Thr Val Phe Glu Gly Gly
35 40 45 35 40 45
Leu Gly Glu Leu His Asn Ala Val Leu Gly His Glu Ala Val Gly GluLeu Gly Glu Leu His Asn Ala Val Leu Gly His Glu Ala Val Gly Glu
50 55 60 50 55 60
Val Val Glu Val Gly Ser Glu Val Lys Asp Phe Lys Pro Gly Asp LysVal Val Glu Val Gly Ser Glu Val Lys Asp Phe Lys Pro Gly Asp Lys
65 70 75 8065 70 75 80
Val Val Ile Pro Ala Ile Thr Pro Asp Trp Arg Thr Leu Asp Val GlnVal Val Ile Pro Ala Ile Thr Pro Asp Trp Arg Thr Leu Asp Val Gln
85 90 95 85 90 95
Arg Gly Tyr His Gln Gln Ser Gly Gly Met Leu Ala Gly Tyr Lys PheArg Gly Tyr His Gln Gln Ser Gly Gly Met Leu Ala Gly Tyr Lys Phe
100 105 110 100 105 110
Thr Ala Gln Lys Pro Gly Val Phe Ala Glu Tyr Ile Tyr Val Asn AspThr Ala Gln Lys Pro Gly Val Phe Ala Glu Tyr Ile Tyr Val Asn Asp
115 120 125 115 120 125
Ala Asp Met Asn Leu Ala His Leu Pro Asp Gly Ile Ser Leu Glu AlaAla Asp Met Asn Leu Ala His Leu Pro Asp Gly Ile Ser Leu Glu Ala
130 135 140 130 135 140
Ala Val Met Ile Thr Asp Met Met Thr Thr Gly Phe His Gly Ala GluAla Val Met Ile Thr Asp Met Met Thr Thr Gly Phe His Gly Ala Glu
145 150 155 160145 150 155 160
Leu Ala Glu Ile Glu Leu Gly Ala Thr Val Ala Val Leu Gly Ile GlyLeu Ala Glu Ile Glu Leu Gly Ala Thr Val Ala Val Leu Gly Ile Gly
165 170 175 165 170 175
Pro Val Gly Leu Met Ala Val Ala Gly Ala Lys Leu Arg Gly Ala GlyPro Val Gly Leu Met Ala Val Ala Gly Ala Lys Leu Arg Gly Ala Gly
180 185 190 180 185 190
Arg Ile Ile Ala Val Gly Ser Arg Pro Val Cys Val Asp Ala Ala LysArg Ile Ile Ala Val Gly Ser Arg Pro Val Cys Val Asp Ala Ala Lys
195 200 205 195 200 205
Tyr Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asn Gly Pro Ile AspTyr Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asn Gly Pro Ile Asp
210 215 220 210 215 220
Ser Gln Ile Met Asp Leu Thr Lys Gly Lys Gly Val Asp Ala Ala IleSer Gln Ile Met Asp Leu Thr Lys Gly Lys Gly Val Asp Ala Ala Ile
225 230 235 240225 230 235 240
Ile Ala Gly Gly Asn Val Asp Ile Met Ala Thr Ala Val Lys Ile ValIle Ala Gly Gly Asn Val Asp Ile Met Ala Thr Ala Val Lys Ile Val
245 250 255 245 250 255
Lys Pro Gly Gly Thr Ile Ala Asn Val Asn Tyr Phe Gly Glu Gly AspLys Pro Gly Gly Thr Ile Ala Asn Val Asn Tyr Phe Gly Glu Gly Asp
260 265 270 260 265 270
Val Leu Pro Val Pro Arg Leu Glu Trp Gly Cys Gly Met Ala His LysVal Leu Pro Val Pro Arg Leu Glu Trp Gly Cys Gly Met Ala His Lys
275 280 285 275 280 285
Ala Ile Lys Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu ArgAla Ile Lys Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu Arg
290 295 300 290 295 300
Leu Ile Asp Leu Val Phe Tyr Lys Arg Val Asp Pro Ser Lys Leu ValLeu Ile Asp Leu Val Phe Tyr Lys Arg Val Asp Pro Ser Lys Leu Val
305 310 315 320305 310 315 320
Thr His Val Phe Gln Gly Phe Asp Asn Ile Glu Lys Ala Leu Met LeuThr His Val Phe Gln Gly Phe Asp Asn Ile Glu Lys Ala Leu Met Leu
325 330 335 325 330 335
Met Lys Asp Lys Pro Lys Asp Leu Ile Lys Pro Val Val Ile Leu AlaMet Lys Asp Lys Pro Lys Asp Leu Ile Lys Pro Val Val Ile Leu Ala
340 345 350 340 345 350
<210> 15<210> 15
<211> 1035<211> 1035
<212> DNA<212> DNA
<213> 人工序列<213> Artificial sequences
<220><220>
<223><223>
<400> 15<400> 15
atggcgaaga tcgacaacgc ggtgctgccg gaaggtagcc tggtgctggt taccggtgcg 60atggcgaaga tcgacaacgc ggtgctgccg gaaggtagcc tggtgctggt taccggtgcg 60
aacggtttcg tggcgagcca cgtggttgag cagctgctgg aacacggtta caaggttcgt 120aacggtttcg tggcgagcca cgtggttgag cagctgctgg aacacggtta caaggttcgt 120
ggcaccgcgc gtagcgcgag caaactggcg aacctgcaaa agcgttggga cgcgaaatac 180ggcaccgcgc gtagcgcgag caaactggcg aacctgcaaa agcgttggga cgcgaaatac 180
ccgggtcgtt ttgagaccgc ggtggttgaa gacatgctga agcagggcgc gtatgatgaa 240ccgggtcgtt ttgagaccgc ggtggttgaa gacatgctga agcagggcgc gtatgatgaa 240
gtgatcaagg gtgcggcggg cgttgcgcac attgcgagcg tggttagctt cagcaacaag 300gtgatcaagg gtgcggcggg cgttgcgcac attgcgagcg tggttagctt cagcaacaag 300
tatgatgagg tggttacccc ggcgatcggt ggcaccctga acgcgctgcg tgctgcggcg 360tatgatgagg tggttacccc ggcgatcggt ggcaccctga acgcgctgcg tgctgcggcg 360
gcgaccccga gcgtgaaacg ttttgttctg accagcagca ccgtgagcgc gctgatcccg 420gcgaccccga gcgtgaaacg ttttgttctg accagcagca ccgtgagcgc gctgatcccg 420
aagccgaacg ttgagggtat ttacctggat gagaagagct ggaacctgga gagcattgac 480aagccgaacg ttgagggtat ttacctggat gagaagagct ggaacctgga gagcattgac 480
aaggcgaaaa ccctgccgga aagcgatccg caaaagagcc tgtgggtgta tgcggcgagc 540aaggcgaaaa ccctgccgga aagcgatccg caaaagagcc tgtgggtgta tgcggcgagc 540
aaaaccgagg cggaactggc ggcgtggaag ttcatggacg agaacaaacc gcactttacc 600aaaaccgagg cggaactggc ggcgtggaag ttcatggacg agaacaaacc gcactttacc 600
ctgaacgcgg ttctgccgaa ctacaccatc ggcaccattt tcgatccgga aacccagagc 660ctgaacgcgg ttctgccgaa ctacaccatc ggcaccattt tcgatccgga aacccagagc 660
ggtagcacca gcggctggat gatgagcctg tttaacggcg aggtgtctcc ggcgctggcg 720ggtagcacca gcggctggat gatgagcctg tttaacggcg aggtgtctcc ggcgctggcg 720
ctgatgccgc cgcagtacta tgtgagcgcg gttgacatcg gtctgctgca cctgggttgc 780ctgatgccgc cgcagtacta tgtgagcgcg gttgacatcg gtctgctgca cctgggttgc 780
ctggtgctgc cgcaaattga gcgtcgtcgt gtttacggca ccgcgggcac cttcgattgg 840ctggtgctgc cgcaaattga gcgtcgtcgt gtttacggca ccgcgggcac cttcgattgg 840
aacaccgttc tggcgacctt tcgtaagctg tatccgagca aaaccttccc ggcggacttt 900aacaccgttc tggcgacctt tcgtaagctg tatccgagca aaaccttccc ggcggacttt 900
ccggatcagg gtcaagacct gagcaagttc gataccgcgc cgagcctgga aattctgaaa 960ccggatcagg gtcaagacct gagcaagttc gataccgcgc cgagcctgga aattctgaaa 960
agcctgggtc gtccgggttg gcgtagcatc gaggaaagca ttaaagacct ggttggcagc 1020agcctgggtc gtccgggttg gcgtagcatc gaggaaagca ttaaagacct ggttggcagc 1020
gagaccgcgc actaa 1035gagaccgcgc actaa 1035
<210> 16<210> 16
<211> 344<211> 344
<212> PRT<212> PRT
<213> 赭色掷孢酵母(Sporobolomyces salmonicolor)<213> Sporobolomyces salmonicolor
<400> 16<400> 16
Met Ala Lys Ile Asp Asn Ala Val Leu Pro Glu Gly Ser Leu Val LeuMet Ala Lys Ile Asp Asn Ala Val Leu Pro Glu Gly Ser Leu Val Leu
1 5 10 151 5 10 15
Val Thr Gly Ala Asn Gly Phe Val Ala Ser His Val Val Glu Gln LeuVal Thr Gly Ala Asn Gly Phe Val Ala Ser His Val Val Glu Gln Leu
20 25 30 20 25 30
Leu Glu His Gly Tyr Lys Val Arg Gly Thr Ala Arg Ser Ala Ser LysLeu Glu His Gly Tyr Lys Val Arg Gly Thr Ala Arg Ser Ala Ser Lys
35 40 45 35 40 45
Leu Ala Asn Leu Gln Lys Arg Trp Asp Ala Lys Tyr Pro Gly Arg PheLeu Ala Asn Leu Gln Lys Arg Trp Asp Ala Lys Tyr Pro Gly Arg Phe
50 55 60 50 55 60
Glu Thr Ala Val Val Glu Asp Met Leu Lys Gln Gly Ala Tyr Asp GluGlu Thr Ala Val Val Glu Asp Met Leu Lys Gln Gly Ala Tyr Asp Glu
65 70 75 8065 70 75 80
Val Ile Lys Gly Ala Ala Gly Val Ala His Ile Ala Ser Val Val SerVal Ile Lys Gly Ala Ala Gly Val Ala His Ile Ala Ser Val Val Ser
85 90 95 85 90 95
Phe Ser Asn Lys Tyr Asp Glu Val Val Thr Pro Ala Ile Gly Gly ThrPhe Ser Asn Lys Tyr Asp Glu Val Val Thr Pro Ala Ile Gly Gly Thr
100 105 110 100 105 110
Leu Asn Ala Leu Arg Ala Ala Ala Ala Thr Pro Ser Val Lys Arg PheLeu Asn Ala Leu Arg Ala Ala Ala Ala Thr Pro Ser Val Lys Arg Phe
115 120 125 115 120 125
Val Leu Thr Ser Ser Thr Val Ser Ala Leu Ile Pro Lys Pro Asn ValVal Leu Thr Ser Ser Thr Val Ser Ala Leu Ile Pro Lys Pro Asn Val
130 135 140 130 135 140
Glu Gly Ile Tyr Leu Asp Glu Lys Ser Trp Asn Leu Glu Ser Ile AspGlu Gly Ile Tyr Leu Asp Glu Lys Ser Trp Asn Leu Glu Ser Ile Asp
145 150 155 160145 150 155 160
Lys Ala Lys Thr Leu Pro Glu Ser Asp Pro Gln Lys Ser Leu Trp ValLys Ala Lys Thr Leu Pro Glu Ser Asp Pro Gln Lys Ser Leu Trp Val
165 170 175 165 170 175
Tyr Ala Ala Ser Lys Thr Glu Ala Glu Leu Ala Ala Trp Lys Phe MetTyr Ala Ala Ser Lys Thr Glu Ala Glu Leu Ala Ala Trp Lys Phe Met
180 185 190 180 185 190
Asp Glu Asn Lys Pro His Phe Thr Leu Asn Ala Val Leu Pro Asn TyrAsp Glu Asn Lys Pro His Phe Thr Leu Asn Ala Val Leu Pro Asn Tyr
195 200 205 195 200 205
Thr Ile Gly Thr Ile Phe Asp Pro Glu Thr Gln Ser Gly Ser Thr SerThr Ile Gly Thr Ile Phe Asp Pro Glu Thr Gln Ser Gly Ser Thr Ser
210 215 220 210 215 220
Gly Trp Met Met Ser Leu Phe Asn Gly Glu Val Ser Pro Ala Leu AlaGly Trp Met Met Ser Leu Phe Asn Gly Glu Val Ser Pro Ala Leu Ala
225 230 235 240225 230 235 240
Leu Met Pro Pro Gln Tyr Tyr Val Ser Ala Val Asp Ile Gly Leu LeuLeu Met Pro Pro Gln Tyr Tyr Val Ser Ala Val Asp Ile Gly Leu Leu
245 250 255 245 250 255
His Leu Gly Cys Leu Val Leu Pro Gln Ile Glu Arg Arg Arg Val TyrHis Leu Gly Cys Leu Val Leu Pro Gln Ile Glu Arg Arg Arg Val Tyr
260 265 270 260 265 270
Gly Thr Ala Gly Thr Phe Asp Trp Asn Thr Val Leu Ala Thr Phe ArgGly Thr Ala Gly Thr Phe Asp Trp Asn Thr Val Leu Ala Thr Phe Arg
275 280 285 275 280 285
Lys Leu Tyr Pro Ser Lys Thr Phe Pro Ala Asp Phe Pro Asp Gln GlyLys Leu Tyr Pro Ser Lys Thr Phe Pro Ala Asp Phe Pro Asp Gln Gly
290 295 300 290 295 300
Gln Asp Leu Ser Lys Phe Asp Thr Ala Pro Ser Leu Glu Ile Leu LysGln Asp Leu Ser Lys Phe Asp Thr Ala Pro Ser Leu Glu Ile Leu Lys
305 310 315 320305 310 315 320
Ser Leu Gly Arg Pro Gly Trp Arg Ser Ile Glu Glu Ser Ile Lys AspSer Leu Gly Arg Pro Gly Trp Arg Ser Ile Glu Glu Ser Ile Lys Asp
325 330 335 325 330 335
Leu Val Gly Ser Glu Thr Ala HisLeu Val Gly Ser Glu Thr Ala His
340 340
<210> 17<210> 17
<211> 1104<211> 1104
<212> DNA<212> DNA
<213> 人工序列<213> Artificial sequences
<220><220>
<223><223>
<400> 17<400> 17
atggaactgt ttaagtatat ggaaacctat gattatgaac aagtgctgtt ttgccaggac 60atggaactgt ttaagtatat ggaaacctat gattatgaac aagtgctgtt ttgccaggac 60
aaggaaagcg gtctgaaggc gattattgcc attcatgata ccacgctggg tccggcactg 120aaggaaagcg gtctgaaggc gattattgcc attcatgata ccacgctggg tccggcactg 120
ggcggtaccc gtatgtggat gtataacagc gaagaagaag cgctggaaga tgccctgcgt 180ggcggtaccc gtatgtggat gtataacagc gaagaagaag cgctggaaga tgccctgcgt 180
ctggcacgcg gtatgaccta caaaaacgca gcagcaggtc tgaatctggg cggtggcaag 240ctggcacgcg gtatgaccta caaaaacgca gcagcaggtc tgaatctggg cggtggcaag 240
acggtgatta tcggtgatcc gcgcaaagac aagaacgaag ctatgtttcg tgcgttcggc 300acggtgatta tcggtgatcc gcgcaaagac aagaacgaag ctatgtttcg tgcgttcggc 300
cgctttatcc agggtctgaa tggccgttat attaccgcgg aagatgtggg taccacggtt 360cgctttatcc agggtctgaa tggccgttat attaccgcgg aagatgtggg taccacggtt 360
gccgatatgg acattatcta tcaagaaacc gactacgtga cgggcatttc accggaattt 420gccgatatgg acattatcta tcaagaaacc gactacgtga cgggcatttc accggaattt 420
ggtagctctg gtaacccgtc gccggcaacg gcctatggtg tttaccgtgg catgaaagct 480ggtagctctg gtaacccgtc gccggcaacg gcctatggtg tttaccgtgg catgaaagct 480
gcggccaagg aagcatttgg tagtgattcc ctggaaggca aagtggttgc agttcagggt 540gcggccaagg aagcatttgg tagtgattcc ctggaaggca aagtggttgc agttcagggt 540
gtcggcaatg tggcttatca cctgtgccgc catctgcacg aagaaggcgc caaactgatt 600gtcggcaatg tggcttatca cctgtgccgc catctgcacg aagaaggcgc caaactgatt 600
gtgaccgata tcaacaagga agtcgtggca cgtgctgttg aagaatttgg tgcgaaagcc 660gtgaccgata tcaacaagga agtcgtggca cgtgctgttg aagaatttgg tgcgaaagcc 660
gtcgatccga atgacatcta cggcgtggaa tgcgatattt ttgcgccgtg tgccctgggt 720gtcgatccga atgacatcta cggcgtggaa tgcgatattt ttgcgccgtg tgccctgggt 720
ggcattatca acgaccagac catcccgcaa ctgaaagcga aggttattgc aggtagtgct 780ggcattatca acgaccagac catcccgcaa ctgaaagcga aggttattgc aggtagtgct 780
aacaatcagc tgaaagaacc gcgtcatggc gatattatcc acgaaatggg catcgtctat 840aacaatcagc tgaaagaacc gcgtcatggc gatattatcc acgaaatggg catcgtctat 840
gccccggact acgtgattaa cgcaggtggc gttatcaatg tcgctgatga actgtatggc 900gccccggact acgtgattaa cgcaggtggc gttatcaatg tcgctgatga actgtatggc 900
tacaatcgtg aacgcgcgat gaaaaagatt gaacaaatct atgacaacat cgaaaaagtt 960tacaatcgtg aacgcgcgat gaaaaagatt gaacaaatct atgacaacat cgaaaaagtt 960
ttcgcaatcg ctaagcgtga taatattccg acctacgtcg cagctgaccg tatggccgaa 1020ttcgcaatcg ctaagcgtga taatattccg acctacgtcg cagctgaccg tatggccgaa 1020
gaacgcattg aaacgatgcg taaagcccgt tcccagttcc tgcaaaatgg tcatcatatt 1080gaacgcattg aaacgatgcg taaagcccgt tcccagttcc tgcaaaatgg tcatcatatt 1080
ctgagccgcc gtcgtgcccg ctaa 1104ctgagccgcc gtcgtgcccg ctaa 1104
<210> 18<210> 18
<211> 367<211> 367
<212> PRT<212> PRT
<213> 嗜热脂肪芽孢杆菌(Geobacillus stearothermophilus)<213> Geobacillus stearothermophilus
<400> 18<400> 18
Met Glu Leu Phe Lys Tyr Met Glu Thr Tyr Asp Tyr Glu Gln Val LeuMet Glu Leu Phe Lys Tyr Met Glu Thr Tyr Asp Tyr Glu Gln Val Leu
1 5 10 151 5 10 15
Phe Cys Gln Asp Lys Glu Ser Gly Leu Lys Ala Ile Ile Ala Ile HisPhe Cys Gln Asp Lys Glu Ser Gly Leu Lys Ala Ile Ile Ala Ile His
20 25 30 20 25 30
Asp Thr Thr Leu Gly Pro Ala Leu Gly Gly Thr Arg Met Trp Met TyrAsp Thr Thr Leu Gly Pro Ala Leu Gly Gly Thr Arg Met Trp Met Tyr
35 40 45 35 40 45
Asn Ser Glu Glu Glu Ala Leu Glu Asp Ala Leu Arg Leu Ala Arg GlyAsn Ser Glu Glu Glu Ala Leu Glu Asp Ala Leu Arg Leu Ala Arg Gly
50 55 60 50 55 60
Met Thr Tyr Lys Asn Ala Ala Ala Gly Leu Asn Leu Gly Gly Gly LysMet Thr Tyr Lys Asn Ala Ala Ala Gly Leu Asn Leu Gly Gly Gly Lys
65 70 75 8065 70 75 80
Thr Val Ile Ile Gly Asp Pro Arg Lys Asp Lys Asn Glu Ala Met PheThr Val Ile Ile Gly Asp Pro Arg Lys Asp Lys Asn Glu Ala Met Phe
85 90 95 85 90 95
Arg Ala Phe Gly Arg Phe Ile Gln Gly Leu Asn Gly Arg Tyr Ile ThrArg Ala Phe Gly Arg Phe Ile Gln Gly Leu Asn Gly Arg Tyr Ile Thr
100 105 110 100 105 110
Ala Glu Asp Val Gly Thr Thr Val Ala Asp Met Asp Ile Ile Tyr GlnAla Glu Asp Val Gly Thr Thr Val Ala Asp Met Asp Ile Ile Tyr Gln
115 120 125 115 120 125
Glu Thr Asp Tyr Val Thr Gly Ile Ser Pro Glu Phe Gly Ser Ser GlyGlu Thr Asp Tyr Val Thr Gly Ile Ser Pro Glu Phe Gly Ser Ser Gly
130 135 140 130 135 140
Asn Pro Ser Pro Ala Thr Ala Tyr Gly Val Tyr Arg Gly Met Lys AlaAsn Pro Ser Pro Ala Thr Ala Tyr Gly Val Tyr Arg Gly Met Lys Ala
145 150 155 160145 150 155 160
Ala Ala Lys Glu Ala Phe Gly Ser Asp Ser Leu Glu Gly Lys Val ValAla Ala Lys Glu Ala Phe Gly Ser Asp Ser Leu Glu Gly Lys Val Val
165 170 175 165 170 175
Ala Val Gln Gly Val Gly Asn Val Ala Tyr His Leu Cys Arg His LeuAla Val Gln Gly Val Gly Asn Val Ala Tyr His Leu Cys Arg His Leu
180 185 190 180 185 190
His Glu Glu Gly Ala Lys Leu Ile Val Thr Asp Ile Asn Lys Glu ValHis Glu Glu Gly Ala Lys Leu Ile Val Thr Asp Ile Asn Lys Glu Val
195 200 205 195 200 205
Val Ala Arg Ala Val Glu Glu Phe Gly Ala Lys Ala Val Asp Pro AsnVal Ala Arg Ala Val Glu Glu Phe Gly Ala Lys Ala Val Asp Pro Asn
210 215 220 210 215 220
Asp Ile Tyr Gly Val Glu Cys Asp Ile Phe Ala Pro Cys Ala Leu GlyAsp Ile Tyr Gly Val Glu Cys Asp Ile Phe Ala Pro Cys Ala Leu Gly
225 230 235 240225 230 235 240
Gly Ile Ile Asn Asp Gln Thr Ile Pro Gln Leu Lys Ala Lys Val IleGly Ile Ile Asn Asp Gln Thr Ile Pro Gln Leu Lys Ala Lys Val Ile
245 250 255 245 250 255
Ala Gly Ser Ala Asn Asn Gln Leu Lys Glu Pro Arg His Gly Asp IleAla Gly Ser Ala Asn Asn Gln Leu Lys Glu Pro Arg His Gly Asp Ile
260 265 270 260 265 270
Ile His Glu Met Gly Ile Val Tyr Ala Pro Asp Tyr Val Ile Asn AlaIle His Glu Met Gly Ile Val Tyr Ala Pro Asp Tyr Val Ile Asn Ala
275 280 285 275 280 285
Gly Gly Val Ile Asn Val Ala Asp Glu Leu Tyr Gly Tyr Asn Arg GluGly Gly Val Ile Asn Val Ala Asp Glu Leu Tyr Gly Tyr Asn Arg Glu
290 295 300 290 295 300
Arg Ala Met Lys Lys Ile Glu Gln Ile Tyr Asp Asn Ile Glu Lys ValArg Ala Met Lys Lys Ile Glu Gln Ile Tyr Asp Asn Ile Glu Lys Val
305 310 315 320305 310 315 320
Phe Ala Ile Ala Lys Arg Asp Asn Ile Pro Thr Tyr Val Ala Ala AspPhe Ala Ile Ala Lys Arg Asp Asn Ile Pro Thr Tyr Val Ala Ala Asp
325 330 335 325 330 335
Arg Met Ala Glu Glu Arg Ile Glu Thr Met Arg Lys Ala Arg Ser GlnArg Met Ala Glu Glu Arg Ile Glu Thr Met Arg Lys Ala Arg Ser Gln
340 345 350 340 345 350
Phe Leu Gln Asn Gly His His Ile Leu Ser Arg Arg Arg Ala ArgPhe Leu Gln Asn Gly His His His Ile Leu Ser Arg Arg Arg Ala Arg
355 360 365 355 360 365
<210> 19<210> 19
<211> 993<211> 993
<212> DNA<212> DNA
<213> 人工序列<213> Artificial sequences
<220><220>
<223><223>
<400> 19<400> 19
atggcatttt ctgcagatac cagcgaaatt gtttataccc atgataccgg tctggattat 60atggcatttt ctgcagatac cagcgaaatt gtttataccc atgataccgg tctggattat 60
attacctata gcgattatga actggaccct gccaatccgc tggccggcgg tgctgcttgg 120attacctata gcgattatga actggaccct gccaatccgc tggccggcgg tgctgcttgg 120
attgaaggtg cctttgtgcc gccgagtgaa gcacgcatta gtatttttga tcagggttat 180attgaaggtg cctttgtgcc gccgagtgaa gcacgcatta gtatttttga tcagggttat 180
ctgcatagtg atgtgaccta taccgtgttt catgtttgga atggtaatgc ctttcgtctg 240ctgcatagtg atgtgaccta taccgtgttt catgtttgga atggtaatgc ctttcgtctg 240
gatgatcata ttgaacgtct gtttagcaat gccgaaagca tgcgcattat tccgccgctg 300gatgatcata ttgaacgtct gtttagcaat gccgaaagca tgcgcattat tccgccgctg 300
acccaggatg aagttaaaga aattgcactg gaactggttg caaaaaccga actgcgtgaa 360acccaggatg aagttaaaga aattgcactg gaactggttg caaaaaccga actgcgtgaa 360
gcatttgtta gtgttagcat tacccgtggc tatagcagca ccccgggtga acgtgatatt 420gcatttgtta gtgttagcat tacccgtggc tatagcagca ccccgggtga acgtgatatt 420
accaaacatc gcccgcaggt ttatatgtat gcagttccgt atcagtggat tgttccgttt 480accaaacatc gcccgcaggt ttatatgtat gcagttccgt atcagtggat tgttccgttt 480
gatcgcattc gtgatggtgt gcatgcaatg gttgcacaga gtgttcgtcg caccccgcgt 540gatcgcattc gtgatggtgt gcatgcaatg gttgcacaga gtgttcgtcg caccccgcgt 540
agcagcattg atccgcaggt taaaaatttt cagtggggcg atctgattcg cgccgttcag 600agcagcattg atccgcaggt taaaaatttt cagtggggcg atctgattcg cgccgttcag 600
gaaacccatg atcgcggctt tgaagcaccg ctgctgctgg atggcgatgg tctgctggcc 660gaaacccatg atcgcggctt tgaagcaccg ctgctgctgg atggcgatgg tctgctggcc 660
gaaggtagcg gctttaatgt ggtggttatt aaggatggcg ttgttcgcag cccgggtcgt 720gaaggtagcg gctttaatgt ggtggttatt aaggatggcg ttgttcgcag cccgggtcgt 720
gcagcactgc cgggtattac ccgcaaaacc gtgctggaaa ttgcagaaag cctgggccat 780gcagcactgc cgggtattac ccgcaaaacc gtgctggaaa ttgcagaaag cctgggccat 780
gaagccattc tggcagatat taccctggca gaactgctgg atgcagatga agttctgggt 840gaagccattc tggcagatat taccctggca gaactgctgg atgcagatga agttctgggt 840
tgtaccaccg ccggtggcgt gtggccgttt gttagtgtgg atggtaatcc gattagcgat 900tgtaccaccg ccggtggcgt gtggccgttt gttagtgtgg atggtaatcc gattagcgat 900
ggcgtgccgg gtccggttac ccagagtatt attcgccgtt attgggaact gaatgtggaa 960ggcgtgccgg gtccggttac ccagagtatt attcgccgtt attgggaact gaatgtggaa 960
agcagcagtc tgctgacccc ggttcagtat taa 993agcagcagtc tgctgacccc ggttcagtat taa 993
<210> 20<210> 20
<211> 330<211> 330
<212> PRT<212> PRT
<213> ATA-117<213> ATA-117
<400> 20<400> 20
Met Ala Phe Ser Ala Asp Thr Ser Glu Ile Val Tyr Thr His Asp ThrMet Ala Phe Ser Ala Asp Thr Ser Glu Ile Val Tyr Thr His Asp Thr
1 5 10 151 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala AsnGly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30 20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Glu Gly Ala Phe Val Pro ProPro Leu Ala Gly Gly Ala Ala Trp Ile Glu Gly Ala Phe Val Pro Pro
35 40 45 35 40 45
Ser Glu Ala Arg Ile Ser Ile Phe Asp Gln Gly Tyr Leu His Ser AspSer Glu Ala Arg Ile Ser Ile Phe Asp Gln Gly Tyr Leu His Ser Asp
50 55 60 50 55 60
Val Thr Tyr Thr Val Phe His Val Trp Asn Gly Asn Ala Phe Arg LeuVal Thr Tyr Thr Val Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 8065 70 75 80
Asp Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Met Arg IleAsp Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Met Arg Ile
85 90 95 85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu LeuIle Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110 100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Phe Val Ser Val Ser Ile ThrVal Ala Lys Thr Glu Leu Arg Glu Ala Phe Val Ser Val Ser Ile Thr
115 120 125 115 120 125
Arg Gly Tyr Ser Ser Thr Pro Gly Glu Arg Asp Ile Thr Lys His ArgArg Gly Tyr Ser Ser Thr Pro Gly Glu Arg Asp Ile Thr Lys His Arg
130 135 140 130 135 140
Pro Gln Val Tyr Met Tyr Ala Val Pro Tyr Gln Trp Ile Val Pro PhePro Gln Val Tyr Met Tyr Ala Val Pro Tyr Gln Trp Ile Val Pro Phe
145 150 155 160145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Ala Met Val Ala Gln Ser Val ArgAsp Arg Ile Arg Asp Gly Val His Ala Met Val Ala Gln Ser Val Arg
165 170 175 165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln TrpArg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190 180 185 190
Gly Asp Leu Ile Arg Ala Val Gln Glu Thr His Asp Arg Gly Phe GluGly Asp Leu Ile Arg Ala Val Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205 195 200 205
Ala Pro Leu Leu Leu Asp Gly Asp Gly Leu Leu Ala Glu Gly Ser GlyAla Pro Leu Leu Leu Asp Gly Asp Gly Leu Leu Ala Glu Gly Ser Gly
210 215 220 210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly ArgPhe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240225 230 235 240
Ala Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala GluAla Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255 245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Leu Ala Glu LeuSer Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Leu Ala Glu Leu
260 265 270 260 265 270
Leu Asp Ala Asp Glu Val Leu Gly Cys Thr Thr Ala Gly Gly Val TrpLeu Asp Ala Asp Glu Val Leu Gly Cys Thr Thr Ala Gly Gly Val Trp
275 280 285 275 280 285
Pro Phe Val Ser Val Asp Gly Asn Pro Ile Ser Asp Gly Val Pro GlyPro Phe Val Ser Val Asp Gly Asn Pro Ile Ser Asp Gly Val Pro Gly
290 295 300 290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val GluPro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320305 310 315 320
Ser Ser Ser Leu Leu Thr Pro Val Gln TyrSer Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330 325 330
<210> 21<210> 21
<211> 975<211> 975
<212> DNA<212> DNA
<213> 人工序列<213> Artificial sequences
<220><220>
<223><223>
<400> 21<400> 21
atggctagta tggataaggt gttcgccggc tatgccgcac gtcaagcaat tctggaaagt 60atggctagta tggataaggt gttcgccggc tatgccgcac gtcaagcaat tctggaaagt 60
accgaaacca ccaatccgtt tgcaaaaggt attgcctggg ttgaaggtga actggtgccg 120accgaaacca ccaatccgtt tgcaaaaggt attgcctggg ttgaaggtga actggtgccg 120
ttagccgaag cacgtattcc gctgctggat cagggcttta tgcatagtga tctgacctat 180ttagccgaag cacgtattcc gctgctggat cagggcttta tgcatagtga tctgacctat 180
gatgtgccga gtgtttggga tggtcgcttt ttccgcctgg atgatcatat tacccgcctg 240gatgtgccga gtgtttggga tggtcgcttt ttccgcctgg atgatcatat tacccgcctg 240
gaagcaagct gcaccaaact gcgtctgcgc ttaccgctgc ctcgtgacca ggtgaaacag 300gaagcaagct gcaccaaact gcgtctgcgc ttaccgctgc ctcgtgacca ggtgaaacag 300
attctggtgg aaatggttgc aaagagcggt attcgcgatg cctttgtgga actgattgtt 360attctggtgg aaatggttgc aaagagcggt attcgcgatg cctttgtgga actgattgtt 360
acccgcggcc tgaaaggtgt gcgcggtacg cgtcctgaag atattgtgaa taatctgtac 420acccgcggcc tgaaaggtgt gcgcggtacg cgtcctgaag atattgtgaa taatctgtac 420
atgttcgtgc agccgtatgt ttgggttatg gaaccggata tgcagcgtgt gggtggcagt 480atgttcgtgc agccgtatgt ttgggttatg gaaccggata tgcagcgtgt gggtggcagt 480
gcagtggttg caagaaccgt gcgtcgcgtt cctcctggtg caattgatcc gaccgttaaa 540gcagtggttg caagaaccgt gcgtcgcgtt cctcctggtg caattgatcc gaccgttaaa 540
aatctgcagt ggggtgacct ggttcgcggt atgtttgaag ccgccgatcg tggtgccacc 600aatctgcagt ggggtgacct ggttcgcggt atgtttgaag ccgccgatcg tggtgccacc 600
tatccttttc tgaccgatgg tgacgcacat ctgaccgaag gcagcggttt taatattgtg 660tatccttttc tgaccgatgg tgacgcacat ctgaccgaag gcagcggttt taatattgtg 660
ctggttaaag acggcgtgct gtataccccg gatcgcggtg tgttacaggg cgtgaccaga 720ctggttaaag acggcgtgct gtataccccg gatcgcggtg tgttacaggg cgtgaccaga 720
aaaagtgtta ttaatgcagc cgaggccttt ggcattgaag tgcgtgtgga atttgtgccg 780aaaagtgtta ttaatgcagc cgaggccttt ggcattgaag tgcgtgtgga atttgtgccg 780
gtggaactgg cctatcgctg cgacgagatt tttatgtgca ccaccgccgg tggtattatg 840gtggaactgg cctatcgctg cgacgagatt tttatgtgca ccaccgccgg tggtattatg 840
ccgattacca ccctggatgg tatgccggtg aatggcggtc agattggccc tattaccaaa 900ccgattacca ccctggatgg tatgccggtg aatggcggtc agattggccc tattaccaaa 900
aagatttggg acggttactg ggccatgcat tatgatgcag catatagctt tgagatcgac 960aagatttggg acggttactg ggccatgcat tatgatgcag catatagctt tgagatcgac 960
tataacgagc gtaat 975tataacgagc gtaat 975
<210> 22<210> 22
<211> 325<211> 325
<212> PRT<212> PRT
<213> 土曲霉(Aspergillus terreus)<213> Aspergillus terreus
<400> 22<400> 22
Met Ala Ser Met Asp Lys Val Phe Ala Gly Tyr Ala Ala Arg Gln AlaMet Ala Ser Met Asp Lys Val Phe Ala Gly Tyr Ala Ala Arg Gln Ala
1 5 10 151 5 10 15
Ile Leu Glu Ser Thr Glu Thr Thr Asn Pro Phe Ala Lys Gly Ile AlaIle Leu Glu Ser Thr Glu Thr Thr Asn Pro Phe Ala Lys Gly Ile Ala
20 25 30 20 25 30
Trp Val Glu Gly Glu Leu Val Pro Leu Ala Glu Ala Arg Ile Pro LeuTrp Val Glu Gly Glu Leu Val Pro Leu Ala Glu Ala Arg Ile Pro Leu
35 40 45 35 40 45
Leu Asp Gln Gly Phe Met His Ser Asp Leu Thr Tyr Asp Val Pro SerLeu Asp Gln Gly Phe Met His Ser Asp Leu Thr Tyr Asp Val Pro Ser
50 55 60 50 55 60
Val Trp Asp Gly Arg Phe Phe Arg Leu Asp Asp His Ile Thr Arg LeuVal Trp Asp Gly Arg Phe Phe Arg Leu Asp Asp His Ile Thr Arg Leu
65 70 75 8065 70 75 80
Glu Ala Ser Cys Thr Lys Leu Arg Leu Arg Leu Pro Leu Pro Arg AspGlu Ala Ser Cys Thr Lys Leu Arg Leu Arg Leu Pro Leu Pro Arg Asp
85 90 95 85 90 95
Gln Val Lys Gln Ile Leu Val Glu Met Val Ala Lys Ser Gly Ile ArgGln Val Lys Gln Ile Leu Val Glu Met Val Ala Lys Ser Gly Ile Arg
100 105 110 100 105 110
Asp Ala Phe Val Glu Leu Ile Val Thr Arg Gly Leu Lys Gly Val ArgAsp Ala Phe Val Glu Leu Ile Val Thr Arg Gly Leu Lys Gly Val Arg
115 120 125 115 120 125
Gly Thr Arg Pro Glu Asp Ile Val Asn Asn Leu Tyr Met Phe Val GlnGly Thr Arg Pro Glu Asp Ile Val Asn Asn Leu Tyr Met Phe Val Gln
130 135 140 130 135 140
Pro Tyr Val Trp Val Met Glu Pro Asp Met Gln Arg Val Gly Gly SerPro Tyr Val Trp Val Met Glu Pro Asp Met Gln Arg Val Gly Gly Ser
145 150 155 160145 150 155 160
Ala Val Val Ala Arg Thr Val Arg Arg Val Pro Pro Gly Ala Ile AspAla Val Val Ala Arg Thr Val Arg Arg Val Pro Pro Gly Ala Ile Asp
165 170 175 165 170 175
Pro Thr Val Lys Asn Leu Gln Trp Gly Asp Leu Val Arg Gly Met PhePro Thr Val Lys Asn Leu Gln Trp Gly Asp Leu Val Arg Gly Met Phe
180 185 190 180 185 190
Glu Ala Ala Asp Arg Gly Ala Thr Tyr Pro Phe Leu Thr Asp Gly AspGlu Ala Ala Asp Arg Gly Ala Thr Tyr Pro Phe Leu Thr Asp Gly Asp
195 200 205 195 200 205
Ala His Leu Thr Glu Gly Ser Gly Phe Asn Ile Val Leu Val Lys AspAla His Leu Thr Glu Gly Ser Gly Phe Asn Ile Val Leu Val Lys Asp
210 215 220 210 215 220
Gly Val Leu Tyr Thr Pro Asp Arg Gly Val Leu Gln Gly Val Thr ArgGly Val Leu Tyr Thr Pro Asp Arg Gly Val Leu Gln Gly Val Thr Arg
225 230 235 240225 230 235 240
Lys Ser Val Ile Asn Ala Ala Glu Ala Phe Gly Ile Glu Val Arg ValLys Ser Val Ile Asn Ala Ala Glu Ala Phe Gly Ile Glu Val Arg Val
245 250 255 245 250 255
Glu Phe Val Pro Val Glu Leu Ala Tyr Arg Cys Asp Glu Ile Phe MetGlu Phe Val Pro Val Glu Leu Ala Tyr Arg Cys Asp Glu Ile Phe Met
260 265 270 260 265 270
Cys Thr Thr Ala Gly Gly Ile Met Pro Ile Thr Thr Leu Asp Gly MetCys Thr Thr Ala Gly Gly Ile Met Pro Ile Thr Thr Leu Asp Gly Met
275 280 285 275 280 285
Pro Val Asn Gly Gly Gln Ile Gly Pro Ile Thr Lys Lys Ile Trp AspPro Val Asn Gly Gly Gln Ile Gly Pro Ile Thr Lys Lys Ile Trp Asp
290 295 300 290 295 300
Gly Tyr Trp Ala Met His Tyr Asp Ala Ala Tyr Ser Phe Glu Ile AspGly Tyr Trp Ala Met His Tyr Asp Ala Ala Tyr Ser Phe Glu Ile Asp
305 310 315 320305 310 315 320
Tyr Asn Glu Arg AsnTyr Asn Glu Arg Asn
325 325
<210> 23<210> 23
<211> 969<211> 969
<212> DNA<212> DNA
<213> 人工序列<213> Artificial sequences
<220><220>
<223><223>
<400> 23<400> 23
atggcaagta tggataaggt gttcagtggt tattacgccc gccagaaact gctggaacgt 60atggcaagta tggataaggt gttcagtggt tattacgccc gccagaaact gctggaacgt 60
agcgataatc cgtttagcaa aggtattgca tacgtggaag gcaaactggt tctgccgagc 120agcgataatc cgtttagcaa aggtattgca tacgtggaag gcaaactggt tctgccgagc 120
gatgcacgta ttccgctgtt agatgaaggt tttatgcaca gtgacctgac ctatgatgtt 180gatgcacgta ttccgctgtt agatgaaggt tttatgcaca gtgacctgac ctatgatgtt 180
attagcgtgt gggatggtcg ctttttccgc ctggatgatc atctgcagcg tattctggaa 240attagcgtgt gggatggtcg ctttttccgc ctggatgatc atctgcagcg tattctggaa 240
agttgtgata aaatgcgcct gaaattcccg ctggccctgt caagtgttaa aaatattctg 300agttgtgata aaatgcgcct gaaattcccg ctggccctgt caagtgttaa aaatattctg 300
gcagagatgg tggccaaaag tggcattcgt gatgcatttg ttgaggttat tgtgacccgc 360gcagagatgg tggccaaaag tggcattcgt gatgcatttg ttgaggttat tgtgacccgc 360
ggtctgaccg gtgtgagagg tagcaaaccg gaagatttgt ataacaacaa catctacctg 420ggtctgaccg gtgtgagagg tagcaaaccg gaagatttgt ataacaacaa catctacctg 420
ctggtgctgc cgtatatttg ggttatggca ccggaaaatc agctgcatgg tggtgaagcc 480ctggtgctgc cgtatatttg ggttatggca ccggaaaatc agctgcatgg tggtgaagcc 480
attattaccc gcaccgtgcg ccgtacaccg cctggtgcat tcgaccctac aattaagaat 540attattaccc gcaccgtgcg ccgtacaccg cctggtgcat tcgaccctac aattaagaat 540
ctgcagtggg gtgacctgac caaaggtctg tttgaagcaa tggatcgcgg cgccacctat 600ctgcagtggg gtgacctgac caaaggtctg tttgaagcaa tggatcgcgg cgccacctat 600
ccttttctga ccgatggtga caccaatctg accgaaggta gcggttttaa tatcgttctg 660ccttttctga ccgatggtga caccaatctg accgaaggta gcggttttaa tatcgttctg 660
gttaagaacg gcatcatcta taccccggat cgtggtgtgc tgcgtggtat tacccgcaaa 720gttaagaacg gcatcatcta taccccggat cgtggtgtgc tgcgtggtat tacccgcaaa 720
agtgtgattg atgttgcacg tgcaaacagt attgacattc gtctggaagt tgtgccggtt 780agtgtgattg atgttgcacg tgcaaacagt attgacattc gtctggaagt tgtgccggtt 780
gaacaggcct atcatagcga tgaaattttc atgtgcacca ccgccggtgg cattatgcct 840gaacaggcct atcatagcga tgaaattttc atgtgcacca ccgccggtgg cattatgcct 840
attaccctgc tggatggcca gccggttaat gatggtcagg tgggtccgat taccaaaaag 900attaccctgc tggatggcca gccggttaat gatggtcagg tgggtccgat taccaaaaag 900
atttgggatg gttactggga aatgcattac aatccggcat atagcttccc ggtggattat 960atttgggatg gttactggga aatgcattac aatccggcat atagcttccc ggtggattat 960
ggtagtggt 969ggtagtggt 969
<210> 24<210> 24
<211> 323<211> 323
<212> PRT<212> PRT
<213> 烟曲霉菌(Aspergillus fumigatus)<213> Aspergillus fumigatus
<400> 24<400> 24
Met Ala Ser Met Asp Lys Val Phe Ser Gly Tyr Tyr Ala Arg Gln LysMet Ala Ser Met Asp Lys Val Phe Ser Gly Tyr Tyr Ala Arg Gln Lys
1 5 10 151 5 10 15
Leu Leu Glu Arg Ser Asp Asn Pro Phe Ser Lys Gly Ile Ala Tyr ValLeu Leu Glu Arg Ser Asp Asn Pro Phe Ser Lys Gly Ile Ala Tyr Val
20 25 30 20 25 30
Glu Gly Lys Leu Val Leu Pro Ser Asp Ala Arg Ile Pro Leu Leu AspGlu Gly Lys Leu Val Leu Pro Ser Asp Ala Arg Ile Pro Leu Leu Asp
35 40 45 35 40 45
Glu Gly Phe Met His Ser Asp Leu Thr Tyr Asp Val Ile Ser Val TrpGlu Gly Phe Met His Ser Asp Leu Thr Tyr Asp Val Ile Ser Val Trp
50 55 60 50 55 60
Asp Gly Arg Phe Phe Arg Leu Asp Asp His Leu Gln Arg Ile Leu GluAsp Gly Arg Phe Phe Arg Leu Asp Asp His Leu Gln Arg Ile Leu Glu
65 70 75 8065 70 75 80
Ser Cys Asp Lys Met Arg Leu Lys Phe Pro Leu Ala Leu Ser Ser ValSer Cys Asp Lys Met Arg Leu Lys Phe Pro Leu Ala Leu Ser Ser Val
85 90 95 85 90 95
Lys Asn Ile Leu Ala Glu Met Val Ala Lys Ser Gly Ile Arg Asp AlaLys Asn Ile Leu Ala Glu Met Val Ala Lys Ser Gly Ile Arg Asp Ala
100 105 110 100 105 110
Phe Val Glu Val Ile Val Thr Arg Gly Leu Thr Gly Val Arg Gly SerPhe Val Glu Val Ile Val Thr Arg Gly Leu Thr Gly Val Arg Gly Ser
115 120 125 115 120 125
Lys Pro Glu Asp Leu Tyr Asn Asn Asn Ile Tyr Leu Leu Val Leu ProLys Pro Glu Asp Leu Tyr Asn Asn Asn Ile Tyr Leu Leu Val Leu Pro
130 135 140 130 135 140
Tyr Ile Trp Val Met Ala Pro Glu Asn Gln Leu His Gly Gly Glu AlaTyr Ile Trp Val Met Ala Pro Glu Asn Gln Leu His Gly Gly Glu Ala
145 150 155 160145 150 155 160
Ile Ile Thr Arg Thr Val Arg Arg Thr Pro Pro Gly Ala Phe Asp ProIle Ile Thr Arg Thr Val Arg Arg Thr Pro Pro Gly Ala Phe Asp Pro
165 170 175 165 170 175
Thr Ile Lys Asn Leu Gln Trp Gly Asp Leu Thr Lys Gly Leu Phe GluThr Ile Lys Asn Leu Gln Trp Gly Asp Leu Thr Lys Gly Leu Phe Glu
180 185 190 180 185 190
Ala Met Asp Arg Gly Ala Thr Tyr Pro Phe Leu Thr Asp Gly Asp ThrAla Met Asp Arg Gly Ala Thr Tyr Pro Phe Leu Thr Asp Gly Asp Thr
195 200 205 195 200 205
Asn Leu Thr Glu Gly Ser Gly Phe Asn Ile Val Leu Val Lys Asn GlyAsn Leu Thr Glu Gly Ser Gly Phe Asn Ile Val Leu Val Lys Asn Gly
210 215 220 210 215 220
Ile Ile Tyr Thr Pro Asp Arg Gly Val Leu Arg Gly Ile Thr Arg LysIle Ile Tyr Thr Pro Asp Arg Gly Val Leu Arg Gly Ile Thr Arg Lys
225 230 235 240225 230 235 240
Ser Val Ile Asp Val Ala Arg Ala Asn Ser Ile Asp Ile Arg Leu GluSer Val Ile Asp Val Ala Arg Ala Asn Ser Ile Asp Ile Arg Leu Glu
245 250 255 245 250 255
Val Val Pro Val Glu Gln Ala Tyr His Ser Asp Glu Ile Phe Met CysVal Val Pro Val Glu Gln Ala Tyr His Ser Asp Glu Ile Phe Met Cys
260 265 270 260 265 270
Thr Thr Ala Gly Gly Ile Met Pro Ile Thr Leu Leu Asp Gly Gln ProThr Thr Ala Gly Gly Ile Met Pro Ile Thr Leu Leu Asp Gly Gln Pro
275 280 285 275 280 285
Val Asn Asp Gly Gln Val Gly Pro Ile Thr Lys Lys Ile Trp Asp GlyVal Asn Asp Gly Gln Val Gly Pro Ile Thr Lys Lys Ile Trp Asp Gly
290 295 300 290 295 300
Tyr Trp Glu Met His Tyr Asn Pro Ala Tyr Ser Phe Pro Val Asp TyrTyr Trp Glu Met His Tyr Asn Pro Ala Tyr Ser Phe Pro Val Asp Tyr
305 310 315 320305 310 315 320
Gly Ser GlyGly Ser Gly
<210> 25<210> 25
<211> 969<211> 969
<212> DNA<212> DNA
<213> 人工序列<213> Artificial sequences
<220><220>
<223><223>
<400> 25<400> 25
atggctagta tggataaggt gttcagcggc tatcatgccc gccagaaact gctggaacgt 60atggctagta tggataaggt gttcagcggc tatcatgccc gccagaaact gctggaacgt 60
agtgataatc cgtttagtaa gggcattgcc tatgtggaag gtaaactggt gctgccgagt 120agtgataatc cgtttagtaa gggcattgcc tatgtggaag gtaaactggt gctgccgagt 120
gatgcccgta ttcctctgct ggatgaaggc tttatgcatg gtgacctgac ctatgatgtt 180gatgcccgta ttcctctgct ggatgaaggc tttatgcatg gtgacctgac ctatgatgtt 180
accaccgtgt gggatggtcg ctttttccgt ctggatgatc acatgcagcg tattctggaa 240accaccgtgt gggatggtcg ctttttccgt ctggatgatc acatgcagcg tattctggaa 240
agctgcgata aaatgcgtct gaaattcccg ctggccccga gtacagttaa aaatattctg 300agctgcgata aaatgcgtct gaaattcccg ctggccccga gtacagttaa aaatattctg 300
gcagagatgg tggcaaagag cggcattcgc gatgcctttg ttgaagtgat tgttacccgt 360gcagagatgg tggcaaagag cggcattcgc gatgcctttg ttgaagtgat tgttacccgt 360
ggtctgaccg gtgttcgtgg tagtaaaccg gaagatttgt ataacaacaa catctacctg 420ggtctgaccg gtgttcgtgg tagtaaaccg gaagatttgt ataacaacaa catctacctg 420
ctggtgctgc cttatgtgtg ggttatggca ccggaaaatc agctgctggg cggttcagca 480ctggtgctgc cttatgtgtg ggttatggca ccggaaaatc agctgctggg cggttcagca 480
attattaccc gcaccgtgcg ccgtacccct cctggtgcat tcgaccctac aattaagaat 540attattaccc gcaccgtgcg ccgtacccct cctggtgcat tcgaccctac aattaagaat 540
ctgcagtggg gcgatctgac caaaggctta tttgaagcaa tggatcgcgg cgccacctat 600ctgcagtggg gcgatctgac caaaggctta tttgaagcaa tggatcgcgg cgccacctat 600
ccttttctga ccgatggtga caccaatctg accgaaggta gcggctttaa tattgttctg 660ccttttctga ccgatggtga caccaatctg accgaaggta gcggctttaa tattgttctg 660
gtgaaaaacg gcatcatcta caccccggat cgcggtgttc tgcgtggtat tacccgcaaa 720gtgaaaaacg gcatcatcta caccccggat cgcggtgttc tgcgtggtat tacccgcaaa 720
agtgttattg atgtggcccg cgcaaataat attgatattc gtctggaggt ggtgccggtt 780agtgttattg atgtggcccg cgcaaataat attgatattc gtctggaggt ggtgccggtt 780
gaacaggttt atcatagtga tgaaatcttc atgtgcacca ccgccggcgg tattatgcct 840gaacaggttt atcatagtga tgaaatcttc atgtgcacca ccgccggcgg tattatgcct 840
attaccctgc tggatggtca gccggttaat gatggtcagg ttggcccgat taccaaaaag 900attaccctgc tggatggtca gccggttaat gatggtcagg ttggcccgat taccaaaaag 900
atttgggatg gctattggga aatgcattac aatccggcat acagctttcc ggttgattat 960atttgggatg gctattggga aatgcattac aatccggcat acagctttcc ggttgattat 960
ggtagcggc 969ggtagcggc 969
<210> 26<210> 26
<211> 323<211> 323
<212> PRT<212> PRT
<213> 费希新萨托菌(Neosartorya fischeri)<213> Neosartorya fischeri
<400> 26<400> 26
Met Ala Ser Met Asp Lys Val Phe Ser Gly Tyr His Ala Arg Gln LysMet Ala Ser Met Asp Lys Val Phe Ser Gly Tyr His Ala Arg Gln Lys
1 5 10 151 5 10 15
Leu Leu Glu Arg Ser Asp Asn Pro Phe Ser Lys Gly Ile Ala Tyr ValLeu Leu Glu Arg Ser Asp Asn Pro Phe Ser Lys Gly Ile Ala Tyr Val
20 25 30 20 25 30
Glu Gly Lys Leu Val Leu Pro Ser Asp Ala Arg Ile Pro Leu Leu AspGlu Gly Lys Leu Val Leu Pro Ser Asp Ala Arg Ile Pro Leu Leu Asp
35 40 45 35 40 45
Glu Gly Phe Met His Gly Asp Leu Thr Tyr Asp Val Thr Thr Val TrpGlu Gly Phe Met His Gly Asp Leu Thr Tyr Asp Val Thr Thr Val Trp
50 55 60 50 55 60
Asp Gly Arg Phe Phe Arg Leu Asp Asp His Met Gln Arg Ile Leu GluAsp Gly Arg Phe Phe Arg Leu Asp Asp His Met Gln Arg Ile Leu Glu
65 70 75 8065 70 75 80
Ser Cys Asp Lys Met Arg Leu Lys Phe Pro Leu Ala Pro Ser Thr ValSer Cys Asp Lys Met Arg Leu Lys Phe Pro Leu Ala Pro Ser Thr Val
85 90 95 85 90 95
Lys Asn Ile Leu Ala Glu Met Val Ala Lys Ser Gly Ile Arg Asp AlaLys Asn Ile Leu Ala Glu Met Val Ala Lys Ser Gly Ile Arg Asp Ala
100 105 110 100 105 110
Phe Val Glu Val Ile Val Thr Arg Gly Leu Thr Gly Val Arg Gly SerPhe Val Glu Val Ile Val Thr Arg Gly Leu Thr Gly Val Arg Gly Ser
115 120 125 115 120 125
Lys Pro Glu Asp Leu Tyr Asn Asn Asn Ile Tyr Leu Leu Val Leu ProLys Pro Glu Asp Leu Tyr Asn Asn Asn Ile Tyr Leu Leu Val Leu Pro
130 135 140 130 135 140
Tyr Val Trp Val Met Ala Pro Glu Asn Gln Leu Leu Gly Gly Ser AlaTyr Val Trp Val Met Ala Pro Glu Asn Gln Leu Leu Gly Gly Ser Ala
145 150 155 160145 150 155 160
Ile Ile Thr Arg Thr Val Arg Arg Thr Pro Pro Gly Ala Phe Asp ProIle Ile Thr Arg Thr Val Arg Arg Thr Pro Pro Gly Ala Phe Asp Pro
165 170 175 165 170 175
Thr Ile Lys Asn Leu Gln Trp Gly Asp Leu Thr Lys Gly Leu Phe GluThr Ile Lys Asn Leu Gln Trp Gly Asp Leu Thr Lys Gly Leu Phe Glu
180 185 190 180 185 190
Ala Met Asp Arg Gly Ala Thr Tyr Pro Phe Leu Thr Asp Gly Asp ThrAla Met Asp Arg Gly Ala Thr Tyr Pro Phe Leu Thr Asp Gly Asp Thr
195 200 205 195 200 205
Asn Leu Thr Glu Gly Ser Gly Phe Asn Ile Val Leu Val Lys Asn GlyAsn Leu Thr Glu Gly Ser Gly Phe Asn Ile Val Leu Val Lys Asn Gly
210 215 220 210 215 220
Ile Ile Tyr Thr Pro Asp Arg Gly Val Leu Arg Gly Ile Thr Arg LysIle Ile Tyr Thr Pro Asp Arg Gly Val Leu Arg Gly Ile Thr Arg Lys
225 230 235 240225 230 235 240
Ser Val Ile Asp Val Ala Arg Ala Asn Asn Ile Asp Ile Arg Leu GluSer Val Ile Asp Val Ala Arg Ala Asn Asn Ile Asp Ile Arg Leu Glu
245 250 255 245 250 255
Val Val Pro Val Glu Gln Val Tyr His Ser Asp Glu Ile Phe Met CysVal Val Pro Val Glu Gln Val Tyr His Ser Asp Glu Ile Phe Met Cys
260 265 270 260 265 270
Thr Thr Ala Gly Gly Ile Met Pro Ile Thr Leu Leu Asp Gly Gln ProThr Thr Ala Gly Gly Ile Met Pro Ile Thr Leu Leu Asp Gly Gln Pro
275 280 285 275 280 285
Val Asn Asp Gly Gln Val Gly Pro Ile Thr Lys Lys Ile Trp Asp GlyVal Asn Asp Gly Gln Val Gly Pro Ile Thr Lys Lys Ile Trp Asp Gly
290 295 300 290 295 300
Tyr Trp Glu Met His Tyr Asn Pro Ala Tyr Ser Phe Pro Val Asp TyrTyr Trp Glu Met His Tyr Asn Pro Ala Tyr Ser Phe Pro Val Asp Tyr
305 310 315 320305 310 315 320
Gly Ser GlyGly Ser Gly
<210> 27<210> 27
<211> 975<211> 975
<212> DNA<212> DNA
<213> 人工序列<213> Artificial sequences
<220><220>
<223><223>
<400> 27<400> 27
atgagcacaa tggataagat ttttgcaggc catgcacagc gccaggcaac attagtggcc 60atgagcacaa tggataagat ttttgcaggc catgcacagc gccaggcaac attagtggcc 60
agtgataata ttttcgcgaa cggcattgca tggattcagg gtgaactggt tccgctgaat 120agtgataata ttttcgcgaa cggcattgca tggattcagg gtgaactggt tccgctgaat 120
gaagcacgta ttccgctgat ggatcagggc tttatgcatg gtgacctgac ctatgatgtt 180gaagcacgta ttccgctgat ggatcagggc tttatgcatg gtgacctgac ctatgatgtt 180
ccggcagttt gggatggccg ctttttccgt ctggatgatc atctggatcg cctggaagca 240ccggcagttt gggatggccg ctttttccgt ctggatgatc atctggatcg cctggaagca 240
agtgttaaaa agatgcgtat gcagttcccg attccgcgtg atgaaattcg catgaccctg 300agtgttaaaa agatgcgtat gcagttcccg attccgcgtg atgaaattcg catgaccctg 300
ctggatatgc tggcaaaaag tggcattaag gatgcctttg tggaactgat tgtgacccgc 360ctggatatgc tggcaaaaag tggcattaag gatgcctttg tggaactgat tgtgacccgc 360
ggtctgaaac cggtgcgtga ggcaaaaccg ggtgaagttc tgaataatca tctgtatctg 420ggtctgaaac cggtgcgtga ggcaaaaccg ggtgaagttc tgaataatca tctgtatctg 420
atcgtgcagc cgtatgtttg ggttatgagc ccggaagccc agtatgttgg cggtaatgcc 480atcgtgcagc cgtatgtttg ggttatgagc ccggaagccc agtatgttgg cggtaatgcc 480
gtgattgcac gcaccgttcg tcgtattccg ccgggtagca tggaccctac aattaagaat 540gtgattgcac gcaccgttcg tcgtattccg ccgggtagca tggaccctac aattaagaat 540
ctgcagtgga gcgatttcac ccgcggcatg tttgaagcct atgatcgcgg cgcccagtat 600ctgcagtgga gcgatttcac ccgcggcatg tttgaagcct atgatcgcgg cgcccagtat 600
ccttttctga ccgatggcga taccaatatt accgaaggta gcggttttaa cgtggttttt 660ccttttctga ccgatggcga taccaatatt accgaaggta gcggttttaa cgtggttttt 660
gttaagaaca acgtgatcta caccccgaat cgtggtgttc tgcagggtat tacccgtaaa 720gttaagaaca acgtgatcta caccccgaat cgtggtgttc tgcagggtat tacccgtaaa 720
agtgttattg acgccgcaaa atggtgtggt catgaagtgc gtgtggaata tgttccggtt 780agtgttattg acgccgcaaa atggtgtggt catgaagtgc gtgtggaata tgttccggtt 780
gaaatggcct atgaagcaga tgaaatcttc atgtgcacca ccgcaggcgg cattatgcct 840gaaatggcct atgaagcaga tgaaatcttc atgtgcacca ccgcaggcgg cattatgcct 840
attaccacaa tggatggtaa accggtgaaa gatggtaaag tgggtccggt taccaaagca 900attaccacaa tggatggtaa accggtgaaa gatggtaaag tgggtccggt taccaaagca 900
atttgggatc gttattgggc catgcattgg gaagatgaat tttcattcaa gatcgactac 960atttgggatc gttattgggc catgcattgg gaagatgaat tttcattcaa gatcgactac 960
cagaagctga aactg 975cagaagctga aactg 975
<210> 28<210> 28
<211> 325<211> 325
<212> PRT<212> PRT
<213> 玉米赤霉(Gibberella zeae)<213> Gibberella zeae
<400> 28<400> 28
Met Ser Thr Met Asp Lys Ile Phe Ala Gly His Ala Gln Arg Gln AlaMet Ser Thr Met Asp Lys Ile Phe Ala Gly His Ala Gln Arg Gln Ala
1 5 10 151 5 10 15
Thr Leu Val Ala Ser Asp Asn Ile Phe Ala Asn Gly Ile Ala Trp IleThr Leu Val Ala Ser Asp Asn Ile Phe Ala Asn Gly Ile Ala Trp Ile
20 25 30 20 25 30
Gln Gly Glu Leu Val Pro Leu Asn Glu Ala Arg Ile Pro Leu Met AspGln Gly Glu Leu Val Pro Leu Asn Glu Ala Arg Ile Pro Leu Met Asp
35 40 45 35 40 45
Gln Gly Phe Met His Gly Asp Leu Thr Tyr Asp Val Pro Ala Val TrpGln Gly Phe Met His Gly Asp Leu Thr Tyr Asp Val Pro Ala Val Trp
50 55 60 50 55 60
Asp Gly Arg Phe Phe Arg Leu Asp Asp His Leu Asp Arg Leu Glu AlaAsp Gly Arg Phe Phe Arg Leu Asp Asp His Leu Asp Arg Leu Glu Ala
65 70 75 8065 70 75 80
Ser Val Lys Lys Met Arg Met Gln Phe Pro Ile Pro Arg Asp Glu IleSer Val Lys Lys Met Arg Met Gln Phe Pro Ile Pro Arg Asp Glu Ile
85 90 95 85 90 95
Arg Met Thr Leu Leu Asp Met Leu Ala Lys Ser Gly Ile Lys Asp AlaArg Met Thr Leu Leu Asp Met Leu Ala Lys Ser Gly Ile Lys Asp Ala
100 105 110 100 105 110
Phe Val Glu Leu Ile Val Thr Arg Gly Leu Lys Pro Val Arg Glu AlaPhe Val Glu Leu Ile Val Thr Arg Gly Leu Lys Pro Val Arg Glu Ala
115 120 125 115 120 125
Lys Pro Gly Glu Val Leu Asn Asn His Leu Tyr Leu Ile Val Gln ProLys Pro Gly Glu Val Leu Asn Asn His Leu Tyr Leu Ile Val Gln Pro
130 135 140 130 135 140
Tyr Val Trp Val Met Ser Pro Glu Ala Gln Tyr Val Gly Gly Asn AlaTyr Val Trp Val Met Ser Pro Glu Ala Gln Tyr Val Gly Gly Asn Ala
145 150 155 160145 150 155 160
Val Ile Ala Arg Thr Val Arg Arg Ile Pro Pro Gly Ser Met Asp ProVal Ile Ala Arg Thr Val Arg Arg Ile Pro Pro Gly Ser Met Asp Pro
165 170 175 165 170 175
Thr Ile Lys Asn Leu Gln Trp Ser Asp Phe Thr Arg Gly Met Phe GluThr Ile Lys Asn Leu Gln Trp Ser Asp Phe Thr Arg Gly Met Phe Glu
180 185 190 180 185 190
Ala Tyr Asp Arg Gly Ala Gln Tyr Pro Phe Leu Thr Asp Gly Asp ThrAla Tyr Asp Arg Gly Ala Gln Tyr Pro Phe Leu Thr Asp Gly Asp Thr
195 200 205 195 200 205
Asn Ile Thr Glu Gly Ser Gly Phe Asn Val Val Phe Val Lys Asn AsnAsn Ile Thr Glu Gly Ser Gly Phe Asn Val Val Phe Val Lys Asn Asn
210 215 220 210 215 220
Val Ile Tyr Thr Pro Asn Arg Gly Val Leu Gln Gly Ile Thr Arg LysVal Ile Tyr Thr Pro Asn Arg Gly Val Leu Gln Gly Ile Thr Arg Lys
225 230 235 240225 230 235 240
Ser Val Ile Asp Ala Ala Lys Trp Cys Gly His Glu Val Arg Val GluSer Val Ile Asp Ala Ala Lys Trp Cys Gly His Glu Val Arg Val Glu
245 250 255 245 250 255
Tyr Val Pro Val Glu Met Ala Tyr Glu Ala Asp Glu Ile Phe Met CysTyr Val Pro Val Glu Met Ala Tyr Glu Ala Asp Glu Ile Phe Met Cys
260 265 270 260 265 270
Thr Thr Ala Gly Gly Ile Met Pro Ile Thr Thr Met Asp Gly Lys ProThr Thr Ala Gly Gly Ile Met Pro Ile Thr Thr Met Asp Gly Lys Pro
275 280 285 275 280 285
Val Lys Asp Gly Lys Val Gly Pro Val Thr Lys Ala Ile Trp Asp ArgVal Lys Asp Gly Lys Val Gly Pro Val Thr Lys Ala Ile Trp Asp Arg
290 295 300 290 295 300
Tyr Trp Ala Met His Trp Glu Asp Glu Phe Ser Phe Lys Ile Asp TyrTyr Trp Ala Met His Trp Glu Asp Glu Phe Ser Phe Lys Ile Asp Tyr
305 310 315 320305 310 315 320
Gln Lys Leu Lys LeuGln Lys Leu Lys Leu
325 325
<210> 29<210> 29
<211> 1011<211> 1011
<212> DNA<212> DNA
<213> 人工序列<213> Artificial sequences
<220><220>
<223><223>
<400> 29<400> 29
atgggtatcg acaccggtac aagcaatctg gtggccgtgg aaccgggtgc aattagagaa 60atgggtatcg acaccggtac aagcaatctg gtggccgtgg aaccgggtgc aattagagaa 60
gataccccgg ccggtagcgt gattcagtat agcgattatg aaatcgacta cagcagcccg 120gataccccgg ccggtagcgt gattcagtat agcgattatg aaatcgacta cagcagcccg 120
tttgcaggtg gtgtggcttg gattgaaggc gaatatctgc cggccgaaga tgccaaaatt 180tttgcaggtg gtgtggcttg gattgaaggc gaatatctgc cggccgaaga tgccaaaatt 180
agcatttttg acaccggttt cggccatagc gatctgacct ataccgttgc acatgtttgg 240agcatttttg acaccggttt cggccatagc gatctgacct ataccgttgc acatgtttgg 240
catggcaata ttttccgcct gggcgatcat ctggatcgtc tgttagatgg cgcacgtaaa 300catggcaata ttttccgcct gggcgatcat ctggatcgtc tgttagatgg cgcacgtaaa 300
ctgcgtctgg atagtggcta taccaaagat gaactggcag atattaccaa gaagtgcgtg 360ctgcgtctgg atagtggcta taccaaagat gaactggcag atattaccaa gaagtgcgtg 360
agcctgagcc agctgcgtga atcatttgtg aatctgacca ttacccgcgg ttatggtaaa 420agcctgagcc agctgcgtga atcatttgtg aatctgacca ttacccgcgg ttatggtaaa 420
cgcaaaggtg aaaaagacct gagtaagctg acccatcagg tgtatatcta tgccattccg 480cgcaaaggtg aaaaagacct gagtaagctg acccatcagg tgtatatcta tgccattccg 480
tatctgtggg cctttccgcc tgccgagcaa atttttggca ccaccgccgt tgtgccgcgt 540tatctgtggg cctttccgcc tgccgagcaa atttttggca ccaccgccgt tgtgccgcgt 540
cacgtgcgtc gtgcaggtcg taacacagtt gatccgacca ttaagaatta ccagtggggt 600cacgtgcgtc gtgcaggtcg taacacagtt gatccgacca ttaagaatta ccagtggggt 600
gacctgaccg cagccagctt cgaggcaaaa gatcgcggtg ctcgcaccgc aattctgatg 660gacctgaccg cagccagctt cgaggcaaaa gatcgcggtg ctcgcaccgc aattctgatg 660
gatgccgata attgtgtggc agaaggcccg ggttttaatg tgtgcattgt taaagacggc 720gatgccgata attgtgtggc agaaggcccg ggttttaatg tgtgcattgt taaagacggc 720
aagctggcaa gcccgagtcg taatgcactg cctggtatta cccgtaaaac cgtgtttgaa 780aagctggcaa gcccgagtcg taatgcactg cctggtatta cccgtaaaac cgtgtttgaa 780
atcgccggtg caatgggcat tgaagccgca ttacgtgatg ttaccagtca tgaactgtac 840atcgccggtg caatgggcat tgaagccgca ttacgtgatg ttaccagtca tgaactgtac 840
gatgcagatg aaatcatggc agttaccacc gccggcggtg ttacacctat taataccctg 900gatgcagatg aaatcatggc agttaccacc gccggcggtg ttacacctat taataccctg 900
gatggcgttc cgattggtga cggtgaaccg ggtcctgtta ccgttgctat tcgtgatcgc 960gatggcgttc cgattggtga cggtgaaccg ggtcctgtta ccgttgctat tcgtgatcgc 960
ttttgggcac tgatggatga accgggtccg ttaattgaag ccattcagta t 1011ttttgggcac tgatggatga accgggtccg ttaattgaag ccattcagta t 1011
<210> 30<210> 30
<211> 337<211> 337
<212> PRT<212> PRT
<213> 分支杆菌(Mycobacterium vanbaalenii)<213> Mycobacterium vanbaalenii
<400>30<400>30
Met Gly Ile Asp Thr Gly Thr Ser Asn Leu Val Ala Val Glu Pro GlyMet Gly Ile Asp Thr Gly Thr Ser Asn Leu Val Ala Val Glu Pro Gly
1 5 10 151 5 10 15
Ala Ile Arg Glu Asp Thr Pro Ala Gly Ser Val Ile Gln Tyr Ser AspAla Ile Arg Glu Asp Thr Pro Ala Gly Ser Val Ile Gln Tyr Ser Asp
20 25 30 20 25 30
Tyr Glu Ile Asp Tyr Ser Ser Pro Phe Ala Gly Gly Val Ala Trp IleTyr Glu Ile Asp Tyr Ser Ser Pro Phe Ala Gly Gly Val Ala Trp Ile
35 40 45 35 40 45
Glu Gly Glu Tyr Leu Pro Ala Glu Asp Ala Lys Ile Ser Ile Phe AspGlu Gly Glu Tyr Leu Pro Ala Glu Asp Ala Lys Ile Ser Ile Phe Asp
50 55 60 50 55 60
Thr Gly Phe Gly His Ser Asp Leu Thr Tyr Thr Val Ala His Val TrpThr Gly Phe Gly His Ser Asp Leu Thr Tyr Thr Val Ala His Val Trp
65 70 75 8065 70 75 80
His Gly Asn Ile Phe Arg Leu Gly Asp His Leu Asp Arg Leu Leu AspHis Gly Asn Ile Phe Arg Leu Gly Asp His Leu Asp Arg Leu Leu Asp
85 90 95 85 90 95
Gly Ala Arg Lys Leu Arg Leu Asp Ser Gly Tyr Thr Lys Asp Glu LeuGly Ala Arg Lys Leu Arg Leu Asp Ser Gly Tyr Thr Lys Asp Glu Leu
100 105 110 100 105 110
Ala Asp Ile Thr Lys Lys Cys Val Ser Leu Ser Gln Leu Arg Glu SerAla Asp Ile Thr Lys Lys Cys Val Ser Leu Ser Gln Leu Arg Glu Ser
115 120 125 115 120 125
Phe Val Asn Leu Thr Ile Thr Arg Gly Tyr Gly Lys Arg Lys Gly GluPhe Val Asn Leu Thr Ile Thr Arg Gly Tyr Gly Lys Arg Lys Gly Glu
130 135 140 130 135 140
Lys Asp Leu Ser Lys Leu Thr His Gln Val Tyr Ile Tyr Ala Ile ProLys Asp Leu Ser Lys Leu Thr His Gln Val Tyr Ile Tyr Ala Ile Pro
145 150 155 160145 150 155 160
Tyr Leu Trp Ala Phe Pro Pro Ala Glu Gln Ile Phe Gly Thr Thr AlaTyr Leu Trp Ala Phe Pro Pro Ala Glu Gln Ile Phe Gly Thr Thr Ala
165 170 175 165 170 175
Val Val Pro Arg His Val Arg Arg Ala Gly Arg Asn Thr Val Asp ProVal Val Pro Arg His Val Arg Arg Ala Gly Arg Asn Thr Val Asp Pro
180 185 190 180 185 190
Thr Ile Lys Asn Tyr Gln Trp Gly Asp Leu Thr Ala Ala Ser Phe GluThr Ile Lys Asn Tyr Gln Trp Gly Asp Leu Thr Ala Ala Ser Phe Glu
195 200 205 195 200 205
Ala Lys Asp Arg Gly Ala Arg Thr Ala Ile Leu Met Asp Ala Asp AsnAla Lys Asp Arg Gly Ala Arg Thr Ala Ile Leu Met Asp Ala Asp Asn
210 215 220 210 215 220
Cys Val Ala Glu Gly Pro Gly Phe Asn Val Cys Ile Val Lys Asp GlyCys Val Ala Glu Gly Pro Gly Phe Asn Val Cys Ile Val Lys Asp Gly
225 230 235 240225 230 235 240
Lys Leu Ala Ser Pro Ser Arg Asn Ala Leu Pro Gly Ile Thr Arg LysLys Leu Ala Ser Pro Ser Arg Asn Ala Leu Pro Gly Ile Thr Arg Lys
245 250 255 245 250 255
Thr Val Phe Glu Ile Ala Gly Ala Met Gly Ile Glu Ala Ala Leu ArgThr Val Phe Glu Ile Ala Gly Ala Met Gly Ile Glu Ala Ala Leu Arg
260 265 270 260 265 270
Asp Val Thr Ser His Glu Leu Tyr Asp Ala Asp Glu Ile Met Ala ValAsp Val Thr Ser His Glu Leu Tyr Asp Ala Asp Glu Ile Met Ala Val
275 280 285 275 280 285
Thr Thr Ala Gly Gly Val Thr Pro Ile Asn Thr Leu Asp Gly Val ProThr Thr Ala Gly Gly Val Thr Pro Ile Asn Thr Leu Asp Gly Val Pro
290 295 300 290 295 300
Ile Gly Asp Gly Glu Pro Gly Pro Val Thr Val Ala Ile Arg Asp ArgIle Gly Asp Gly Glu Pro Gly Pro Val Thr Val Ala Ile Arg Asp Arg
305 310 315 320305 310 315 320
Phe Trp Ala Leu Met Asp Glu Pro Gly Pro Leu Ile Glu Ala Ile GlnPhe Trp Ala Leu Met Asp Glu Pro Gly Pro Leu Ile Glu Ala Ile Gln
325 330 335 325 330 335
TyrTyr
<210> 31<210> 31
<211> 1428<211> 1428
<212> DNA<212> DNA
<213> 人工序列<213> Artificial sequences
<220><220>
<223><223>
<400> 31<400> 31
atgagcctga ccgtgcaaaa aattaactgg gaacaggtta aggagtggga tcgtaaatat 60atgagcctga ccgtgcaaaa aattaactgg gaacaggtta aggagtggga tcgtaaatat 60
ctgatgcgta cctttagcac ccagaatgaa tatcagccgg ttccgattga aagtaccgaa 120ctgatgcgta cctttagcac ccagaatgaa tatcagccgg ttccgattga aagtaccgaa 120
ggcgattatc tgatcatgcc ggatggtaca cgcctgctgg atttctttaa tcagctgtat 180ggcgattatc tgatcatgcc ggatggtaca cgcctgctgg atttctttaa tcagctgtat 180
tgcgtgaacc tgggtcagaa aaatcagaaa gttaacgcag ccatcaagga agcactggat 240tgcgtgaacc tgggtcagaa aaatcagaaa gttaacgcag ccatcaagga agcactggat 240
cgctatggct ttgtttggga tacctatgcc accgattata aagccaaagc agcaaaaatc 300cgctatggct ttgtttggga tacctatgcc accgattata aagccaaagc agcaaaaatc 300
atcatcgagg atattctggg tgacgaagat tggccgggca aagtgcgttt tgtgagtacc 360atcatcgagg atattctggg tgacgaagat tggccgggca aagtgcgttt tgtgagtacc 360
ggcagcgaag ccgtggaaac agctttaaat attgcacgcc tgtacaccaa tcgcccgctg 420ggcagcgaag ccgtggaaac agctttaaat attgcacgcc tgtacaccaa tcgcccgctg 420
gtggtgacac gtgaacatga ttatcatggc tggaccggcg gcgcagcaac cgtgacccgt 480gtggtgacac gtgaacatga ttatcatggc tggaccggcg gcgcagcaac cgtgacccgt 480
ctgcgtagct atcgtagcgg tctggtgggt gaaaatagcg aaagttttag tgcccagatc 540ctgcgtagct atcgtagcgg tctggtgggt gaaaatagcg aaagttttag tgcccagatc 540
ccgggcagta gctataatag cgcagtgctg atggccccga gccctaacat gtttcaggat 600ccgggcagta gctataatag cgcagtgctg atggccccga gccctaacat gtttcaggat 600
agcgatggta atctgctgaa agatgaaaac ggcgaactgc tgagcgttaa atatacccgc 660agcgatggta atctgctgaa agatgaaaac ggcgaactgc tgagcgttaa atatacccgc 660
cgcatgattg aaaactacgg tccggaacag gtggcagcag ttattaccga agttagccag 720cgcatgattg aaaactacgg tccggaacag gtggcagcag ttattaccga agttagccag 720
ggtgccggta gtgctatgcc tccttatgaa tatatcccgc agattcgcaa aatgaccaaa 780ggtgccggta gtgctatgcc tccttatgaa tatatcccgc agattcgcaa aatgaccaaa 780
gaactgggcg tgctgtggat taatgatgaa gtgctgaccg gttttggccg caccggtaaa 840gaactgggcg tgctgtggat taatgatgaa gtgctgaccg gttttggccg caccggtaaa 840
tggtttggtt atcagcatta cggtgtgcag ccggatatta ttacaatggg taaaggtctg 900tggtttggtt atcagcatta cggtgtgcag ccggatatta ttacaatggg taaaggtctg 900
agcagcagca gtctgccggc tggtgcagtg ttagtgagca aagaaattgc agcattcatg 960agcagcagca gtctgccggc tggtgcagtg ttagtgagca aagaaattgc agcattcatg 960
gataagcacc gttgggaaag cgtgagtacc tatgccggtc atccggttgc aatggctgcc 1020gataagcacc gttgggaaag cgtgagtacc tatgccggtc atccggttgc aatggctgcc 1020
gtgtgtgcaa atctggaagt gatgatggaa gaaaacttcg ttgagcaggc aaaagatagt 1080gtgtgtgcaa atctggaagt gatgatggaa gaaaacttcg ttgagcaggc aaaagatagt 1080
ggtgaatata tccgtagcaa gctggaactg ctgcaggaaa aacataaaag catcggtaac 1140ggtgaatata tccgtagcaa gctggaactg ctgcaggaaa aacataaaag catcggtaac 1140
ttcgacggct atggcctgct gtggattgtt gatattgtta atgccaagac caagaccccg 1200ttcgacggct atggcctgct gtggattgtt gatattgtta atgccaagac caagaccccg 1200
tatgttaaac tggatcgcaa ttttacccac ggtatgaatc cgaatcagat tccgacccag 1260tatgttaaac tggatcgcaa ttttacccac ggtatgaatc cgaatcagat tccgacccag 1260
attattatga agaaggccct ggaaaagggc gtgctgattg gtggtgtgat gccgaatacc 1320attattatga agaaggccct ggaaaagggc gtgctgattg gtggtgtgat gccgaatacc 1320
atgcgcattg gtgcaagcct gaatgtgagt cgcggcgata ttgataaagc aatggatgca 1380atgcgcattg gtgcaagcct gaatgtgagt cgcggcgata ttgataaagc aatggatgca 1380
ctggactacg ccctggatta tctggaaagt ggtgaatggc agcagagc 1428ctggactacg ccctggatta tctggaaagt ggtgaatggc agcagagc 1428
<210> 32<210> 32
<211> 476<211> 476
<212> PRT<212> PRT
<213> 巨大芽胞杆菌(Bacillus megaterium)<213> Bacillus megaterium
<400> 32<400> 32
Met Ser Leu Thr Val Gln Lys Ile Asn Trp Glu Gln Val Lys Glu TrpMet Ser Leu Thr Val Gln Lys Ile Asn Trp Glu Gln Val Lys Glu Trp
1 5 10 151 5 10 15
Asp Arg Lys Tyr Leu Met Arg Thr Phe Ser Thr Gln Asn Glu Tyr GlnAsp Arg Lys Tyr Leu Met Arg Thr Phe Ser Thr Gln Asn Glu Tyr Gln
20 25 30 20 25 30
Pro Val Pro Ile Glu Ser Thr Glu Gly Asp Tyr Leu Ile Met Pro AspPro Val Pro Ile Glu Ser Thr Glu Gly Asp Tyr Leu Ile Met Pro Asp
35 40 45 35 40 45
Gly Thr Arg Leu Leu Asp Phe Phe Asn Gln Leu Tyr Cys Val Asn LeuGly Thr Arg Leu Leu Asp Phe Phe Asn Gln Leu Tyr Cys Val Asn Leu
50 55 60 50 55 60
Gly Gln Lys Asn Gln Lys Val Asn Ala Ala Ile Lys Glu Ala Leu AspGly Gln Lys Asn Gln Lys Val Asn Ala Ala Ile Lys Glu Ala Leu Asp
65 70 75 8065 70 75 80
Arg Tyr Gly Phe Val Trp Asp Thr Tyr Ala Thr Asp Tyr Lys Ala LysArg Tyr Gly Phe Val Trp Asp Thr Tyr Ala Thr Asp Tyr Lys Ala Lys
85 90 95 85 90 95
Ala Ala Lys Ile Ile Ile Glu Asp Ile Leu Gly Asp Glu Asp Trp ProAla Ala Lys Ile Ile Ile Glu Asp Ile Leu Gly Asp Glu Asp Trp Pro
100 105 110 100 105 110
Gly Lys Val Arg Phe Val Ser Thr Gly Ser Glu Ala Val Glu Thr AlaGly Lys Val Arg Phe Val Ser Thr Gly Ser Glu Ala Val Glu Thr Ala
115 120 125 115 120 125
Leu Asn Ile Ala Arg Leu Tyr Thr Asn Arg Pro Leu Val Val Thr ArgLeu Asn Ile Ala Arg Leu Tyr Thr Asn Arg Pro Leu Val Val Thr Arg
130 135 140 130 135 140
Glu His Asp Tyr His Gly Trp Thr Gly Gly Ala Ala Thr Val Thr ArgGlu His Asp Tyr His Gly Trp Thr Gly Gly Ala Ala Thr Val Thr Arg
145 150 155 160145 150 155 160
Leu Arg Ser Tyr Arg Ser Gly Leu Val Gly Glu Asn Ser Glu Ser PheLeu Arg Ser Tyr Arg Ser Gly Leu Val Gly Glu Asn Ser Glu Ser Phe
165 170 175 165 170 175
Ser Ala Gln Ile Pro Gly Ser Ser Tyr Asn Ser Ala Val Leu Met AlaSer Ala Gln Ile Pro Gly Ser Ser Tyr Asn Ser Ala Val Leu Met Ala
180 185 190 180 185 190
Pro Ser Pro Asn Met Phe Gln Asp Ser Asp Gly Asn Leu Leu Lys AspPro Ser Pro Asn Met Phe Gln Asp Ser Asp Gly Asn Leu Leu Lys Asp
195 200 205 195 200 205
Glu Asn Gly Glu Leu Leu Ser Val Lys Tyr Thr Arg Arg Met Ile GluGlu Asn Gly Glu Leu Leu Ser Val Lys Tyr Thr Arg Arg Met Ile Glu
210 215 220 210 215 220
Asn Tyr Gly Pro Glu Gln Val Ala Ala Val Ile Thr Glu Val Ser GlnAsn Tyr Gly Pro Glu Gln Val Ala Ala Val Ile Thr Glu Val Ser Gln
225 230 235 240225 230 235 240
Gly Ala Gly Ser Ala Met Pro Pro Tyr Glu Tyr Ile Pro Gln Ile ArgGly Ala Gly Ser Ala Met Pro Pro Tyr Glu Tyr Ile Pro Gln Ile Arg
245 250 255 245 250 255
Lys Met Thr Lys Glu Leu Gly Val Leu Trp Ile Asn Asp Glu Val LeuLys Met Thr Lys Glu Leu Gly Val Leu Trp Ile Asn Asp Glu Val Leu
260 265 270 260 265 270
Thr Gly Phe Gly Arg Thr Gly Lys Trp Phe Gly Tyr Gln His Tyr GlyThr Gly Phe Gly Arg Thr Gly Lys Trp Phe Gly Tyr Gln His Tyr Gly
275 280 285 275 280 285
Val Gln Pro Asp Ile Ile Thr Met Gly Lys Gly Leu Ser Ser Ser SerVal Gln Pro Asp Ile Ile Thr Met Gly Lys Gly Leu Ser Ser Ser Ser
290 295 300 290 295 300
Leu Pro Ala Gly Ala Val Leu Val Ser Lys Glu Ile Ala Ala Phe MetLeu Pro Ala Gly Ala Val Leu Val Ser Lys Glu Ile Ala Ala Phe Met
305 310 315 320305 310 315 320
Asp Lys His Arg Trp Glu Ser Val Ser Thr Tyr Ala Gly His Pro ValAsp Lys His Arg Trp Glu Ser Val Ser Thr Tyr Ala Gly His Pro Val
325 330 335 325 330 335
Ala Met Ala Ala Val Cys Ala Asn Leu Glu Val Met Met Glu Glu AsnAla Met Ala Ala Val Cys Ala Asn Leu Glu Val Met Met Glu Glu Asn
340 345 350 340 345 350
Phe Val Glu Gln Ala Lys Asp Ser Gly Glu Tyr Ile Arg Ser Lys LeuPhe Val Glu Gln Ala Lys Asp Ser Gly Glu Tyr Ile Arg Ser Lys Leu
355 360 365 355 360 365
Glu Leu Leu Gln Glu Lys His Lys Ser Ile Gly Asn Phe Asp Gly TyrGlu Leu Leu Gln Glu Lys His Lys Ser Ile Gly Asn Phe Asp Gly Tyr
370 375 380 370 375 380
Gly Leu Leu Trp Ile Val Asp Ile Val Asn Ala Lys Thr Lys Thr ProGly Leu Leu Trp Ile Val Asp Ile Val Asn Ala Lys Thr Lys Thr Pro
385 390 395 400385 390 395 400
Tyr Val Lys Leu Asp Arg Asn Phe Thr His Gly Met Asn Pro Asn GlnTyr Val Lys Leu Asp Arg Asn Phe Thr His Gly Met Asn Pro Asn Gln
405 410 415 405 410 415
Ile Pro Thr Gln Ile Ile Met Lys Lys Ala Leu Glu Lys Gly Val LeuIle Pro Thr Gln Ile Ile Met Lys Lys Ala Leu Glu Lys Gly Val Leu
420 425 430 420 425 430
Ile Gly Gly Val Met Pro Asn Thr Met Arg Ile Gly Ala Ser Leu AsnIle Gly Gly Val Met Pro Asn Thr Met Arg Ile Gly Ala Ser Leu Asn
435 440 445 435 440 445
Val Ser Arg Gly Asp Ile Asp Lys Ala Met Asp Ala Leu Asp Tyr AlaVal Ser Arg Gly Asp Ile Asp Lys Ala Met Asp Ala Leu Asp Tyr Ala
450 455 460 450 455 460
Leu Asp Tyr Leu Glu Ser Gly Glu Trp Gln Gln SerLeu Asp Tyr Leu Glu Ser Gly Glu Trp Gln Gln Ser
465 470 475465 470 475
<210> 33<210> 33
<211> 1344<211> 1344
<212> DNA<212> DNA
<213> 人工序列<213> Artificial sequences
<220><220>
<223><223>
<400> 33<400> 33
atgaaccagc cgctgaatgt ggccccgccg gttagcagcg aactgaatct gcgtgcccat 60atgaaccagc cgctgaatgt ggccccgccg gttagcagcg aactgaatct gcgtgcccat 60
tggatgccgt ttagcgcaaa tcgtaatttt cagaaagatc cgcgtattat tgttgccgca 120tggatgccgt ttagcgcaaa tcgtaatttt cagaaagatc cgcgtattat tgttgccgca 120
gaaggtagtt ggctgaccga tgataaaggc cgcaaagtgt atgatagtct gagtggcctg 180gaaggtagtt ggctgaccga tgataaaggc cgcaaagtgt atgatagtct gagtggcctg 180
tggacctgcg gtgcaggcca tagccgtaaa gaaattcagg aagcagtggc acgccagctg 240tggacctgcg gtgcaggcca tagccgtaaa gaaattcagg aagcagtggc acgccagctg 240
ggcaccctgg attatagccc gggttttcag tatggccatc cgctgagttt tcagctggca 300ggcaccctgg attatagccc gggttttcag tatggccatc cgctgagttt tcagctggca 300
gaaaaaattg ccggtctgct gccgggtgaa ctgaatcatg ttttctttac cggtagtggc 360gaaaaaattg ccggtctgct gccgggtgaa ctgaatcatg ttttctttac cggtagtggc 360
agcgaatgcg ccgataccag cattaagatg gcccgtgcat attggcgcct gaaaggtcag 420agcgaatgcg ccgataccag cattaagatg gcccgtgcat attggcgcct gaaaggtcag 420
ccgcagaaaa ccaaactgat tggccgtgca cgcggttatc atggcgtgaa tgttgccggc 480ccgcagaaaa ccaaactgat tggccgtgca cgcggttatc atggcgtgaa tgttgccggc 480
accagcctgg gcggcattgg tggtaatcgc aaaatgtttg gtcagctgat ggatgtggat 540accagcctgg gcggcattgg tggtaatcgc aaaatgtttg gtcagctgat ggatgtggat 540
catctgccgc ataccctgca gccgggcatg gcattcactc gtggtatggc acagaccggc 600catctgccgc ataccctgca gccgggcatg gcattcactc gtggtatggc acagaccggc 600
ggcgttgaac tggcaaatga actgctgaaa ctgattgaac tgcatgatgc cagtaatatt 660ggcgttgaac tggcaaatga actgctgaaa ctgattgaac tgcatgatgc cagtaatatt 660
gccgcagtga ttgtggaacc gatgagtggc agtgcaggtg ttctggtgcc gccggtgggt 720gccgcagtga ttgtggaacc gatgagtggc agtgcaggtg ttctggtgcc gccggtgggt 720
tatctgcagc gtctgcgtga aatttgtgat cagcataata ttctgctgat ttttgatgaa 780tatctgcagc gtctgcgtga aatttgtgat cagcataata ttctgctgat ttttgatgaa 780
gtgatcaccg catttggccg tctgggtaca tatagcggtg ccgaatattt tggtgtgacc 840gtgatcaccg catttggccg tctgggtaca tatagcggtg ccgaatattt tggtgtgacc 840
ccggatctga tgaatgtggc aaaacaggtg accaatggtg ccgtgccgat gggcgcagtt 900ccggatctga tgaatgtggc aaaacaggtg accaatggtg ccgtgccgat gggcgcagtt 900
attgcaagca gcgaaatcta tgataccttt atgaatcagg ccctgccgga acatgccgtg 960attgcaagca gcgaaatcta tgataccttt atgaatcagg ccctgccgga acatgccgtg 960
gaattttctc atggttatac ctatagtgca catccggttg cctgtgccgc cggcctggca 1020gaattttctc atggttatac ctatagtgca catccggttg cctgtgccgc cggcctggca 1020
gcactggata ttctggcccg tgataatctg gtgcagcaga gtgcagaact ggcaccgcat 1080gcactggata ttctggcccg tgataatctg gtgcagcaga gtgcagaact ggcaccgcat 1080
tttgaaaaag gtctgcatgg tctgcagggc gccaaaaatg ttattgatat tcgtaattgc 1140tttgaaaaag gtctgcatgg tctgcagggc gccaaaaatg ttattgatat tcgtaattgc 1140
ggcctggccg gcgccattca gattgcaccg cgtgatggtg acccgaccgt tcgcccgttt 1200ggcctggccg gcgccattca gattgcaccg cgtgatggtg acccgaccgt tcgcccgttt 1200
gaagccggca tgaaactgtg gcagcagggt ttttatgtgc gctttggcgg cgataccctg 1260gaagccggca tgaaactgtg gcagcagggt ttttatgtgc gctttggcgg cgataccctg 1260
cagtttggtc cgacctttaa tgcacgcccg gaagaactgg atcgcctgtt tgatgcagtg 1320cagtttggtc cgacctttaa tgcacgcccg gaagaactgg atcgcctgtt tgatgcagtg 1320
ggtgaagcac tgaatggtat tgcc 1344ggtgaagcac tgaatggtat tgcc 1344
<210> 34<210> 34
<211> 448<211> 448
<212> PRT<212> PRT
<213> 铜绿色假单胞菌(P. aeruginosa)<213> Pseudomonas aeruginosa (P. aeruginosa)
<400> 34<400> 34
Met Asn Gln Pro Leu Asn Val Ala Pro Pro Val Ser Ser Glu Leu AsnMet Asn Gln Pro Leu Asn Val Ala Pro Pro Val Ser Ser Glu Leu Asn
1 5 10 151 5 10 15
Leu Arg Ala His Trp Met Pro Phe Ser Ala Asn Arg Asn Phe Gln LysLeu Arg Ala His Trp Met Pro Phe Ser Ala Asn Arg Asn Phe Gln Lys
20 25 30 20 25 30
Asp Pro Arg Ile Ile Val Ala Ala Glu Gly Ser Trp Leu Thr Asp AspAsp Pro Arg Ile Ile Val Ala Ala Glu Gly Ser Trp Leu Thr Asp Asp
35 40 45 35 40 45
Lys Gly Arg Lys Val Tyr Asp Ser Leu Ser Gly Leu Trp Thr Cys GlyLys Gly Arg Lys Val Tyr Asp Ser Leu Ser Gly Leu Trp Thr Cys Gly
50 55 60 50 55 60
Ala Gly His Ser Arg Lys Glu Ile Gln Glu Ala Val Ala Arg Gln LeuAla Gly His Ser Arg Lys Glu Ile Gln Glu Ala Val Ala Arg Gln Leu
65 70 75 8065 70 75 80
Gly Thr Leu Asp Tyr Ser Pro Gly Phe Gln Tyr Gly His Pro Leu SerGly Thr Leu Asp Tyr Ser Pro Gly Phe Gln Tyr Gly His Pro Leu Ser
85 90 95 85 90 95
Phe Gln Leu Ala Glu Lys Ile Ala Gly Leu Leu Pro Gly Glu Leu AsnPhe Gln Leu Ala Glu Lys Ile Ala Gly Leu Leu Pro Gly Glu Leu Asn
100 105 110 100 105 110
His Val Phe Phe Thr Gly Ser Gly Ser Glu Cys Ala Asp Thr Ser IleHis Val Phe Phe Thr Gly Ser Gly Ser Glu Cys Ala Asp Thr Ser Ile
115 120 125 115 120 125
Lys Met Ala Arg Ala Tyr Trp Arg Leu Lys Gly Gln Pro Gln Lys ThrLys Met Ala Arg Ala Tyr Trp Arg Leu Lys Gly Gln Pro Gln Lys Thr
130 135 140 130 135 140
Lys Leu Ile Gly Arg Ala Arg Gly Tyr His Gly Val Asn Val Ala GlyLys Leu Ile Gly Arg Ala Arg Gly Tyr His Gly Val Asn Val Ala Gly
145 150 155 160145 150 155 160
Thr Ser Leu Gly Gly Ile Gly Gly Asn Arg Lys Met Phe Gly Gln LeuThr Ser Leu Gly Gly Ile Gly Gly Asn Arg Lys Met Phe Gly Gln Leu
165 170 175 165 170 175
Met Asp Val Asp His Leu Pro His Thr Leu Gln Pro Gly Met Ala PheMet Asp Val Asp His Leu Pro His Thr Leu Gln Pro Gly Met Ala Phe
180 185 190 180 185 190
Thr Arg Gly Met Ala Gln Thr Gly Gly Val Glu Leu Ala Asn Glu LeuThr Arg Gly Met Ala Gln Thr Gly Gly Val Glu Leu Ala Asn Glu Leu
195 200 205 195 200 205
Leu Lys Leu Ile Glu Leu His Asp Ala Ser Asn Ile Ala Ala Val IleLeu Lys Leu Ile Glu Leu His Asp Ala Ser Asn Ile Ala Ala Val Ile
210 215 220 210 215 220
Val Glu Pro Met Ser Gly Ser Ala Gly Val Leu Val Pro Pro Val GlyVal Glu Pro Met Ser Gly Ser Ala Gly Val Leu Val Pro Pro Val Gly
225 230 235 240225 230 235 240
Tyr Leu Gln Arg Leu Arg Glu Ile Cys Asp Gln His Asn Ile Leu LeuTyr Leu Gln Arg Leu Arg Glu Ile Cys Asp Gln His Asn Ile Leu Leu
245 250 255 245 250 255
Ile Phe Asp Glu Val Ile Thr Ala Phe Gly Arg Leu Gly Thr Tyr SerIle Phe Asp Glu Val Ile Thr Ala Phe Gly Arg Leu Gly Thr Tyr Ser
260 265 270 260 265 270
Gly Ala Glu Tyr Phe Gly Val Thr Pro Asp Leu Met Asn Val Ala LysGly Ala Glu Tyr Phe Gly Val Thr Pro Asp Leu Met Asn Val Ala Lys
275 280 285 275 280 285
Gln Val Thr Asn Gly Ala Val Pro Met Gly Ala Val Ile Ala Ser SerGln Val Thr Asn Gly Ala Val Pro Met Gly Ala Val Ile Ala Ser Ser
290 295 300 290 295 300
Glu Ile Tyr Asp Thr Phe Met Asn Gln Ala Leu Pro Glu His Ala ValGlu Ile Tyr Asp Thr Phe Met Asn Gln Ala Leu Pro Glu His Ala Val
305 310 315 320305 310 315 320
Glu Phe Ser His Gly Tyr Thr Tyr Ser Ala His Pro Val Ala Cys AlaGlu Phe Ser His Gly Tyr Thr Tyr Ser Ala His Pro Val Ala Cys Ala
325 330 335 325 330 335
Ala Gly Leu Ala Ala Leu Asp Ile Leu Ala Arg Asp Asn Leu Val GlnAla Gly Leu Ala Ala Leu Asp Ile Leu Ala Arg Asp Asn Leu Val Gln
340 345 350 340 345 350
Gln Ser Ala Glu Leu Ala Pro His Phe Glu Lys Gly Leu His Gly LeuGln Ser Ala Glu Leu Ala Pro His Phe Glu Lys Gly Leu His Gly Leu
355 360 365 355 360 365
Gln Gly Ala Lys Asn Val Ile Asp Ile Arg Asn Cys Gly Leu Ala GlyGln Gly Ala Lys Asn Val Ile Asp Ile Arg Asn Cys Gly Leu Ala Gly
370 375 380 370 375 380
Ala Ile Gln Ile Ala Pro Arg Asp Gly Asp Pro Thr Val Arg Pro PheAla Ile Gln Ile Ala Pro Arg Asp Gly Asp Pro Thr Val Arg Pro Phe
385 390 395 400385 390 395 400
Glu Ala Gly Met Lys Leu Trp Gln Gln Gly Phe Tyr Val Arg Phe GlyGlu Ala Gly Met Lys Leu Trp Gln Gln Gly Phe Tyr Val Arg Phe Gly
405 410 415 405 410 415
Gly Asp Thr Leu Gln Phe Gly Pro Thr Phe Asn Ala Arg Pro Glu GluGly Asp Thr Leu Gln Phe Gly Pro Thr Phe Asn Ala Arg Pro Glu Glu
420 425 430 420 425 430
Leu Asp Arg Leu Phe Asp Ala Val Gly Glu Ala Leu Asn Gly Ile AlaLeu Asp Arg Leu Phe Asp Ala Val Gly Glu Ala Leu Asn Gly Ile Ala
435 440 445 435 440 445
Claims (16)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711181396.4A CN109825538B (en) | 2017-11-23 | 2017-11-23 | A kind of synthetic method of chiral 2-amino-1-butanol |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711181396.4A CN109825538B (en) | 2017-11-23 | 2017-11-23 | A kind of synthetic method of chiral 2-amino-1-butanol |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109825538A CN109825538A (en) | 2019-05-31 |
CN109825538B true CN109825538B (en) | 2022-05-24 |
Family
ID=66858565
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711181396.4A Active CN109825538B (en) | 2017-11-23 | 2017-11-23 | A kind of synthetic method of chiral 2-amino-1-butanol |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109825538B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110951706B (en) * | 2019-09-16 | 2021-08-27 | 浙江工业大学 | Recombinant R-omega-transaminase, mutant and application in asymmetric synthesis of sitagliptin |
CN112779232B (en) * | 2019-11-05 | 2021-12-28 | 中国科学院天津工业生物技术研究所 | A kind of synthetic method of chiral amine alcohol compound |
JP7420944B2 (en) * | 2019-12-02 | 2024-01-23 | ジーリン アシムケム ラボラトリーズ カンパニー リミテッド | Co-immobilized enzyme, its production method and its use |
CN112852894B (en) * | 2020-06-04 | 2021-11-09 | 中国科学院天津工业生物技术研究所 | Amine dehydrogenase mutant and application thereof in synthesis of chiral amine alcohol compound |
CN111996176B (en) * | 2020-10-29 | 2021-01-15 | 中国科学院天津工业生物技术研究所 | Carbonyl reductase mutant and application thereof |
CN114875084B (en) * | 2021-02-05 | 2023-10-20 | 上海交通大学 | Method for synthesizing (1R, 2R) -AMPP by utilizing enzyme cascade reaction |
CN112941115B (en) * | 2021-03-30 | 2024-08-16 | 江苏阿尔法集团盛基药业(宿迁)有限公司 | Ticagrelor chiral preparation method of intermediate |
CN116875577B (en) * | 2023-06-08 | 2024-05-14 | 河北工业大学 | Amine dehydrogenase mutant, nucleic acid, expression vector, preparation method and application |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102086462A (en) * | 2009-12-02 | 2011-06-08 | 中国科学院上海生命科学研究院 | Method for preparing chiral monomer mandelic acid |
CN102140431A (en) * | 2010-12-21 | 2011-08-03 | 大成生化科技(松原)有限公司 | L-tryptophan gene engineering bacterium, method for constructing same and method for fermenting and producing L-tryptophan by using same |
CN102191227A (en) * | 2010-03-04 | 2011-09-21 | 上海工业生物技术研发中心 | A kind of dihydrodipicolinate synthase |
CN102308001A (en) * | 2009-02-04 | 2012-01-04 | 赢创德固赛有限责任公司 | Method for producing multicyclical ring systems carrying amino groups |
CN106497996A (en) * | 2016-10-11 | 2017-03-15 | 凯莱英医药集团(天津)股份有限公司 | The enzyme catalysiss preparation method of chiral alcohol |
CN106574281A (en) * | 2014-07-03 | 2017-04-19 | 巴斯夫欧洲公司 | Redox self-sufficient biocatalytic amination of alcohols |
-
2017
- 2017-11-23 CN CN201711181396.4A patent/CN109825538B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102308001A (en) * | 2009-02-04 | 2012-01-04 | 赢创德固赛有限责任公司 | Method for producing multicyclical ring systems carrying amino groups |
CN102086462A (en) * | 2009-12-02 | 2011-06-08 | 中国科学院上海生命科学研究院 | Method for preparing chiral monomer mandelic acid |
CN102191227A (en) * | 2010-03-04 | 2011-09-21 | 上海工业生物技术研发中心 | A kind of dihydrodipicolinate synthase |
CN102140431A (en) * | 2010-12-21 | 2011-08-03 | 大成生化科技(松原)有限公司 | L-tryptophan gene engineering bacterium, method for constructing same and method for fermenting and producing L-tryptophan by using same |
CN106574281A (en) * | 2014-07-03 | 2017-04-19 | 巴斯夫欧洲公司 | Redox self-sufficient biocatalytic amination of alcohols |
CN106497996A (en) * | 2016-10-11 | 2017-03-15 | 凯莱英医药集团(天津)股份有限公司 | The enzyme catalysiss preparation method of chiral alcohol |
Non-Patent Citations (2)
Title |
---|
Development of an amine dehydrogenase for synthesis of chiral amines;Michael J Abrahamson等;《Angew Chem Int Ed Engl.》;20120306;第51卷(第16期);全文 * |
定向进化技术的最新进展;曲戈等;《生物工程学报》;20171010;第34卷(第1期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN109825538A (en) | 2019-05-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109825538B (en) | A kind of synthetic method of chiral 2-amino-1-butanol | |
US20140134689A1 (en) | Host Cells and Methods for Oxidizing Aromatic Amino Acids | |
CN110551771B (en) | Synthesis method of chiral 3-amino-1-butanol | |
CN113774036B (en) | Imine reductase mutant and application thereof | |
CN109055324B (en) | Improved ketoreductase and application thereof | |
US20090203096A1 (en) | Process for Production of Optically Active Alcohol | |
SG192706A1 (en) | Cells and methods for producing isobutyric acid | |
CN112852895B (en) | Method for synthesizing (R) -3-amino-1-butanol by double-enzyme cascade catalysis | |
CN112143764B (en) | A kind of method for preparing brivaracetam intermediate compound catalyzed by biological enzyme | |
CN108728501A (en) | Produce the enzymatic method of 2-Hydroxy-4-methylthiobutyric acid (MHA) | |
CN112779232B (en) | A kind of synthetic method of chiral amine alcohol compound | |
CN113355299B (en) | Ketoacid reductase, genes, engineered bacteria and their application in the synthesis of chiral aromatic 2-hydroxy acids | |
CN113444702A (en) | Enone reductase mutant and application thereof | |
CN113293151A (en) | Short-chain dehydrogenase mutants and uses thereof | |
CN107937364A (en) | The Kidney bean epoxide hydrolase mutant that a kind of enantioselectivity improves | |
CN114908129B (en) | Dehydrogenase for the preparation of (R) -4-chloro-3-hydroxybutyric acid ethyl ester | |
CN114540318B (en) | Enzyme with glycolaldehyde synthesis catalyzing function and application thereof | |
JP2005198655A (en) | Cephalosporin C acylase | |
CN110358751A (en) | A kind of recombinant lipase mutant, encoding gene, recombination engineering and application | |
CN112921021B (en) | An aldolase mutant and its application in the production of 1,3-propanediol | |
CN110607335A (en) | A kind of nicotinamide adenine dinucleotide compound biosynthesis method | |
CN111254170B (en) | Method for preparing (S) -1,2,3, 4-tetrahydroisoquinoline-3-formic acid by multienzyme coupling | |
CN115873816A (en) | Transaminase mutant for synthesizing sitagliptin | |
CN110835639B (en) | Method for preparing (S) -1,2,3, 4-tetrahydroisoquinoline-1-formic acid and derivatives thereof | |
JP2008212144A (en) | Alcohol dehydrogenase, gene encoding the same, and method for producing optically active (R) -3-quinuclidinol using the same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |