CN101842479A - 用于改良宿主细胞内蛋白质生产的信号序列和共表达的分子伴侣 - Google Patents
用于改良宿主细胞内蛋白质生产的信号序列和共表达的分子伴侣 Download PDFInfo
- Publication number
- CN101842479A CN101842479A CN200880114429A CN200880114429A CN101842479A CN 101842479 A CN101842479 A CN 101842479A CN 200880114429 A CN200880114429 A CN 200880114429A CN 200880114429 A CN200880114429 A CN 200880114429A CN 101842479 A CN101842479 A CN 101842479A
- Authority
- CN
- China
- Prior art keywords
- sequence
- protein
- laccase
- expression
- trichoderma
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108010076504 Protein Sorting Signals Proteins 0.000 title claims abstract description 117
- 108010006519 Molecular Chaperones Proteins 0.000 title claims abstract description 88
- 102000005431 Molecular Chaperones Human genes 0.000 title claims description 56
- 230000014616 translation Effects 0.000 title abstract description 9
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 293
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 242
- 238000000034 method Methods 0.000 claims abstract description 79
- 102000004190 Enzymes Human genes 0.000 claims abstract description 56
- 108090000790 Enzymes Proteins 0.000 claims abstract description 56
- -1 prp1 Proteins 0.000 claims abstract description 38
- 101150047030 ERO1 gene Proteins 0.000 claims abstract description 16
- 101100407358 Aspergillus niger pdiA gene Proteins 0.000 claims abstract description 10
- 101150031304 ppi1 gene Proteins 0.000 claims abstract description 6
- 108010056891 Calnexin Proteins 0.000 claims abstract description 4
- 101100046193 Desulfitobacterium hafniense (strain Y51) tig1 gene Proteins 0.000 claims abstract description 4
- 101100084053 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) ppi-2 gene Proteins 0.000 claims abstract description 4
- 101100222355 Schizosaccharomyces pombe (strain 972 / ATCC 24843) cwf2 gene Proteins 0.000 claims abstract description 3
- 102000034342 Calnexin Human genes 0.000 claims abstract 2
- 108010029541 Laccase Proteins 0.000 claims description 103
- 241000223259 Trichoderma Species 0.000 claims description 67
- 101150052795 cbh-1 gene Proteins 0.000 claims description 61
- 241000235349 Ascomycota Species 0.000 claims description 37
- 230000003197 catalytic effect Effects 0.000 claims description 31
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 claims description 29
- 230000004927 fusion Effects 0.000 claims description 29
- 102100022624 Glucoamylase Human genes 0.000 claims description 26
- 241000233866 Fungi Species 0.000 claims description 23
- 239000002773 nucleotide Substances 0.000 claims description 16
- 125000003729 nucleotide group Chemical group 0.000 claims description 16
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 claims description 15
- 230000008569 process Effects 0.000 claims description 11
- 241000228212 Aspergillus Species 0.000 claims description 10
- 230000003248 secreting effect Effects 0.000 claims description 9
- 241000894006 Bacteria Species 0.000 claims description 8
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 8
- 102100021971 Bcl-2-interacting killer Human genes 0.000 claims description 6
- 101000970576 Homo sapiens Bcl-2-interacting killer Proteins 0.000 claims description 6
- 241000221960 Neurospora Species 0.000 claims description 6
- 241000293772 Cerrena Species 0.000 claims description 4
- 241000222511 Coprinus Species 0.000 claims description 4
- 241000222354 Trametes Species 0.000 claims description 4
- 241000222356 Coriolus Species 0.000 claims description 3
- 241000222418 Lentinus Species 0.000 claims description 3
- 241000226677 Myceliophthora Species 0.000 claims description 3
- 241000222395 Phlebia Species 0.000 claims description 3
- 241000222350 Pleurotus Species 0.000 claims description 3
- 241000222640 Polyporus Species 0.000 claims description 3
- 241001361634 Rhizoctonia Species 0.000 claims description 3
- 241001279361 Stachybotrys Species 0.000 claims description 3
- 238000011144 upstream manufacturing Methods 0.000 claims description 3
- 241000222680 Collybia Species 0.000 claims description 2
- 241000123326 Fomes Species 0.000 claims description 2
- 241000127897 Ganoderma tsunodae Species 0.000 claims description 2
- 241000146398 Gelatoporia subvermispora Species 0.000 claims description 2
- 241001536563 Panus Species 0.000 claims description 2
- 241000221945 Podospora Species 0.000 claims description 2
- 241000222361 Spongipellis Species 0.000 claims description 2
- 241001494489 Thielavia Species 0.000 claims description 2
- 239000010985 leather Substances 0.000 claims description 2
- 210000003462 vein Anatomy 0.000 claims description 2
- 241000589516 Pseudomonas Species 0.000 claims 3
- 235000001674 Agaricus brunnescens Nutrition 0.000 claims 1
- 241001465180 Botrytis Species 0.000 claims 1
- 241000222336 Ganoderma Species 0.000 claims 1
- 241001149422 Ganoderma applanatum Species 0.000 claims 1
- 235000009754 Vitis X bourquina Nutrition 0.000 claims 1
- 235000012333 Vitis X labruscana Nutrition 0.000 claims 1
- 240000006365 Vitis vinifera Species 0.000 claims 1
- 235000014787 Vitis vinifera Nutrition 0.000 claims 1
- 210000001367 artery Anatomy 0.000 claims 1
- 150000007523 nucleic acids Chemical group 0.000 abstract description 40
- 108091028043 Nucleic acid sequence Proteins 0.000 abstract description 36
- 108091005749 foldases Proteins 0.000 abstract description 28
- 102000035175 foldases Human genes 0.000 abstract description 28
- 239000000203 mixture Substances 0.000 abstract description 14
- 230000001976 improved effect Effects 0.000 abstract description 3
- 235000018102 proteins Nutrition 0.000 description 222
- 210000004027 cell Anatomy 0.000 description 124
- 230000014509 gene expression Effects 0.000 description 84
- 241000499912 Trichoderma reesei Species 0.000 description 81
- 108020004414 DNA Proteins 0.000 description 50
- 229940088598 enzyme Drugs 0.000 description 46
- 239000013612 plasmid Substances 0.000 description 46
- 108090000765 processed proteins & peptides Proteins 0.000 description 44
- 102000004196 processed proteins & peptides Human genes 0.000 description 42
- 229920001184 polypeptide Polymers 0.000 description 40
- 102000040430 polynucleotide Human genes 0.000 description 37
- 108091033319 polynucleotide Proteins 0.000 description 37
- 239000002157 polynucleotide Substances 0.000 description 37
- 230000028327 secretion Effects 0.000 description 29
- 238000004519 manufacturing process Methods 0.000 description 27
- 235000001014 amino acid Nutrition 0.000 description 25
- 239000013604 expression vector Substances 0.000 description 24
- 230000002538 fungal effect Effects 0.000 description 22
- 108010050848 glycylleucine Proteins 0.000 description 22
- 239000013598 vector Substances 0.000 description 22
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 20
- 241000221198 Basidiomycota Species 0.000 description 19
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 19
- 150000001413 amino acids Chemical class 0.000 description 19
- 229940024606 amino acid Drugs 0.000 description 18
- 108091005804 Peptidases Proteins 0.000 description 17
- 239000004365 Protease Substances 0.000 description 17
- 125000003275 alpha amino acid group Chemical group 0.000 description 17
- 239000013613 expression plasmid Substances 0.000 description 17
- 102000039446 nucleic acids Human genes 0.000 description 17
- 108020004707 nucleic acids Proteins 0.000 description 17
- 108010092854 aspartyllysine Proteins 0.000 description 16
- 108010054155 lysyllysine Proteins 0.000 description 15
- 238000006467 substitution reaction Methods 0.000 description 15
- 108010034529 leucyl-lysine Proteins 0.000 description 14
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 13
- 230000000694 effects Effects 0.000 description 13
- 230000001965 increasing effect Effects 0.000 description 13
- 108010005233 alanylglutamic acid Proteins 0.000 description 12
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 12
- 230000004186 co-expression Effects 0.000 description 12
- FFEARJCKVFRZRR-SCSAIBSYSA-N D-methionine Chemical compound CSCC[C@@H](N)C(O)=O FFEARJCKVFRZRR-SCSAIBSYSA-N 0.000 description 11
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 11
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 11
- 108090000637 alpha-Amylases Proteins 0.000 description 11
- 108010038633 aspartylglutamate Proteins 0.000 description 11
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 11
- 241000894007 species Species 0.000 description 11
- 108010073969 valyllysine Proteins 0.000 description 11
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- OHDRQQURAXLVGJ-HLVWOLMTSA-N azane;(2e)-3-ethyl-2-[(e)-(3-ethyl-6-sulfo-1,3-benzothiazol-2-ylidene)hydrazinylidene]-1,3-benzothiazole-6-sulfonic acid Chemical compound [NH4+].[NH4+].S/1C2=CC(S([O-])(=O)=O)=CC=C2N(CC)C\1=N/N=C1/SC2=CC(S([O-])(=O)=O)=CC=C2N1CC OHDRQQURAXLVGJ-HLVWOLMTSA-N 0.000 description 10
- 108010015792 glycyllysine Proteins 0.000 description 10
- 108010047495 alanylglycine Proteins 0.000 description 9
- 102000004139 alpha-Amylases Human genes 0.000 description 9
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 9
- 108010047857 aspartylglycine Proteins 0.000 description 9
- 230000001580 bacterial effect Effects 0.000 description 9
- 238000000855 fermentation Methods 0.000 description 9
- 230000004151 fermentation Effects 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 9
- 108010049041 glutamylalanine Proteins 0.000 description 9
- 239000003550 marker Substances 0.000 description 9
- 239000002609 medium Substances 0.000 description 9
- 108010051242 phenylalanylserine Proteins 0.000 description 9
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- 229940024171 alpha-amylase Drugs 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 8
- 238000010276 construction Methods 0.000 description 8
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 8
- 108010037850 glycylvaline Proteins 0.000 description 8
- 230000006872 improvement Effects 0.000 description 8
- 108010064235 lysylglycine Proteins 0.000 description 8
- 238000000746 purification Methods 0.000 description 8
- 108010079364 N-glycylalanine Proteins 0.000 description 7
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 7
- 108010044940 alanylglutamine Proteins 0.000 description 7
- 108010087924 alanylproline Proteins 0.000 description 7
- 125000000539 amino acid group Chemical group 0.000 description 7
- 108010077245 asparaginyl-proline Proteins 0.000 description 7
- 238000010586 diagram Methods 0.000 description 7
- 238000004520 electroporation Methods 0.000 description 7
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 7
- 108010089804 glycyl-threonine Proteins 0.000 description 7
- 108010003700 lysyl aspartic acid Proteins 0.000 description 7
- 108010070643 prolylglutamic acid Proteins 0.000 description 7
- 239000006228 supernatant Substances 0.000 description 7
- 108010061238 threonyl-glycine Proteins 0.000 description 7
- 230000009466 transformation Effects 0.000 description 7
- 241000588724 Escherichia coli Species 0.000 description 6
- 108010065920 Insulin Lispro Proteins 0.000 description 6
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 6
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 6
- 101150069003 amdS gene Proteins 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 108010077515 glycylproline Proteins 0.000 description 6
- 230000010354 integration Effects 0.000 description 6
- 230000012846 protein folding Effects 0.000 description 6
- 230000001105 regulatory effect Effects 0.000 description 6
- 239000000523 sample Substances 0.000 description 6
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 5
- 108010059892 Cellulase Proteins 0.000 description 5
- AGPKZVBTJJNPAG-RFZPGFLSSA-N D-Isoleucine Chemical compound CC[C@@H](C)[C@@H](N)C(O)=O AGPKZVBTJJNPAG-RFZPGFLSSA-N 0.000 description 5
- WHUUTDBJXJRKMK-GSVOUGTGSA-N D-glutamic acid Chemical compound OC(=O)[C@H](N)CCC(O)=O WHUUTDBJXJRKMK-GSVOUGTGSA-N 0.000 description 5
- ROHFNLRQFUQHCH-RXMQYKEDSA-N D-leucine Chemical compound CC(C)C[C@@H](N)C(O)=O ROHFNLRQFUQHCH-RXMQYKEDSA-N 0.000 description 5
- KZSNJWFQEVHDMF-SCSAIBSYSA-N D-valine Chemical compound CC(C)[C@@H](N)C(O)=O KZSNJWFQEVHDMF-SCSAIBSYSA-N 0.000 description 5
- 108050001049 Extracellular proteins Proteins 0.000 description 5
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 5
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 5
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 5
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 5
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 5
- 108090000854 Oxidoreductases Proteins 0.000 description 5
- 102000004316 Oxidoreductases Human genes 0.000 description 5
- 229920002472 Starch Polymers 0.000 description 5
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 5
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 5
- 238000003556 assay Methods 0.000 description 5
- 229940106157 cellulase Drugs 0.000 description 5
- 238000012258 culturing Methods 0.000 description 5
- 239000001963 growth medium Substances 0.000 description 5
- 108010041601 histidyl-aspartyl-glutamyl-leucine Proteins 0.000 description 5
- 230000001939 inductive effect Effects 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 239000008107 starch Substances 0.000 description 5
- 235000019698 starch Nutrition 0.000 description 5
- 108010003137 tyrosyltyrosine Proteins 0.000 description 5
- 238000010269 ABTS assay Methods 0.000 description 4
- DLFVBJFMPXGRIB-UHFFFAOYSA-N Acetamide Chemical compound CC(N)=O DLFVBJFMPXGRIB-UHFFFAOYSA-N 0.000 description 4
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 4
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 4
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 4
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 4
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 4
- 241000351920 Aspergillus nidulans Species 0.000 description 4
- 241000228245 Aspergillus niger Species 0.000 description 4
- 108010084185 Cellulases Proteins 0.000 description 4
- 102000005575 Cellulases Human genes 0.000 description 4
- 101150097493 D gene Proteins 0.000 description 4
- DCXYFEDJOCDNAF-UWTATZPHSA-N D-Asparagine Chemical compound OC(=O)[C@H](N)CC(N)=O DCXYFEDJOCDNAF-UWTATZPHSA-N 0.000 description 4
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 4
- CKLJMWTZIZZHCS-UHFFFAOYSA-N D-OH-Asp Natural products OC(=O)C(N)CC(O)=O CKLJMWTZIZZHCS-UHFFFAOYSA-N 0.000 description 4
- CKLJMWTZIZZHCS-UWTATZPHSA-N D-aspartic acid Chemical compound OC(=O)[C@H](N)CC(O)=O CKLJMWTZIZZHCS-UWTATZPHSA-N 0.000 description 4
- ZDXPYRJPNDTMRX-GSVOUGTGSA-N D-glutamine Chemical compound OC(=O)[C@H](N)CCC(N)=O ZDXPYRJPNDTMRX-GSVOUGTGSA-N 0.000 description 4
- AYFVYJQAPQTCCC-STHAYSLISA-N D-threonine Chemical compound C[C@H](O)[C@@H](N)C(O)=O AYFVYJQAPQTCCC-STHAYSLISA-N 0.000 description 4
- CWYNVVGOOAEACU-UHFFFAOYSA-N Fe2+ Chemical compound [Fe+2] CWYNVVGOOAEACU-UHFFFAOYSA-N 0.000 description 4
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 4
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 4
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 4
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 4
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 4
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 4
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 4
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 4
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 4
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 4
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 4
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 4
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- 102000035195 Peptidases Human genes 0.000 description 4
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 4
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 4
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 4
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 4
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 101150114858 cbh2 gene Proteins 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 101150066032 egl-1 gene Proteins 0.000 description 4
- 101150003727 egl2 gene Proteins 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 108010079547 glutamylmethionine Proteins 0.000 description 4
- 108010002430 hemicellulase Proteins 0.000 description 4
- 108010085325 histidylproline Proteins 0.000 description 4
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 4
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 230000000813 microbial effect Effects 0.000 description 4
- 244000005700 microbiome Species 0.000 description 4
- 230000037361 pathway Effects 0.000 description 4
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 4
- 108010018625 phenylalanylarginine Proteins 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 239000000600 sorbitol Substances 0.000 description 4
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 4
- INOZZBHURUDQQR-AJNGGQMLSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 INOZZBHURUDQQR-AJNGGQMLSA-N 0.000 description 3
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 3
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 3
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 3
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 3
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 3
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 3
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 3
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 3
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 3
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 3
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 3
- 101100389688 Arabidopsis thaliana AERO1 gene Proteins 0.000 description 3
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 3
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 3
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 3
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 3
- 241001513093 Aspergillus awamori Species 0.000 description 3
- 240000006439 Aspergillus oryzae Species 0.000 description 3
- 101100026178 Caenorhabditis elegans egl-3 gene Proteins 0.000 description 3
- XUJNEKJLAYXESH-UWTATZPHSA-N D-Cysteine Chemical compound SC[C@@H](N)C(O)=O XUJNEKJLAYXESH-UWTATZPHSA-N 0.000 description 3
- 101000609814 Dictyostelium discoideum Protein disulfide-isomerase 1 Proteins 0.000 description 3
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 description 3
- 108010058643 Fungal Proteins Proteins 0.000 description 3
- 241000223218 Fusarium Species 0.000 description 3
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 3
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 3
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 3
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 3
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 3
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 3
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 3
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 3
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 3
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 3
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 3
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 3
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 3
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 3
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- 101001114059 Homo sapiens Protein-arginine deiminase type-1 Proteins 0.000 description 3
- 102000004157 Hydrolases Human genes 0.000 description 3
- 108090000604 Hydrolases Proteins 0.000 description 3
- 101100506045 Hypocrea jecorina egl5 gene Proteins 0.000 description 3
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 3
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 3
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 3
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 3
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 3
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 3
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 3
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 3
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 3
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 3
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 3
- 108090001060 Lipase Proteins 0.000 description 3
- 102000004882 Lipase Human genes 0.000 description 3
- 239000004367 Lipase Substances 0.000 description 3
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 3
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 3
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 3
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 3
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 241000221961 Neurospora crassa Species 0.000 description 3
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 3
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 3
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 3
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 3
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 3
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 3
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 3
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 3
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 3
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 3
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 3
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 3
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 3
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 3
- 102100023222 Protein-arginine deiminase type-1 Human genes 0.000 description 3
- 108010079005 RDV peptide Proteins 0.000 description 3
- 108700008625 Reporter Genes Proteins 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 3
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 3
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 3
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 3
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 3
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 3
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 3
- 241000223261 Trichoderma viride Species 0.000 description 3
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 3
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 3
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 3
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 3
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 3
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 3
- 108010011559 alanylphenylalanine Proteins 0.000 description 3
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 230000002759 chromosomal effect Effects 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 108010005400 cutinase Proteins 0.000 description 3
- 108010016616 cysteinylglycine Proteins 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 230000007613 environmental effect Effects 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 235000019421 lipase Nutrition 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 108010084572 phenylalanyl-valine Proteins 0.000 description 3
- 229920001282 polysaccharide Polymers 0.000 description 3
- 239000005017 polysaccharide Substances 0.000 description 3
- 150000004804 polysaccharides Chemical class 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 108010004914 prolylarginine Proteins 0.000 description 3
- 108010048818 seryl-histidine Proteins 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 2
- 108091005508 Acid proteases Proteins 0.000 description 2
- 241001019659 Acremonium <Plectosphaerellaceae> Species 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 2
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 2
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 2
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 2
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- 101100137363 Arabidopsis thaliana PPI2 gene Proteins 0.000 description 2
- 101100536527 Arabidopsis thaliana TOC159 gene Proteins 0.000 description 2
- 101100481793 Arabidopsis thaliana TOC33 gene Proteins 0.000 description 2
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 2
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 2
- FXGMURPOWCKNAZ-JYJNAYRXSA-N Arg-Val-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FXGMURPOWCKNAZ-JYJNAYRXSA-N 0.000 description 2
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 2
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 2
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 2
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 2
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- CNKAZIGBGQIHLL-GUBZILKMSA-N Asp-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N CNKAZIGBGQIHLL-GUBZILKMSA-N 0.000 description 2
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 2
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 2
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 2
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 2
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 2
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 2
- RATOMFTUDRYMKX-ACZMJKKPSA-N Asp-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RATOMFTUDRYMKX-ACZMJKKPSA-N 0.000 description 2
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 2
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 2
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 2
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 2
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- 241000228197 Aspergillus flavus Species 0.000 description 2
- 101100049989 Aspergillus niger xlnB gene Proteins 0.000 description 2
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 102100021868 Calnexin Human genes 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 241000293770 Cerrena unicolor Species 0.000 description 2
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 2
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 2
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 2
- RWAZRMXTVSIVJR-YUMQZZPRSA-N Cys-Gly-His Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CNC=N1)C(O)=O RWAZRMXTVSIVJR-YUMQZZPRSA-N 0.000 description 2
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 2
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 2
- AHLPHDHHMVZTML-SCSAIBSYSA-N D-Ornithine Chemical compound NCCC[C@@H](N)C(O)=O AHLPHDHHMVZTML-SCSAIBSYSA-N 0.000 description 2
- ONIBWKKTOPOVIA-SCSAIBSYSA-N D-Proline Chemical compound OC(=O)[C@H]1CCCN1 ONIBWKKTOPOVIA-SCSAIBSYSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UWTATZPHSA-N D-Serine Chemical compound OC[C@@H](N)C(O)=O MTCFGRXMJLQNBG-UWTATZPHSA-N 0.000 description 2
- QNAYBMKLOCPYGJ-UWTATZPHSA-N D-alanine Chemical compound C[C@@H](N)C(O)=O QNAYBMKLOCPYGJ-UWTATZPHSA-N 0.000 description 2
- ODKSFYDXXFIFQN-SCSAIBSYSA-N D-arginine Chemical compound OC(=O)[C@H](N)CCCNC(N)=N ODKSFYDXXFIFQN-SCSAIBSYSA-N 0.000 description 2
- HNDVDQJCIGZPNO-RXMQYKEDSA-N D-histidine Chemical compound OC(=O)[C@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-RXMQYKEDSA-N 0.000 description 2
- KDXKERNSBIXSRK-RXMQYKEDSA-N D-lysine Chemical compound NCCCC[C@@H](N)C(O)=O KDXKERNSBIXSRK-RXMQYKEDSA-N 0.000 description 2
- COLNVLDHVKWLRT-MRVPVSSYSA-N D-phenylalanine Chemical compound OC(=O)[C@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-MRVPVSSYSA-N 0.000 description 2
- 108090000204 Dipeptidase 1 Proteins 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 241000701959 Escherichia virus Lambda Species 0.000 description 2
- 241000223221 Fusarium oxysporum Species 0.000 description 2
- 101150108358 GLAA gene Proteins 0.000 description 2
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 2
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 2
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 2
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 2
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 2
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 2
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 2
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 2
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 2
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 2
- LUGUNEGJNDEBLU-DCAQKATOSA-N Gln-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LUGUNEGJNDEBLU-DCAQKATOSA-N 0.000 description 2
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 2
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 2
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 2
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 2
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 2
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 2
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 2
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 2
- 229920001503 Glucan Polymers 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 2
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 2
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 2
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 2
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 2
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 2
- 101000651309 Homo sapiens Retinoic acid receptor responder protein 1 Proteins 0.000 description 2
- 101001100101 Homo sapiens Retinoic acid-induced protein 3 Proteins 0.000 description 2
- 101000577652 Homo sapiens Serine/threonine-protein kinase PRP4 homolog Proteins 0.000 description 2
- 101000610640 Homo sapiens U4/U6 small nuclear ribonucleoprotein Prp3 Proteins 0.000 description 2
- 101000577737 Homo sapiens U4/U6 small nuclear ribonucleoprotein Prp4 Proteins 0.000 description 2
- 241000223198 Humicola Species 0.000 description 2
- 241000223199 Humicola grisea Species 0.000 description 2
- 241001480714 Humicola insolens Species 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 2
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 2
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 2
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 2
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- WTDRDQBEARUVNC-LURJTMIESA-N L-DOPA Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-LURJTMIESA-N 0.000 description 2
- WTDRDQBEARUVNC-UHFFFAOYSA-N L-Dopa Natural products OC(=O)C(N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-UHFFFAOYSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- QEFRNWWLZKMPFJ-YGVKFDHGSA-N L-methionine S-oxide Chemical compound CS(=O)CC[C@H](N)C(O)=O QEFRNWWLZKMPFJ-YGVKFDHGSA-N 0.000 description 2
- 125000002707 L-tryptophyl group Chemical group [H]C1=C([H])C([H])=C2C(C([C@](N([H])[H])(C(=O)[*])[H])([H])[H])=C([H])N([H])C2=C1[H] 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- 101150096459 LHS1 gene Proteins 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 2
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 2
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 2
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 2
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 2
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 2
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 2
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 2
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 2
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 2
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 2
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 2
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 2
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 2
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 2
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 2
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 2
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 2
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 2
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 2
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 2
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- 101150014737 MADS1 gene Proteins 0.000 description 2
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 2
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 2
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 2
- 241000235395 Mucor Species 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 101150101414 PRP1 gene Proteins 0.000 description 2
- 241000228143 Penicillium Species 0.000 description 2
- 241000228150 Penicillium chrysogenum Species 0.000 description 2
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 2
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 2
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 2
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 2
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 2
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 2
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 2
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 2
- QUUCAHIYARMNBL-FHWLQOOXSA-N Phe-Tyr-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N QUUCAHIYARMNBL-FHWLQOOXSA-N 0.000 description 2
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 2
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 2
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 2
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 2
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 2
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 2
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
- LEBTWGWVUVJNTA-FKBYEOEOSA-N Pro-Trp-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=CC=C4)C(=O)O LEBTWGWVUVJNTA-FKBYEOEOSA-N 0.000 description 2
- 238000010240 RT-PCR analysis Methods 0.000 description 2
- 108010025216 RVF peptide Proteins 0.000 description 2
- 101100368710 Rattus norvegicus Tacstd2 gene Proteins 0.000 description 2
- 102100027682 Retinoic acid receptor responder protein 1 Human genes 0.000 description 2
- 241000235403 Rhizomucor miehei Species 0.000 description 2
- 241000235527 Rhizopus Species 0.000 description 2
- 101100342406 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PRS1 gene Proteins 0.000 description 2
- 241000222480 Schizophyllum Species 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 2
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 2
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 2
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- 102100028868 Serine/threonine-protein kinase PRP4 homolog Human genes 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 2
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 2
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 2
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 2
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 2
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 2
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 2
- WQYPAGQDXAJNED-AAEUAGOBSA-N Trp-Cys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N WQYPAGQDXAJNED-AAEUAGOBSA-N 0.000 description 2
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 2
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 2
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 2
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 2
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 2
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 2
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 2
- 102100040374 U4/U6 small nuclear ribonucleoprotein Prp3 Human genes 0.000 description 2
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 2
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 2
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 101150075580 Xyn1 gene Proteins 0.000 description 2
- NREIOERVEJDBJP-KFDLCVIWSA-N [3)-beta-D-ribosyl-(1->1)-D-ribitol-5-P-(O->]3 Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1OC[C@H](O)[C@H](O)[C@H](O)COP(O)(=O)O[C@@H]1[C@@H](CO)O[C@@H](OC[C@H](O)[C@H](O)[C@H](O)COP(O)(=O)O[C@@H]2[C@H](O[C@@H](OC[C@H](O)[C@H](O)[C@H](O)COP(O)(O)=O)[C@@H]2O)CO)[C@@H]1O NREIOERVEJDBJP-KFDLCVIWSA-N 0.000 description 2
- 108010048241 acetamidase Proteins 0.000 description 2
- 108010066875 alanyl-prolyl-tryptophyl-cysteine Proteins 0.000 description 2
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 108010084650 alpha-N-arabinofuranosidase Proteins 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000001651 autotrophic effect Effects 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 238000004061 bleaching Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 108020001778 catalytic domains Proteins 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 239000001913 cellulose Substances 0.000 description 2
- 229920002678 cellulose Polymers 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 238000004140 cleaning Methods 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 239000010432 diamond Substances 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 2
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 229940059442 hemicellulase Drugs 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 238000003018 immunoassay Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 150000002894 organic compounds Chemical class 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 230000035699 permeability Effects 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 239000013605 shuttle vector Substances 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000011550 stock solution Substances 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 238000003151 transfection method Methods 0.000 description 2
- 238000011426 transformation method Methods 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 230000005945 translocation Effects 0.000 description 2
- 101150016309 trpC gene Proteins 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 101150041186 xyn2 gene Proteins 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- OGILYBDMVOATLU-CQJMVLFOSA-N (2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-n-[(2s)-1-[[(2s)-1-amino-1-oxo-3-phenylpropan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]-4-methylpentanamide Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(N)=O)C1=CC=C(O)C=C1 OGILYBDMVOATLU-CQJMVLFOSA-N 0.000 description 1
- JBFQOLHAGBKPTP-NZATWWQASA-N (2s)-2-[[(2s)-4-carboxy-2-[[3-carboxy-2-[[(2s)-2,6-diaminohexanoyl]amino]propanoyl]amino]butanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)C(CC(O)=O)NC(=O)[C@@H](N)CCCCN JBFQOLHAGBKPTP-NZATWWQASA-N 0.000 description 1
- SMWADGDVGCZIGK-AXDSSHIGSA-N (2s)-5-phenylpyrrolidine-2-carboxylic acid Chemical compound N1[C@H](C(=O)O)CCC1C1=CC=CC=C1 SMWADGDVGCZIGK-AXDSSHIGSA-N 0.000 description 1
- JWBYADXJYCNKIE-SYKZBELTSA-N (2s)-5-phenylpyrrolidine-2-carboxylic acid;(2s)-pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1.N1[C@H](C(=O)O)CCC1C1=CC=CC=C1 JWBYADXJYCNKIE-SYKZBELTSA-N 0.000 description 1
- HZKLCOYAVAAQRD-VGMNWLOBSA-N (3s)-3-[[2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]-4-[[(1r)-1-carboxyethyl]amino]-4-oxobutanoic acid Chemical compound OC(=O)[C@@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N HZKLCOYAVAAQRD-VGMNWLOBSA-N 0.000 description 1
- CUVSTAMIHSSVKL-UWVGGRQHSA-N (4s)-4-[(2-aminoacetyl)amino]-5-[[(2s)-6-amino-1-(carboxymethylamino)-1-oxohexan-2-yl]amino]-5-oxopentanoic acid Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN CUVSTAMIHSSVKL-UWVGGRQHSA-N 0.000 description 1
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 1
- WEZDRVHTDXTVLT-GJZGRUSLSA-N 2-[[(2s)-2-[[(2s)-2-[(2-aminoacetyl)amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WEZDRVHTDXTVLT-GJZGRUSLSA-N 0.000 description 1
- QVOBNSFUVPLVPE-ROUUACIJSA-N 2-[[(2s)-2-[[2-[[(2s)-2-amino-3-phenylpropanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 QVOBNSFUVPLVPE-ROUUACIJSA-N 0.000 description 1
- OTEWWRBKGONZBW-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]-4-methylpentanoyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NC(CC(C)C)C(=O)NCC(=O)NCC(O)=O OTEWWRBKGONZBW-UHFFFAOYSA-N 0.000 description 1
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 1
- XJFPXLWGZWAWRQ-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O XJFPXLWGZWAWRQ-UHFFFAOYSA-N 0.000 description 1
- 108010011619 6-Phytase Proteins 0.000 description 1
- 241000235389 Absidia Species 0.000 description 1
- 108700016155 Acyl transferases Proteins 0.000 description 1
- 102000057234 Acyl transferases Human genes 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 241000222518 Agaricus Species 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- RUQBGIMJQUWXPP-CYDGBPFRSA-N Ala-Leu-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O RUQBGIMJQUWXPP-CYDGBPFRSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- FCXAUASCMJOFEY-NDKCEZKHSA-N Ala-Leu-Thr-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O FCXAUASCMJOFEY-NDKCEZKHSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- IEAUDUOCWNPZBR-LKTVYLICSA-N Ala-Trp-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IEAUDUOCWNPZBR-LKTVYLICSA-N 0.000 description 1
- UBTKNYUAMYRMKE-GOPGUHFVSA-N Ala-Trp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N UBTKNYUAMYRMKE-GOPGUHFVSA-N 0.000 description 1
- CWRBRVZBMVJENN-UVBJJODRSA-N Ala-Trp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N CWRBRVZBMVJENN-UVBJJODRSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- XKXAZPSREVUCRT-BPNCWPANSA-N Ala-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=C(O)C=C1 XKXAZPSREVUCRT-BPNCWPANSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 102100033770 Alpha-amylase 1C Human genes 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- 229920000945 Amylopectin Polymers 0.000 description 1
- 229920000856 Amylose Polymers 0.000 description 1
- 241001510441 Anaeromyces Species 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 1
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 1
- DPNHSNLIULPOBH-GUBZILKMSA-N Arg-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DPNHSNLIULPOBH-GUBZILKMSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 1
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 1
- RATVAFHGEFAWDH-JYJNAYRXSA-N Arg-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCN=C(N)N)N RATVAFHGEFAWDH-JYJNAYRXSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- POZKLUIXMHIULG-FDARSICLSA-N Arg-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N POZKLUIXMHIULG-FDARSICLSA-N 0.000 description 1
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- RFLVTVBAESPKKR-ZLUOBGJFSA-N Asn-Cys-Cys Chemical compound N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O RFLVTVBAESPKKR-ZLUOBGJFSA-N 0.000 description 1
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- GYOHQKJEQQJBOY-QEJZJMRPSA-N Asn-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N GYOHQKJEQQJBOY-QEJZJMRPSA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- QUAWOKPCAKCHQL-SRVKXCTJSA-N Asn-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QUAWOKPCAKCHQL-SRVKXCTJSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- GOKCTAJWRPSCHP-VHWLVUOQSA-N Asn-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N GOKCTAJWRPSCHP-VHWLVUOQSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 1
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- XCBKBPRFACFFOO-AQZXSJQPSA-N Asn-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O XCBKBPRFACFFOO-AQZXSJQPSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- NJPLPRFQLBZAMH-IHRRRGAJSA-N Asn-Tyr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O NJPLPRFQLBZAMH-IHRRRGAJSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- XZFONYMRYTVLPL-NHCYSSNCSA-N Asn-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N XZFONYMRYTVLPL-NHCYSSNCSA-N 0.000 description 1
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- KGAJCJXBEWLQDZ-UBHSHLNASA-N Asp-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N KGAJCJXBEWLQDZ-UBHSHLNASA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- SPKRHJOVRVDJGG-CIUDSAMLSA-N Asp-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SPKRHJOVRVDJGG-CIUDSAMLSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- GISFCCXBVJKGEO-QEJZJMRPSA-N Asp-Glu-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GISFCCXBVJKGEO-QEJZJMRPSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- UZNSWMFLKVKJLI-VHWLVUOQSA-N Asp-Ile-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UZNSWMFLKVKJLI-VHWLVUOQSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- ZRUBWRCKIVDCFS-XPCJQDJLSA-N Asp-Leu-Thr-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZRUBWRCKIVDCFS-XPCJQDJLSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- ZXRQJQCXPSMNMR-XIRDDKMYSA-N Asp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N ZXRQJQCXPSMNMR-XIRDDKMYSA-N 0.000 description 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- APXVSPGLYXFFSD-AJNGGQMLSA-N Asp-Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)N)CC1=CC=CC=C1 APXVSPGLYXFFSD-AJNGGQMLSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- XAPPCWUWHNWCPQ-PBCZWWQYSA-N Asp-Thr-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XAPPCWUWHNWCPQ-PBCZWWQYSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- NVXLFIPTHPKSKL-UBHSHLNASA-N Asp-Trp-Asn Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 NVXLFIPTHPKSKL-UBHSHLNASA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- KNOGLZBISUBTFW-QRTARXTBSA-N Asp-Trp-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O KNOGLZBISUBTFW-QRTARXTBSA-N 0.000 description 1
- UXIPUCUHQBIQOS-SRVKXCTJSA-N Asp-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UXIPUCUHQBIQOS-SRVKXCTJSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000636501 Asparagus aspergillus Species 0.000 description 1
- 102000035101 Aspartic proteases Human genes 0.000 description 1
- 108091005502 Aspartic proteases Proteins 0.000 description 1
- 241000228215 Aspergillus aculeatus Species 0.000 description 1
- 241000892910 Aspergillus foetidus Species 0.000 description 1
- 241001225321 Aspergillus fumigatus Species 0.000 description 1
- 101000757144 Aspergillus niger Glucoamylase Proteins 0.000 description 1
- 241001465318 Aspergillus terreus Species 0.000 description 1
- 241000203233 Aspergillus versicolor Species 0.000 description 1
- 241000193422 Bacillus lentus Species 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 108010023063 Bacto-peptone Proteins 0.000 description 1
- 108700038091 Beta-glucanases Proteins 0.000 description 1
- 102100032487 Beta-mannosidase Human genes 0.000 description 1
- 241000149420 Bothrometopus brevis Species 0.000 description 1
- 241001619326 Cephalosporium Species 0.000 description 1
- 241000462056 Cestraeus plicatilis Species 0.000 description 1
- 241000221955 Chaetomium Species 0.000 description 1
- 108050001186 Chaperonin Cpn60 Proteins 0.000 description 1
- 102000052603 Chaperonins Human genes 0.000 description 1
- 229920002101 Chitin Polymers 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 1
- 102000003813 Cis-trans-isomerases Human genes 0.000 description 1
- 108090000175 Cis-trans-isomerases Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 244000251987 Coprinus macrorhizus Species 0.000 description 1
- 241001362614 Crassa Species 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- DCJNIJAWIRPPBB-CIUDSAMLSA-N Cys-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N DCJNIJAWIRPPBB-CIUDSAMLSA-N 0.000 description 1
- UKVGHFORADMBEN-GUBZILKMSA-N Cys-Arg-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UKVGHFORADMBEN-GUBZILKMSA-N 0.000 description 1
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 1
- NIPJKKSXHSBEMX-CIUDSAMLSA-N Cys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N NIPJKKSXHSBEMX-CIUDSAMLSA-N 0.000 description 1
- BIVLWXQGXJLGKG-BIIVOSGPSA-N Cys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)C(=O)O BIVLWXQGXJLGKG-BIIVOSGPSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 1
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 1
- URDUGPGPLNXXES-WHFBIAKZSA-N Cys-Gly-Cys Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O URDUGPGPLNXXES-WHFBIAKZSA-N 0.000 description 1
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 1
- LYSHSHHDBVKJRN-JBDRJPRFSA-N Cys-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N LYSHSHHDBVKJRN-JBDRJPRFSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- ZOKPRHVIFAUJPV-GUBZILKMSA-N Cys-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O ZOKPRHVIFAUJPV-GUBZILKMSA-N 0.000 description 1
- UBHPUQAWSSNQLQ-DCAQKATOSA-N Cys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O UBHPUQAWSSNQLQ-DCAQKATOSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- MWVDDZUTWXFYHL-XKBZYTNZSA-N Cys-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O MWVDDZUTWXFYHL-XKBZYTNZSA-N 0.000 description 1
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 1
- XSELZJJGSKZZDO-UBHSHLNASA-N Cys-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XSELZJJGSKZZDO-UBHSHLNASA-N 0.000 description 1
- CLEFUAZULXANBU-MELADBBJSA-N Cys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N)C(=O)O CLEFUAZULXANBU-MELADBBJSA-N 0.000 description 1
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 1
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 1
- 206010011732 Cyst Diseases 0.000 description 1
- QNAYBMKLOCPYGJ-UHFFFAOYSA-N D-alpha-Ala Natural products CC([NH3+])C([O-])=O QNAYBMKLOCPYGJ-UHFFFAOYSA-N 0.000 description 1
- 125000002353 D-glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- QIVBCDIJIAJPQS-SECBINFHSA-N D-tryptophane Chemical compound C1=CC=C2C(C[C@@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-SECBINFHSA-N 0.000 description 1
- OUYCCCASQSFEME-MRVPVSSYSA-N D-tyrosine Chemical compound OC(=O)[C@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-MRVPVSSYSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241001255637 Dactylium Species 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 102100021807 ER degradation-enhancing alpha-mannosidase-like protein 1 Human genes 0.000 description 1
- 101150050883 ERV2 gene Proteins 0.000 description 1
- 108700041152 Endoplasmic Reticulum Chaperone BiP Proteins 0.000 description 1
- 102100021451 Endoplasmic reticulum chaperone BiP Human genes 0.000 description 1
- 102100039328 Endoplasmin Human genes 0.000 description 1
- 241000702224 Enterobacteria phage M13 Species 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- 241000577870 Fusarium decemcellulare Species 0.000 description 1
- 241000146406 Fusarium heterosporum Species 0.000 description 1
- 241000427940 Fusarium solani Species 0.000 description 1
- 241000577872 Fusarium striatum Species 0.000 description 1
- 108010092526 GKPV peptide Proteins 0.000 description 1
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 1
- 241000896533 Gliocladium Species 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- ALUBSZXSNSPDQV-WDSKDSINSA-N Gln-Cys-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ALUBSZXSNSPDQV-WDSKDSINSA-N 0.000 description 1
- QFTRCUPCARNIPZ-XHNCKOQMSA-N Gln-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)C(=O)O QFTRCUPCARNIPZ-XHNCKOQMSA-N 0.000 description 1
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- FYAULIGIFPPOAA-ZPFDUUQYSA-N Gln-Ile-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O FYAULIGIFPPOAA-ZPFDUUQYSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- YJCZUTXLPXBNIO-BHYGNILZSA-N Gln-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)N)N)C(=O)O YJCZUTXLPXBNIO-BHYGNILZSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- COSBSYQVPSODFX-GUBZILKMSA-N Glu-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N COSBSYQVPSODFX-GUBZILKMSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 1
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- BHXSLRDWXIFKTP-SRVKXCTJSA-N Glu-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BHXSLRDWXIFKTP-SRVKXCTJSA-N 0.000 description 1
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 1
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- YUXIEONARHPUTK-JBACZVJFSA-N Glu-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YUXIEONARHPUTK-JBACZVJFSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 1
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 1
- ZNOHKCPYDAYYDA-BPUTZDHNSA-N Glu-Trp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNOHKCPYDAYYDA-BPUTZDHNSA-N 0.000 description 1
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- 108050008938 Glucoamylases Proteins 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- AYBKPDHHVADEDA-YUMQZZPRSA-N Gly-His-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O AYBKPDHHVADEDA-YUMQZZPRSA-N 0.000 description 1
- IVSWQHKONQIOHA-YUMQZZPRSA-N Gly-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN IVSWQHKONQIOHA-YUMQZZPRSA-N 0.000 description 1
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 1
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- RVGMVLVBDRQVKB-UWVGGRQHSA-N Gly-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN RVGMVLVBDRQVKB-UWVGGRQHSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- 108010009504 Gly-Phe-Leu-Gly Proteins 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 1
- WSLHFAFASQFMSK-SFTDATJTSA-N Gly-Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)CN)C(O)=O)=CNC2=C1 WSLHFAFASQFMSK-SFTDATJTSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- IHDKKJVBLGXLEL-STQMWFEESA-N Gly-Tyr-Met Chemical compound CSCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CN)C(O)=O IHDKKJVBLGXLEL-STQMWFEESA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 101150112743 HSPA5 gene Proteins 0.000 description 1
- 101100295959 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) arcB gene Proteins 0.000 description 1
- 241000055915 Heterocoma lanuginosa Species 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 1
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 1
- FLUVGKKRRMLNPU-CQDKDKBSSA-N His-Ala-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLUVGKKRRMLNPU-CQDKDKBSSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 1
- XJQDHFMUUBRCGA-KKUMJFAQSA-N His-Asn-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XJQDHFMUUBRCGA-KKUMJFAQSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- YOSQCYUFZGPIPC-PBCZWWQYSA-N His-Asp-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YOSQCYUFZGPIPC-PBCZWWQYSA-N 0.000 description 1
- IDQKGZWUPVOGPZ-GUBZILKMSA-N His-Cys-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IDQKGZWUPVOGPZ-GUBZILKMSA-N 0.000 description 1
- AASLOGQZZKZWKH-SRVKXCTJSA-N His-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AASLOGQZZKZWKH-SRVKXCTJSA-N 0.000 description 1
- QSLKWWDKIXMWJV-SRVKXCTJSA-N His-Cys-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N QSLKWWDKIXMWJV-SRVKXCTJSA-N 0.000 description 1
- IIVZNQCUUMBBKF-GVXVVHGQSA-N His-Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 IIVZNQCUUMBBKF-GVXVVHGQSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 1
- UVDDTHLDZBMBAV-SRVKXCTJSA-N His-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N UVDDTHLDZBMBAV-SRVKXCTJSA-N 0.000 description 1
- BRZQWIIFIKTJDH-VGDYDELISA-N His-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N BRZQWIIFIKTJDH-VGDYDELISA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- HBGKOLSGLYMWSW-DCAQKATOSA-N His-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CS)C(=O)O HBGKOLSGLYMWSW-DCAQKATOSA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- WCHONUZTYDQMBY-PYJNHQTQSA-N His-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WCHONUZTYDQMBY-PYJNHQTQSA-N 0.000 description 1
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 1
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 1
- DGLAHESNTJWGDO-SRVKXCTJSA-N His-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DGLAHESNTJWGDO-SRVKXCTJSA-N 0.000 description 1
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 1
- AHEBIAHEZWQVHB-QTKMDUPCSA-N His-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O AHEBIAHEZWQVHB-QTKMDUPCSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- PZUZIHRPOVVHOT-KBPBESRZSA-N His-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CN=CN1 PZUZIHRPOVVHOT-KBPBESRZSA-N 0.000 description 1
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- 101000779870 Homo sapiens Alpha-amylase 1B Proteins 0.000 description 1
- 101000779869 Homo sapiens Alpha-amylase 1C Proteins 0.000 description 1
- 101000895701 Homo sapiens ER degradation-enhancing alpha-mannosidase-like protein 1 Proteins 0.000 description 1
- 101000836873 Homo sapiens Nucleotide exchange factor SIL1 Proteins 0.000 description 1
- 101000734572 Homo sapiens Phosphoenolpyruvate carboxykinase, cytosolic [GTP] Proteins 0.000 description 1
- 101000579123 Homo sapiens Phosphoglycerate kinase 1 Proteins 0.000 description 1
- 108050009363 Hyaluronidases Proteins 0.000 description 1
- 102000001974 Hyaluronidases Human genes 0.000 description 1
- 101100429194 Hypocrea jecorina (strain QM6a) xyn2 gene Proteins 0.000 description 1
- 101100506040 Hypocrea jecorina cel61a gene Proteins 0.000 description 1
- 241000257176 Hypoderma <fly> Species 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- CISBRYJZMFWOHJ-JBDRJPRFSA-N Ile-Ala-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N CISBRYJZMFWOHJ-JBDRJPRFSA-N 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- CCHSQWLCOOZREA-GMOBBJLQSA-N Ile-Asp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N CCHSQWLCOOZREA-GMOBBJLQSA-N 0.000 description 1
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 1
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 1
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- AKQFLPNANHNTLP-VKOGCVSHSA-N Ile-Pro-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N AKQFLPNANHNTLP-VKOGCVSHSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- GMUYXHHJAGQHGB-TUBUOCAGSA-N Ile-Thr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMUYXHHJAGQHGB-TUBUOCAGSA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- BLFXHAFTNYZEQE-VKOGCVSHSA-N Ile-Trp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BLFXHAFTNYZEQE-VKOGCVSHSA-N 0.000 description 1
- PBWMCUAFLPMYPF-ZQINRCPSSA-N Ile-Trp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PBWMCUAFLPMYPF-ZQINRCPSSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- JCGMFFQQHJQASB-PYJNHQTQSA-N Ile-Val-His Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O JCGMFFQQHJQASB-PYJNHQTQSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- 125000000899 L-alpha-glutamyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C([H])([H])C([H])([H])C(O[H])=O 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 1
- 125000000010 L-asparaginyl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])C(=O)N([H])[H] 0.000 description 1
- 125000000415 L-cysteinyl group Chemical group O=C([*])[C@@](N([H])[H])([H])C([H])([H])S[H] 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- 125000002061 L-isoleucyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])[C@](C([H])([H])[H])([H])C(C([H])([H])[H])([H])[H] 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- 125000001176 L-lysyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C([H])([H])C([H])([H])C([H])([H])C(N([H])[H])([H])[H] 0.000 description 1
- 125000000393 L-methionino group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C(SC([H])([H])[H])([H])[H] 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- 125000002435 L-phenylalanyl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 1
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 1
- 125000002842 L-seryl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])O[H] 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 125000003580 L-valyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(C([H])([H])[H])(C([H])([H])[H])[H] 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- HQPHMEPBNUHPKD-XIRDDKMYSA-N Leu-Cys-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N HQPHMEPBNUHPKD-XIRDDKMYSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- WGAZVKFCPHXZLO-SZMVWBNQSA-N Leu-Trp-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N WGAZVKFCPHXZLO-SZMVWBNQSA-N 0.000 description 1
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- SEOXPEFQEOYURL-PMVMPFDFSA-N Leu-Tyr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SEOXPEFQEOYURL-PMVMPFDFSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 102000003820 Lipoxygenases Human genes 0.000 description 1
- 108090000128 Lipoxygenases Proteins 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 1
- BYEBKXRNDLTGFW-CIUDSAMLSA-N Lys-Cys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O BYEBKXRNDLTGFW-CIUDSAMLSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 1
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- HQXSFFSLXFHWOX-IXOXFDKPSA-N Lys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N)O HQXSFFSLXFHWOX-IXOXFDKPSA-N 0.000 description 1
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 1
- CENKQZWVYMLRAX-ULQDDVLXSA-N Lys-Phe-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CENKQZWVYMLRAX-ULQDDVLXSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- RYOLKFYZBHMYFW-WDSOQIARSA-N Lys-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 RYOLKFYZBHMYFW-WDSOQIARSA-N 0.000 description 1
- BIWVMACFGZFIEB-VFAJRCTISA-N Lys-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N)O BIWVMACFGZFIEB-VFAJRCTISA-N 0.000 description 1
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- NQOQDINRVQCAKD-ULQDDVLXSA-N Lys-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N NQOQDINRVQCAKD-ULQDDVLXSA-N 0.000 description 1
- USPJSTBDIGJPFK-PMVMPFDFSA-N Lys-Tyr-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O USPJSTBDIGJPFK-PMVMPFDFSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 101150068888 MET3 gene Proteins 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- SQUTUWHAAWJYES-GUBZILKMSA-N Met-Asp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SQUTUWHAAWJYES-GUBZILKMSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 1
- GTRWUQSSISWRTL-NAKRPEOUSA-N Met-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N GTRWUQSSISWRTL-NAKRPEOUSA-N 0.000 description 1
- MYKLINMAGAIRPJ-CIUDSAMLSA-N Met-Gln-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MYKLINMAGAIRPJ-CIUDSAMLSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 1
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 1
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 1
- FAKYXUOUQCRGMO-FDARSICLSA-N Met-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCSC)N FAKYXUOUQCRGMO-FDARSICLSA-N 0.000 description 1
- NBEFNGUZUOUGFG-KKUMJFAQSA-N Met-Tyr-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NBEFNGUZUOUGFG-KKUMJFAQSA-N 0.000 description 1
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- 108020005196 Mitochondrial DNA Proteins 0.000 description 1
- 102100021339 Multidrug resistance-associated protein 1 Human genes 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 241000233892 Neocallimastix Species 0.000 description 1
- 101100007538 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cpc-1 gene Proteins 0.000 description 1
- 101100022915 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cys-11 gene Proteins 0.000 description 1
- 244000070804 Neurospora sitophila Species 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 102100027096 Nucleotide exchange factor SIL1 Human genes 0.000 description 1
- 241001502335 Orpinomyces Species 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 125000002288 PGK1 group Chemical group 0.000 description 1
- 241000212370 Panus rudis Species 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- 241000222385 Phanerochaete Species 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- NEHSHYOUIWBYSA-DCPHZVHLSA-N Phe-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NEHSHYOUIWBYSA-DCPHZVHLSA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- KOUUGTKGEQZRHV-KKUMJFAQSA-N Phe-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KOUUGTKGEQZRHV-KKUMJFAQSA-N 0.000 description 1
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 1
- RLUMIJXNHJVUCO-JBACZVJFSA-N Phe-Gln-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 RLUMIJXNHJVUCO-JBACZVJFSA-N 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 1
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 1
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 1
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- TXJJXEXCZBHDNA-ACRUOGEOSA-N Phe-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N TXJJXEXCZBHDNA-ACRUOGEOSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- BSJCSHIAMSGQGN-BVSLBCMMSA-N Phe-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O BSJCSHIAMSGQGN-BVSLBCMMSA-N 0.000 description 1
- ODGNUUUDJONJSC-UFYCRDLUSA-N Phe-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O ODGNUUUDJONJSC-UFYCRDLUSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- APECKGGXAXNFLL-RNXOBYDBSA-N Phe-Trp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 APECKGGXAXNFLL-RNXOBYDBSA-N 0.000 description 1
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- FXEKNHAJIMHRFJ-ULQDDVLXSA-N Phe-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N FXEKNHAJIMHRFJ-ULQDDVLXSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- 102100034796 Phosphoenolpyruvate carboxykinase, cytosolic [GTP] Human genes 0.000 description 1
- 102100028251 Phosphoglycerate kinase 1 Human genes 0.000 description 1
- 108010064785 Phospholipases Proteins 0.000 description 1
- 102000015439 Phospholipases Human genes 0.000 description 1
- 241000235379 Piromyces Species 0.000 description 1
- 108010059820 Polygalacturonase Proteins 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 1
- SZZBUDVXWZZPDH-BQBZGAKWSA-N Pro-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 SZZBUDVXWZZPDH-BQBZGAKWSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 1
- BAKAHWWRCCUDAF-IHRRRGAJSA-N Pro-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CN=CN1 BAKAHWWRCCUDAF-IHRRRGAJSA-N 0.000 description 1
- TYMBHHITTMGGPI-NAKRPEOUSA-N Pro-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 TYMBHHITTMGGPI-NAKRPEOUSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- PUQRDHNIOONJJN-AVGNSLFASA-N Pro-Lys-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PUQRDHNIOONJJN-AVGNSLFASA-N 0.000 description 1
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 1
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- YYARMJSFDLIDFS-FKBYEOEOSA-N Pro-Phe-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YYARMJSFDLIDFS-FKBYEOEOSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- MDAWMJUZHBQTBO-XGEHTFHBSA-N Pro-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1)O MDAWMJUZHBQTBO-XGEHTFHBSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- JRBWMRUPXWPEID-JYJNAYRXSA-N Pro-Trp-Cys Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CS)C(=O)O)C(=O)[C@@H]1CCCN1 JRBWMRUPXWPEID-JYJNAYRXSA-N 0.000 description 1
- VPBQDHMASPJHGY-JYJNAYRXSA-N Pro-Trp-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CO)C(=O)O VPBQDHMASPJHGY-JYJNAYRXSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 1
- BVTYXOFTHDXSNI-IHRRRGAJSA-N Pro-Tyr-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 BVTYXOFTHDXSNI-IHRRRGAJSA-N 0.000 description 1
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- DGDCSVGVWWAJRS-AVGNSLFASA-N Pro-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 DGDCSVGVWWAJRS-AVGNSLFASA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 102000006010 Protein Disulfide-Isomerase Human genes 0.000 description 1
- 108010009736 Protein Hydrolysates Proteins 0.000 description 1
- 108010001267 Protein Subunits Proteins 0.000 description 1
- 102000002067 Protein Subunits Human genes 0.000 description 1
- 101100084022 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) lapA gene Proteins 0.000 description 1
- 102000013009 Pyruvate Kinase Human genes 0.000 description 1
- 108020005115 Pyruvate Kinase Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 101710167911 Repressible acid phosphatase Proteins 0.000 description 1
- 241000813090 Rhizoctonia solani Species 0.000 description 1
- 101100111629 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) KAR2 gene Proteins 0.000 description 1
- 101100420794 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SCJ1 gene Proteins 0.000 description 1
- 101100022918 Schizosaccharomyces pombe (strain 972 / ATCC 24843) sua1 gene Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 1
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- LSHUNRICNSEEAN-BPUTZDHNSA-N Ser-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N LSHUNRICNSEEAN-BPUTZDHNSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 101000880156 Streptomyces cacaoi Subtilisin inhibitor-like protein 1 Proteins 0.000 description 1
- 101100370749 Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) trpC1 gene Proteins 0.000 description 1
- 241000187398 Streptomyces lividans Species 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- 244000299461 Theobroma cacao Species 0.000 description 1
- 235000009470 Theobroma cacao Nutrition 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 1
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 1
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- UGFSAPWZBROURT-IXOXFDKPSA-N Thr-Phe-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N)O UGFSAPWZBROURT-IXOXFDKPSA-N 0.000 description 1
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- VGNKUXWYFFDWDH-BEMMVCDISA-N Thr-Trp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N)O VGNKUXWYFFDWDH-BEMMVCDISA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 1
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 241000222357 Trametes hirsuta Species 0.000 description 1
- 241000222355 Trametes versicolor Species 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 241000223262 Trichoderma longibrachiatum Species 0.000 description 1
- 241001557886 Trichoderma sp. Species 0.000 description 1
- 240000001274 Trichosanthes villosa Species 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 1
- CDPXXGFRDZVVGF-OYDLWJJNSA-N Trp-Arg-Trp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CDPXXGFRDZVVGF-OYDLWJJNSA-N 0.000 description 1
- VEYXZZGMIBKXCN-UBHSHLNASA-N Trp-Asp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VEYXZZGMIBKXCN-UBHSHLNASA-N 0.000 description 1
- FKAPNDWDLDWZNF-QEJZJMRPSA-N Trp-Asp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FKAPNDWDLDWZNF-QEJZJMRPSA-N 0.000 description 1
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 1
- HXMJXDNSFVNSEH-IHPCNDPISA-N Trp-Cys-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXMJXDNSFVNSEH-IHPCNDPISA-N 0.000 description 1
- PTAWAMWPRFTACW-SZMVWBNQSA-N Trp-Gln-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PTAWAMWPRFTACW-SZMVWBNQSA-N 0.000 description 1
- GWQUSADRQCTMHN-NWLDYVSISA-N Trp-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GWQUSADRQCTMHN-NWLDYVSISA-N 0.000 description 1
- KDWZQYUTMJSYRJ-BHYGNILZSA-N Trp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O KDWZQYUTMJSYRJ-BHYGNILZSA-N 0.000 description 1
- PGPCENKYTLDIFM-SZMVWBNQSA-N Trp-His-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PGPCENKYTLDIFM-SZMVWBNQSA-N 0.000 description 1
- XLVRTKPAIXJYOH-HOCLYGCPSA-N Trp-His-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)NCC(=O)O)N XLVRTKPAIXJYOH-HOCLYGCPSA-N 0.000 description 1
- GQHAIUPYZPTADF-FDARSICLSA-N Trp-Ile-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 GQHAIUPYZPTADF-FDARSICLSA-N 0.000 description 1
- KIMOCKLJBXHFIN-YLVFBTJISA-N Trp-Ile-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O)=CNC2=C1 KIMOCKLJBXHFIN-YLVFBTJISA-N 0.000 description 1
- RRXPAFGTFQIEMD-IVJVFBROSA-N Trp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N RRXPAFGTFQIEMD-IVJVFBROSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- CMXACOZDEJYZSK-XIRDDKMYSA-N Trp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CMXACOZDEJYZSK-XIRDDKMYSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- VDUJEEQMRQCLHB-YTQUADARSA-N Trp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VDUJEEQMRQCLHB-YTQUADARSA-N 0.000 description 1
- SNWIAPVRCNYFNI-SZMVWBNQSA-N Trp-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SNWIAPVRCNYFNI-SZMVWBNQSA-N 0.000 description 1
- IVBJBFSWJDNQFW-XIRDDKMYSA-N Trp-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IVBJBFSWJDNQFW-XIRDDKMYSA-N 0.000 description 1
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 1
- YCQXZDHDSUHUSG-FJHTZYQYSA-N Trp-Thr-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 YCQXZDHDSUHUSG-FJHTZYQYSA-N 0.000 description 1
- QHWMVGCEQAPQDK-UMPQAUOISA-N Trp-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O QHWMVGCEQAPQDK-UMPQAUOISA-N 0.000 description 1
- JKLJVFCPCWMNMZ-UMPQAUOISA-N Trp-Thr-Met Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(O)=O)[C@@H](C)O)=CNC2=C1 JKLJVFCPCWMNMZ-UMPQAUOISA-N 0.000 description 1
- PALLCTDPFINNMM-JQHSSLGASA-N Trp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N PALLCTDPFINNMM-JQHSSLGASA-N 0.000 description 1
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 1
- QOEZFICGUZTRFX-IHRRRGAJSA-N Tyr-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O QOEZFICGUZTRFX-IHRRRGAJSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- FJKXUIJOMUWCDD-FHWLQOOXSA-N Tyr-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N)O FJKXUIJOMUWCDD-FHWLQOOXSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- KEANSLVUGJADPN-LKTVYLICSA-N Tyr-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N KEANSLVUGJADPN-LKTVYLICSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 1
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 1
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 1
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- VYTUETMEZZLJFU-IHRRRGAJSA-N Tyr-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)N[C@@H](CS)C(=O)O VYTUETMEZZLJFU-IHRRRGAJSA-N 0.000 description 1
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 1
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- GAKBTSMAPGLQFA-JNPHEJMOSA-N Tyr-Thr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 GAKBTSMAPGLQFA-JNPHEJMOSA-N 0.000 description 1
- KUXCBJFJURINGF-PXDAIIFMSA-N Tyr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N KUXCBJFJURINGF-PXDAIIFMSA-N 0.000 description 1
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- 102000003425 Tyrosinase Human genes 0.000 description 1
- 108060008724 Tyrosinase Proteins 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- PYXQBKJPHNCTNW-CYDGBPFRSA-N Val-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N PYXQBKJPHNCTNW-CYDGBPFRSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- OFQGGTGZTOTLGH-NHCYSSNCSA-N Val-Met-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N OFQGGTGZTOTLGH-NHCYSSNCSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 241000307264 Zygorhynchus Species 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 108700014220 acyltransferase activity proteins Proteins 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 239000004599 antimicrobial Substances 0.000 description 1
- 101150008194 argB gene Proteins 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010086780 arginyl-glycyl-aspartyl-alanine Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 150000001491 aromatic compounds Chemical class 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 108010055059 beta-Mannosidase Proteins 0.000 description 1
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- 108010066270 beta-lactorphin Proteins 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 101150093596 bip1 gene Proteins 0.000 description 1
- 101150049515 bla gene Proteins 0.000 description 1
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 1
- LLSDKQJKOVVTOJ-UHFFFAOYSA-L calcium chloride dihydrate Chemical compound O.O.[Cl-].[Cl-].[Ca+2] LLSDKQJKOVVTOJ-UHFFFAOYSA-L 0.000 description 1
- 229940052299 calcium chloride dihydrate Drugs 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 108010089934 carbohydrase Proteins 0.000 description 1
- 230000006860 carbon metabolism Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 235000013351 cheese Nutrition 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000011098 chromatofocusing Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000005352 clarification Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000006690 co-activation Effects 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- MPMSMUBQXQALQI-UHFFFAOYSA-N cobalt phthalocyanine Chemical compound [Co+2].C12=CC=CC=C2C(N=C2[N-]C(C3=CC=CC=C32)=N2)=NC1=NC([C]1C=CC=CC1=1)=NC=1N=C1[C]3C=CC=CC3=C2[N-]1 MPMSMUBQXQALQI-UHFFFAOYSA-N 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 208000031513 cyst Diseases 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000002761 deinking Substances 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 230000009025 developmental regulation Effects 0.000 description 1
- 235000021245 dietary protein Nutrition 0.000 description 1
- 210000002249 digestive system Anatomy 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 108010093305 exopolygalacturonase Proteins 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 235000021050 feed intake Nutrition 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 235000003599 food sweetener Nutrition 0.000 description 1
- 238000001641 gel filtration chromatography Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010017007 glucose-regulated proteins Proteins 0.000 description 1
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 1
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 101150028578 grp78 gene Proteins 0.000 description 1
- 210000004209 hair Anatomy 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 239000005457 ice water Substances 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000011532 immunohistochemical staining Methods 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- 108010059345 keratinase Proteins 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010062085 ligninase Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 101150039489 lysZ gene Proteins 0.000 description 1
- 108010089256 lysyl-aspartyl-glutamyl-leucine Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- WRUGWIBCXHJTDG-UHFFFAOYSA-L magnesium sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Mg+2].[O-]S([O-])(=O)=O WRUGWIBCXHJTDG-UHFFFAOYSA-L 0.000 description 1
- 229940061634 magnesium sulfate heptahydrate Drugs 0.000 description 1
- SQQMAOCOWKFBNP-UHFFFAOYSA-L manganese(II) sulfate Chemical compound [Mn+2].[O-]S([O-])(=O)=O SQQMAOCOWKFBNP-UHFFFAOYSA-L 0.000 description 1
- 229910000357 manganese(II) sulfate Inorganic materials 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 150000002772 monosaccharides Chemical class 0.000 description 1
- 108010066052 multidrug resistance-associated protein 1 Proteins 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 239000005416 organic matter Substances 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 239000003973 paint Substances 0.000 description 1
- 239000010893 paper waste Substances 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 101150009573 phoA gene Proteins 0.000 description 1
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 1
- 229940085127 phytase Drugs 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 239000001965 potato dextrose agar Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 108020003519 protein disulfide isomerase Proteins 0.000 description 1
- 239000003531 protein hydrolysate Substances 0.000 description 1
- 238000000164 protein isolation Methods 0.000 description 1
- 230000020978 protein processing Effects 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 101150089778 pyr-4 gene Proteins 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000004007 reversed phase HPLC Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 239000012898 sample dilution Substances 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 239000003765 sweetening agent Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 239000004753 textile Substances 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 239000003104 tissue culture media Substances 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 238000013060 ultrafiltration and diafiltration Methods 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 230000009105 vegetative growth Effects 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 238000004065 wastewater treatment Methods 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
- 210000002268 wool Anatomy 0.000 description 1
- 229920001221 xylan Polymers 0.000 description 1
- 150000004823 xylans Chemical class 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
- NWONKYPBYAMBJT-UHFFFAOYSA-L zinc sulfate Chemical compound [Zn+2].[O-]S([O-])(=O)=O NWONKYPBYAMBJT-UHFFFAOYSA-L 0.000 description 1
- 229910000368 zinc sulfate Inorganic materials 0.000 description 1
- 229960001763 zinc sulfate Drugs 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
- C12N9/2408—Glucanases acting on alpha -1,4-glucosidic bonds
- C12N9/2411—Amylases
- C12N9/2428—Glucan 1,4-alpha-glucosidase (3.2.1.3), i.e. glucoamylase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0055—Oxidoreductases (1.) acting on diphenols and related substances as donors (1.10)
- C12N9/0057—Oxidoreductases (1.) acting on diphenols and related substances as donors (1.10) with oxygen as acceptor (1.10.3)
- C12N9/0061—Laccase (1.10.3.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
- C12N9/2434—Glucanases acting on beta-1,4-glucosidic bonds
- C12N9/2437—Cellulases (3.2.1.4; 3.2.1.74; 3.2.1.91; 3.2.1.150)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/58—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y110/00—Oxidoreductases acting on diphenols and related substances as donors (1.10)
- C12Y110/03—Oxidoreductases acting on diphenols and related substances as donors (1.10) with an oxygen as acceptor (1.10.3)
- C12Y110/03002—Laccase (1.10.3.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01091—Cellulose 1,4-beta-cellobiosidase (3.2.1.91)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/02—Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Mycology (AREA)
- Plant Pathology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明提供了改良蛋白质生产的方法和组合物。方法包括步骤:(a)将第一核酸序列引入宿主细胞,所述第一核酸序列包括与所需的蛋白质序列有效连接的信号序列;(b)表达第一核酸序列;(c)共表达编码分子伴侣或折叠酶的第二核酸序列,所述分子伴侣或折叠酶选自bip1、ero1、pdi1、tig1、prp1、ppi1、ppi2、prp3、prp4、钙联接蛋白和lhs1;和(d)收集从宿主细胞分泌的所需蛋白质。第一核酸序列任选的包括在信号序列和所需的蛋白质序列之间的酶序列。
Description
发明领域
本发明提供了改良蛋白质生产的方法和组合物。在一些实施方案中,本文提供的方法涉及与蛋白质有效连接的信号序列的用途。在一些实施方案中,与蛋白质有效连接的信号序列在宿主细胞中与至少一种分子伴侣组合表达。在一些实施方案中,蛋白质在丝状真菌细胞中表达。在其他实施方案中,本发明的方法涉及蛋白质与酶的催化结构域的融合,所述酶例如葡糖淀粉酶或CBH1。一些实施方案提供了信号序列、一个或多个分子伴侣、陪伴蛋白和/或折叠酶的组合,和/或蛋白质与催化蛋白质或结构域的融合。
发明背景
宿主细胞例如酵母、丝状真菌和细菌已长期用于表达盒分泌外源蛋白质。通常,这些外源或蛋白质在酵母、丝状真菌和细菌中的生产,涉及表达蛋白质以及从宿主细胞或生长细胞的培养基中部分或完全纯化蛋白质。尽管一些蛋白质需要从宿主细胞的胞内环境中纯化,如果蛋白质从细胞分泌到培养基中,则可以极大的简化纯化。
胞外蛋白质分泌是复杂的,也是多种细胞表达系统中蛋白质生产的重要方面。与蛋白质分泌相关的一个因素是正确的蛋白质折叠。在体外稀释浓度下,许多蛋白质可以可逆的解折叠和再折叠,因为指定紧密折叠结构所需要的全部信息都存在于蛋白质的氨基酸序列中。然而,体内的蛋白质折叠发生在浓缩的大量蛋白质的环境下,其中分子内的折叠过程与分子间的聚集反应相互竞争。这些复杂因素在真核表达系统中比在原核系统中更显著。
真核分泌通路的第一步是新生多肽以伸展的形式跨内质网(ER)膜的易位。多肽的正确折叠和装配通过分泌通路发生在ER中。然而,在多种情况下,虽然蛋白质大量过表达,但却分泌不佳。实际上,在多种情况下,应该有利于此类表达的分泌信号没有表现出实现所述功能。与其他蛋白质的相互作用使所需蛋白质的表达更加复杂化。当在一个物种、属或科中尝试表达从另一物种、属或科的生物中获得的蛋白质时,上述因素甚至更加重要。例如,担子菌(Basidiomycete)蛋白质(例如,漆酶)通常在子囊菌(Ascomycete)宿主(例如,木霉属(Trichoderma))中表达低下。实际上,尽管在真菌表达系统领域开展了大量工作,但仍然需要改良所需蛋白质的胞外表达。
发明概述
发明提供了改良蛋白质生产的方法和组合物。方法涉及使用与所需蛋白质有效连接的信号序列,所述蛋白质在宿主细胞中与至少一种分子伴侣组合表达。在一些实施方案中,在丝状真菌细胞中表达蛋白质。在其他实施方案中,本发明的方法涉及所需蛋白质与宿主蛋白质,例如葡糖淀粉酶或CBH1催化结构域的融合。
在一些实施方案中,本发明提供了增加丝状真菌宿主(例如,子囊菌)中的蛋白质生产的方法和组合物,通过组合分泌信号的应用和分子伴侣蛋白的表达,所述分子伴侣蛋白是从与蛋白质相同的生物体中获得的。在一些实施方案中,蛋白质是非子囊菌蛋白质,其与来自子囊菌宿主蛋白质的分泌信号融合。在其他一些实施方案中,至少一种分子伴侣蛋白可用于增加与子囊菌蛋白质的催化结构域融合的蛋白质的表达。
一些实施方案提供了用于在子囊菌宿主细胞中生产至少一种蛋白质的方法,通过向宿主细胞引入多核苷酸,所述多核苷酸包括与来自和宿主相同的门、属和/或物种的信号序列有效连接的所需蛋白质;共表达来自与蛋白质相同的门、属和/或物种的分子伴侣;在适合蛋白质表达和生产的培养条件下培养宿主细胞;并生产蛋白质。方法任选的包括回收所产生的蛋白质。一些实施方案包括将蛋白质与来自子囊菌的酶的催化结构域融合。其它实施方案包括将蛋白质与来自子囊菌的全长的酶融合。在一些实施方案中,子囊菌宿主细胞是木霉属。在一些实施方案中,分子伴侣是下列中的至少一种:BIP1、ERO1、PDI1、TIG1、PRP1、PPI1、PPI2、PRP3、PRP4、钙联接蛋白和LHS1。
蛋白质的选择不限于,并可以包括来自任何属、物种和/或科的任何下列蛋白质:漆酶、葡糖淀粉酶、α淀粉酶、颗粒淀粉水解酶、纤维素酶、脂肪酶、木聚糖酶、角质酶、半纤维素酶、蛋白酶、氧化酶、漆酶及其组合。一些实施方案包括来自NSP24或CBH1基因的信号序列。在一些实施方案中,分子伴侣基因是bip1。方法的实施方案还可以包括子囊菌启动子。在一些实施方案中,宿主细胞和信号序列来自相同的子囊菌宿主。在一些实施方案中,启动子是来自木霉属的CBH1启动子。在一些实施方案中,蛋白质是担子菌蛋白质。在一些实施方案中,宿主细胞是子囊菌宿主细胞。在一些实施方案中,宿主细胞是担子菌宿主细胞且蛋白质是子囊菌蛋白质。
其它一些实施方案提供了用于在子囊菌宿主细胞中生产至少一种蛋白质的方法,通过向子囊菌宿主细胞引入多核苷酸,所述多核苷酸包括与来自子囊菌的酶的催化结构域融合的所需蛋白质,其中所需蛋白质是担子菌蛋白质;共表达子囊菌的分子伴侣;在适合蛋白质表达和生产的培养条件下培养子囊菌宿主细胞;并生产蛋白质。在一些实施方案中,回收所产生的蛋白质。在一些实施方案中,蛋白质与子囊菌信号序列有效的连接。
附图简介
图1显示了木霉属表达质粒pTrex4-漆酶D opt的示意图。多核苷酸序列显示为SEQ ID NO:1。
图2显示了木霉属表达质粒pTrex2g-Bip1的示意图。多核苷酸序列显示为SEQ ID NO:2。
图3显示了木霉属表达质粒pTrex2g-Pdi1的示意图。多核苷酸序列显示为SEQ ID NO:3。
图4显示了在木霉属表达质粒pTrex2g-Ero1中使用的Ero1序列的示意图。多核苷酸序列显示为SEQ ID NO:4。
图5显示了木霉属表达质粒pTrGA-漆酶D opt的示意图。多核苷酸序列显示为SEQ ID NO:5。
图6显示了木霉属表达质粒pKB408的示意图。多核苷酸序列显示为SEQ ID NO:6。
图7显示了木霉属表达质粒pKB410的示意图。多核苷酸序列显示为SEQ ID NO:7。
图8-1至8-4显示了里氏木霉NSP24可读框(ORF)SEQ ID NO:8。信号肽是前20个氨基酸(SEQ ID NO:9)。
图9-1至9-2显示了里氏木霉CBH1 ORF(SEQ ID NO:10)。信号肽开始于碱基对210,结束于碱基对260(SEQ ID NO:11)。催化核心开始于碱基对261至碱基对1698(SEQ ID NO:12),包括内含子1(从碱基对671至737)和内含子2(从碱基对1435至1497)。接头序列始于碱基对1699,止于碱基对1770(SEQ ID NO:13)。CBH1蛋白质序列显示为SEQID NO:14。
图10示例了漆酶产量的改善,通过融合单色下皮黑孔菌(c.unicolor)漆酶的编码基因和全长木霉属葡糖淀粉酶。菌株#8-2是CBH1漆酶融合体。菌株1066-9、1066-13和1066-15是TrGA漆酶融合体。
图11示例了在摇瓶中漆酶产量的改善,通过融合单色下皮黑孔菌(C.unicolor)漆酶的编码基因和CBH1或NSP24信号序列。Y轴以单位/ml显示了漆酶活性。X轴显示了菌株(仅CBH融合体,或具有信号序列)。
图12示例了在发酵罐中漆酶产量的改善,通过融合单色下皮黑孔菌漆酶的编码基因和CBH1或NSP24信号序列。Y轴以单位/ml显示了漆酶活性。X轴以小时显示了发酵时间。
图13示例了由CBH1信号序列加BIP1分子伴侣表达,所提供的漆酶产量的改善。Y轴以单位/ml显示了漆酶活性。X轴以小时显示了发酵时间。
图14示例了通过在摇瓶中共表达分子伴侣和单色下皮黑孔菌,在3、4和5天对漆酶产量的改善。Y轴以单位/ml显示了漆酶活性。X轴显示了菌株(KB410-13,或与bip共表达)。
图15示例了漆酶产量的改善,通过融合单色下皮黑孔菌漆酶的编码基因和CBH1信号序列、催化结构域和接头,以及与Bip1、pdi1或ero1分子伴侣共表达。Y轴以单位/ml显示了漆酶活性。X轴显示了菌株。
发明详述
除非另外指出,本发明的实践涉及在分子生物学、蛋白质工程、重组DNA技术、微生物学、细胞生物学、细胞培养、转基因生物学、免疫学和蛋白质纯化中普遍使用的常规技术,属于本领域技术人员掌握的范围。此类技术是本领域技术人员已知的,在多种文本和参考文献中都有描述。本文上下文中提及的所有专利、专利申请、文章和出版物,都通过引用清楚地整合到本文中。
定义
术语“子囊菌”指一类属于子囊菌门(Ascomycota)的真菌。该门的成员的特征是存在子囊(即,含有子囊孢子的特化的囊样细胞)。
术语“担子菌”指一类属于担子菌门的真菌。该门的成员的特征是产生担孢子(即,位于特化的棒状末端细胞外部区域的有性孢子,所述特化细胞称为担子)。
“蛋白酶”意指具有在蛋白质骨架的一个或多个不同位置,催化肽键切割的能力的蛋白质或多肽的蛋白质或多肽结构域(例如,E.C.3.4)。可以从微生物(例如,真菌或细菌)、植物和/或动物中获得蛋白酶。
“酸性蛋白酶”指具有在酸性条件下水解蛋白质的能力的蛋白酶。
如本文中使用的,术语“分子伴侣”或“分子伴侣蛋白”通过保护未折叠区域免受周围蛋白质(干扰),而有利于蛋白质折叠,但不增加蛋白质折叠的速率。这可以包括帮助分泌蛋白在内质网(ER)中折叠和糖基化的蛋白质及其同源物。分子伴侣可以定位在ER中。示例性分子伴侣包括Bip(GRP78)、GRP94和酵母Lhs1p,以及通过与未折叠状态下暴露的疏水区域结合,防止不理想的相互作用,而帮助分泌蛋白折叠的蛋白质。分子伴侣还包括涉及蛋白质跨ER膜易位的蛋白质。
如本文中使用的,“陪伴蛋白”是利用ATP,帮助蛋白质折叠为天然状态(活性状态)的蛋白质。通常,蛋白质亚基装配在一起形成大环装配体(assemby)。例如,陪伴蛋白发挥作用是通过在中央腔中与非天然蛋白质结合,然后基于结合ATP,释放底物蛋白质到目前包封的腔,而有效折叠。
“折叠酶蛋白质”意指催化蛋白质折叠中的步骤,而增加蛋白质折叠速率的蛋白质。例如,它们可以帮助形成二硫键,以及帮助与脯氨酸残基相邻的肽链形成正确的构象。示例性的折叠酶包括蛋白质二硫键异构酶(pdi)及其同源物,和脯氨酰-肽基顺反异构酶及其同源物。
如本文中使用的,“NSP24家族蛋白酶”意指在其天然的或野生型类型具有蛋白酶活性的酶,并属于NSP24蛋白酶的家族。NSP24蛋白酶是酸性蛋白酶,例如酸性真菌蛋白酶。NSP24蛋白酶具有与SEQ ID NO:8的氨基酸序列及其生物学活性片段,至少85%、至少90%、至少93%、至少95%、至少96%、至少97%、至少98%和至少99%的序列同一性。
如本文中使用的,术语“所需的蛋白质”意指目标蛋白质。在本申请中,所需的蛋白质和目标蛋白质是可互换的使用的。在一些实施方案中,所需的蛋白质是可商购的重要工业蛋白。术语意在涵盖由天然存在的基因、突变基因和/或合成的基因编码的蛋白质。所需的蛋白质可以是宿主细胞天然的蛋白质,或者对宿主细胞非天然的(异源的)蛋白质。
如本文中使用的,“衍生物”意指源自前体或亲本蛋白质(例如,天然蛋白质)的蛋白质,通过在C-和/或N-末端添加一个或多个氨基酸,在氨基酸序列的一个或多个不同位点取代一个或多个氨基酸,在蛋白质的一端或两端或者在氨基酸序列的一个或多个位点缺失一个或多个氨基酸,或者在氨基酸序列的一个或多个位点插入一个或多个氨基酸。
术语“重组”指不天然存在于宿主细胞中的多核苷酸或多肽。重组分子可以包括以不天然存在的方式相互连接的两条或多条天然存在的序列。
术语“肽”、“蛋白质”和“多肽”在本文中可互换的使用的。
如本文中使用的,“百分比(%)序列同一性”在涉及氨基酸或核苷酸序列时,定义为在比对序列并且如果需要,为了获得最大百分比序列同一性而引入空位后,候选序列中与目标序列(例如,NSP24信号肽序列)中的氨基酸残基或核苷酸相同的氨基酸残基或核苷酸的百分比,且不考虑任何保守性取代作为序列同一性的一部分。
如本文中使用的,术语“α淀粉酶(例如,E.C.3.2.1.1类)”指催化α-1,4-糖苷键水解的酶。这类酶也被描述为影响含有1,4-α连接的D-葡萄糖单位的多糖中的1,4-α-D-糖苷键外切或内切水解的酶。用于描述这类酶的另一个术语是“糖原酶”。示例性的酶包括α-1,4-葡聚糖4-葡聚糖水化酶葡聚糖水解酶。
如本文中使用的,术语“葡糖淀粉酶”指淀粉葡糖苷酶类的酶(例如,EC.3.2.1.3、葡糖淀粉酶、1,4-α-D-葡聚糖葡糖水解酶)。这些是外切作用的酶,从直链淀粉和支链淀粉分子的非还原端释放葡糖基残基。酶还水解α-1,6-和α-1,3-键,尽管以远低于水解α-1,4-键的速率。
术语“启动子”意指涉及结合RNA聚合酶,起始基因转录的调控序列。
如本文中使用的,“异源启动子”指这样的启动子,其与基因或纯化的核酸处于相关联,但与所述基因或纯化的核酸不是天然相关的。
如本文中使用的,多肽的“纯化制品”和“基本纯的制品”意指已经与其天然存在的细胞、其它蛋白质、脂类或核酸分离的多肽。
如本文中使用的,“同源”指两个或多个多肽分子之间,或两个或多个核酸分子之间的序列相似性。当比较的序列位置被相同的碱基或氨基酸单体亚基占据(例如,如果两条DNA分子中每一条的位置都被腺嘌呤占据),则分子在该位置是同源的。两条序列之间的百分比同源性是两条序列共享的匹配或同源位置数除以位置数再乘以100的函数。例如,如果两条序列中的10个位置有6个是匹配或同源的,则两条序列是60%同源。作为示例,DNA序列ATTGCC和TATGGC享有50%同源性。通常,当两条序列被比对以给出最大同源性时进行比较。在本文中,术语“%同源性”与术语“%同一性”可互换的使用的,指当使用序列比对程序进行比对时,在核酸序列或氨基酸序列之间的核酸或氨基酸序列同一性的水平。
如本文中使用的,术语“载体”指设计用于将核酸引入一种或多种细胞类型中的多核苷酸序列。载体包括克隆载体、表达载体、穿梭载体、质粒、噬菌体颗粒、盒等。
如本文中使用的,“表达载体”意指包括与合适的控制序列有效连接的DNA序列的DNA构建体,所述控制序列能够影响DNA在合适宿主中的表达。
术语“表达”意指基于基因的核酸序列生产多肽的过程。
术语“共表达”意指在一个细胞内表达至少2个不同的基因。其可以是外源基因或内源基因。其可以整合到相同或不同质粒或从相同或不同的质粒中表达,并且可以从相同或不同的启动子表达。
如本文中使用的,“有效的连接的”意指调控区(例如,启动子、终止子、分泌信号或增强子区)与结构基因是关联的或连接的,并控制所述基因的表达。如果信号序列通过宿主细胞的分泌系统指导蛋白质,则它与蛋白质是有效连接的。
如本文中使用的,“微生物”指细菌、真菌、病毒、原生动物和其他微生物或微观生物体。
术语“丝状真菌”指本领域已知的真菌亚门(Eumycotina)的所有丝状形态。这类真菌的特征是营养菌丝体,其具有细胞壁,所述细胞壁含有壳多糖、纤维素和其他复杂多糖。本发明的丝状真菌在形态学、生理学和遗传学上与酵母不同。丝状真菌通过菌丝延长进行营养生长,碳代谢是专性好氧的。
如本文中使用的,术语“木霉属(Trichoderma)”和“木霉属物种(Trichoderma sp.)”指以前或现在分类为木霉属的任何真菌属。
如本文中使用的,术语“培养”指在合适条件下,在液体、半固体或固体培养基中生长微生物细胞群体。在一些实施方案中,如本领域已知的,在容器或反应器中进行培养。在一些实施方案中,培养导致淀粉底物(例如,包含颗粒淀粉的底物)发酵生物转化为终产物。
“发酵”指微生物对有机物质的酶促和厌氧降解,产生更简单的有机化合物。发酵通常在厌氧条件下发生,但该术语并非意在唯一限制为严格的厌氧条件,因为在氧气存在的条件下也可以发生发酵。
在将核酸序列插入到细胞内的语境中,术语“引入”意指“转染”、“转化”或“转导”,包括指核酸序列向真核或原核细胞内的掺入,其中,核酸序列或者掺入到细胞基因组内(例如,染色体、质粒、质体或线粒体DNA),转变成自主复制子,或者瞬时的表达(例如,转染的mRNA)。
如本文中使用的,术语“转化的”、“稳定转化的”和“转基因”用于指细胞时,意指细胞具有非天然的核酸序列,所述序列被整合到其基因组中,或者作为附加型质粒在多代中维持。
如本文中使用的,术语“异源的”用于指编码所需蛋白质的多肽或多核苷酸时,意指多肽或多核苷酸不是天然存在于宿主细胞内的。
术语“同源的”或“内源的”在涉及编码所需蛋白质的多肽或多核苷酸时,指多肽或多核苷酸是天然存在于宿主细胞内的,或由宿主细胞天然表达的。
术语“过表达”意指在宿主细胞中以大于野生型宿主细胞所产生的水平表达多肽的过程。在一些实施方案中,将至少一种多核苷酸引入宿主细胞。在其他一些实施方案中,术语指同源多肽以这样的浓度表达,所述浓度大于野生型细胞表达的相同的同源多肽的表达。
如本文中使用的,发明的一个方面的特征是“基本纯的”核酸,所述核酸包括编码与蛋白质有效连接的NSP24信号肽或CBH1信号肽的核苷酸序列,和/或此类核酸的等价体。在这些实施方案中,核酸分离自其它核酸和/或细胞成分。
术语“等价体”指编码功能性等价的多肽的核苷酸序列。等价的核苷酸序列涵盖了由于一个或多个核苷酸取代、添加和/或缺失而不同的序列,例如等位变体。例如,在一些实施方案中,由于遗传密码的简并性,等价的核苷酸序列包括这样的序列,所述序列不同于核苷酸序列SEQ ID NO:8,但导致产生与SEQ ID NO:8编码的多肽序列功能性等价的多肽。
本发明提供了生产所需蛋白质的方法。方法包括步骤:(a)将第一核酸序列引入宿主细胞,所述第一核酸序列包括与所需的蛋白质序列有效连接的信号序列;(b)表达第一核酸序列;(c)共表达编码分子伴侣或折叠酶的第二核酸序列,所述分子伴侣或折叠酶选自bip1、ero1、pdi1、tig1、prp1、ppi1、ppi2、prp3、prp4、钙联接蛋白和lhs1;和(d)收集从宿主细胞分泌的所需蛋白质。
在一个实施方案中,第一核酸序列还包括在信号序列和所需的蛋白质序列之间的酶序列。例如,酶序列获得自葡糖淀粉酶或CBH1酶。在一个实施方案中,酶序列是全长的酶序列,所述酶序列包括催化结构域、接头和结合结构域。在另一个实施方案中,酶序列包括催化结构域序列,其通过接头与所需的蛋白质序列相连。在一些实施方案中,酶是在其天然宿主中高表达和/或分泌的宿主蛋白质。
第一核酸序列还包括在信号序列上游的启动子。在一个实施方案中,启动子是宿主细胞天然的,且与所需的蛋白质序列不是天然相关的。
第二核酸序列与启动子有效的连接。在一个实施方案中,启动子是宿主细胞天然的,且与第二核酸序列不是天然相关的。
增加的蛋白质表达
本发明提供了用于在宿主细胞中生产所需蛋白质的方法。通过包括分泌信号(例如,NSP24信号肽或CBH1信号肽),并结合分子伴侣、陪伴蛋白和/或折叠酶蛋白的共表达,增加蛋白质产量。在一些实施方案中,分泌信号来自子囊菌宿主蛋白质。在一些实施方案中,所需的蛋白质与酶的催化结构域融合。
本发明提供了显著的优点,特别是考虑到这样的事实,即,在子囊菌宿主中难以生产大量来自其它真菌家族的蛋白质。实际上,本领域技术人员已知,在真菌或细菌宿主中通常难以生产任何异源的真菌蛋白质。本发明提供了适合在合适的真菌或细菌宿主中,生产任何合适蛋白质的方法和组合物。在一些实施方案中,真菌宿主是子囊菌,蛋白质是担子菌蛋白质,而在其它实施方案中,真菌宿主是担子菌,而蛋白质是子囊菌蛋白质。
在一些实施方案中,本发明提供了用于在宿主中增加蛋白质表达和/或分泌的方法,利用宿主信号肽与来自和蛋白质来源的相同生物体的一种或多种伴侣分子或折叠酶的共表达组合。因此,在一些实施方案中,利用担子菌宿主信号肽和子囊菌分子伴侣,在担子菌宿主中表达异源的子囊菌蛋白质。在一些备选的实施方案中,利用子囊菌信号肽和子囊菌或担子菌分子伴侣,在子囊菌宿主中表达异源的担子菌蛋白质。在一些实施方案中,子囊菌宿主是木霉属的成员。在一些实施方案中,木霉是里氏木霉,包括里氏木霉的多个菌株。在一些备选的实施方案中,担子菌是下皮黑孔属(Cerrena)的成员,包括但不限于单色下皮黑孔菌。
在本发明的一些实施方案中,通过将蛋白质与宿主酶融合,并与来自与所需蛋白质相同生物体的一种或多种分子伴侣外源共表达组合,来增加所需蛋白质的表达和/或分泌。通过同一质粒,或通过不同的质粒来实现共表达。
在其他实施方案中,通过将蛋白质与宿主酶的催化结构域连接,并组合将蛋白质与宿主信号序列有效的连接,以及一种或多种分子伴侣、陪伴蛋白和/或折叠酶的外源共表达,来增加所需蛋白质的表达和/或分泌,所述分子伴侣、陪伴蛋白和/或折叠酶优选的来自与蛋白质相同的生物体。
可以认为,本文提供的多个实施方案中论及的元件可以以任何合适的组合使用。因此,并非意在将实施方案限于本文提供的特定描述,因为各实施方案的方面可以彼此组合使用。
信号肽
本发明中使用的特定信号肽不是关键的,只要信号肽在宿主中是有效的。当与蛋白质有效连接的信号肽增加了宿主细胞内的蛋白质分泌时,就提供了“有效的信号肽”。在一些实施方案中,信号肽是从强分泌的蛋白质中获得的,和/或是强信号肽。“强信号肽”导致天然蛋白质被其天然宿主强分泌。在一些实施方案中,信号肽获得自与宿主细胞相同门的生物体。实际上,在一些实施方案中,这是有利的。在一些实施方案中,信号肽和宿主细胞来自相同的属,而在其它一些实施方案中,信号肽和宿主细胞来自(相同的)物种。例如,在一些实施方案中,宿主细胞是子囊菌宿主细胞,信号肽获得自子囊菌。在一些实施方案中,宿主细胞是木霉属,信号肽来自木霉属。在一些实施方案中,宿主细胞是里氏木霉,信号肽来自里氏木霉。在一些实施方案中,信号肽是强信号肽。在一些可选的实施方案中,宿主细胞是担子菌宿主细胞,信号肽获得自担子菌。可用于本发明的信号肽的一些实例包括但不限于CBH1和NSP24信号肽。信号肽可以在例如子囊菌门的门的其它成员中发挥作用,但在一些实施方案中,当信号肽在获得它的属中使用时,具有最优的用途(即,提供强分泌)。
如本文中使用的,“强分泌蛋白”是形成从细胞中分泌显著量的总蛋白的任何蛋白质。从细胞中分泌的总蛋白也称为“胞外蛋白”。例如,强分泌的蛋白质包括至少约2%的胞外蛋白,至少约3%,至少约4%,至少约5%,至少约6%,至少约7%,至少约8%,至少约9%,至少约10%,至少约15%,至少约20%,至少约30%,至少约40%,至少约50%,至少约60%,至少约70%,至少约75%,至少约80%,至少约85%,至少约90%,至少约95%,或至少约99%。在一些实施方案中,强分泌蛋白包括在培养上清液中至少约5%的胞外蛋白质。
CBHI信号肽、接头和催化结构域
里氏木霉产生一些纤维素酶,包括纤维二糖水解酶I(CBHI),其折叠成由延伸的接头区分隔的两个分离的结构域(即,催化和结合结构域)。外源多肽已经作为具有CBHI的催化结构域和接头区的融合体在里氏木霉中分泌(参见例如,Nyyssonen等人,Bio/Technol.11:591-595[1993])。长枝木霉(T.longibrachiatem)也生产CBHI,可用于融合体以及分离信号肽和/或接头。接头可用于连接酶的催化结构域和所需多肽。任何合适的接头都可用于本发明,只要它在独立折叠的结构域之间形成延伸的、半刚性的间隔区。在一些蛋白质中发现了此类接头区域,特别是在水解酶中(例如,细菌和真菌纤维素酶和半纤维素酶;参见例如,Libby等人,ProteinEngineering,Design and Selection(1994)第7卷,1109-1114)。
如图9所示,对于CBHI(SEQ ID NO:10),信号序列始于碱基对210,止于碱基对260(SEQ ID NO:11)。催化核心始于碱基对261至碱基对1698(SEQ ID NO:12),包括内含子1(从碱基对671至737)和内含子2(从碱基对1435至1497)。接头序列始于碱基对1699,止于碱基对1770(SEQID NO:13)。纤维素结合结构域始于碱基对1771至碱基对1878。可以通过expasy组织网址发现关于CBHI的序列和结构域信息,命名为uniprot/P62694。已经在多个其他木霉属物种和其他丝状真菌中鉴别了CBHI同源物,合适时可用于本发明。
NSP24信号肽和多核苷酸
NSP24基因是从里氏木霉中分离并测序的(参见例如,美国专利号7,429,476,其通过引用全文整合到本文中)。该基因测序鉴别了编码407个氨基酸可读框(SEQ ID NO:8)的序列,如图8所示。鉴别SEQ ID NO:8的前20个氨基酸(MQTFGAFLVSFLAASGLAAA;SEQ ID NO:9)是信号肽。已经在多个其他木霉属物种和其他丝状真菌中鉴别了NSP24同源物,合适时可用于本发明。在一些实施方案中,NSP24信号序列用于子囊菌生物体。在一些实施方案中,序列用于木霉属物种,在一些甚至更特别的实施方案中,用于里氏木霉。
因此,本发明提供了可用于分泌蛋白质的NSP24家族蛋白酶信号肽。在一些实施方案中,NSP24信号肽命名为“NSP24天冬氨酸蛋白酶信号肽”。
发明的多核苷酸
本发明提供了多种多核苷酸,包括但不限于编码所需蛋白质、信号肽、催化结构域、接头、分子伴侣、陪伴蛋白和折叠酶的多核苷酸。在一些实施方案中,多核苷酸包括至少两种上述分子。在其他实施方案中,本发明的多核苷酸包括至少三种上述分子。
在一些实施方案中,多核苷酸编码蛋白质,所述蛋白质包括至少一个氨基酸取代,例如使用L-氨基酸的“保守氨基酸取代”,其中,一个氨基酸被另一种生物学相似的氨基酸取代了。保守氨基酸取代是保留了被取代的氨基酸的一般电荷、疏水性/亲水性,和/或空间体积(steric bulk)的取代。保守取代的实例是下列组之间的取代:Gly/Ala、Val/Ile/Leu、Lys/Arg、Asn/Gln、Glu/Asp、Ser/Cys/Thr,和Phe/Trp/Tyr。在一些实施方案中,“衍生蛋白质”可用于本发明。在一些这类实施方案中,衍生蛋白质与“亲本”蛋白质序列相比,低至约1至约10个氨基酸残基不同,例如约6至约10个,少至约5个,少至约4个,约3个,约2个,或甚至1个氨基酸残基不同。表1提供了本领域认知的示例性保守氨基酸取代。在其他实施方案中,取代涉及一个或多个非保守氨基酸取代、缺失或插入,其不消除信号肽活性。
表1:保守氨基酸替换
氨基酸 | 一字母码 | 用任意下列替换 |
丙氨酸 | A | D-Ala、Gly、β-Ala、L-Cys、D-Cys |
精氨酸 | R | D-Arg、Lys、D-Lys、同型-Arg、D-同型-Arg、Met、Ile、D-Met、D-Ile、Orn、D-Orn |
天冬酰胺 | N | D-Asn、Asp、D-Asp、Glu、D-Glu、Gln、D-Gln |
天冬氨酸 | D | D-Asp、D-Asn、Asn、Glu、D-Glu、Gln、D-Gln |
半胱氨酸 | C | D-Cys、S-Me-Cys、Met、D-Met、Thr、D-Thr |
谷氨酰胺 | Q | D-Gln、Asn、D-Asn、Glu、D-Glu、Asp、D-Asp |
谷氨酸 | E | D-Glu、D-Asp、Asp、Asn、D-Asn、Gln、D-Gln |
氨基酸 | 一字母码 | 用任意下列替换 |
甘氨酸 | G | Ala、D-Ala、Pro、D-Pro、b-Ala、Acp |
氨基酸 | 一字母码 | 用任意下列替换 |
异亮氨酸 | I | D-Ile、Val、D-Val、Leu、D-Leu、Met、D-Met |
亮氨酸 | L | D-Leu、Val、D-Val、Leu、D-Leu、Met、D-Met |
赖氨酸 | K | D-Lys、Arg、D-Arg、同型-Arg、D-同型-Arg、Met、D-Met、Ile、D-Ile、Orn、D-Orn |
甲硫氨酸 | M | D-Met、S-Me-Cys、Ile、D-Ile、Leu、D-Leu、Val、D-Val |
苯丙氨酸 | F | D-Phe、Tyr、D-Th r、L-Dopa、His、D-His、Trp、D-Trp、反式-3,4或5-苯基脯氨酸,顺式-3、4或5-苯基脯氨酸 |
脯氨酸 | P | D-Pro、L-I-噻唑烷-4-羧酸、D-或L-1-噁唑烷-4-羧酸 |
丝氨酸 | S | D-Ser、Thr、D-Thr、别-Thr、Met、D-Met、Met(O)、D-Met(O)、L-Cys、D-Cys |
苏氨酸 | T | D-Thr、Ser、D-Ser、别-Thr、Met、D-Met、Met(O)、D-Met(O)、Val、D-Val |
酪氨酸 | Y | D-Tyr、Phe、D-Phe、L-Dopa、His、D-His |
缬氨酸 | V | D-Val、Leu、D-Leu、Ile、D-Ile、Met、D-Met |
在一些实施方案中,发明的多核苷酸是天然序列。在一些实施方案中,天然序列分离自自然界,而在其它实施方案中,它们是通过重组或合成方式生产的。术语“天然序列”特别涵盖了天然存在的截短的或分泌的形态(例如,生物活性片段),以及天然序列的天然存在的变体形态。
由于遗传密码的简并性,特定氨基酸可以使用一个以上的密码子来编码。因此,在一些实施方案中,不同的DNA序列用于编码任意多肽,例如信号肽、蛋白质、催化结构域和/或分子伴侣。实际上,本发明意在涵盖编码相同多肽的不同多核苷酸序列。
当在合适的温度和溶液离子强度条件下,单链形态的核酸可以与其它核酸退火时,则核酸与另一核酸序列是可杂交的。在低、中、高和非常高严格条件下用于杂交的杂交和洗涤条件是本领域熟知的。通常,杂交涉及核苷酸探针和同源DNA序列,其通过互补多核苷酸的广泛的碱基配对,形成稳定的双链杂交体。在一些实施方案中,在2X氯化钠/柠檬酸钠(SSC),0.5%SDS中,在约60℃(中等严格)、65℃(中等/高严格)、70℃(高严格)和约75℃(非常高严格)下,洗涤具有探针和同源序列的滤膜(参见例如,Current Protocols in Molecular Biology,John Wiley &Sons,New York,1989,6.3.1-6.3.6,通过引用整合到本文中)。
本发明涵盖了等位变体、天然突变体、诱导突变体,由这样的DNA编码的蛋白质,所述DNA在高或低严格条件下与编码漆酶、NSP24信号序列、CBHI信号序列、催化结构域、分子伴侣、陪伴蛋白和折叠酶的核酸杂交。本发明的核酸和多肽包括由于公开序列的测序错误而与本文公开的序列不同的序列。
“DNA序列的同源性”是通过两条DNA序列之间相同的程度来确定的。通常使用计算机程序来确定多肽序列和/或核苷酸序列的同源性或“百分比同一性”。实施序列比对和确定序列同一性的方法是本领域技术人员熟知的,不需要过多实验就可以实施,并且可以确定的获得同一性值的计算。本领域技术人员可获得且已知多种算法,用于比对序列和确定序列同一性。使用这些算法的计算机程序也是本领域可获得且已知的,包括但不限于:ALIGN或Megalign(DNASTAR)软件,或WU-BLAST-2、GAP、BESTFIT、BLAST、FASTA、TFASTA和CLUSTAL。本领域技术人员已知如何确定用于测量比对的合适参数,包括在比较的序列全长获得最大比对所需的算法。可以利用程序确定的缺省参数来确定序列同一性。在一些实施方案中,通过MSPRCH程序(Oxford Molecular)执行的Smith-Waterman同源性检索算法(Smith Waterman,Meth.Mol.Biol.,70:173-187[1997))来确定序列同一性,所述MSPRCH程序使用具有下列检索参数的仿射空位检索:空位开发罚分为12,空位延伸罚分为1。可以使用Genetics Computer Group,Inc.(Madison,WI)的GCG序列分析软件包的GAP程序进行成对的氨基酸比较,运用blosum62氨基酸取代矩阵,具有空位权重12和长度权重2。关于两条氨基酸序列的最优比对,变体氨基酸序列的连续区段相对于参照氨基酸序列,可以具有额外的氨基酸残基或缺失的氨基酸残基。用于与参照氨基酸序列比较的连续区段可以包括至少约20个连续的氨基酸残基,可以是约30个、约40个、约50个或更多个氨基酸残基。在一些实施方案中,通过设定空位罚分对与在衍生氨基酸序列中包括空位相关的增加的序列同一性进行校正。
在一些实施方案中,本发明涵盖的蛋白质、信号肽、酶催化结构域、分子伴侣、陪伴蛋白和/或折叠酶是源自细菌或真菌,例如丝状真菌。示例性丝状真菌包括曲霉属物种(Aspergillus spp.)和木霉属物种(Trichodermaspp.)。一种示例性的木霉属物种是里氏木霉。然而,在一些实施方案中,本发明提供的信号肽和/或编码信号肽的DNA是源自真菌的另一属或物种,包括但不限于犁头霉属物种(Absidia spp);支顶孢属物种(Acremoniumspp);蘑菇属物种(Agaricus spp);Anaeromyces spp;曲霉属物种(Aspergillus spp),包括但不限于:棘孢曲霉(A.aculeatus)、泡盛曲霉(A.awamori)、黄曲霉(A.flavus)、臭曲霉(A.foetidus)、A.fumaricus、烟曲霉(A.fumigatus)、构巢曲霉(A.nidulans)、黑曲霉(A.niger)、米曲霉(A.oryzae)、土曲霉(A.terreus)和杂色曲霉(A.versicolor);Aeurobasidium spp;下皮黑孔属物种(Cerrena spp);Cephalosporum spp;头孢属物种(Cephalosporium);毛壳属物种(Chaetomium spp);鬼伞属物种(Coprinus spp);Dactyllum spp;指孢霉属物种(Dactylium spp);镰孢属物种(Fusarium spp),包括F.conglomerans、多隔镰刀菌(F.decemcellulare)、爪哇镰孢(F.javanicum)、亚麻镰孢(F.lini)、尖镰孢(F.oxysporum)和腐皮镰孢(F.solani);粘帚霉属物种(Gliocladiumspp);腐质霉属物种(Humicola spp),包括特异腐质霉(H.insolens)和H.lanuginosa;毛霉属物种(Mucor spp);脉孢菌属物种(Neurosporaspp),包括粗糙脉孢菌(N.crassa)和好食脉孢霉(N.sitophila);Neocallimastix spp;Orpinomyces spp;青霉属物种(Penicillium spp);Phanerochaete;射脉菌属物种(Phlebia spp);Piromyces spp;根霉属物种(Rhizopus spp);裂褶菌属物种(Schizophyllum spp);葡萄穗霉属物种(Stachybotrys spp);栓菌属物种(Trametes spp);木霉属物种(Trichoderma spp),包括里氏木霉(T.reesei)、(长枝)里氏木霉(T.reesei(longibrachiatum))和绿色木霉(T.viride);和接霉属物种(Zygorhynchus spp)。
催化结构域融合体
将所需蛋白质与酶融合通常能增加所需蛋白质的表达和/或分泌。通常,在构建体中,酶序列位于所需蛋白质序列的上游。例如,酶是从葡糖淀粉酶或从CBH1酶中获得的。在一个实施方案中,酶序列是全长的酶序列,包括催化结构域、接头和结合结构域。在另一个实施方案中,酶序列包括催化结构域序列,其通过接头或部分接头与所需蛋白质序列连接。在一些实施方案中,酶是在其天然宿主中高表达和/或分泌的宿主蛋白质。例如,当宿主细胞是木霉属宿主细胞时,酶是来自木霉属蛋白质。然而,可以理解,多种丝状真菌蛋白可用于与蛋白质融合,并可成功用于其它的丝状真菌宿主。
分子伴侣、陪伴蛋白和折叠酶
可用于本发明包括的方法和多核苷酸中的特定分子伴侣、陪伴蛋白和折叠酶不是关键的。此外,当本文中描述分子伴侣、陪伴蛋白和折叠酶的用途时,它们在方法中是可互换使用的。例如,当描述方法使用分子伴侣时,应理解为折叠酶和/或陪伴蛋白也可以取代所述分子伴侣或与其累加的使用。适合本发明的分子伴侣、陪伴蛋白和/或折叠酶是在宿主细胞中有活性的,并且发挥了增加所需蛋白质表达的作用的那些。
在一些实施方案中,分子伴侣、陪伴蛋白和/或折叠酶来自与蛋白质相同门的生物体,也可以来自相同的属,还可以来自相同的属和物种。在一些实施方案中,分子伴侣、陪伴蛋白和/或折叠酶来自担子菌,蛋白质是担子菌蛋白质。在一些实施方案中,分子伴侣、陪伴蛋白和/或折叠酶是组合使用的。在一些实施方案中,可以使用具有与全长分子伴侣、陪伴蛋白和/或折叠酶基本相同功能的分子伴侣、陪伴蛋白和/或折叠酶的片段。示例性的分子伴侣、陪伴蛋白和/或折叠酶包括在美国专利申请60/919,332和WO2008/115596中公开的,其通过引用全文整合到本文中。示例性的分子伴侣、陪伴蛋白和/或折叠酶包括但不限于:BIP1、CLX1、ERO1、LHS1、PRP3、PRP4、PRP1、TIG1、PDI1、PPI1、PPI2、SCJ1、ERV2、EDEM和SIL1。
表2提供了多种可用于本发明的分子伴侣、陪伴蛋白和/或折叠酶的序列。
分子生物学——启动子和表达载体
本发明利用了重组遗传性领域的常规技术,是本领域技术人员熟知的。在一些实施方案中,本发明提供了包括基因启动子序列(例如,来自丝状真菌)的异源基因,其通常在转化宿主细胞(例如,里氏木霉细胞)用于复制和/或表达前,克隆到中间载体中。这类中间载体通常是原核载体(例如,质粒或穿梭载体)。
通常,以任何合适的启动子实现所需蛋白质的表达。在一个实施方案中,对宿主非天然的启动子与编码所需蛋白质的多核苷酸有效的连接,所述蛋白质对宿主是天然或非天然的。在另一个实施方案中,对宿主天然的启动子与编码所需蛋白质的多核苷酸有效的连接,所述蛋白质对宿主是天然或非天然的。在一些实施方案中,以异源启动子表达所需蛋白质,所述异源启动子与所需蛋白质基因不是天然相关联的。而在其他一些实施方案中,以组成型或诱导性启动子表达所需蛋白质。在一些实施方案中,在具有纤维素酶启动子(例如,cbh1启动子)的木霉属表达系统中表达所需蛋白质。
如本文中使用的,术语“启动子”指发挥指导下游基因转录的功能的核酸序列。启动子可以包括靠近转录起始位点附近的必需核酸序列,例如,在II型聚合酶启动子的情况下的TATA元件。启动子以及其它的转录和翻译调控核酸序列,统称为“调控序列”,控制基因的表达。通常,调控序列包括但不限于:启动子序列、核糖体结合位点、转录起始和终止序列、翻译起始和终止序列,以及增强子或激活子序列。调控序列通常适合宿主并被其识别,所述宿主中表达下游基因。在一些实施方案中,使用的启动子来自与宿主细胞相同的门,在其他实施方案中,启动子来自与宿主细胞相同的属,在一些实施方案中,来自与宿主细胞相同的属和种。
“组成型启动子”是在大部分环境和发育条件下有活性的启动子。“可诱导的”或“可抑制的启动子”是在环境或发育调控下有活性的启动子。在一些实施方案中,由于环境因素的改变,启动子是可诱导的或可抑制的,如本领域已知的,所述因素包括但不限于:碳、氮或其他营养的可利用性、温度、pH、渗透性、重金属的存在、抑制剂浓度、压力,或前述的组合。在其他一些实施方案中,启动子是可以被代谢因素诱导或抑制的,如本领域已知的,例如一些碳源的水平、一些能量源的水平、一些分解代谢产物的水平,或前述的组合。
合适的启动子的非限制性实例包括cbh1、cbh2、egl1、egl2、egl3、egl4、egl5、xyn1和xyn2,产黄青霉(P.chrysogenum)的可抑制的酸性磷酸酶基因(phoA)启动子(参见,Graessle等人,Appl.Environ.Microbiol.,63:753-756[1997]),葡萄糖抑制性PCK1启动子(参见,Leuker等人,Gene 192:235-240[1997]),麦芽糖诱导性、葡萄糖抑制性MRP1启动子(参见,Munro等人,Mol.Microbiol.,391414-1426[2001]),甲硫氨酸抑制性MET3启动子(参见,Liu等人,Eukary.Cell 5:638-649[2006]),pKi启动子和cpc1启动子。
在本发明的一些实施方案中,报告基因构建体的启动子是温度敏感型启动子。在一些实施方案中,升高的温度抑制温度敏感型启动子的活性。在一些实施方案中,启动子是分解代谢产物抑制性启动子。在一些实施方案中,渗透性的改变抑制启动子。在一些实施方案中,启动子可以被培养基中存在的多糖、二糖或单糖水平诱导或抑制。
可用于本发明的诱导型启动子的实例是里氏木霉的cbh1启动子,其核苷酸序列以登录号D86235储存在GenBank中。其它示例性启动子包括在编码纤维素酶的基因调控中涉及的启动子,包括但不限于:cbh2、egl1、egl2、egl3、egl5、xyn1和xyn2。
在本发明的一些实施方案中,为了获得克隆基因的高水平表达,异源基因有利的位于与天然存在的基因中距启动子大概相同距离的位置。然而,如本领域已知的,在不丧失启动子功能的条件下,该距离可以进行适应的一些改变。
在一些实施方案中,通过替换、取代、添加或消除一个或多个核苷酸所修饰的天然启动子可用于本发明,只要所述修饰不改变启动子的功能。实际上,本发明意在涵盖,但并不限于启动子的此类改变。
表达载体/构建体通常含有转录单位或表达盒,所述转录单位或表达盒含有异源序列表达所需的所有额外的元件。因此,典型的表达盒含有与异源核酸序列有效连接的启动子,和转录物有效多聚腺苷酸化所需的信号,核糖体结合位点,以及翻译终止。盒内的其它元件可以包括增强子,以及如果使用基因组DNA作为结构基因,可以包括具有功能性剪接供体和受体位点的内含子、分泌前导肽、前导序列、接头和切割位点。
本发明的实践不受遗传构建体中启动子的选择限制。如上文指出的,示例性的启动子是里氏木霉的cbh1、cbh2、egl1、egl2、egl3、egl5、xln1和xln2启动子。可用于本发明的其它启动子包括来自泡盛曲霉和黑曲霉的葡糖淀粉酶基因(glaA)的那些(参见,Nunberg等人,Mol.Cell Biol.,4:2306-2315[1984])和来自构巢曲霉乙酰胺酶的启动子。用于枯草芽孢杆菌(Bacillus subtilis)载体的示例性启动子是AprE启动子;用于大肠杆菌的示例性启动子是Lac启动子,在酿酒酵母(Saccharomyces cerevisiae)中使用的示例性启动子是PGK1,在黑曲霉中使用的示例性启动子是glaA,且用于里氏木霉的示例性启动子是cbhI。然而,本发明并非意在限于这些特定的细胞或这些特定的启动子,其它细胞和启动子也可以用多个实施方案中。
在一些实施方案中,除启动子序列外,表达盒还含有位于结构基因下游的转录终止区,以提供有效的终止。在一些实施方案中,终止区是从和启动子序列相同的基因中获得的,而在其他实施方案中,其获得自不同的基因。
虽然任何合适的功能性真菌终止子都可用于本发明,一些示例性的终止子包括但不限于来自以下基因的终止子:构巢曲霉trpC基因(参见,Yelton等人,Proc.Natl.Acad.Sci.USA 81:1470-1474(1984);Mullaney等人,(Molecular Genetics and Genomics[MGG]199:37-45(1985)),泡盛曲霉或黑曲霉的葡糖淀粉酶基因(参见,Nunberg等人,Mol.Cell Biol.,4:2306-2315(1984);Boel等人,EMBO J.,3:1581-1585(1984)),米曲霉的TAKA淀粉酶基因,米黑毛霉(Mucor miehei)的羧基蛋白酶基因(欧洲专利公开号0215594)和里氏木霉的CBH1基因。
并非意在将用于转运遗传信息进入宿主细胞的表达载体限于任何特定的载体。可认为,任何用于在真核或原核细胞中表达的常规载体都可用于本发明。标准的细菌表达载体包括但不限于:噬菌体λ和M13,以及质粒例如基于pBR322的质粒、pSKF、pET23D,和融合的表达系统,例如MBP、GST和LacZ。在一些实施方案中,在重组蛋白质中添加表位标签,提供分离的常规方法(例如,c-myc)。合适的表达和/或整合载体的实例是本领域技术人员熟知的(参见例如,Bennett和Lasure(编著)More GeneManipulations in Fungi,Academic Press第70-76页和第396-428页(1991);美国专利号5,874,276)。多种销售商(例如,Promega,Invitrogen,等)提供有效的载体,如本领域技术人员已知的。一些特别有效的载体包括但不限于pBR322、pUC18、pUC100、pDONTM201、pENTRTM、3Z和4Z。然而,本发明意在涵盖其它发挥等价功能并且是或成为本领域已知的表达载体。因此,多种宿主/表达载体组合可用于表达本发明的DNA序列。在一些实施方案中,有效的表达载体包括染色体区段、非染色体和/或合成的DNA序列(例如,多种已知的SV40衍生物)和已知的细菌质粒(例如,来自大肠杆菌的质粒,包括col E1、pCR1、pBR322、pMb9、pUC 19、pSL1180及其衍生物),更广宿主范围质粒(例如,RP4)、噬菌体DNA(例如,噬菌体λ的多种衍生物如NM989,和其他DNA噬菌体,如M13,以及丝状单链DNA噬菌体),和酵母质粒(例如,2μ质粒或其衍生物)。
在一些实施方案中,表达载体包括可选择性标记。可选择性标记的实例包括赋予抗微生物剂抗性的标记。营养标记也可用于本发明,包括本领域已知的标记,如amdS、argB和pyr4。用于转化木霉属的标记是本领域已知的(参见例如,Finkelstein,in Biotechnology of Filamentous Fungi,Finkelstein等人,(编著)Butterworth-Heinemann,Boston MA,第6章(1992))。在一些实施方案中,表达载体还包括复制子,编码抗生素抗性的基因(使能够选择具有重组质粒的细菌),和/或在质粒非必需区域的独特的限制性位点(使能够插入异源序列)。其旨在任何合适的抗生素抗性基因都可用于本发明。在一些里氏木霉是宿主细胞的实施方案中,优选的选择原核序列,使其不干扰DNA在里氏木霉中的复制或整合。
在一些实施方案中,表达载体只包括报告基因,或任选的作为与目标蛋白质的融合体。报告基因的实例包括但不限于,荧光报告子、颜色可检测的报告子(例如,β-半乳糖苷酶)和生物素化的报告子。在一些实施方案中,当表达报告分子时,其被用于鉴别信号肽在宿主细胞中是否有活性。如果信号肽是活性的,则从细胞分泌报告分子。在一些实施方案中,信号肽首先与报告子有效的连接,从而鉴别从特定宿主细胞中的分泌作用。备选的方法例如使用对目标蛋白质和/或信号肽特异的抗体的方法,也可用于确定是否分泌目标蛋白质。
在一些实施方案中,本发明的转化方法导致全部或部分转化载体稳定的整合到宿主细胞的基因组中,例如丝状真菌宿主细胞。然而,也考虑导致维持自我复制的染色体外转化载体的转化。
本发明可使用多种标准的转染方法,产生表达大量蛋白质的细菌和丝状真菌(例如,曲霉属或木霉属)细胞系。用于将DNA构建体引入木霉属的纤维素酶-生产菌株的方法是本领域技术人员熟知的(参见例如,Lorito等人,Curr.Genet.,24:349-356[1993];Goldman等人,Curr.Genet.,17:169-174[1990];Penttila等人,Gene 6:155-164[1987];美国专利号6.022,725;美国专利号6,268,328;Nevalainen等人,“The Molecular Biologyof Trichoderma and its Application to the Expression of Both Homologousand Heterologous Genes”in Molecular Industrial Mycology,Leong和Berka(编著),Marcel Dekker Inc.,NY[1992]第129-148页;Yelton等人,Proc.Natl.Acad.Sci.USA 81:1470-1474[1984];Bajar等人,Proc.Natl.Acad.Sci.USA 88:8202-8212[1991];Fernandez-Abalos等人,Microbiol.,149:1623-1632[2003];和Brigidi等人,FEMS Microbiol.Lett.,55:135-138[1990])。
然而,本发明可以使用任何熟知的程序,用于将外源核苷酸序列引入宿主细胞。如本领域技术人员熟知的,这类方法包括但不限于使用磷酸钙转染、Polybrene、原生质体融合、电穿孔、生物射弹、脂质体、显微注射、质粒载体、病毒载体,和任何其他用于引入克隆基因组DNA、cDNA、合成DNA或其他外源遗传材料到宿主细胞的熟知方法。还可使用农杆菌介导的转染方法(参见例如,美国专利号6,255,115)。只需要所使用的特定遗传工程方法能够成功的将至少一个基因引入到能表达基因的宿主细胞中。在一些实施方案中,发明提供了生产蛋白质的方法,包括下述步骤:向宿主细胞引入多核苷酸,所述多核苷酸包括与编码蛋白质的核酸连接的NSP24信号肽;在适合蛋白质表达和生产的培养条件下培养宿主细胞;并生产所述蛋白质。在一些实施方案中,从宿主细胞分泌蛋白质。在一些备选的实施方案中,本发明提供了生产蛋白质的方法,包括步骤:向宿主细胞引入多核苷酸,所述多核苷酸包括与编码蛋白质的核酸有效连接的CBH1信号肽;在适合蛋白质表达和生产的培养条件下培养宿主细胞;并生产所述蛋白质。在一些实施方案中,从宿主细胞分泌蛋白质。
在将表达载体引入宿主细胞后,在有利于基因表达的条件下,在基因启动子序列的控制下,培养转染或转化的细胞。在一些实施方案中,培养大批转化的细胞。在一些实施方案中,使用标准技术,从细胞中收获和/或从培养物中回收产物(即,蛋白质)。
因此,本文中的发明提供了所需蛋白质的表达和增强的分泌,它的分泌是通过信号肽序列、融合DNA序列和多种异源构建体,以及分子伴侣、陪伴蛋白和/或折叠酶的表达而增强的。发明还提供了表达和分泌高水平的此类所需多肽的方法。
所需蛋白质
术语“所需蛋白质”意指任何目标蛋白质。所需蛋白质可以是宿主细胞天然的蛋白质,或者宿主细胞非天然(异源)的蛋白质。在一些实施方案中,所需蛋白质是真菌蛋白质。在一些实施方案中,宿主是子囊菌宿主,蛋白质是除子囊菌蛋白质以外的任何蛋白质。在一些实施方案中,宿主是担子菌宿主,蛋白质是除担子菌蛋白质以外的任何蛋白质。在一些实施方案中,蛋白质是除木霉属蛋白质以外的任何蛋白质。在一些实施方案中,蛋白质是除曲霉属蛋白质以外的任何蛋白质。
本发明并非意在限定任何特定类型的蛋白质。实际上,本发明意在涵盖任何目标蛋白质。所需蛋白质的一些非限制性实例包括但不限于葡糖淀粉酶、α淀粉酶、颗粒淀粉水解酶、纤维素酶、脂肪酶、木聚糖酶、角质酶、半纤维素酶、蛋白酶、氧化酶、漆酶及其组合。
在一些实施方案中,葡糖淀粉酶是从丝状真菌来源获得的野生型葡糖淀粉酶,所述来源例如曲霉属、木霉属或根霉属的菌株。然而,在其他实施方案中,葡糖淀粉酶是蛋白质改造的葡糖淀粉酶(例如,黑曲霉葡糖淀粉酶的变体)。在其他一些实施方案中,本发明的组合物还包括至少一种蛋白酶和至少一种α淀粉酶。在一些实施方案中,α淀粉酶是从细菌来源(例如,芽孢杆菌属物种(Bacillus spp.))或真菌来源(例如,曲霉属物种)获得的。在一些实施方案中,组合物还包括至少一种蛋白酶、和/或至少一种葡糖淀粉酶,和/或至少一种α淀粉酶。在一些实施方案中,蛋白质是漆酶,例如从担子菌获得的漆酶,在一些实施方案中,从下皮黑孔属(例如单色下皮黑孔菌)获得的漆酶。这类酶的商业来源是已知的,可从例如Genencor International,Inc.和Novozymes A/S获得。
漆酶知漆酶相关的酶
在一个优选的实施方案中,漆酶和漆酶相关的酶是所需蛋白质。本发明并非意在限定任何特定的漆酶,因为涵盖了属于酶分类(EC 1.10.3.2)中的任何漆酶。在一些实施方案中,漆酶是从微生物或植物来源获得的。在一些实施方案中,微生物漆酶源自细菌或真菌(包括丝状真菌和酵母)。虽然本发明并非意在限制特定的漆酶,合适的实例包括源自以下的漆酶:曲霉属(Aspergillus)、脉孢菌属(例如,粗糙脉孢菌)、柄孢壳属(Podospora)、葡萄孢属(Botrytisi)、金钱菌属(Collybia)、下皮黑孔属、葡萄穗霉属(Stachybotrys)、革耳属(Panus)(例如,野生革耳(Panusrudis))、梭孢壳属(Thieilava)、层孔菌属(Fomes)、香菇属(Lentinus)、侧耳属(Pleurotus)、栓菌属(Trametes)(例如,长绒毛栓菌(T.villosa)和变色栓菌(T.versicolor))、丝核菌属(Rhizoctonia)(例如,立枯丝核菌(R.solani))、鬼伞属(Coprinus)(褶纹鬼伞(C.plicatilis)和灰盖鬼伞(C.cinereus))、小脆柄菇属(Psatyrella)、毁丝霉属(Myceliophthora)(例如,嗜热毁丝霉(M.thermonhila)、Schytalidium、射脉菌属(Phlebia)(例如,射脉菌(P.radita);参见例如,WO 92/01046)、革盖菌属Coriolus)(例如,毛革盖菌(C.hirsutus);参见例如,JP 2--238885)、绵皮孔菌属(Spongipellis)、多孔菌属(Polyporus)、虫拟蜡菌(Ceriporiopsissubvermispora)、粗皮灵芝(Ganoderma tsunodae)和木霉属。
在一些实施方案中,漆酶包括来自CBS115.075菌株的下皮黑孔属漆酶A1、B1和D2,来自CBS154.29菌株的下皮黑孔属漆酶A2、B2、C、D1和E,来自ATCC20013菌株的下皮黑孔属漆酶B3(参见例如,美国公开号2008/0196173,通过引用全文整合到本文中)。这类漆酶的其他优化版本也可用于本发明。
在另一个实施方案中,漆酶包括在木霉属中表达的下皮黑孔属漆酶D的成熟蛋白质;其氨基酸序列显示如下(SEQ ID NO:45):
AIGPVADLHIVNKDLAPDGVQRPTVLAGGTFPGTLITGQKGDNFQLNVIDDLTDDRM
LTPTSIHWHGFFQKGTAWADGPAFVTQCPIIADNSFLYDFDVPDQAGTFWYHSHLST
QYCDGLRGAFVVYDPNDPHKDLYDVDDGGTVITLADWYHVLAQTVVGAATPDSTLI
NGLGRSQTGPADAELAVISVEHNKRYRFRLVSISCDPNFTFSVDGHNMTVIEVDGVN
TRPLTVDSIQIFAGQRYSFVLNANQPEDNYWIRAMPNIGRNTTTLDGKNAAILRYKNA
SVEEPKTVGGPAQSPLNEADLRPLVPAPVPGNAVPGGADINHRLNLTFSNGLFSINNA
SFTNPSVPALLQILSGAQNAQDLLPTGSYIGLELGKVVELVIPPLAVGGPHPFHLHGHN
FWVVRSAGSDEYNFDDAILRDVVSIGAGTDEVTIRFVTDNPGPWFLHCHIDWHLEAG
LAIVFAEGINQTAAANPTPQAWDELCPKYNGLSASQKVKPKKGTAI
宿主细胞
本发明提供了用本文描述的DNA构建体和载体转化的宿主细胞。在一些实施方案中,本发明提供了用DNA构建体转化的宿主细胞,所述DNA构建体编码所需蛋白质和如本文所述有效连接NSP24或CBHI信号肽。在一些实施方案中,发明提供了编码至少一种所需蛋白质的DNA构建体,所述蛋白质例如蛋白酶、漆酶、α淀粉酶、葡糖淀粉酶、木聚糖酶和纤维素酶,其中构建体引入了宿主细胞。在一些实施方案中,本发明提供了在基因启动子的控制下,蛋白质基因的表达和/或蛋白质基因的过表达,所述启动子在细菌和/或真菌宿主细胞中是功能性的。
其旨在任何合适的宿主细胞都可用于本发明。本发明并非意在限制任何特定的宿主细胞。在一些实施方案中,宿主细胞是这样的细胞,在所述细胞中,信号肽在分泌目标蛋白质中有活性。例如,可使用里氏木霉信号肽的宿主细胞包括但不限于真菌和细菌细胞。宿主细胞包括丝状真菌细胞,包括但不限于木霉属物种(例如,绿色木霉(T.viride)和里氏木霉,红褐肉座菌(Hypocrea jecorina)的无性形态,原分类为长枝木霉(T.longibrachiatum))、青霉属物种、腐质霉属物种(例如,特异腐质霉和灰腐质霉(H.grisea))、曲霉属物种(例如,黑曲霉、构巢曲霉、米曲霉和泡盛曲霉)、镰孢属物种(例如禾赤镰孢(F.graminum))、脉孢菌属物种、肉座菌属物种(Hypocrea spp)和毛霉属物种。备选的宿主细胞包括但不限于:芽孢杆菌属物种(例如,枯草芽孢杆菌(B.subtilis)、地衣芽孢杆菌(B.licheniformis)、迟缓芽孢杆菌(B.lentus)、嗜热脂肪芽孢杆菌(B.stearothremophilus)和短芽孢杆菌(B.brevis))和链霉菌属物种(Streptomyces spp.)(例如,天蓝色链霉菌(S coelicolor)和浅青紫链霉菌(S.lividans))。
本领域已知多种方法用于鉴别蛋白质是否从宿主细胞分泌,或停留在细胞质内。其旨在任何合适的方法都可用于鉴别信号序列具有活性的宿主细胞。
蛋白质表达
通过培养用载体转化的细胞来生产本发明所需的蛋白质,所述载体例如包含基因的表达载体,通过NSP24或CBH1信号肽序列、折叠酶、陪伴蛋白和/或分子伴侣增强所述基因的分泌。本发明对于增加胞内和/或胞外蛋白质的生产特别有效。如本领域技术人员已知,生产蛋白质的优化条件可以随宿主细胞和待表达的蛋白质的选择而改变。本领域技术人员易于确定此类条件。
在一些实施方案中,在表达后分离或回收和纯化目标蛋白质。用于蛋白质分离和纯化的多种方法是本领域技术人员已知的。任何合适的方法都可用于本发明。例如,可用于本发明的标准纯化方法包括但不限于:电泳的、分子、免疫和色谱技术,包括离子交换、疏水、亲和和反相HPLC色谱,以及色谱聚焦。例如,在一些实施方案中,使用标准的抗体柱纯化目标蛋白质,所述抗体柱包括针对目标蛋白质的抗体。超滤和渗滤技术与蛋白质浓缩组合,也可用于一些实施方案中。如本领域技术人员已知的,纯化所需的程度应该根据目标蛋白质的用途改变。实际上,在一些实施方案,不需要进行任何纯化。
在一些实施方案中,如本发明提供的,利用本领域技术人员已知的常规程序从培养基中回收通过转化宿主细胞所生产的目标蛋白质。这些方法包括但不限于通过离心或过滤从培养基中分离宿主细胞。在一些实施方案中,破裂细胞并从细胞级分和碎片中移出上清液。在一些实施方案中,在澄清后,通过盐(例如,硫酸铵)的方式沉淀上清液或滤液的蛋白质组分。然后,溶解沉淀的蛋白质,在一些实施方案中,通过任何合适的方法进行纯化,包括色谱程序(例如,离子交换色谱、凝胶过滤色谱、亲和色谱和其他本领域已知的程序)。
在其他一些实施方案中,通过免疫动物(例如,兔或小鼠)产生针对利用本发明所生产的肽和蛋白质的抗体,使用本领域已知的任何合适方法回收抗蛋白和/或NSP24信号肽抗体。在其他一些实施方案中,使用本领域已知的任何合适方法产生单克隆抗体。
在一些实施方案中,本领域技术人员已知的测定可用于本发明,包括但不限于在WO 99/34011和美国专利号6,605,458中描述的那些,均通过引用全文整合到本文中。
融合体
在一些实施方案中,以融合蛋白生产所需的蛋白质。在其他一些实施方案中,所需蛋白质与丝状真菌有效分泌的蛋白质融合,并与酶催化结构域融合,所述酶来自与用于表达融合蛋白的宿主细胞相同的门、属和/或种。在一些实施方案中,所需蛋白质与CBHI多肽或其部分融合。在其他一些实施方案中,所需蛋白质与经过改变最小化或消除了催化活性的CBHI多肽或其部分融合。在其他一些实施方案中,所需蛋白质与木霉属葡糖淀粉酶多肽或其部分融合。在其他一些实施方案中,所需蛋白质与经过改变最小化或消除了催化活性的木霉属葡糖淀粉酶或其部分融合。在其他一些实施方案中,所需蛋白质与多肽融合以增加分泌、便于后续纯化和/或增加稳定性。
通常,在本发明表达宿主中的第一、第二和/或第三多核苷酸是遗传插入或整合到表达宿主的基因组构成中的(例如,整合到表达宿主的染色体中)。然而,在一些实施方案中,它是染色体外的(例如,以表达宿主内的复制载体存在)。在其他一些实施方案中,染色体外多核苷酸在针对位于载体上的选择标记的合适选择条件下表达。
分泌水平测定
如本文所述,使用任何合适方法确定所需多肽在表达宿主中的分泌水平。例如,在一些实施方案中,分泌水平是基于多种因素(例如,宿主的生长条件)等。然而,在一些实施方案中,在宿主中表达的所需多肽的分泌水平高于在不存在分泌增强蛋白条件下表达的所需多肽的分泌水平。在一些实施方案中,当宿主以分批发酵模式在摇瓶中生长时,所需多肽(例如,在表达宿主如里氏木霉中的单色下皮黑孔菌的漆酶)的分泌水平是至少约1mg/升,约2mg/升,约3mg/升,约4mg/升,或约5mg/升,或者,当宿主在具有受控的pH、补料速率等(例如,补料分批发酵)发酵罐环境中生长时,所述分泌水平是至少约50mg/升,约100mg/升,约150mg/升,约200mg/升,约250mg/升,约500mg/升,约1000mg/升,约2000mg/升,约5000mg/升,约10,000mg/升,或约20,000mg/升。
例如,为了评估可分泌多肽的表达和/或分泌,在蛋白质水平、RNA水平,和/或通过使用适用于分泌性多肽活性和/或产量的功能性生物测定来进行测定。用于分析分泌性多肽表达和/或分泌的示例性测定包括但不限于,Northern印迹、点渍法(DNA或RNA分析)、RT-PCR(逆转录酶聚合酶链式反应),或原位杂交,使用适当标记的探针(基于核酸编码序列),常规的Southern印迹和放射自显影。
在一些实施方案中,直接在样品中测量可分泌多肽的生产、表达和/或分泌。在一些实施方案中,利用针对酶活性、表达和/或生产的测定来进行测量。在一些实施方案中,通过免疫学方法(例如,免疫组化染色细胞和/或组织切片,或通过Western印迹或ELISA方法免疫测定组织培养基)评估蛋白质表达。此类免疫测定可用于定量和/或定性的评估可分泌多肽的表达。这些方法是本领域技术人员已知的。实际上,有多种可用于此类方法的可商购试剂盒和试剂。
在一些实施方案中,本发明还提供了从用于生长表达宿主的培养基中获得的提取物(例如,固体或上清液)。在一些实施方案中,上清液不含可观量的表达宿主,而在一些备选的实施方案中,上清液不含任何量的表达宿主。
细胞培养
如本领域已知的,可以在常规营养培养基中培养本发明的宿主细胞和转化细胞。然而,在一些实施方案中,当合适的,修饰用于转化宿主细胞的培养基,从而激活启动子和选择转化体。特定的培养条件,例如温度、pH等,通常是用于就表达所选定的宿主细胞的那些,并且对本领域技术人员是显而易见的。用于宿主细胞的培养基和条件是本领域技术人员已知的。应该注意到,在培养中,真菌宿主细胞(例如,木霉属细胞)的稳定转化体通常与不稳定的转化体是可区分的,通过其更快的生长速率或在固体培养基上形成具有光滑而非粗糙轮廓的环状克隆。
组合物
在一些实施方案中,本发明提供了使用NSP24或CBH1信号序列、构建体和载体,表达所需蛋白质的组合物和方法。在一些实施方案中,本发明提供了包括酶的组合物,所述酶包括但不限于漆酶、葡糖淀粉酶、α淀粉酶、颗粒淀粉水解酶、纤维素酶、脂肪酶、磷脂酶、木聚糖酶、角质酶、半纤维素酶、氧化酶、过氧化物酶、蛋白酶、植酸酶、角蛋白酶、支链淀粉酶、葡糖淀粉酶、果胶酶、氧化还原酶、还原酶、过水解酶(perhydrolases)、酚氧化酶、脂加氧酶、木质素酶(ligninases)、tannanases、支链淀粉酶、戊聚糖酶(pentosanases)、β-葡聚糖酶、阿拉伯糖苷酶(arabinosidases)、透明质酸酶、软骨素酶(chondrointinases)、甘露聚糖酶、酯酶、酰基转移酶,及其组合。
应用
本发明生产的所需蛋白质可用于任何适合该蛋白质的应用。蛋白质(例如,酶)应用的实例包括但不限于:用于改善饲料摄入和饲料效率的动物饲料(例如蛋白酶)、食物蛋白质水解物(例如,用于具有损伤的消化系统的个体)、皮革处理、蛋白质纤维(例如,羊毛和丝)的处理、清洁、蛋白质加工(例如,去除苦味肽、增加食物滋味,和/或生产乳酪和/或可可)、个人护理产品(例如、头发组合物)、甜味剂(例如,生产高麦芽糖或高果糖糖浆)、发酵和生物乙醇(例如,用于处理谷类发酵生产生物乙醇的α淀粉酶和葡糖淀粉酶)。漆酶应用的实例包括但不限于:纸浆和纸的漂白、纺织品漂白、废水处理、废纸脱墨、芳香化合物或蛋白质的多聚化、放射介导的多聚化和交联反应(例如,涂料(paint)、涂层(coating)、生物材料)、染料激活,和偶联有机化合物。漆酶还可用于清洁组合物,包括但不限于洗衣用和其它洗涤剂。
实施例
提供下列实施例用于示例,而并非限制要求保护的发明。在不脱离发明范围的条件下,可以实施对材料和方法的多种修饰,这对本领域技术人员是显而易见的。
在下文公开的实验内容中使用以下缩写:M(摩尔);μM(微摩尔);N(正常);mol(摩尔);mmol(毫摩尔);μmol(微摩尔);nmol(纳摩尔);g(克);mg(毫克);kg(千克);μg和ug(微克);L(升);ml(毫升);μl和ul(微升);cm(厘米);mm(毫米);μm(微米);nm(纳米);℃(摄氏度);h和hr(小时);min(分钟);sec(秒);msec(毫秒);V(伏特);xg(时间重力);°F(华氏度);amdS(乙酰胺酶,从构巢曲霉中获得的选择性标记);lccD(漆酶);BioRad(BioRadLaboratories,Hercules,CA);Difco(Difco Laboratories,Detroit,MI);Calbiochem(Calbiochem商标,由EMD Chemicals Inc.,San Diego,CA拥有);Sigma(Sigma Chemical Co.,St.Louis,MO);Spectronic(SpectronicDevices,Ltd.,Bedfordshire,UK);Advanced Kinetics(Advanced Kineticsand Technology Solutions,Switzerland)。
实施例中大部分的表达载体都是基于pSL1180质粒骨架生产的,数据库中提供了它的序列,标识符为U13865。使用本领域已知的多接头和/或PCR方法添加标记例如amdS标记、分子伴侣或折叠酶、漆酶(lccD)、信号序列、TrGA融合体和终止子。
质粒上的位点鉴别如下:cbh1——纤维二糖水解酶;Tcbh1——来自cbh1的终止子;TrGA木霉属的葡糖淀粉酶;lccD——漆酶D;amdS标记——自养的选择性标记;pSL1180——质粒骨架;漆酶D opt——漆酶D基因的优化版本,使用在宿主(木霉属)中表达优化的密码子选择进行构建;Pcpc-1——来自粗糙脉孢霉的交叉通路控制-1基因的启动子;bla——β-内酰胺酶基因(即,来自大肠杆菌的选择性标记);和HphR——潮霉素抗性基因(来自大肠杆菌的选择性标记)。
为了构建表达质粒,设计引物并用于含有DNA模板的Herculase PCR反应(Stratagene)。
实施例1.构建表达载体pTrex4-漆酶D opt
该实施例描述了构建表达载体pTrex4-漆酶D opt中涉及的步骤。产生的质粒用于表达密码子优化的单色下皮黑孔菌的漆酶D基因,使用CBH1启动子和CBH1信号序列。该表达载体含有与CBH1(纤维二糖水解酶)核心/接头融合的漆酶D密码子优化基因,并从CBH1启动子表达。图1提供了木霉属表达质粒的示意图。pTrex4-漆酶D opt质粒的序列显示为SEQ ID NO:1。在构建pTrex4-漆酶D opt时,组装下列DNA区段(参见,图1)。将里氏木霉基因组DNA的片段插入到质粒pSL1180载体中,所述片段代表了CBH1启动子、CBH1信号序列和CBH1核心/接头。插入单色下皮黑孔菌的漆酶D基因的密码子优化拷贝(漆酶D opt),使其在接头区与CBH1有效的连接。
来自里氏木霉的CBH1终止子与漆酶D基因有效的连接。添加amdS基因作为选择性自养标记。pSL1180载体中存在bla基因(编码β-内酰胺酶,获得自大肠杆菌的选择性标记)。
实施例2.构建表达载体pTrex2g-Bip1
产生pTrex2g/Bip1质粒用于表达里氏木霉的bip1分子伴侣。图2提供了木霉属表达质粒pTrex2g-Bip1的示意图;质粒的序列提供为SEQ IDNO:2。在构建pTrex2g-Bip1中,组装下列DNA区段。将2276bp的里氏木霉bip1片段插入到质粒pSL1180载体中,与Ppki启动子(里氏木霉的丙酮酸激酶)有效的连接。木霉属cbh1终止子与bip1基因有效的连接。包括来自大肠杆菌的HphR选择性标记用于选择,其与Pcpc-1启动子(来自粗糙脉孢霉的交叉通路控制-1基因)和trpC终止子(来自构巢曲霉的色氨酸合成基因C)有效的连接。
实施例3.构建表达载体pTrex2g-Pdi1
以与pTrex2g-Bip1相同的方式(参见实施例2),产生pTrex2g-Pdi1质粒,用于表达分子伴侣pdi1,除了插入里氏木霉pdi1分子伴侣基因(2465bp)取代bip1分子伴侣基因。图3提供了木霉属表达质粒pTrex2g-Pdi1的示意图;质粒的序列提供为SEQ ID NO:3。
实施例4.构建表达载体pTrex2g-Ero1
以与pTrex2g-Bip1相同的方式(参见实施例2),产生pTrex2g-Ero1质粒,用于表达分子伴侣ero1,除了插入里氏木霉ero1分子伴侣基因(2465bp)取代bip1分子伴侣基因。图4提供了ero1在木霉属表达质粒pTrex2g-Ero1中的ero1的示意图。ero1的序列提供为SEQ ID NO:4。
实施例5.构建表达载体pTrGA-漆酶D opt
以与实施例1相似的方式产生pTrGA-漆酶D opt质粒,除了pTrGA-漆酶D opt表达里氏木霉的全长葡糖淀粉酶和密码子优化的单色下皮黑孔菌漆酶D的融合体。图5提供了木霉属表达质粒pTrGA-漆酶D opt的示意图;多核苷酸序列显示为SEQ ID NO:5。
实施例6.构建表达载体pKB408
生产pKB408质粒用于表达与里氏木霉NSP-24信号肽有效融合的单色下皮黑孔菌漆酶D opt。以与图1所示相似的方式构建质粒,除了漆酶D构建体是与NSP24信号肽有效连接的,插入其替换与CBH1信号序列、催化结构域和接头连接的漆酶D opt。图6提供了木霉属表达质粒pKB408的示意图;多核苷酸序列显示为SEQ ID NO:6。
实施例7.构建表达载体pKB410
如实施例6中所述产生pKB410质粒,除了使用里氏木霉CBH1信号序列取代NSP-24信号肽。图7提供了木霉属表达质粒pKB410的示意图;多核苷酸序列显示为SEQ ID NO:7。
实施例8.转化里氏木霉并分析表达
在该实施例中,稳定的重组里氏木霉菌株源自RL-P37(参见,Sheir-Neiss and Montenecourt,Appl.Microbiol.Biotechnol.,20:46-53(1984)),并如Bower等人(参见,Bower等人,Carbohydrases FromTrichoderma reesei and Other Micro-organisms,Royal Society of Chemistry,Cambridge,第327-334页(1998))所述删除了cbh1、cbh2、egl1和egl2基因,所述菌株用于转化单独或以各种组合的实施例1-14的质粒。如下文所述,使用生物射弹和电穿孔方法转化质粒。
生物射弹转化
通过DNA测序验证表达质粒,并生物射弹转化到木霉属菌株中。根据生产商的说明(参见,WO 05/001036和美国专利申请公开号2006/0003408),使用PDS-1000/he Particle Delivery System(Bio-Rad)实施通过生物射弹转化方法的木霉属菌株的转化。选择转化体,并转移到含有乙酰胺(MMA)的基本培养基的平板上,在28-30℃生长4天。将包含孢子和菌丝体的单菌落的小栓转移到30ml含有1mM铜的NREL乳糖确定成分肉汤(pH 6.2)中。培养物在28℃生长5天。离心培养肉汤,如下文所述针对漆酶活性使用ABTS测定分析上清液。
电穿孔
如美国专利申请号:60/931,072所述实施电穿孔,其通过引用全文整合到本文中。里氏木霉菌株在Potato Dextrose琼脂平板(Difco)上生长大约10-20天,并产生孢子。用水从平板表面洗去孢子,并通过Miracloth(Calbiochem)过滤纯化。离心(3000xg,12min)收集孢子,在冰水中洗涤1次,以及冰冷的1.1M山梨糖醇洗涤1次。将孢子沉淀重悬在少量体积的冷1.1M山梨糖醇中,每100μl孢子悬液与约8μg自质粒DNA(pKB408和pKB410,图6和7)分离的凝胶纯化的DNA片段混合。将混合物(100μl)置于电穿孔小池(1mm间隔)中,使用下列电穿孔参数进行电脉冲:电压6000-20000V/cm,电容=25μF,电阻=50Ω。在电穿孔后,将孢子稀释约100倍,成为1.1M山梨糖醇与YEPD(1%酵母提取物,2%Bacto-蛋白胨,2%葡萄糖,pH5.5)的5∶1混合物,置于摇瓶中,在轨道摇床上孵育16-18小时(28℃和200rpm)。通过离心再一次收集孢子,重悬在约10倍沉淀体积的1.1M山梨糖醇中,涂板在两个15cm的含有amdS改良培养基的培养皿上(乙酰胺0.6g/l,氯化铯1.68g/l,葡萄糖20g/l,磷酸二氢钾15g/l,七水合硫酸镁0.6g/l,二水合氯化钙0.6g/l,硫酸亚铁(II)5mg/l,硫酸锌1.4mg/l,氯化亚钴(II)1mg/l,硫酸锰(II)1.6mg/l,琼脂20g/l和pH 4.25)。在28-30℃孵育约1周后出现转化体。
如下实施ABTS测定:制备在水中含有4.5mM ABTS的ABTS储液(ABTS;Sigma Cat#A-1888)。制备含有0.1M醋酸钠pH 5.0的缓冲液。然后,将1.5ml缓冲液和0.2ml的ABTS储液添加到小池(10×4×45mm,No./REF67.742)中,混合均匀。制备1个额外的小池作为空白。然后,将50μl各种待测试的酶样品(使用各种稀释度)加入到混合物中。
如下使用ABTS动力学测定程序设定(Advanced Kinetics),在Genesys2仪器(Spectronic)中测量ABTS活性:波长420nm,间隔时间(秒)2.0,总运行时间(秒)14.0,因子1.000,下限-000000.00,上限999999.00,反应顺序为第一种。
程序涉及向小池中添加1.5mL的NaOAc(120mM NaOAc缓冲液pH5.0),然后添加0.2mL的4.5mM ABTS(然后向空白小池中),向小池中加入0.05mL的酶样品,迅速混合均匀,最后,测量420nm处的吸光度改变,每2秒测量一次共计14秒。1个ABTS单位定义为每分钟A420的改变(不考虑样品稀释)。ABTS U/mL的计算:(Δ420/分的变化*稀释因子)。
实施例9.分析里氏木霉转化体中漆酶/葡糖淀粉酶融合基因的表达
通过离心(16000xg,10min)将如实施例8所述获得并培养的转化体的培养基与菌丝体分离,分析上清液的ABTS活性。结果显示在图10中。表3提供了图10描述的菌株。图10示例了通过编码单色下皮黑孔菌漆酶和全长木霉属葡糖淀粉酶的融合的漆酶产量的改善。结果显示,当与木霉属的葡糖淀粉酶融合时,漆酶的表达比与CBH1融合时增加了24-29%。
表3.图10中使用的菌株
菌株标识编号 | 菌株类型 |
#8-2 | CBH1漆酶融合体 |
1066-9 | TrGA漆酶融合体 |
1066-13 | TrGA漆酶融合体 |
1066-15 | TrGA漆酶融合体 |
实施例10.分析使用NSP24和CBH1信号序列的漆酶产量
当里氏木霉CBH1信号序列与漆酶基因有效的连接时,在摇瓶中的表达比单独的初始CBH1融合体菌株#8-2增加了4-5倍,而在14升发酵罐中则增加了5-6倍,如图11(摇瓶)和12(发酵罐)提供的结果所示。当使用里氏木霉NSP24信号序列时,在摇瓶中的表达增加了3-4倍,而在14升发酵罐中则增加了4-5倍。在摇瓶中分析关于CBH1信号序列的3个克隆(#7、#10和#13),以及分析关于NSP24信号序列的2个克隆(#7和#25),在第3(第1条)、4(第2条)和5天(第3条)分析表达。在14升发酵罐中分析各个单克隆,结果如图12所示。在该图中,菱形表示与漆酶D有效的连接的NSP24信号序列,方形表示与漆酶D有效的连接的CBH1信号序列,而三角形表示单独的CBH1融合体。
实施例11.分析在发酵罐中的利用CBH1信号序列和bip1共表达的
漆酶产量
将CBH1信号序列质粒(与漆酶有效的连接)与里氏木霉Bip1质粒共转化,并分析表达。结果显示在图13中。在图13中,菱形表示就CBH1信号序列(与漆酶有效的连接)加Bip1获得的数据,而方块表示就单独CBH1信号序列(与漆酶有效的连接)获得的数据。图13示例了CBH1信号序列加BIP1分子伴侣表达所提供的漆酶产量的增加,其在发酵罐中显著增加了表达,增加了超过15%。
实施例12.分析在摇瓶中的利用CBH1信号序列和bip1共表达的漆酶
产量
将CBH1信号序列质粒(与漆酶有效的连接)与里氏木霉bip1质粒共转化,生长并使用ABTS测定分析漆酶表达。结果显示在图14中。就3天(第1条)、4天(第2条)和5天(第3条)分析5个不同的克隆。KB410-13是只含有CBH1信号序列质粒的对照。其它4个克隆是KB410-13与一个bip1的共转化体:E32、E9、E16和E10。图14示例了在摇瓶中通过共表达分子伴侣和单色下皮黑孔菌对漆酶产量的增加。与bip1的共表达显著增加了摇瓶中的表达(14-41%)。
实施例13.分析利用CBH1-漆酶D融合体和共表达多种分子伴侣的
漆酶产量
将表达质粒与多种里氏木霉分子伴侣质粒(BIP1、PDI1和ERO1)共转化,所述表达质粒具有与漆酶有效连接的CBH1信号序列、催化结构域和接头。培养物种生长所获得的转化细胞,并分析漆酶表达。图15示例了通过将融合且bip1、pdi1和ero1分子伴侣共表达,对漆酶产量的改善,其中所述融合是编码单色下皮黑孔菌漆酶的基因与CBH1信号序列、催化结构域和接头的融合。
所有菌株都具有与漆酶D连接的CBH1信号序列、催化结构域和接头,菌株1B1、1B12和1B19具有bip1表达盒;它们是3个独立的转化体,具有bip1质粒拷贝数和整合位置的差异。菌株3B2和3B8具有pdi1表达盒;它们是2个独立的转化体,具有pdi1质粒拷贝数和整合位置的差异。菌株9B6和9B7具有ero1表达盒;它们是2个独立的转化体,具有ero1质粒拷贝数和整合位置的差异。#8-2是对照菌株,没有分子伴侣表达盒。
图15的结果表示,与bip1分子伴侣共表达获得了最多的表达增加。
实施例14.分析利用CBH1信号序列和共表达多种分子伴侣的漆酶产
量
将CBH1信号序列质粒(即,与漆酶有效的连接)与多种里氏木霉分子伴侣质粒(bip1、lhs1、pdi1、ppi1、ppi2、tig1、prp1和ero1)(单独或组合的)共转化。如本领域已知的,在摇瓶中生长培养物,并使用ABTS测定分析漆酶表达。一式三份的分析克隆。表4中提供的数据显示,添加一种以上的分子伴侣所增加的漆酶表达没有超过单独使用bip1。表4中的数据显示了来自同一菌株的三次独立的孢子纯化样品(或克隆)。
目前,以已经全面、清楚、简洁和准确术语的描述了本发明,以及产生和利用本发明的方式和过程,使任何所属领域技术人员都可以制备和使用其。应该理解,前文描述了本发明优选的实施方案,在不脱离权利要求提出的本发明范围的条件下,可以进行修饰。为了特别指出并明确声明视为发明的主题,下列权利要求总结本说明书。
序列表
<110>丹尼斯科美国公司(DANISCO US,INC.)
鲍锴(BAO,Kai)
王华明(WANG,Huaming)
<120>用于改良宿主细胞内蛋白质生产的信号序列和共表达的分子伴侣
<130>02718.0027.00PC00
<150>US 60/984,430
<151>2007-11-01
<160>45
<170>PatentIn版本3.5
<210>1
<211>11689
<212>DNA
<213>木霉属(Trichoderma)
<400>1
aagcgcctgc agccacttgc agtcccgtgg aattctcacg gtgaatgtag gccttttgta 60
gggtaggaat tgtcactcaa gcacccccaa cctccattac gcctccccca tagagttccc 120
aatcagtgag tcatggcact gttctcaaat agattgggga gaagttgact tccgcccaga 180
gctgaaggtc gcacaaccgc atgatatagg gtcggcaacg gcaaaaaagc acgtggctca 240
ccgaaaagca agatgtttgc gatctaacat ccaggaacct ggatacatcc atcatcacgc 300
acgaccactt tgatctgctg gtaaactcgt attcgcccta aaccgaagtg acgtggtaaa 360
tctacacgtg ggcccctttc ggtatactgc gtgtgtcttc tctaggtgcc attcttttcc 420
cttcctctag tgttgaattg tttgtgttgg agtccgagct gtaactacct ctgaatctct 480
ggagaatggt ggactaacga ctaccgtgca cctgcatcat gtatataata gtgatcctga 540
gaaggggggt ttggagcaat gtgggacttt gatggtcatc aaacaaagaa cgaagacgcc 600
tcttttgcaa agttttgttt cggctacggt gaagaactgg atacttgttg tgtcttctgt 660
gtatttttgt ggcaacaaga ggccagagac aatctattca aacaccaagc ttgctctttt 720
gagctacaag aacctgtggg gtatatatct agagttgtga agtcggtaat cccgctgtat 780
agtaatacga gtcgcatcta aatactccga agctgctgcg aacccggaga atcgagatgt 840
gctggaaagc ttctagcgag cggctaaatt agcatgaaag gctatgagaa attctggaga 900
cggcttgttg aatcatggcg ttccattctt cgacaagcaa agcgttccgt cgcagtagca 960
ggcactcatt cccgaaaaaa ctcggagatt cctaagtagc gatggaaccg gaataatata 1020
ataggcaata cattgagttg cctcgacggt tgcaatgcag gggtactgag cttggacata 1080
actgttccgt accccacctc ttctcaacct ttggcgtttc cctgattcag cgtacccgta 1140
caagtcgtaa tcactattaa cccagactga ccggacgtgt tttgcccttc atttggagaa 1200
ataatgtcat tgcgatgtgt aatttgcctg cttgaccgac tggggctgtt cgaagcccga 1260
atgtaggatt gttatccgaa ctctgctcgt agaggcatgt tgtgaatctg tgtcgggcag 1320
gacacgcctc gaaggttcac ggcaagggaa accaccgata gcagtgtcta gtagcaacct 1380
gtaaagccgc aatgcagcat cactggaaaa tacaaaccaa tggctaaaag tacataagtt 1440
aatgcctaaa gaagtcatat accagcggct aataattgta caatcaagtg gctaaacgta 1500
ccgtaatttg ccaacggctt gtggggttgc agaagcaacg gcaaagcccc acttccccac 1560
gtttgtttct tcactcagtc caatctcagc tggtgatccc ccaattgggt cgcttgtttg 1620
ttccggtgaa gtgaaagaag acagaggtaa gaatgtctga ctcggagcgt tttgcataca 1680
accaagggca gtgatggaag acagtgaaat gttgacattc aaggagtatt tagccaggga 1740
tgcttgagtg tatcgtgtaa ggaggtttgt ctgccgatac gacgaatact gtatagtcac 1800
ttctgatgaa gtggtccata ttgaaatgta agtcggcact gaacaggcaa aagattgagt 1860
tgaaactgcc taagatctcg ggccctcggg ccttcggcct ttgggtgtac atgtttgtgc 1920
tccgggcaaa tgcaaagtgt ggtaggatcg aacacactgc tgcctttacc aagcagctga 1980
gggtatgtga taggcaaatg ttcaggggcc actgcatggt ttcgaataga aagagaagct 2040
tagccaagaa caatagccga taaagatagc ctcattaaac ggaatgagct agtaggcaaa 2100
gtcagcgaat gtgtatatat aaaggttcga ggtccgtgcc tccctcatgc tctccccatc 2160
tactcatcaa ctcagatcct ccaggagact tgtacaccat cttttgaggc acagaaaccc 2220
aatagtcaac cgcggactgc gcatcatgta tcggaagttg gccgtcatct cggccttctt 2280
ggccacagct cgtgctcagt cggcctgcac tctccaatcg gagactcacc cgcctctgac 2340
atggcagaaa tgctcgtctg gtggcacttg cactcaacag acaggctccg tggtcatcga 2400
cgccaactgg cgctggactc acgctacgaa cagcagcacg aactgctacg atggcaacac 2460
ttggagctcg accctatgtc ctgacaacga gacctgcgcg aagaactgct gtctggacgg 2520
tgccgcctac gcgtccacgt acggagttac cacgagcggt aacagcctct ccattggctt 2580
tgtcacccag tctgcgcaga agaacgttgg cgctcgcctt taccttatgg cgagcgacac 2640
gacctaccag gaattcaccc tgcttggcaa cgagttctct ttcgatgttg atgtttcgca 2700
gctgccgtaa gtgacttacc atgaacccct gacgtatctt cttgtgggct cccagctgac 2760
tggccaattt aaggtgcggc ttgaacggag ctctctactt cgtgtccatg gacgcggatg 2820
gtggcgtgag caagtatccc accaacaccg ctggcgccaa gtacggcacg gggtactgtg 2880
acagccagtg tccccgcgat ctgaagttca tcaatggcca ggccaacgtt gagggctggg 2940
agccgtcatc caacaacgca aacacgggca ttggaggaca cggaagctgc tgctctgaga 3000
tggatatctg ggaggccaac tccatctccg aggctcttac cccccaccct tgcacgactg 3060
tcggccagga gatctgcgag ggtgatgggt gcggcggaac ttactccgat aacagatatg 3120
gcggcacttg cgatcccgat ggctgcgact ggaacccata ccgcctgggc aacaccagct 3180
tctacggccc tggctcaagc tttaccctcg ataccaccaa gaaattgacc gttgtcaccc 3240
agttcgagac gtcgggtgcc atcaaccgat actatgtcca gaatggcgtc actttccagc 3300
agcccaacgc cgagcttggt agttactctg gcaacgagct caacgatgat tactgcacag 3360
ctgaggaggc agaattcggc ggatcctctt tctcagacaa gggcggcctg actcagttca 3420
agaaggctac ctctggcggc atggttctgg tcatgagtct gtgggatgat gtgagtttga 3480
tggacaaaca tgcgcgttga caaagagtca agcagctgac tgagatgtta cagtactacg 3540
ccaacatgct gtggctggac tccacctacc cgacaaacga gacctcctcc acacccggtg 3600
ccgtgcgcgg aagctgctcc accagctccg gtgtccctgc tcaggtcgaa tctcagtctc 3660
ccaacgccaa ggtcaccttc tccaacatca agttcggacc cattggcagc accggcaacc 3720
ctagcggcgg caaccctccc ggcggaaacc cgcctggcac caccaccacc cgccgcccag 3780
ccactaccac tggaagctct cccggaccta ctagtgtcgc cgtttacaaa cgcgctattg 3840
gaccagttgc tgatctgcac atcgttaaca aggatttggc cccagacggc gtccagcgcc 3900
caactgttct ggccggtgga acttttccgg gcacgctgat taccggtcaa aagggcgaca 3960
acttccagct gaacgtgatt gatgacctga ccgacgatcg catgttgacc cctacttcga 4020
tccattggca tggtttcttc cagaagggaa ccgcctgggc cgacggtccg gctttcgtta 4080
cacagtgccc tattatcgca gacaactcct tcctctacga tttcgacgtt cccgaccagg 4140
cgggcacctt ctggtaccac tcacacttgt ctacacagta ctgcgacggt ctgcgcggtg 4200
ccttcgttgt ttacgacccc aacgaccctc acaaggacct ttatgatgtc gatgacggtg 4260
gcacagttat cacattggct gactggtatc acgtcctcgc tcagaccgtt gtcggagctg 4320
ctacacccga ctctacgctg attaacggct tgggacgcag ccagactggc cccgccgacg 4380
ctgagctggc cgttatctct gttgaacaca acaagagata ccgtttcaga ctcgtctcca 4440
tctcgtgcga tcccaacttc acttttagcg tcgacggtca caacatgacg gttatcgagg 4500
ttgatggcgt gaatacccgc cctctcaccg tcgattccat tcaaattttc gccggccagc 4560
gatactcctt tgtgctgaat gccaatcagc ccgaggataa ctactggatc cgcgctatgc 4620
ctaacatcgg acgaaacacc actacccttg atggcaagaa tgccgctatc ctgcgataca 4680
agaacgccag cgttgaggag cccaaaaccg tcggaggacc cgcgcagagc ccattgaacg 4740
aggccgacct gcgacctctg gtgcccgctc ctgtccctgg caacgcagtt cctggtggtg 4800
cggacatcaa ccaccgcctg aacctgacat tcagcaacgg cctcttctct atcaataacg 4860
catcatttac aaaccccagc gtccctgcct tgttgcagat tctttccggc gcacaaaacg 4920
ctcaggatct gcttcccacc ggttcttata tcggcttgga gttgggcaag gtcgttgaac 4980
tcgtgatccc tcccttggcc gttggtggcc cccatccatt ccacttgcac ggccacaact 5040
tttgggtcgt ccgaagcgct ggttctgacg agtataattt cgacgatgca attttgcgcg 5100
acgtggtcag cattggcgcg ggaactgacg aggttactat ccgttttgtc actgataacc 5160
caggcccttg gttcctccat tgccacatcg actggcacct cgaagccggc ctcgccattg 5220
ttttcgccga aggcatcaat caaaccgcag ccgccaaccc gactccacag gcctgggacg 5280
aactctgccc caagtataac ggactctccg cttcccagaa agtgaagccc aagaagggaa 5340
cagccatcta aggcgcgccg cgcgccagct ccgtgcgaaa gcctgacgca ccggtagatt 5400
cttggtgagc ccgtatcatg acggcggcgg gagctacatg gccccgggtg atttattttt 5460
tttgtatcta cttctgaccc ttttcaaata tacggtcaac tcatctttca ctggagatgc 5520
ggcctgcttg gtattgcgat gttgtcagct tggcaaattg tggctttcga aaacacaaaa 5580
cgattcctta gtagccatgc attttaagat aacggaatag aagaaagagg aaattaaaaa 5640
aaaaaaaaaa acaaacatcc cgttcataac ccgtagaatc gccgctcttc gtgtatccca 5700
gtaccagttt attttgaata gctcgcccgc tggagagcat cctgaatgca agtaacaacc 5760
gtagaggctg acacggcagg tgttgctagg gagcgtcgtg ttctacaagg ccagacgtct 5820
tcgcggttga tatatatgta tgtttgactg caggctgctc agcgacgaca gtcaagttcg 5880
ccctcgctgc ttgtgcaata atcgcagtgg ggaagccaca ccgtgactcc catctttcag 5940
taaagctctg ttggtgttta tcagcaatac acgtaattta aactcgttag catggggctg 6000
atagcttaat taccgtttac cagtgccgcg gttctgcagc tttccttggc ccgtaaaatt 6060
cggcgaagcc agccaatcac cagctaggca ccagctaaac cctataatta gtctcttatc 6120
aacaccatcc gctcccccgg gatcaatgag gagaatgagg gggatgcggg gctaaagaag 6180
cctacataac cctcatgcca actcccagtt tacactcgtc gagccaacat cctgactata 6240
agctaacaca gaatgcctca atcctgggaa gaactggccg ctgataagcg cgcccgcctc 6300
gcaaaaacca tccctgatga atggaaagtc cagacgctgc ctgcggaaga cagcgttatt 6360
gatttcccaa agaaatcggg gatcctttca gaggccgaac tgaagatcac agaggcctcc 6420
gctgcagatc ttgtgtccaa gctggcggcc ggagagttga cctcggtgga agttacgcta 6480
gcattctgta aacgggcagc aatcgcccag cagttagtag ggtcccctct acctctcagg 6540
gagatgtaac aacgccacct tatgggacta tcaagctgac gctggcttct gtgcagacaa 6600
actgcgccca cgagttcttc cctgacgccg ctctcgcgca ggcaagggaa ctcgatgaat 6660
actacgcaaa gcacaagaga cccgttggtc cactccatgg cctccccatc tctctcaaag 6720
accagcttcg agtcaaggta caccgttgcc cctaagtcgt tagatgtccc tttttgtcag 6780
ctaacatatg ccaccagggc tacgaaacat caatgggcta catctcatgg ctaaacaagt 6840
acgacgaagg ggactcggtt ctgacaacca tgctccgcaa agccggtgcc gtcttctacg 6900
tcaagacctc tgtcccgcag accctgatgg tctgcgagac agtcaacaac atcatcgggc 6960
gcaccgtcaa cccacgcaac aagaactggt cgtgcggcgg cagttctggt ggtgagggtg 7020
cgatcgttgg gattcgtggt ggcgtcatcg gtgtaggaac ggatatcggt ggctcgattc 7080
gagtgccggc cgcgttcaac ttcctgtacg gtctaaggcc gagtcatggg cggctgccgt 7140
atgcaaagat ggcgaacagc atggagggtc aggagacggt gcacagcgtt gtcgggccga 7200
ttacgcactc tgttgagggt gagtccttcg cctcttcctt cttttcctgc tctataccag 7260
gcctccactg tcctcctttc ttgcttttta tactatatac gagaccggca gtcactgatg 7320
aagtatgtta gacctccgcc tcttcaccaa atccgtcctc ggtcaggagc catggaaata 7380
cgactccaag gtcatcccca tgccctggcg ccagtccgag tcggacatta ttgcctccaa 7440
gatcaagaac ggcgggctca atatcggcta ctacaacttc gacggcaatg tccttccaca 7500
ccctcctatc ctgcgcggcg tggaaaccac cgtcgccgca ctcgccaaag ccggtcacac 7560
cgtgaccccg tggacgccat acaagcacga tttcggccac gatctcatct cccatatcta 7620
cgcggctgac ggcagcgccg acgtaatgcg cgatatcagt gcatccggcg agccggcgat 7680
tccaaatatc aaagacctac tgaacccgaa catcaaagct gttaacatga acgagctctg 7740
ggacacgcat ctccagaagt ggaattacca gatggagtac cttgagaaat ggcgggaggc 7800
tgaagaaaag gccgggaagg aactggacgc catcatcgcg ccgattacgc ctaccgctgc 7860
ggtacggcat gaccagttcc ggtactatgg gtatgcctct gtgatcaacc tgctggattt 7920
cacgagcgtg gttgttccgg ttacctttgc ggataagaac atcgataaga agaatgagag 7980
tttcaaggcg gttagtgagc ttgatgccct cgtgcaggaa gagtatgatc cggaggcgta 8040
ccatggggca ccggttgcag tgcaggttat cggacggaga ctcagtgaag agaggacgtt 8100
ggcgattgca gaggaagtgg ggaagttgct gggaaatgtg gtgactccat agctaataag 8160
tgtcagatag caatttgcac aagaaatcaa taccagcaac tgtaaataag cgctgaagtg 8220
accatgccat gctacgaaag agcagaaaaa aacctgccgt agaaccgaag agatatgaca 8280
cgcttccatc tctcaaagga agaatccctt cagggttgcg tttccagtct agacacgtat 8340
aacggcacaa gtgtctctca ccaaatgggt tatatctcaa atgtgatcta aggatggaaa 8400
gcccagaatc taggcctatt aatattccgg agtatacgta gccggctaac gttaacaacc 8460
ggtacctcta gaactatagc tagcatgcgc aaatttaaag cgctgatatc gatcgcgcgc 8520
agatccatat atagggcccg ggttataatt acctcaggtc gacgtcccat ggccattcga 8580
attcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc acaattccac 8640
acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga gtgagctaac 8700
tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg tcgtgccagc 8760
tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg 8820
cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc 8880
actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt 8940
gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc 9000
ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa 9060
acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc 9120
ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg 9180
cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc 9240
tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc 9300
gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca 9360
ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact 9420
acggctacac tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg 9480
gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt 9540
ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct 9600
tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga 9660
gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa 9720
tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac 9780
ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga 9840
taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc 9900
cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca 9960
gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta 10020
gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg 10080
tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc 10140
gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg 10200
ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt 10260
ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt 10320
cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata 10380
ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc 10440
gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac 10500
ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa 10560
ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct 10620
tcctttttca atattattga agcatttatc agggttattg tctcatgagc ggatacatat 10680
ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc 10740
cacctgacgt ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca 10800
cgaggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc 10860
tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg 10920
gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca tcagagcaga 10980
ttgtactgag agtgcaccat aaaattgtaa acgttaatat tttgttaaaa ttcgcgttaa 11040
atttttgtta aatcagctca ttttttaacc aataggccga aatcggcaaa atcccttata 11100
aatcaaaaga atagcccgag atagggttga gtgttgttcc agtttggaac aagagtccac 11160
tattaaagaa cgtggactcc aacgtcaaag ggcgaaaaac cgtctatcag ggcgatggcc 11220
cactacgtga accatcaccc aaatcaagtt ttttggggtc gaggtgccgt aaagcactaa 11280
atcggaaccc taaagggagc ccccgattta gagcttgacg gggaaagccg gcgaacgtgg 11340
cgagaaagga agggaagaaa gcgaaaggag cgggcgctag ggcgctggca agtgtagcgg 11400
tcacgctgcg cgtaaccacc acacccgccg cgcttaatgc gccgctacag ggcgcgtact 11460
atggttgctt tgacgtatgc ggtgtgaaat accgcacaga tgcgtaagga gaaaataccg 11520
catcaggcgc cattcgccat tcaggctgcg caactgttgg gaagggcgat cggtgcgggc 11580
ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct gcaaggcgat taagttgggt 11640
aacgccaggg ttttcccagt cacgacgttg taaaacgacg gccagtgcc 11689
<210>2
<211>9375
<212>DNA
<213>木霉属
<400>2
agactagcgg ccggtcccct tatcccagct gttccacgtt ggcctgcccc tcagttagcg 60
ctcaactcaa tgcccctcac tggcgaggcg agggcaagga tggaggggca gcatcgcctg 120
agttggagca aagcggccgc catgggagca gcgaaccaac ggagggatgc cgtgctttgt 180
cgtggctgct gtggccaatc cgggcccttg gttggctcac agagcgttgc tgtgagacca 240
tgagctatta ttgctaggta cagtatagag agaggagaga gagagagaga gagagagggg 300
aaaaaaggtg aggttgaagt gagaaaaaaa aaaaaaaaaa aaatccaacc actgacggct 360
gccggctctg ccacccccct ccctccaccc cagaccacct gcacactcag cgcgcagcat 420
cacctaatct tggctcgcct tcccgcagct caggttgttt tttttttctc tctccctcgt 480
cgaagccgcc cttgttccct tatttatttc cctctccatc cttgtctgcc tttggtccat 540
ctgccccttt gtctgcatct cttttgcacg catcgcctta tcgtcgtctc ttttttcact 600
cacgggagct tgacgaagac ctgactcgtg agcctcacct gctgatttct ctccccccct 660
cccgaccggc ttgacttttg tttctcctcc agtaccttat cgcgaagccg gaagaaccct 720
ctttaacccc atcaaacaag tttgtacaaa aaagcaggct atggctcgtt cacggagctc 780
cctggccctc gggctgggcc tgctctgctg gatcacgctg ctcttcgctc ctctggcgtt 840
tgtcggaaag gccaatgccg cgagcgacga cgcggacaac tacggcactg ttatcggaat 900
tgtaagtcga ctgacggcag caaccccgcc attttcttgg tgttgatgct caggcagccc 960
tgctaacacg cttctcctcc gcccaggatc tcggaactac ctacagctgc gtcggtgtga 1020
tgcagaaggg caaggttgag attctcgtca acgaccaggg taaccgaatc actccctcct 1080
acgtggcctt taccgacgag gagcgtctgg ttggcgattc cgccaagaac caggccgccg 1140
ccaaccccac caacaccgtc tacgatgtca agtcagttct accgccctgt tggcttctat 1200
tgtataagtg gacaattagc taactgttgt cacaggcgat tgattggccg caaattcgac 1260
gagaaggaga tccaggccga catcaagcac ttcccctaca aggtcattga gaagaacggc 1320
aagcccgtcg tccaggtcca ggtcaacggc cagaagaagc agttcactcc cgaggagatt 1380
tctgccatga ttcttggcaa gatgaaggag gttgccgagt cgtacctggg caagaaggtt 1440
acccacgccg tcgtcaccgt ccctgcctac ttcaacgtga gtcttttccc cgaaattcct 1500
cgaggattcc aagagccatc tgctaacagc ccgataggac aaccagcgac aggccaccaa 1560
ggacgccggt accattgccg gcttgaacgt tctccgaatc gtcaacgaac ccaccgctgc 1620
cgctatcgcc tatggtctgg acaagaccga cggtgagcgc cagatcattg tctacgatct 1680
cggtggtggt acctttgatg tttctctcct gtccattgac aatggcgtct tcgaggtctt 1740
ggctaccgcc ggtgacaccc accttggtgg tgaggacttt gaccagcgca ttatcaacta 1800
cctggccaag gcctacaaca agaagaacaa cgtcgacatc tccaaggacc tcaaggccat 1860
gggcaagctc aagcgtgaag ccgaaaaggc caagcgtacc ctctcttccc agatgagcac 1920
tcgtatcgaa atcgaggcct tcttcgaggg caacgacttc tccgagactc tcacccgggc 1980
caagttcgag gagctcaaca tggacctctt caagaagacc ctgaagcctg tcgagcaggt 2040
tctcaaggac gccaacgtca agaagagcga ggttgacgac atcgttctgg tcggcggttc 2100
cacccgtatc cccaaggttc agtctcttat cgaggagtac tttaacggca agaaggcttc 2160
caagggtatc aaccccgacg aggctgttgc tttcggtgcc gccgtccagg ccggtgtcct 2220
ttctggtgag gaaggtaccg atgacattgt tctcatggac gtcaaccccc tgactctcgg 2280
tatcgagacc actggcggag tcatgaccaa gctcattccc cgcaacaccc ccatccccac 2340
tcgcaagagc cagatcttct cgactgctgc cgataaccag cccgtcgtcc tgatccaggt 2400
cttcgagggt gagcgttcca tgaccaagga caacaacctc ctgggcaagt tcgagcttac 2460
cggcattcct cctgcccccc gcggtgtccc ccagattgag gtttccttcg agttggatgc 2520
caacggtatc ctcaaggtct ccgctcacga caagggcacc ggcaagcagg agtccatcac 2580
catcaccaac gacaagggcc gtctcaccca ggaggagatt gaccgcatgg ttgccgaggc 2640
cgagaagttc gccgaggagg acaaggctac ccgtgagcgc atcgaggccc gtaacggtct 2700
tgagaactac gccttcagcc tgaagaacca ggtcaatgac gaggagggcc tcggcggcaa 2760
gattgacgag gaggacaagg agactgtaag ttgaagcgat ccatcactgc tttctgatgc 2820
ggacatgtca cactaacact tgaccagatt cttgacgccg tcaaggaggc taccgagtgg 2880
ctcgaggaga acggcgccga cgccactacc gaggactttg aggagcagaa ggagaagctg 2940
tccaacgtcg cctaccccat cacctccaag atgtaccagg gtgctggtgg ctccgaggac 3000
gatggcgact tccacgacga attgtaaacc cagctttctt gtacaaagtg gttcgatggt 3060
ttaggcgcgc cagctccgtg cgaaagcctg acgcaccggt agattcttgg tgagcccgta 3120
tcatgacggc ggcgggagct acatggcccc gggtgattta ttttttttgt atctacttct 3180
gacccttttc aaatatacgg tcaactcatc tttcactgga gatgcggcct gcttggtatt 3240
gcgatgttgt cagcttggca aattgtggct ttcgaaaaca caaaacgatt ccttagtagc 3300
catgcatttt aagataacgg aatagaagaa agaggaaatt aaaaaaaaaa aaaaaacaaa 3360
catcccgttc ataacccgta gaatcgccgc tcttcgtgta tcccagtacc agtttacctg 3420
tggcgccggt gatgccggcc acgatgcgtc cggcgtagag gatcctctag ctagaaagaa 3480
ggattacctc taaacaagtg tacctgtgca ttctgggtaa acgactcata ggagagttgt 3540
aaaaaagttt cggccggcgt attgggtgtt acggagcatt cactaggcaa ccatggttac 3600
tattgtatac ccatcttagt aggaatgatt ttcgaggttt atacctacga tgaatgtgtg 3660
tcctgtaggc ttgagagttc aaggaagaaa cagtgcaatt atctttgcga acccaggggc 3720
tggtgacgga attttcatag tcaagctatc agagttaaga agaggagcat gtcaaagtac 3780
aattagagac aaatatatag tcgcgtggag ccaagagcgg attcctcagt ctcgtaggtc 3840
tcttgacgac cgttgatctg cttgatctcg tctcccgaaa atgaaaatag actctgctaa 3900
gctattcttc tgcttcgccg gagcctgaag ggcgtactag ggttgcgagg tccaatgcat 3960
taatgcattg cagatgagct gtatctggaa gaggtaaacc cgaaacgcgt tttattcttg 4020
ttgacatgga gctattaaat cactagaagg cactctttgc tgcttggaca aatgaacgta 4080
tcttatcgag atcctgaaca ccatttgtct caactccgga gctgacatcg acaccaacga 4140
tcttatatcc agattcgtca agctgtttga tgatttcagt aacgttaagt ggatcccggt 4200
cggcatctac tctattcctt tgccctcgga cgagtgctgg ggcgtcggtt tccactatcg 4260
gcgagtactt ctacacagcc atcggtccag acggccgcgc ttctgcgggc gatttgtgta 4320
cgcccgacag tcccggctcc ggatcggacg attgcgtcgc atcgaccctg cgcccaagct 4380
gcatcatcga aattgccgtc aaccaagctc tgatagagtt ggtcaagacc aatgcggagc 4440
atatacgccc ggaggcgcgg cgatcctgca agctccggat gcctccgctc gaagtagcgc 4500
gtctgctgct ccatacaagc caaccacggc ctccagaaga agatgttggc gacctcgtat 4560
tgggaatccc cgaacatcgc ctcgctccag tcaatgaccg ctgttatgcg gccattgtcc 4620
gtcaggacat tgttggagcc gaaatccgcg tgcacgaggt gccggacttc ggggcagtcc 4680
tcggcccaaa gcatcagctc atcgagagcc tgcgcgacgg acgcactgac ggtgtcgtcc 4740
atcacagttt gccagtgata cacatgggga tcagcaatcg cgcatatgaa atcacgccat 4800
gtagtgtatt gaccgattcc ttgcggtccg aatgggccga acccgctcgt ctggctaaga 4860
tcggccgcag cgatcgcatc catggcctcc gcgaccggct gcagaacagc gggcagttcg 4920
gtttcaggca ggtcttgcaa cgtgacaccc tgtgcacggc gggagatgca ataggtcagg 4980
ctctcgctga attccccaat gtcaagcact tccggaatcg ggagcgcggc cgatgcaaag 5040
tgccgataaa cataacgatc tttgtagaaa ccatcggcgc agctatttac ccgcaggaca 5100
tatccacgcc ctcctacatc gaagctgaaa gcacgagatt cttcgccctc cgagagctgc 5160
atcaggtcgg agacgctgtc gaacttttcg atcagaaact tctcgacaga cgtcgcggtg 5220
agttcaggct ttttcatatg ggtacctgag aacatcttgt tgccctgctt tccgtgcgaa 5280
atactaccgg tacttttggg aaacaaggga acaggagggc gctgctgtgc gcggttctga 5340
gtgttcagga ttgaagctga agaaggtgct gaggaagcgt agaactgttg cggacgcgag 5400
ttctgagaag agctgtaccg attggtgaaa gccgaagaag tgagttggtg ccctgttgcc 5460
tggataatgt ttgcaactcg ctggttctgc agagacggag acaaatgctg gctacgatgt 5520
tgctgattca ggttgatacc tcggtcgaga tactgttttg gtttgatagg gtggatttgg 5580
ttgcagagaa gaagaaagga aggtcaaaga gggaaaactg ggcggaggga aggattttgt 5640
atcaggcagc aaactgccac tgcagtggcc ctggcagtgc cgggcgaggc acccacgcac 5700
ggccgcgcaa ccggttggtc cttgcccacc acgaaaccct tctgaaaggt cagatggaag 5760
tgtgcgacag tgcgcgtccc caagccaatg caggcgccat ggatccactc cccacccgca 5820
agatttcact gtgcgttctt attggttgcc gcaaggccag ccaaaggggg aagtatgagt 5880
cacagcaccg atacaagaaa attgcagaac taacatatgg atgcgcgcgc tattctgtag 5940
agctctgggc aaagcaccaa tcctgcgggt cggtacacac actagcactg ccccacctga 6000
ggcagtcagc cccgctgacc gaattgccaa gagccaatgg agacggaaag ccaacgctga 6060
tggagcacca tctgaatgga cctcgctcgc ttgcctggaa gggacaaggg acaccggaga 6120
cgcggccgca ctagtgcatg cgcaaattta aagcgctgat atcgatcgcg cgcagatcca 6180
tatatagggc ccgggttata attacctcag gtcgacgtcc catggccatt cgaattcgta 6240
atcatgtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc acacaacata 6300
cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta actcacatta 6360
attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 6420
tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg 6480
ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 6540
gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 6600
ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 6660
cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 6720
ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 6780
accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 6840
catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 6900
gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 6960
tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 7020
agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 7080
actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 7140
gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 7200
aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 7260
gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 7320
aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 7380
atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 7440
gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta gataactacg 7500
atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga cccacgctca 7560
ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt 7620
cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc tagagtaagt 7680
agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca 7740
cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca 7800
tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat cgttgtcaga 7860
agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa ttctcttact 7920
gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcattctga 7980
gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga taataccgcg 8040
ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg gcgaaaactc 8100
tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc acccaactga 8160
tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg aaggcaaaat 8220
gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact cttccttttt 8280
caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 8340
atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccacctgac 8400
gtctaagaaa ccattattat catgacatta acctataaaa ataggcgtat cacgaggccc 8460
tttcgtctcg cgcgtttcgg tgatgacggt gaaaacctct gacacatgca gctcccggag 8520
acggtcacag cttgtctgta agcggatgcc gggagcagac aagcccgtca gggcgcgtca 8580
gcgggtgttg gcgggtgtcg gggctggctt aactatgcgg catcagagca gattgtactg 8640
agagtgcacc ataaaattgt aaacgttaat attttgttaa aattcgcgtt aaatttttgt 8700
taaatcagct cattttttaa ccaataggcc gaaatcggca aaatccctta taaatcaaaa 8760
gaatagcccg agatagggtt gagtgttgtt ccagtttgga acaagagtcc actattaaag 8820
aacgtggact ccaacgtcaa agggcgaaaa accgtctatc agggcgatgg cccactacgt 8880
gaaccatcac ccaaatcaag ttttttgggg tcgaggtgcc gtaaagcact aaatcggaac 8940
cctaaaggga gcccccgatt tagagcttga cggggaaagc cggcgaacgt ggcgagaaag 9000
gaagggaaga aagcgaaagg agcgggcgct agggcgctgg caagtgtagc ggtcacgctg 9060
cgcgtaacca ccacacccgc cgcgcttaat gcgccgctac agggcgcgta ctatggttgc 9120
tttgacgtat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc 9180
gccattcgcc attcaggctg cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc 9240
tattacgcca gctggcgaaa gggggatgtg ctgcaaggcg attaagttgg gtaacgccag 9300
ggttttccca gtcacgacgt tgtaaaacga cggccagtgc caagcttaag gtgcacggcc 9360
cacgtggcca ctagt 9375
<210>3
<211>8813
<212>DNA
<213>木霉属
<400>3
agactagcgg ccggtcccct tatcccagct gttccacgtt ggcctgcccc tcagttagcg 60
ctcaactcaa tgcccctcac tggcgaggcg agggcaagga tggaggggca gcatcgcctg 120
agttggagca aagcggccgc catgggagca gcgaaccaac ggagggatgc cgtgctttgt 180
cgtggctgct gtggccaatc cgggcccttg gttggctcac agagcgttgc tgtgagacca 240
tgagctatta ttgctaggta cagtatagag agaggagaga gagagagaga gagagagggg 300
aaaaaaggtg aggttgaagt gagaaaaaaa aaaaaaaaaa aaatccaacc actgacggct 360
gccggctctg ccacccccct ccctccaccc cagaccacct gcacactcag cgcgcagcat 420
cacctaatct tggctcgcct tcccgcagct caggttgttt tttttttctc tctccctcgt 480
cgaagccgcc cttgttccct tatttatttc cctctccatc cttgtctgcc tttggtccat 540
ctgccccttt gtctgcatct cttttgcacg catcgcctta tcgtcgtctc ttttttcact 600
cacgggagct tgacgaagac ctgactcgtg agcctcacct gctgatttct ctccccccct 660
cccgaccggc ttgacttttg tttctcctcc agtaccttat cgcgaagccg gaagaaccct 720
ctttaacccc atcaaacaag tttgtacaaa aaagcaggct atgcaacaga agcgtcttac 780
tgctgccctg gtggccgctt tggccgctgt ggtctctgcc gagtcggatg tcaagtcctt 840
gaccaaggac accttcaacg acttcatcaa ctccaatgac ctcgtcctgg ctgagtgtat 900
gtctctctct ctctctctcc ccccctcccc tttgccttct gccctctcaa gcttctgcat 960
ctctcgaccc ctcccccgcc agccccccgg catcgagatc cccgctaaca gctgcaatct 1020
tccagtcttc gctccctggt gcggccactg caaggctctc gcccccgagt acgaggaggc 1080
ggccacgact ctcaaggaca agagcatcaa gctcgccaag gtcgactgtg tcgaggaggc 1140
tgacctctgc aaggagcatg gagttgaggg ctaccccacg ctcaaggtct tccgtggcct 1200
cgataaggtc gctccctaca ctggtccccg caaggctgac gggtaagctt tgaattgcac 1260
tgttctttgc atcaatccat tcattcgcta acgttggttg tcctttcagc atcacctcct 1320
acatggtgaa gcagtccctg cctgccgtct ccgccctcac caaggatacc ctcgaggact 1380
tcaagaccgc cgacaaggtc gtcctggtcg cctacatcgc cgccgatgac aaggcctcca 1440
acgagacctt cactgctctg gccaacgagc tgcgtgacac ctacctcttt ggtggcgtca 1500
acgatgctgc cgttgctgag gctgagggcg tcaagttccc ttccattgtc ctctacaagt 1560
ccttcgacga gggcaagaac gtcttcagcg agaagttcga tgctgaggcc attcgcaact 1620
ttgctcaggt tgccgccact cccctcgttg gcgaagttgg ccctgagacc tacgccggct 1680
acatgtctgc cggtatccct ctggcttaca tcttcgccga gaccgccgag gagcgtgaga 1740
acctggccaa gaccctcaag cccgtcgccg agaagtacaa gggcaagatc aacttcgcca 1800
ccatcgacgc caagaacttt ggctcgcacg ccggcaacat caacctcaag accgacaagt 1860
tccccgcctt tgccattcac gacattgaga agaacctcaa gttccccttt gaccagtcca 1920
aggagatcac cgagaaggac attgccgcct ttgtcgacgg cttctcctct ggcaagattg 1980
aggccagcat caagtccgag cccatccccg agacccagga gggccccgtc accgttgtcg 2040
ttgcccactc ttacaaggac attgtccttg acgacaagaa ggacgtcctg attgagttct 2100
acgctccctg gtgcggtcac tgcaaggctc tcgcccccaa gtacgatgag ctcgccagcc 2160
tgtatgccaa gagcgacttc aaggacaagg ttgtcatcgc caaggttgat gccactgcca 2220
acgacgtccc cgacgagatc cagggcttcc ccaccatcaa gctctacccc gccggtgaca 2280
agaagaaccc cgtcacctac agcggtgccc gcactgttga ggacttcatc gagttcatca 2340
aggagaacgg caagtacaag gccggcgtcg agatccccgc cgagcccacc gaggaggctg 2400
aggcttccga gtccaaggcc tctgaggagg ccaaggcttc cgaggagact cacgatgagc 2460
tgtaaaccca gctttcttgt acaaagtggt tcgatggttt aggcgcgcca gctccgtgcg 2520
aaagcctgac gcaccggtag attcttggtg agcccgtatc atgacggcgg cgggagctac 2580
atggccccgg gtgatttatt ttttttgtat ctacttctga cccttttcaa atatacggtc 2640
aactcatctt tcactggaga tgcggcctgc ttggtattgc gatgttgtca gcttggcaaa 2700
ttgtggcttt cgaaaacaca aaacgattcc ttagtagcca tgcattttaa gataacggaa 2760
tagaagaaag aggaaattaa aaaaaaaaaa aaaacaaaca tcccgttcat aacccgtaga 2820
atcgccgctc ttcgtgtatc ccagtaccag tttacctgtg gcgccggtga tgccggccac 2880
gatgcgtccg gcgtagagga tcctctagct agaaagaagg attacctcta aacaagtgta 2940
cctgtgcatt ctgggtaaac gactcatagg agagttgtaa aaaagtttcg gccggcgtat 3000
tgggtgttac ggagcattca ctaggcaacc atggttacta ttgtataccc atcttagtag 3060
gaatgatttt cgaggtttat acctacgatg aatgtgtgtc ctgtaggctt gagagttcaa 3120
ggaagaaaca gtgcaattat ctttgcgaac ccaggggctg gtgacggaat tttcatagtc 3180
aagctatcag agttaagaag aggagcatgt caaagtacaa ttagagacaa atatatagtc 3240
gcgtggagcc aagagcggat tcctcagtct cgtaggtctc ttgacgaccg ttgatctgct 3300
tgatctcgtc tcccgaaaat gaaaatagac tctgctaagc tattcttctg cttcgccgga 3360
gcctgaaggg cgtactaggg ttgcgaggtc caatgcatta atgcattgca gatgagctgt 3420
atctggaaga ggtaaacccg aaacgcgttt tattcttgtt gacatggagc tattaaatca 3480
ctagaaggca ctctttgctg cttggacaaa tgaacgtatc ttatcgagat cctgaacacc 3540
atttgtctca actccggagc tgacatcgac accaacgatc ttatatccag attcgtcaag 3600
ctgtttgatg atttcagtaa cgttaagtgg atcccggtcg gcatctactc tattcctttg 3660
ccctcggacg agtgctgggg cgtcggtttc cactatcggc gagtacttct acacagccat 3720
cggtccagac ggccgcgctt ctgcgggcga tttgtgtacg cccgacagtc ccggctccgg 3780
atcggacgat tgcgtcgcat cgaccctgcg cccaagctgc atcatcgaaa ttgccgtcaa 3840
ccaagctctg atagagttgg tcaagaccaa tgcggagcat atacgcccgg aggcgcggcg 3900
atcctgcaag ctccggatgc ctccgctcga agtagcgcgt ctgctgctcc atacaagcca 3960
accacggcct ccagaagaag atgttggcga cctcgtattg ggaatccccg aacatcgcct 4020
cgctccagtc aatgaccgct gttatgcggc cattgtccgt caggacattg ttggagccga 4080
aatccgcgtg cacgaggtgc cggacttcgg ggcagtcctc ggcccaaagc atcagctcat 4140
cgagagcctg cgcgacggac gcactgacgg tgtcgtccat cacagtttgc cagtgataca 4200
catggggatc agcaatcgcg catatgaaat cacgccatgt agtgtattga ccgattcctt 4260
gcggtccgaa tgggccgaac ccgctcgtct ggctaagatc ggccgcagcg atcgcatcca 4320
tggcctccgc gaccggctgc agaacagcgg gcagttcggt ttcaggcagg tcttgcaacg 4380
tgacaccctg tgcacggcgg gagatgcaat aggtcaggct ctcgctgaat tccccaatgt 4440
caagcacttc cggaatcggg agcgcggccg atgcaaagtg ccgataaaca taacgatctt 4500
tgtagaaacc atcggcgcag ctatttaccc gcaggacata tccacgccct cctacatcga 4560
agctgaaagc acgagattct tcgccctccg agagctgcat caggtcggag acgctgtcga 4620
acttttcgat cagaaacttc tcgacagacg tcgcggtgag ttcaggcttt ttcatatggg 4680
tacctgagaa catcttgttg ccctgctttc cgtgcgaaat actaccggta cttttgggaa 4740
acaagggaac aggagggcgc tgctgtgcgc ggttctgagt gttcaggatt gaagctgaag 4800
aaggtgctga ggaagcgtag aactgttgcg gacgcgagtt ctgagaagag ctgtaccgat 4860
tggtgaaagc cgaagaagtg agttggtgcc ctgttgcctg gataatgttt gcaactcgct 4920
ggttctgcag agacggagac aaatgctggc tacgatgttg ctgattcagg ttgatacctc 4980
ggtcgagata ctgttttggt ttgatagggt ggatttggtt gcagagaaga agaaaggaag 5040
gtcaaagagg gaaaactggg cggagggaag gattttgtat caggcagcaa actgccactg 5100
cagtggccct ggcagtgccg ggcgaggcac ccacgcacgg ccgcgcaacc ggttggtcct 5160
tgcccaccac gaaacccttc tgaaaggtca gatggaagtg tgcgacagtg cgcgtcccca 5220
agccaatgca ggcgccatgg atccactccc cacccgcaag atttcactgt gcgttcttat 5280
tggttgccgc aaggccagcc aaagggggaa gtatgagtca cagcaccgat acaagaaaat 5340
tgcagaacta acatatggat gcgcgcgcta ttctgtagag ctctgggcaa agcaccaatc 5400
ctgcgggtcg gtacacacac tagcactgcc ccacctgagg cagtcagccc cgctgaccga 5460
attgccaaga gccaatggag acggaaagcc aacgctgatg gagcaccatc tgaatggacc 5520
tcgctcgctt gcctggaagg gacaagggac accggagacg cggccgcact agtgcatgcg 5580
caaatttaaa gcgctgatat cgatcgcgcg cagatccata tatagggccc gggttataat 5640
tacctcaggt cgacgtccca tggccattcg aattcgtaat catgtcatag ctgtttcctg 5700
tgtgaaattg ttatccgctc acaattccac acaacatacg agccggaagc ataaagtgta 5760
aagcctgggg tgcctaatga gtgagctaac tcacattaat tgcgttgcgc tcactgcccg 5820
ctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga 5880
gaggcggttt gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg 5940
tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag 6000
aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc 6060
gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca 6120
aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt 6180
ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 6240
tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc 6300
tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc 6360
ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact 6420
tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg 6480
ctacagagtt cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta 6540
tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca 6600
aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa 6660
aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg 6720
aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc 6780
ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg 6840
acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat 6900
ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg 6960
gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa 7020
taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca 7080
tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc 7140
gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt 7200
cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa 7260
aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat 7320
cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct 7380
tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga 7440
gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag 7500
tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga 7560
gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca 7620
ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg 7680
cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc 7740
agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag 7800
gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc attattatca 7860
tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtctcgcg cgtttcggtg 7920
atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag 7980
cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg 8040
gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat aaaattgtaa 8100
acgttaatat tttgttaaaa ttcgcgttaa atttttgtta aatcagctca ttttttaacc 8160
aataggccga aatcggcaaa atcccttata aatcaaaaga atagcccgag atagggttga 8220
gtgttgttcc agtttggaac aagagtccac tattaaagaa cgtggactcc aacgtcaaag 8280
ggcgaaaaac cgtctatcag ggcgatggcc cactacgtga accatcaccc aaatcaagtt 8340
ttttggggtc gaggtgccgt aaagcactaa atcggaaccc taaagggagc ccccgattta 8400
gagcttgacg gggaaagccg gcgaacgtgg cgagaaagga agggaagaaa gcgaaaggag 8460
cgggcgctag ggcgctggca agtgtagcgg tcacgctgcg cgtaaccacc acacccgccg 8520
cgcttaatgc gccgctacag ggcgcgtact atggttgctt tgacgtatgc ggtgtgaaat 8580
accgcacaga tgcgtaagga gaaaataccg catcaggcgc cattcgccat tcaggctgcg 8640
caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg 8700
gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg 8760
taaaacgacg gccagtgcca agcttaaggt gcacggccca cgtggccact agt 8813
<210>4
<211>1910
<212>DNA
<213>木霉属
<400>4
atgaagtcgg cgagcaaatt gttctttctc tccgtgtttt ccctatgggc gacgccgggc 60
gcatgctcaa gctcgtcaag tacatgcact gtacgtcaac ccaaccttgg cctcgtttcc 120
cctttggaag aatgctttgc gctgacagat tttgttgatc tagttctccc caaacgccat 180
cattgacgat ggatgcgttt cgtatgcgac tctcgataga ctcaatgtca aggtgaagcc 240
tgctatagac gaactcgttc agacgaccga cttcttttcg cactatcgct tgaacctctt 300
caacaaaaaa tgccccttct ggaacgacga agatggcatg tgcggtaaca ttgcctgcgc 360
cgtcgagacg ctggacaacg aagaagatat tcccgagata tggagggctc acgagcttag 420
caagctggaa ggccctcgag cgaagcatcc cggcaagcaa gagcagaggc agaaccctga 480
gcgaccgctg cagggagagc tgggggagga tgtaggggag agctgcgtgg ttgaatacga 540
cgacgagtgt gacgacagag actactgcgt ctgggacgac gaaggcgcaa cgtccaaggg 600
ggactacatc agcttgttgc gcaaccccga gcgcttcacc ggctatggcg gtcaaagtgc 660
aaagcaggtg tgggacgcca tctactcgga gaactgcttc aagaagagct cgtttcccaa 720
gtcggccgat ctaggcgtct cgcaccgccc aaccgaggcg gctgctctgg acttcaagca 780
ggtcctggac accgctggcc gccaggctca actggaacag cagcggcaga gcaacccaaa 840
cattcccttt gttgccaaca ctggctacga ggtggacgat gagtgtctgg agaagcgcgt 900
gttctaccgg gtggtgtcgg gaatgcacgc cagcatcagc gtccacctgt gctgggactt 960
cctgaaccag agcacggggc aatggcagcc caacttggac tgctacgaga gccgcctgca 1020
caagtttcca gaccgcatca gcaacctcta cttcaactac gctctcgtga ctcgcgccat 1080
tgcgaagctg ggcccgtatg tactgtcacc gcagtacacc ttttgcacag gggacccgtt 1140
gcaagaccag gagacgcgag acaagattgc ggccgtcacg aagcacgcgg ctagcgtccc 1200
gcagatcttt gacgagggcg tcatgtttgt caacggcgaa ggcccctcgc tcaaggaaga 1260
tttccgcaat cgcttccgca acatcagccg ggtcatggac tgcgtcggct gcgacaagtg 1320
ccgtctctgg ggcaagatcc agaccagcgg ctacggcacg gctttgaaga ttctgtttga 1380
gttcaacgag ggccagaagc cgccgcccct caagaggacc gagctggtgg ccctcttcaa 1440
cacgtatgcc agactcagct cgtcggtggc ggccgttggg cgattcaggg ccatgattga 1500
catgcgcgac aagatggcgt ccaagcccga cttcaagccc gaggatctct acacgctcat 1560
cgacgaggcg gacgaggaca tggacgagtt tatcaggatg caaaatcgtg ggagccacgg 1620
agatacgctg ggcgagcagg tcggaaacga atttgcccgc gtcatgatgg ccgtcaagat 1680
tgtgctcaag agttggatcc gaacgcccaa gatgatgtaa gtctcttctc tctttttttt 1740
ccccttcttc gagtggcaca aagctcttca ttgagatgga ctaacacaat tctagttggc 1800
aaattgtctc ggaagagacg tcgagattgt atcgcgcttg ggtcggtctg cctgcgcgac 1860
ccagacggta cgcgttcaga ctgcccaact tgaatagaga cgagttgtga 1910
<210>5
<211>12297
<212>DNA
<213>木霉属
<400>5
aagcttacta gtacttctcg agctctgtac atgtccggtc gcgacgtacg cgtatcgatg 60
gcgccagctg caggcggccg cctgcagcca cttgcagtcc cgtggaattc tcacggtgaa 120
tgtaggcctt ttgtagggta ggaattgtca ctcaagcacc cccaacctcc attacgcctc 180
ccccatagag ttcccaatca gtgagtcatg gcactgttct caaatagatt ggggagaagt 240
tgacttccgc ccagagctga aggtcgcaca accgcatgat atagggtcgg caacggcaaa 300
aaagcacgtg gctcaccgaa aagcaagatg tttgcgatct aacatccagg aacctggata 360
catccatcat cacgcacgac cactttgatc tgctggtaaa ctcgtattcg ccctaaaccg 420
aagtgcgtgg taaatctaca cgtgggcccc tttcggtata ctgcgtgtgt cttctctagg 480
tgccattctt ttcccttcct ctagtgttga attgtttgtg ttggagtccg agctgtaact 540
acctctgaat ctctggagaa tggtggacta acgactaccg tgcacctgca tcatgtatat 600
aatagtgatc ctgagaaggg gggtttggag caatgtggga ctttgatggt catcaaacaa 660
agaacgaaga cgcctctttt gcaaagtttt gtttcggcta cggtgaagaa ctggatactt 720
gttgtgtctt ctgtgtattt ttgtggcaac aagaggccag agacaatcta ttcaaacacc 780
aagcttgctc ttttgagcta caagaacctg tggggtatat atctagagtt gtgaagtcgg 840
taatcccgct gtatagtaat acgagtcgca tctaaatact ccgaagctgc tgcgaacccg 900
gagaatcgag atgtgctgga aagcttctag cgagcggcta aattagcatg aaaggctatg 960
agaaattctg gagacggctt gttgaatcat ggcgttccat tcttcgacaa gcaaagcgtt 1020
ccgtcgcagt agcaggcact cattcccgaa aaaactcgga gattcctaag tagcgatgga 1080
accggaataa tataataggc aatacattga gttgcctcga cggttgcaat gcaggggtac 1140
tgagcttgga cataactgtt ccgtacccca cctcttctca acctttggcg tttccctgat 1200
tcagcgtacc cgtacaagtc gtaatcacta ttaacccaga ctgaccggac gtgttttgcc 1260
cttcatttgg agaaataatg tcattgcgat gtgtaatttg cctgcttgac cgactggggc 1320
tgttcgaagc ccgaatgtag gattgttatc cgaactctgc tcgtagaggc atgttgtgaa 1380
tctgtgtcgg gcaggacacg cctcgaaggt tcacggcaag ggaaaccacc gatagcagtg 1440
tctagtagca acctgtaaag ccgcaatgca gcatcactgg aaaatacaaa ccaatggcta 1500
aaagtacata agttaatgcc taaagaagtc atataccagc ggctaataat tgtacaatca 1560
agtggctaaa cgtaccgtaa tttgccaacg gcttgtgggg ttgcagaagc aacggcaaag 1620
ccccacttcc ccacgtttgt ttcttcactc agtccaatct cagctggtga tcccccaatt 1680
gggtcgcttg tttgttccgg tgaagtgaaa gaagacagag gtaagaatgt ctgactcgga 1740
gcgttttgca tacaaccaag ggcagtgatg gaagacagtg aaatgttgac attcaaggag 1800
tatttagcca gggatgcttg agtgtatcgt gtaaggaggt ttgtctgccg atacgacgaa 1860
tactgtatag tcacttctga tgaagtggtc catattgaaa tgtaaagtcg gcactgaaca 1920
ggcaaaagat tgagttgaaa ctgcctaaga tctcgggccc tcgggccttc ggcctttggg 1980
tgtacatgtt tgtgctccgg gcaaatgcaa agtgtggtag gatcgaacac actgctgcct 2040
ttaccaagca gctgagggta tgtgataggc aaatgttcag gggccactgc atggtttcga 2100
atagaaagag aagcttagcc aagaacaata gccgataaag atagcctcat taaacggaat 2160
gagctagtag gcaaagtcag cgaatgtgta tatataaagg ttcgaggtcc gtgcctccct 2220
catgctctcc ccatctactc atcaactcag atcctccagg agacttgtac accatctttt 2280
gaggcacaga aacccaatag tcaaccatca caagtttgta caaaaaagca ggctccgcgg 2340
ccgccccctt caccatcatg cacgtcctgt cgactgcggt gctgctcggc tccgttgccg 2400
ttcaaaaggt cctgggaaga ccaggatcaa gcggtctgtc cgacgtcacc aagaggtctg 2460
ttgacgactt catcagcacc gagacgccta ttgcactgaa caatcttctt tgcaatgttg 2520
gtcctgatgg atgccgtgca ttcggcacat cagctggtgc ggtgattgca tctcccagca 2580
caattgaccc ggactgtaag ttggccttga tgaaccatat catatatcgc cgagaagtgg 2640
accgcgtgct gagactgaga cagactatta catgtggacg cgagatagcg ctcttgtctt 2700
caagaacctc atcgaccgct tcaccgaaac gtacgatgcg ggcctgcagc gccgcatcga 2760
gcagtacatt actgcccagg tcactctcca gggcctctct aacccctcgg gctccctcgc 2820
ggacggctct ggtctcggcg agcccaagtt tgagttgacc ctgaagcctt tcaccggcaa 2880
ctggggtcga ccgcagcggg atggcccagc tctgcgagcc attgccttga ttggatactc 2940
aaagtggctc atcaacaaca actatcagtc gactgtgtcc aacgtcatct ggcctattgt 3000
gcgcaacgac ctcaactatg ttgcccagta ctggtcagtg cttgcttgct cttgaattac 3060
gtctttgctt gtgtgtctaa tgcctccacc acaggaacca aaccggcttt gacctctggg 3120
aagaagtcaa tgggagctca ttctttactg ttgccaacca gcaccgaggt atgaagcaaa 3180
tcctcgacat tcgctgctac tgcacatgag cattgttact gaccagctct acagcacttg 3240
tcgagggcgc cactcttgct gccactcttg gccagtcggg aagcgcttat tcatctgttg 3300
ctccccaggt tttgtgcttt ctccaacgat tctgggtgtc gtctggtgga tacgtcgact 3360
ccaacagtat gtcttttcac tgtttatatg agattggcca atactgatag ctcgcctcta 3420
gtcaacacca acgagggcag gactggcaag gatgtcaact ccgtcctgac ttccatccac 3480
accttcgatc ccaaccttgg ctgtgacgca ggcaccttcc agccatgcag tgacaaagcg 3540
ctctccaacc tcaaggttgt tgtcgactcc ttccgctcca tctacggcgt gaacaagggc 3600
attcctgccg gtgctgccgt cgccattggc cggtatgcag aggatgtgta ctacaacggc 3660
aacccttggt atcttgctac atttgctgct gccgagcagc tgtacgatgc catctacgtc 3720
tggaagaaga cgggctccat cacggtgacc gccacctccc tggccttctt ccaggagctt 3780
gttcctggcg tgacggccgg gacctactcc agcagctctt cgacctttac caacatcatc 3840
aacgccgtct cgacatacgc cgatggcttc ctcagcgagg ctgccaagta cgtccccgcc 3900
gacggttcgc tggccgagca gtttgaccgc aacagcggca ctccgctgtc tgcgcttcac 3960
ctgacgtggt cgtacgcctc gttcttgaca gccacggccc gtcgggctgg catcgtgccc 4020
ccctcgtggg ccaacagcag cgctagcacg atcccctcga cgtgctccgg cgcgtccgtg 4080
gtcggatcct actcgcgtcc caccgccacg tcattccctc cgtcgcagac gcccaagcct 4140
ggcgtgcctt ccggtactcc ctacacgccc ctgccctgcg cgaccccaac ctccgtggcc 4200
gtcaccttcc acgagctcgt gtcgacacag tttggccaga cggtcaaggt ggcgggcaac 4260
gccgcggccc tgggcaactg gagcacgagc gccgccgtgg ctctggacgc cgtcaactat 4320
gccgataacc accccctgtg gattgggacg gtcaacctcg aggctggaga cgtcgtggag 4380
tacaagtaca tcaatgtggg ccaagatggc tccgtgacct gggagagtga tcccaaccac 4440
acttacacgg ttcctgcggt ggcttgtgtg acgcaggttg tcaaggagga cacctggcag 4500
tcggctattg gaccagttgc tgatctgcac atcgttaaca aggatttggc cccagacggc 4560
gtccagcgcc caactgttct ggccggtgga acttttccgg gcacgctgat taccggtcaa 4620
aagggcgaca acttccagct gaacgtgatt gatgacctga ccgacgatcg catgttgacc 4680
cctacttcga tccattggca tggtttcttc cagaagggaa ccgcctgggc cgacggtccg 4740
gctttcgtta cacagtgccc tattatcgca gacaactcct tcctctacga tttcgacgtt 4800
cccgaccagg cgggcacctt ctggtaccac tcacacttgt ctacacagta ctgcgacggt 4860
ctgcgcggtg ccttcgttgt ttacgacccc aacgaccctc acaaggacct ttatgatgtc 4920
gatgacggtg gcacagttat cacattggct gactggtatc acgtcctcgc tcagaccgtt 4980
gtcggagctg ctacacccga ctctacgctg attaacggct tgggacgcag ccagactggc 5040
cccgccgacg ctgagctggc cgttatctct gttgaacaca acaagagata ccgtttcaga 5100
ctcgtctcca tctcgtgcga tcccaacttc acttttagcg tcgacggtca caacatgacg 5160
gttatcgagg ttgatggcgt gaatacccgc cctctcaccg tcgattccat tcaaattttc 5220
gccggccagc gatactcctt tgtgctgaat gccaatcagc ccgaggataa ctactggatc 5280
cgcgctatgc ctaacatcgg acgaaacacc actacccttg atggcaagaa tgccgctatc 5340
ctgcgataca agaacgccag cgttgaggag cccaaaaccg tcggaggacc cgcgcagagc 5400
ccattgaacg aggccgacct gcgacctctg gtgcccgctc ctgtccctgg caacgcagtt 5460
cctggtggtg cggacatcaa ccaccgcctg aacctgacat tcagcaacgg cctcttctct 5520
atcaataacg catcatttac aaaccccagc gtccctgcct tgttgcagat tctttccggc 5580
gcacaaaacg ctcaggatct gcttcccacc ggttcttata tcggcttgga gttgggcaag 5640
gtcgttgaac tcgtgatccc tcccttggcc gttggtggcc cccatccatt ccacttgcac 5700
ggccacaact tttgggtcgt ccgaagcgct ggttctgacg agtataattt cgacgatgca 5760
attttgcgcg acgtggtcag cattggcgcg ggaactgacg aggttactat ccgttttgtc 5820
actgataacc caggcccttg gttcctccat tgccacatcg actggcacct cgaagccggc 5880
ctcgccattg ttttcgccga aggcatcaat caaaccgcag ccgccaaccc gactccacag 5940
gcctgggacg aactctgccc caagtataac ggactctccg cttcccagaa agtgaagccc 6000
aagaagggaa cagccatcta aaagggtggg cgcgccgacc cagctttctt gtacaaagtg 6060
gtgatcgcgc cagctccgtg cgaaagcctg acgcaccggt agattcttgg tgagcccgta 6120
tcatgacggc ggcgggagct acatggcccc gggtgattta ttttttttgt atctacttct 6180
gacccttttc aaatatacgg tcaactcatc tttcactgga gatgcggcct gcttggtatt 6240
gcgatgttgt cagcttggca aattgtggct ttcgaaaaca caaaacgatt ccttagtagc 6300
catgcatttt aagataacgg aatagaagaa agaggaaatt aaaaaaaaaa aaaaaacaaa 6360
catcccgttc ataacccgta gaatcgccgc tcttcgtgta tcccagtacc agtttatttt 6420
gaatagctcg cccgctggag agcatcctga atgcaagtaa caaccgtaga ggctgacacg 6480
gcaggtgttg ctagggagcg tcgtgttcta caaggccaga cgtcttcgcg gttgatatat 6540
atgtatgttt gactgcaggc tgctcagcga cgacagtcaa gttcgccctc gctgcttgtg 6600
caataatcgc agtggggaag ccacaccgtg actcccatct ttcagtaaag ctctgttggt 6660
gtttatcagc aatacacgta atttaaactc gttagcatgg ggctgatagc ttaattaccg 6720
tttaccagtg ccatggttct gcagctttcc ttggcccgta aaattcggcg aagccagcca 6780
atcaccagct aggcaccagc taaaccctat aattagtctc ttatcaacac catccgctcc 6840
cccgggatca atgaggagaa tgagggggat gcggggctaa agaagcctac ataaccctca 6900
tgccaactcc cagtttacac tcgtcgagcc aacatcctga ctataagcta acacagaatg 6960
cctcaatcct gggaagaact ggccgctgat aagcgcgccc gcctcgcaaa aaccatccct 7020
gatgaatgga aagtccagac gctgcctgcg gaagacagcg ttattgattt cccaaagaaa 7080
tcggggatcc tttcagaggc cgaactgaag atcacagagg cctccgctgc agatcttgtg 7140
tccaagctgg cggccggaga gttgacctcg gtggaagtta cgctagcatt ctgtaaacgg 7200
gcagcaatcg cccagcagtt agtagggtcc cctctacctc tcagggagat gtaacaacgc 7260
caccttatgg gactatcaag ctgacgctgg cttctgtgca gacaaactgc gcccacgagt 7320
tcttccctga cgccgctctc gcgcaggcaa gggaactcga tgaatactac gcaaagcaca 7380
agagacccgt tggtccactc catggcctcc ccatctctct caaagaccag cttcgagtca 7440
aggtacaccg ttgcccctaa gtcgttagat gtcccttttt gtcagctaac atatgccacc 7500
agggctacga aacatcaatg ggctacatct catggctaaa caagtacgac gaaggggact 7560
cggttctgac aaccatgctc cgcaaagccg gtgccgtctt ctacgtcaag acctctgtcc 7620
cgcagaccct gatggtctgc gagacagtca acaacatcat cgggcgcacc gtcaacccac 7680
gcaacaagaa ctggtcgtgc ggcggcagtt ctggtggtga gggtgcgatc gttgggattc 7740
gtggtggcgt catcggtgta ggaacggata tcggtggctc gattcgagtg ccggccgcgt 7800
tcaacttcct gtacggtcta aggccgagtc atgggcggct gccgtatgca aagatggcga 7860
acagcatgga gggtcaggag acggtgcaca gcgttgtcgg gccgattacg cactctgttg 7920
agggtgagtc cttcgcctct tccttctttt cctgctctat accaggcctc cactgtcctc 7980
ctttcttgct ttttatacta tatacgagac cggcagtcac tgatgaagta tgttagacct 8040
ccgcctcttc accaaatccg tcctcggtca ggagccatgg aaatacgact ccaaggtcat 8100
ccccatgccc tggcgccagt ccgagtcgga cattattgcc tccaagatca agaacggcgg 8160
gctcaatatc ggctactaca acttcgacgg caatgtcctt ccacaccctc ctatcctgcg 8220
cggcgtggaa accaccgtcg ccgcactcgc caaagccggt cacaccgtga ccccgtggac 8280
gccatacaag cacgatttcg gccacgatct catctcccat atctacgcgg ctgacggcag 8340
cgccgacgta atgcgcgata tcagtgcatc cggcgagccg gcgattccaa atatcaaaga 8400
cctactgaac ccgaacatca aagctgttaa catgaacgag ctctgggaca cgcatctcca 8460
gaagtggaat taccagatgg agtaccttga gaaatggcgg gaggctgaag aaaaggccgg 8520
gaaggaactg gacgccatca tcgcgccgat tacgcctacc gctgcggtac ggcatgacca 8580
gttccggtac tatgggtatg cctctgtgat caacctgctg gatttcacga gcgtggttgt 8640
tccggttacc tttgcggata agaacatcga taagaagaat gagagtttca aggcggttag 8700
tgagcttgat gccctcgtgc aggaagagta tgatccggag gcgtaccatg gggcaccggt 8760
tgcagtgcag gttatcggac ggagactcag tgaagagagg acgttggcga ttgcagagga 8820
agtggggaag ttgctgggaa atgtggtgac tccatagcta ataagtgtca gatagcaatt 8880
tgcacaagaa atcaatacca gcaactgtaa ataagcgctg aagtgaccat gccatgctac 8940
gaaagagcag aaaaaaacct gccgtagaac cgaagagata tgacacgctt ccatctctca 9000
aaggaagaat cccttcaggg ttgcgtttcc agtctagaca cgtataacgg cacaagtgtc 9060
tctcaccaaa tgggttatat ctcaaatgtg atctaaggat ggaaagccca gaatatcgat 9120
cgcgcgcaga tccatatata gggcccgggt tataattacc tcaggtcgac gtcccatggc 9180
cattcgaatt cgtaatcatg gtcatagctg tttcctgtgt gaaattgtta tccgctcaca 9240
attccacaca acatacgagc cggaagcata aagtgtaaag cctggggtgc ctaatgagtg 9300
agctaactca cattaattgc gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg 9360
tgccagctgc attaatgaat cggccaacgc gcggggagag gcggtttgcg tattgggcgc 9420
tcttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta 9480
tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag 9540
aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg 9600
tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg 9660
tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg 9720
cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga 9780
agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc 9840
tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt 9900
aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact 9960
ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg 10020
cctaactacg gctacactag aagaacagta tttggtatct gcgctctgct gaagccagtt 10080
accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt 10140
ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct 10200
ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg 10260
gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt 10320
aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt 10380
gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc 10440
gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg 10500
cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc 10560
gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg 10620
gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca 10680
ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga 10740
tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 10800
ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg 10860
cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca 10920
accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata 10980
cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct 11040
tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact 11100
cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa 11160
acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc 11220
atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga 11280
tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga 11340
aaagtgccac ctgacgtcta agaaaccatt attatcatga cattaaccta taaaaatagg 11400
cgtatcacga ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac 11460
atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag cagacaagcc 11520
cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta tgcggcatca 11580
gagcagattg tactgagagt gcaccataaa attgtaaacg ttaatatttt gttaaaattc 11640
gcgttaaatt tttgttaaat cagctcattt tttaaccaat aggccgaaat cggcaaaatc 11700
ccttataaat caaaagaata gcccgagata gggttgagtg ttgttccagt ttggaacaag 11760
agtccactat taaagaacgt ggactccaac gtcaaagggc gaaaaaccgt ctatcagggc 11820
gatggcccac tacgtgaacc atcacccaaa tcaagttttt tggggtcgag gtgccgtaaa 11880
gcactaaatc ggaaccctaa agggagcccc cgatttagag cttgacgggg aaagccggcg 11940
aacgtggcga gaaaggaagg gaagaaagcg aaaggagcgg gcgctagggc gctggcaagt 12000
gtagcggtca cgctgcgcgt aaccaccaca cccgccgcgc ttaatgcgcc gctacagggc 12060
gcgtactatg gttgctttga cgtatgcggt gtgaaatacc gcacagatgc gtaaggagaa 12120
aataccgcat caggcgccat tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg 12180
tgcgggcctc ttcgctatta cgccagctgg cgaaaggggg atgtgctgca aggcgattaa 12240
gttgggtaac gccagggttt tcccagtcac gacgttgtaa aacgacggcc agtgccc 12297
<210>6
<211>10208
<212>DNA
<213>木霉属
<400>6
aagcttacta gtacttctcg agctctgtac atgtccggtc gcgacgtacg cgtatcgatg 60
gcgccagctg caggcggccg cctgcagcca cttgcagtcc cgtggaattc tcacggtgaa 120
tgtaggcctt ttgtagggta ggaattgtca ctcaagcacc cccaacctcc attacgcctc 180
ccccatagag ttcccaatca gtgagtcatg gcactgttct caaatagatt ggggagaagt 240
tgacttccgc ccagagctga aggtcgcaca accgcatgat atagggtcgg caacggcaaa 300
aaagcacgtg gctcaccgaa aagcaagatg tttgcgatct aacatccagg aacctggata 360
catccatcat cacgcacgac cactttgatc tgctggtaaa ctcgtattcg ccctaaaccg 420
aagtgcgtgg taaatctaca cgtgggcccc tttcggtata ctgcgtgtgt cttctctagg 480
tgccattctt ttcccttcct ctagtgttga attgtttgtg ttggagtccg agctgtaact 540
acctctgaat ctctggagaa tggtggacta acgactaccg tgcacctgca tcatgtatat 600
aatagtgatc ctgagaaggg gggtttggag caatgtggga ctttgatggt catcaaacaa 660
agaacgaaga cgcctctttt gcaaagtttt gtttcggcta cggtgaagaa ctggatactt 720
gttgtgtctt ctgtgtattt ttgtggcaac aagaggccag agacaatcta ttcaaacacc 780
aagcttgctc ttttgagcta caagaacctg tggggtatat atctagagtt gtgaagtcgg 840
taatcccgct gtatagtaat acgagtcgca tctaaatact ccgaagctgc tgcgaacccg 900
gagaatcgag atgtgctgga aagcttctag cgagcggcta aattagcatg aaaggctatg 960
agaaattctg gagacggctt gttgaatcat ggcgttccat tcttcgacaa gcaaagcgtt 1020
ccgtcgcagt agcaggcact cattcccgaa aaaactcgga gattcctaag tagcgatgga 1080
accggaataa tataataggc aatacattga gttgcctcga cggttgcaat gcaggggtac 1140
tgagcttgga cataactgtt ccgtacccca cctcttctca acctttggcg tttccctgat 1200
tcagcgtacc cgtacaagtc gtaatcacta ttaacccaga ctgaccggac gtgttttgcc 1260
cttcatttgg agaaataatg tcattgcgat gtgtaatttg cctgcttgac cgactggggc 1320
tgttcgaagc ccgaatgtag gattgttatc cgaactctgc tcgtagaggc atgttgtgaa 1380
tctgtgtcgg gcaggacacg cctcgaaggt tcacggcaag ggaaaccacc gatagcagtg 1440
tctagtagca acctgtaaag ccgcaatgca gcatcactgg aaaatacaaa ccaatggcta 1500
aaagtacata agttaatgcc taaagaagtc atataccagc ggctaataat tgtacaatca 1560
agtggctaaa cgtaccgtaa tttgccaacg gcttgtgggg ttgcagaagc aacggcaaag 1620
ccccacttcc ccacgtttgt ttcttcactc agtccaatct cagctggtga tcccccaatt 1680
gggtcgcttg tttgttccgg tgaagtgaaa gaagacagag gtaagaatgt ctgactcgga 1740
gcgttttgca tacaaccaag ggcagtgatg gaagacagtg aaatgttgac attcaaggag 1800
tatttagcca gggatgcttg agtgtatcgt gtaaggaggt ttgtctgccg atacgacgaa 1860
tactgtatag tcacttctga tgaagtggtc catattgaaa tgtaaagtcg gcactgaaca 1920
ggcaaaagat tgagttgaaa ctgcctaaga tctcgggccc tcgggccttc ggcctttggg 1980
tgtacatgtt tgtgctccgg gcaaatgcaa agtgtggtag gatcgaacac actgctgcct 2040
ttaccaagca gctgagggta tgtgataggc aaatgttcag gggccactgc atggtttcga 2100
atagaaagag aagcttagcc aagaacaata gccgataaag atagcctcat taaacggaat 2160
gagctagtag gcaaagtcag cgaatgtgta tatataaagg ttcgaggtcc gtgcctccct 2220
catgctctcc ccatctactc atcaactcag atcctccagg agacttgtac accatctttt 2280
gaggcacaga aacccaatag tcaaccatca caagtttgta caaaaaagca ggctccgcgg 2340
ccgccccctt caccatgcag acctttggag cttttctcgt ttccttcctc gccgccagcg 2400
gcctggccgc ggccgctatt ggaccagttg ctgatctgca catcgttaac aaggatttgg 2460
ccccagacgg cgtccagcgc ccaactgttc tggccggtgg aacttttccg ggcacgctga 2520
ttaccggtca aaagggcgac aacttccagc tgaacgtgat tgatgacctg accgacgatc 2580
gcatgttgac ccctacttcg atccattggc atggtttctt ccagaaggga accgcctggg 2640
ccgacggtcc ggctttcgtt acacagtgcc ctattatcgc agacaactcc ttcctctacg 2700
atttcgacgt tcccgaccag gcgggcacct tctggtacca ctcacacttg tctacacagt 2760
actgcgacgg tctgcgcggt gccttcgttg tttacgaccc caacgaccct cacaaggacc 2820
tttatgatgt cgatgacggt ggcacagtta tcacattggc tgactggtat cacgtcctcg 2880
ctcagaccgt tgtcggagct gctacacccg actctacgct gattaacggc ttgggacgca 2940
gccagactgg ccccgccgac gctgagctgg ccgttatctc tgttgaacac aacaagagat 3000
accgtttcag actcgtctcc atctcgtgcg atcccaactt cacttttagc gtcgacggtc 3060
acaacatgac ggttatcgag gttgatggcg tgaatacccg ccctctcacc gtcgattcca 3120
ttcaaatttt cgccggccag cgatactcct ttgtgctgaa tgccaatcag cccgaggata 3180
actactggat ccgcgctatg cctaacatcg gacgaaacac cactaccctt gatggcaaga 3240
atgccgctat cctgcgatac aagaacgcca gcgttgagga gcccaaaacc gtcggaggac 3300
ccgcgcagag cccattgaac gaggccgacc tgcgacctct ggtgcccgct cctgtccctg 3360
gcaacgcagt tcctggtggt gcggacatca accaccgcct gaacctgaca ttcagcaacg 3420
gcctcttctc tatcaataac gcatcattta caaaccccag cgtccctgcc ttgttgcaga 3480
ttctttccgg cgcacaaaac gctcaggatc tgcttcccac cggttcttat atcggcttgg 3540
agttgggcaa ggtcgttgaa ctcgtgatcc ctcccttggc cgttggtggc ccccatccat 3600
tccacttgca cggccacaac ttttgggtcg tccgaagcgc tggttctgac gagtataatt 3660
tcgacgatgc aattttgcgc gacgtggtca gcattggcgc gggaactgac gaggttacta 3720
tccgttttgt cactgataac ccaggccctt ggttcctcca ttgccacatc gactggcacc 3780
tcgaagccgg cctcgccatt gttttcgccg aaggcatcaa tcaaaccgca gccgccaacc 3840
cgactccaca ggcctgggac gaactctgcc ccaagtataa cggactctcc gcttcccaga 3900
aagtgaagcc caagaaggga acagccatct aaaagggtgg gcgcgccgac ccagctttct 3960
tgtacaaagt ggtgatcgcg ccagctccgt gcgaaagcct gacgcaccgg tagattcttg 4020
gtgagcccgt atcatgacgg cggcgggagc tacatggccc cgggtgattt attttttttg 4080
tatctacttc tgaccctttt caaatatacg gtcaactcat ctttcactgg agatgcggcc 4140
tgcttggtat tgcgatgttg tcagcttggc aaattgtggc tttcgaaaac acaaaacgat 4200
tccttagtag ccatgcattt taagataacg gaatagaaga aagaggaaat taaaaaaaaa 4260
aaaaaaacaa acatcccgtt cataacccgt agaatcgccg ctcttcgtgt atcccagtac 4320
cagtttattt tgaatagctc gcccgctgga gagcatcctg aatgcaagta acaaccgtag 4380
aggctgacac ggcaggtgtt gctagggagc gtcgtgttct acaaggccag acgtcttcgc 4440
ggttgatata tatgtatgtt tgactgcagg ctgctcagcg acgacagtca agttcgccct 4500
cgctgcttgt gcaataatcg cagtggggaa gccacaccgt gactcccatc tttcagtaaa 4560
gctctgttgg tgtttatcag caatacacgt aatttaaact cgttagcatg gggctgatag 4620
cttaattacc gtttaccagt gccatggttc tgcagctttc cttggcccgt aaaattcggc 4680
gaagccagcc aatcaccagc taggcaccag ctaaacccta taattagtct cttatcaaca 4740
ccatccgctc ccccgggatc aatgaggaga atgaggggga tgcggggcta aagaagccta 4800
cataaccctc atgccaactc ccagtttaca ctcgtcgagc caacatcctg actataagct 4860
aacacagaat gcctcaatcc tgggaagaac tggccgctga taagcgcgcc cgcctcgcaa 4920
aaaccatccc tgatgaatgg aaagtccaga cgctgcctgc ggaagacagc gttattgatt 4980
tcccaaagaa atcggggatc ctttcagagg ccgaactgaa gatcacagag gcctccgctg 5040
cagatcttgt gtccaagctg gcggccggag agttgacctc ggtggaagtt acgctagcat 5100
tctgtaaacg ggcagcaatc gcccagcagt tagtagggtc ccctctacct ctcagggaga 5160
tgtaacaacg ccaccttatg ggactatcaa gctgacgctg gcttctgtgc agacaaactg 5220
cgcccacgag ttcttccctg acgccgctct cgcgcaggca agggaactcg atgaatacta 5280
cgcaaagcac aagagacccg ttggtccact ccatggcctc cccatctctc tcaaagacca 5340
gcttcgagtc aaggtacacc gttgccccta agtcgttaga tgtccctttt tgtcagctaa 5400
catatgccac cagggctacg aaacatcaat gggctacatc tcatggctaa acaagtacga 5460
cgaaggggac tcggttctga caaccatgct ccgcaaagcc ggtgccgtct tctacgtcaa 5520
gacctctgtc ccgcagaccc tgatggtctg cgagacagtc aacaacatca tcgggcgcac 5580
cgtcaaccca cgcaacaaga actggtcgtg cggcggcagt tctggtggtg agggtgcgat 5640
cgttgggatt cgtggtggcg tcatcggtgt aggaacggat atcggtggct cgattcgagt 5700
gccggccgcg ttcaacttcc tgtacggtct aaggccgagt catgggcggc tgccgtatgc 5760
aaagatggcg aacagcatgg agggtcagga gacggtgcac agcgttgtcg ggccgattac 5820
gcactctgtt gagggtgagt ccttcgcctc ttccttcttt tcctgctcta taccaggcct 5880
ccactgtcct cctttcttgc tttttatact atatacgaga ccggcagtca ctgatgaagt 5940
atgttagacc tccgcctctt caccaaatcc gtcctcggtc aggagccatg gaaatacgac 6000
tccaaggtca tccccatgcc ctggcgccag tccgagtcgg acattattgc ctccaagatc 6060
aagaacggcg ggctcaatat cggctactac aacttcgacg gcaatgtcct tccacaccct 6120
cctatcctgc gcggcgtgga aaccaccgtc gccgcactcg ccaaagccgg tcacaccgtg 6180
accccgtgga cgccatacaa gcacgatttc ggccacgatc tcatctccca tatctacgcg 6240
gctgacggca gcgccgacgt aatgcgcgat atcagtgcat ccggcgagcc ggcgattcca 6300
aatatcaaag acctactgaa cccgaacatc aaagctgtta acatgaacga gctctgggac 6360
acgcatctcc agaagtggaa ttaccagatg gagtaccttg agaaatggcg ggaggctgaa 6420
gaaaaggccg ggaaggaact ggacgccatc atcgcgccga ttacgcctac cgctgcggta 6480
cggcatgacc agttccggta ctatgggtat gcctctgtga tcaacctgct ggatttcacg 6540
agcgtggttg ttccggttac ctttgcggat aagaacatcg ataagaagaa tgagagtttc 6600
aaggcggtta gtgagcttga tgccctcgtg caggaagagt atgatccgga ggcgtaccat 6660
ggggcaccgg ttgcagtgca ggttatcgga cggagactca gtgaagagag gacgttggcg 6720
attgcagagg aagtggggaa gttgctggga aatgtggtga ctccatagct aataagtgtc 6780
agatagcaat ttgcacaaga aatcaatacc agcaactgta aataagcgct gaagtgacca 6840
tgccatgcta cgaaagagca gaaaaaaacc tgccgtagaa ccgaagagat atgacacgct 6900
tccatctctc aaaggaagaa tcccttcagg gttgcgtttc cagtctagac acgtataacg 6960
gcacaagtgt ctctcaccaa atgggttata tctcaaatgt gatctaagga tggaaagccc 7020
agaatatcga tcgcgcgcag atccatatat agggcccggg ttataattac ctcaggtcga 7080
cgtcccatgg ccattcgaat tcgtaatcat ggtcatagct gtttcctgtg tgaaattgtt 7140
atccgctcac aattccacac aacatacgag ccggaagcat aaagtgtaaa gcctggggtg 7200
cctaatgagt gagctaactc acattaattg cgttgcgctc actgcccgct ttccagtcgg 7260
gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 7320
gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 7380
ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 7440
acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 7500
cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 7560
caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 7620
gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 7680
tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 7740
aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 7800
ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 7860
cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 7920
tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc 7980
tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 8040
ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 8100
aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 8160
aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 8220
aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 8280
gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 8340
gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 8400
caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 8460
ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta 8520
attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg 8580
ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg 8640
gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 8700
ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta 8760
tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg 8820
gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc 8880
cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg 8940
gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga 9000
tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 9060
ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 9120
gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 9180
tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 9240
catttccccg aaaagtgcca cctgacgtct aagaaaccat tattatcatg acattaacct 9300
ataaaaatag gcgtatcacg aggccctttc gtctcgcgcg tttcggtgat gacggtgaaa 9360
acctctgaca catgcagctc ccggagacgg tcacagcttg tctgtaagcg gatgccggga 9420
gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact 9480
atgcggcatc agagcagatt gtactgagag tgcaccataa aattgtaaac gttaatattt 9540
tgttaaaatt cgcgttaaat ttttgttaaa tcagctcatt ttttaaccaa taggccgaaa 9600
tcggcaaaat cccttataaa tcaaaagaat agcccgagat agggttgagt gttgttccag 9660
tttggaacaa gagtccacta ttaaagaacg tggactccaa cgtcaaaggg cgaaaaaccg 9720
tctatcaggg cgatggccca ctacgtgaac catcacccaa atcaagtttt ttggggtcga 9780
ggtgccgtaa agcactaaat cggaacccta aagggagccc ccgatttaga gcttgacggg 9840
gaaagccggc gaacgtggcg agaaaggaag ggaagaaagc gaaaggagcg ggcgctaggg 9900
cgctggcaag tgtagcggtc acgctgcgcg taaccaccac acccgccgcg cttaatgcgc 9960
cgctacaggg cgcgtactat ggttgctttg acgtatgcgg tgtgaaatac cgcacagatg 10020
cgtaaggaga aaataccgca tcaggcgcca ttcgccattc aggctgcgca actgttggga 10080
agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc 10140
aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc 10200
cagtgccc 10208
<210>7
<211>10199
<212>DNA
<213>木霉属
<400>7
aagcttacta gtacttctcg agctctgtac atgtccggtc gcgacgtacg cgtatcgatg 60
gcgccagctg caggcggccg cctgcagcca cttgcagtcc cgtggaattc tcacggtgaa 120
tgtaggcctt ttgtagggta ggaattgtca ctcaagcacc cccaacctcc attacgcctc 180
ccccatagag ttcccaatca gtgagtcatg gcactgttct caaatagatt ggggagaagt 240
tgacttccgc ccagagctga aggtcgcaca accgcatgat atagggtcgg caacggcaaa 300
aaagcacgtg gctcaccgaa aagcaagatg tttgcgatct aacatccagg aacctggata 360
catccatcat cacgcacgac cactttgatc tgctggtaaa ctcgtattcg ccctaaaccg 420
aagtgcgtgg taaatctaca cgtgggcccc tttcggtata ctgcgtgtgt cttctctagg 480
tgccattctt ttcccttcct ctagtgttga attgtttgtg ttggagtccg agctgtaact 540
acctctgaat ctctggagaa tggtggacta acgactaccg tgcacctgca tcatgtatat 600
aatagtgatc ctgagaaggg gggtttggag caatgtggga ctttgatggt catcaaacaa 660
agaacgaaga cgcctctttt gcaaagtttt gtttcggcta cggtgaagaa ctggatactt 720
gttgtgtctt ctgtgtattt ttgtggcaac aagaggccag agacaatcta ttcaaacacc 780
aagcttgctc ttttgagcta caagaacctg tggggtatat atctagagtt gtgaagtcgg 840
taatcccgct gtatagtaat acgagtcgca tctaaatact ccgaagctgc tgcgaacccg 900
gagaatcgag atgtgctgga aagcttctag cgagcggcta aattagcatg aaaggctatg 960
agaaattctg gagacggctt gttgaatcat ggcgttccat tcttcgacaa gcaaagcgtt 1020
ccgtcgcagt agcaggcact cattcccgaa aaaactcgga gattcctaag tagcgatgga 1080
accggaataa tataataggc aatacattga gttgcctcga cggttgcaat gcaggggtac 1140
tgagcttgga cataactgtt ccgtacccca cctcttctca acctttggcg tttccctgat 1200
tcagcgtacc cgtacaagtc gtaatcacta ttaacccaga ctgaccggac gtgttttgcc 1260
cttcatttgg agaaataatg tcattgcgat gtgtaatttg cctgcttgac cgactggggc 1320
tgttcgaagc ccgaatgtag gattgttatc cgaactctgc tcgtagaggc atgttgtgaa 1380
tctgtgtcgg gcaggacacg cctcgaaggt tcacggcaag ggaaaccacc gatagcagtg 1440
tctagtagca acctgtaaag ccgcaatgca gcatcactgg aaaatacaaa ccaatggcta 1500
aaagtacata agttaatgcc taaagaagtc atataccagc ggctaataat tgtacaatca 1560
agtggctaaa cgtaccgtaa tttgccaacg gcttgtgggg ttgcagaagc aacggcaaag 1620
ccccacttcc ccacgtttgt ttcttcactc agtccaatct cagctggtga tcccccaatt 1680
gggtcgcttg tttgttccgg tgaagtgaaa gaagacagag gtaagaatgt ctgactcgga 1740
gcgttttgca tacaaccaag ggcagtgatg gaagacagtg aaatgttgac attcaaggag 1800
tatttagcca gggatgcttg agtgtatcgt gtaaggaggt ttgtctgccg atacgacgaa 1860
tactgtatag tcacttctga tgaagtggtc catattgaaa tgtaaagtcg gcactgaaca 1920
ggcaaaagat tgagttgaaa ctgcctaaga tctcgggccc tcgggccttc ggcctttggg 1980
tgtacatgtt tgtgctccgg gcaaatgcaa agtgtggtag gatcgaacac actgctgcct 2040
ttaccaagca gctgagggta tgtgataggc aaatgttcag gggccactgc atggtttcga 2100
atagaaagag aagcttagcc aagaacaata gccgataaag atagcctcat taaacggaat 2160
gagctagtag gcaaagtcag cgaatgtgta tatataaagg ttcgaggtcc gtgcctccct 2220
catgctctcc ccatctactc atcaactcag atcctccagg agacttgtac accatctttt 2280
gaggcacaga aacccaatag tcaaccatca caagtttgta caaaaaagca ggctccgcgg 2340
ccgccccctt caccatgtat cggaagttgg ccgtcatctc ggccttcttg gccacagctc 2400
gtgctgctat tggaccagtt gctgatctgc acatcgttaa caaggatttg gccccagacg 2460
gcgtccagcg cccaactgtt ctggccggtg gaacttttcc gggcacgctg attaccggtc 2520
aaaagggcga caacttccag ctgaacgtga ttgatgacct gaccgacgat cgcatgttga 2580
cccctacttc gatccattgg catggtttct tccagaaggg aaccgcctgg gccgacggtc 2640
cggctttcgt tacacagtgc cctattatcg cagacaactc cttcctctac gatttcgacg 2700
ttcccgacca ggcgggcacc ttctggtacc actcacactt gtctacacag tactgcgacg 2760
gtctgcgcgg tgccttcgtt gtttacgacc ccaacgaccc tcacaaggac ctttatgatg 2820
tcgatgacgg tggcacagtt atcacattgg ctgactggta tcacgtcctc gctcagaccg 2880
ttgtcggagc tgctacaccc gactctacgc tgattaacgg cttgggacgc agccagactg 2940
gccccgccga cgctgagctg gccgttatct ctgttgaaca caacaagaga taccgtttca 3000
gactcgtctc catctcgtgc gatcccaact tcacttttag cgtcgacggt cacaacatga 3060
cggttatcga ggttgatggc gtgaataccc gccctctcac cgtcgattcc attcaaattt 3120
tcgccggcca gcgatactcc tttgtgctga atgccaatca gcccgaggat aactactgga 3180
tccgcgctat gcctaacatc ggacgaaaca ccactaccct tgatggcaag aatgccgcta 3240
tcctgcgata caagaacgcc agcgttgagg agcccaaaac cgtcggagga cccgcgcaga 3300
gcccattgaa cgaggccgac ctgcgacctc tggtgcccgc tcctgtccct ggcaacgcag 3360
ttcctggtgg tgcggacatc aaccaccgcc tgaacctgac attcagcaac ggcctcttct 3420
ctatcaataa cgcatcattt acaaacccca gcgtccctgc cttgttgcag attctttccg 3480
gcgcacaaaa cgctcaggat ctgcttccca ccggttctta tatcggcttg gagttgggca 3540
aggtcgttga actcgtgatc cctcccttgg ccgttggtgg cccccatcca ttccacttgc 3600
acggccacaa cttttgggtc gtccgaagcg ctggttctga cgagtataat ttcgacgatg 3660
caattttgcg cgacgtggtc agcattggcg cgggaactga cgaggttact atccgttttg 3720
tcactgataa cccaggccct tggttcctcc attgccacat cgactggcac ctcgaagccg 3780
gcctcgccat tgttttcgcc gaaggcatca atcaaaccgc agccgccaac ccgactccac 3840
aggcctggga cgaactctgc cccaagtata acggactctc cgcttcccag aaagtgaagc 3900
ccaagaaggg aacagccatc taaaagggtg ggcgcgccga cccagctttc ttgtacaaag 3960
tggtgatcgc gccagctccg tgcgaaagcc tgacgcaccg gtagattctt ggtgagcccg 4020
tatcatgacg gcggcgggag ctacatggcc ccgggtgatt tatttttttt gtatctactt 4080
ctgacccttt tcaaatatac ggtcaactca tctttcactg gagatgcggc ctgcttggta 4140
ttgcgatgtt gtcagcttgg caaattgtgg ctttcgaaaa cacaaaacga ttccttagta 4200
gccatgcatt ttaagataac ggaatagaag aaagaggaaa ttaaaaaaaa aaaaaaaaca 4260
aacatcccgt tcataacccg tagaatcgcc gctcttcgtg tatcccagta ccagtttatt 4320
ttgaatagct cgcccgctgg agagcatcct gaatgcaagt aacaaccgta gaggctgaca 4380
cggcaggtgt tgctagggag cgtcgtgttc tacaaggcca gacgtcttcg cggttgatat 4440
atatgtatgt ttgactgcag gctgctcagc gacgacagtc aagttcgccc tcgctgcttg 4500
tgcaataatc gcagtgggga agccacaccg tgactcccat ctttcagtaa agctctgttg 4560
gtgtttatca gcaatacacg taatttaaac tcgttagcat ggggctgata gcttaattac 4620
cgtttaccag tgccatggtt ctgcagcttt ccttggcccg taaaattcgg cgaagccagc 4680
caatcaccag ctaggcacca gctaaaccct ataattagtc tcttatcaac accatccgct 4740
cccccgggat caatgaggag aatgaggggg atgcggggct aaagaagcct acataaccct 4800
catgccaact cccagtttac actcgtcgag ccaacatcct gactataagc taacacagaa 4860
tgcctcaatc ctgggaagaa ctggccgctg ataagcgcgc ccgcctcgca aaaaccatcc 4920
ctgatgaatg gaaagtccag acgctgcctg cggaagacag cgttattgat ttcccaaaga 4980
aatcggggat cctttcagag gccgaactga agatcacaga ggcctccgct gcagatcttg 5040
tgtccaagct ggcggccgga gagttgacct cggtggaagt tacgctagca ttctgtaaac 5100
gggcagcaat cgcccagcag ttagtagggt cccctctacc tctcagggag atgtaacaac 5160
gccaccttat gggactatca agctgacgct ggcttctgtg cagacaaact gcgcccacga 5220
gttcttccct gacgccgctc tcgcgcaggc aagggaactc gatgaatact acgcaaagca 5280
caagagaccc gttggtccac tccatggcct ccccatctct ctcaaagacc agcttcgagt 5340
caaggtacac cgttgcccct aagtcgttag atgtcccttt ttgtcagcta acatatgcca 5400
ccagggctac gaaacatcaa tgggctacat ctcatggcta aacaagtacg acgaagggga 5460
ctcggttctg acaaccatgc tccgcaaagc cggtgccgtc ttctacgtca agacctctgt 5520
cccgcagacc ctgatggtct gcgagacagt caacaacatc atcgggcgca ccgtcaaccc 5580
acgcaacaag aactggtcgt gcggcggcag ttctggtggt gagggtgcga tcgttgggat 5640
tcgtggtggc gtcatcggtg taggaacgga tatcggtggc tcgattcgag tgccggccgc 5700
gttcaacttc ctgtacggtc taaggccgag tcatgggcgg ctgccgtatg caaagatggc 5760
gaacagcatg gagggtcagg agacggtgca cagcgttgtc gggccgatta cgcactctgt 5820
tgagggtgag tccttcgcct cttccttctt ttcctgctct ataccaggcc tccactgtcc 5880
tcctttcttg ctttttatac tatatacgag accggcagtc actgatgaag tatgttagac 5940
ctccgcctct tcaccaaatc cgtcctcggt caggagccat ggaaatacga ctccaaggtc 6000
atccccatgc cctggcgcca gtccgagtcg gacattattg cctccaagat caagaacggc 6060
gggctcaata tcggctacta caacttcgac ggcaatgtcc ttccacaccc tcctatcctg 6120
cgcggcgtgg aaaccaccgt cgccgcactc gccaaagccg gtcacaccgt gaccccgtgg 6180
acgccataca agcacgattt cggccacgat ctcatctccc atatctacgc ggctgacggc 6240
agcgccgacg taatgcgcga tatcagtgca tccggcgagc cggcgattcc aaatatcaaa 6300
gacctactga acccgaacat caaagctgtt aacatgaacg agctctggga cacgcatctc 6360
cagaagtgga attaccagat ggagtacctt gagaaatggc gggaggctga agaaaaggcc 6420
gggaaggaac tggacgccat catcgcgccg attacgccta ccgctgcggt acggcatgac 6480
cagttccggt actatgggta tgcctctgtg atcaacctgc tggatttcac gagcgtggtt 6540
gttccggtta cctttgcgga taagaacatc gataagaaga atgagagttt caaggcggtt 6600
agtgagcttg atgccctcgt gcaggaagag tatgatccgg aggcgtacca tggggcaccg 6660
gttgcagtgc aggttatcgg acggagactc agtgaagaga ggacgttggc gattgcagag 6720
gaagtgggga agttgctggg aaatgtggtg actccatagc taataagtgt cagatagcaa 6780
tttgcacaag aaatcaatac cagcaactgt aaataagcgc tgaagtgacc atgccatgct 6840
acgaaagagc agaaaaaaac ctgccgtaga accgaagaga tatgacacgc ttccatctct 6900
caaaggaaga atcccttcag ggttgcgttt ccagtctaga cacgtataac ggcacaagtg 6960
tctctcacca aatgggttat atctcaaatg tgatctaagg atggaaagcc cagaatatcg 7020
atcgcgcgca gatccatata tagggcccgg gttataatta cctcaggtcg acgtcccatg 7080
gccattcgaa ttcgtaatca tggtcatagc tgtttcctgt gtgaaattgt tatccgctca 7140
caattccaca caacatacga gccggaagca taaagtgtaa agcctggggt gcctaatgag 7200
tgagctaact cacattaatt gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt 7260
cgtgccagct gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc 7320
gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 7380
tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 7440
agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 7500
cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 7560
ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 7620
tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 7680
gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 7740
gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 7800
gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 7860
ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 7920
ggcctaacta cggctacact agaagaacag tatttggtat ctgcgctctg ctgaagccag 7980
ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 8040
gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 8100
ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 8160
tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt 8220
ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca 8280
gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg 8340
tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac 8400
cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg 8460
ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc 8520
gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta 8580
caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac 8640
gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc 8700
ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac 8760
tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact 8820
caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa 8880
tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt 8940
cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca 9000
ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa 9060
aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac 9120
tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg 9180
gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc 9240
gaaaagtgcc acctgacgtc taagaaacca ttattatcat gacattaacc tataaaaata 9300
ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa aacctctgac 9360
acatgcagct cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag 9420
cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat 9480
cagagcagat tgtactgaga gtgcaccata aaattgtaaa cgttaatatt ttgttaaaat 9540
tcgcgttaaa tttttgttaa atcagctcat tttttaacca ataggccgaa atcggcaaaa 9600
tcccttataa atcaaaagaa tagcccgaga tagggttgag tgttgttcca gtttggaaca 9660
agagtccact attaaagaac gtggactcca acgtcaaagg gcgaaaaacc gtctatcagg 9720
gcgatggccc actacgtgaa ccatcaccca aatcaagttt tttggggtcg aggtgccgta 9780
aagcactaaa tcggaaccct aaagggagcc cccgatttag agcttgacgg ggaaagccgg 9840
cgaacgtggc gagaaaggaa gggaagaaag cgaaaggagc gggcgctagg gcgctggcaa 9900
gtgtagcggt cacgctgcgc gtaaccacca cacccgccgc gcttaatgcg ccgctacagg 9960
gcgcgtacta tggttgcttt gacgtatgcg gtgtgaaata ccgcacagat gcgtaaggag 10020
aaaataccgc atcaggcgcc attcgccatt caggctgcgc aactgttggg aagggcgatc 10080
ggtgcgggcc tcttcgctat tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt 10140
aagttgggta acgccagggt tttcccagtc acgacgttgt aaaacgacgg ccagtgccc 10199
<210>8
<211>9931
<212>DNA
<213>里氏木霉
<400>8
ctgcagccac ttgcagtccc gtggaattct cacggtgaat gtaggccttt tgtagggtag 60
gaattgtcac tcaagcaccc ccaacctcca ttacgcctcc cccatagagt tcccaatcag 120
tgagtcatgg cactgttctc aaatagattg gggagaagtt gacttccgcc cagagctgaa 180
ggtcgcacaa ccgcatgata tagggtcggc aacggcaaaa aagcacgtgg ctcaccgaaa 240
agcaagatgt ttgcgatcta acatccagga acctggatac atccatcatc acgcacgacc 300
actttgatct gctggtaaac tcgtattcgc cctaaaccga agtgcgtggt aaatctacac 360
gtgggcccct ttcggtatac tgcgtgtgtc ttctctaggt gccattcttt tcccttcctc 420
tagtgttgaa ttgtttgtgt tggagtccga gctgtaacta cctctgaatc tctggagaat 480
ggtggactaa cgactaccgt gcacctgcat catgtatata atagtgatcc tgagaagggg 540
ggtttggagc aatgtgggac tttgatggtc atcaaacaaa gaacgaagac gcctcttttg 600
caaagttttg tttcggctac ggtgaagaac tggatacttg ttgtgtcttc tgtgtatttt 660
tgtggcaaca agaggccaga gacaatctat tcaaacacca agcttgctct tttgagctac 720
aagaacctgt ggggtatata tctagagttg tgaagtcggt aatcccgctg tatagtaata 780
cgagtcgcat ctaaatactc cgaagctgct gcgaacccgg agaatcgaga tgtgctggaa 840
agcttctagc gagcggctaa attagcatga aaggctatga gaaattctgg agacggcttg 900
ttgaatcatg gcgttccatt cttcgacaag caaagcgttc cgtcgcagta gcaggcactc 960
attcccgaaa aaactcggag attcctaagt agcgatggaa ccggaataat ataataggca 1020
atacattgag ttgcctcgac ggttgcaatg caggggtact gagcttggac ataactgttc 1080
cgtaccccac ctcttctcaa cctttggcgt ttccctgatt cagcgtaccc gtacaagtcg 1140
taatcactat taacccagac tgaccggacg tgttttgccc ttcatttgga gaaataatgt 1200
cattgcgatg tgtaatttgc ctgcttgacc gactggggct gttcgaagcc cgaatgtagg 1260
attgttatcc gaactctgct cgtagaggca tgttgtgaat ctgtgtcggg caggacacgc 1320
ctcgaaggtt cacggcaagg gaaaccaccg atagcagtgt ctagtagcaa cctgtaaagc 1380
cgcaatgcag catcactgga aaatacaaac caatggctaa aagtacataa gttaatgcct 1440
aaagaagtca tataccagcg gctaataatt gtacaatcaa gtggctaaac gtaccgtaat 1500
ttgccaacgg cttgtggggt tgcagaagca acggcaaagc cccacttccc cacgtttgtt 1560
tcttcactca gtccaatctc agctggtgat cccccaattg ggtcgcttgt ttgttccggt 1620
gaagtgaaag aagacagagg taagaatgtc tgactcggag cgttttgcat acaaccaagg 1680
gcagtgatgg aagacagtga aatgttgaca ttcaaggagt atttagccag ggatgcttga 1740
gtgtatcgtg taaggaggtt tgtctgccga tacgacgaat actgtatagt cacttctgat 1800
gaagtggtcc atattgaaat gtaagtcggc actgaacagg caaaagattg agttgaaact 1860
gcctaagatc tcgggccctc gggccttcgg cctttgggtg tacatgtttg tgctccgggc 1920
aaatgcaaag tgtggtagga tcgaacacac tgctgccttt accaagcagc tgagggtatg 1980
tgataggcaa atgttcaggg gccactgcat ggtttcgaat agaaagagaa gcttagccaa 2040
gaacaatagc cgataaagat agcctcatta aacggaatga gctagtaggc aaagtcagcg 2100
aatgtgtata tataaaggtt cgaggtccgt gcctccctca tgctctcccc atctactcat 2160
caactcagat cctccaggag acttgtacac catcttttga ggcacagaaa cccaatagtc 2220
aaccatcaca agtttgtaca aaaaagcagg ctccgcggcc gcccccttca ccatgcagac 2280
ctttggagct tttctcgttt ccttcctcgc cgccagcggc ctggccgcgg ccctccccac 2340
cgagggtcag aagacggctt ccgtcgaggt ccagtacaac aagaactacg tcccccacgg 2400
ccctactgct ctcttcaagg ccaagagaaa gtatggcgct cccatcagcg acaacctgaa 2460
gtctctcgtg gctgccaggc aggccaagca ggctctcgcc aagcgccaga ccggctcggc 2520
gcccaaccac cccagtgaca gcgccgattc ggagtacatc acctccgtct ccatcggcac 2580
tccggctcag gtcctccccc tggactttga caccggctcc tccgacctgt gggtctttag 2640
ctccgagacg cccaagtctt cggccaccgg ccacgccatc tacacgccct ccaagtcgtc 2700
cacctccaag aaggtgtctg gcgccagctg gtccatcagc tacggcgacg gcagcagctc 2760
cagcggcgat gtctacaccg acaaggtcac catcggaggc ttcagcgtca acacccaggg 2820
cgtcgagtct gccacccgcg tgtccaccga gttcgtccag gacacggtca tctctggcct 2880
cgtcggcctt gcctttgaca gcggcaacca ggtcaggccg cacccgcaga agacgtggtt 2940
ctccaacgcc gccagcagcc tggctgagcc ccttttcact gccgacctga ggcacggaca 3000
gagtaagtag acactcactg gaattcgttc ctttcccgat catcatgaaa gcaagtagac 3060
tgactgaacc aaacaactag acggcagcta caactttggc tacatcgaca ccagcgtcgc 3120
caagggcccc gttgcctaca cccccgttga caacagccag ggcttctggg agttcactgc 3180
ctcgggctac tctgtcggcg gcggcaagct caaccgcaac tccatcgacg gcattgccga 3240
caccggcacc accctgctcc tcctcgacga caacgtcgtc gatgcctact acgccaacgt 3300
ccagtcggcc cagtacgaca accagcagga gggtgtcgtc ttcgactgcg acgaggacct 3360
cccttcgttc agcttcggtg ttggaagctc caccatcacc atccctggcg atctgctgaa 3420
cctgactccc ctcgaggagg gcagctccac ctgcttcggt ggcctccaga gcagctccgg 3480
cattggcatc aacatctttg gtgacgttgc cctcaaggct gccctggttg tctttgacct 3540
cggcaacgag cgcctgggct gggctcagaa ataaaagggt gggcgcgccg acccagcttt 3600
cttgtacaaa gtggtgatcg cgccagctcc gtgcgaaagc ctgacgcacc ggtagattct 3660
tggtgagccc gtatcatgac ggcggcggga gctacatggc cccgggtgat ttattttttt 3720
tgtatctact tctgaccctt ttcaaatata cggtcaactc atctttcact ggagatgcgg 3780
cctgcttggt attgcgatgt tgtcagcttg gcaaattgtg gctttcgaaa acacaaaacg 3840
attccttagt agccatgcat tttaagataa cggaatagaa gaaagaggaa attaaaaaaa 3900
aaaaaaaaac aaacatcccg ttcataaccc gtagaatcgc cgctcttcgt gtatcccagt 3960
accagtttat tttgaatagc tcgcccgctg gagagcatcc tgaatgcaag taacaaccgt 4020
agaggctgac acggcaggtg ttgctaggga gcgtcgtgtt ctacaaggcc agacgtcttc 4080
gcggttgata tatatgtatg tttgactgca ggctgctcag cgacgacagt caagttcgcc 4140
ctcgctgctt gtgcaataat cgcagtgggg aagccacacc gtgactccca tctttcagta 4200
aagctctgtt ggtgtttatc agcaatacac gtaatttaaa ctcgttagca tggggctgat 4260
agcttaatta ccgtttacca gtgccatggt tctgcagctt tccttggccc gtaaaattcg 4320
gcgaagccag ccaatcacca gctaggcacc agctaaaccc tataattagt ctcttatcaa 4380
caccatccgc tcccccggga tcaatgagga gaatgagggg gatgcggggc taaagaagcc 4440
tacataaccc tcatgccaac tcccagttta cactcgtcga gccaacatcc tgactataag 4500
ctaacacaga atgcctcaat cctgggaaga actggccgct gataagcgcg cccgcctcgc 4560
aaaaaccatc cctgatgaat ggaaagtcca gacgctgcct gcggaagaca gcgttattga 4620
tttcccaaag aaatcgggga tcctttcaga ggccgaactg aagatcacag aggcctccgc 4680
tgcagatctt gtgtccaagc tggcggccgg agagttgacc tcggtggaag ttacgctagc 4740
attctgtaaa cgggcagcaa tcgcccagca gttagtaggg tcccctctac ctctcaggga 4800
gatgtaacaa cgccacctta tgggactatc aagctgacgc tggcttctgt gcagacaaac 4860
tgcgcccacg agttcttccc tgacgccgct ctcgcgcagg caagggaact cgatgaatac 4920
tacgcaaagc acaagagacc cgttggtcca ctccatggcc tccccatctc tctcaaagac 4980
cagcttcgag tcaaggtaca ccgttgcccc taagtcgtta gatgtccctt tttgtcagct 5040
aacatatgcc accagggcta cgaaacatca atgggctaca tctcatggct aaacaagtac 5100
gacgaagggg actcggttct gacaaccatg ctccgcaaag ccggtgccgt cttctacgtc 5160
aagacctctg tcccgcagac cctgatggtc tgcgagacag tcaacaacat catcgggcgc 5220
accgtcaacc cacgcaacaa gaactggtcg tgcggcggca gttctggtgg tgagggtgcg 5280
atcgttggga ttcgtggtgg cgtcatcggt gtaggaacgg atatcggtgg ctcgattcga 5340
gtgccggccg cgttcaactt cctgtacggt ctaaggccga gtcatgggcg gctgccgtat 5400
gcaaagatgg cgaacagcat ggagggtcag gagacggtgc acagcgttgt cgggccgatt 5460
acgcactctg ttgagggtga gtccttcgcc tcttccttct tttcctgctc tataccaggc 5520
ctccactgtc ctcctttctt gctttttata ctatatacga gaccggcagt cactgatgaa 5580
gtatgttaga cctccgcctc ttcaccaaat ccgtcctcgg tcaggagcca tggaaatacg 5640
actccaaggt catccccatg ccctggcgcc agtccgagtc ggacattatt gcctccaaga 5700
tcaagaacgg cgggctcaat atcggctact acaacttcga cggcaatgtc cttccacacc 5760
ctcctatcct gcgcggcgtg gaaaccaccg tcgccgcact cgccaaagcc ggtcacaccg 5820
tgaccccgtg gacgccatac aagcacgatt tcggccacga tctcatctcc catatctacg 5880
cggctgacgg cagcgccgac gtaatgcgcg atatcagtgc atccggcgag ccggcgattc 5940
caaatatcaa agacctactg aacccgaaca tcaaagctgt taacatgaac gagctctggg 6000
acacgcatct ccagaagtgg aattaccaga tggagtacct tgagaaatgg cgggaggctg 6060
aagaaaaggc cgggaaggaa ctggacgcca tcatcgcgcc gattacgcct accgctgcgg 6120
tacggcatga ccagttccgg tactatgggt atgcctctgt gatcaacctg ctggatttca 6180
cgagcgtggt tgttccggtt acctttgcgg ataagaacat cgataagaag aatgagagtt 6240
tcaaggcggt tagtgagctt gatgccctcg tgcaggaaga gtatgatccg gaggcgtacc 6300
atggggcacc ggttgcagtg caggttatcg gacggagact cagtgaagag aggacgttgg 6360
cgattgcaga ggaagtgggg aagttgctgg gaaatgtggt gactccatag ctaataagtg 6420
tcagatagca atttgcacaa gaaatcaata ccagcaactg taaataagcg ctgaagtgac 6480
catgccatgc tacgaaagag cagaaaaaaa cctgccgtag aaccgaagag atatgacacg 6540
cttccatctc tcaaaggaag aatcccttca gggttgcgtt tccagtctag acacgtataa 6600
cggcacaagt gtctctcacc aaatgggtta tatctcaaat gtgatctaag gatggaaagc 6660
ccagaatatc gatcgcgcgc agatccatat atagggcccg ggttataatt acctcaggtc 6720
gacgtcccat ggccattcga attcgtaatc atggtcatag ctgtttcctg tgtgaaattg 6780
ttatccgctc acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg 6840
tgcctaatga gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc 6900
gggaaacctg tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt 6960
gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct 7020
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga 7080
taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 7140
cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg 7200
ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg 7260
aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt 7320
tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt 7380
gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg 7440
cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact 7500
ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt 7560
cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct 7620
gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 7680
cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 7740
tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg 7800
ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta 7860
aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 7920
atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 7980
ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc 8040
tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc 8100
agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat 8160
taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt 8220
tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc 8280
cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag 8340
ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt 8400
tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac 8460
tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg 8520
cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat 8580
tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc 8640
gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc 8700
tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa 8760
atgttgaata ctcatactct tcctttttca atattattga agcatttatc agggttattg 8820
tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg 8880
cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc attattatca tgacattaac 8940
ctataaaaat aggcgtatca cgaggccctt tcgtctcgcg cgtttcggtg atgacggtga 9000
aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg 9060
gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa 9120
ctatgcggca tcagagcaga ttgtactgag agtgcaccat aaaattgtaa acgttaatat 9180
tttgttaaaa ttcgcgttaa atttttgtta aatcagctca ttttttaacc aataggccga 9240
aatcggcaaa atcccttata aatcaaaaga atagcccgag atagggttga gtgttgttcc 9300
agtttggaac aagagtccac tattaaagaa cgtggactcc aacgtcaaag ggcgaaaaac 9360
cgtctatcag ggcgatggcc cactacgtga accatcaccc aaatcaagtt ttttggggtc 9420
gaggtgccgt aaagcactaa atcggaaccc taaagggagc ccccgattta gagcttgacg 9480
gggaaagccg gcgaacgtgg cgagaaagga agggaagaaa gcgaaaggag cgggcgctag 9540
ggcgctggca agtgtagcgg tcacgctgcg cgtaaccacc acacccgccg cgcttaatgc 9600
gccgctacag ggcgcgtact atggttgctt tgacgtatgc ggtgtgaaat accgcacaga 9660
tgcgtaagga gaaaataccg catcaggcgc cattcgccat tcaggctgcg caactgttgg 9720
gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct 9780
gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg taaaacgacg 9840
gccagtgccc aagcttacta gtacttctcg agctctgtac atgtccggtc gcgacgtacg 9900
cgtatcgatg gcgccagctg caggcggccg c 9931
<210>9
<211>20
<212>DNA
<213>里氏木霉
<400>9
ctgcagccac ttgcagtccc 20
<210>10
<211>2221
<212>DNA
<213>里氏木霉
<220>
<221>misc_feature
<222>(166)..(166)
<223>n是a、c、g或t
<400>10
aagcttagcc aagaacaata gccgataaag atagcctcat taaacggaat gagctagtag 60
gcaaagtcag cgaatgtgta tatataaagg ttcgaggtcc gtgcctccct catgctctcc 120
ccatctactc atcaactcag atcctccagg agacttgtac accatntttt gaggcacaga 180
aacccaatag tcaaccgcgg actggcatca tgtatcggaa gttggccgtc atctcggcct 240
tcttggccac agctcgtgct cagtcggcct gcactctcca atcggagact cacccgcctc 300
tgacatggca gaaatgctcg tctggtggca cttgcactca acagacaggc tccgtggtca 360
tcgacgccaa ctggcgctgg actcacgcta cgaacagcag cacgaactgc tacgatggca 420
acacttggag ctcgacccta tgtcctgaca acgagacctg cgcgaagaac tgctgtctgg 480
acggtgccgc ctacgcgtcc acgtacggag ttaccacgag cggtaacagc ctctccattg 540
gctttgtcac ccagtctgcg cagaagaacg ttggcgctcg cctttacctt atggcgagcg 600
acacgaccta ccaggaattc accctgcttg gcaacgagtt ctctttcgat gttgatgttt 660
cgcagctgcc gtaagtgact taccatgaac ccctgacgta tcttcttgtg ggctcccagc 720
tgactggcca atttaaggtg cggcttgaac ggagctctct acttcgtgtc catggacgcg 780
gatggtggcg tgagcaagta tcccaccaac accgctggcg ccaagtacgg cacggggtac 840
tgtgacagcc agtgtccccg cgatctgaag ttcatcaatg gccaggccaa cgttgagggc 900
tgggagccgt catccaacaa cgcaaacacg ggcattggag gacacggaag ctgctgctct 960
gagatggata tctgggaggc caactccatc tccgaggctc ttacccccca cccttgcacg 1020
actgtcggcc aggagatctg cgagggtgat gggtgcggcg gaacttactc cgataacaga 1080
tatggcggca cttgcgatcc cgatggctgc gactggaacc cataccgcct gggcaacacc 1140
agcttctacg gccctggctc aagctttacc ctcgatacca ccaagaaatt gaccgttgtc 1200
acccagttcg agacgtcggg tgccatcaac cgatactatg tccagaatgg cgtcactttc 1260
cagcagccca acgccgagct tggtagttac tctggcaacg agctcaacga tgattactgc 1320
acagctgagg aggcagaatt cggcggatcc tctttctcag acaagggcgg cctgactcag 1380
ttcaagaagg ctacctctgg cggcatggtt ctggtcatga gtctgtggga tgatgtgagt 1440
ttgatggaca aacatgcgcg ttgacaaaga gtcaagcagc tgactgagat gttacagtac 1500
tacgccaaca tgctgtggct ggactccacc tacccgacaa acgagacctc ctccacaccc 1560
ggtgccgtgc gcggaagctg ctccaccagc tccggtgtcc ctgctcaggt cgaatctcag 1620
tctcccaacg ccaaggtcac cttctccaac atcaagttcg gacccattgg cagcaccggc 1680
aaccctagcg gcggcaaccc tcccggcgga aaccgtggca ccaccaccac ccgccgccca 1740
gccactacca ctggaagctc tcccggacct acccagtctc actacggcca gtgcggcggt 1800
attggctaca gcggccccac ggtctgcgcc agcggcacaa cttgccaggt cctgaaccct 1860
tactactctc agtgcctgta aagctccgtg cgaaagcctg acgcaccggt agattcttgg 1920
tgagcccgta tcatgacggc ggcgggagct acatggcccc gggtgattta ttttttttgt 1980
atctacttct gacccttttc aaatatacgg tcaactcatc tttcactgga gatgcggcct 2040
gcttggtatt gcgatgttgt cagcttggca aattgtggct ttcgaaaaca caaaacgatt 2100
ccttagtagc catgcatttt aagataacgg aatagaagaa agaggaaatt aaaaaaaaaa 2160
aaaaaacaaa catcccgttc ataacccgta gaatcgccgc tcttcgtgta tcccagtacc 2220
a 2221
<210>11
<211>51
<212>DNA
<213>里氏木霉
<400>11
atgtatcgga agttggccgt catctcggcc ttcttggcca cagctcgtgc t 51
<210>12
<211>1438
<212>DNA
<213>里氏木霉
<400>12
cagtcggcct gcactctcca atcggagact cacccgcctc tgacatggca gaaatgctcg 60
tctggtggca cttgcactca acagacaggc tccgtggtca tcgacgccaa ctggcgctgg 120
actcacgcta cgaacagcag cacgaactgc tacgatggca acacttggag ctcgacccta 180
tgtcctgaca acgagacctg cgcgaagaac tgctgtctgg acggtgccgc ctacgcgtcc 240
acgtacggag ttaccacgag cggtaacagc ctctccattg gctttgtcac ccagtctgcg 300
cagaagaacg ttggcgctcg cctttacctt atggcgagcg acacgaccta ccaggaattc 360
accctgcttg gcaacgagtt ctctttcgat gttgatgttt cgcagctgcc gtaagtgact 420
taccatgaac ccctgacgta tcttcttgtg ggctcccagc tgactggcca atttaaggtg 480
cggcttgaac ggagctctct acttcgtgtc catggacgcg gatggtggcg tgagcaagta 540
tcccaccaac accgctggcg ccaagtacgg cacggggtac tgtgacagcc agtgtccccg 600
cgatctgaag ttcatcaatg gccaggccaa cgttgagggc tgggagccgt catccaacaa 660
cgcaaacacg ggcattggag gacacggaag ctgctgctct gagatggata tctgggaggc 720
caactccatc tccgaggctc ttacccccca cccttgcacg actgtcggcc aggagatctg 780
cgagggtgat gggtgcggcg gaacttactc cgataacaga tatggcggca cttgcgatcc 840
cgatggctgc gactggaacc cataccgcct gggcaacacc agcttctacg gccctggctc 900
aagctttacc ctcgatacca ccaagaaatt gaccgttgtc acccagttcg agacgtcggg 960
tgccatcaac cgatactatg tccagaatgg cgtcactttc cagcagccca acgccgagct 1020
tggtagttac tctggcaacg agctcaacga tgattactgc acagctgagg aggcagaatt 1080
cggcggatcc tctttctcag acaagggcgg cctgactcag ttcaagaagg ctacctctgg 1140
cggcatggtt ctggtcatga gtctgtggga tgatgtgagt ttgatggaca aacatgcgcg 1200
ttgacaaaga gtcaagcagc tgactgagat gttacagtac tacgccaaca tgctgtggct 1260
ggactccacc tacccgacaa acgagacctc ctccacaccc ggtgccgtgc gcggaagctg 1320
ctccaccagc tccggtgtcc ctgctcaggt cgaatctcag tctcccaacg ccaaggtcac 1380
cttctccaac atcaagttcg gacccattgg cagcaccggc aaccctagcg gcggcaac 1438
<210>13
<211>72
<212>DNA
<213>里氏木霉
<400>13
cctcccggcg gaaaccgtgg caccaccacc acccgccgcc cagccactac cactggaagc 60
tctcccggac ct 72
<210>14
<211>513
<212>PRT
<213>木霉属
<400>14
Met Tyr Arg Lys Leu Ala Val Ile Ser Ala Phe Leu Ala Thr Ala Arg
1 5 10 15
Ala Gln Ser Ala Cys Thr Leu Gln Ser Glu Thr His Pro Pro Leu Thr
20 25 30
Trp Gln Lys Cys Ser Ser Gly Gly Thr Cys Thr Gln Gln Thr Gly Ser
35 40 45
Val Val Ile Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser
50 55 60
Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp
65 70 75 80
Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala
85 90 95
Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser Ile Gly Phe
100 105 110
Val Thr Gln Ser Ala Gln Lys Asn Val Gly Ala Arg Leu Tyr Leu Met
115 120 125
Ala Ser Asp Thr Thr Tyr Gln Glu Phe Thr Leu Leu Gly Asn Glu Phe
130 135 140
Ser Phe Asp Val Asp Val Ser Gln Leu Pro Cys Gly Leu Asn Gly Ala
145 150 155 160
Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro
165 170 175
Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln
180 185 190
Cys Pro Arg Asp Leu Lys Phe Ile Asn Gly Gln Ala Asn Val Glu Gly
195 200 205
Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly Ile Gly Gly His Gly
210 215 220
Ser Cys Cys Ser Glu Met Asp Ile Trp Glu Ala Asn Ser Ile Ser Glu
225 230 235 240
Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gln Glu Ile Cys Glu
245 250 255
Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr
260 265 270
Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr
275 280 285
Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys
290 295 300
Leu Thr Val Val Thr Gln Phe Glu Thr Ser Gly Ala Ile Asn Arg Tyr
305 310 315 320
Tyr Val Gln Asn Gly Val Thr Phe Gln Gln Pro Asn Ala Glu Leu Gly
325 330 335
Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu
340 345 350
Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gln
355 360 365
Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp
370 375 380
Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr
385 390 395 400
Asn Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr
405 410 415
Ser Ser Gly Val Pro Ala Gln Val Glu Ser Gln Ser Pro Asn Ala Lys
420 425 430
Val Thr Phe Ser Asn Ile Lys Phe Gly Pro Ile Gly Ser Thr Gly Asn
435 440 445
Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Arg Gly Thr Thr Thr Thr
450 455 460
Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gln Ser
465 470 475 480
His Tyr Gly Gln Cys Gly Gly Ile Gly Tyr Ser Gly Pro Thr Val Cys
485 490 495
Ala Ser Gly Thr Thr Cys Gln Val Leu Asn Pro Tyr Tyr Ser Gln Cys
500 505 510
Leu
<210>15
<211>2267
<212>DNA
<213>里氏木霉
<400>15
atggctcgtt cacggagctc cctggccctc gggctgggcc tgctctgctg gatcacgctg 60
ctcttcgctc ctctggcgtt tgtcggaaag gccaatgccg cgagcgacga cgcggacaac 120
tacggcactg ttatcggaat tgtaagtcga ctgacggcag caaccccgcc attttcttgg 180
tgttgatgct caggcagccc tgctaacacg cttctcctcc gcccaggatc tcggaactac 240
ctacagctgc gtcggtgtga tgcagaaggg caaggttgag attctcgtca acgaccaggg 300
taaccgaatc actccctcct acgtggcctt taccgacgag gagcgtctgg ttggcgattc 360
cgccaagaac caggccgccg ccaaccccac caacaccgtc tacgatgtca agtcagttct 420
accgccctgt tggcttctat tgtataagtg gacaattagc taactgttgt cacaggcgat 480
tgattggccg caaattcgac gagaaggaga tccaggccga catcaagcac ttcccctaca 540
aggtcattga gaagaacggc aagcccgtcg tccaggtcca ggtcaacggc cagaagaagc 600
agttcactcc cgaggagatt tctgccatga ttcttggcaa gatgaaggag gttgccgagt 660
cgtacctggg caagaaggtt acccacgccg tcgtcaccgt ccctgcctac ttcaacgtga 720
gtcttttccc cgaaattcct cgaggattcc aagagccatc tgctaacagc ccgataggac 780
aaccagcgac aggccaccaa ggacgccggt accattgccg gcttgaacgt tctccgaatc 840
gtcaacgaac ccaccgctgc cgctatcgcc tatggtctgg acaagaccga cggtgagcgc 900
cagatcattg tctacgatct cggtggtggt acctttgatg tttctctcct gtccattgac 960
aatggcgtct tcgaggtctt ggctaccgcc ggtgacaccc accttggtgg tgaggacttt 1020
gaccagcgca ttatcaacta cctggccaag gcctacaaca agaagaacaa cgtcgacatc 1080
tccaaggacc tcaaggccat gggcaagctc aagcgtgaag ccgaaaaggc caagcgtacc 1140
ctctcttccc agatgagcac tcgtatcgaa atcgaggcct tcttcgaggg caacgacttc 1200
tccgagactc tcacccgggc caagttcgag gagctcaaca tggacctctt caagaagacc 1260
ctgaagcctg tcgagcaggt tctcaaggac gccaacgtca agaagagcga ggttgacgac 1320
atcgttctgg tcggcggttc cacccgtatc cccaaggttc agtctcttat cgaggagtac 1380
tttaacggca agaaggcttc caagggtatc aaccccgacg aggctgttgc tttcggtgcc 1440
gccgtccagg ccggtgtcct ttctggtgag gaaggtaccg atgacattgt tctcatggac 1500
gtcaaccccc tgactctcgg tatcgagacc actggcggag tcatgaccaa gctcattccc 1560
cgcaacaccc ccatccccac tcgcaagagc cagatcttct cgactgctgc cgataaccag 1620
cccgtcgtcc tgatccaggt cttcgagggt gagcgttcca tgaccaagga caacaacctc 1680
ctgggcaagt tcgagcttac cggcattcct cctgcccccc gcggtgtccc ccagattgag 1740
gtttccttcg agttggatgc caacggtatc ctcaaggtct ccgctcacga caagggcacc 1800
ggcaagcagg agtccatcac catcaccaac gacaagggcc gtctcaccca ggaggagatt 1860
gaccgcatgg ttgccgaggc cgagaagttc gccgaggagg acaaggctac ccgtgagcgc 1920
atcgaggccc gtaacggtct tgagaactac gccttcagcc tgaagaacca ggtcaatgac 1980
gaggagggcc tcggcggcaa gattgacgag gaggacaagg agactgtaag ttgaagcgat 2040
ccatcactgc tttctgatgc ggacatgtca cactaacact tgaccagatt cttgacgccg 2100
tcaaggaggc taccgagtgg ctcgaggaga acggcgccga cgccactacc gaggactttg 2160
aggagcagaa ggagaagctg tccaacgtcg cctaccccat cacctccaag atgtaccagg 2220
gtgctggtgg ctccgaggac gatggcgact tccacgacga attgtaa 2267
<210>16
<211>1942
<212>DNA
<213>里氏木霉
<400>16
atgaagttca acaccgtcgc ggccgctgcg gctctgctcg ctggtgtcgc gtatgccgag 60
gacgtcgagg agtccaaggc agtcccggag cttcccacct ttactgtgag tttgccctct 120
ctttcatctt tggaaaagga cccaaatgtt ggcgcttggc tccagcttgg agcaagcttc 180
ttggacgacg ggatatcatg aaccgctgct gacagttccc accaatcgct tagcccacct 240
ccatcaaggc ggacttcctc gagcagttca ccgacgactg ggagtcccgg tggaagcctt 300
cccacgccaa gaaggacacc agcggctccg acaaggacgc agaggaggaa tgggcctacg 360
tcggcgagtg ggcggtcgag gagccctacc agtacaaggg catcaacggc gacaagggcc 420
tcgttgtcaa gaaccctgcc gcgcaccacg ccatctcggc caagttcccc aagaagattg 480
acaacaaggg caagacgctc gtcgtgcagt acgaggtgaa gctccagagt aagtttggcc 540
tctgcaactc ccccgtgata accaaagcga gatgtggaca ttgtgctgac ctatacgctt 600
ccagagggac tggactgcgg cggtgcctac atgaagctgc tgcgcgacaa caaggctctc 660
caccaggatg agttcagcaa caccaccccc tacgtcatca tgtttggccc cgacaagtgc 720
ggccacaaca accgggtcca cttcatcgtc aaccacaaga accccaagac tggcgagtac 780
gaggagaagc acctcaactc ggccccggcc gtcaacattg tcaagacgac ggagctctac 840
accctcattg tccaccccaa caacaccttc tccatcaagc agaacggtgt cgagaccaag 900
gccggcagcc ttctcgagga cctgagccct cccatcaacc ctcccaagga gattgatgac 960
cccaaggact ccaagcccga cgactgggtc gacgaggctc gcattcccga ccccgaggcc 1020
gtcaagcccg aggactggga cgaggatgcg ccctttgaga ttgtcgacga ggaggccgtc 1080
aagcccgagg actggctcga ggacgagccc accacgatcc ccgaccccga ggcccagaag 1140
cccgaggact gggatgacga ggaggacggc gactggatcc ctcccaccgt ccccaacccc 1200
aagtgcgagg acgtctccgg ttgcggcccc tggaccaagc ccatggtcag gaaccccaac 1260
tacaagggca agtggactgc tccttacatt gacaaccctg cctacaaggg cgtctgggct 1320
ccccgcaaga tcaagaaccc cgactacttt gaggacaaga cgcccgccaa ctttgagccc 1380
atgggagctg taagtttcgt tcctttacca agaccttcat gacgctcgat tgctaaccag 1440
tgctcgacag attggcttcg agatctggac catgaccaac gacatcctct ttgacaacat 1500
ctacattggc cactccattg aggatgccga gaagctggcc aacgagacct tcttcgtcaa 1560
gcaccccatt gagaaggcgc ttgccgaggc tgatgagccc aagtttgacg acacccccaa 1620
gtcgccctct gacctcaagt tcctcgacga ccccgtgacc tttgtcaagg agaagcttga 1680
cctgttcctg accattgccc agcgcgaccc cgttgaggcc atcaagtttg ttcccgaggt 1740
cgccggtggc attgccgccg tcttcgtcac cctgattgcc atcattgtcg gtctggtcgg 1800
ccttggctcc tcatcggccg cccccaagaa ggccgccgcc actgctaagg agaaggccaa 1860
ggacgtttcc gaggctgttg caagcggtgc cgacaaggtc aagggagagg ttaccaagcg 1920
aaccacccgc agccagtcgt ag 1942
<210>17
<211>1910
<212>DNA
<213>里氏木霉
<400>17
atgaagtcgg cgagcaaatt gttctttctc tccgtgtttt ccctatgggc gacgccgggc 60
gcatgctcaa gctcgtcaag tacatgcact gtacgtcaac ccaaccttgg cctcgtttcc 120
cctttggaag aatgctttgc gctgacagat tttgttgatc tagttctccc caaacgccat 180
cattgacgat ggatgcgttt cgtatgcgac tctcgataga ctcaatgtca aggtgaagcc 240
tgctatagac gaactcgttc agacgaccga cttcttttcg cactatcgct tgaacctctt 300
caacaaaaaa tgccccttct ggaacgacga agatggcatg tgcggtaaca ttgcctgcgc 360
cgtcgagacg ctggacaacg aagaagatat tcccgagata tggagggctc acgagcttag 420
caagctggaa ggccctcgag cgaagcatcc cggcaagcaa gagcagaggc agaaccctga 480
gcgaccgctg cagggagagc tgggggagga tgtaggggag agctgcgtgg ttgaatacga 540
cgacgagtgt gacgacagag actactgcgt ctgggacgac gaaggcgcaa cgtccaaggg 600
ggactacatc agcttgttgc gcaaccccga gcgcttcacc ggctatggcg gtcaaagtgc 660
aaagcaggtg tgggacgcca tctactcgga gaactgcttc aagaagagct cgtttcccaa 720
gtcggccgat ctaggcgtct cgcaccgccc aaccgaggcg gctgctctgg acttcaagca 780
ggtcctggac accgctggcc gccaggctca actggaacag cagcggcaga gcaacccaaa 840
cattcccttt gttgccaaca ctggctacga ggtggacgat gagtgtctgg agaagcgcgt 900
gttctaccgg gtggtgtcgg gaatgcacgc cagcatcagc gtccacctgt gctgggactt 960
cctgaaccag agcacggggc aatggcagcc caacttggac tgctacgaga gccgcctgca 1020
caagtttcca gaccgcatca gcaacctcta cttcaactac gctctcgtga ctcgcgccat 1080
tgcgaagctg ggcccgtatg tactgtcacc gcagtacacc ttttgcacag gggacccgtt 1140
gcaagaccag gagacgcgag acaagattgc ggccgtcacg aagcacgcgg ctagcgtccc 1200
gcagatcttt gacgagggcg tcatgtttgt caacggcgaa ggcccctcgc tcaaggaaga 1260
tttccgcaat cgcttccgca acatcagccg ggtcatggac tgcgtcggct gcgacaagtg 1320
ccgtctctgg ggcaagatcc agaccagcgg ctacggcacg gctttgaaga ttctgtttga 1380
gttcaacgag ggccagaagc cgccgcccct caagaggacc gagctggtgg ccctcttcaa 1440
cacgtatgcc agactcagct cgtcggtggc ggccgttggg cgattcaggg ccatgattga 1500
catgcgcgac aagatggcgt ccaagcccga cttcaagccc gaggatctct acacgctcat 1560
cgacgaggcg gacgaggaca tggacgagtt tatcaggatg caaaatcgtg ggagccacgg 1620
agatacgctg ggcgagcagg tcggaaacga atttgcccgc gtcatgatgg ccgtcaagat 1680
tgtgctcaag agttggatcc gaacgcccaa gatgatgtaa gtctcttctc tctttttttt 1740
ccccttcttc gagtggcaca aagctcttca ttgagatgga ctaacacaat tctagttggc 1800
aaattgtctc ggaagagacg tcgagattgt atcgcgcttg ggtcggtctg cctgcgcgac 1860
ccagacggta cgcgttcaga ctgcccaact tgaatagaga cgagttgtga 1910
<210>18
<211>3027
<212>DNA
<213>里氏木霉
<400>18
atgaagtcac cgaggaaatc accgttgctg aagctcctcg gagccgcctt tctcttctcc 60
accaacgttc tcgccatctc cgctgttctc ggagtcgatc tgggaaccga gtacatcaag 120
gcggcgctgg tgaagcccgg catcccgctt gagattgtgc tcacgaaaga ttcccgacga 180
aaagaaacct cggccgtcgc cttcaagccg gcaaagggcg ccttaccgga gggccagtac 240
cccgaacgga gctatggcgc cgacgcaatg gcactcgccg cacgattccc cggcgaagta 300
tacccgaatc tgaagcccct gcttggactg ccagtggggg atgccattgt ccaagaatat 360
gcggccaggc accctgcgtt gaagctacag gcgcacccca cgcggggaac tgctgcgttc 420
aagacggaga cgctgtctcc ggaagaggag gcttggatgg tggaggagct gttggccatg 480
gagcttcaga gcatccagaa gaacgcagag gttaccgctg gcggcgactc ttcgatacgc 540
tccatcgtgc tcaccgtccc gccgttttac accatcgagg agaagcgagc cctgcagatg 600
gcagcagagc tcgccggctt caaggtcctg agccttgtca gcgacggact ggccgtgggc 660
ctcaactatg ccaccagtcg ccaattcccg aatatcaacg aaggcgccaa gccggaatac 720
cacttggtct ttgacatggg agcgggctcc acaactgcta cggtcatgag gttccaaagc 780
cgtacggtta aggacgtcgg caagttcaac aagacggttc aggagatcca ggttctcggc 840
agcggctggg acaggaccct cggaggagac tctctcaact cgctaatcat cgatgacatg 900
attgctcagt ttgtggaatc caagggtgct cagaagattt cggcaaccgc cgagcaggtt 960
cagtctcatg gccgcgccgt tgcaaagctg agcaaggaag ccgagcgtct ccgacacgtc 1020
ctcagcgcca accagaacac ccaagccagc tttgagggac tgtacgaaga tgttgacttc 1080
aagtacaaga tctctcgggc tgacttcgag accatggcaa aggctcatgt cgagcgagtc 1140
aacgctgcca tcaaggacgc tctgaaggcc gcgaacctcg agattggcga tctgacttcc 1200
gtcattcttc acggtggtgc gacccgtact ccgtttgtgc gagaggccat tgagaaagct 1260
cttggttctg gcgacaagat ccgtaccaat gtcaactctg atgaggcagc cgtctttggt 1320
gctgctttcc gggctgctga gctcagccca agcttccgtg tgaaggagat taggatttct 1380
gagggtgcaa actacgcagc tggcattact tggaaggctg cgaacggcaa ggtacaccgc 1440
caacgactct ggactgcccc gtcgccgctc ggtggcccgg ccaaggagat tacctttacg 1500
gaacaggagg actttactgg tttattctat caacaagttg acactgagga taagcccgtc 1560
aagtcgttct cgactaagaa ccttaccgcc tctgttgctg ctctgaaaga aaagtatccc 1620
acttgtgccg atactggcgt tcagttcaag gctgccgcga agctccgtac cgagaacggc 1680
gaggttgcca tcgtcaaggc ctttgtggag tgcgaggctg aagtcgttga gaaggaaggc 1740
tttgttgacg gcgttaagaa cctctttggc ttcgggaaga aagatcagaa gcccctcgcc 1800
gaaggaggag acaaggacag tgccgatgcg tctgcggatt ctgaggccga gacggaggaa 1860
gctagctctg cgacaaagtc ctcctcttcc accagcacca ccaagtccgg agatgctgcc 1920
gagtcaacag aggctgcaaa ggaagtcaag aagaagcagc ttgtttctat ccctgtcgaa 1980
gtcacgttgg aaaaggctgg aatccctcag cttaccaagg ccgagtggac caaggccaag 2040
gatcgactga aggcattcgc cgcctccgac aaggccaggc tgcagcgcga agaggccctg 2100
aaccagctcg aagcattcac ttacaaggtt cgcgaccttg tcgacaacga agccttcatc 2160
tccgcgtcta ccgaggcgga gcgacagacg ctctctgaaa aggctagcga agcaagtgac 2220
tggctttatg aggagggcga ctcggccacg aaagatgact ttgttgctaa gctcaaggct 2280
ctgcaagatc tcgtggcacc gatccagaac cgcctggacg aggctgagaa gcggcctggt 2340
ctgattagcg atctgagaaa cattctcaac accacaaatg tgtttattga cactgttcgt 2400
gggcagattg ctgcgtatga tgaatggaaa tccacagctt cagccaagtc ggctgaatca 2460
gccacctcga gtgctgccgc cgaggcgacg accaacgact ttgaagggct cgaggatgag 2520
gacgacagcc ccaaagaggc tgaggagaag cccgttccag aaaaggtcgt gcccccgctg 2580
cacaactctg aggagattga cacgctcgag gttctctaca aggagactct ggagtggctg 2640
aacaagctcg aacgccaaca ggcagatgtt cctctcaccg aagagcccgt gcttgttgtc 2700
agcgagctgg ttgccagacg agatgcgctt gacaaggcca gcttagacct cgcgctgaag 2760
agctacaccc aataccagaa gaacaagccc aagaagccca ccaagagcaa gaaggcgaag 2820
aagcaggaca agacgaagag cgccgacaag gctggcccga cgtttgagtt tcccgagggc 2880
agcgtgcccc tctccggcga ggagctggag gagctggtca agaagtacat gaaggaggag 2940
gaggagaccc gcaggcaggc cgagggcgga caggcagagg agaagccggc ggaagataca 3000
gagaagtcga gccatgacga gctctaa 3027
<210>19
<211>1417
<212>DNA
<213>里氏木霉
<400>19
atggtagcca gattgtccag catctacgcc tgtgggctct tagcctggac gcacattgtt 60
tgcgcctctc agtttagcga cccgatgcaa ctacagaagc atcttgcaca gaatgactat 120
actttaattg cttgtaagtc atgacaatat cgccttctaa agtgtgtcaa ctcaggtaga 180
aataattgct aatagtagct tacagttgtt gctgtaagag ttgttcgggt caatatctta 240
cctagtcaag tcgagactcg aggctgacct taaggtatcg ctaccactta cagcctcaac 300
tgtaagtttt ggccaggcca agtttgaacc catctccctt aagaacaccg aacttaaaaa 360
aagtcaaacg gcagagaagc cagcaaactc ctcttagaag aatggcagac ggtccagcaa 420
catgtcgcct ccaccgccac catcgactgt ccgtccagcc ctaaactctg tcaggagatg 480
gacgtcgcct cctttcccgc tattcggctc taccgccagg atggctcagt aacacgttat 540
cgagggcctc gtcggaccgc accgtgagtt gacactttct tcgaattttg gagttaatct 600
ctcaaagcat gaagtgactg actgactacc ttacctccca ggatcgacgc ctttgtgaag 660
cgtgctctca aaccatccgt gcagaatgtt cctgggcagc aacttgccaa cttcatcacc 720
aacgacgact atgtattcat cgccaagctg caaggcgaga gcgagagcat caattctcac 780
tacagggatt ttgcgcaaga gtattctgat cgatactcgt ttggcatcat cacgagtggc 840
tctgtaccct ccaatggcgt ctggtgctac aacaacgtcg acggaaatca gcacgcggcg 900
acggacttga acgatccaaa tgccttgaag aagcttctca atctttgcac cgcggaggtc 960
attccccagc ttacacgacg caatgagatg acttatcttt ccgtatgtct tctgttctcc 1020
ctcctcactt ttaaaatgtt cagtagaaga agcttgggct tctgacccct tattccagtc 1080
aggccgatcc ttggtctatt acttctccaa caatgaagca gaccgcgaag catacgtcaa 1140
agcgctcaaa cccatcgccc agcgatacgc cgagttcctc cagttcgtca ccgtcgactc 1200
tggcgagtat cccgatatgc tgcgcaatct gggcgttcgc tccgccggag gcctggcagt 1260
gcaaaacgtc cacaacggac atattttccc cttcagagga gacgctgctg cttcgcctgg 1320
acaggttgac cagttcattg tggccatctc agaaggtagg gcgcagcctt gggatgggag 1380
gtttgacgag ggacaggagg cgcatgatga gctctga 1417
<210>20
<211>2174
<212>DNA
<213>里氏木霉
<400>20
atgcggctaa catccttctt ctctggcctg gccgcctttg gccttctgtc atctccagca 60
ctggcagatg atgaagctga caacgtcccc gcgcccacat acttcgattc cgtcatggtg 120
cctcccttga cagaactaac gccagacaac ttcgaaaagg aggcaagcaa aaccaagtgg 180
cttcttgtga agcactacag gtactaagcc cttcagccat atcacaccac tccccgtctg 240
attcaagctg acgcgtagcc gctgtctagt ccatactgcc accattgtat cagctacgcc 300
ccgaccttcc agacaaccta cgaattctac tacacatcca agccagaagg agctggcgac 360
acgagcttca ccgacttcta cgacttcaag tttgctgccg tgaactgtat cgcctacagc 420
gacctttgcg ttgagaatgg cgtcaagcta taccccacta cggttctata cgagaacggc 480
aaagaggtca aggccgtaac gggtggccag aacatcacct tcctttctga tctcatcgaa 540
gaagctttgg agaagtcgaa gcctggatct cggcccaagt ctctcgcatt gccccaaccg 600
ggcgacaaag agcgccccaa atctgagccc gagacagcat cgaggagcgc aaccgaggag 660
aagaagccca agaagccggt tgccacgccg aacgaagacg gagtgtcagt ttccttgacg 720
gccgaaaact tccagcgcct ggtgactatg actcaggatc cctggttcat caagttttac 780
gcgccgtggt gcccccattg ccaagacatg gcgcctacct gggagcagct ggcgaagaac 840
atgaagggca agctcaacat tggagaggtc aactgtgaca aggagtcgcg attgtgcaaa 900
gacgttggtg cgcgggcgtt tcccactatc ctgttcttca agggtggaga gcgctcagag 960
tacgaggggc tccgaggcct gggcgacttt atcaaatatg ccgaaaacgc cgtcgacctc 1020
gctagcggag tgcctgacgt ggacttggca gcattcaagg ctctcgagca gaaggaagac 1080
gtcatctttg tctactttta cgaccacgcc accacatcgg aggacttcaa tgccctcgag 1140
aggctgcccc tgagtctcat cggacatgcc aaactggtta agactaagga tccggccatg 1200
tacgagcgct tcaagatcac gacatggccc agattcatgg tttcgaggga gggtcgccct 1260
acgtactacc ctcccctcac ccctaacgcg atgagagata cccaccaagt tctggactgg 1320
atgaggtcgg tttggcttcc ccttgtcccc gaactgttgg ttaccaacgc ccgccagatc 1380
atggacaaca aaattgttgt gctcggcgtc ctgaatcgag aagaccagga atccttccag 1440
agtgctcttc gggagatgaa gagcgcagcc aacgagtgga tggacaggca aatccaagag 1500
ttccagttgg agcggaagaa gctgcgagac gcgaagcaaa tgaggatcga ggaagctgag 1560
gaccgagacg atgagcgcgc cctgcgggcc gccaaggcga tccatattga catgaacaat 1620
tccggacgga gagaagtggc ctttgcgtgg gttgatggcg tagcgtggca gcgctggatt 1680
cgaaccacgt atggcattga tgttaaggac ggagaaagag tcattatcaa cgaccaagat 1740
gtaagcctca agctcacccc catttgtcct ccctctacaa tattgctttg cgtttcgaac 1800
atgaacgact aacaaaaaca tttgaacaga gccgcaagta ctgggacagc accgtgacgg 1860
gcaactacat cctcgtcagc cgcacgtcca tcctggagac gctcgacaag gtcgtctaca 1920
ccccgcaggc cctcaagccc aagctcacca tttcctcttt cgagaagatc tttttcgaca 1980
tccgcgtctc cttcaccgag cacccctacc tgaccctggg ctgcatcgtt ggcatcgcct 2040
ttggagcctt ctcctggctg cgtggccgct ctcgccgtgg acgcggccac ttccggctcg 2100
aggattccat cagcattaga gatttcaagg acgggttcct tggtggatct aacggcaaca 2160
ccaaggccga ctga 2174
<210>21
<211>1578
<212>DNA
<213>里氏木霉
<400>21
atgcatcagc aaaccctcct cgccaccctc gcggcgagtc tcgctgctct tccttttgct 60
caggcgggct tctattcgaa gagctctccc gtgctgcaag tagacgccaa gtcgtacgac 120
cgcctcatca caaagtcgaa tcatacctct gtaagtatcc gtcctcacac actcacctca 180
ctcacaacgc gacatcatat ctcatacaca tccaccccaa accaccacaa acacaagaca 240
tatatcaagc tcaaacacat acacatacat acaaacacat acacacacag atacatacac 300
aactctcata tatatgaacc attcattgac atttccccca agattgtcga attctacgcc 360
ccctggtgcg gccactgcca aaacctcaag cccgcctacg aaaaggccgc ccgcaccctc 420
gacggcctgg ccaaggtcgc cgccgtcgac tgcgacgacg acgccaacaa ggccctctgc 480
ggctccctcg gcgtcaaggg cttccccacc ctcaagatcg tccgccccgg caagaagccc 540
ggccgccccg tcgtcgagga ctaccagggc cagcgcaccg cgggcgccat tgccgacgcc 600
gtcgtcgcca agatcaacaa ccacgtcgtc aagctgacgg acaaggacat tgatgccttt 660
ctggaaaagg acggcgacaa gccaaaggcc atcttgttca cggaaaaggg aactacgagt 720
gcgctgctga ggagccttgc tattgatttt ctcgacgccg tgaccattgg ccaggtccgc 780
aacaaggaaa aggctgccgt cgacaggttc ggcatctctt cgttcccttc cttcgtcctc 840
atccccggag gcggcaagga gcccgtcgtc tacagcggcg agctcaacaa gaaggacatg 900
gtcgagttcc tcaagcaggt cgccgagccc aaccccgacc cggccccctc aaacggcaag 960
tccggcaaga aggcctccac caaggacaag gccagcagca aggaggcccc ccaaaaggcc 1020
gccgccgccg acgagtcttc gtccgccgca tcctccgaga cctcaacggc cgccgcgccg 1080
gagtcgaccc tcatcgacat ccccgccctg acttccaagg cagagctcga ggagcactgt 1140
ctccaaccaa agtcccaaac ctgcgtcctc gcctttgtgc ccgcgtccgc ctcggagatg 1200
cgcaacaaga tcctttctgc cgtctcccag ctgcacacca agtacgtcca cggaaagcgc 1260
cacttcccct tcttctctgt cgacagcgac gtcgaaggct ctgccgccct caaggaagcc 1320
ctcggcctct cgggcaagat tgagctcgtt gccctcaacg cccgccgggg gtggtggagg 1380
cgatacgagg acggtgagtt cagcgttcac agcgtcgagt cctggattga cgccgttcgc 1440
atgggcgagg gcgagaagaa gaagcttccc gagggagtcg tcgtcgagaa ggcggagccg 1500
gcggaggaag caaagtctga gactgaagct gccgcagctg atgaggccac tgagaagcct 1560
gagcacgatg agctctaa 1578
<210>22
<211>1167
<212>DNA
<213>里氏木霉
<400>22
atggtcttga tcaagagcct cgtgctcgcc gtcctggcca gctcggtggc tgccaagtcg 60
gccgtcatcg acctgattcc gtccaacttt gacaagcttg tcttctccgg aaagcccacg 120
cttgtcgagt tttttgctcc ctggtgcggc cactgcaaga accttgctcc cgtgtacgag 180
gagttggccc aggtgtttga gcatgctaag gacaaggtcc agattgcaaa ggtcgacgcc 240
gactcggagc gagacctcgg aaagcggttc ggcatccagg gcttccccac gctcaagttc 300
ttcgatggca agagcaagga gccgcaggag tacaagtcgg gccgtgatct ggacagcctg 360
accaagttca tcactgagaa gactggtgtc aagcccaaga agaagggcga gctgcccagc 420
agcgtggtga tgctgaacac taggaccttc cacgacactg ttggaggcga caagaatgtc 480
ctggtagcgt tcactgctcc ttggtgtggc cgtaagtgaa gcctcgaccc ccgactgagt 540
cttgattctc gcatatttac ctcttgacca gactgcaaga acctcgcccc cacttgggaa 600
aaggttgcca atgacttcgc gggtgatgag aacgttgtga ttgccaaggt cgatgccgag 660
ggcgctgaca gcaaggccgt cgccgaagag tacggcgtca ctggctaccc caccatcctc 720
ttcttccccg ctggcaccaa gaagcaggtt gactaccaag gcggccgatc ggagggtgac 780
tttgtcaact tcatcaacga gaaggccggc accttccgaa ccgagggcgg cgagctgaat 840
gacatcgccg gcaccgtggc gcccctcgac accatcgtgg ccaacttcct cagcggcacc 900
ggcttggccg aggctgctgc tgagatcaag gaggctgttg acctgcttac ggatgctgcg 960
gagaccaagt tcgccgagta ctacgtccgc gtcttcgaca agctgagcaa gaatgagaag 1020
tttgttaaca aggagcttgc gagactgcag ggcatcctgg ccaagggtgg ccttgcccct 1080
tctaagcggg atgagatcca gatcaagatc aacgtcctgc gcaaatttac ccccaaggag 1140
aacgaggacc agaaggacga gctgtga 1167
<210>23
<211>1705
<212>DNA
<213>里氏木霉
<400>23
atgcaacaga agcgtcttac tgctgccctg gtggccgctt tggccgctgt ggtctctgcc 60
gagtcggatg tcaagtcctt gaccaaggac accttcaacg acttcatcaa ctccaatgac 120
ctcgtcctgg ctgagtgtat gtctctctct ctctctctcc ccccctcccc tttgccttct 180
gccctctcaa gcttctgcat ctctcgaccc ctcccccgcc agccccccgg catcgagatc 240
cccgctaaca gctgcaatct tccagtcttc gctccctggt gcggccactg caaggctctc 300
gcccccgagt acgaggaggc ggccacgact ctcaaggaca agagcatcaa gctcgccaag 360
gtcgactgtg tcgaggaggc tgacctctgc aaggagcatg gagttgaggg ctaccccacg 420
ctcaaggtct tccgtggcct cgataaggtc gctccctaca ctggtccccg caaggctgac 480
gggtaagctt tgaattgcac tgttctttgc atcaatccat tcattcgcta acgttggttg 540
tcctttcagc atcacctcct acatggtgaa gcagtccctg cctgccgtct ccgccctcac 600
caaggatacc ctcgaggact tcaagaccgc cgacaaggtc gtcctggtcg cctacatcgc 660
cgccgatgac aaggcctcca acgagacctt cactgctctg gccaacgagc tgcgtgacac 720
ctacctcttt ggtggcgtca acgatgctgc cgttgctgag gctgagggcg tcaagttccc 780
ttccattgtc ctctacaagt ccttcgacga gggcaagaac gtcttcagcg agaagttcga 840
tgctgaggcc attcgcaact ttgctcaggt tgccgccact cccctcgttg gcgaagttgg 900
ccctgagacc tacgccggct acatgtctgc cggtatccct ctggcttaca tcttcgccga 960
gaccgccgag gagcgtgaga acctggccaa gaccctcaag cccgtcgccg agaagtacaa 1020
gggcaagatc aacttcgcca ccatcgacgc caagaacttt ggctcgcacg ccggcaacat 1080
caacctcaag accgacaagt tccccgcctt tgccattcac gacattgaga agaacctcaa 1140
gttccccttt gaccagtcca aggagatcac cgagaaggac attgccgcct ttgtcgacgg 1200
cttctcctct ggcaagattg aggccagcat caagtccgag cccatccccg agacccagga 1260
gggccccgtc accgttgtcg ttgcccactc ttacaaggac attgtccttg acgacaagaa 1320
ggacgtcctg attgagttct acgctccctg gtgcggtcac tgcaaggctc tcgcccccaa 1380
gtacgatgag ctcgccagcc tgtatgccaa gagcgacttc aaggacaagg ttgtcatcgc 1440
caaggttgat gccactgcca acgacgtccc cgacgagatc cagggcttcc ccaccatcaa 1500
gctctacccc gccggtgaca agaagaaccc cgtcacctac agcggtgccc gcactgttga 1560
ggacttcatc gagttcatca aggagaacgg caagtacaag gccggcgtcg agatccccgc 1620
cgagcccacc gaggaggctg aggcttccga gtccaaggcc tctgaggagg ccaaggcttc 1680
cgaggagact cacgatgagc tgtaa 1705
<210>24
<211>982
<212>DNA
<213>里氏木霉
<400>24
atgaaggcag ccctgctcct ctccgccctg gcctcgtgcg ccattggcct cgtcgccgcc 60
gccgccgagg acttcaagat cgaggtcacc caccccgtcg agtgcgaccg caagacgcaa 120
aagggcgaca agctgtccat gcactaccgc ggcacgctgg ccaagacggg cgacaagttc 180
gatgccagtg cgtttcttct attccctttc cctctttcct cccatttctc tcacacacca 240
atgacggtcc tccttttctt ttgatctcat tgactgacaa gttttggtct acctactcta 300
ggctacgatc gtaaccagcc attcaacttc aagctgggtg ctggccaggt gattaagggg 360
ttcgtcttgc ccaccccccc ctaacccacc cctctcgttc ttttatgacg acgacgacga 420
cgacgacgtt gggcgacgtt gaggctaacg gcttgtagat gggatcaggg tctccttgac 480
atgtgcattg gcgagaagag gtaagacgaa ccgaaccaac ccaactgcgt cgctcactgc 540
ctccttgggc ctctatcagg acgcaatgct gaccattaca tcaccaattc aggactctca 600
cgatccctcc cgagctgggc tacggccagc gcaacatggg ccccattccc gccggctcaa 660
ccctgagtac gtggctccta tcctccccta cctgaactcc caaacccaga gtttcaccca 720
cgccgcatgg aaaaccaggc cgcaggctaa caacacacga tgccatacag tctttgagac 780
cgagctcctc gccatcgagg gcgtcaaggc ccccgagaag aagcccgtcc ccgagacgcc 840
cattgtcgag aagcccgccg aagagacaga ggagagcgtc gtcgagaagg ccgccgaggc 900
agccgccagc gtggcctccg aggccgtcga cgccgccaag actgtctttg ccgacactga 960
cgagggtcac ggggagctgt aa 982
<210>25
<211>809
<212>DNA
<213>里氏木霉
<400>25
atgctgacct ttaggcggct cttcaccacc gccatcgtcc tggtggtggg cctgctcttc 60
ttcgtcaaga cggccgaggc cgccaagggc cccaagatca cccacaaggt cttcttcgac 120
attgagcacg gcgacgagaa gctgggccgc atcgtcctgg gcctgtacgg caagacggtc 180
cccgagacgg ccgagaactt ccgggccctg gccaccggcg agaagggctt cggctacgag 240
ggctcgacct tccaccgcgt catcaagcag tttatgattc agggcggcga ctttaccaag 300
ggcgatggca ccggtggcaa gtcgagtaag ttgcctttgg ttcccaaata agcaatcaat 360
tgatcaatca attgggtggc atggcgtttg tcactgcatc tggctctggc tctggctaac 420
cttgagggct ccgtctagtc tacggcaaca agttcaagga cgagaacttc aagctgaagc 480
acaccaagaa gggcctgctg tccatggcca acgcgggacc cgacaccaac ggctcccagt 540
tcttcatcac cactgttgtt acctcgtatg atttccccac cctccttgga agatcctgga 600
taagaagtag gaccaatcta acgaacaact taaacagatg gctcgacggc cgacacgtcg 660
tcttcggcga ggttctcgag ggctacgaca ttgttgagaa gattgaaaac gtccagaccg 720
gccccggcga tcgcccagtg aagccggtca agattgccaa gagcggcgag ctggaggttc 780
cccccgaagg tattcacgtc gagctctaa 809
<210>26
<211>1372
<212>DNA
<213>里氏木霉
<400>26
atgatactgc gcgcggcaat cttcgtcttg ctggcgctgg tatcgctggc ggtttgcgcc 60
gaggactttt acaaggtatg ccgggacgca atgcctcgaa tcaagcacgg agcgtgctga 120
cggacacatg acaggttcta ggagtcgaca agtctgcgtc agacaagcag ctcaagcagg 180
cctatcgcca gctctccaag aagttccacc cagacaagaa cccgtacgcc ctcctacagc 240
tacacgcagt ctcgccaacc ttctccaatg tgctaatcac tctactgctt ctagaggcga 300
tgaaacggcg cacgagaaat tcgtgctggt gtccgaggcc tacgaagttc tgagcgattc 360
cgagcttcgc aaagtctacg accgctacgg ccacgagggc gtcaagtccc accgtcaagg 420
cggcggcgga ggaggaggag gcgacccctt cgacctcttc agcaggttct ttggcggcca 480
tggccacttt gggagaaaca gccgcgagcc ccggggcagc aacattgagg tccgcatcga 540
gatttccctc cgcgactttt acaacggcgc cacgaccgag ttccagtggg agaagcagca 600
catatgcgaa aagtgcgagg gcacgggcag cgcggacgga aaggtcgaga cgtgcagcgt 660
ctgcggcgga cacggggttc ggattgtcaa gcagcagctc gttcccggca tgttccagca 720
gatgcagatg cgctgcgacc actgtggcgg ctcgggcaag accatcaaga acaagtgttc 780
cgtctgccac ggcagccgag tcgagcgcaa gccgacgact gtcagcctga ctgtcgagag 840
gggcattgct cgagatgcca aggtggtgtt tgagaacgaa gccgaccaga gccccgactg 900
ggttcctggt gatctcattg tcaacctggg cgagaaggcc ccgtcatacg aagacaaccc 960
cgatcgcgtc gacggcacct tcttccggcg caagggccat gacctgtact ggaccgaggt 1020
tctgtcgctg cgtgaggcct ggatgggtgg ctggacgcgt aacctcacgc acctcgacaa 1080
gcacgttgtg cgtcttggac gggagcgagg ccaggttgtt cagagtgggt tggtggaaac 1140
cattcccggc gaaggcatgc ccatatggca cgaagaggga gagagcgtct atcacacaca 1200
cgagtttgga aatctctacg tcacatacga agtcattttg ccggaccaga tggacaagaa 1260
gatggagagc gagttctggg acctgtggga gaagtggcgg tccaagaatg gtgtggacct 1320
gcaaaaggat ctcgggcggc ctgagccagg gcatgaccat gatgagttat ga 1372
<210>27
<211>685
<212>DNA
<213>里氏木霉
<400>27
atggcgcgcc gccagcacct caccgcgaca gtcctgctgg ccgtcgtgct cttcttcagc 60
atcacgtacc tcctctcggg ctcgtccagc tccaatgcgg atcgaacgcg cgaggccgta 120
gtggcagagc ccaagtcgga attcaaggtg gattttgacg gcatgccggc caacctgctg 180
gagggagagt caatagcacc caagctggag aatgcgactc tcaagtacgt ttcccgcata 240
cccgaacctg ctcccatgag ccaccgacca tggcagtgtt tcaaaggata ccagttctga 300
cgcttttctg caattacata gagccgagct cggtcgcgca acatggaaat tcatgcacac 360
aatggtcgcc cgcttccccg agaagccctc gcccgaggag cgcaagacgc tcgagacctt 420
catctacctc ttcggccggc tgtacccctg cggcgactgc gcgaggcact tccggggcct 480
gctggcaaaa tatccgccgc agacgagtag ccggaatgcg gctgccggat ggctgtgttt 540
tgtgcacaac caggtcaacg agaggctgaa gaagcccata tttgactgca acaacattgg 600
cgacttttac gactgcggct gcggggacga gaagaaggac gggaaggagg aggccaaggt 660
tgatggcgaa ttggtgaagg aatag 685
<210>28
<211>3407
<212>DNA
<213>里氏木霉
<400>28
atggtgatgc tggtggcgat cgcgctcgca tggctgggat gctcgctgct gcggccggta 60
gatgccatgc gcgcagacta tctggcccag ctgcggcagg agacggtgga catgttctat 120
cacggatata gcaactacat ggagcatgcg tttcccgaag acgaggtggg ttccgctgcg 180
atagaagatt gttgttgggg ctgctgctat gttccagctc ccggggggtc ggattctctc 240
atatagaact agacagctaa cgacttgtgc cttttccata tgcttagctg cgtcccatat 300
cgtgcactcc cctgacgcga gatcgagaca atccggggcg catcagcctc aacgatgccc 360
tcggcaacta ctctctgacc ctcatagaca gcctgtctac ccttgccatc ctggccggcg 420
gcccgcagaa cggcccttac acgggaccgc aggctctgag cgacttccag gatggcgtgg 480
ccgagtttgt gcgacactac ggagacgggc gatcggggcc ctccggcgct gggatacgtg 540
ccagaggctt tgatctcgac agcaaagttc aggtctttga gaccgtcatc cggggcgtgg 600
gcggtctcct tagcgcgcac ctgttcgcca ttggggagct gccgattacc ggatacgtgc 660
ccaggccgga gggagtcgca ggcgatgatc ctctggagct ggcccctatt ccgtggccca 720
atgggttcag gtacgatggc cagctgctga ggctcgcgct cgacctctcc gagaggctgc 780
ttcccgcctt ctacacgccg acgggcattc cgtatcctcg tgtcaatctc cgcagcggca 840
tcccctttta cgtcaactcg cctctccacc aaaacctggg cgaggcagtg gaggagcaga 900
gtggccgtcc tgaaattacc gagacctgca gcgccggggc gggaagcctg gttctcgaat 960
ttaccgtctt gagcaggctc acgggagacg ccaggtttga acaagccgcc aagcgagcat 1020
tctgggaggt ctggcatcgc aggagcgaaa ttggcttgat cgggaacggc atcgacgccg 1080
agcgcgggct gtggatcggc cctcacgcgg gcattggcgc gggcatggac agcttctttg 1140
aatatgcgct caagagccat atcctcctct cgggcctcgg tatgcccaac gcctccacgt 1200
cgcgccgaca gagcacaacc agctggctgg atccaaactc cctgcacccg ccgctgccac 1260
cagagatgca cacgtcagat gccttcctcc aggcatggca tcaggcgcac gcctcggtca 1320
agcggtacct gtacaccgac cggagccact tcccttatta ctccaacaac caccgtgcca 1380
cgggccagcc ctatgccatg tggatcgaca gcctgggcgc cttctatccg gggctcctcg 1440
ccctggccgg tgaggtggaa gaggccattg aggcgaacct cgtctacaca gccttgtgga 1500
cgcggtactc tgcgctgccc gaacgctggt ccgtccgcga aggcaacgtc gaggcaggca 1560
tcggctggtg gcccgggagg cccgagttca tcgagtcgac gtaccacatc taccgtgcaa 1620
cccgcgaccc gtggtatctg cacgttggcg agatggtcct ccgcgacatt cggcgtcggt 1680
gctatgcgga gtgcggctgg gccgggcttc aggacgtgca gacgggcgag aagcaggacc 1740
gcatggagag cttcttcttg ggagagacgg caaaatacat gtacctgctg ttcgacccag 1800
accatccact caacaagctg gatgccgcct acgtcttcac cacagaaggc catccgctta 1860
tcataccaaa gagcaaaagg ggtagcggct ctcacaacag acaggaccgc gctcgcaaag 1920
ccaagaagag ccgagacgtc gcagtctaca cctactacga tgaaagcttc acaaactctt 1980
gtccggcccc tcggccgcct tcagagcatc acctgatagg ctcggccacg gcggccaggc 2040
cagacttgtt ctccgtctct cgcttcacag acctgtacag aacgcccaac gtacacgggc 2100
ccctggagaa ggtggagatg cgagacaaga agaagggccg ggtggttcga tacagggcca 2160
cctcaaacca caccatcttc ccctggactc ttcccccagc catgctgccg gagaatggca 2220
cctgcgctgc tcccccggaa cgcatcatat ccttgattga gttcccggcc aacgacatca 2280
ccagtggaat cacgtcgcgg ttcggcaacc atctatcgtg gcagacgcat ctggggccaa 2340
cggtcaacat tctagaggga ctgaggctcc agctcgagca ggtgtcggac cctgccacgg 2400
gagaagacaa gtggaggatc acacacattg gcaacacgca gctggggcgc cacgagacag 2460
tcttcttcca cgcggaacac gtaaggcatc tcaaggacga ggtgttttcc tgccgcagaa 2520
ggagggacgc cgtggaaatc gagctcctgg tcgacaagcc gagcgatacc aacaacaaca 2580
acacgcttgc ctcgtccgat gacgatgtag tggtagatgc aaaagcagaa gagcaagacg 2640
gcatgctagc cgacgacgac ggcgacacac tcaacgcaga aacactctcc tccaactccc 2700
tcttccagtc cctcctccgc gccgtctcct ccgtcttcga gcccgtctac accgccatcc 2760
ccgagtccga ccccagcgcc ggcaccgcca aggtctacag tttcgacgcc tacacgtcca 2820
ccggccccgg cgcgtacccc atgccgtcca tctcggacac gcccatcccc ggcaacccct 2880
tttacaactt ccgcaacccg gcctccaact tcccctggtc gaccgtcttc ctcgccggcc 2940
aggcctgcga gggcccgctc cccgcgtccg cgccgcgcga gcaccaggtc attgtcatgc 3000
tccgcggcgg ctgctccttc agccgcaagc tggacaacat ccccagcttc tcgccccacg 3060
acagggcgct gcagctcgtc gttgtcctcg acgaaccgcc gccgccgccg ccgccgccgc 3120
cagccagtca gaacagcggc ggcgatgacg acgatgaaga tgacgaagac gaccacgacg 3180
ccgtcaacga caacgaagac gacaggcgcg acgtgacgcg gccactgctc gacacggagc 3240
agaccacgcc caagggcatg aagcgcctgc acggcatccc aatggtcctc gtccgagccg 3300
cgcggggcga ctacgagctt ttcgggcatg ccattggcgt gggcatgagg cgcaagtatc 3360
gggttgaaag ccaggggctt gtcgtggaga atgcggttgt gctgtga 3407
<210>29
<211>1221
<212>DNA
<213>里氏木霉
<400>29
atgaggcctc tggcactcat atttgccctc atcttgggcc tattgctctg cttagcagcc 60
ccagcaacgg catcgtcatc atcatcacaa cactctcccc aagcggcatc agacgagtca 120
gatttaatat gtcacacatc aaacccagac gaatgctatc cccgggtctt cgtaccaacg 180
catgagttcc agccagtcca cgacgaccag caactcccaa acggcctcca tgtccgtctc 240
aacatctgga ccggccaaaa ggaagccaag atcaacgtcc ccgatgaggc caaccctgat 300
ctcgatggcc tgcccgtcga ccaagccgtg gttctcgtcg accaggagca gccagaaatt 360
atccagatcc ccaagggcgc accaaaatac gacaatgtcg gcaagatcaa ggaacccgcg 420
caagaaggag acgcccaaac ggaagccatt gcttttgcag agacgttcaa catgctcaag 480
accggcaagt cgccaagcgc cgaggagttc gacaacggac tggaaggcct ggaggagctc 540
tcccacgaca tctactacgg gctcaaaatc acagaggacg cggacgtggt caaggcgcta 600
ttctgcttga tgggggctcg cgacggcgac gcctcggagg gagccacgcc gcgcgaccag 660
caagcggccg cgatcctcgc cggcgccctg tccaacaatc cgtcggcact cgccgagata 720
gccaagatct ggcctgagct tctggactcg tcgtgtcctc gcgacggcgc caccatctct 780
gaccgtttct accaagacac cgtctccgtt gccgactctc cggcaaaggt caaggccgcc 840
gtctcggcca tcaacggcct gatcaaggac ggcgccatcc gaaagcagtt tctcgaaaac 900
agcggcatga agcagctcct ctcggtcctg tgccaagaga agccggagtg ggcgggagcg 960
cagcggaaag tcgctcagct ggtgctggac accttcctgg acgaggacat gggcgcccag 1020
cttggccagt ggcccagggg caaggcatcg aacaacgggg tgtgtgcggc gccggagacg 1080
gcgctcgatg acggatgctg ggactatcat gcggacagga tggtgaagct gcatgggacg 1140
ccgtggagca aggagttgaa gcagaggctg ggagatgcgc gcaaggcgaa cagcaagttg 1200
ccggatcatg gcgagctgta g 1221
<210>30
<211>648
<212>PRT
<213>里氏木霉
<400>30
Met Ala Arg Ser Arg Ser Ser Leu Ala Leu Gly Leu Gly Leu Leu Cys
1 5 10 15
Trp Ile Thr Leu Leu Phe Ala Pro Leu Ala Phe Val Gly Lys Ala Asn
20 25 30
Ala Ala Ser Asp Asp Ala Asp Asn Tyr Gly Thr Val Ile Gly Ile Asp
35 40 45
Leu Gly Thr Thr Tyr Ser Cys Val Gly Val Met Gln Lys Gly Lys Val
50 55 60
Glu Ile Leu Val Asn Asp Gln Gly Asn Arg Ile Thr Pro Ser Tyr Val
65 70 75 80
Ala Phe Thr Asp Glu Glu Arg Leu Val Gly Asp Ser Ala Lys Asn Gln
85 90 95
Ala Ala Ala Asn Pro Thr Asn Thr Val Tyr Asp Val Lys Arg Leu Ile
100 105 110
Gly Arg Lys Phe Asp Glu Lys Glu Ile Gln Ala Asp Ile Lys His Phe
115 120 125
Pro Tyr Lys Val Ile Glu Lys Asn Gly Lys Pro Val Val Gln Val Gln
130 135 140
Val Asn Gly Gln Lys Lys Gln Phe Thr Pro Glu Glu Ile Ser Ala Met
145 150 155 160
Ile Leu Gly Lys Met Lys Glu Val Ala Glu Ser Tyr Leu Gly Lys Lys
165 170 175
Val Thr His Ala Val Val Thr Val Pro Ala Tyr Phe Asn Asp Asn Gln
180 185 190
Arg Gln Ala Thr Lys Asp Ala Gly Thr Ile Ala Gly Leu Asn Val Leu
195 200 205
Arg Ile Val Asn Glu Pro Thr Ala Ala Ala Ile Ala Tyr Gly Leu Asp
210 215 220
Lys Thr Asp Gly Glu Arg Gln Ile Ile Val Tyr Asp Leu Gly Gly Gly
225 230 235 240
Thr Phe Asp Val Ser Leu Leu Ser Ile Asp Asn Gly Val Phe Glu Val
245 250 255
Leu Ala Thr Ala Gly Asp Thr His Leu Gly Gly Glu Asp Phe Asp Gln
260 265 270
Arg Ile Ile Asn Tyr Leu Ala Lys Ala Tyr Asn Lys Lys Asn Asn Val
275 280 285
Asp Ile Ser Lys Asp Leu Lys Ala Met Gly Lys Leu Lys Arg Glu Ala
290 295 300
Glu Lys Ala Lys Arg Thr Leu Ser Ser Gln Met Ser Thr Arg Ile Glu
305 310 315 320
Ile Glu Ala Phe Phe Glu Gly Asn Asp Phe Ser Glu Thr Leu Thr Arg
325 330 335
Ala Lys Phe Glu Glu Leu Asn Met Asp Leu Phe Lys Lys Thr Leu Lys
340 345 350
Pro Val Glu Gln Val Leu Lys Asp Ala Asn Val Lys Lys Ser Glu Val
355 360 365
Asp Asp Ile Val Leu Val Gly Gly Ser Thr Arg Ile Pro Lys Val Gln
370 375 380
Ser Leu Ile Glu Glu Tyr Phe Asn Gly Lys Lys Ala Ser Lys Gly Ile
385 390 395 400
Asn Pro Asp Glu Ala Val Ala Phe Gly Ala Ala Val Gln Ala Gly Val
405 410 415
Leu Ser Gly Glu Glu Gly Thr Asp Asp Ile Val Leu Met Asp Val Asn
420 425 430
Pro Leu Thr Leu Gly Ile Glu Thr Thr Gly Gly Val Met Thr Lys Leu
435 440 445
Ile Pro Arg Asn Thr Pro Ile Pro Thr Arg Lys Ser Gln Ile Phe Ser
450 455 460
Thr Ala Ala Asp Asn Gln Pro Val Val Leu Ile Gln Val Phe Glu Gly
465 470 475 480
Glu Arg Ser Met Thr Lys Asp Asn Asn Leu Leu Gly Lys Phe Glu Leu
485 490 495
Thr Gly Ile Pro Pro Ala Pro Arg Gly Val Pro Gln Ile Glu Val Ser
500 505 510
Gly Thr Gly Lys Gln Glu Ser Ile Thr Ile Thr Asn Asp Lys Gly Arg
515 520 525
Leu Thr Gln Glu Glu Ile Asp Arg Met Val Ala Glu Ala Glu Lys Phe
530 535 540
Ala Glu Glu Asp Lys Ala Thr Arg Glu Arg Ile Glu Ala Arg Asn Gly
545 550 555 560
Leu Glu Asn Tyr Ala Phe Ser Leu Lys Asn Gln Val Asn Asp Glu Glu
565 570 575
Gly Leu Gly Gly Lys Ile Asp Glu Glu Asp Lys Glu Thr Ile Leu Asp
580 585 590
Ala Val Lys Glu Ala Thr Glu Trp Leu Glu Glu Asn Gly Ala Asp Ala
595 600 605
Thr Thr Glu Asp Phe Glu Glu Gln Lys Glu Lys Leu Ser Asn Val Ala
610 615 620
Tyr Pro Ile Thr Ser Lys Met Tyr Gln Gly Ala Gly Gly Ser Glu Asp
625 630 635 640
Asp Gly Asp Phe His Asp Glu Leu
645
<210>31
<211>558
<212>PRT
<213>里氏木霉
<400>31
Met Lys Phe Asn Thr Val Ala Ala Ala Ala Ala Leu Leu Ala Gly Val
1 5 10 15
Ala Tyr Ala Glu Asp Val Glu Glu Ser Lys Ala Val Pro Glu Leu Pro
20 25 30
Thr Phe Thr Pro Thr Ser Ile Lys Ala Asp Phe Leu Glu Gln Phe Thr
35 40 45
Asp Asp Trp Glu Ser Arg Trp Lys Pro Ser His Ala Lys Lys Asp Thr
50 55 60
Ser Gly Ser Asp Lys Asp Ala Glu Glu Glu Trp Ala Tyr Val Gly Glu
65 70 75 80
Trp Ala Val Glu Glu Pro Tyr Gln Tyr Lys Gly Ile Asn Gly Asp Lys
85 90 95
Gly Leu Val Val Lys Asn Pro Ala Ala His His Ala Ile Ser Ala Lys
100 105 110
Phe Pro Lys Lys Ile Asp Asn Lys Gly Lys Thr Leu Val Val Gln Tyr
115 120 125
Glu Val Lys Leu Gln Lys Gly Leu Asp Cys Gly Gly Ala Tyr Met Lys
130 135 140
Leu Leu Arg Asp Asn Lys Ala Leu His Gln Asp Glu Phe Ser Asn Thr
145 150 155 160
Thr Pro Tyr Val Ile Met Phe Gly Pro Asp Lys Cys Gly His Asn Asn
165 170 175
Arg Val His Phe Ile Val Asn His Lys Asn Pro Lys Thr Gly Glu Tyr
180 185 190
Glu Glu Lys His Leu Asn Ser Ala Pro Ala Val Asn Ile Val Lys Thr
195 200 205
Thr Glu Leu Tyr Thr Leu Ile Val His Pro Asn Asn Thr Phe Ser Ile
210 215 220
Lys Gln Asn Gly Val Glu Thr Lys Ala Gly Ser Leu Leu Glu Asp Leu
225 230 235 240
Ser Pro Pro Ile Asn Pro Pro Lys Glu Ile Asp Asp Pro Lys Asp Ser
245 250 255
Lys Pro Asp Asp Trp Val Asp Glu Ala Arg Ile Pro Asp Pro Glu Ala
260 265 270
Val Lys Pro Glu Asp Trp Asp Glu Asp Ala Pro Phe Glu Ile Val Asp
275 280 285
Glu Glu Ala Val Lys Pro Glu Asp Trp Leu Glu Asp Glu Pro Thr Thr
290 295 300
Ile Pro Asp Pro Glu Ala Gln Lys Pro Glu Asp Trp Asp Asp Glu Glu
305 310 315 320
Asp Gly Asp Trp Ile Pro Pro Thr Val Pro Asn Pro Lys Cys Glu Asp
325 330 335
Val Ser Gly Cys Gly Pro Trp Thr Lys Pro Met Val Arg Asn Pro Asn
340 345 350
Tyr Lys Gly Lys Trp Thr Ala Pro Tyr Ile Asp Asn Pro Ala Tyr Lys
355 360 365
Gly Val Trp Ala Pro Arg Lys Ile Lys Asn Pro Asp Tyr Phe Glu Asp
370 375 380
Lys Thr Pro Ala Asn Phe Glu Pro Met Gly Ala Ile Gly Phe Glu Ile
385 390 395 400
Trp Thr Met Thr Asn Asp Ile Leu Phe Asp Asn Ile Tyr Ile Gly His
405 410 415
Ser Ile Glu Asp Ala Glu Lys Leu Ala Asn Glu Thr Phe Phe Val Lys
420 425 430
His Pro Ile Glu Lys Ala Leu Ala Glu Ala Asp Glu Pro Lys Phe Asp
435 440 445
Asp Thr Pro Lys Ser Pro Ser Asp Leu Lys Phe Leu Asp Asp Pro Val
450 455 460
Thr Phe Val Lys Glu Lys Leu Asp Leu Phe Leu Thr Ile Ala Gln Arg
465 470 475 480
Asp Pro Val Glu Ala Ile Lys Phe Val Pro Glu Val Ala Gly Gly Ile
485 490 495
Ala Ala Val Phe Val Thr Leu Ile Ala Ile Ile Val Gly Leu Val Gly
500 505 510
Leu Gly Ser Ser Ser Ala Ala Pro Lys Lys Ala Ala Ala Thr Ala Lys
515 520 525
Glu Lys Ala Lys Asp Val Ser Glu Ala Val Ala Ser Gly Ala Asp Lys
530 535 540
Val Lys Gly Glu Val Thr Lys Arg Thr Thr Arg Ser Gln Ser
545 550 555
<210>32
<211>585
<212>PRT
<213>里氏木霉
<400>32
Met Lys Ser Ala Ser Lys Leu Phe Phe Leu Ser Val Phe Ser Leu Trp
1 5 10 15
Ala Thr Pro Gly Ala Cys Ser Ser Ser Ser Ser Thr Cys Thr Phe Ser
20 25 30
Pro Asn Ala Ile Ile Asp Asp Gly Cys Val Ser Tyr Ala Thr Leu Asp
35 40 45
Arg Leu Asn Val Lys Val Lys Pro Ala Ile Asp Glu Leu Val Gln Thr
50 55 60
Thr Asp Phe Phe Ser His Tyr Arg Leu Asn Leu Phe Asn Lys Lys Cys
65 70 75 80
Pro Phe Trp Asn Asp Glu Asp Gly Met Cys Gly Asn Ile Ala Cys Ala
85 90 95
Val Glu Thr Leu Asp Asn Glu Glu Asp Ile Pro Glu Ile Trp Arg Ala
100 105 110
His Glu Leu Ser Lys Leu Glu Gly Pro Arg Ala Lys His Pro Gly Lys
115 120 125
Gln Glu Gln Arg Gln Asn Pro Glu Arg Pro Leu Gln Gly Glu Leu Gly
130 135 140
Glu Asp Val Gly Glu Ser Cys Val Val Glu Tyr Asp Asp Glu Cys Asp
145 150 155 160
Asp Arg Asp Tyr Cys Val Trp Asp Asp Glu Gly Ala Thr Ser Lys Gly
165 170 175
Asp Tyr Ile Ser Leu Leu Arg Asn Pro Glu Arg Phe Thr Gly Tyr Gly
180 185 190
Gly Gln Ser Ala Lys Gln Val Trp Asp Ala Ile Tyr Ser Glu Asn Cys
195 200 205
Phe Lys Lys Ser Ser Phe Pro Lys Ser Ala Asp Leu Gly Val Ser His
210 215 220
Arg Pro Thr Glu Ala Ala Ala Leu Asp Phe Lys Gln Val Leu Asp Thr
225 230 235 240
Ala Gly Arg Gln Ala Gln Leu Glu Gln Gln Arg Gln Ser Asn Pro Asn
245 250 255
Ile Pro Phe Val Ala Asn Thr Gly Tyr Glu Val Asp Asp Glu Cys Leu
260 265 270
Glu Lys Arg Val Phe Tyr Arg Val Val Ser Gly Met His Ala Ser Ile
275 280 285
Ser Val His Leu Cys Trp Asp Phe Leu Asn Gln Ser Thr Gly Gln Trp
290 295 300
Gln Pro Asn Leu Asp Cys Tyr Glu Ser Arg Leu His Lys Phe Pro Asp
305 310 315 320
Arg Ile Ser Asn Leu Tyr Phe Asn Tyr Ala Leu Val Thr Arg Ala Ile
325 330 335
Ala Lys Leu Gly Pro Tyr Val Leu Ser Pro Gln Tyr Thr Phe Cys Thr
340 345 350
Gly Asp Pro Leu Gln Asp Gln Glu Thr Arg Asp Lys Ile Ala Ala Val
355 360 365
Thr Lys His Ala Ala Ser Val Pro Gln Ile Phe Asp Glu Gly Val Met
370 375 380
Phe Val Asn Gly Glu Gly Pro Ser Leu Lys Glu Asp Phe Arg Asn Arg
385 390 395 400
Phe Arg Asn Ile Ser Arg Val Met Asp Cys Val Gly Cys Asp Lys Cys
405 410 415
Arg Leu Trp Gly Lys Ile Gln Thr Ser Gly Tyr Gly Thr Ala Leu Lys
420 425 430
Ile Leu Phe Glu Phe Asn Glu Gly Gln Lys Pro Pro Pro Leu Lys Arg
435 440 445
Thr Glu Leu Val Ala Leu Phe Asn Thr Tyr Ala Arg Leu Ser Ser Ser
450 455 460
Val Ala Ala Val Gly Arg Phe Arg Ala Met Ile Asp Met Arg Asp Lys
465 470 475 480
Met Ala Ser Lys Pro Asp Phe Lys Pro Glu Asp Leu Tyr Thr Leu Ile
485 490 495
Asp Glu Ala Asp Glu Asp Met Asp Glu Phe Ile Arg Met Gln Asn Arg
500 505 510
Gly Ser His Gly Asp Thr Leu Gly Glu Gln Val Gly Asn Glu Phe Ala
515 520 525
Arg Val Met Met Ala Val Lys Ile Val Leu Lys Ser Trp Ile Arg Thr
530 535 540
Pro Lys Met Ile Trp Gln Ile Val Ser Glu Glu Thr Ser Arg Leu Tyr
545 550 555 560
Arg Ala Trp Val Gly Leu Pro Ala Arg Pro Arg Arg Tyr Ala Phe Arg
565 570 575
Leu Pro Asn Leu Asn Arg Asp Glu Leu
580 585
<210>33
<211>1008
<212>PRT
<213>里氏木霉
<400>33
Met Lys Ser Pro Arg Lys Ser Pro Leu Leu Lys Leu Leu Gly Ala Ala
1 5 10 15
Phe Leu Phe Ser Thr Asn Val Leu Ala Ile Ser Ala Val Leu Gly Val
20 25 30
Asp Leu Gly Thr Glu Tyr Ile Lys Ala Ala Leu Val Lys Pro Gly Ile
35 40 45
Pro Leu Glu Ile Val Leu Thr Lys Asp Ser Arg Arg Lys Glu Thr Ser
50 55 60
Ala Val Ala Phe Lys Pro Ala Lys Gly Ala Leu Pro Glu Gly Gln Tyr
65 70 75 80
Pro Glu Arg Ser Tyr Gly Ala Asp Ala Met Ala Leu Ala Ala Arg Phe
85 90 95
Pro Gly Glu Val Tyr Pro Asn Leu Lys Pro Leu Leu Gly Leu Pro Val
100 105 110
Gly Asp Ala Ile Val Gln Glu Tyr Ala Ala Arg His Pro Ala Leu Lys
115 120 125
Leu Gln Ala His Pro Thr Arg Gly Thr Ala Ala Phe Lys Thr Glu Thr
130 135 140
Leu Ser Pro Glu Glu Glu Ala Trp Met Val Glu Glu Leu Leu Ala Met
145 150 155 160
Glu Leu Gln Ser Ile Gln Lys Asn Ala Glu Val Thr Ala Gly Gly Asp
165 170 175
Ser Ser Ile Arg Ser Ile Val Leu Thr Val Pro Pro Phe Tyr Thr Ile
180 185 190
Glu Glu Lys Arg Ala Leu Gln Met Ala Ala Glu Leu Ala Gly Phe Lys
195 200 205
Val Leu Ser Leu Val Ser Asp Gly Leu Ala Val Gly Leu Asn Tyr Ala
210 215 220
Thr Ser Arg Gln Phe Pro Asn Ile Asn Glu Gly Ala Lys Pro Glu Tyr
225 230 235 240
His Leu Val Phe Asp Met Gly Ala Gly Ser Thr Thr Ala Thr Val Met
245 250 255
Arg Phe Gln Ser Arg Thr Val Lys Asp Val Gly Lys Phe Asn Lys Thr
260 265 270
Val Gln Glu Ile Gln Val Leu Gly Ser Gly Trp Asp Arg Thr Leu Gly
275 280 285
Gly Asp Ser Leu Asn Ser Leu Ile Ile Asp Asp Met Ile Ala Gln Phe
290 295 300
Val Glu Ser Lys Gly Ala Gln Lys Ile Ser Ala Thr Ala Glu Gln Val
305 310 315 320
Gln Ser His Gly Arg Ala Val Ala Lys Leu Ser Lys Glu Ala Glu Arg
325 330 335
Leu Arg His Val Leu Ser Ala Asn Gln Asn Thr Gln Ala Ser Phe Glu
340 345 350
Gly Leu Tyr Glu Asp Val Asp Phe Lys Tyr Lys Ile Ser Arg Ala Asp
355 360 365
Phe Glu Thr Met Ala Lys Ala His Val Glu Arg Val Asn Ala Ala Ile
370 375 380
Lys Asp Ala Leu Lys Ala Ala Asn Leu Glu Ile Gly Asp Leu Thr Ser
385 390 395 400
Val Ile Leu His Gly Gly Ala Thr Arg Thr Pro Phe Val Arg Glu Ala
405 410 415
Ile Glu Lys Ala Leu Gly Ser Gly Asp Lys Ile Arg Thr Asn Val Asn
420 425 430
Ser Asp Glu Ala Ala Val Phe Gly Ala Ala Phe Arg Ala Ala Glu Leu
435 440 445
Ser Pro Ser Phe Arg Val Lys Glu Ile Arg Ile Ser Glu Gly Ala Asn
450 455 460
Tyr Ala Ala Gly Ile Thr Trp Lys Ala Ala Asn Gly Lys Val His Arg
465 470 475 480
Gln Arg Leu Trp Thr Ala Pro Ser Pro Leu Gly Gly Pro Ala Lys Glu
485 490 495
Ile Thr Phe Thr Glu Gln Glu Asp Phe Thr Gly Leu Phe Tyr Gln Gln
500 505 510
Val Asp Thr Glu Asp Lys Pro Val Lys Ser Phe Ser Thr Lys Asn Leu
515 520 525
Thr Ala Ser Val Ala Ala Leu Lys Glu Lys Tyr Pro Thr Cys Ala Asp
530 535 540
Thr Gly Val Gln Phe Lys Ala Ala Ala Lys Leu Arg Thr Glu Asn Gly
545 550 555 560
Glu Val Ala Ile Val Lys Ala Phe Val Glu Cys Glu Ala Glu Val Val
565 570 575
Glu Lys Glu Gly Phe Val Asp Gly Val Lys Asn Leu Phe Gly Phe Gly
580 585 590
Lys Lys Asp Gln Lys Pro Leu Ala Glu Gly Gly Asp Lys Asp Ser Ala
595 600 605
Asp Ala Ser Ala Asp Ser Glu Ala Glu Thr Glu Glu Ala Ser Ser Ala
610 615 620
Thr Lys Ser Ser Ser Ser Thr Ser Thr Thr Lys Ser Gly Asp Ala Ala
625 630 635 640
Glu Ser Thr Glu Ala Ala Lys Glu Val Lys Lys Lys Gln Leu Val Ser
645 650 655
Ile Pro Val Glu Val Thr Leu Glu Lys Ala Gly Ile Pro Gln Leu Thr
660 665 670
Lys Ala Glu Trp Thr Lys Ala Lys Asp Arg Leu Lys Ala Phe Ala Ala
675 680 685
Ser Asp Lys Ala Arg Leu Gln Arg Glu Glu Ala Leu Asn Gln Leu Glu
690 695 700
Ala Phe Thr Tyr Lys Val Arg Asp Leu Val Asp Asn Glu Ala Phe Ile
705 710 715 720
Ser Ala Ser Thr Glu Ala Glu Arg Gln Thr Leu Ser Glu Lys Ala Ser
725 730 735
Glu Ala Ser Asp Trp Leu Tyr Glu Glu Gly Asp Ser Ala Thr Lys Asp
740 745 750
Asp Phe Val Ala Lys Leu Lys Ala Leu Gln Asp Leu Val Ala Pro Ile
755 760 765
Gln Asn Arg Leu Asp Glu Ala Glu Lys Arg Pro Gly Leu Ile Ser Asp
770 775 780
Leu Arg Asn Ile Leu Asn Thr Thr Asn Val Phe Ile Asp Thr Val Arg
785 790 795 800
Gly Gln Ile Ala Ala Tyr Asp Glu Trp Lys Ser Thr Ala Ser Ala Lys
805 810 815
Ser Ala Glu Ser Ala Thr Ser Ser Ala Ala Ala Glu Ala Thr Thr Asn
820 825 830
Asp Phe Glu Gly Leu Glu Asp Glu Asp Asp Ser Pro Lys Glu Ala Glu
835 840 845
Glu Lys Pro Val Pro Glu Lys Val Val Pro Pro Leu His Asn Ser Glu
850 855 860
Glu Ile Asp Thr Leu Glu Val Leu Tyr Lys Glu Thr Leu Glu Trp Leu
865 870 875 880
Asn Lys Leu Glu Arg Gln Gln Ala Asp Val Pro Leu Thr Glu Glu Pro
885 890 895
Val Leu Val Val Ser Glu Leu Val Ala Arg Arg Asp Ala Leu Asp Lys
900 905 910
Ala Ser Leu Asp Leu Ala Leu Lys Ser Tyr Thr Gln Tyr Gln Lys Asn
915 920 925
Lys Pro Lys Lys Pro Thr Lys Ser Lys Lys Ala Lys Lys Gln Asp Lys
930 935 940
Thr Lys Ser Ala Asp Lys Ala Gly Pro Thr Phe Glu Phe Pro Glu Gly
945 950 955 960
Ser Val Pro Leu Ser Gly Glu Glu Leu Glu Glu Leu Val Lys Lys Tyr
965 970 975
Met Lys Glu Glu Glu Glu Thr Arg Arg Gln Ala Glu Gly Gly Gln Ala
980 985 990
Glu Glu Lys Pro Ala Glu Asp Thr Glu Lys Ser Ser His Asp Glu Leu
995 1000 1005
<210>34
<211>363
<212>PRT
<213>里氏木霉
<400>34
Met Val Ala Arg Leu Ser Ser Ile Tyr Ala Cys Gly Leu Leu Ala Trp
1 5 10 15
Thr His Ile Val Cys Ala Ser Gln Phe Ser Asp Pro Met Gln Leu Gln
20 25 30
Lys His Leu Ala Gln Asn Asp Tyr Thr Leu Ile Ala Phe Val Ala Ser
35 40 45
Arg Leu Glu Ala Asp Leu Lys Val Ser Leu Pro Leu Thr Ala Ser Thr
50 55 60
Ser Asn Gly Arg Glu Ala Ser Lys Leu Leu Leu Glu Glu Trp Gln Thr
65 70 75 80
Val Gln Gln His Val Ala Ser Thr Ala Thr Ile Asp Cys Pro Ser Ser
85 90 95
Pro Lys Leu Cys Gln Glu Met Asp Val Ala Ser Phe Pro Ala Ile Arg
100 105 110
Leu Tyr Arg Gln Asp Gly Ser Val Thr Arg Tyr Arg Gly Pro Arg Arg
115 120 125
Thr Ala Pro Ile Asp Ala Phe Val Lys Arg Ala Leu Lys Pro Ser Val
130 135 140
Gln Asn Val Pro Gly Gln Gln Leu Ala Asn Phe Ile Thr Asn Asp Asp
145 150 155 160
Tyr Val Phe Ile Ala Lys Leu Gln Gly Glu Ser Glu Ser Ile Asn Ser
165 170 175
His Tyr Arg Asp Phe Ala Gln Glu Tyr Ser Asp Arg Tyr Ser Phe Gly
180 185 190
Ile Ile Thr Ser Gly Ser Val Pro Ser Asn Gly Val Trp Cys Tyr Asn
195 200 205
Asn Val Asp Gly Asn Gln His Ala Ala Thr Asp Leu Asn Asp Pro Asn
210 215 220
Ala Leu Lys Lys Leu Leu Asn Leu Cys Thr Ala Glu Val Ile Pro Gln
225 230 235 240
Leu Thr Arg Arg Asn Glu Met Thr Tyr Leu Ser Ser Gly Arg Ser Leu
245 250 255
Val Tyr Tyr Phe Ser Asn Asn Glu Ala Asp Arg Glu Ala Tyr Val Lys
260 265 270
Ala Leu Lys Pro Ile Ala Gln Arg Tyr Ala Glu Phe Leu Gln Phe Val
275 280 285
Thr Val Asp Ser Gly Glu Tyr Pro Asp Met Leu Arg Asn Leu Gly Val
290 295 300
Arg Ser Ala Gly Gly Leu Ala Val Gln Asn Val His Asn Gly His Ile
305 310 315 320
Phe Pro Phe Arg Gly Asp Ala Ala Ala Ser Pro Gly Gln Val Asp Gln
325 330 335
Phe Ile Val Ala Ile Ser Glu Gly Arg Ala Gln Pro Trp Asp Gly Arg
340 345 350
Phe Asp Glu Gly Gln Glu Ala His Asp Glu Leu
355 360
<210>35
<211>688
<212>PRT
<213>里氏木霉
<400>35
Met Arg Leu Thr Ser Phe Phe Ser Gly Leu Ala Ala Phe Gly Leu Leu
1 5 10 15
Ser Ser Pro Ala Leu Ala Asp Asp Glu Ala Asp Asn Val Pro Ala Pro
20 25 30
Thr Tyr Phe Asp Ser Val Met Val Pro Pro Leu Thr Glu Leu Thr Pro
35 40 45
Asp Asn Phe Glu Lys Glu Ala Ser Lys Thr Lys Trp Leu Leu Val Lys
50 55 60
His Tyr Ser Pro Tyr Cys His His Cys Ile Ser Tyr Ala Pro Thr Phe
65 70 75 80
Gln Thr Thr Tyr Glu Phe Tyr Tyr Thr Ser Lys Pro Glu Gly Ala Gly
85 90 95
Asp Thr Ser Phe Thr Asp Phe Tyr Asp Phe Lys Phe Ala Ala Val Asn
100 105 110
Cys Ile Ala Tyr Ser Asp Leu Cys Val Glu Asn Gly Val Lys Leu Tyr
115 120 125
Pro Thr Thr Val Leu Tyr Glu Asn Gly Lys Glu Val Lys Ala Val Thr
130 135 140
Gly Gly Gln Asn Ile Thr Phe Leu Ser Asp Leu Ile Glu Glu Ala Leu
145 150 155 160
Glu Lys Ser Lys Pro Gly Ser Arg Pro Lys Ser Leu Ala Leu Pro Gln
165 170 175
Pro Gly Asp Lys Glu Arg Pro Lys Ser Glu Pro Glu Thr Ala Ser Arg
180 185 190
Ser Ala Thr Glu Glu Lys Lys Pro Lys Lys Pro Val Ala Thr Pro Asn
195 200 205
Glu Asp Gly Val Ser Val Ser Leu Thr Ala Glu Asn Phe Gln Arg Leu
210 215 220
Val Thr Met Thr Gln Asp Pro Trp Phe Ile Lys Phe Tyr Ala Pro Trp
225 230 235 240
Cys Pro His Cys Gln Asp Met Ala Pro Thr Trp Glu Gln Leu Ala Lys
245 250 255
Asn Met Lys Gly Lys Leu Asn Ile Gly Glu Val Asn Cys Asp Lys Glu
260 265 270
Ser Arg Leu Cys Lys Asp Val Gly Ala Arg Ala Phe Pro Thr Ile Leu
275 280 285
Phe Phe Lys Gly Gly Glu Arg Ser Glu Tyr Glu Gly Leu Arg Gly Leu
290 295 300
Gly Asp Phe Ile Lys Tyr Ala Glu Asn Ala Val Asp Leu Ala Ser Gly
305 310 315 320
Val Pro Asp Val Asp Leu Ala Ala Phe Lys Ala Leu Glu Gln Lys Glu
325 330 335
Asp Val Ile Phe Val Tyr Phe Tyr Asp His Ala Thr Thr Ser Glu Asp
340 345 350
Phe Asn Ala Leu Glu Arg Leu Pro Leu Ser Leu Ile Gly His Ala Lys
355 360 365
Leu Val Lys Thr Lys Asp Pro Ala Met Tyr Glu Arg Phe Lys Ile Thr
370 375 380
Thr Trp Pro Arg Phe Met Val Ser Arg Glu Gly Arg Pro Thr Tyr Tyr
385 390 395 400
Pro Pro Leu Thr Pro Asn Ala Met Arg Asp Thr His Gln Val Leu Asp
405 410 415
Trp Met Arg Ser Val Trp Leu Pro Leu Val Pro Glu Leu Leu Val Thr
420 425 430
Asn Ala Arg Gln Ile Met Asp Asn Lys Ile Val Val Leu Gly Val Leu
435 440 445
Asn Arg Glu Asp Gln Glu Ser Phe Gln Ser Ala Leu Arg Glu Met Lys
450 455 460
Ser Ala Ala Asn Glu Trp Met Asp Arg Gln Ile Gln Glu Phe Gln Leu
465 470 475 480
Glu Arg Lys Lys Leu Arg Asp Ala Lys Gln Met Arg Ile Glu Glu Ala
485 490 495
Glu Asp Arg Asp Asp Glu Arg Ala Leu Arg Ala Ala Lys Ala Ile His
500 505 510
Ile Asp Met Asn Asn Ser Gly Arg Arg Glu Val Ala Phe Ala Trp Val
515 520 525
Asp Gly Val Ala Trp Gln Arg Trp Ile Arg Thr Thr Tyr Gly Ile Asp
530 535 540
Val Lys Asp Gly Glu Arg Val Ile Ile Asn Asp Gln Asp Val Ser Leu
545 550 555 560
Lys Leu Thr Pro Ile Cys Pro Pro Ser Thr Ile Leu Leu Cys Ser Arg
565 570 575
Lys Tyr Trp Asp Ser Thr Val Thr Gly Asn TyrIle Leu Val Ser Arg
580 585 590
Thr Ser Ile Leu Glu Thr Leu Asp Lys Val Val Tyr Thr Pro Gln Ala
595 600 605
Leu Lys Pro Lys Leu Thr Ile Ser Ser Phe Glu Lys Ile Phe Phe Asp
610 615 620
Ile Arg Val Ser Phe Thr Glu His Pro Tyr Leu Thr Leu Gly Cys Ile
625 630 635 640
Val Gly Ile Ala Phe Gly Ala Phe Ser Trp Leu Arg Gly Arg Ser Arg
645 650 655
Arg Gly Arg Gly His Phe Arg Leu Glu Asp Ser Ile Ser Ile Arg Asp
660 665 670
Phe Lys Asp Gly Phe Leu Gly Gly Ser Asn Gly Asn Thr Lys Ala Asp
675 680 685
<210>36
<211>461
<212>PRT
<213>里氏木霉
<400>36
Met His Gln Gln Thr Leu Leu Ala Thr Leu Ala Ala Ser Leu Ala Ala
1 5 10 15
Leu Pro Phe Ala Gln Ala Gly Phe Tyr Ser Lys Ser Ser Pro Val Leu
20 25 30
Gln Val Asp Ala Lys Ser Tyr Asp Arg Leu Ile Thr Lys Ser Asn His
35 40 45
Thr Ser Ile Val Glu Phe Tyr Ala Pro Trp Cys Gly His Cys Gln Asn
50 55 60
Leu Lys Pro Ala Tyr Glu Lys Ala Ala Arg Thr Leu Asp Gly Leu Ala
65 70 75 80
Lys Val Ala Ala Val Asp Cys Asp Asp Asp Ala Asn Lys Ala Leu Cys
85 90 95
Gly Ser Leu Gly Val Lys Gly Phe Pro Thr Leu Lys Ile Val Arg Pro
100 105 110
Gly Lys Lys Pro Gly Arg Pro Val Val Glu Asp Tyr Gln Gly Gln Arg
115 120 125
Thr Ala Gly Ala Ile Ala Asp Ala Val Val Ala Lys Ile Asn Asn His
130 135 140
Val Val Lys Leu Thr Asp Lys Asp Ile Asp Ala Phe Leu Glu Lys Asp
145 150 155 160
Gly Asp Lys Pro Lys Ala Ile Leu Phe Thr Glu Lys Gly Thr Thr Ser
165 170 175
Ala Leu Leu Arg Ser Leu Ala Ile Asp Phe Leu Asp Ala Val Thr Ile
180 185 190
Gly Gln Val Arg Asn Lys Glu Lys Ala Ala Val Asp Arg Phe Gly Ile
195 200 205
Ser Ser Phe Pro Ser Phe Val Leu Ile Pro Gly Gly Gly Lys Glu Pro
210 215 220
Val Val Tyr Ser Gly Glu Leu Asn Lys Lys Asp Met Val Glu Phe Leu
225 230 235 240
Lys Gln Val Ala Glu Pro Asn Pro Asp Pro Ala Pro Ser Asn Gly Lys
245 250 255
Ser Gly Lys Lys Ala Ser Thr Lys Asp Lys Ala Ser Ser Lys Glu Ala
260 265 270
Pro Gln Lys Ala Ala Ala Ala Asp Glu Ser Ser Ser Ala Ala Ser Ser
275 280 285
Glu Thr Ser Thr Ala Ala Ala Pro Glu Ser Thr Leu Ile Asp Ile Pro
290 295 300
Ala Leu Thr Ser Lys Ala Glu Leu Glu Glu His Cys Leu Gln Pro Lys
305 310 315 320
Ser Gln Thr Cys Val Leu Ala Phe Val Pro Ala Ser Ala Ser Glu Met
325 330 335
Arg Asn Lys Ile Leu Ser Ala Val Ser Gln Leu His Thr Lys Tyr Val
340 345 350
His Gly Lys Arg His Phe Pro Phe Phe Ser Val Asp Ser Asp Val Glu
355 360 365
Gly Ser Ala Ala Leu Lys Glu Ala Leu Gly Leu Ser Gly Lys Ile Glu
370 375 380
Leu Val Ala Leu Asn Ala Arg Arg Gly Trp Trp Arg Arg Tyr Glu Asp
385 390 395 400
Gly Glu Phe Ser Val His Ser Val Glu Ser Trp Ile Asp Ala Val Arg
405 410 415
Met Gly Glu Gly Glu Lys Lys Lys Leu Pro Glu Gly Val Val Val Glu
420 425 430
Lys Ala Glu Pro Ala Glu Glu Ala Lys Ser Glu Thr Glu Ala Ala Ala
435 440 445
Ala Asp Glu Ala Thr Glu Lys Pro Glu His Asp Glu Leu
450 455 460
<210>37
<211>368
<212>PRT
<213>里氏木霉
<400>37
Met Val Leu Ile Lys Ser Leu Val Leu Ala Val Leu Ala Ser Ser Val
1 5 10 15
Ala Ala Lys Ser Ala Val Ile Asp Leu Ile Pro Ser Asn Phe Asp Lys
20 25 30
Leu Val Phe Ser Gly Lys Pro Thr Leu Val Glu Phe Phe Ala Pro Trp
35 40 45
Cys Gly His Cys Lys Asn Leu Ala Pro Val Tyr Glu Glu Leu Ala Gln
50 55 60
Val Phe Glu His Ala Lys Asp Lys Val Gln Ile Ala Lys Val Asp Ala
65 70 75 80
Asp Ser Glu Arg Asp Leu Gly Lys Arg Phe Gly Ile Gln Gly Phe Pro
85 90 95
Thr Leu Lys Phe Phe Asp Gly Lys Ser Lys Glu Pro Gln Glu Tyr Lys
100 105 110
Ser Gly Arg Asp Leu Asp Ser Leu Thr Lys Phe Ile Thr Glu Lys Thr
115 120 125
Gly Val Lys Pro Lys Lys Lys Gly Glu Leu Pro Ser Ser Val Val Met
130 135 140
Leu Asn Thr Arg Thr Phe His Asp Thr Val Gly Gly Asp Lys Asn Val
145 150 155 160
Leu Val Ala Phe Thr Ala Pro Trp Cys Gly His Cys Lys Asn Leu Ala
165 170 175
Pro Thr Trp Glu Lys Val Ala Asn Asp Phe Ala Gly Asp Glu Asn Val
180 185 190
Val Ile Ala Lys Val Asp Ala Glu Gly Ala Asp Ser Lys Ala Val Ala
195 200 205
Glu Glu Tyr Gly Val Thr Gly Tyr Pro Thr Ile Leu Phe Phe Pro Ala
210 215 220
Gly Thr Lys Lys Gln Val Asp Tyr Gln Gly Gly Arg Ser Glu Gly Asp
225 230 235 240
Phe Val Asn Phe Ile Asn Glu Lys Ala Gly Thr Phe Arg Thr Glu Gly
245 250 255
Gly Glu Leu Asn Asp Ile Ala Gly Thr Val Ala Pro Leu Asp Thr Ile
260 265 270
Val Ala Asn Phe Leu Ser Gly Thr Gly Leu Ala Glu Ala Ala Ala Glu
275 280 285
Ile Lys Glu Ala Val Asp Leu Leu Thr Asp Ala Ala Glu Thr Lys Phe
290 295 300
Ala Glu Tyr Tyr Val Arg Val Phe Asp Lys Leu Ser Lys Asn Glu Lys
305 310 315 320
Phe Val Asn Lys Glu Leu Ala Arg Leu Gln Gly Ile Leu Ala Lys Gly
325 330 335
Gly Leu Ala Pro Ser Lys Arg Asp Glu Ile Gln Ile Lys Ile Asn Val
340 345 350
Leu Arg Lys Phe Thr Pro Lys Glu Asn Glu Asp Gln Lys Asp Glu Leu
355 360 365
<210>38
<211>502
<212>PRT
<213>里氏木霉
<400>38
Met Gln Gln Lys Arg Leu Thr Ala Ala Leu Val Ala Ala Leu Ala Ala
1 5 10 15
Val Val Ser Ala Glu Ser Asp Val Lys Ser Leu Thr Lys Asp Thr Phe
20 25 30
Asn Asp Phe Ile Asn Ser Asn Asp Leu Val Leu Ala Glu Phe Phe Ala
35 40 45
Pro Trp Cys Gly His Cys Lys Ala Leu Ala Pro Glu Tyr Glu Glu Ala
50 55 60
Ala Thr Thr Leu Lys Asp Lys Ser Ile Lys Leu Ala Lys Val Asp Cys
65 70 75 80
Val Glu Glu Ala Asp Leu Cys Lys Glu His Gly Val Glu Gly Tyr Pro
85 90 95
Thr Leu Lys Val Phe Arg Gly Leu Asp Lys Val Ala Pro Tyr Thr Gly
100 105 110
Pro Arg Lys Ala Asp Gly Ile Thr Ser Tyr Met Val Lys Gln Ser Leu
115 120 125
Pro Ala Val Ser Ala Leu Thr Lys Asp Thr Leu Glu Asp Phe Lys Thr
130 135 140
Ala Asp Lys Val Val Leu Val Ala Tyr Ile Ala Ala Asp Asp Lys Ala
145 150 155 160
Ser Asn Glu Thr Phe Thr Ala Leu Ala Asn Glu Leu Arg Asp Thr Tyr
165 170 175
Leu Phe Gly Gly Val Asn Asp Ala Ala Val Ala Glu Ala Glu Gly Val
180 185 190
Lys Phe Pro Ser Ile Val Leu Tyr Lys Ser Phe Asp Glu Gly Lys Asn
195 200 205
Val Phe Ser Glu Lys Phe Asp Ala Glu Ala Ile Arg Asn Phe Ala Gln
210 215 220
Val Ala Ala Thr Pro Leu Val Gly Glu Val Gly Pro Glu Thr Tyr Ala
225 230 235 240
Gly Tyr Met Ser Ala Gly Ile Pro Leu Ala Tyr Ile Phe Ala Glu Thr
245 250 255
Ala Glu Glu Arg Glu Asn Leu Ala Lys Thr Leu Lys Pro Val Ala Glu
260 265 270
Lys Tyr Lys Gly Lys Ile Asn Phe Ala Thr Ile Asp Ala Lys Asn Phe
275 280 285
Gly Ser His Ala Gly Asn Ile Asn Leu Lys Thr Asp Lys Phe Pro Ala
290 295 300
Phe Ala Ile His Asp Ile Glu Lys Asn Leu Lys Phe Pro Phe Asp Gln
305 310 315 320
Ser Lys Glu Ile Thr Glu Lys Asp Ile Ala Ala Phe Val Asp Gly Phe
325 330 335
Ser Ser Gly Lys Ile Glu Ala Ser Ile Lys Ser Glu Pro Ile Pro Glu
340 345 350
Thr Gln Glu Gly Pro Val Thr Val Val Val Ala His Ser Tyr Lys Asp
355 360 365
Ile Val Leu Asp Asp Lys Lys Asp Val Leu Ile Glu Phe Tyr Ala Pro
370 375 380
Trp Cys Gly His Cys Lys Ala Leu Ala Pro Lys Tyr Asp Glu Leu Ala
385 390 395 400
Ser Leu Tyr Ala Lys Ser Asp Phe Lys Asp Lys Val Val Ile Ala Lys
405 410 415
Val Asp Ala Thr Ala Asn Asp Val Pro Asp Glu Ile Gln Gly Phe Pro
420 425 430
Thr Ile Lys Leu Tyr Pro Ala Gly Asp Lys Lys Asn Pro Val Thr Tyr
435 440 445
Ser Gly Ala Arg Thr Val Glu Asp Phe Ile Glu Phe Ile Lys Glu Asn
450 455 460
Gly Lys Tyr Lys Ala Gly Val Glu Ile Pro Ala Glu Pro Thr Glu Glu
465 470 475 480
Ala Glu Ala Ser Glu Ser Lys Ala Ser Glu Glu Ala Lys Ala Ser Glu
485 490 495
Glu Thr His Asp Glu Leu
500
<210>39
<211>190
<212>PRT
<213>里氏木霉
<400>39
Met Lys Ala Ala Leu Leu Leu Ser Ala Leu Ala Ser Cys Ala Ile Gly
1 5 10 15
Leu Val Ala Ala Ala Ala Glu Asp Phe Lys Ile Glu Val Thr His Pro
20 25 30
Val Glu Cys Asp Arg Lys Thr Gln Lys Gly Asp Lys Leu Ser Met His
35 40 45
Tyr Arg Gly Thr Leu Ala Lys Thr Gly Asp Lys Phe Asp Ala Ser Tyr
50 55 60
Asp Arg Asn Gln Pro Phe Asn Phe Lys Leu Gly Ala Gly Gln Val Ile
65 70 75 80
Lys Gly Trp Asp Gln Gly Leu Leu Asp Met Cys Ile Gly Glu Lys Arg
85 90 95
Thr Leu Thr Ile Pro Pro Glu Leu Gly Tyr Gly Gln Arg Asn Met Gly
100 105 110
Pro Ile Pro Ala Gly Ser Thr Leu Ile Phe Glu Thr Glu Leu Leu Ala
115 120 125
Ile Glu Gly Val Lys Ala Pro Glu Lys Lys Pro Val Pro Glu Thr Pro
130 135 140
Ile Val Glu Lys Pro Ala Glu Glu Thr Glu Glu Ser Val Val Glu Lys
145 150 155 160
Ala Ala Glu Ala Ala Ala Ser Val Ala Ser Glu Ala Val Asp Ala Ala
165 170 175
Lys Thr Val Phe Ala Asp Thr Asp Glu Gly His Gly Glu Leu
180 185 190
<210>40
<211>207
<212>PRT
<213>里氏木霉
<400>40
Met Leu Thr Phe Arg Arg Leu Phe Thr Thr Ala Ile Val Leu Val Val
1 5 10 15
Gly Leu Leu Phe Phe Val Lys Thr Ala Glu Ala Ala Lys Gly Pro Lys
20 25 30
Ile Thr His Lys Val Phe Phe Asp Ile Glu His Gly Asp Glu Lys Leu
35 40 45
Gly Arg Ile Val Leu Gly Leu Tyr Gly Lys Thr Val Pro Glu Thr Ala
50 55 60
Glu Asn Phe Arg Ala Leu Ala Thr Gly Glu Lys Gly Phe Gly Tyr Glu
65 70 75 80
Gly Ser Thr Phe His Arg Val Ile Lys Gln Phe Met Ile Gln Gly Gly
85 90 95
Asp Phe Thr Lys Gly Asp Gly Thr Gly Gly Lys Ser Ile Tyr Gly Asn
100 105 110
Lys Phe Lys Asp Glu Asn Phe Lys Leu Lys His Thr Lys Lys Gly Leu
115 120 125
Leu Ser Met Ala Asn Ala Gly Pro Asp Thr Asn Gly Ser Gln Phe Phe
130 135 140
Ile Thr Thr Val Val Thr Ser Trp Leu Asp Gly Arg His Val Val Phe
145 150 155 160
Gly Glu Val Leu Glu Gly Tyr Asp Ile Val Glu Lys Ile Glu Asn Val
165 170 175
Gln Thr Gly Pro Gly Asp Arg Pro Val Lys Pro Val Lys Ile Ala Lys
180 185 190
Ser Gly Glu Leu Glu Val Pro Pro Glu Gly Ile His Val Glu Leu
195 200 205
<210>41
<211>413
<212>PRT
<213>里氏木霉
<400>41
Met Ile Leu Arg Ala Ala Ile Phe Val Leu Leu Ala Leu Val Ser Leu
1 5 10 15
Ala Val Cys Ala Glu Asp Phe Tyr Lys Val Leu Gly Val Asp Lys Ser
20 25 30
Ala Ser Asp Lys Gln Leu Lys Gln Ala Tyr Arg Gln Leu Ser Lys Lys
35 40 45
Phe His Pro Asp Lys Asn Pro Gly Asp Glu Thr Ala His Glu Lys Phe
50 55 60
Val Leu Val Ser Glu Ala Tyr Glu Val Leu Ser Asp Ser Glu Leu Arg
65 70 75 80
Lys Val Tyr Asp Arg Tyr Gly His Glu Gly Val Lys Ser His Arg Gln
85 90 95
Gly Gly Gly Gly Gly Gly Gly Gly Asp Pro Phe Asp Leu Phe Ser Arg
100 105 110
Phe Phe Gly Gly His Gly His Phe Gly Arg Asn Ser Arg Glu Pro Arg
115 120 125
Gly Ser Asn Ile Glu Val Arg Ile Glu Ile Ser Leu Arg Asp Phe Tyr
130 135 140
Asn Gly Ala Thr Thr Glu Phe Gln Trp Glu Lys Gln His Ile Cys Glu
145 150 155 160
Lys Cys Glu Gly Thr Gly Ser Ala Asp Gly Lys Val Glu Thr Cys Ser
165 170 175
Val Cys Gly Gly His Gly Val Arg Ile Val Lys Gln Gln Leu Val Pro
180 185 190
Gly Met Phe Gln Gln Met Gln Met Arg Cys Asp His Cys Gly Gly Ser
195 200 205
Gly Lys Thr Ile Lys Asn Lys Cys Ser Val Cys His Gly Ser Arg Val
210 215 220
Glu Arg Lys Pro Thr Thr Val Ser Leu Thr Val Glu Arg Gly Ile Ala
225 230 235 240
Arg Asp Ala Lys Val Val Phe Glu Asn Glu Ala Asp Gln Ser Pro Asp
245 250 255
Trp Val Pro Gly Asp Leu Ile Val Asn Leu Gly Glu Lys Ala Pro Ser
260 265 270
Tyr Glu Asp Asn Pro Asp Arg Val Asp Gly Thr Phe Phe Arg Arg Lys
275 280 285
Gly His Asp Leu Tyr Trp Thr Glu Val Leu Ser Leu Arg Glu Ala Trp
290 295 300
Met Gly Gly Trp Thr Arg Asn Leu Thr His Leu Asp Lys His Val Val
305 310 315 320
Arg Leu Gly Arg Glu Arg Gly Gln Val Val Gln Ser Gly Leu Val Glu
325 330 335
Thr Ile Pro Gly Glu Gly Met Pro Ile Trp His Glu Glu Gly Glu Ser
340 345 350
Val Tyr His Thr His Glu Phe Gly Asn Leu Tyr Val Thr Tyr Glu Val
355 360 365
Ile Leu Pro Asp Gln Met Asp Lys Lys Met Glu Ser Glu Phe Trp Asp
370 375 380
Leu Trp Glu Lys Trp Arg Ser Lys Asn Gly Val Asp Leu Gln Lys Asp
385 390 395 400
Leu Gly Arg Pro Glu Pro Gly His Asp His Asp Glu Leu
405 410
<210>42
<211>182
<212>PRT
<213>里氏木霉
<400>42
Met Ala Arg Arg Gln His Leu Thr Ala Thr Val Leu Leu Ala Val Val
1 5 10 15
Leu Phe Phe Ser Ile Thr Tyr Leu Leu Ser Gly Ser Ser Ser Ser Asn
20 25 30
Ala Asp Arg Thr Arg Glu Ala Val Val Ala Glu Pro Lys Ser Glu Phe
35 40 45
Lys Val Asp Phe Asp Gly Met Pro Ala Asn Leu Leu Glu Gly Glu Ser
50 55 60
Ile Ala Pro Lys Leu Glu Asn Ala Thr Leu Lys Ala Glu Leu Gly Arg
65 70 75 80
Ala Thr Trp Lys Phe Met His Thr Met Val Ala Arg Phe Pro Glu Lys
85 90 95
Pro Ser Pro Glu Glu Arg Lys Thr Leu Glu Thr Phe Ile Tyr Leu Phe
100 105 110
Gly Arg Leu Tyr Pro Cys Gly Asp Cys Ala Arg His Phe Arg Gly Leu
115 120 125
Leu Ala Lys Tyr Pro Pro Gln Thr Ser Ser Arg Asn Ala Ala Ala Gly
130 135 140
Trp Leu Cys Phe Val His Asn Gln Val Asn Glu Arg Leu Lys Lys Pro
145 150 155 160
Ile Phe Asp Cys Asn Asn Ile Gly Asp Phe Tyr Asp Cys Gly Cys Gly
165 170 175
Asp Glu Lys Lys Asp Gly
180
<210>43
<211>1070
<212>PRT
<213>里氏木霉
<400>43
Met Val Met Leu Val Ala Ile Ala Leu Ala Trp Leu Gly Cys Ser Leu
1 5 10 15
Leu Arg Pro Val Asp Ala Met Arg Ala Asp Tyr Leu Ala Gln Leu Arg
20 25 30
Gln Glu Thr Val Asp Met Phe Tyr His Gly Tyr Ser Asn Tyr Met Glu
35 40 45
His Ala Phe Pro Glu Asp Glu Leu Arg Pro Ile Ser Cys Thr Pro Leu
50 55 60
Thr Arg Asp Arg Asp Asn Pro Gly Arg Ile Ser Leu Asn Asp Ala Leu
65 70 75 80
Gly Asn Tyr Ser Leu Thr Leu Ile Asp Ser Leu Ser Thr Leu Ala Ile
85 90 95
Leu Ala Gly Gly Pro Gln Asn Gly Pro Tyr Thr Gly Pro Gln Ala Leu
100 105 110
Ser Asp Phe Gln Asp Gly Val Ala Glu Phe Val Arg His Tyr Gly Asp
115 120 125
Gly Arg Ser Gly Pro Ser Gly Ala Gly Ile Arg Ala Arg Gly Phe Asp
130 135 140
Leu Asp Ser Lys Val Gln Val Phe Glu Thr Val Ile Arg Gly Val Gly
145 150 155 160
Gly Leu Leu Ser Ala His Leu Phe Ala Ile Gly Glu Leu Pro Ile Thr
165 170 175
Gly Tyr Val Pro Arg Pro Glu Gly Val Ala Gly Asp Asp Pro Leu Glu
180 185 190
Leu Ala Pro Ile Pro Trp Pro Asn Gly Phe Arg Tyr Asp Gly Gln Leu
195 200 205
Leu Arg Leu Ala Leu Asp Leu Ser Glu Arg Leu Leu Pro Ala Phe Tyr
210 215 220
Thr Pro Thr Gly Ile Pro Tyr Pro Arg Val Asn Leu Arg Ser Gly Ile
225 230 235 240
Pro Phe Tyr Val Asn Ser Pro Leu His Gln Asn Leu Gly Glu Ala Val
245 250 255
Glu Glu Gln Ser Gly Arg Pro Glu Ile Thr Glu Thr Cyg Ser Ala Gly
260 265 270
Ala Gly Ser Leu Val Leu Glu Phe Thr Val Leu Ser Arg Leu Thr Gly
275 280 285
Asp Ala Arg Phe Glu Gln Ala Ala Lys Arg Ala Phe Trp Glu Val Trp
290 295 300
His Arg Arg Ser Glu Ile Gly Leu Ile Gly Asn Gly Ile Asp Ala Glu
305 310 315 320
Arg Gly Leu Trp Ile Gly Pro His Ala Gly Ile Gly Ala Gly Met Asp
325 330 335
Ser Phe Phe Glu Tyr Ala Leu Lys Ser His Ile Leu Leu Ser Gly Leu
340 345 350
Gly Met Pro Asn Ala Ser Thr Ser Arg Arg Gln Ser Thr Thr Ser Trp
355 360 365
Leu Asp Pro Asn Ser Leu His Pro Pro Leu Pro Pro Glu Met His Thr
370 375 380
Ser Asp Ala Phe Leu Gln Ala Trp His Gln Ala His Ala Ser Val Lys
385 390 395 400
Arg Tyr Leu Tyr Thr Asp Arg Ser His Phe Pro Tyr Tyr Ser Asn Asn
405 410 415
His Arg Ala Thr Gly Gln Pro Tyr Ala Met Trp Ile Asp Ser Leu Gly
420 425 430
Ala Phe Tyr Pro Gly Leu Leu Ala Leu Ala Gly Glu Val Glu Glu Ala
435 440 445
Ile Glu Ala Asn Leu Val Tyr Thr Ala Leu Trp Thr Arg Tyr Ser Ala
450 455 460
Leu Pro Glu Arg Trp Ser Val Arg Glu Gly Asn Val Glu Ala Gly Ile
465 470 475 480
Gly Trp Trp Pro Gly Arg Pro Glu Phe Ile Glu Ser Thr Tyr His Ile
485 490 495
Tyr Arg Ala Thr Arg Asp Pro Trp Tyr Leu His Val Gly Glu Met Val
500 505 510
Leu Arg Asp Ile Arg Arg Arg Cys Tyr Ala Glu Cys Gly Trp Ala Gly
515 520 525
Leu Gln Asp Val Gln Thr Gly Glu Lys Gln Asp Arg Met Glu Ser Phe
530 535 540
Phe Leu Gly Glu Thr Ala Lys Tyr Met Tyr Leu Leu Phe Asp Pro Asp
545 550 555 560
His Pro Leu Asn Lys Leu Asp Ala Ala Tyr Val Phe Thr Thr Glu Gly
565 570 575
His Pro Leu Ile Ile Pro Lys Ser Lys Arg Gly Ser Gly Ser His Asn
580 585 590
Arg Gln Asp Arg Ala Arg Lys Ala Lys Lys Ser Arg Asp Val Ala Val
595 600 605
Tyr Thr Tyr Tyr Asp Glu Ser Phe Thr Asn Ser Cys Pro Ala Pro Arg
610 615 620
Pro Pro Ser Glu His His Leu Ile Gly Ser Ala Thr Ala Ala Arg Pro
625 630 635 640
Asp Leu Phe Ser Val Ser Arg Phe Thr Asp Leu Tyr Arg Thr Pro Asn
645 650 655
Val His Gly Pro Leu Glu Lys Val Glu Met Arg Asp Lys Lys Lys Gly
660 665 670
Arg Val Val Arg Tyr Arg Ala Thr Ser Asn His ThrIle Phe Pro Trp
675 680 685
Thr Leu Pro Pro Ala Met Leu Pro Glu Asn Gly Thr Cys Ala Ala Pro
690 695 700
Pro Glu Arg Ile Ile Ser Leu Ile Glu Phe Pro Ala Asn Asp Ile Thr
705 710 715 720
Ser Gly Ile Thr Ser Arg Phe Gly Asn His Leu Ser Trp Gln Thr His
725 730 735
Leu Gly Pro Thr Val Asn Ile Leu Glu Gly Leu Arg Leu Gln Leu Glu
740 745 750
Gln Val Ser Asp Pro Ala Thr Gly Glu Asp Lys Trp Arg Ile Thr His
755 760 765
Ile Gly Asn Thr Gln Leu Gly Arg His Glu Thr Val Phe Phe His Ala
770 775 780
Glu His Val Arg His Leu Lys Asp Glu Val Phe Ser Cys Arg Arg Arg
785 790 795 800
Arg Asp Ala Val Glu Ile Glu Leu Leu Val Asp Lys Pro Ser Asp Thr
805 810 815
Asn Asn Asn Asn Thr Leu Ala Ser Ser Asp Asp Asp Val Val Val Asp
820 825 830
Ala Lys Ala Glu Glu Gln Asp Gly Met Leu Ala Asp Asp Asp Gly Asp
835 840 845
Thr Leu Asn Ala Glu Thr Leu Ser Ser Asn Ser Leu Phe Gln Ser Leu
850 855 860
Leu Arg Ala Val Ser Ser Val Phe Glu Pro Val Tyr Thr Ala Ile Pro
865 870 875 880
Glu Ser Asp Pro Ser Ala Gly Thr Ala Lys Val Tyr Ser Phe Asp Ala
885 890 895
Tyr Thr Ser Thr Gly Pro Gly Ala Tyr Pro Met Pro Ser Ile Ser Asp
900 905 910
Thr Pro Ile Pro Gly Asn Pro Phe Tyr Asn Phe Arg Asn Pro Ala Ser
915 920 925
Asn Phe Pro Trp Ser Thr Val Phe Leu Ala Gly Gln Ala Cys Glu Gly
930 935 940
Pro Leu Pro Ala Ser Ala Pro Arg Glu His Gln Val Ile Val Met Leu
945 950 955 960
Arg Gly Gly Cys Ser Phe Ser Arg Lys Leu Asp Asn Ile Pro Ser Phe
965 970 975
Ser Pro His Asp Arg Ala Leu Gln Leu Val Val Val Leu Asp Glu Pro
980 985 990
Pro Pro Pro Pro Pro Pro Pro Pro Ala Asn Asp Arg Arg Asp Val Thr
995 1000 1005
Arg Pro Leu Leu Asp Thr Glu Gln Thr Thr Pro Lys Gly Met Lys
1010 1015 1020
Arg Leu His Gly Ile Pro Met Val Leu Val Arg Ala Ala Arg Gly
1025 1030 1035
Asp Tyr Glu Leu Phe Gly His Ala Ile Gly Val Gly Met Arg Arg
1040 1045 1050
Lys Tyr Arg Val Glu Ser Gln Gly Leu Val Val Glu Asn Ala Val
1055 1060 1065
Val Leu
1070
<210>44
<211>406
<212>PRT
<213>里氏木霉
<400>44
Met Arg Pro Leu Ala Leu Ile Phe Ala Leu Ile Leu Gly Leu Leu Leu
1 5 10 15
Cys Leu Ala Ala Pro Ala Thr Ala Ser Ser Ser Ser Ser Gln His Ser
20 25 30
Pro Gln Ala Ala Ser Asp Glu Ser Asp Leu Ile Cys His Thr Ser Asn
35 40 45
Pro Asp Glu Cys Tyr Pro Arg Val Phe Val Pro Thr His Glu Phe Gln
50 55 60
Pro Val His Asp Asp Gln Gln Leu Pro Asn Gly Leu His Val Arg Leu
65 70 75 80
Asn Ile Trp Thr Gly Gln Lys Glu Ala Lys Ile Asn Val Pro Asp Glu
85 90 95
Ala Asn Pro Asp Leu Asp Gly Leu Pro Val Asp Gln Ala Val Val Leu
100 105 110
Val Asp Gln Glu Gln Pro Glu Ile Ile Gln Ile Pro Lys Gly Ala Pro
115 120 125
Lys Tyr Asp Asn Val Gly Lys Ile Lys Glu Pro Ala Gln Glu Gly Asp
130 135 140
Ala Gln Thr Glu Ala Ile Ala Phe Ala Glu Thr Phe Asn Met Leu Lys
145 150 155 160
Thr Gly Lys Ser Pro Ser Ala Glu Glu Phe Asp Asn Gly Leu Glu Gly
165 170 175
Leu Glu Glu Leu Ser His Asp Ile Tyr Tyr Gly Leu Lys Ile Thr Glu
180 185 190
Asp Ala Asp Val Val Lys Ala Leu Phe Cys Leu Met Gly Ala Arg Asp
195 200 205
Gly Asp Ala Ser Glu Gly Ala Thr Pro Arg Asp Gln Gln Ala Ala Ala
210 215 220
Ile Leu Ala Gly Ala Leu Ser Asn Asn Pro Ser Ala Leu Ala Glu Ile
225 230 235 240
Ala Lys Ile Trp Pro Glu Leu Leu Asp Ser Ser Cys Pro Arg Asp Gly
245 250 255
Ala Thr Ile Ser Asp Arg Phe Tyr Gln Asp Thr Val Ser Val Ala Asp
260 265 270
Ser Pro Ala Lys Val Lys Ala Ala Val Ser Ala Ile Asn Gly Leu Ile
275 280 285
Lys Asp Gly Ala Ile Arg Lys Gln Phe Leu Glu Asn Ser Gly Met Lys
290 295 300
Gln Leu Leu Ser Val Leu Cys Gln Glu Lys Pro Glu Trp Ala Gly Ala
305 310 315 320
Gln Arg Lys Val Ala Gln Leu Val Leu Asp Thr Phe Leu Asp Glu Asp
325 330 335
Met Gly Ala Gln Leu Gly Gln Trp Pro Arg Gly Lys Ala Ser Asn Asn
340 345 350
Gly Val Cys Ala Ala Pro Glu Thr Ala Leu Asp Asp Gly Cys Trp Asp
355 360 365
Tyr His Ala Asp Arg Met Val Lys Leu His Gly Thr Pro Trp Ser Lys
370 375 380
Glu Leu Lys Gln Arg Leu Gly Asp Ala Arg Lys Ala Asn Ser Lys Leu
385 390 395 400
Pro Asp His Gly Glu Leu
405
<210>45
<211>505
<212>PRT
<213>木霉属
<400>45
Ala Ile Gly Pro Val Ala Asp Leu His Ile Val Asn Lys Asp Leu Ala
1 5 10 15
Pro Asp Gly Val Gln Arg Pro Thr Val Leu Ala Gly Gly Thr Phe Pro
20 25 30
Gly Thr Leu Ile Thr Gly Gln Lys Gly Asp Asn Phe Gln Leu Asn Val
35 40 45
Ile Asp Asp Leu Thr Asp Asp Arg Met Leu Thr Pro Thr Ser Ile His
50 55 60
Trp His Gly Phe Phe Gln Lys Gly Thr Ala Trp Ala Asp Gly Pro Ala
65 70 75 80
Phe Val Thr Gln Cys Pro Ile Ile Ala Asp Asn Ser Phe Leu Tyr Asp
85 90 95
Phe Asp Val Pro Asp Gln Ala Gly Thr Phe Trp Tyr His Ser His Leu
100 105 110
Ser Thr Gln Tyr Cys Asp Gly Leu Arg Gly Ala Phe Val Val Tyr Asp
115 120 125
Pro Asn Asp Pro His Lys Asp Leu Tyr Asp Val Asp Asp Gly Gly Thr
130 135 140
Val Ile Thr Leu Ala Asp Trp Tyr His Val Leu Ala Gln Thr Val Val
145 150 155 160
Gly Ala Ala Thr Pro Asp Ser Thr Leu Ile Asn Gly Leu Gly Arg Ser
165 170 175
Gln Thr Gly Pro Ala Asp Ala Glu Leu Ala Val Ile Ser Val Glu His
180 185 190
Asn Lys Arg Tyr Arg Phe Arg Leu Val Ser Ile Ser Cys Asp Pro Asn
195 200 205
Phe Thr Phe Ser Val Asp Gly His Asn Met Thr Val Ile Glu Val Asp
210 215 220
Gly Val Asn Thr Arg Pro Leu Thr Val Asp Ser Ile Gln Ile Phe Ala
225 230 235 240
Gly Gln Arg Tyr Ser Phe Val Leu Asn Ala Asn Gln Pro Glu Asp Asn
245 250 255
Tyr Trp Ile Arg Ala Met Pro Asn Ile Gly Arg Asn Thr Thr Thr Leu
260 265 270
Asp Gly Lys Asn Ala Ala Ile Leu Arg Tyr Lys Asn Ala Ser Val Glu
275 280 285
Glu Pro Lys Thr Val Gly Gly Pro Ala Gln Ser Pro Leu Asn Glu Ala
290 295 300
Asp Leu Arg Pro Leu Val Pro Ala Pro Val Pro Gly Asn Ala Val Pro
305 310 315 320
Gly Gly Ala Asp Ile Asn His Arg Leu Asn Leu Thr Phe Ser Asn Gly
325 330 335
Leu Phe Ser Ile Asn Asn Ala Ser Phe Thr Asn Pro Ser Val Pro Ala
340 345 350
Leu Leu Gln Ile Leu Ser Gly Ala Gln Asn Ala Gln Asp Leu Leu Pro
355 360 365
Thr Gly Ser Tyr Ile Gly Leu Glu Leu Gly Lys Val Val Glu Leu Val
370 375 380
Ile Pro Pro Leu Ala Val Gly Gly Pro His Pro Phe His Leu His Gly
385 390 395 400
His Asn Phe Trp Val Val Arg Ser Ala Gly Ser Asp Glu Tyr Asn Phe
405 410 415
Asp Asp Ala Ile Leu Arg Asp Val Val Ser Ile Gly Ala Gly Thr Asp
420 425 430
Glu Val Thr Ile Arg Phe Val Thr Asp Asn Pro Gly Pro Trp Phe Leu
435 440 445
His Cys His Ile Asp Trp His Leu Glu Ala Gly Leu Ala Ile Val Phe
450 455 460
Ala Glu Gly Ile Asn Gln Thr Ala Ala Ala Asn Pro Thr Pro Gln Ala
465 470 475 480
Trp Asp Glu Leu Cys Pro Lys Tyr Asn Gly Leu Ser Ala Ser Gln Lys
485 490 495
Val Lys Pro Lys Lys Gly Thr Ala Ile
500 505
Claims (21)
1.生产所需蛋白质的方法,其包括步骤:
(a)将第一核酸序列引入宿主细胞,所述第一核酸序列包括与所需的蛋白质序列有效连接的信号序列;
(b)表达第一核酸序列;
(c)共表达编码分子伴侣或折叠酶的第二核酸序列,所述分子伴侣或折叠酶选自bip1、ero1、pdi1、tig1、prp1、ppi1、ppi2、prp3、prp4、钙联接蛋白和lhs1;和
(d)收集从宿主细胞分泌的所需的蛋白质。
2.权利要求1的方法,其中第一核酸序列还包括在信号序列和所需的蛋白质序列之间的酶序列。
3.权利要求2的方法,其中酶序列获得自葡糖淀粉酶或CBH1酶。
4.权利要求2的方法,其中酶序列包括全长的酶序列。
5.权利要求2的方法,其中酶序列包括催化核心结构域序列。
6.权利要求5的方法,其中第一核酸序列还包括在催化核心结构域序列和所需的蛋白质序列之间的接头序列。
7.权利要求1的方法,其中所需的蛋白质是漆酶。
8.权利要求7的方法,其中所述漆酶源自丝状真菌或酵母。
9.权利要求8的方法,其中所述漆酶源自曲霉属(Aspergillus)、脉孢菌属(Neurospora)、柄孢壳属(Podospora)、葡萄孢属(Botrytis)、金钱菌属(Collybia)、下皮黑孔属(Cerrena)、葡萄穗霉属(Stachybotrys)、革耳鼠(Panus)、梭孢壳属(Thieilava)、层孔菌属(Fomes)、香菇属(Lentinus)、侧耳属(Pleurotus)、栓菌属(Trametes)、丝核菌属(Rhizoctonia)、鬼伞属(Coprinus)、小脆柄菇属(Psatyrella)、毁丝霉属(Myceliophthora)、Schytalidium、射脉菌属(Phlebia)、革盖菌属(Coriolus)、绵皮孔菌属(Spongipellis)、多孔菌属(Polyporus)、虫拟蜡菌(Ceriporiopsis subvermispora)、粗皮灵芝(Ganoderma tsunodae)或木霉属(Trichoderma)。
10.权利要求9的方法,其中所述漆酶源自下皮黑孔属漆酶A1、A2、B1、B2、B3、C、D1、D2或E。
11.权利要求9的方法,其中所述漆酶源自下皮黑孔属漆酶D的成熟蛋白质。
12.权利要求1的方法,其中信号序列编码纤维二糖水解酶I信号肽或NSP24信号肽。
13.权利要求1的方法,其中宿主是丝状真菌。
14.权利要求13的方法,其中宿主是子囊菌。
15.权利要求14的方法,其中宿主是木霉属。
16.权利要求1的方法,其中第一核酸序列还包括在信号序列上游的启动子。
17.权利要求16的方法,其中启动子对宿主细胞是天然的,且与所需的蛋白质序列不是天然相关的。
18.权利要求1的方法,其中分子伴侣是BIP1。
19.权利要求1的方法,其中第二核酸序列与启动子有效的连接。
20.权利要求19的方法,其中启动子对宿主细胞是天然的,且与第二核酸序列不是天然相关的。
21.权利要求2的方法,其中所需的蛋白质是漆酶,且漆酶作为与酶的融合蛋白被产生。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US98443007P | 2007-11-01 | 2007-11-01 | |
US60/984,430 | 2007-11-01 | ||
PCT/US2008/081721 WO2009058956A1 (en) | 2007-11-01 | 2008-10-30 | Signal sequences and co-expressed chaperones for improving protein production in a host cell |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101842479A true CN101842479A (zh) | 2010-09-22 |
Family
ID=40304979
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200880114429A Pending CN101842479A (zh) | 2007-11-01 | 2008-10-30 | 用于改良宿主细胞内蛋白质生产的信号序列和共表达的分子伴侣 |
Country Status (10)
Country | Link |
---|---|
US (1) | US20090221030A1 (zh) |
EP (1) | EP2203556A1 (zh) |
JP (3) | JP5252746B2 (zh) |
CN (1) | CN101842479A (zh) |
AU (1) | AU2008318644B2 (zh) |
BR (1) | BRPI0818273A2 (zh) |
CA (1) | CA2704548A1 (zh) |
CO (1) | CO6210760A2 (zh) |
MX (1) | MX2010004325A (zh) |
WO (1) | WO2009058956A1 (zh) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104694560A (zh) * | 2013-12-04 | 2015-06-10 | 中国科学院天津工业生物技术研究所 | 一种源于里氏木霉的二硫键异构酶基因Trpdi2及其用途 |
CN106795503A (zh) * | 2014-08-15 | 2017-05-31 | 丹尼斯科美国公司 | 用于改善的蛋白质产生的组合物和方法 |
CN109890834A (zh) * | 2016-08-01 | 2019-06-14 | 艾杜罗生物科技公司 | 蛋白质表达增强子序列及其用途 |
CN112391402A (zh) * | 2020-11-17 | 2021-02-23 | 华中科技大学 | 一种提高解脂耶氏酵母中目的蛋白表达水平的方法 |
CN112501141A (zh) * | 2020-10-22 | 2021-03-16 | 重庆中元汇吉生物技术有限公司 | 一种增加人甲状腺过氧化物酶产量的试剂和表达方法 |
CN112955547A (zh) * | 2018-06-27 | 2021-06-11 | 贝林格尔·英格海姆Rcv两合公司 | 通过使用转录因子来增加蛋白质表达的手段和方法 |
CN113736817A (zh) * | 2021-10-08 | 2021-12-03 | 枣庄市杰诺生物酶有限公司 | 一种毕赤酵母中提高碱性脂肪酶分泌效率及酶活的方法 |
CN114761418A (zh) * | 2019-12-02 | 2022-07-15 | 株式会社Lg化学 | Cho细胞衍生的蛋白分泌因子和包含它的表达载体 |
CN117903294A (zh) * | 2024-03-19 | 2024-04-19 | 北京国科星联科技有限公司 | 一种发酵生产乳铁蛋白的马克斯克鲁维酵母及其构建方法与应用 |
CN117903295A (zh) * | 2024-03-19 | 2024-04-19 | 北京国科星联科技有限公司 | 一种用于分泌表达乳铁蛋白的马克斯克鲁维酵母及其构建方法与应用 |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20110117064A (ko) | 2008-12-24 | 2011-10-26 | 다니스코 유에스 인크. | 라카아제 및 저온에서의 이의 사용 방법 |
MX2011008924A (es) | 2009-03-03 | 2011-09-21 | Danisco Us Inc | Decoloracion oxidante de tintes con metodo de peracido generado enzimaticamente, composicion y kit de partes. |
FI20095615A0 (fi) * | 2009-06-02 | 2009-06-02 | Oulun Yliopisto | Menetelmä luonnollisesti laskostuneiden proteiinien tuottamiseksi prokaryootti-isännässä |
EP2630291A1 (en) | 2010-10-18 | 2013-08-28 | Danisco US Inc. | Local color modification of dyed fabrics using a laccase system |
EP2694647A1 (en) | 2011-04-06 | 2014-02-12 | Danisco US Inc. | Laccase variants having increased expression and/or activity |
JP2012235767A (ja) * | 2011-04-27 | 2012-12-06 | Toyota Motor Corp | 変異トリコデルマ属微生物及びこれを用いたタンパク質の製造方法 |
EP3358000A3 (en) * | 2012-01-05 | 2018-12-19 | Glykos Finland Oy | Protease deficient filamentous fungal cells and methods of use thereof |
CN104066841B (zh) * | 2012-01-23 | 2018-01-02 | 旭硝子株式会社 | 表达载体和蛋白质的制造方法 |
WO2013174927A1 (en) | 2012-05-23 | 2013-11-28 | Novartis International Pharmaceutical Limited | Production of fucosylated glycoproteins |
WO2014164727A1 (en) | 2013-03-12 | 2014-10-09 | Wisconsin Alumni Research Foundation | A method of treating fungal infection |
RU2016104064A (ru) | 2013-07-10 | 2017-08-15 | Новартис Аг | Клетки мицелиальных грибов с множественной недостаточностью протеаз и способы их использования |
WO2015158808A2 (en) | 2014-04-17 | 2015-10-22 | Boehringer Ingelheim Rcv Gmbh & Co Kg | Recombinant host cell engineered to overexpress helper proteins |
EP3132039B1 (en) | 2014-04-17 | 2019-07-24 | Boehringer Ingelheim RCV GmbH & Co KG | Recombinant host cell for expressing proteins of interest |
CN108064266A (zh) | 2014-07-21 | 2018-05-22 | 格利科斯芬兰公司 | 在丝状真菌中具有哺乳动物样n-聚糖的糖蛋白的制备 |
CN112175977B (zh) * | 2020-10-30 | 2022-04-15 | 华中农业大学 | 一种米曲霉角蛋白酶基因及其表达载体与应用 |
CN112409464B (zh) * | 2020-11-23 | 2022-04-22 | 江南大学 | 提高枯草芽孢杆菌重组蛋白胞外生产水平的信号肽突变体及应用 |
EP4399295A1 (en) * | 2021-09-09 | 2024-07-17 | International N&H Denmark ApS | Over expression of foldases and chaperones improves protein production |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5773245A (en) * | 1992-10-02 | 1998-06-30 | Research Corporation Technologies, Inc. | Methods for increasing secretion of overexpressed proteins |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DK0562003T4 (en) * | 1990-12-10 | 2015-07-13 | Danisco Us Inc | Improved saccharification of cellulose by cloning and amplification of.-Glucosidase gene from Tricodermareesei |
US5861271A (en) * | 1993-12-17 | 1999-01-19 | Fowler; Timothy | Cellulase enzymes and systems for their expressions |
US6008029A (en) * | 1995-08-25 | 1999-12-28 | Novo Nordisk Biotech Inc. | Purified coprinus laccases and nucleic acids encoding the same |
CA2286307A1 (en) * | 1997-04-07 | 1998-10-15 | Unilever Plc | Agrobacterium mediated transformation of moulds, in particular those belonging to the genus aspergillus |
DE69830743T2 (de) * | 1997-11-21 | 2006-04-27 | Novozymes A/S | Protease-varianten und zusammensetzungen |
US6268328B1 (en) * | 1998-12-18 | 2001-07-31 | Genencor International, Inc. | Variant EGIII-like cellulase compositions |
CA2403842C (en) * | 2000-03-24 | 2012-09-25 | Genencor International, Inc. | Increased production of secreted proteins by recombinant eukaryotic cells |
US6358733B1 (en) * | 2000-05-19 | 2002-03-19 | Apolife, Inc. | Expression of heterologous multi-domain proteins in yeast |
JP2002065282A (ja) * | 2000-09-04 | 2002-03-05 | Iwate Prefecture | シイタケラッカーゼ遺伝子 |
JP4834554B2 (ja) * | 2003-11-06 | 2011-12-14 | ジェネンコー・インターナショナル・インク | 糸状菌におけるプロテアーゼ抑制剤及びその変異体の発現 |
EP1692160B1 (en) * | 2003-11-06 | 2010-10-27 | Danisco US Inc. | Fgf-5 binding and supported peptides |
CA2567570C (en) * | 2004-05-27 | 2014-07-08 | Genencor International, Inc. | Aspergillus kawachi acid-stable alpha amylase and applications in granular starch hydrolysis |
BRPI0519766A2 (pt) * | 2004-12-30 | 2009-03-10 | Genencor Int | proteases Ácidas féngicas |
MX2009005734A (es) * | 2006-12-18 | 2009-06-10 | Danisco Us Inc Genencor Div | Mediadores de lacasa y metodos de uso. |
WO2008115596A2 (en) * | 2007-03-21 | 2008-09-25 | Danisco Us, Inc., Genencor Division | Over expression of foldases and chaperones improves protein production |
-
2008
- 2008-10-30 CN CN200880114429A patent/CN101842479A/zh active Pending
- 2008-10-30 EP EP08844940A patent/EP2203556A1/en not_active Withdrawn
- 2008-10-30 CA CA2704548A patent/CA2704548A1/en not_active Abandoned
- 2008-10-30 US US12/261,306 patent/US20090221030A1/en not_active Abandoned
- 2008-10-30 JP JP2010532232A patent/JP5252746B2/ja not_active Expired - Fee Related
- 2008-10-30 BR BRPI0818273-6A2A patent/BRPI0818273A2/pt not_active IP Right Cessation
- 2008-10-30 AU AU2008318644A patent/AU2008318644B2/en not_active Ceased
- 2008-10-30 MX MX2010004325A patent/MX2010004325A/es active IP Right Grant
- 2008-10-30 WO PCT/US2008/081721 patent/WO2009058956A1/en active Application Filing
-
2010
- 2010-04-23 CO CO10048157A patent/CO6210760A2/es not_active Application Discontinuation
-
2013
- 2013-01-22 JP JP2013009289A patent/JP2013121354A/ja active Pending
-
2014
- 2014-09-30 JP JP2014200112A patent/JP2015027308A/ja not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5773245A (en) * | 1992-10-02 | 1998-06-30 | Research Corporation Technologies, Inc. | Methods for increasing secretion of overexpressed proteins |
Non-Patent Citations (3)
Title |
---|
JARI J RAUTIO ET AL.: "Physiological evaluation of the filamentous fungus Trichoderma Physiological evaluation of the filamentous fungus Trichoderma", 《BMC BIOTECHNOLOGY》 * |
LAURA-LEENA KIISKINEN ET AL.: "Expression of Melanocarpus albomyces laccase in Trichoderma reesei and characterization of the purified enzyme", 《MICROBIOLOGY》 * |
P. J. PUNT ET AL.: "Analysis of the role of the gene bipA, encoding the major endoplasmic reticulum chaperone protein in the secretion of homologous and heterologous proteins in black Aspergilli", 《APPL MICROBIOL BIOTECHNOL》 * |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104694560A (zh) * | 2013-12-04 | 2015-06-10 | 中国科学院天津工业生物技术研究所 | 一种源于里氏木霉的二硫键异构酶基因Trpdi2及其用途 |
CN106795503A (zh) * | 2014-08-15 | 2017-05-31 | 丹尼斯科美国公司 | 用于改善的蛋白质产生的组合物和方法 |
CN109890834A (zh) * | 2016-08-01 | 2019-06-14 | 艾杜罗生物科技公司 | 蛋白质表达增强子序列及其用途 |
CN112955547A (zh) * | 2018-06-27 | 2021-06-11 | 贝林格尔·英格海姆Rcv两合公司 | 通过使用转录因子来增加蛋白质表达的手段和方法 |
CN114761418A (zh) * | 2019-12-02 | 2022-07-15 | 株式会社Lg化学 | Cho细胞衍生的蛋白分泌因子和包含它的表达载体 |
CN112501141A (zh) * | 2020-10-22 | 2021-03-16 | 重庆中元汇吉生物技术有限公司 | 一种增加人甲状腺过氧化物酶产量的试剂和表达方法 |
CN112391402A (zh) * | 2020-11-17 | 2021-02-23 | 华中科技大学 | 一种提高解脂耶氏酵母中目的蛋白表达水平的方法 |
CN113736817A (zh) * | 2021-10-08 | 2021-12-03 | 枣庄市杰诺生物酶有限公司 | 一种毕赤酵母中提高碱性脂肪酶分泌效率及酶活的方法 |
CN117903294A (zh) * | 2024-03-19 | 2024-04-19 | 北京国科星联科技有限公司 | 一种发酵生产乳铁蛋白的马克斯克鲁维酵母及其构建方法与应用 |
CN117903295A (zh) * | 2024-03-19 | 2024-04-19 | 北京国科星联科技有限公司 | 一种用于分泌表达乳铁蛋白的马克斯克鲁维酵母及其构建方法与应用 |
CN117903295B (zh) * | 2024-03-19 | 2024-05-31 | 北京国科星联科技有限公司 | 一种用于分泌表达乳铁蛋白的马克斯克鲁维酵母及其构建方法与应用 |
CN117903294B (zh) * | 2024-03-19 | 2024-06-14 | 北京国科星联科技有限公司 | 一种发酵生产乳铁蛋白的马克斯克鲁维酵母及其构建方法与应用 |
Also Published As
Publication number | Publication date |
---|---|
BRPI0818273A2 (pt) | 2014-10-14 |
AU2008318644B2 (en) | 2013-05-02 |
JP2013121354A (ja) | 2013-06-20 |
JP2015027308A (ja) | 2015-02-12 |
AU2008318644A1 (en) | 2009-05-07 |
MX2010004325A (es) | 2010-05-20 |
JP5252746B2 (ja) | 2013-07-31 |
CO6210760A2 (es) | 2010-10-20 |
CA2704548A1 (en) | 2009-05-07 |
JP2011502483A (ja) | 2011-01-27 |
EP2203556A1 (en) | 2010-07-07 |
US20090221030A1 (en) | 2009-09-03 |
WO2009058956A1 (en) | 2009-05-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101842479A (zh) | 用于改良宿主细胞内蛋白质生产的信号序列和共表达的分子伴侣 | |
KR102274445B1 (ko) | 게놈 삽입을 위한 방법 | |
AU2016203445B2 (en) | Integration of a polynucleotide encoding a polypeptide that catalyzes pyruvate to acetolactate conversion | |
CN109563505A (zh) | 用于真核细胞的组装系统 | |
WO2017165859A1 (en) | Modified viral capsid proteins | |
KR102711998B1 (ko) | 조작되고 완전-기능 맞춤 당단백질 | |
AU2018280116B2 (en) | Enhanced modified viral capsid proteins | |
AU2013337574B2 (en) | Cytochrome P450 and cytochrome P450 reductase polypeptides, encoding nucleic acid molecules and uses thereof | |
CN108138121B (zh) | 用微生物高水平生产长链二羧酸 | |
KR102593668B1 (ko) | 유도 기질 부재 하의 사상 진균 세포에서의 단백질 생산 | |
US20030167538A1 (en) | Use of the maize x112 mutant ahas 2 gene and imidazolinone herbicides for selection of transgenic monocots, maize, rice and wheat plants resistant to the imidazolinone herbicides | |
KR20180053684A (ko) | Fdca의 진균 제조 | |
KR20140099224A (ko) | 케토-아이소발레레이트 데카르복실라제 효소 및 이의 이용 방법 | |
DK2443248T3 (en) | IMPROVEMENT OF LONG-CHAIN POLYUM Saturated OMEGA-3 AND OMEGA-6 FATTY ACID BIOS SYNTHESIS BY EXPRESSION OF ACYL-CoA LYSOPHOSPHOLIPID ACYL TRANSFERASES | |
CN114181957B (zh) | 一种基于病毒加帽酶的稳定t7表达系统及其在真核生物中表达蛋白质的方法 | |
KR20220161297A (ko) | 신규 세포주 | |
CN111094569A (zh) | 光控性病毒蛋白质、其基因及包含该基因的病毒载体 | |
CN101883843A (zh) | 破坏过氧化物酶体生物合成因子蛋白(pex)以改变含油真核生物中多不饱和脂肪酸和总脂质含量 | |
KR102287880B1 (ko) | 세포에서 이중 가닥 dna의 표적 부위를 변형시키기 위한 방법 | |
TW202228728A (zh) | 用於同時調節基因表現之組合物及方法 | |
CN112513072A (zh) | 慢病毒载体转化的t-rapa细胞在改善溶酶体贮积症中的应用 | |
US20240165154A1 (en) | Methods and agents for modulating adoptive immunotherapy | |
KR20230042073A (ko) | 치환된 칸나비노이드 및 전구체를 제조하기 위해 변형 효소를 이용하는 방법 및 이를 함유하는 세포 | |
WO1998049320A1 (en) | SECRETED α-AMYLASE AS A REPORTER GENE | |
CN116710108A (zh) | 用于同时调节诸基因表达的组合物和方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1148556 Country of ref document: HK |
|
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20100922 |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: WD Ref document number: 1148556 Country of ref document: HK |