CN114599778A - 生产多胺缀合物的酵母 - Google Patents
生产多胺缀合物的酵母 Download PDFInfo
- Publication number
- CN114599778A CN114599778A CN202080075687.9A CN202080075687A CN114599778A CN 114599778 A CN114599778 A CN 114599778A CN 202080075687 A CN202080075687 A CN 202080075687A CN 114599778 A CN114599778 A CN 114599778A
- Authority
- CN
- China
- Prior art keywords
- dna
- synthase
- primer
- leu
- glutathione
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 229920000768 polyamine Polymers 0.000 title claims abstract description 115
- 240000004808 Saccharomyces cerevisiae Species 0.000 title claims description 75
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 153
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Natural products OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 claims abstract description 103
- 229960003180 glutathione Drugs 0.000 claims abstract description 70
- 210000005253 yeast cell Anatomy 0.000 claims abstract description 68
- 108010024636 Glutathione Proteins 0.000 claims abstract description 56
- 238000004519 manufacturing process Methods 0.000 claims abstract description 26
- 108010089000 polyamine oxidase Proteins 0.000 claims abstract description 18
- 108090000364 Ligases Proteins 0.000 claims abstract description 17
- 102000003960 Ligases Human genes 0.000 claims abstract description 17
- 102100037209 Peroxisomal N(1)-acetyl-spermine/spermidine oxidase Human genes 0.000 claims abstract description 17
- ATHGHQPFGPMSJY-UHFFFAOYSA-N spermidine Chemical compound NCCCCNCCCN ATHGHQPFGPMSJY-UHFFFAOYSA-N 0.000 claims description 82
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 72
- PFNFFQXMRSDOHW-UHFFFAOYSA-N spermine Chemical compound NCCCNCCCCNCCCN PFNFFQXMRSDOHW-UHFFFAOYSA-N 0.000 claims description 52
- 210000004027 cell Anatomy 0.000 claims description 51
- 239000002773 nucleotide Substances 0.000 claims description 48
- 125000003729 nucleotide group Chemical group 0.000 claims description 48
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 45
- 229940063673 spermidine Drugs 0.000 claims description 41
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 38
- 229920001184 polypeptide Polymers 0.000 claims description 37
- 108010051753 Spermidine Synthase Proteins 0.000 claims description 30
- 102100030413 Spermidine synthase Human genes 0.000 claims description 29
- 229940063675 spermine Drugs 0.000 claims description 26
- 241000223104 Trypanosoma Species 0.000 claims description 25
- 238000000034 method Methods 0.000 claims description 25
- 108010071698 Spermine synthase Proteins 0.000 claims description 22
- 102100037616 Spermine synthase Human genes 0.000 claims description 22
- 230000002018 overexpression Effects 0.000 claims description 19
- 229960003067 cystine Drugs 0.000 claims description 16
- KIDHWZJUCRJVML-UHFFFAOYSA-N putrescine Chemical compound NCCCCN KIDHWZJUCRJVML-UHFFFAOYSA-N 0.000 claims description 16
- 241000588724 Escherichia coli Species 0.000 claims description 13
- 241000219195 Arabidopsis thaliana Species 0.000 claims description 8
- 239000005700 Putrescine Substances 0.000 claims description 8
- 241000223105 Trypanosoma brucei Species 0.000 claims description 8
- 239000001963 growth medium Substances 0.000 claims description 8
- VHRGRCVQAFMJIZ-UHFFFAOYSA-N cadaverine Chemical compound NCCCCCN VHRGRCVQAFMJIZ-UHFFFAOYSA-N 0.000 claims description 6
- 102000015833 Cystatin Human genes 0.000 claims description 5
- 108050004038 cystatin Proteins 0.000 claims description 5
- 241000346858 Senecio vernalis Species 0.000 claims description 4
- XFNJVJPLKCPIBV-UHFFFAOYSA-N trimethylenediamine Chemical compound NCCCN XFNJVJPLKCPIBV-UHFFFAOYSA-N 0.000 claims description 4
- QYPPJABKJHAVHS-UHFFFAOYSA-N Agmatine Natural products NCCCCNC(N)=N QYPPJABKJHAVHS-UHFFFAOYSA-N 0.000 claims description 3
- 241000190944 Blastochloris viridis Species 0.000 claims description 3
- QYPPJABKJHAVHS-UHFFFAOYSA-P agmatinium(2+) Chemical compound NC(=[NH2+])NCCCC[NH3+] QYPPJABKJHAVHS-UHFFFAOYSA-P 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 3
- 241000219194 Arabidopsis Species 0.000 claims description 2
- 108010036164 Glutathione synthase Proteins 0.000 claims description 2
- 240000003705 Senecio vulgaris Species 0.000 claims description 2
- 108700026220 vif Genes Proteins 0.000 claims description 2
- LEVWYRKDKASIDU-QWWZWVQMSA-N D-cystine Chemical compound OC(=O)[C@H](N)CSSC[C@@H](N)C(O)=O LEVWYRKDKASIDU-QWWZWVQMSA-N 0.000 claims 4
- 241001008148 Aster indicus Species 0.000 claims 1
- 101710187553 Spermine synthase SPE4 Proteins 0.000 claims 1
- 108020004414 DNA Proteins 0.000 description 270
- 150000007523 nucleic acids Chemical class 0.000 description 46
- 102000039446 nucleic acids Human genes 0.000 description 32
- 108020004707 nucleic acids Proteins 0.000 description 32
- 102000004190 Enzymes Human genes 0.000 description 29
- 108090000790 Enzymes Proteins 0.000 description 29
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 27
- 235000001014 amino acid Nutrition 0.000 description 22
- 102000004169 proteins and genes Human genes 0.000 description 22
- 229940024606 amino acid Drugs 0.000 description 21
- 150000001413 amino acids Chemical class 0.000 description 20
- 230000004048 modification Effects 0.000 description 20
- 238000012986 modification Methods 0.000 description 20
- 235000018102 proteins Nutrition 0.000 description 20
- 125000003275 alpha amino acid group Chemical group 0.000 description 19
- 230000000694 effects Effects 0.000 description 18
- 239000013612 plasmid Substances 0.000 description 18
- 238000006243 chemical reaction Methods 0.000 description 17
- 230000002255 enzymatic effect Effects 0.000 description 17
- 108091028043 Nucleic acid sequence Proteins 0.000 description 16
- 230000015572 biosynthetic process Effects 0.000 description 16
- 239000000047 product Substances 0.000 description 15
- 238000006467 substitution reaction Methods 0.000 description 15
- 108020004999 messenger RNA Proteins 0.000 description 14
- 238000013518 transcription Methods 0.000 description 14
- 230000035897 transcription Effects 0.000 description 14
- 230000009466 transformation Effects 0.000 description 14
- 238000000855 fermentation Methods 0.000 description 13
- 230000004151 fermentation Effects 0.000 description 13
- 108091033319 polynucleotide Proteins 0.000 description 13
- 102000040430 polynucleotide Human genes 0.000 description 13
- 239000002157 polynucleotide Substances 0.000 description 13
- LEVWYRKDKASIDU-IMJSIDKUSA-N L-cystine Chemical compound [O-]C(=O)[C@@H]([NH3+])CSSC[C@H]([NH3+])C([O-])=O LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 12
- 238000012217 deletion Methods 0.000 description 12
- 230000037430 deletion Effects 0.000 description 12
- 230000002068 genetic effect Effects 0.000 description 12
- 230000037361 pathway Effects 0.000 description 12
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 11
- 230000014509 gene expression Effects 0.000 description 11
- 239000002609 medium Substances 0.000 description 11
- -1 phenol amides Chemical class 0.000 description 11
- 241000894007 species Species 0.000 description 11
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 10
- 241000186226 Corynebacterium glutamicum Species 0.000 description 10
- 239000012634 fragment Substances 0.000 description 10
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 9
- 230000001965 increasing effect Effects 0.000 description 9
- 239000000523 sample Substances 0.000 description 9
- 108020004705 Codon Proteins 0.000 description 8
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 8
- 101100010928 Saccharolobus solfataricus (strain ATCC 35092 / DSM 1617 / JCM 11322 / P2) tuf gene Proteins 0.000 description 8
- 101150001810 TEAD1 gene Proteins 0.000 description 8
- 101150074253 TEF1 gene Proteins 0.000 description 8
- 102100029898 Transcriptional enhancer factor TEF-1 Human genes 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- 101000881210 Arabidopsis thaliana Spermine synthase Proteins 0.000 description 7
- 241000235648 Pichia Species 0.000 description 7
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 7
- 238000012239 gene modification Methods 0.000 description 7
- 230000005017 genetic modification Effects 0.000 description 7
- 235000013617 genetically modified food Nutrition 0.000 description 7
- 238000009396 hybridization Methods 0.000 description 7
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 7
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 7
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 6
- 241000196324 Embryophyta Species 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 6
- 230000004060 metabolic process Effects 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- 101000579123 Homo sapiens Phosphoglycerate kinase 1 Proteins 0.000 description 5
- 102100023162 L-serine dehydratase/L-threonine deaminase Human genes 0.000 description 5
- 108010043075 L-threonine 3-dehydrogenase Proteins 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 5
- KJWZYMMLVHIVSU-IYCNHOCDSA-N PGK1 Chemical compound CCCCC[C@H](O)\C=C\[C@@H]1[C@@H](CCCCCCC(O)=O)C(=O)CC1=O KJWZYMMLVHIVSU-IYCNHOCDSA-N 0.000 description 5
- 102100028251 Phosphoglycerate kinase 1 Human genes 0.000 description 5
- 241000223252 Rhodotorula Species 0.000 description 5
- MEFKEPWMEQBLKI-AIRLBKTGSA-N S-adenosyl-L-methioninate Chemical compound O[C@@H]1[C@H](O)[C@@H](C[S+](CC[C@H](N)C([O-])=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 MEFKEPWMEQBLKI-AIRLBKTGSA-N 0.000 description 5
- 102100033598 Triosephosphate isomerase Human genes 0.000 description 5
- 238000007792 addition Methods 0.000 description 5
- 229910052799 carbon Inorganic materials 0.000 description 5
- 108010018006 histidylserine Proteins 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 238000012261 overproduction Methods 0.000 description 5
- 108010070643 prolylglutamic acid Proteins 0.000 description 5
- 230000009467 reduction Effects 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- 108091033409 CRISPR Proteins 0.000 description 4
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 4
- 101150009006 HIS3 gene Proteins 0.000 description 4
- 241000222697 Leishmania infantum Species 0.000 description 4
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 4
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 4
- 101100394989 Rhodopseudomonas palustris (strain ATCC BAA-98 / CGA009) hisI gene Proteins 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 108020004566 Transfer RNA Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 210000003527 eukaryotic cell Anatomy 0.000 description 4
- 238000000605 extraction Methods 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 238000009630 liquid culture Methods 0.000 description 4
- 108010003700 lysyl aspartic acid Proteins 0.000 description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 4
- 229930014626 natural product Natural products 0.000 description 4
- 229910052757 nitrogen Inorganic materials 0.000 description 4
- 229960003104 ornithine Drugs 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 230000008685 targeting Effects 0.000 description 4
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 description 3
- 101710163881 5,6-dihydroxyindole-2-carboxylic acid oxidase Proteins 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- 102100034035 Alcohol dehydrogenase 1A Human genes 0.000 description 3
- 102100036826 Aldehyde oxidase Human genes 0.000 description 3
- 101100288313 Arabidopsis thaliana KTI4 gene Proteins 0.000 description 3
- 101000600602 Aspergillus flavus (strain ATCC MYA-384 / AF70) Endopolygalacturonase A Proteins 0.000 description 3
- 101000600608 Aspergillus flavus (strain ATCC MYA-384 / AF70) Endopolygalacturonase B Proteins 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 101100327917 Caenorhabditis elegans chup-1 gene Proteins 0.000 description 3
- 108010078791 Carrier Proteins Proteins 0.000 description 3
- 241001508786 Citeromyces matritensis Species 0.000 description 3
- 101150094690 GAL1 gene Proteins 0.000 description 3
- 101150038242 GAL10 gene Proteins 0.000 description 3
- 101150037782 GAL2 gene Proteins 0.000 description 3
- 102100028501 Galanin peptides Human genes 0.000 description 3
- 102100024637 Galectin-10 Human genes 0.000 description 3
- 102100021735 Galectin-2 Human genes 0.000 description 3
- 102100039555 Galectin-7 Human genes 0.000 description 3
- 101000892220 Geobacillus thermodenitrificans (strain NG80-2) Long-chain-alcohol dehydrogenase 1 Proteins 0.000 description 3
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 3
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 3
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 3
- 102100036669 Glycerol-3-phosphate dehydrogenase [NAD(+)], cytoplasmic Human genes 0.000 description 3
- 101100246753 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) pyrF gene Proteins 0.000 description 3
- 101000780443 Homo sapiens Alcohol dehydrogenase 1A Proteins 0.000 description 3
- 101000928314 Homo sapiens Aldehyde oxidase Proteins 0.000 description 3
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 3
- 101000608772 Homo sapiens Galectin-7 Proteins 0.000 description 3
- 101001072574 Homo sapiens Glycerol-3-phosphate dehydrogenase [NAD(+)], cytoplasmic Proteins 0.000 description 3
- 101000951145 Homo sapiens Succinate dehydrogenase [ubiquinone] cytochrome b small subunit, mitochondrial Proteins 0.000 description 3
- 241000235649 Kluyveromyces Species 0.000 description 3
- 241001138401 Kluyveromyces lactis Species 0.000 description 3
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 3
- 108091005461 Nucleic proteins Proteins 0.000 description 3
- 101001126848 Rhizobium radiobacter Polygalacturonase Proteins 0.000 description 3
- 241000221523 Rhodotorula toruloides Species 0.000 description 3
- 241000235070 Saccharomyces Species 0.000 description 3
- 101100434411 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ADH1 gene Proteins 0.000 description 3
- 101100478264 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SPE4 gene Proteins 0.000 description 3
- 241000235346 Schizosaccharomyces Species 0.000 description 3
- 241000311088 Schwanniomyces Species 0.000 description 3
- 102100038014 Succinate dehydrogenase [ubiquinone] cytochrome b small subunit, mitochondrial Human genes 0.000 description 3
- 101150033985 TPI gene Proteins 0.000 description 3
- 101150032817 TPI1 gene Proteins 0.000 description 3
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 3
- 101150050575 URA3 gene Proteins 0.000 description 3
- 241000235029 Zygosaccharomyces bailii Species 0.000 description 3
- 101150102866 adc1 gene Proteins 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 230000000692 anti-sense effect Effects 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 239000007853 buffer solution Substances 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 238000003804 extraction from natural source Methods 0.000 description 3
- 239000007789 gas Substances 0.000 description 3
- 238000002873 global sequence alignment Methods 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010079547 glutamylmethionine Proteins 0.000 description 3
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 3
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 150000003384 small molecules Chemical class 0.000 description 3
- 230000001052 transient effect Effects 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- ZAXCZCOUDLENMH-UHFFFAOYSA-N 3,3,3-tetramine Chemical compound NCCCNCCCNCCCN ZAXCZCOUDLENMH-UHFFFAOYSA-N 0.000 description 2
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- 102100029457 Adenine phosphoribosyltransferase Human genes 0.000 description 2
- 108010024223 Adenine phosphoribosyltransferase Proteins 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 2
- 108020005544 Antisense RNA Proteins 0.000 description 2
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 2
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 2
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 2
- 241000228212 Aspergillus Species 0.000 description 2
- 241000223651 Aureobasidium Species 0.000 description 2
- 101000950981 Bacillus subtilis (strain 168) Catabolic NAD-specific glutamate dehydrogenase RocG Proteins 0.000 description 2
- 108010088278 Branched-chain-amino-acid transaminase Proteins 0.000 description 2
- 241000722885 Brettanomyces Species 0.000 description 2
- 238000010453 CRISPR/Cas method Methods 0.000 description 2
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 2
- 102100024633 Carbonic anhydrase 2 Human genes 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- 241000222716 Crithidia Species 0.000 description 2
- 241001527609 Cryptococcus Species 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 108700033480 EC 2.5.1.45 Proteins 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 101150015836 ENO1 gene Proteins 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 102000016901 Glutamate dehydrogenase Human genes 0.000 description 2
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- 108020005004 Guide RNA Proteins 0.000 description 2
- 244000286779 Hansenula anomala Species 0.000 description 2
- 235000014683 Hansenula anomala Nutrition 0.000 description 2
- 101000760643 Homo sapiens Carbonic anhydrase 2 Proteins 0.000 description 2
- 101000801742 Homo sapiens Triosephosphate isomerase Proteins 0.000 description 2
- 108030001875 Homospermidine synthases Proteins 0.000 description 2
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 2
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- 241000222722 Leishmania <genus> Species 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 2
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 108020004485 Nonsense Codon Proteins 0.000 description 2
- 102100021079 Ornithine decarboxylase Human genes 0.000 description 2
- 108700005126 Ornithine decarboxylases Proteins 0.000 description 2
- 241001542817 Phaffia Species 0.000 description 2
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 2
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 2
- 108091000080 Phosphotransferase Proteins 0.000 description 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 2
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 2
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- 108030001813 Thermospermine synthases Proteins 0.000 description 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 2
- 241000223230 Trichosporon Species 0.000 description 2
- UHXOYRWHIQZAKV-SZMVWBNQSA-N Trp-Pro-Arg Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UHXOYRWHIQZAKV-SZMVWBNQSA-N 0.000 description 2
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 2
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- 241000235013 Yarrowia Species 0.000 description 2
- 241000235015 Yarrowia lipolytica Species 0.000 description 2
- 241000235017 Zygosaccharomyces Species 0.000 description 2
- 241001464778 Zymobacter Species 0.000 description 2
- 101150039629 aah1 gene Proteins 0.000 description 2
- 125000004442 acylamino group Chemical group 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 2
- 230000001851 biosynthetic effect Effects 0.000 description 2
- OTBHHUPVCYLGQO-UHFFFAOYSA-N bis(3-aminopropyl)amine Chemical compound NCCCNCCCN OTBHHUPVCYLGQO-UHFFFAOYSA-N 0.000 description 2
- 230000006652 catabolic pathway Effects 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 239000003184 complementary RNA Substances 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000001212 derivatisation Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 229960002989 glutamic acid Drugs 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 229960004452 methionine Drugs 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 239000002679 microRNA Substances 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 230000002438 mitochondrial effect Effects 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- RPSHOHKFHRFBAZ-UHFFFAOYSA-N n'-[4-(4-aminobutylamino)butyl]butane-1,4-diamine Chemical compound NCCCCNCCCCNCCCCN RPSHOHKFHRFBAZ-UHFFFAOYSA-N 0.000 description 2
- 238000007857 nested PCR Methods 0.000 description 2
- 230000037434 nonsense mutation Effects 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 102000020233 phosphotransferase Human genes 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 239000002510 pyrogen Substances 0.000 description 2
- 229960002181 saccharomyces boulardii Drugs 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000001509 sodium citrate Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- HNSDLXPSAYFUHK-UHFFFAOYSA-N 1,4-bis(2-ethylhexyl) sulfosuccinate Chemical compound CCCCC(CC)COC(=O)CC(S(O)(=O)=O)C(=O)OCC(CC)CCCC HNSDLXPSAYFUHK-UHFFFAOYSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- 108010034457 5'-methylthioadenosine phosphorylase Proteins 0.000 description 1
- SEHFUALWMUWDKS-UHFFFAOYSA-N 5-fluoroorotic acid Chemical compound OC(=O)C=1NC(=O)NC(=O)C=1F SEHFUALWMUWDKS-UHFFFAOYSA-N 0.000 description 1
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 1
- CKOMXBHMKXXTNW-UHFFFAOYSA-N 6-methyladenine Chemical compound CNC1=NC=NC2=C1N=CN2 CKOMXBHMKXXTNW-UHFFFAOYSA-N 0.000 description 1
- 230000005730 ADP ribosylation Effects 0.000 description 1
- 101710165738 Acetylornithine aminotransferase Proteins 0.000 description 1
- 108010052875 Adenine deaminase Proteins 0.000 description 1
- 208000000230 African Trypanosomiasis Diseases 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 102100036601 Aggrecan core protein Human genes 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 1
- IAUSCRHURCZUJP-CIUDSAMLSA-N Ala-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CS)C(O)=O IAUSCRHURCZUJP-CIUDSAMLSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 1
- MRXZVZVKYZELRU-UHFFFAOYSA-N Ala-Trp-Ser-Ser Chemical compound C1=CC=C2C(CC(NC(=O)C(N)C)C(=O)NC(CO)C(=O)NC(CO)C(O)=O)=CNC2=C1 MRXZVZVKYZELRU-UHFFFAOYSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 206010001935 American trypanosomiasis Diseases 0.000 description 1
- 101100490194 Arabidopsis thaliana ACL5 gene Proteins 0.000 description 1
- 101100366322 Arabidopsis thaliana ADC1 gene Proteins 0.000 description 1
- 101100366333 Arabidopsis thaliana ADC2 gene Proteins 0.000 description 1
- OOBVTWHLKYJFJH-FXQIFTODSA-N Arg-Ala-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O OOBVTWHLKYJFJH-FXQIFTODSA-N 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- FIQKRDXFTANIEJ-ULQDDVLXSA-N Arg-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FIQKRDXFTANIEJ-ULQDDVLXSA-N 0.000 description 1
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- HOIFSHOLNKQCSA-FXQIFTODSA-N Asn-Arg-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O HOIFSHOLNKQCSA-FXQIFTODSA-N 0.000 description 1
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 1
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- SPKRHJOVRVDJGG-CIUDSAMLSA-N Asp-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SPKRHJOVRVDJGG-CIUDSAMLSA-N 0.000 description 1
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 1
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- XFQOQUWGVCVYON-DCAQKATOSA-N Asp-Met-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 XFQOQUWGVCVYON-DCAQKATOSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 102000004031 Carboxy-Lyases Human genes 0.000 description 1
- 108090000489 Carboxy-Lyases Proteins 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 208000024699 Chagas disease Diseases 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- ZJBWJHQDOIMVLM-WHFBIAKZSA-N Cys-Cys-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZJBWJHQDOIMVLM-WHFBIAKZSA-N 0.000 description 1
- CFQVGYWKSLKWFX-KBIXCLLPSA-N Cys-Glu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CFQVGYWKSLKWFX-KBIXCLLPSA-N 0.000 description 1
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 1
- UXBYDFKFHMYYPL-XIRDDKMYSA-N Cys-His-Trp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UXBYDFKFHMYYPL-XIRDDKMYSA-N 0.000 description 1
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- IRDBEBCCTCNXGZ-AVGNSLFASA-N Cys-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O IRDBEBCCTCNXGZ-AVGNSLFASA-N 0.000 description 1
- GQNZIAGMRXOFJX-GUBZILKMSA-N Cys-Val-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O GQNZIAGMRXOFJX-GUBZILKMSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- ATFSDBMHRCDLBV-BPUTZDHNSA-N Cys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N ATFSDBMHRCDLBV-BPUTZDHNSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- XPDXVDYUQZHFPV-UHFFFAOYSA-N Dansyl Chloride Chemical compound C1=CC=C2C(N(C)C)=CC=CC2=C1S(Cl)(=O)=O XPDXVDYUQZHFPV-UHFFFAOYSA-N 0.000 description 1
- 241000235035 Debaryomyces Species 0.000 description 1
- 241001465321 Eremothecium Species 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- LLVXTGUTDYMJLY-GUBZILKMSA-N Gln-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LLVXTGUTDYMJLY-GUBZILKMSA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 1
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- DWDBJWAXPXXYLP-SRVKXCTJSA-N Gln-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DWDBJWAXPXXYLP-SRVKXCTJSA-N 0.000 description 1
- GIVHPCWYVWUUSG-HVTMNAMFSA-N Gln-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GIVHPCWYVWUUSG-HVTMNAMFSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- SWDSRANUCKNBLA-AVGNSLFASA-N Gln-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SWDSRANUCKNBLA-AVGNSLFASA-N 0.000 description 1
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- SYDJILXOZNEEDK-XIRDDKMYSA-N Glu-Arg-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SYDJILXOZNEEDK-XIRDDKMYSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- SAEBUDRWKUXLOM-ACZMJKKPSA-N Glu-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O SAEBUDRWKUXLOM-ACZMJKKPSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 1
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- PAZQYODKOZHXGA-SRVKXCTJSA-N Glu-Pro-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O PAZQYODKOZHXGA-SRVKXCTJSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 1
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 1
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 1
- QOOFKCCZZWTCEP-AVGNSLFASA-N Glu-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QOOFKCCZZWTCEP-AVGNSLFASA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- 108010063907 Glutathione Reductase Proteins 0.000 description 1
- 102100036442 Glutathione reductase, mitochondrial Human genes 0.000 description 1
- 102100034294 Glutathione synthetase Human genes 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 1
- UMRIXLHPZZIOML-OALUTQOASA-N Gly-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN UMRIXLHPZZIOML-OALUTQOASA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- 241001149669 Hanseniaspora Species 0.000 description 1
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 1
- YTKOTXRIWQHSAZ-GUBZILKMSA-N His-Glu-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N YTKOTXRIWQHSAZ-GUBZILKMSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- VTMLJMNQHKBPON-QWRGUYRKSA-N His-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 VTMLJMNQHKBPON-QWRGUYRKSA-N 0.000 description 1
- CTGZVVQVIBSOBB-AVGNSLFASA-N His-His-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTGZVVQVIBSOBB-AVGNSLFASA-N 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- MFQVZYSPCIZFMR-MGHWNKPDSA-N His-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N MFQVZYSPCIZFMR-MGHWNKPDSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- GJMHMDKCJPQJOI-IHRRRGAJSA-N His-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 GJMHMDKCJPQJOI-IHRRRGAJSA-N 0.000 description 1
- JUIOPCXACJLRJK-AVGNSLFASA-N His-Lys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N JUIOPCXACJLRJK-AVGNSLFASA-N 0.000 description 1
- MVZASEMJYJPJSI-IHPCNDPISA-N His-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC3=CN=CN3)N MVZASEMJYJPJSI-IHPCNDPISA-N 0.000 description 1
- HJUPAYWVVVRYFQ-PYJNHQTQSA-N His-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N HJUPAYWVVVRYFQ-PYJNHQTQSA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- XJFITURPHAKKAI-SRVKXCTJSA-N His-Pro-Gln Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CN=CN1 XJFITURPHAKKAI-SRVKXCTJSA-N 0.000 description 1
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 1
- WKEABZIITNXXQZ-CIUDSAMLSA-N His-Ser-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N WKEABZIITNXXQZ-CIUDSAMLSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- FFKJUTZARGRVTH-KKUMJFAQSA-N His-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FFKJUTZARGRVTH-KKUMJFAQSA-N 0.000 description 1
- 101000916644 Homo sapiens Macrophage colony-stimulating factor 1 receptor Proteins 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- JXUGDUWBMKIJDC-NAKRPEOUSA-N Ile-Ala-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JXUGDUWBMKIJDC-NAKRPEOUSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 1
- LVQDUPQUJZWKSU-PYJNHQTQSA-N Ile-Arg-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LVQDUPQUJZWKSU-PYJNHQTQSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- WNQKUUQIVDDAFA-ZPFDUUQYSA-N Ile-Gln-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N WNQKUUQIVDDAFA-ZPFDUUQYSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 1
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 1
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- RQQCJTLBSJMVCR-DSYPUSFNSA-N Ile-Leu-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RQQCJTLBSJMVCR-DSYPUSFNSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- RFMDODRWJZHZCR-BJDJZHNGSA-N Ile-Lys-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O RFMDODRWJZHZCR-BJDJZHNGSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- IMRKCLXPYOIHIF-ZPFDUUQYSA-N Ile-Met-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IMRKCLXPYOIHIF-ZPFDUUQYSA-N 0.000 description 1
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- 125000003338 L-glutaminyl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])C([H])([H])C(=O)N([H])[H] 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 208000004554 Leishmaniasis Diseases 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- BCUVPZLLSRMPJL-XIRDDKMYSA-N Leu-Trp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N BCUVPZLLSRMPJL-XIRDDKMYSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 229910009891 LiAc Inorganic materials 0.000 description 1
- 241001149698 Lipomyces Species 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- PRCHKVGXZVTALR-KKUMJFAQSA-N Lys-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N PRCHKVGXZVTALR-KKUMJFAQSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- WKUXWMWQTOYTFI-SRVKXCTJSA-N Lys-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N WKUXWMWQTOYTFI-SRVKXCTJSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- SEZADXQOJJTXPG-VFAJRCTISA-N Lys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N)O SEZADXQOJJTXPG-VFAJRCTISA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 1
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 1
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 101150110199 MEU1 gene Proteins 0.000 description 1
- 241000218922 Magnoliophyta Species 0.000 description 1
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- HGKJFNCLOHKEHS-FXQIFTODSA-N Met-Cys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(O)=O HGKJFNCLOHKEHS-FXQIFTODSA-N 0.000 description 1
- HLYIDXAXQIJYIG-CIUDSAMLSA-N Met-Gln-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HLYIDXAXQIJYIG-CIUDSAMLSA-N 0.000 description 1
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 1
- RVYDCISQIGHAFC-ZPFDUUQYSA-N Met-Ile-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O RVYDCISQIGHAFC-ZPFDUUQYSA-N 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- WYDFQSJOARJAMM-GUBZILKMSA-N Met-Pro-Asp Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WYDFQSJOARJAMM-GUBZILKMSA-N 0.000 description 1
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 1
- QZUCCDSNETVAIS-RYQLBKOJSA-N Met-Trp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N QZUCCDSNETVAIS-RYQLBKOJSA-N 0.000 description 1
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 1
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 1
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 1
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- RFMMMVDNIPUKGG-YFKPBYRVSA-N N-acetyl-L-glutamic acid Chemical compound CC(=O)N[C@H](C(O)=O)CCC(O)=O RFMMMVDNIPUKGG-YFKPBYRVSA-N 0.000 description 1
- 108010072610 N-acetyl-gamma-glutamyl-phosphate reductase Proteins 0.000 description 1
- 101710202061 N-acetyltransferase Proteins 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- XJLXINKUBYWONI-NNYOXOHSSA-O NADP(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-NNYOXOHSSA-O 0.000 description 1
- 241000193596 Nadsonia Species 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 241000320412 Ogataea angusta Species 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 102000007981 Ornithine carbamoyltransferase Human genes 0.000 description 1
- 101710113020 Ornithine transcarbamylase, mitochondrial Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 101150067591 PRS5 gene Proteins 0.000 description 1
- 206010034133 Pathogen resistance Diseases 0.000 description 1
- YRKFKTQRVBJYLT-CQDKDKBSSA-N Phe-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 YRKFKTQRVBJYLT-CQDKDKBSSA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 1
- OMHMIXFFRPMYHB-SRVKXCTJSA-N Phe-Cys-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OMHMIXFFRPMYHB-SRVKXCTJSA-N 0.000 description 1
- OWCLJDXHHZUNEL-IHRRRGAJSA-N Phe-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OWCLJDXHHZUNEL-IHRRRGAJSA-N 0.000 description 1
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- FINLZXKJWTYYLC-ACRUOGEOSA-N Phe-His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FINLZXKJWTYYLC-ACRUOGEOSA-N 0.000 description 1
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- AOKZOUGUMLBPSS-PMVMPFDFSA-N Phe-Trp-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O AOKZOUGUMLBPSS-PMVMPFDFSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- FXEKNHAJIMHRFJ-ULQDDVLXSA-N Phe-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N FXEKNHAJIMHRFJ-ULQDDVLXSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- JUJGNDZIKKQMDJ-IHRRRGAJSA-N Pro-His-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O JUJGNDZIKKQMDJ-IHRRRGAJSA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- DMNANGOFEUVBRV-GJZGRUSLSA-N Pro-Trp-Gly Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)O)C(=O)[C@@H]1CCCN1 DMNANGOFEUVBRV-GJZGRUSLSA-N 0.000 description 1
- UCTIUWKCVNGEFH-OBJOEFQTSA-N Pro-Val-Gly-Pro Chemical compound N([C@@H](C(C)C)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 UCTIUWKCVNGEFH-OBJOEFQTSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 102100035914 S-adenosylmethionine decarboxylase proenzyme Human genes 0.000 description 1
- 108050004491 S-adenosylmethionine decarboxylase proenzyme Proteins 0.000 description 1
- 102100026115 S-adenosylmethionine synthase isoform type-1 Human genes 0.000 description 1
- 108050008511 S-adenosylmethionine synthases Proteins 0.000 description 1
- 102100034187 S-methyl-5'-thioadenosine phosphorylase Human genes 0.000 description 1
- 108091006419 SLC25A12 Proteins 0.000 description 1
- 101150032645 SPE1 gene Proteins 0.000 description 1
- 101150089804 SPE2 gene Proteins 0.000 description 1
- 101100390989 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) FMS1 gene Proteins 0.000 description 1
- 101100409550 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PRE9 gene Proteins 0.000 description 1
- 101100366397 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SPE3 gene Proteins 0.000 description 1
- 101100261151 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) TPO5 gene Proteins 0.000 description 1
- 241001146210 Senecio scandens Species 0.000 description 1
- 241001123731 Senecio viridis Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- COLJZWUVZIXSSS-CIUDSAMLSA-N Ser-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N COLJZWUVZIXSSS-CIUDSAMLSA-N 0.000 description 1
- WKLJLEXEENIYQE-SRVKXCTJSA-N Ser-Cys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WKLJLEXEENIYQE-SRVKXCTJSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- XERQKTRGJIKTRB-CIUDSAMLSA-N Ser-His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CN=CN1 XERQKTRGJIKTRB-CIUDSAMLSA-N 0.000 description 1
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- JJUNLJTUIKFPRF-BPUTZDHNSA-N Ser-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N JJUNLJTUIKFPRF-BPUTZDHNSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- 101710194092 Thiamine-phosphate synthase 1 Proteins 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- VUKVQVNKIIZBPO-HOUAVDHOSA-N Thr-Asp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VUKVQVNKIIZBPO-HOUAVDHOSA-N 0.000 description 1
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- DDDLIMCZFKOERC-SVSWQMSJSA-N Thr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N DDDLIMCZFKOERC-SVSWQMSJSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 108090000340 Transaminases Proteins 0.000 description 1
- 102000003929 Transaminases Human genes 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 241000242541 Trematoda Species 0.000 description 1
- 241001480014 Trigonopsis Species 0.000 description 1
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 1
- NIWAGRRZHCMPOY-GMVOTWDCSA-N Trp-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NIWAGRRZHCMPOY-GMVOTWDCSA-N 0.000 description 1
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 1
- XZLHHHYSWIYXHD-XIRDDKMYSA-N Trp-Gln-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XZLHHHYSWIYXHD-XIRDDKMYSA-N 0.000 description 1
- YRXXUYPYPHRJPB-RXVVDRJESA-N Trp-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N YRXXUYPYPHRJPB-RXVVDRJESA-N 0.000 description 1
- LDMUNXDDIDAPJH-VMBFOHBNSA-N Trp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LDMUNXDDIDAPJH-VMBFOHBNSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- WKQNLTQSCYXKQK-VFAJRCTISA-N Trp-Lys-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WKQNLTQSCYXKQK-VFAJRCTISA-N 0.000 description 1
- FJHXNRKNOXEIIO-OYDLWJJNSA-N Trp-Met-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)CCSC)C(O)=O)=CNC2=C1 FJHXNRKNOXEIIO-OYDLWJJNSA-N 0.000 description 1
- GIAMKIPJSRZVJB-IHPCNDPISA-N Trp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GIAMKIPJSRZVJB-IHPCNDPISA-N 0.000 description 1
- JEYRCNVVYHTZMY-SZMVWBNQSA-N Trp-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JEYRCNVVYHTZMY-SZMVWBNQSA-N 0.000 description 1
- QHWMVGCEQAPQDK-UMPQAUOISA-N Trp-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O QHWMVGCEQAPQDK-UMPQAUOISA-N 0.000 description 1
- ZZDFLJFVSNQINX-HWHUXHBOSA-N Trp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O ZZDFLJFVSNQINX-HWHUXHBOSA-N 0.000 description 1
- 241000224557 Trypanosoma brucei brucei Species 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- CWQZAUYFWRLITN-AVGNSLFASA-N Tyr-Gln-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O CWQZAUYFWRLITN-AVGNSLFASA-N 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- KEANSLVUGJADPN-LKTVYLICSA-N Tyr-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N KEANSLVUGJADPN-LKTVYLICSA-N 0.000 description 1
- ADECJAKCRKPSOR-ULQDDVLXSA-N Tyr-His-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ADECJAKCRKPSOR-ULQDDVLXSA-N 0.000 description 1
- OSXNCKRGMSHWSQ-ACRUOGEOSA-N Tyr-His-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSXNCKRGMSHWSQ-ACRUOGEOSA-N 0.000 description 1
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- DAOREBHZAKCOEN-ULQDDVLXSA-N Tyr-Leu-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O DAOREBHZAKCOEN-ULQDDVLXSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 1
- ZMKDQRJLMRZHRI-ACRUOGEOSA-N Tyr-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N ZMKDQRJLMRZHRI-ACRUOGEOSA-N 0.000 description 1
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Natural products O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- CHWRZUGUMAMTFC-IHRRRGAJSA-N Val-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CNC=N1 CHWRZUGUMAMTFC-IHRRRGAJSA-N 0.000 description 1
- JPPXDMBGXJBTIB-ULQDDVLXSA-N Val-His-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N JPPXDMBGXJBTIB-ULQDDVLXSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- SJLVYVZBFDTRCG-DCAQKATOSA-N Val-Lys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N SJLVYVZBFDTRCG-DCAQKATOSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- RSGHLMMKXJGCMK-JYJNAYRXSA-N Val-Met-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N RSGHLMMKXJGCMK-JYJNAYRXSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- HPOSMQWRPMRMFO-GUBZILKMSA-N Val-Pro-Cys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HPOSMQWRPMRMFO-GUBZILKMSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 1
- 206010047505 Visceral leishmaniasis Diseases 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000010933 acylation Effects 0.000 description 1
- 238000005917 acylation reaction Methods 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 229960001570 ademetionine Drugs 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 239000003905 agrochemical Substances 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 238000005842 biochemical reaction Methods 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 101150038500 cas9 gene Proteins 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 230000003013 cytotoxicity Effects 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 229940000406 drug candidate Drugs 0.000 description 1
- 238000000132 electrospray ionisation Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 150000002085 enols Chemical class 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 230000008124 floral development Effects 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 235000012041 food component Nutrition 0.000 description 1
- 239000005417 food ingredient Substances 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 230000006251 gamma-carboxylation Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 238000003208 gene overexpression Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 238000010362 genome editing Methods 0.000 description 1
- 108010050322 glutamate acetyltransferase Proteins 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010008671 glycyl-tryptophyl-methionine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 230000009643 growth defect Effects 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 208000029080 human African trypanosomiasis Diseases 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 150000002611 lead compounds Chemical class 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 229940049920 malate Drugs 0.000 description 1
- BJEPYKJPYRNKOW-UHFFFAOYSA-L malate(2-) Chemical compound [O-]C(=O)C(O)CC([O-])=O BJEPYKJPYRNKOW-UHFFFAOYSA-L 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 150000002739 metals Chemical class 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000003068 molecular probe Substances 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000010412 perfusion Effects 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 150000004713 phosphodiesters Chemical group 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 229930000223 plant secondary metabolite Natural products 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 230000008119 pollen development Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- AOHJOMMDDJHIJH-UHFFFAOYSA-N propylenediamine Chemical compound CC(N)CN AOHJOMMDDJHIJH-UHFFFAOYSA-N 0.000 description 1
- 238000010926 purge Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- ADRDEXBBJTUCND-UHFFFAOYSA-N pyrrolizidine Chemical class C1CCN2CCCC21 ADRDEXBBJTUCND-UHFFFAOYSA-N 0.000 description 1
- 229930002356 pyrrolizidine alkaloid Natural products 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000024053 secondary metabolic process Effects 0.000 description 1
- 230000008117 seed development Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 201000002612 sleeping sickness Diseases 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- AGGKEGLBGGJEBZ-UHFFFAOYSA-N tetramethylenedisulfotetramine Chemical compound C1N(S2(=O)=O)CN3S(=O)(=O)N1CN2C3 AGGKEGLBGGJEBZ-UHFFFAOYSA-N 0.000 description 1
- 150000003573 thiols Chemical class 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000001195 ultra high performance liquid chromatography Methods 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 229910052727 yttrium Inorganic materials 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K5/00—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof
- C07K5/02—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing at least one abnormal peptide link
- C07K5/0215—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing at least one abnormal peptide link containing natural amino acids, forming a peptide bond via their side chain functional group, e.g. epsilon-Lys, gamma-Glu
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/14—Fungi; Culture media therefor
- C12N1/16—Yeasts; Culture media therefor
- C12N1/165—Yeast isolates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0012—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7)
- C12N9/0026—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on CH-NH groups of donors (1.5)
- C12N9/0032—Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on CH-NH groups of donors (1.5) with oxygen as acceptor (1.5.3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1085—Transferases (2.) transferring alkyl or aryl groups other than methyl groups (2.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y105/00—Oxidoreductases acting on the CH-NH group of donors (1.5)
- C12Y105/03—Oxidoreductases acting on the CH-NH group of donors (1.5) with oxygen as acceptor (1.5.3)
- C12Y105/03016—Spermine oxidase (1.5.3.16)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y205/00—Transferases transferring alkyl or aryl groups, other than methyl groups (2.5)
- C12Y205/01—Transferases transferring alkyl or aryl groups, other than methyl groups (2.5) transferring alkyl or aryl groups, other than methyl groups (2.5.1)
- C12Y205/01022—Spermine synthase (2.5.1.22)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y205/00—Transferases transferring alkyl or aryl groups, other than methyl groups (2.5)
- C12Y205/01—Transferases transferring alkyl or aryl groups, other than methyl groups (2.5) transferring alkyl or aryl groups, other than methyl groups (2.5.1)
- C12Y205/01044—Homospermidine synthase (2.5.1.44)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y205/00—Transferases transferring alkyl or aryl groups, other than methyl groups (2.5)
- C12Y205/01—Transferases transferring alkyl or aryl groups, other than methyl groups (2.5) transferring alkyl or aryl groups, other than methyl groups (2.5.1)
- C12Y205/01079—Thermospermine synthase (2.5.1.79)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y603/00—Ligases forming carbon-nitrogen bonds (6.3)
- C12Y603/01—Acid-ammonia (or amine)ligases (amide synthases)(6.3.1)
- C12Y603/01008—Glutathionylspermidine synthase (6.3.1.8)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y603/00—Ligases forming carbon-nitrogen bonds (6.3)
- C12Y603/01—Acid-ammonia (or amine)ligases (amide synthases)(6.3.1)
- C12Y603/01009—Trypanothione synthase (6.3.1.9)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y105/00—Oxidoreductases acting on the CH-NH group of donors (1.5)
- C12Y105/03—Oxidoreductases acting on the CH-NH group of donors (1.5) with oxygen as acceptor (1.5.3)
- C12Y105/03017—Non-specific polyamine oxidase (1.5.3.17)
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Mycology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Botany (AREA)
- Tropical Medicine & Parasitology (AREA)
- Virology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明涉及在能够生产至少一种多胺的酵母细胞中的多胺缀合物的生产。该酵母细胞还包含多胺:谷胱甘肽连接酶编码基因和至少一种多胺合酶编码基因,但是缺少多胺氧化酶编码基因或包含扰乱的多胺氧化酶编码基因。该酵母细胞能够生产各种多胺‑谷胱甘肽缀合物。
Description
技术领域
本发明总体涉及遗传工程化酵母,并且特别涉及能够生产多胺缀合物的这类酵母。
背景技术
需要具有新颖作用机制的小分子来帮助处理当今最具挑战性的生物医学问题和农业问题。然而,几十年间,由目标是识别用于药物和农用化学品开发的先导化合物的合成化学产生的超过1.35亿的小分子,尽管通过了初始筛选,但是被逐步淘汰。显然,改善小分子的复杂性和多样性是必要的。实际上,在自然界中多样的化合物,即具有复杂结构的天然产物和它们的衍生物是容易得到的,数千年来其在疾病的研究和治疗中发挥了重要作用。特别地,多胺缀合物是一组多样并且数量上主要的植物次级代谢物,其展示出广阔的结构多样性和复杂性。由此,已经报道多项对这些化合物的有益性质的研究。例如,已经提议多胺-谷胱甘肽缀合物,也被称为锥虫胱甘肽(trypanothione),作为药物化学中的特异性分子探针和用于谷胱甘肽还原酶抑制剂的开发。
遗憾的是,由于其结构复杂性和在自然界中的低丰度,从传统的合成化学或自天然来源的提取均难以获得多胺类似物。已经寻求在快速生长、遗传上易处理的物种中基于微生物的生产作为天然产物及它们的衍生物的传统供应链的替代方案。特别地,面包师的酵母酿酒酵母(Saccharomyces cerevisiae)已经用做细胞工厂用于生产许多不同的燃料、化学品、食品配料和医药品,特别是为了它的天然产物生产。
因此,期望开发用于多胺缀合物的发现和生产的新方法。
发明内容
提供能够生产多胺缀合物的酵母细胞是总体的目的。
通过实施方案满足该目的以及其他目的。
在独立权利要求中定义了本发明。在从属权利要求中定义了本发明进一步的实施方案。
本发明涉及能够生产谷胱甘肽-多胺缀合物的酵母细胞。该酵母细胞能够生产至少一种多胺。该酵母细胞还包含多胺:谷胱甘肽连接酶编码基因和至少一种多胺合酶编码基因,但是缺少多胺氧化酶编码基因或包含扰乱的多胺氧化酶编码基因。
本发明还涉及生产谷胱甘肽-多胺缀合物的方法。该方法包含在培养基中并且在适合于通过根据本发明的酵母细胞生产谷胱甘肽-多胺缀合物的培养条件下培养所述酵母细胞。该方法还包含从培养基和/或从酵母细胞中收集谷胱甘肽-多胺缀合物。
本发明提供用于各种谷胱甘肽-多胺缀合物的生产的有效手段,所述谷胱甘肽-多胺缀合物包括单取代和/或多取代多胺,诸如锥虫胱甘肽。因此本发明可以用作涉及传统的合成化学或自天然来源的提取以获得多胺缀合物的现有技术方法的有成本效益的替代方案。
附图说明
通过参考以下与附图一起的描述可以最好地理解实施方案,连同其进一步的目的和优点,其中:
图1a至1f说明用于亚精胺和高级多胺(higher-polyamine)过量合成的工程酵母代谢。
图2a和2b说明酵母中锥虫胱甘肽-(SH)2的生产。
图3a和3b说明用于酵母中复杂的酚酰胺和锥虫胱甘肽-(SH)2的生物合成的工程化途径。
具体实施方案
为了使得能够有效获取多胺缀合物的多样性,我们工程化了酵母代谢以过量生产一类复杂的多胺,例如亚精胺、高亚精胺(homo-spermidine)、热精胺(thermospermine)和精胺。通过具有按需定制途径(tailoring pathway)的多样的多胺缀合物的生物合成证明该酵母平台的多用性。具体地,由计算机模拟辅助,我们系统地重构了酵母中心碳代谢和氮代谢、甲硫氨酸补救途径、腺嘌呤的补救途径、多胺转运机器和多胺降解途径,由此使得酵母能够在深孔(deep-well)规模发酵中生产>400 mg/l的亚精胺。此外,通过插入按需定制途径以及产生合成聚生体,我们证明了酵母中多胺缀合物(包括锥虫胱甘肽)的从头生物合成。
现在将参考其中展示本发明的实施方案的附图和实施例在下文中描述本发明。该描述并非旨在成为可以实施本发明的所有不同方式或可以添加至本发明的所有特征的详细目录。例如,关于一个实施方案说明的特征可以并入到其他实施方案中,并且关于特定的实施方案说明的特征可以从该实施方案中删除。因此,本发明设想在本发明的一些实施方案中,可以排除或省略载于本文的任何特征或特征的组合。此外,鉴于本公开,本文提出的对各种实施方案的很多变化和添加对于本领域技术人员将是显而易见的,它们不背离本发明。因此,以下描述旨在说明本发明的一些特定的实施方案,并且并非旨在详尽地指定其所有的排列、组合和变化。
除非本文另有定义,本文所使用的科学术语和技术术语将具有本领域普通的技术人员通常所理解的含义。
一般地,本文所述的与生物化学、酶学、分子和细胞生物学、微生物学、遗传学以及蛋白质和核酸化学以及杂交的技术有关使用的名词是在本领域众所周知且通常使用的那些。
本文提及的常规方法和技术在,例如Molecular Cloning, a laboratory manual[second edition] Sambrook et al. Cold Spring Harbor Laboratory, 1989中,例如,在其第1.21章节"Extraction And Purification Of Plasmid DNA"、第1.53章节"Strategies For Cloning In Plasmid Vectors"、第1.85章节"Identification OfBacterial Colonies That Contain Recombinant Plasmids"、第6章节"GelElectrophoresis Of DNA"、第14章节 "In vitro Amplification Of DNA By ThePolymerase Chain Reaction"和第17章节"Expression Of Cloned Genes InEscherichia coli")中更详细地解释。
本说明书通篇涉及的酶委员会(Enzyme Commission)(EC)编号(本文也称为"种类")是根据国际生物化学与分子生物学联合会命名委员会(Nomenclature Committee ofthe International Union of Biochemistry and Molecular Biology) (NC-IUBMB)在其例如作为以下得到的资源"Enzyme Nomenclature" (1992,包括附录6-17)中: "Enzymenomenclature 1992: recommendations of the Nomenclature Committee of theInternational Union of Biochemistry and Molecular Biology on the nomenclatureand classification of enzymes", Webb, E. C. (1992), San Diego: 由AcademicPress为国际生物化学与分子生物学联合会(the International Union of Biochemistryand Molecular Biology)出版(ISBN 0-12-227164-5)。这是基于由每个酶种类催化的化学反应的数字分类方案。
除非上下文另有指示,具体地意指本文所述的本发明的各种特征可以以任何组合使用。此外,本发明还设想在本发明的一些实施方案中,可以排除或省略载于本文的任何特征或特征的组合。为了说明,如果本说明书声明组合物包含组分A、B和C,其具体地意指可以单独地或以任何组合省略和放弃A、B或C或其组合的任一个。
如在本发明的描述和所附权利要求中所使用的,除非上下文另有清楚指示,单数形式"一(a)"、"一(an)"和"该/所述(the)"也旨在包括复数形式。也如本文所使用的,"和/或"指的是并且包括一种或多种关联的所列项目的任何和所有可能的组合,以及在替代方案("或")中解释时为组合的缺少。
贯穿该说明书的描述和权利要求,词语"包含(comprise)"和"含有(contain)"以及词语的变化,例如"包含(comprising)"和"包含(comprises)",意为"包括但不限于"并且不排除其他的部分、添加物、组分、整数或步骤。贯穿该说明书的描述和权利要求,除非上下文另有要求,单数包括复数。特别地,在使用不定冠词的情况下,除非上下文另有要求,本说明书应理解为设想复数以及单数。
如本文所使用的,过渡词组基本上由…"组成"意为权利要求的范围应解释为包括在权利要求中所列举的指定的材料或步骤以及实质上不影响所要求保护的发明的基本和新颖特性的那些材料或步骤。因此,当在本发明的权利要求中使用时,术语基本上由…"组成"并非旨在解释为等效于"包含"。
为了便于理解本发明,下面定义了许多术语。
如本文所使用的,术语"多胺"指的是具有两个或更多个伯氨基基团的有机化合物。多胺的示例包括腐胺(Put)、亚精胺(Spd)、精胺(Spm)、热精胺(Tspm)、对称-高亚精胺(sym-homospermidine)(Hspd)、1,2-二氨基丙烷、尸胺、胍丁胺、对称-降亚精胺(sym-norspermidine)和降精胺(norspermine)。
如本文所使用的,术语"多胺缀合物"或"谷胱甘肽-多胺缀合物"指的是至少一个谷胱甘肽(GSH)分子和多胺之间的缀合物。优选地,谷胱甘肽-多胺缀合物包含在GSH的羧基基团和多胺的胺基团之间形成的酰胺键。谷胱甘肽-多胺缀合物的非限制性而是说明性的示例包括锥虫胱甘肽(N 1 ,N 10 -双(谷胱甘酰基)亚精胺)、N 1 -谷胱甘酰基亚精胺、N 10 -谷胱甘酰基亚精胺、N 1 ,N 10 -双(谷胱甘酰基)精胺、N 1 ,N 5 ,N 10 -三(谷胱甘酰基)精胺、N 1 ,N 5 ,N 10 ,N 14 -四(谷胱甘酰基)精胺。因此,谷胱甘肽-多胺缀合物可以是一个多胺(诸如亚精胺或精胺)和一个、两个或更多个(诸如三个或四个)谷胱甘肽分子之间的缀合物。
也如本文所使用的,术语"核苷酸序列"、"核酸"、"核酸分子"、"寡核苷酸"和"多核苷酸"指的是RNA或DNA,包括cDNA、DNA片段或部分、基因组DNA、合成DNA、质粒DNA、mRNA和反义RNA,其中的任何一种可以是单链或双链的、线性或分支的、或其杂合体。本文所提供的核酸分子和/或核苷酸序列以5'至3'方向、从左至右呈现于本文,并且使用如载于美国序列细则 37 CFR §§1.821 - 1.825和世界知识产权组织(the World Intellectual PropertyOrganization)(WIPO)标准ST.25中的用于代表核苷酸性状的标准代码来代表。当人工合成地生产dsRNA时,不太常见的碱基,诸如肌苷、5-甲基胞嘧啶、6-甲基腺嘌呤、次黄嘌呤等也可以用于反义、dsRNA和核酶配对。例如,已经证明包含尿苷和胞苷的C-5丙炔类似物的多核苷酸以高亲和力结合RNA并且是基因表达的有力反义抑制剂。也可以进行其他的修饰,诸如对磷酸二酯主链或RNA的核糖糖基团中的2'-羟基的修饰。如本文所使用的术语"重组体"在使用时意为特定的核酸(DNA或RNA)是克隆、限制和/或连接步骤的各种组合的产物,所述各种组合造成具有结构编码序列或非编码序列的可区别于天然系统中发现的内源核酸的构建体。
如本文所使用的,术语"基因"指的是能够用于生产mRNA、反义RNA、miRNA、抗-微RNA反义寡聚脱氧核苷酸(anti-microRNA antisense oligodeoxyribonucleotide)(AMO)等的核酸分子。基因可以能够或不能够用于生产功能蛋白或功能基因产物。基因可以包括编码区域和非编码区域(例如,内含子、调控元件、启动子、增强子、终止序列和/或5'非翻译区域和3'非翻译区域)两者。基因可以是"分离的",其意为核酸实质上或基本上没有通常发现的与处于其天然状态的核酸关联的组分。这类组分包括其他细胞材料、来自重组体生产的培养基和/或在化学合成核酸中使用的各种化学品。
如本文所定义的"扰乱的基因"涉及造成部分或完全无功能基因和基因产物的对基因的任何突变或修饰。这种突变或修饰包括但不限于错义突变、无义突变、缺失、取代、插入、靶向序列的添加等。此外或备选地,基因的扰乱还可以通过控制基因转录的控制元件的突变或修饰(诸如启动子、终止子和/或增强元件中的突变和修饰)来达到。在这种情况下,这种突变或修饰造成基因转录的部分或完全丧失,即,当与天然和未修饰的控制元件相比,转录较低或减少。作为结果,转录和翻译之后可用的基因产物的量(如果有的话)将减少。此外,基因的扰乱可能还需要向基因中添加或从中去除定位信号,造成基因产物在其天然的亚细胞区室中的存在降低。
基因扰乱的目的是减少基因产物的可用的量,包括完全阻止基因产物的任何生产,或表达当与天然或野生型基因产物相比缺少或具有较低的酶活性的基因产物。
如本文所使用的术语"缺失"或"敲除"指的是无效或敲除的基因。
术语"减弱的活性"在涉及酶时指的是与对照或野生型状态相比酶在其天然区室中的活性降低。造成减弱的酶活性的操作包括但不限于错义突变、无义突变、缺失、取代、插入、靶向序列的添加、靶向序列的去除等。包含造成减弱的酶活性的修饰的细胞与不包含这类修饰的细胞相比将具有较低的酶活性。减弱的酶活性可以通过编码无功能基因产物(例如,具有基本上无活性例如,当与野生型多肽的活性相比活性小于约10%或者甚至5%的多肽)来达到。
基因的密码子优化版本指的是引入到细胞中的外源基因,并且其中基因的密码子已经关于特定的细胞进行了优化。一般地,并非所有的tRNA横跨物种都是同等地或者在相同水平表达。由此,基因序列的密码子优化涉及改变密码子以匹配最优势的tRNA,即,在给定细胞中,用由相对更优势的tRNA识别的同义密码子去改变由低优势的tRNA识别的密码子。这样,来自密码子优化基因的mRNA将被更有效地翻译。优选地,密码子和同义密码子编码相同的氨基酸。
如本文所使用的,术语"肽"、"多肽"和"蛋白"可互换使用以指示氨基酸残基的聚合物。术语"肽'、"多肽"和"蛋白"还包括修饰,所述修饰包括但不限于脂质附着、糖基化、糖基化、硫酸酯化、羟基化、L-谷氨酸残基的γ-羧基化和ADP-核糖基化。
如本文所使用的,术语"酶"定义为在细胞中催化化学或生化反应的蛋白。通常地,根据本发明,编码酶的核苷酸序列可操作地连接至核苷酸序列(启动子),该核苷酸序列(启动子)引起相应基因在细胞中的充分表达以赋予细胞生产亚精胺的能力。
如本文所使用的,术语"开放阅读框(ORF)"指的是编码多肽、肽或蛋白的RNA或DNA区域。
如本文所使用的,术语"基因组"包括宿主细胞中的质粒和染色体两者。例如,无论它们是染色体整合的还是质粒定位的,引入到宿主细胞中的本公开的编码核酸都可以是基因组的一部分。
如本文所使用的,术语"启动子"指的是具有控制一个或多个基因转录的功能的核酸序列,其位于关于基因的转录起始位点的转录方向的上游。在该上下文中合适的启动子包括组成型天然启动子和诱导型天然启动子两者,以及本领域的技术人员所熟知的工程化启动子。
用于在酵母细胞中使用的合适的启动子包括但不限于PDC、GPD1、TEF1、PGK1和TDH的启动子。其他合适的启动子包括GAL1、GAL2、GAL10、GAL7、CUP1、HIS3、CYC1、ADH1、PGL、GAPDH、ADC1、URA3、TRP1、LEU2、TPI、AOX1和ENOl的启动子。
如本文所使用的,如果没有另外注释,术语"终止子"指的是"转录终止信号"。终止子是阻碍或停止聚合酶的转录的序列。
如本文所使用的,根据本公开的"重组真核细胞"定义为包含内源核酸序列的额外多份或一份拷贝的细胞,或用在真核细胞中不天然存在的多肽或核苷酸序列转化或遗传修饰的细胞。如本文所使用的,野生型真核细胞定义为重组真核细胞的亲本细胞。
如本文所使用的,术语"增加(increase)"、"增加(increased)"、"增加(increasing)"、"增强(enhance)"、"增强(enhanced)"、"增强(enhancing)"和"增强(enhancement)"(及其语法变化)指示当与对照相比,至少约1%、2%、3%、4%、5%、6%、7%、8%、9%、10%、15%、20%、25%、30%、35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、100%、150%、200%、300%、400%、500% 或更多,或其中任何范围的提高。
如本文所使用的,术语"减少(reduce)"、"减少(reduced)"、"减少(reduction)"、"缩减"、"抑制"和"降低"以及相似的术语意为当与对照相比,至少约1%、2%、3%、4%、5%、6%、7%、8%、9%、10%、15%、20%、25%、30%、35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、100%、150%、200%、300%、400%、500% 或更多,或其中任何范围的降低。
如本文所使用的减少的基因表达涉及减少基因的转录、减少转录自基因的mRNA的翻译和/或减少翻译自mRNA的蛋白的翻译后加工的遗传修饰。这类遗传修饰包括应用于基因的控制序列(诸如启动子和增强子)的插入、缺失、置换或突变。例如,基因的启动子可以由较少活性的启动子或诱导型启动子替换,从而造成减少的基因转录。启动子的敲除也会造成减少的基因表达(典型地为0)。
如本文所使用的,术语本发明的核苷酸序列的"部分"或"片段"将理解为意为核苷酸序列相对于参考核酸或参考核苷酸序列长度减少,并且包含与参考核酸或参考核苷酸序列同一或几乎同一的(例如,80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、98%、99%同一性)连续核苷酸的核苷酸序列、基本上由之组成和/或由之组成。根据本发明的这种核酸片段或部分可以在适当的情况下包含于较大的多核苷酸中作为其组分。
具有同源性的不同的核酸或蛋白在本文中称为"同源物"。术语同源物包括来自相同物种和其他物种的同源序列,以及来自相同物种和其他物种的直系同源序列。"同源性"指的是用位置同一性的百分比(即,序列相似性或同一性)表示的在两个或更多个核酸和/或氨基酸序列之间的相似性水平。同源性还指的是不同的核酸或蛋白中相似的功能性质的概念。因此,本发明的组合物和方法进一步包含本发明的核苷酸序列和多肽序列的同源物。如本文所使用的,"直系同源的"指的是不同物种中的在物种形成期间由共同的祖先基因产生的同源核苷酸序列和/或氨基酸序列。本发明的核苷酸序列的同源物与所述核苷酸序列具有实质的序列同一性,例如,至少约70%、75%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%和/或100%。
如本文所使用的术语"过量表达(overexpress)"或"过量表达(overexpression)"指的是,与在其天然或对照状态下(例如,未用特定的过量表达的异源或重组多肽转化)在细胞中相比,更高水平的基因活性(例如,基因的转录)、更高水平的mRNA翻译为蛋白和/或更高水平的基因产物(例如,多肽)的生产。过量表达基因的典型示例是在当与基因的天然启动子相比另一启动子的转录控制下的基因。另外或备选地,基因的控制元件(诸如增强子)中的其他改变也可以用于过量表达特定的基因。此外,如本文所使用的,影响(即,增加)转录自基因的mRNA的翻译的修饰可以备选地或额外用于达到基因过量表达。这些术语也可以指的是细胞中基因的拷贝数量的增加和/或mRNA和/或基因产物的量的增加。还可以通过包括来自不同物种的编码相同或同源基因产物(诸如酶)的基因达到过量表达。过量表达可以造成当与对照水平相比细胞中25%、50%、75%、100%、200%、300%、400%、500%、750%、1000%、1500%、2000%或更高,或其中任何范围的水平。
如本文所使用的,术语"外源的"或"异源的"在关于核酸(RNA或DNA)、蛋白或基因使用时指的是,核酸、蛋白或基因作为将其导入的细胞、生物体、基因组、RNA或DNA序列的部分非天然地存在,包括天然存在的核苷酸序列的非天然存在的多份拷贝。这种外源基因可以是来自另一物种或株系的基因、在宿主细胞中天然存在的基因的修饰、突变或进化版本或在宿主细胞中天然存在的基因的嵌合版本或融合基因。在这些前例中,修饰、突变或进化引起基因的核苷酸序列的改变,从而获得当与宿主细胞中天然存在的基因相比具有另一核苷酸序列的修饰、突变或进化的基因。进化基因指的是编码进化基因并且通过遗传修饰(诸如突变或暴露于进化压力)以衍生出当与野生型或天然基因相比具有不同的核苷酸序列的新基因而获得的基因。嵌合基因是通过将一个或多个编码序列的部分组合以产生新基因而形成的。这些修饰与将整个基因序列合并到单个阅读框中的融合基因不同,并且经常保留它们的原始功能。
"内源的"、"天然的"或"野生型"核酸、核苷酸序列、多肽或氨基酸序列指的是天然存在的或内源的核酸、核苷酸序列、多肽或氨基酸序列。因此,例如,"野生型mRNA"是天然存在于生物体中或生物体内源的mRNA。
如本文所使用的,术语"修饰的",当其关于生物体使用时,指的是当与除了未进行如此修饰之外相同的宿主生物体相比,已进行修饰以生产多胺缀合物的宿主生物体。原则上,根据本公开的这类"修饰"可以包含当与除了未经过修饰之外相同的生物体相比,在宿主生物体中适当地变更多胺缀合物的生产的任何生理、遗传、化学或其他的修饰。然而,在大多数实施方案中,修饰将包含遗传修饰。在某些实施方案中,如本文所述,修饰包含将基因引入到宿主细胞中。提升多肽活性的遗传修饰包括但不限于:引入一份或多份拷贝的编码多肽的基因(其可能区别于已存在于宿主细胞中的编码具有相同活性的多肽的任何基因);变更存在于细胞中的基因以增加基因的转录或翻译(例如,变更例如调节序列、启动子序列或其他序列、向其添加额外的序列、置换其一个或多个核苷酸、从其缺失序列或将其交换);以及变更编码多肽的基因的序列(例如非编码或编码的)以提升活性(例如通过增加酶活性、降低反馈抑制、靶向特定的亚细胞定位、提升mRNA稳定性、提升蛋白稳定性)。减少多肽活性的遗传修饰包括但不限于:缺失编码多肽的基因的部分或全部;插入扰乱编码多肽的基因的核酸序列;改变存在于细胞中的基因以减少基因的转录或翻译或减少由基因编码的mRNA或多肽的稳定性(例如通过添加额外的序列至启动子序列、调控序列或其他序列、将其变更、从其缺失序列、置换其一个或多个核苷酸或将其交换例如置换其一个或多个氨基酸)。术语"过量生产"在本文中关于宿主细胞中产物的生产使用,并且指示依靠编码涉及宿主细胞的代谢途径的不同多肽的核酸序列的引入或作为其他的修饰的结果,当与未修饰宿主细胞或野生型细胞相比宿主细胞生产更多的产物。
如本文所使用的术语"载体"定义为线性或环状DNA分子,其包含编码本发明的多肽的多核苷酸,并且所述多核苷酸可操作地连接至确保其表达的额外核苷酸。
"引入"在酵母细胞的上下文中意为以核酸分子得以进入细胞的内部的方式使核酸分子与细胞接触。相应地,多核苷酸和/或核酸分子可以在单个的转化事件中、在分开的转化事件中引入酵母细胞。因此,如本文所使用的术语"转化"指的是将异源核酸引入到细胞中。酵母细胞的转化可以是稳定的或瞬时的。
"瞬时转化"在多核苷酸的上下文中意为多核苷酸被引入到细胞中并且不整合到细胞的基因组中。
在将多核苷酸引入到细胞中的上下文中提到"稳定地引入"或"稳定地引入的"意指引入的多核苷酸稳定地并入到细胞的基因组中,并且因此多核苷酸稳定地转化了细胞。如本文所使用的"稳定转化"或"稳定地转化的"意为核酸分子被引入到细胞中并且整合到细胞的基因组中。因此,整合的核酸分子能够由其后代,更特别地,由多个连续世代的后代遗传。如本文所使用的稳定转化也可以指的是染色体外(例如作为微型染色体)维持的核酸分子。
瞬时转化可以通过例如酶联免疫吸附测定(ELISA)或蛋白质印迹来检测,其可以检测由一个或多个引入到生物体中的核酸分子编码的肽或多肽的存在。细胞的稳定转化可以通过例如细胞的基因组DNA与核酸序列的DNA印迹杂交测定来检测,所述核酸序列特异性地与引入到生物体(例如,酵母)中的核酸分子的核苷酸序列杂交。细胞的稳定转化可以通过例如细胞的RNA与核酸序列的RNA印迹杂交测定来检测,所述核酸序列特异性地与引入到酵母或其他生物体中的核酸分子的核苷酸序列杂交。细胞的稳定转化还可以通过例如本领域所熟知的聚合酶链式反应(PCR)或其他扩增反应来检测,所述反应采用与核酸分子的靶标序列杂交的特异性引物序列,造成靶标序列的扩增,其可以根据标准方法检测。转化也可以通过本领域熟知的直接测序和/或杂交方案来检测。
本发明的实施方案还包括如本文所述的多肽的变体。如本文所使用的,"变体"意为其中氨基酸序列与它衍生自的基础序列的不同在于序列内一个或多个氨基酸由其他氨基酸取代的多肽。例如,SEQ ID NO: 1的变体可以具有氨基酸序列,该氨基酸序列与SEQ IDNO: 1具有至少约50%同一性(例如,至少约80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或约100%同一性)。该变体和/或片段是功能变体/片段,因为变体序列具有与酶相似或同一的功能酶活性特性,所述酶具有本文所指定的非变体氨基酸序列(并且这是如本说明书通篇所使用的术语"功能变体"的含义)。
因此,任何呈现的氨基酸序列的"功能变体"或"功能片段"是保持在与非变体序列相同酶类别(即,具有相同EC编号)内的任何氨基酸序列。确定酶是否落在特定类别内的方法是技术人员所熟知的,技术人员可以在不使用创造性技能的情况下确定酶类别。例如,合适的方法可以从国际生物化学与分子生物学联合会(the International Union ofBiochemistry and Molecular Biology)获得。
氨基酸由具有大致相似的性质的不同的氨基酸置换的氨基酸取代可以视为"保守的"。非保守取代是其中氨基酸由不同类型的氨基酸置换。
"保守取代"意为一种氨基酸由相同种类的另一氨基酸取代,其中种类定义如下:
种类 氨基酸示例
非极性的:A、V、L、I、P、M、F、W
不带电荷的极性的:G、S、T、C、Y、N、Q
酸性的:D、E
碱性的:K、R、H。
如本领域技术人员所熟知的,通过保守取代变更多肽的一级结构可能不会显著变更该多肽的活性,因为插入到序列中的氨基酸的侧链可能能够形成与已被取代掉的氨基酸的侧链相似的键和接触。甚至当取代在对确定多肽构象关键的区域中时也是如此。
在本发明的实施方案中,如本文别处所定义的,假如非保守取代不干扰多肽的酶活性,它们就是可能的。酶的取代版本必须保留特性,以使它们保持在与非取代酶相同的酶种类中,所述酶种类使用以上所讨论的NC-IUBMB命名法确定。
一般来说,在不变更多肽的生物活性的情况下,比保守取代更少的非保守取代将是可能的。确定任何取代(以及事实上任何氨基酸缺失或插入)的影响完全在技术人员的常规能力内,技术人员可以容易地确定变体多肽是否保留根据本发明的方面的酶活性。例如,当确定多肽的变体是否落在本发明的范围内(即,为如上所定义的"功能变体或片段")时,技术人员将确定该变体或片段是否保留如本文别处所提及的参考NC-IUBMB命名法所定义的底物转换酶活性。所有这类变体都在本发明的范围内。
除本文所公开的那些之外,使用标准的遗传代码,技术人员可以容易地构思和制造编码多肽的进一步的核酸序列。核酸序列可以是DNA或RNA,并且当它是DNA分子时,它可以例如包含cDNA或基因组DNA。如本文别处所述,核酸可以包含在表达载体内。
因此,本发明的实施方案包括编码本发明的实施方案所设想的多肽的变体核酸序列。与核酸序列相关的术语"变体"意为多核苷酸序列的任何取代、变化、修饰、置换、缺失或自其或向其添加一个或多个核苷酸,只要由该多核苷酸编码的所得多肽序列表现出与由基础序列编码的多肽至少相同或相似的酶性质。该术语包括等位基因的变体并且还包括多核苷酸("探针序列"),该多核苷酸实质上与本发明的实施方案的多核苷酸序列杂交。这类杂交可能发生在低严谨性条件下和高严谨性条件下或两者之间。概括地,低严谨性条件可以定义为其中洗涤步骤发生在低于探针序列的计算或实际解链温度(Tm)约40°C-48°C(例如,约实验室环境温度至约55°C)的温度下在0.330-0.825 M NaCl缓冲液溶液中的杂交,而高严谨性条件涉及在低于探针序列的计算或实际Tm约5°C-10°C(例如,约65°C)的温度下在0.0165-0.0330 M N缓冲液溶液中的洗涤。缓冲液溶液可以是,例如盐水-柠檬酸钠(SSC)缓冲液(0.15M NaCl和0.015M 柠檬酸三钠),其中低严谨性洗涤发生在3 × SSC缓冲液中,并且高严谨性洗涤发生在0.1 × SSC 缓冲液中。涉及核酸序列的杂交的步骤已经在例如Molecular Cloning, a laboratory manual [second edition] Sambrook et al. ColdSpring Harbor Laboratory, 1989中,例如在其第11章节"Synthetic OligonucleotideProbes"中描述。
优选地,核酸序列变体具有约80%或更多的与本发明的实施方案的核酸序列相同的核苷酸,更优选至少85%或甚至90%、91%、92%、93%、94%、95%、96%、97%、98%或99%或更大的序列同一性。
本发明的变体核酸可以进行密码子优化以用于在特定的宿主细胞中表达。
如本文所使用的,"序列同一性"指的是两个核苷酸序列或两个肽序列或蛋白序列之间的序列相似性。通过序列比对确定相似性,以确定序列之间的结构和/或功能关系。
氨基酸序列之间的序列同一性可以通过使用从the National Center forBiotechnology Information (NCBI), Bethesda, Md., USA得到的Needleman-Wunsch 全局序列比对工具(Needleman-Wunsch Global Sequence Alignment Tool),例如,经由http://blast.ncbi. nlm.nih.gov/Blast.cgi,使用默认参数设置(对于蛋白比对,Gapcosts Existence: 11 Extension: 1)比较序列的比对来确定。本说明书中提及的序列比较和百分比同一性已使用该软件确定。当将序列同一性水平与例如SEQ ID NO: 1比较时,优选地,这应相对于SEQ ID NO: 1的整个长度进行(即,使用全局比对方法),以避免短区域的高同一性重叠造成同一性的高总体评价。例如,具有例如五个氨基酸的短多肽片段可能具有与整个SEQ ID NO: 1内的五个氨基酸区域具有100%同一性的序列,但这不提供100%的氨基酸同一性,除非该片段形成较长序列的部分,该较长序列在等效于SEQ ID NO: 1中的位置的其他位置也具有同一的氨基酸。当比较序列中的等效位置由相同的氨基酸占据时,那么分子在该位置是同一的。将比对评分为同一性的百分比是与比较序列共享的位置处同一的氨基酸的数量的函数。在比较序列时,最佳比对可能要求将空位引入到一个或多个序列中,以考虑序列中可能的插入和缺失。序列比较方法可以采用空位罚分,以便对于被比较的序列中相同数量的同一的分子,具有尽可能少的空位的序列比对,其反映两个比较序列之间较高的相关性,将达到比具有许多空位的序列更高的分数。最大百分比同一性的计算涉及考虑空位罚分的最佳比对的产生。如上所提及的,可以使用Needleman-Wunsch全局序列比对工具,使用默认参数设置确定百分比序列同一性。Needleman-Wunsch算法出版于J. Mol. Biol. (1970) 48卷: 443-453。
本发明的一个方面涉及能够生产谷胱甘肽-多胺缀合物的酵母细胞。该酵母细胞能够生产至少一种多胺并且该酵母细胞包含多胺:谷胱甘肽连接酶编码基因和至少一种多胺合酶编码基因,但是缺少多胺氧化酶编码基因或包含扰乱的多胺氧化酶编码基因。
在一个实施方案中,酵母细胞被工程化以用于多胺:谷胱甘肽连接酶的过量表达。
在一个实施方案中,多胺:谷胱甘肽连接酶的过量表达通过将多胺:谷胱甘肽连接酶编码基因置于在酵母细胞中高活性的启动子的转录控制下来达到。用于在酵母细胞中使用的合适的启动子包括但不限于,PDC、GPD、GPD1、TEF1、PGK1、TDH和TDH3的启动子。其他合适的启动子包括GAL1、GAL2、GAL10、GAL7、CUP1、HIS3、CYC1、ADH1、PGL、GAPDH、ADC1、URA3、TRP1、LEU2、TPI、AOX1和ENO1的启动子。
酵母细胞可以包含一份或多份(即至少两份)拷贝的多胺:谷胱甘肽连接酶编码基因,从而增加多胺:谷胱甘肽连接酶的mRNA的拷贝数量并且从而增加由酵母细胞生产的多胺:谷胱甘肽连接酶的量。在这种情况下,多胺:谷胱甘肽连接酶编码基因的多份拷贝可以在一个启动子的转录控制下,或每个多胺:谷胱甘肽连接酶编码基因可以在各自的启动子的转录控制下。在后一种情况下,相同类型的启动子可以用于控制各个多胺:谷胱甘肽连接酶编码基因的转录,或可以使用不同类型的启动子。
在一个实施方案中,谷胱甘肽-多胺缀合物选自锥虫胱甘肽(N 1 ,N 10 -双(谷胱甘酰基)亚精胺)、N 1 -谷胱甘酰基亚精胺、N 10 -谷胱甘酰基亚精胺、N 1 ,N 10 -双(谷胱甘酰基)精胺、N 1 ,N 5 ,N 10 -三(谷胱甘酰基)精胺、N 1 ,N 5 ,N 10 ,N 14 -四(谷胱甘酰基)精胺。因此,缀合物可以是一个多胺(诸如亚精胺或精胺)和一个、两个或多个(诸如三个或四个)谷胱甘肽分子之间的缀合物。
在一个实施方案中,多胺:谷胱甘肽连接酶编码基因选自锥虫胱甘肽合酶(EC6.3.1.9)、谷胱甘酰基亚精胺合酶(EC 6.3.1.8)及其组合。
在一个实施方案中,锥虫胱甘肽合酶(TryS)编码基因选自布氏锥虫指名亚种(Trypanosoma brucei brucei)锥虫胱甘肽合酶(TbbTryS)和核苷酸序列,该核苷酸序列编码与锥虫胱甘肽合酶TbbTryS具有至少80%的序列同一性的锥虫胱甘肽合酶。在一个实施方案中,该核苷酸序列编码与布氏锥虫指名亚种TrrS具有至少85%或甚至90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的序列同一性的锥虫胱甘肽合酶。在一个实施方案中,该具有至少80%的序列同一性的锥虫胱甘肽合酶能够催化谷胱甘肽和多胺转换为谷胱甘酰基多胺和/或催化谷胱甘肽和谷胱甘酰基多胺转换为双(谷胱甘酰基)多胺,优选地,能够催化谷胱甘肽和亚精胺转换为谷胱甘酰基亚精胺和催化谷胱甘肽和谷胱甘酰基亚精胺转换为N 1 , N 10 -双(谷胱甘酰基)亚精胺。具有至少80%的序列同一性的锥虫胱甘肽合酶的酶效力可以低于、实质上等于或高于TbbTrrS相应的酶效力,优选至少实质上相等或较高的酶效力。
TbbTryS的氨基酸序列展示于SEQ ID NO: 34中并且TbbTrrS的核苷酸序列展示于SEQ ID NO: 35中。
在一个实施方案中,谷胱甘酰基亚精胺合酶(GSS)编码基因选自大肠杆菌(Escherichia coli)谷胱甘酰基亚精胺合酶(EcGSS)和核苷酸序列,该核苷酸序列编码与谷胱甘酰基亚精胺合酶EcGSS具有至少80%的序列同一性的谷胱甘酰基亚精胺合酶。在一个实施方案中,该核苷酸序列编码与大肠杆菌EcGSS具有至少85%或甚至90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的序列同一性的谷胱甘酰基亚精胺合酶。在一个实施方案中,该具有至少80%的序列同一性的谷胱甘酰基亚精胺合酶能够催化谷胱甘肽和多胺转换为谷胱甘酰基多胺,优选地,能够催化谷胱甘肽和亚精胺转换为谷胱甘酰基亚精胺。具有至少80%的序列同一性的谷胱甘酰基亚精胺合酶的酶效力可以低于、实质上等于或高于EcGSS相应的酶效力,优选至少实质上相等或较高的酶效力。
EcGSS的氨基酸序列展示于SEQ ID NO: 259中并且EcGSS的核苷酸序列展示于SEQID NO: 260中。
在一个实施方案中,至少一种多胺选自精胺、热精胺、对称-高亚精胺、1,3-二氨基丙烷、腐胺、尸胺、胍丁胺、亚精胺、对称-降亚精胺、降精胺及其组合。
本发明的酵母细胞缺少多胺氧化酶(EC 1.5.3.17)编码基因或包含扰乱的多胺氧化酶编码基因。该酵母细胞还包含至少一种多胺合酶编码基因。
由酵母细胞表达的至少一种多胺合酶在酵母细胞中催化至少一种多胺的生产。多胺氧化酶是催化精胺转换回亚精胺的酶。因此,酵母细胞缺少任何多胺氧化酶编码基因或包含扰乱的多胺氧化酶编码基因。这意为优选酵母细胞缺少任何多胺氧化酶,或如果酵母细胞中表达这种多胺氧化酶,优选该多胺氧化酶为酶促非活性的或至少当与天然多胺氧化酶相比具有显著较低的酶效力。
在一个实施方案中,酵母细胞被工程化以用于至少一种多胺合酶的过量表达。
在一个实施方案中,至少一种多胺合酶的过量表达通过将至少一种至少一种多胺合酶编码基因置于在酵母细胞中高活性的启动子的转录控制下来达到。用于在酵母细胞中使用的合适的启动子包括但不限于,PDC、GPD、GPD1、TEF1、PGK1、TDH和TDH3的启动子。其他合适的启动子包括GAL1、GAL2、GAL10、GAL7、CUP1、HIS3、CYC1、ADH1、PGL、GAPDH、ADC1、URA3、TRP1、LEU2、TPI、AOX1和ENO1的启动子。
酵母细胞可以包含一份或多份拷贝的多胺合酶编码基因,从而增加多胺合酶的mRNA的拷贝数量并且从而增加由酵母细胞生产的多胺合酶的量。在这种情况下,多胺合酶编码基因的多份拷贝可以在一个启动子的转录控制下,或每个多胺合酶编码基因可以在各自的启动子的转录控制下。在后一种情况下,相同类型的启动子可以用于控制各个多胺合酶编码基因的转录,或可以使用不同类型的启动子。
在一个实施方案中,多胺合酶编码基因选自精胺合酶(EC 2.5.1.22)编码基因、热精胺合酶(EC 2.5.1.79)编码基因和高亚精胺合酶(EC 2.5.1.44或EC 2.5.1.45)编码基因。
在一个实施方案中,精胺合酶编码基因选自酿酒酵母精胺合酶(优选ScSPE4)、拟南芥精胺合酶(AtSPMS)和核苷酸序列,该核苷酸序列编码与精胺合酶ScSPE4或精胺合酶AtSPMS具有至少80%的序列同一性的精胺合酶。在一个实施方案中,该核苷酸序列编码与ScSPE4或AtSPMS具有至少85%或甚至90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的序列同一性的精胺合酶。在一个实施方案中,该具有至少80%的序列同一性的精胺合酶能够催化亚精胺转换为精胺。具有至少80%的序列同一性的精胺合酶的酶效力可以低于、实质上等于或高于ScSPE4或AtSPES相应的酶效力,优选至少实质上相等或较高的酶效力。
ScSPE4的氨基酸序列展示于SEQ ID NO: 1中并且ScSPE4的核苷酸序列展示于SEQID NO: 2中。AtSPMS相应的氨基酸序列展示于SEQ ID NO: 3中并且AtSPMS的核苷酸序列展示于SEQ ID NO: 4中。
在一个实施方案中,热精胺合酶编码基因选自拟南芥热精胺合酶(优选AtACL5)和核苷酸序列,该核苷酸序列编码与热精胺合酶AtACL5具有至少80%的序列同一性的热精胺合酶。在一个实施方案中,该核苷酸序列编码与AtACL5具有至少85%或甚至90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的序列同一性的热精胺合酶。在一个实施方案中,该具有至少80%的序列同一性的热精胺合酶能够催化亚精胺转换为热精胺。具有至少80%的序列同一性的热精胺合酶的酶效力可以低于、实质上等于或高于AtACL5相应的酶效力,优选至少实质上相等或较高的酶效力。
AtACL5的氨基酸序列展示于SEQ ID NO: 5中并且AtACL5的核苷酸序列展示于SEQID NO: 6中。
在一个实施方案中,高亚精胺合酶(HSS)编码基因选自春千里光(Senecio vernalis)高亚精胺合酶(SvHSS)、绿色芽生绿菌(Blastochloris viridis)高亚精胺合酶(BvHSS)和核苷酸序列,该核苷酸序列编码与高亚精胺合酶SvHSS或高亚精胺合酶BvHSS具有至少80%的序列同一性的高亚精胺合酶。在一个实施方案中,该核苷酸序列编码与SvHSS或BvHSS具有至少85%或甚至90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的序列同一性的高亚精胺合酶。在一个实施方案中,该具有至少80%的序列同一性的高亚精胺合酶能够催化腐胺转换为对称-高亚精胺,或催化腐胺或亚精胺转换为对称-高亚精胺。具有至少80%的序列同一性的高亚精胺合酶的酶效力可以低于、实质上等于或高于SvHSS或BvHSS相应的酶效力,优选至少实质上相等或较高的酶效力。
SvHSS的氨基酸序列展示于SEQ ID NO: 7中并且SvHSS的核苷酸序列展示于SEQID NO: 8中。BvHSS相应的氨基酸序列展示于SEQ ID NO: 9中并且BvHSS的核苷酸序列展示于SEQ ID NO: 10中。
在一个实施方案中,酵母细胞选自酵母属(Saccharomyces)、克鲁维酵母菌属(Kluyveromyces)、接合酵母属(Zygosaccharomyces)、假丝酵母属(Candida)、有孢汉逊酵母属(Hanseniaspora)、毕赤酵母属(Pichia)、汉逊酵母属(Hansenula)、裂殖酵母属(Schizosaccharomyces)、三角酵母属(Trigonopsis)、酒香酵母属(Brettanomyces)、德巴利酵母属(Debaromyces)、拿逊酵母属(Nadsonia)、油脂酵母属(Lipomyces)、隐球菌属(Cryptococcus)、短梗霉属(Aureobasidium)、丝孢酵母属(Trichosporon)、红酵母属(Rhodotorula)、耶罗威亚酵母属(Yarrowia)、红冬孢酵母属(Rhodosporidium)、法夫酵母属(Phaffia)、许旺酵母属(Schwanniomyces)、曲霉属(Aspergillus)和阿氏酵母属(Ashbya)。在特定的实施方案中,酵母细胞选自酿酒酵母、布拉氏酵母(Saccharomyces boulardii)、拜耳接合酵母(Zygosaccharomyces bailii)、乳酸克鲁维酵母(Kluyveromyces lactis)、圆红冬孢酵母(Rhodosporidium toruloides)、解脂耶罗威亚酵母(Yarrowia lipolytica)、粟酒裂殖酵母(Schizosaccharomyces pombe)、巴斯德毕赤酵母(Pichia pastoris)、异常汉逊酵母(Hansenula anomala)、圆球形假丝酵母(Candida sphaerica)或解苹果酸裂殖酵母(Schizosaccharomyces malidevorans)。酿酒酵母是优选的酵母物种。
在一个实施方案中,酵母细胞是酿酒酵母细胞并且多胺氧化酶是FMS1。因此,在一个实施方案中,酿酒酵母细胞缺少FMS1或包含扰乱的FMS1。
本发明的另一方面涉及能够生产谷胱甘肽-多胺缀合物的酵母细胞。该酵母细胞能够生产至少一种多胺并且该酵母细胞包含多胺:谷胱甘肽连接酶编码基因。
如前文中所述的酵母细胞的各种实施方案也可以应用于本发明的这个方面。
本发明的另一方面涉及生产谷胱甘肽-多胺缀合物的方法。该方法包含在培养基中并且在适合于通过根据本发明的酵母细胞生产谷胱甘肽-多胺缀合物的培养条件下培养所述酵母细胞。该方法还包含从培养基和/或从酵母细胞中收集谷胱甘肽-多胺缀合物。
在本发明的这个方面中的培养基可以是其中可以培养酵母细胞以生产谷胱甘肽-多胺缀合物的任何培养基。培养可以是,例如分批培养、补料分批培养或灌注培养或发酵、生物反应器发酵等的形式。
实施例
实施例1:通过系统地重新布线酵母中的天然代谢来改善亚精胺生产
在该实施例1中,我们系统地重构了酵母菌株中的代谢,包括中心碳代谢和氮代谢、甲硫氨酸补救途径、腺嘌呤的补救途径、多胺转运机器和多胺消耗/降解途径。此外,我们还引入了另外潜在的阳性遗传靶标。该酵母菌株用新的模块化遗传设计建造。具体地,从头Spd生物合成途径分成多个遗传模块,其包含很多生物合成酶的编码序列以将更大的碳通量从糖碳来源转移至Spd。
设计以增加L-鸟氨酸(Orn)的积累的前体过量生产模块(I)包括八种蛋白的过量表达:来自酿酒酵母的NADP(+)-依赖的谷氨酸脱氢酶(GDH1)[SEQ ID NO: 11]、来自酿酒酵母的线粒体天冬氨酸和谷氨酸载体蛋白(AGC1)[SEQ ID NO: 12]、来自酿酒酵母的线粒体L-鸟氨酸载体蛋白(ORT1)[SEQ ID NO: 13]、来自大肠杆菌的谷氨酸N-乙酰基转移酶(EcargA)[SEQ ID NO: 14]、来自大肠杆菌的乙酰基谷氨酸激酶(EcargB)[SEQ ID NO:15]、来自谷氨酸棒状杆菌(Corynebacteriumglutamicum)的N-乙酰基-γ-谷氨酰基磷酸还原酶(CgargC)[SEQ ID NO: 16]、来自谷氨酸棒状杆菌的乙酰基鸟氨酸氨基转移酶(CgargD)[SEQ ID NO: 17]和来自谷氨酸棒状杆菌的鸟氨酸乙酰基转移酶(CgargJ)[SEQID NO: 18]。此外,该模块(I)中还包括两种蛋白的减弱或去除:通过用较弱的启动子PKEX2交换其天然启动子PARG3来减弱酵母天然的鸟氨酸氨甲酰基转移酶(ARG3)[SEQ ID NO:19],和通过敲除CAR2来去除L-鸟氨酸转氨酶(CAR2)[SEQ ID NO: 20]的活性。
设计以从L-鸟氨酸过量生产Put的腐胺(Put)模块(II)包括两个遗传修饰;来自酿酒酵母的鸟氨酸脱羧酶(SPE1)[SEQ ID NO: 21]的过量表达和天然的鸟氨酸脱羧酶抗酶(OAZ1)[SEQ ID NO: 22]的缺失。
设计亚精胺生物合成模块(III)用于从腐胺过量生产亚精胺(Spd),并且其特点在于来自酿酒酵母的两种蛋白的过量表达:腺苷甲硫氨酸脱羧酶(AdoMetDC;SPE2)[SEQ IDNO: 23]和亚精胺合酶(SpdSyn;SPE3)[SEQ ID NO: 24]。该模块还包括两种天然蛋白的缺失以避免亚精胺消耗或降解:编码精胺合酶的SPE4 [SEQ ID NO: 2]和编码非特异性多胺氧化酶的FMS1 [SEQ ID NO: 25]的缺失。
设计S-腺苷-L-甲硫氨酸(AdoMet)模块(IV)以增加辅因子AdoMet的可达性。这些修饰包括很多蛋白的过量表达:5'-甲硫腺苷磷酸化酶(MEU1)[SEQ ID NO: 26](来自酿酒酵母)、支链氨基酸氨基转移酶(BAT2)[SEQ ID NO: 27](来自酿酒酵母)、腺嘌呤磷酸核糖基转移酶(APT1)[SEQ ID NO: 28](来自酿酒酵母)、核糖磷酸焦磷酸激酶(PRS5)[SEQ IDNO: 29](来自酿酒酵母)和来自婴儿利什曼原虫(Leishmania infantum)的S-腺苷甲硫氨酸合酶(LiMAT)[SEQ ID NO: 30]。该模块还包括腺嘌呤脱氨酶活性(AAH1)[SEQ ID NO:31]的缺失。
设计多胺流出模块(V)以减轻对细胞的细胞毒性或减轻对多胺生物合成的抑制。该模块包括由TPO5[SEQ ID NO: 32]编码的酵母天然的多胺转运蛋白的过量表达。
最后但重要地,设计另外的亚精胺生物合成模块(VI)用于从腐胺和AdoMet过量生产亚精胺。该模块包括由SPE2-SPE3[SEQ ID NO: 33]编码的AdoMetDC-SpdSyn融合蛋白的过量表达。
该实施例1中的基因的过量表达经由基于CRISPR/cas9系统或传统的遗传标记物的方法,通过染色体整合至作为整合基因座的我们预期没有生长缺陷和活性表达的区域来获得。按照由Mans等人2015开发的方案执行基于CRISPR/cas9的基因组编辑。特别地,包含使Cas9能够组成型表达的具有HIS3标记物的质粒pL-CAS9-HIS的酿酒酵母菌株CEN.PK113-11C为用于所有遗传操作的开始菌株。为了使得能够在所选择的基因座中有效地编辑基因组,构建了多个引导RNA(gRNA)质粒。遗传模块根据重叠延伸PCR (OE-PCR)程序构建为整合盒,所述遗传模块包括遗传部分(即,启动子、终止子、ORF和同源臂)的各种组合。该实施例1中使用以下基因和启动子组合:TPI1p-ORT1-pYX212t、tHXT7p-AGC1-CYC1t、TEF1p-GDH1-DIT1t 、PGK1p-SPE3-pYX212t、TEF1p-SPE1-PRM9t、TDH3p-SPE2-DIT1t、TDH3p-CgargJ-TDH2t、PGK1p--EcargB-ADH1t、TEF1p-CgargC-FBA1t、tHXT7p-CgargD-TPI1t、TPI1p-EcargA-CYC1t、TPI1p-MEU1p-FBA1t、PGK1p-BAT2-CYC1t、TDH3p-APT1-DIT1t、TEF1p-PRS5-PRM9t、TEF1p-LiMAT-PRM9、TDH3p-TPO5-CYC1t、TEF1p-SPE2-SPE3-PRM9t。
所有天然的遗传部分(即天然的启动子、终止子、ORF和同源臂)是使用CEN.PK113-11C基因组DNA作为模板进行PCR扩增的。对于优化的异源基因,使用合成的片段或质粒(从GenScript获得)用于PCR扩增。贯穿整个分子克隆程序使用High-fidelity Phusion DNA聚合酶。通过标准的LiAc/SS DNA/PEG转化方法将盒或质粒引入到酵母中。在合成的不含尿嘧啶完全介质培养基(SC-URA)上选择包含基于URA3的质粒或盒的菌株,所述培养基由6.7 g/L不含氨基酸的酵母氮源基础(YNB)、0.77 g/l不含尿嘧啶的完全补充混合物(CSM-URA)、20g/l葡萄糖和20 g/l琼脂组成。将URA3标记物去除,并在5-氟乳清酸(5'-FOA)平板上选择。此外,基于CRISPR/cas9的系统也用于实践AAH1、SPE4和FMS1的缺失。其他基因敲除实验通过传统的方法进行。本文所使用的所有引物在表1中列出,所有质粒在表2中列出并且所有菌株在表3中列出。
表1-引物
表2-质粒
表3-菌株
使用将深孔规模发酵和高效液相色谱(HPLC)组合的测定以评估所得菌株JQSPD_AA。特别地,所得JQSPD_AA菌株用于多胺的生产的24孔深孔分批发酵在由Verduyn等人1992开发的基本培养基中实行。在0.2的初始OD600下用2 ml基本培养基在24孔深孔平板中自24h预培养物接种培养物,并且在300 rpm,30°C下培育120小时。基本培养基包含7.5 g/l(NH4)2SO4、14.4 g/l KH2PO4、0.5 g/l MgSO4∙7H2O、20 g/l葡萄糖、2 ml/l微量金属和1 ml/l维生素溶液,如果需要,用40 mg/l尿嘧啶、40 mg/l组氨酸补充,pH调节至4.5。样品通过取0.1 ml的液体培养物制备并且经过热水(HW)提取。在该方法中,我们使用在深孔平板的发酵中的基本培养基作为提取背景。将包含0.9 ml的发酵培养基的管在水浴中100°C下预热10 min。然后,将热发酵培养基迅速地倾倒至0.1 ml的液体培养物上;立即涡旋混合物,并且将样品放置于水浴中。30 min后,将每个管放置于冰上5 min。离心后,上清液直接用于衍生化。为了衍生化,将0.125 ml的饱和NaHCO3溶液和0.25 ml的丹磺酰氯溶液(5 mg/ml在丙酮中)添加至0.25 ml的样品。然后反应混合物在40°C下在黑暗中偶尔摇动(occasionalshaking)孵育1 h。通过添加0.275 ml的甲醇停止反应。通过25 mm针头式滤器(0.45 µmNylon)过滤的样品用于HPLC检测。使用以下色谱条件:C18 (100 mm × 4.6 mm i.d., 2.6µm, Phenomenex Kinetex),激发波长340 nm,发射波长515 nm,样品注入1.5 µl,柱温40°C,检测器灵敏度7,采集开始于4.0 min。流动相为水和甲醇,其速度为1 ml/min。洗脱程序如下:0-5min 50%至65%甲醇,5-7.5 min 65%至75%甲醇,7.5-9.5 min 75%至87.5%甲醇,9.5-10.5 min 87.5%至100%甲醇,10.5-11.5 min 100%甲醇,11.5-13.5 min 100%至50%甲醇,13.5-16 50% 甲醇。
菌株JQSPD_AA生产的Spd滴度在浓度> 400 mg/l,与如本文所使用的仅有部分的修饰的菌株(见WO 2016/144247和WO 2019/013696中的实施例)相比显著地增加了Spd滴度。
实施例2:酵母中高级多胺的生产
生命已经进化出各式各样的途径来合成多胺的结构变体。事实上,虽然Put和Spd作为常见的多胺典型地发现于大多数细胞中,但不常见的多胺,诸如对称-高亚精胺(Hspd)、热精胺(Tspm)、精胺(Spm)、支链多胺和长链多胺(LCPAs)在自然界中也已被找到。该实施例2通过设计遗传模块(VII)并将其引入到来自实施例1的Spd平台菌株JQSPD_AA中,调查了对称-高亚精胺(Hspd)、热精胺(Tspm)和精胺(Spm)的生物合成。
我们首先着手异源合成三胺Hspd,其存在于植物和细菌两者中。在植物中,Hspd是吡咯里西啶生物碱生物合成中第一个途径特异性的中间体,其由高亚精胺合酶(植物HSS;EC 2.5.1.45)形成。该酶比细菌高亚精胺合酶(细菌HSS;EC 2.5.1.44)更具特异性,因为后者不能使用Put作为氨基丁基基团的供体。为了探索植物HSS和细菌HSS两者用于Hspd微生物生产的可能性,设计以在酵母中生物合成Hspd的遗传子模块(VII-a)和(VII-b)分别编码了春千里光SvHSS和绿色芽生绿菌BvHSS13的表达。子模块作为从GenScipt订购的并且分别包含酵母密码子优化的SvHSS基因[SEQ ID NO: 8]和BvHSS基因[SEQ ID NO: 10]的高拷贝质粒SvHSS_p426GPD和BvHSS_p426GPD引入到Spd平台菌株JQSPD_AA中。转化实验按照与实施例1中相同的程序。用与实施例1中所描述的相同的程序测定所得菌株JQSPD_AA (SvHSS_p426GPD)和JQSPD_AA (BvHSS_p426GPD)的Hspd生产。
我们发现两种HSS的过量表达都使得能够生物合成Hspd。特别地,SvHSS使Hspd滴度能够在40.9 mg/,而BvHSS使Hspd滴度能够在31.1mg/(见图1a和1d)。
随后,我们还通过引入子模块(VII-c)、子模块(VII-d)和子模块(VII-e),利用Spd平台(实施例1)用于四胺Spm和Tspm的生产。Spm是在所有后生动物中、在有花植物中以及在酵母中发现的最常见的四胺。特异性氨基丙基转移酶,即精胺合酶(SpmSyn;EC 2.5.1.22)负责Spm生物合成。我们首先探索了酵母天然的SpmSyn Spe4p用于Spm过量生产。
当在JQSPD_AA中过量表达作为高拷贝质粒SPE4_p426GPD (子模块(VII-c))的密码子优化的酵母SPE4[SEQ ID NO: 2]时,获得了53.1 mg/l的Spm(见图1c和1f)。我们还通过在JQSPD_AA菌株中过量表达作为高拷贝质粒AtSPMS_p426GPD(子模块(VII-d))的AtSPMS[SEQ ID NO: 4]测试了来自拟南芥的SpmSyn。这造成了Spm的生产(41.8 mg/l;见图1c和1f)。植物ACL5氨基丙基转移酶(TspmSyn;EC 2.5.1.79)(来自拟南芥)被证明合成Spm的异构体Tspm。因此我们还在JQSPD_AA菌株中过量表达作为高拷贝质粒AtACL5_p426GPD (子模块(VII-e))的AtACL5[SEQ ID NO: 6]。该策略使得能够生产43.8 mg/l的Tspm (见图1b和1e)。所有包含酵母密码子优化的基因的质粒购买自GenScript。在该实施例2中使用了与实施例1中所使用的相同的转化和产物测定。所有质粒在表2中列出并且所有菌株在表3中列出。
图3a说明用于酵母中亚精胺和高级多胺的生物合成的工程化途径。
实施例3:酵母中锥虫胱甘肽的生物合成
酰胺键无疑是自然界中最重要的结构基序之一。大约四分之一的所有市售药物和三分之二的所有药物候选物都携带至少一个酰胺键,并且胺的酰基化是制药工业中实践最广泛的反应之一。多胺存在独特的骨架以连接其他部分,并且经常被并入到特化代谢中,导致具有复杂结构的多样的包含酰胺键的天然产物的生物合成。例如,锥虫胱甘肽(N 1 ,N 10 -双(谷胱甘酰基)亚精胺),T(SH)2)是短膜虫属(Crithidia)、锥虫属(Trypanosoma)和利什曼原虫属(Leishmania)的锥虫(trypanosomatid)中主要的低分子量硫醇,后两者的属包含威胁生命的疾病或致残疾病(诸如非洲昏睡病、黑热病、恰加斯病(Chagas’ disease)和美洲利什曼病或东方疖)的病原体。同时,在植物中可以合成丰富的含有多胺的羟基肉桂酰胺,已假定其涉及花、花粉和种子的发育以及病原体抗性。然而,从传统的合成化学或自天然来源的提取均难以获得它们。首先,这些多胺缀合物在自然界中表现出低丰度(Li等人2018)。另一方面,多年来建立用于直接酰胺化反应的有效催化系统仍然是有机化学中难以解决的挑战(Wang 2019)。在该实施例3中,我们通过引入模块(VIII)在多胺缀合物的生物合成中利用了来自实施例1的多胺平台。
我们首先着手酵母中[T(SH)2]的异源生物合成。[T(SH)2]是致病的锥虫中主要的氧化还原介体,其在法式短膜虫(Crithidia fasciculate)中由两个相异的酶(即谷胱甘酰基亚精胺合酶(GspS;EC 6.3.1.8)和锥虫胱甘肽合酶(TryS;EC 6.3.1.9))从谷胱甘肽(GSH)和亚精胺(Spd)逐步合成,然而在布氏锥虫指名亚种中两个步骤都由罕见的具有宽广的底物特异性的TryS (EC 6.3.1.9)催化。为了探索TbbTryS用于[T(SH)2]微生物生产,设计以在酵母中合成[T(SH)2]的遗传子模块(VIII-a)编码布氏锥虫指名亚种TryS(TbbTryS)的表达。子模块作为从GenScript订购的包含酵母密码子优化的TbbTryS基因[SEQ ID NO: 35]的高拷贝质粒TbbTryS_p426GPD引入至Spd平台菌株JQSPD_AA。按照与实施例1中所描述的相同的程序构建转化实验。所得菌株JQSPD_AA (TbbTryS _p426GPD)用于与实施例1中所描述的相同的程序的基于24孔深孔的发酵。通过取0.1 ml的液体培养物制备发酵样品。发酵样品经过热水(HW)提取,在我们的方法中我们使用发酵培养基提取。将包含0.9 ml的发酵培养基的管在水浴中100°C下预热10 min。然后,将热发酵培养基迅速地倾倒至0.1 ml的液体培养物上;立即涡旋混合物,并且将样品放置于水浴中。30 min后,将每个管放置于冰上5 min。离心后,上清液直接用于检测。
多胺缀合物的检测通过在连接到Orbitrap Fusion Mass Spectrometer (ThermoFisher Scientific, San Jose, CA)的Dionex UltiMate 3000 UHPLC (FisherScientific, San Jose, CA)上的液相色谱-质谱(LC-MS)测量来进行。该系统使用保持在35°C下的Agilent Zorbax Eclipse Plus C18 2.1 x 100 mm, 1.8 µm柱。流速为0.350mL/min,用0.1%甲酸(A)和在乙腈中的0.1%甲酸(B)作为流动相。梯度开始为5% B 1min并且然后接着线性梯度至95% B直到5 min。该溶剂组合物持续1.5 min,其后将它改变至5% B并且持续直到8 min。样品(5 µl)传递至装备有正离子或正离子模式的加热电喷雾离子化源(HESI)的MS,其中鞘气设置为50(a.u.),辅助气设置为10(a.u.)并且吹扫气设置为1(a.u.)。锥温和探针温度分别为325°C和380°C,并且喷雾电压为3500 V。扫描范围为80 Da至500 Da并且扫描之间的时间为50 ms。符合TbbTrs催化两个GSH部分到亚精胺上的性质,该酶在来自实施例1的Spd平台JQSPD_AA中的过量表达造成了[T(SH)2]的生产。特别地,检测到m/z值对应于722.2977[M + H]+或361.6524[M + H]2+的新的单个LC-MS峰,指示[T(SH)2]的存在(见图2a和2b)。所有质粒在表2中列出并且所有菌株在表3中列出。
图3b说明用于酵母中谷胱甘肽-多胺缀合物的生物合成的工程化途径。
以上所述的实施方案应理解为本发明的一些说明性示例。本领域的技术人员将理解可以在不背离本发明范围的情况下对实施方案进行各种修饰、组合和改变。特别地,在技术上可能的情况下,可以将不同实施方案中的不同部分技术方案合并成其他配置。然而,本发明的范围由所附权利要求定义。
参考文献
Bienz, S., Detterbeck R., Ensch C., Guggisberg A., Häusermann U.,Meisterhans C., Wendt B., Werner C. & Hesse M. Putrescine, spermidine,spermine, and related polyamine alkaloids. Alkaloids Chem Biol. 58, 83-338(2002).
Kim, S.-K., Jin, Y.-S., Choi, I.-G., Park, Y.-C. & Seo, J.-H.Enhanced tolerance of Saccharomyces cerevisiae to multiple lignocellulose-derived inhibitors through modulation of spermidine contents. MetabolicEngineering 29, 46-55, doi:http://dx.doi.org/10.1016/j.ymben.2015.02.004(2015).
Li, S., Li, Y. & Smolke, C. D. Strategies for microbial synthesis ofhigh-value phytochemicals. Nat Chem 10, 395-404, doi:10.1038/s41557-018-0013-z (2018).
Mans, R., van Rossum, H. M., Wijsman, M., Backx, A., Kuijpers, N. G.,van den Broek, M., Daran-Lapujade, P., Pronk, J. T., van Maris, A. J. &Daran, J. M. CRISPR/Cas9: a molecular Swiss army knife for simultaneousintroduction of multiple genetic modifications in Saccharomyces cerevisiae.FEMS Yeast Res. 15, fov004-fov004, doi:10.1093/femsyr/fov004 (2015).
Michael, A. J. Polyamines in Eukaryotes, Bacteria, and Archaea. J.Biol. Chem. 291, 14896-14903, doi:10.1074/jbc.R116.734780 (2016).
Nielsen, J. Yeast cell factories on the horizon. Science 349, 1050-1051, doi:10.1126/science.aad2081 (2015).
Nielsen, J. Cell factory engineering for improved production ofnatural products. Natural Product Reports, doi:10.1039/C9NP00005D (2019).
Nielsen, J. & Keasling, Jay D. Engineering Cellular Metabolism. Cell164, 1185-1197, doi:https://doi.org/10.1016/j.cell.2016.02.004 (2016).
Ober, D. & Hartmann, T. Homospermidine synthase, the first pathway-specific enzyme of pyrrolizidine alkaloid biosynthesis, evolved fromdeoxyhypusine synthase. Proceedings of the National Academy of Sciences 96,14777-14782, doi:10.1073/pnas.96.26.14777 (1999).
Verduyn, C., Postma, E., Scheffers, W.A., Van Dijken, J.P. Effect ofbenzoic acid on metabolic fluxes in yeasts: A continuous-culture study on theregulation of respiration and alcoholic fermentation. Yeast 8, 501–517, doi:10.1002/yea.320080703 (1992).
Wang, X. Challenges and outlook for catalytic direct amidationreactions. Nature Catalysis 2, 98-102, doi:10.1038/s41929-018-0215-1 (2019).
Yu, T. Zhou, Y. J., Huang, M., Liu, Q., Pereira, R., David, F., &Nielsen, J. Reprogramming Yeast Metabolism from Alcoholic Fermentation toLipogenesis. Cell 174, 1549-1558. e1514, doi:https://doi.org/10.1016/j.cell.2018.07.013 (2018).
Zhou, Y. J., Buijs, N. A., Zhu, Z., Qin, J., Siewers, V. & Nielsen,J. Production of fatty acid-derived oleochemicals and biofuels by syntheticyeast cell factories. Nat. Commun 7, 11709, doi:10.1038/ncomms11709, https://www.nature.com/articles/ncomms11709#supplementary-information (2016)。
<110> Chrysea Limited
<120> 生产多胺缀合物的酵母
<130> HSJ103132P.WOP
<150> SE 1951232-6
<151> 2019-10-28
<160> 260
<170> PatentIn version 3.5
<210> 1
<211> 300
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 1
Met Val Asn Asn Ser Gln His Ser Tyr Ile Lys Asp Gly Trp Phe Arg
1 5 10 15
Glu Ile Asn Asp Lys Ser Phe Pro Gly Gln Ala Phe Thr Met Thr Val
20 25 30
Asp Ser Ile Leu Tyr Glu Ala Arg Ser Glu Phe Gln Asp Ile Leu Ile
35 40 45
Phe Arg Asn Lys Val Tyr Gly Thr Val Leu Val Leu Asp Gly Ile Val
50 55 60
Gln Cys Thr Glu Phe Asp Glu Phe Ala Tyr Gln Glu Met Ile Thr His
65 70 75 80
Ile Ala Met Phe Ala His Ser Asn Pro Lys Arg Val Leu Ile Ile Gly
85 90 95
Gly Gly Asp Gly Gly Val Leu Arg Glu Val Ala Lys His Ser Cys Val
100 105 110
Glu Asp Ile Thr Met Val Glu Ile Asp Ser Ser Val Ile Glu Leu Ser
115 120 125
Arg Lys Phe Leu Pro Thr Leu Ser Asn Gly Ala Phe Asp Asp Glu Arg
130 135 140
Leu Asp Leu Lys Leu Cys Asp Gly Phe Lys Phe Leu Gln Asp Ile Gly
145 150 155 160
Ala Ser Asp Val His Lys Lys Phe Asp Val Ile Ile Thr Asp Ser Ser
165 170 175
Asp Pro Glu Gly Pro Ala Glu Ala Phe Phe Gln Glu Arg Tyr Phe Glu
180 185 190
Leu Leu Lys Asp Ala Leu Asn Pro Asn Gly Val Val Ile Met Gln Ser
195 200 205
Ser Glu Asn Phe Trp Leu Asn Leu Lys Tyr Leu His Asp Leu Lys Asn
210 215 220
Thr Ala Lys Lys Val Phe Pro Asn Thr Glu Tyr Cys Tyr Thr Met Val
225 230 235 240
Pro Thr Tyr Thr Ser Gly Gln Leu Gly Leu Ile Val Cys Ser Asn Asn
245 250 255
Ala Asn Ile Pro Leu Asn Ile Pro Gln Arg Lys Ile Ser Glu Gln Glu
260 265 270
Gln Gly Lys Leu Lys Tyr Tyr Asn Pro Gln Ile His Ser Ser Ala Phe
275 280 285
Val Leu Pro Thr Trp Ala Asp Lys Val Ile Asn Glu
290 295 300
<210> 2
<211> 904
<212> DNA
<213> 酿酒酵母
<400> 2
atggttaaca actctcaaca tccatacatc aaggatggtt ggttcagaga aattaatgat 60
aagtcattcc caggtcaagc ttttactatg acagttgatt ctatcttgta cgaagcaaga 120
tcagaatttc aagatatctt gatttttaga aataaggttt acggtactgt tttggttttg 180
gatggtatcg ttcaatgtac agaatttgat gaatttgctt accaagaaat gatcactcat 240
atcgctatgt tcgcacattc taacccaaaa agagttttga tcattggtgg tggtgacggt 300
ggtgttttga gagaagttgc aaagcattca tgtgttgaag atatcactat ggttgaaatc 360
gattcttcag ttattgaatt gtctagaaag ttcttaccaa cattgtcaaa tggtgctttc 420
gatgatgaaa gattggattt gaagttgtgt gatggtttta aattcttgca agatatcggt 480
gcatctgatg ttcataagaa attcgatgtt atcatcactg attcttcaga tccagaaggt 540
ccagctgaag ctttctttca agaaagatac ttcgaattgt tgaaggatgc tttgaaccca 600
aacggtgttg ttattatgca atcttcagaa aacttctggt tgaatttgaa gtatttgcat 660
gatttgaaaa atacagctaa gaaagttttc ccaaacactg aatactgtta cacaatggtt 720
ccaacttaca catctggtca attaggtttg atcgtttgtt caaacaacgc taacatccca 780
ttgaacatcc cacaaagaaa aatttctgaa caagaacagg gtaaattgaa gtactacaac 840
ccacaaatcc attcttcagc ttttgttttg ccaacatggg cagataaagt tattaatgaa 900
taag 904
<210> 3
<211> 359
<212> PRT
<213> 拟南芥(Arabidopsis thaliana)
<400> 3
Met Glu Gly Asp Val Gly Lys Gly Leu Val Cys Gln Asn Thr Met Asp
1 5 10 15
Gly Lys Ala Ser Asn Gly Asn Gly Leu Glu Lys Thr Val Pro Ser Cys
20 25 30
Cys Leu Lys Ala Met Ala Cys Val Pro Glu Asp Asp Ala Lys Cys His
35 40 45
Ser Thr Val Val Ser Gly Trp Phe Ser Glu Pro His Pro Arg Ser Gly
50 55 60
Lys Lys Gly Gly Lys Ala Val Tyr Phe Asn Asn Pro Met Trp Pro Gly
65 70 75 80
Glu Ala His Ser Leu Lys Val Glu Lys Val Leu Phe Lys Asp Lys Ser
85 90 95
Asp Phe Gln Glu Val Leu Val Phe Glu Ser Ala Thr Tyr Gly Lys Val
100 105 110
Leu Val Leu Asp Gly Ile Val Gln Leu Thr Glu Lys Asp Glu Cys Ala
115 120 125
Tyr Gln Glu Met Ile Ala His Leu Pro Leu Cys Ser Ile Ser Ser Pro
130 135 140
Lys Asn Val Leu Val Val Gly Gly Gly Asp Gly Gly Val Leu Arg Glu
145 150 155 160
Ile Ser Arg His Ser Ser Val Glu Val Ile Asp Ile Cys Glu Ile Asp
165 170 175
Lys Met Val Ile Asp Val Ser Lys Lys Phe Phe Pro Glu Leu Ala Val
180 185 190
Gly Phe Asp Asp Pro Arg Val Gln Leu His Ile Gly Asp Ala Ala Glu
195 200 205
Phe Leu Arg Lys Ser Pro Glu Gly Lys Tyr Asp Ala Ile Ile Val Asp
210 215 220
Ser Ser Asp Pro Val Gly Pro Ala Leu Ala Leu Val Glu Lys Pro Phe
225 230 235 240
Phe Glu Thr Leu Ala Arg Ala Leu Lys Pro Gly Gly Val Leu Cys Asn
245 250 255
Met Ala Glu Ser Met Trp Leu His Thr His Leu Ile Glu Asp Met Ile
260 265 270
Ser Ile Cys Arg Gln Thr Phe Lys Ser Val His Tyr Ala Trp Ser Ser
275 280 285
Val Pro Thr Tyr Pro Ser Gly Val Ile Gly Phe Val Leu Cys Ser Thr
290 295 300
Glu Gly Pro Ala Val Asp Phe Lys Asn Pro Ile Asn Pro Ile Glu Lys
305 310 315 320
Leu Asp Gly Ala Met Thr His Lys Arg Glu Leu Lys Phe Tyr Asn Ser
325 330 335
Asp Met His Arg Ala Ala Phe Ala Leu Pro Thr Phe Leu Arg Arg Glu
340 345 350
Val Ala Ser Leu Leu Ala Ser
355
<210> 4
<211> 1080
<212> DNA
<213> 拟南芥
<400> 4
atggaaggtg acgttggtaa aggtttggtt tgtcaaaata ctatggatgg taaagcttca 60
aatggtaatg gtttggaaaa gactgttcca tcttgttgtt taaaagctat ggcatgtgtt 120
ccagaagatg atgcaaaatg tcattctact gttgtttcag gttggttttc tgaaccacat 180
ccaagatcag gtaaaaaggg tggtaaagct gtttacttca acaacccaat gtggccaggt 240
gaagcacatt ctttgaaggt tgaaaaggtt ttgtttaaag acaagtcaga ttttcaagaa 300
gttttggttt tcgaatctgc tacttacggt aaagttttgg ttttggatgg tatcgttcaa 360
ttgacagaaa aggatgaatg tgcttaccaa gaaatgattg cacatttgcc attgtgttct 420
atctcttcac ctaaaaatgt tttggttgtt ggtggtggtg acggtggtgt tttgagagaa 480
atctcaagac attcttcagt tgaagttatt gatatctgtg aaatcgataa gatggttatt 540
gatgtttcta agaaattttt cccagaatta gctgttggtt ttgatgatcc aagagttcaa 600
ttgcatattg gtgacgctgc agaatttttg agaaagtcac cagagggtaa atacgatgca 660
atcatcgttg attcttcaga tccagttggt ccagctttgg cattggttga aaagccattt 720
ttcgaaacat tggctagagc attaaaacca ggtggtgttt tgtgtaatat ggctgaatct 780
atgtggttgc atactcattt gatcgaagat atgatctcaa tctgtagaca aacttttaaa 840
tctgttcatt acgcatggtc ttcagttcca acttacccat caggtgttat tggtttcgtt 900
ttgtgttcta cagaaggtcc agctgttgat ttcaagaacc caattaatcc aatcgaaaaa 960
ttggatggtg caatgactca taagagagaa ttgaagttct acaattctga tatgcataga 1020
gctgcatttg ctttgccaac atttttaaga agagaagttg cttcattgtt agcatcttaa 1080
<210> 5
<211> 339
<212> PRT
<213> 拟南芥
<400> 5
Met Gly Glu Ala Val Glu Val Met Phe Gly Asn Gly Phe Pro Glu Ile
1 5 10 15
His Lys Ala Thr Ser Pro Thr Gln Thr Leu His Ser Asn Gln Gln Asp
20 25 30
Cys His Trp Tyr Glu Glu Thr Ile Asp Asp Asp Leu Lys Trp Ser Phe
35 40 45
Ala Leu Asn Ser Val Leu His Gln Gly Thr Ser Glu Tyr Gln Asp Ile
50 55 60
Ala Leu Leu Asp Thr Lys Arg Phe Gly Lys Val Leu Val Ile Asp Gly
65 70 75 80
Lys Met Gln Ser Ala Glu Arg Asp Glu Phe Ile Tyr His Glu Cys Leu
85 90 95
Ile His Pro Ala Leu Leu Phe His Pro Asn Pro Lys Thr Val Phe Ile
100 105 110
Met Gly Gly Gly Glu Gly Ser Ala Ala Arg Glu Ile Leu Lys His Thr
115 120 125
Thr Ile Glu Lys Val Val Met Cys Asp Ile Asp Gln Glu Val Val Asp
130 135 140
Phe Cys Arg Arg Phe Leu Thr Val Asn Ser Asp Ala Phe Cys Asn Lys
145 150 155 160
Lys Leu Glu Leu Val Ile Lys Asp Ala Lys Ala Glu Leu Glu Lys Arg
165 170 175
Glu Glu Lys Phe Asp Ile Ile Val Gly Asp Leu Ala Asp Pro Val Glu
180 185 190
Gly Gly Pro Cys Tyr Gln Leu Tyr Thr Lys Ser Phe Tyr Gln Asn Ile
195 200 205
Leu Lys Pro Lys Leu Ser Pro Asn Gly Ile Phe Val Thr Gln Ala Gly
210 215 220
Pro Ala Gly Ile Phe Thr His Lys Glu Val Phe Thr Ser Ile Tyr Asn
225 230 235 240
Thr Met Lys Gln Val Phe Lys Tyr Val Lys Ala Tyr Thr Ala His Val
245 250 255
Pro Ser Phe Ala Asp Thr Trp Gly Trp Val Met Ala Ser Asp His Glu
260 265 270
Phe Asp Val Glu Val Asp Glu Met Asp Arg Arg Ile Glu Glu Arg Val
275 280 285
Asn Gly Glu Leu Met Tyr Leu Asn Ala Pro Ser Phe Val Ser Ala Ala
290 295 300
Thr Leu Asn Lys Thr Ile Ser Leu Ala Leu Glu Lys Glu Thr Glu Val
305 310 315 320
Tyr Ser Glu Glu Asn Ala Arg Phe Ile His Gly His Gly Val Ala Tyr
325 330 335
Arg His Ile
<210> 6
<211> 1020
<212> DNA
<213> 拟南芥
<400> 6
atgggtgaag cagtagaagt aatgttcggt aacggtttcc cagaaatctt aaaagccaca 60
agtccaactc aaaccttgca ctccaatcaa caagattgtc attggtacga agaaactatc 120
gatgatgatt tgaagtggtc tttcgcttta aattctgttt tgcatcaagg tacttctgaa 180
taccaagata tcgcattgtt ggatacaaag agattcggta aagttttggt tattgatggt 240
aaaatgcaat cagctgaaag agatgagttt atatatcatg aatgtttgat ccatccagca 300
ttgttgttcc atccaaaccc aaagactgtt tttattatgg gtggtggtga aggttctgct 360
gcaagagaaa tcttgaagca tactacaatc gaaaaagttg ttatgtgtga tatcgatcaa 420
gaagttgttg atttctgtag aagatttttg acagttaatt cagatgcttt ctgtaataag 480
aaattggaat tagttattaa agatgctaag gcagaattgg aaaagagaga agaaaagttc 540
gatattattg ttggtgactt ggctgatcca gttgaaggtg gtccatgtta tcaattgtac 600
actaagtctt tctaccaaaa cattttgaaa ccaaaattat caccaaatgg tatttttgtt 660
actcaagctg gtccagcagg tatttttaca cataaagaag tttttacttc tatctataac 720
acaatgaagc aagtttttaa atacgttaaa gcttacactg cacatgttcc atcttttgct 780
gatacatggg gttgggttat ggcatcagat catgaatttg atgttgaagt tgatgaaatg 840
gatagaagaa tcgaagaaag agttaacggt gaattgatgt acttaaatgc tccatctttt 900
gtttcagctg caactttgaa taagacaatc tcattggcat tggaaaagga aacagaagtt 960
tactccgaag aaaatgctag attcatccac ggtcacggtg ttgcctacag acatatctaa 1020
<210> 7
<211> 370
<212> PRT
<213> 春千里光(Senecio vernalis)
<400> 7
Met Ala Glu Ser Asn Lys Glu Ala Ile Asp Ser Ala Arg Ser Asn Val
1 5 10 15
Phe Lys Glu Ser Glu Ser Leu Glu Gly Thr Cys Ala Lys Ile Gly Gly
20 25 30
Tyr Asp Phe Asn Asn Gly Ile Asp His Ser Lys Leu Leu Lys Ser Met
35 40 45
Val Ser Thr Gly Phe Gln Ala Ser Asn Leu Gly Asp Ala Met Ile Ile
50 55 60
Thr Asn Gln Met Leu Asp Trp Arg Leu Ser His Asp Glu Val Pro Glu
65 70 75 80
Asn Cys Ser Glu Glu Glu Lys Lys Asn Arg Glu Ser Val Lys Cys Lys
85 90 95
Ile Phe Leu Gly Phe Thr Ser Asn Leu Ile Ser Ser Gly Val Arg Glu
100 105 110
Thr Ile Cys Tyr Leu Thr Gln His Arg Met Val Asp Val Leu Val Thr
115 120 125
Thr Thr Gly Gly Ile Glu Glu Asp Phe Ile Lys Cys Leu Ala Ser Thr
130 135 140
Tyr Lys Gly Lys Phe Ser Leu Pro Gly Ala Asp Leu Arg Ser Lys Gly
145 150 155 160
Leu Asn Arg Ile Gly Asn Leu Ile Val Pro Asn Asp Asn Tyr Ile Lys
165 170 175
Phe Glu Asp Trp Ile Ile Pro Ile Phe Asp Gln Met Leu Ile Glu Gln
180 185 190
Lys Thr Gln Asn Val Leu Trp Thr Pro Ser Arg Met Ile Ala Arg Leu
195 200 205
Gly Lys Glu Ile Asn Asn Glu Thr Ser Tyr Leu Tyr Trp Ala Tyr Lys
210 215 220
Asn Asn Ile Pro Val Phe Cys Pro Ser Ile Thr Asp Gly Ser Ile Gly
225 230 235 240
Asp Met Leu Tyr Phe His Ser Val Ser Asn Pro Gly Pro Gly Leu Val
245 250 255
Val Asp Ile Val Gln Asp Val Ile Ala Met Asp Asn Glu Ala Val His
260 265 270
Ala Ser Pro Gln Lys Thr Gly Ile Ile Ile Leu Gly Gly Gly Leu Pro
275 280 285
Lys His His Ile Cys Asn Ala Asn Met Met Arg Asn Gly Ala Asp Phe
290 295 300
Ala Val Phe Ile Asn Thr Ala Gln Glu Tyr Asp Gly Ser Asp Ser Gly
305 310 315 320
Ala Arg Pro Asp Glu Ala Val Ser Trp Gly Lys Ile Ser Ser Thr Gly
325 330 335
Lys Ala Val Lys Val His Cys Asp Ala Thr Ile Ala Phe Pro Leu Leu
340 345 350
Val Ala Glu Thr Phe Ala Val Lys Lys Glu Lys Ala Ser Lys Val Asn
355 360 365
Gly Phe
370
<210> 8
<211> 1113
<212> DNA
<213> 春千里光
<400> 8
atggctgaat ctaataagga agctatcgat tctgcaagat caaacgtttt taaagaatct 60
gaatcattag aaggtacatg tgcaaagatc ggtggttacg atttcaacaa cggtatcgat 120
cattcaaagt tgttgaagtc tatggtttca acaggtttcc aagcttctaa tttgggtgac 180
gcaatgatca tcactaacca aatgttggat tggagattat ctcatgatga agttccagaa 240
aactgttcag aagaagaaaa gaaaaataga gaatctgtta agtgtaagat tttcttgggt 300
tttacatcaa atttgatctc ttcaggtgtt agagaaacaa tctgttattt gactcaacat 360
agaatggttg atgttttggt tactacaact ggtggtatcg aagaagattt catcaagtgt 420
ttagcttcta cttacaaggg taaattttca ttgccaggtg cagatttgag atctaagggt 480
ttgaacagaa ttggtaattt gatcgttcca aacgataact acatcaagtt cgaagattgg 540
attattccaa tttttgatca aatgttaatt gaacaaaaga ctcaaaatgt tttgtggact 600
ccatcaagaa tgattgctag attgggtaaa gaaattaata acgaaacatc ttatttgtac 660
tgggcataca agaacaacat cccagttttc tgtccatcta ttactgatgg ttcaattggt 720
gacatgttgt acttccattc tgtttcaaac ccaggtccag gtttggttgt tgatatcgtt 780
caagatgtta tcgctatgga taatgaagct gttcatgcat ctccacaaaa gactggtatc 840
atcatcttgg gtggtggttt accaaagcat catatctgta acgctaacat gatgagaaac 900
ggtgctgatt tcgcagtttt tattaacaca gcacaagaat acgatggttc tgattcaggt 960
gctagaccag atgaagcagt ttcatggggt aaaatctctt caactggtaa agctgttaaa 1020
gttcattgtg atgctacaat tgcatttcca ttgttagttg ctgaaacttt cgcagttaag 1080
aaagaaaagg cttctaaagt taatggtttt taa 1113
<210> 9
<211> 280
<212> PRT
<213> 绿色芽生绿菌(Blastochloris viridis)
<400> 9
Met Thr Asp Trp Pro Val Tyr His Arg Ile Asp Gly Pro Ile Val Met
1 5 10 15
Ile Gly Phe Gly Ser Ile Gly Arg Gly Thr Leu Pro Leu Ile Glu Arg
20 25 30
His Phe Ala Phe Asp Arg Ser Lys Leu Val Val Ile Asp Pro Ser Asp
35 40 45
Glu Ala Arg Lys Leu Ala Glu Ala Arg Gly Val Arg Phe Ile Gln Gln
50 55 60
Ala Val Thr Arg Asp Asn Tyr Arg Asp Leu Leu Val Pro Leu Leu Thr
65 70 75 80
Ala Gly Pro Gly Gln Gly Phe Cys Val Asn Leu Ser Val Asp Thr Ser
85 90 95
Ser Leu Asp Ile Met Glu Leu Ala Arg Glu Asn Gly Ala Leu Tyr Ile
100 105 110
Asp Thr Val Val Glu Pro Trp Leu Gly Phe Tyr Phe Asp Pro Asp Leu
115 120 125
Lys Pro Glu Ala Arg Ser Asn Tyr Ala Leu Arg Glu Thr Val Leu Ala
130 135 140
Ala Arg Arg Asn Lys Pro Gly Gly Thr Thr Ala Val Ser Cys Cys Gly
145 150 155 160
Ala Asn Pro Gly Met Val Ser Trp Phe Val Lys Gln Ala Leu Val Asn
165 170 175
Leu Ala Ala Asp Leu Gly Val Thr Arg Glu Glu Pro Thr Thr Arg Glu
180 185 190
Glu Trp Ala Arg Leu Ala Met Asp Leu Gly Val Lys Gly Ile His Ile
195 200 205
Ala Glu Arg Asp Thr Gln Arg Ala Asn Phe Pro Lys Pro Phe Asp Val
210 215 220
Phe Val Asn Thr Trp Ser Val Glu Gly Phe Val Ser Glu Gly Leu Gln
225 230 235 240
Pro Ala Glu Leu Gly Trp Gly Thr Phe Glu Arg Trp Met Pro Asp Asn
245 250 255
Ala Arg Gly His Asp Ser Gly Cys Gly Ala Gly Ile Tyr Leu Leu Gln
260 265 270
Pro Gly Ala Asn Thr Arg Val Arg
275 280
<210> 10
<211> 1434
<212> DNA
<213> 绿色芽生绿菌
<400> 10
atgacagatt ggccagttta ccatagaatc gatggtccaa tcgttatgat tggttttggt 60
tctattggta gaggtacttt gccattgatc gaaagacatt tcgcattcga tagatctaag 120
ttggttgtta ttgatccatc agatgaagct agaaaattgg ctgaagcaag aggtgttaga 180
ttcattcaac aagcagttac aagagataac tacagagatt tgttggttcc attgttaact 240
gctggtccag gtcaaggttt ctgtgttaat ttgtctgttg atacatcttc attggatatc 300
atggaattgg ctagagaaaa tggtgcattg tatattgata ctgttgttga accatggttg 360
ggtttctact tcgatccaga tttgaagcca gaagctagat caaactacgc attgagagaa 420
acagttttag ctgcaagaag aaataagcca ggtggtacta cagctgtttc ttgttgtggt 480
gcaaatcctg gtatggtttc atggttcgtt aagcaagctt tggttaattt ggctgcagat 540
ttgggtgtta ctagagaaga accaactaca agagaagaat gggctagatt ggcaatggat 600
ttgggtgtta agggtattca tatcgctgaa agagatacac aaagagcaaa cttcccaaag 660
ccattcgatg ttttcgttaa cacttggtct gttgaaggtt ttgtttcaga aggtttgcaa 720
ccagctgaat taggttgggg tacatttgaa agatggatgc cagataatgc tagaggtcat 780
gattctggtt gtggtgcagg tatatatttg ttacaaccag gtgctaatac tagagttaga 840
tcatggactc caacagctac tgcacaatac ggtttcttgg ttacacataa cgaatctatc 900
tcaatcgcag atttcttgac tgttagagat gctgcaggtc aagctgttta tagaccaaca 960
tgtcattatg cttaccatcc atgtaacgat gcagttttgt ctttgcatga aatgtttggt 1020
tctggtaaaa gacaatcaga ttggttgatc ttggatgaaa ctgaaatcgt tgatggtatc 1080
gatgaattgg gtgttttgtt gtacggtcat ggtaaaaatg cttattggta cggttctcaa 1140
ttgtcaatcg aagaaacaag aagaattgct ccagatcaaa atgcaactgg tttgcaagtt 1200
tcttcagctg ttttagctgg tatggtttgg gctttggaaa atccaaaagc tggtattgtt 1260
gaagcagatg atttggatta cagaagatgt ttggaagttc aaacaccata tttgggtcca 1320
gttgttggtg tttacacaga ttggactcca ttggcaggta gaccaggttt atttccagaa 1380
gatattgatg cttcagatcc atggcaattc agaaatgttt tggttagaga ttaa 1434
<210> 11
<211> 1365
<212> DNA
<213> 酿酒酵母
<400> 11
atgtcagagc cagaatttca acaagcttac gaagaagttg tctcctcttt ggaagactct 60
actcttttcg aacaacaccc agaatacaga aaggttttgc caattgtttc tgttccagaa 120
agaatcatac aattcagagt cacctgggaa aatgacaagg gtgaacaaga agttgctcaa 180
ggttacagag tgcaatataa ctccgccaag ggtccataca agggtggtct acgtttccat 240
ccttccgtga acttgtctat cttgaaattc ttgggtttcg aacaaatctt caagaactcc 300
ttgaccggcc tagacatggg tggtggtaaa ggtggtctat gtgtggactt gaagggaaga 360
tctaataacg aaatcagaag aatctgttat gctttcatga gagaattgag cagacacatt 420
ggtcaagaca ctgacgtgcc agctggtgat atcggtgttg gtggtcgtga aattggttac 480
ctgttcggtg cttacagatc atacaagaac tcctgggaag gtgtcttaac cggtaagggt 540
ttgaactggg gtggttcttt gatcagacca gaagccactg gttacggttt agtttactat 600
actcaagcta tgatcgacta tgccacaaac ggtaaggaat ctttcgaagg taagcgcgtc 660
accatctctg gtagtggtaa cgttgctcaa tacgctgcct tgaaggttat tgagctaggt 720
ggtactgtcg tttccctatc tgactccaag ggttgtatca tctctgaaac tggtatcacc 780
tccgaacaag tcgctgatat ttccagtgct aaggtcaact tcaagtcctt ggaacaaatc 840
gtcaacgaat actctacttt ctccgaaaac aaagtgcaat acattgctgg tgctcgtcca 900
tggacccacg tccaaaaggt cgacattgct ttgccatgtg ccacccaaaa tgaagtcagc 960
ggtgaagaag ccaaggcctt ggttgctcaa ggtgtcaagt ttattgccga aggttccaac 1020
atgggttcca ctccagaagc tattgccgtc tttgaaactg ctcgttccac cgccactgga 1080
ccaagcgaag ctgtttggta cggtccacca aaggctgcta acttgggtgg tgttgctgtt 1140
tctggtttag aaatggcaca aaactctcaa agaatcacat ggactagcga aagagttgac 1200
caagagttga agagaattat gatcaactgt ttcaatgaat gtatcgacta tgccaagaag 1260
tacactaagg acggtaaggt cttgccatct ttggtcaaag gtgctaatat cgcaagtttc 1320
atcaaggtct ctgatgctat gtttgaccaa ggtgatgtat tttaa 1365
<210> 12
<211> 2709
<212> DNA
<213> 酿酒酵母
<400> 12
atggagcaaa tcaattcgaa cagtagaaaa aagaagcaac aattggaagt attcaaatat 60
tttgcaagtg tccttacaaa agaggacaag cctattagta tcagtaatgg tatgttagat 120
atgccgacag tgaactccag taaactcaca gcaggaaatg ggaaacctga cacggagaag 180
cttacaggag aactaatttt aacatacgac gatttcattg aactgatatc tagctcaaag 240
actatttatt cgaagtttac ggaccattcg ttcaatttga accagatacc caagaacgtt 300
ttcgggtgta ttttcttcgc tattgatgaa caaaacaagg gatatctgac gcttaatgat 360
tggttttatt ttaataattt attagaatat gataattatc atctcattat tctatatgag 420
ttctttagga aatttgatgt agagaatttg aaggcaaaac aaaaaaaaga gcttggtagt 480
tcgtcgttta atttaaaggc tgcagatgat cgaattaagt caattaatta tggtaacaga 540
tttctaagct ttgatgatct tcttttgaat ctgaaccaat tcaaagatac tatccgcctg 600
ttgcacgaat ctattgatga taattttgtt aaagataaca aattactact tgattggaat 660
gactttcgat ttctgaaatt ttacaaatgt tatcatgaaa atgaagagta tttgagttta 720
aactctctgg tcacgatttt acaaaatgat cttaagaatg aaaaaatatt tataggtttt 780
gataggttgg cacagatgga ctcacaaggg catcgtttag ccctaagcaa aaatcaactc 840
acctatcttc taaggttatt ttactctcac agggtgtctg cagatatatt ttcctccttg 900
aatctatcaa acaccgaatt actaaaagcg gacaataatt ccattccgta caatgtattc 960
aaggatatat tttatttatt tcaaaatttt gacctactga accaaatatt tcacaagtat 1020
gttactgaaa ataatttgaa tgagcaggat attagggaac aaatagttac taaaaatgac 1080
tttatgacag ttttaaacgc ccagtataac aaagtaaaca atatcattga gttctctcct 1140
tcccaaatca acctactatt ttctatcgtc gcaaattcaa aggaaaacag aagattaaga 1200
aagagaaatc aagatcgaga tgacgagcta ttaaatgatc accattatga ttcagatatt 1260
gattttttta tccataatga gtatttgcat ggagtaagca gatccagaaa aaatttagaa 1320
agttttaatg actattatca tgatctctcg gatggatttg accaagactc tggtgttaaa 1380
aaagcttcaa aagcgagtac tggcttgttt gaatctgtat ttggaggtaa aaaagataaa 1440
gcaacgatgc gttctgactt aacaattgaa gatttcatga aaattttgaa cccaaattac 1500
ctgaacgact tagttcacca aatggaattg caaaaaaatc aaaatgagtc attgtatatt 1560
aattactact tttatccaat tttcgattcg ttgtacaatt tctccttggg ttctattgcg 1620
ggttgtattg gtgcaactgt agtataccca atagacttta taaaaacaag gatgcaagcc 1680
caaagatctt tagcccaata caaaaactca attgattgtt tgttgaagat tatatcccgc 1740
gaaggaataa aaggtctcta ctctggctta gggccacaat taataggagt tgctcctgaa 1800
aaggcgataa aattgactgt caatgatttt atgagaaaca ggttgactga taaaaacggc 1860
aagctaagcc tttttcctga aattatttct ggcgcttcag ctggtgcatg tcaagttata 1920
tttactaatc cgttagagat tgtaaaaatt aggctacagg tccaatccga ctatgttggt 1980
gaaaacatac aacaagccaa tgaaactgcc actcaaatag tcaaaaaatt aggactgagg 2040
ggcttgtaca atggtgtagc cgcatgttta atgagagatg ttccattctc tgctatttat 2100
tttcccactt atgcacattt aaaaaaagat ctctttgatt ttgatccaaa tgataaaaca 2160
aagaggaatc gattaaaaac atgggagctt ttaactgccg gtgccattgc tggtatgcca 2220
gctgccttct tgactactcc ttttgatgtt ataaaaacaa ggctccagat agatcctcga 2280
aaaggtgaga caaagtataa cggtatattt catgctatcc gaactatctt aaaggaagag 2340
agctttagaa gctttttcaa aggtggtgga gcccgtgtcc taagaagttc tccccaattt 2400
gggttcactc tggccgccta tgaattattc aagggcttta ttccctcccc cgataacaaa 2460
ttaaaaagca gagagggtag gaagagattt tgtatcgatg acgacgcagg caatgaagag 2520
acagtagttc atagtaacgg tgaactccca cagcaaaagt tttactctga tgatagaaaa 2580
catgccaatt attactataa aagctgtcaa attgcgaaaa cattcattga tttggacaat 2640
aacttttcta ggtttgactc ttcagtttat aaaaactttc aagagcacct aagaagcatt 2700
aacgggtga 2709
<210> 13
<211> 879
<212> DNA
<213> 酿酒酵母
<400> 13
atggaggaca gtaaaaagaa aggattaata gaaggcgcta tactcgatat aataaacggt 60
tccattgcag gcgcctgtgg taaggtgatc gagtttcctt tcgatactgt gaaagtcagg 120
ttgcaaacac aagcatccaa cgtgttccca acaacatggt cttgtataaa atttacttac 180
caaaatgaag gaatagcacg agggtttttt caaggcattg cttcaccttt agttggagca 240
tgtctggaga acgcgacatt atttgtgtct tataaccaat gttctaaatt tttagaaaaa 300
catacaaacg tttccccgtt ggggcaaatc ctgatctctg gtggagtagc gggttcatgt 360
gctagtttag ttttgacacc cgtggagctg gtgaagtgta agttgcaggt tgcgaactta 420
caagttgcat cagctaaaac gaaacataca aaggtgttgc ctacaataaa agcaattata 480
actgagagag gattggcagg attgtggcaa gggcaatcgg gcacttttat tcgagaaagc 540
ttcggtggtg ttgcctggtt tgcaacctac gaaatagtta agaagtcgtt gaaagatagg 600
cactcccttg atgacccaaa aagagatgaa agtaagatat gggaactact tattagtgga 660
gggagcgctg gattggcatt caacgccagt atttttcctg cggatactgt gaaatcagta 720
atgcaaactg aacatataag cctcaccaat gcggtgaaga agatatttgg caaatttgga 780
ctaaagggtt tttatcgagg actgggtata acccttttta gggcagtacc agcaaacgct 840
gcagtttttt acatctttga gactctttct gcactttaa 879
<210> 14
<211> 1332
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 14
atggtaaagg aacgtaaaac cgagttggtc gagggattcc gccattcggt tccctatatc 60
aatacccacc ggggaaaaac gtttgtcatc atgctcggcg gtgaagccat tgagcatgag 120
aatttctcca gtatcgttaa tgatatcggg ttgttgcaca gcctcggcat ccgtctggtg 180
gtggtctatg gcgcacgtcc gcagatcgac gcaaatctgg ctgcgcatca ccacgaaccg 240
ctgtatcaca agaatatacg tgtgaccgac gccaaaacac tggaactggt gaagcaggct 300
gcgggaacat tgcaactgga tattactgct cgcctgtcga tgagtctcaa taacacgccg 360
ctgcagggcg cgcatatcaa cgtcgtcagt ggcaatttta ttattgccca gccgctgggc 420
gtcgatgacg gcgtggatta ctgccatagc gggcgtatcc ggcggattga tgaagacgcg 480
atccatcgtc aactggacag cggtgcaata gtgctaatgg ggccggtcgc tgtttcagtc 540
actggcgaga gctttaacct gacctcggaa gagattgcca ctcaactggc catcaaactg 600
aaagctgaaa agatgattgg tttttgctct tcccagggcg tcactaatga cgacggtgat 660
attgtctccg aacttttccc taacgaagcg caagcgcggg tagaagccca ggaagagaaa 720
ggcgattaca actccggtac ggtgcgcttt ttgcgtggcg cagtgaaagc ctgccgcagc 780
ggcgtgcgtc gctgtcattt aatcagttat caggaagatg gcgcgctgtt gcaagagttg 840
ttctcacgcg acggtatcgg tacgcagatt gtgatggaaa gcgccgagca gattcgtcgc 900
gcaacaatca acgatattgg cggtattctg gagttgattc gcccactgga gcagcaaggt 960
attctggtac gccgttctcg cgagcagctg gagatggaaa tcgacaaatt caccattatt 1020
cagcgcgata acacgactat tgcctgcgcc gcgctctatc cgttcccgga agagaagatt 1080
ggggaaatgg cctgtgtggc agttcacccg gattaccgca gttcatcaag gggtgaagtt 1140
ctgctggaac gcattgccgc tcaggcgaag cagagcggct taagcaaatt gtttgtgctg 1200
accacgcgca gtattcactg gttccaggaa cgtggattta ccccagtgga tattgattta 1260
ctgcccgaga gcaaaaagca gttgtacaac taccagcgta aatccaaagt gttgatggcg 1320
gatttagggt aa 1332
<210> 15
<211> 777
<212> DNA
<213> 大肠杆菌
<400> 15
atgatgaatc cattaattat caaactgggc ggcgtactgc tggatagtga agaggcgctg 60
gaacgtctgt ttagcgcact ggtgaattat cgtgagtcac atcagcgtcc gctggtgatt 120
gtgcacggcg gcggttgcgt ggtggatgag ctgatgaaag ggctgaatct gccggtgaaa 180
aagaaaaacg gcctgcgggt gacgcctgct gatcagatag acattatcac cggagcactg 240
gcgggaacgg caaataaaac cctgttggca tgggcgaaga aacatcagat tgcggccgta 300
ggtttgtttc tcggtgacgg cgacagcgtc aaagtgaccc agcttgatga agagttaggt 360
catgttggac tggcgcagcc aggttcgcct aagcttatca actccttgct ggagaacggt 420
tatctgccgg tggtcagctc cattggcgta acagacgaag ggcaactgat gaacgtcaat 480
gccgaccagg cggcaacggc gctggcggca acgctgggcg cggatctgat tttgctctcc 540
gacgtcagcg gcattctcga cggcaaaggg caacgcattg ccgaaatgac cgccgcgaaa 600
gcagaacaac tgattgagca gggcattatt actgacggca tgatagtgaa agtgaacgcg 660
gcgctggatg cggcccgcac gctgggccgt ccggtagata tcgcctcctg gcgtcatgcg 720
gagcagcttc cggcactgtt taacggtatg ccgatgggta cgcggatttt agcttaa 777
<210> 16
<211> 1074
<212> DNA
<213> 谷氨酸棒状杆菌(Corynebacterium glutamicum)
<400> 16
atgataatgc acaatgtcta tggtgttaca atgactatta aggtcgcaat cgcaggtgcc 60
tcaggttacg caggtggtga aatcttgaga ttgttattgg gtcatccagc atatgcctct 120
ggtgaattag aaataggtgc attgaccgct gcatccactg ccggtagtac attgggtgaa 180
ttgatgccac atattcctca attagctgat agagttatac aagacactac agctgaaaca 240
ttggcaggtc atgatgttgt ctttttaggt ttgccacacg gtttctcagc agaaatagcc 300
ttacaattgg gtcctgatgt cacagtaatc gattgtgccg ctgactttag attacaaaat 360
gcagccgact gggaaaaatt ctatggttcc gaacatcaag gtacctggcc atacggtatt 420
ccagaaatgc ctggtcacag agaagccttg agaggtgcta agagagttgc agtcccaggt 480
tgctttccta caggtgctac cttagcatta ttgccagccg ttcaagctgg tttgatcgaa 540
cctgatgtat ctgtagtttc aattaccggt gtttccggtg caggtaaaaa ggctagtgtt 600
gccttattgg gttctgaaac tatgggttca ttgaaggcat acaacacctc aggtaaacat 660
agacacactc cagaaatcgc tcaaaacttg ggtgaagttt ctgacaaacc agtaaaggtt 720
tcattcacac ctgttttagc tccattgcct agaggtattt taaccactgc tacagcacct 780
ttgaaagaag gtgtcaccgc cgaacaagcc agagctgttt acgaagaatt ctacgctcaa 840
gaaactttcg tccatgtatt accagaaggt gcccaacctc aaacacaagc tgttttgggt 900
tccaacatgt gtcacgttca agtcgaaatt gatgaagaag ctggtaaagt attggttact 960
agtgcaatcg acaatttgac taagggtaca gcaggtgctg cagttcaatg catgaactta 1020
tctgtcggtt ttgatgaagc cgctggtttg ccacaagtcg gtgtagctcc ttaa 1074
<210> 17
<211> 1176
<212> DNA
<213> 谷氨酸棒状杆菌
<400> 17
atgtctacat tggaaacctg gcctcaagtc atcatcaaca catacggtac tcctcctgtc 60
gaattggtct ctggtaaagg tgctacagta accgatgacc agggtaacgt ttacatcgat 120
ttgttggctg gtatagcagt taacgccttg ggtcatgctc acccagcaat aatcgaagct 180
gtaactaacc aaataggtca attgggtcat gtttctaact tatttgcatc aagacctgtt 240
gtcgaagttg ccgaagaatt aattaagaga ttctctttgg atgacgcaac attagctgca 300
caaaccagag ttttcttttg taattcaggt gcagaagcca acgaagccgc ttttaaaatc 360
gctagattga caggtagatc cagaatttta gcagccgttc atggtttcca cggtagaacc 420
atgggtagtt tggcattaac tggtcaacca gataagagag aagcattttt gccaatgcct 480
tccggtgttg aattctatcc ttacggtgac actgactatt tgagaaaaat ggtcgaaacc 540
aatccaactg atgtagctgc aatcttttta gaacctattc aaggtgaaac aggtgtagtt 600
ccagcccctg aaggtttctt gaaggctgtt agagaattgt gtgatgaata cggtatcttg 660
atgatcactg acgaagtaca aacaggtgtt ggtagaaccg gtgacttttt cgcacatcaa 720
cacgatggtg tcgtaccaga cgttgtcact atggctaaag gtttgggtgg tggtttacct 780
attggtgcct gcttggctac aggtagagcc gctgaattaa tgaccccagg taaacatggt 840
actacatttg gtggtaaccc tgttgcttgt gcagccgcta aagcagtctt gtcagtagtt 900
gatgacgcat tttgcgccga agttgctaga aagggtgaat tattcaagga attgttggct 960
aaggttgatg gtgtcgtaga cgtcagaggt agaggtttga tgttaggtgt tgtcttggaa 1020
agagatgtcg caaagcaagc cgtattggac ggttttaaac acggtgttat tttaaatgct 1080
ccagcagata acatcattag attgactcca cctttagtca taacagatga agaaattgcc 1140
gacgctgtta aagcaattgc cgaaacaata gcttaa 1176
<210> 18
<211> 1167
<212> DNA
<213> 谷氨酸棒状杆菌
<400> 18
atggccgaaa aaggtataac agctccaaaa ggtttcgttg cctctgctac tacagccggt 60
atcaaggctt caggtaatcc agatatggca ttggttgtca accaaggtcc tgaattttct 120
gctgcagccg ttttcactag aaatagagtc tttgctgcac ctgttaaagt ctctagagaa 180
aacgttgctg atggtcaaat tagagctgtc ttgtataatg ctggtaatgc aaacgcctgt 240
aacggtttac aaggtgaaaa ggatgcaaga gaatccgtaa gtcatttggc ccaaaatttg 300
ggtttagaag attccgacat cggtgtttgc agtacaggtt tgattggtga attgttgcca 360
atggataagt tgaacgctgg tatcgaccaa ttgaccgccg aaggtgcttt aggtgacaac 420
ggtgccgctg cagccaaagc tatcatgacc actgataccg ttgacaagga aactgtagtt 480
tttgcagatg gttggacagt aggtggtatg ggtaaaggtg ttggtatgat ggcaccttca 540
ttggccacca tgttagtatg tttaacaacc gatgcctccg ttactcaaga aatggctcaa 600
attgctttgg caaatgccac cgctgtcact ttcgacacat tagatataga cggttctaca 660
tcaaccaacg atactgtttt cttgttagca tctggtgcct caggtatcac tccaacacaa 720
gatgaattga atgacgctgt ttacgctgca tgctctgata ttgccgctaa attacaagca 780
gacgccgaag gtgttacaaa gagagtagca gttaccgtcg taggtactac aaataacgaa 840
caagctatta atgcagccag aacagttgca agagataact tgtttaaatg tgccatgttc 900
ggttctgacc caaattgggg tagagtctta gctgcagttg gtatggctga tgcagacatg 960
gaacctgaaa agatatccgt ctttttcaac ggtcaagctg tatgcttgga tagtactggt 1020
gctcctggtg caagagaagt cgacttgtct ggtgctgata ttgacgttag aatagatttg 1080
ggtacttcag gtgaaggtca agcaacagtt agaaccactg atttgtcctt tagttacgtc 1140
gaaattaatt ccgcttactc ttcataa 1167
<210> 19
<211> 1017
<212> DNA
<213> 酿酒酵母
<400> 19
atgtcaacca cagcatccac gccttcatct ttacgtcatt tgatttctat aaaagatctt 60
tctgatgaag aattcagaat cttagtacaa agagctcaac atttcaagaa tgtttttaaa 120
gcaaataaaa cgaatgattt ccaatccaac catctgaaac tattgggtag aactatagcc 180
ttaatattta ctaaaagatc aactagaacg agaatttcga ccgaaggtgc agccaccttc 240
tttggtgccc aaccgatgtt tttaggtaaa gaggatattc agcttggtgt caatgaatca 300
ttttacgata ccaccaaggt tgtatcatct atggtttcat gtatttttgc ccgtgtgaac 360
aaacatgaag acatacttgc tttttgcaag gattcctctg taccgatcat caactctcta 420
tgtgacaaat tccacccttt gcaagcaatt tgtgatcttt taacaataat cgaaaacttc 480
aatatatctc tagatgaagt aaataaggga atcaattcaa aattgaagat ggcatggatt 540
ggtgatgcca ataatgtcat aaatgatatg tgcatcgcat gtctgaaatt cggtataagt 600
gtcagtattt ccactccccc cggtattgaa atggattccg atattgtcga tgaagcaaag 660
aaagttgctg agagaaacgg tgcgacattt gaattaacac acgactcttt aaaggcctcc 720
accaatgcca atatattagt aaccgatact ttcgtttcca tgggtgaaga atttgcgaaa 780
caggccaagc tgaaacaatt caaaggtttt caaatcaatc aagaacttgt ctctgtggct 840
gatccaaact acaaatttat gcattgtctg ccaagacatc aagaagaagt tagtgatgat 900
gtcttttatg gagagcattc catagtcttt gaagaagcag aaaacagatt atatgcagct 960
atgtctgcca ttgatatctt tgttaataat aaaggtaatt tcaaggactt gaaataa 1017
<210> 20
<211> 1275
<212> DNA
<213> 酿酒酵母
<400> 20
atgtccgaag ctaccctctc ctccaagcaa accattgaat gggaaaacaa atactccgcc 60
cacaactacc accccttgcc cgtcgttttt cacaaggcta agggcgcaca tgtgtgggac 120
ccggagggta agctgtacct cgacttcctg agcgcttatt ctgccgtcaa ccagggccat 180
tgccatcctc acatcatcaa ggctttgacg gagcaagcac aaacactaac attgtcctcc 240
agagcgttcc acaacgatgt ttacgcgcaa ttcgccaagt tcgtgaccga attcttcggg 300
ttcgaaaccg ttttgcccat gaacaccggt gcagaagccg tggaaactgc tttgaagttg 360
gccagaagat gggggtacat gaagaagaac atccctcaag ataaagccat cattctgggt 420
gccgagggta acttccacgg gagaaccttc ggtgctatca gtttgagtac cgactacgag 480
gactccaagt tgcatttcgg gcctttcgtg cctaacgttg ccagtggtca ctccgtgcac 540
aagatcagat acggccacgc agaagatttc gtccctatct tggaatctcc tgaaggtaag 600
aacgttgccg ccatcattct agagccaatt cagggtgaag ccggtatcgt cgtgcccccc 660
gcagactact tcccaaaggt ctccgcatta tgccgtaagc acaacgtcct attgatcgtt 720
gacgaaattc aaaccggtat cggtagaacc ggtgagttgc tttgctacga ccactacaag 780
gcagaggcca agcctgatat tgttttgtta ggtaaggctc tctcaggtgg tgttcttccc 840
gtctcatgtg ttctgtcttc tcacgacatc atgtcttgct ttaccccagg atctcacggt 900
tctactttcg gcggtaatcc tttggcttcc cgcgttgcca tcgccgccct cgaggtcatc 960
cgcgacgaga agctgtgcca aagagccgcc caactgggta gctctttcat cgcccaattg 1020
aaagctctcc aagccaaatc taacggtata atctctgagg tgcgtggtat gggactgctt 1080
accgccatcg taatcgaccc atccaaggcc aatggtaaga ccgcttggga cttgtgtcta 1140
ttgatgaagg atcacggcct cttggctaag cccacccacg accacatcat cagattggct 1200
cctcctttgg tcatctccga agaggacttg caaaccggtg tcgaaaccat tgccaagtgt 1260
atcgatctgt tataa 1275
<210> 21
<211> 1401
<212> DNA
<213> 酿酒酵母
<400> 21
atgtctagta ctcaagtagg aaatgctcta tctagttcca ctactacttt agtggacttg 60
tctaattcta cggttaccca aaagaagcaa tattataaag atggcgagac gctgcacaat 120
cttttgcttg aactaaagaa taaccaagat ttggaacttt taccgcatga acaagcgcat 180
cctaaaatat ttcaagcgct caaggctcgt attggtagaa ttaataatga aacgtgcgac 240
cccggtgagg agaactcgtt tttcatatgc gatttgggag aagtcaagag attattcaac 300
aactgggtga aggagcttcc tagaattaag ccattttatg ccgtcaaatg taatcctgat 360
accaaggttt tgtcattatt agcagagttg ggcgttaatt tcgattgcgc ttccaaagtg 420
gaaattgaca gagtattatc gatgaacatc tcgccggata gaattgttta cgctaatcct 480
tgtaaagtag catctttcat tagatatgca gcttcaaaaa atgtaatgaa gtctactttt 540
gacaatgtag aagaattgca taaaatcaaa aagtttcatc ctgagtctca gttgttatta 600
agaatcgcta ccgatgactc taccgctcaa tgtcgacttt ccaccaaata tggctgtgaa 660
atggaaaacg tagacgtttt attaaaggct ataaaggaac taggtttaaa cctggctggt 720
gtttctttcc acgtcggttc aggcgcttct gattttacaa gcttatacaa agccgttaga 780
gatgcaagaa cggtatttga caaagctgct aacgaatacg ggttgccccc tttgaagatt 840
ttggatgtag gtggtggatt tcaatttgaa tccttcaaag aatcaactgc tgttttgcgt 900
ctagcgctag aggaattttt ccctgtaggt tgtggtgttg atataattgc agagcctggc 960
agatactttg tagctacagc gttcactttg gcatctcatg tgattgcgaa gagaaaactg 1020
tctgagaatg aagcaatgat ttacactaac gatggtgtat acgggaacat gaattgtatt 1080
ttattcgatc atcaagagcc ccatccaaga accctttatc ataatttgga atttcattac 1140
gacgattttg aatccactac tgcggtcctc gactctatca acaaaacaag atctgagtat 1200
ccatataaag tttccatctg gggacccaca tgtgatggtt tggattgtat tgccaaagag 1260
tattacatga agcatgatgt tatagtcggt gattggtttt attttcctgc cctgggtgcc 1320
tacacatcat cggcggctac tcaattcaac ggctttgagc agactgcgga tatagtatac 1380
atagactctg aactcgattg a 1401
<210> 22
<211> 879
<212> DNA
<213> 酿酒酵母
<400> 22
atgtatgaag taatacagaa aaggaaaaca aaaataataa acgttttaca gagtcctgaa 60
ctcatgaggc tcatagagga cccatcaaat ctgggtattt ctttacattt tccagtaagt 120
tcactgctaa aaagtaataa gtgcacacca atgcctaaac tttctacgta tagtttggct 180
agtgggggat ttaaggattg gtgcgcggac atccctctag acgttccacc agagattgat 240
atcatcgatt tttactggga tgttatttta tgcatggaat ctcaattcat attagattac 300
aatgttccgt caaaaaataa ggggaacaat cagaagtctg ttgctaagct gttgaaaaat 360
aagcttgtaa acgatatgaa aactacgtta aaaagactaa tttataatga aaataccaag 420
caatataaaa ataataatag ccacgatggt tacaattgga gaaaactagg ctcgcagtat 480
ttcatactgt atcttcccct atttacgcag gaactgattt ggtgtaaact taatgaaaac 540
tatttccatg ttgtattacc atctttactg aatagtagga acgttcatga taaccacagt 600
acctatataa ataaagattg gttacttgcc cttttagagc taacttccaa cctgaaccaa 660
aacttcaaat tcgaatacat gaaattgaga ttgtatattt taagagatga tttaattaat 720
aatggtttgg atcttttgaa aaatcttaac tgggtcggtg ggaaactgat taaaaatgaa 780
gatagagaag tcttgttgaa ctcgaccgat ttagctacgg attctatttc tcatttatta 840
ggtgatgaaa actttgttat tttagagttt gaatgctaa 879
<210> 23
<211> 1191
<212> DNA
<213> 酿酒酵母
<400> 23
atgactgtca ccataaaaga attgactaac cacaactaca ttgaccacga actatcagcc 60
actttagact caacggatgc gttcgagggt cccgagaagt tgctggaaat ctggttcttc 120
cctcacaaga agtccatcac gaccgaaaag acattaagaa atattggcat ggatagatgg 180
atcgagattt tgaaattagt gaaatgcgaa gttctttcca tgaagaagac taaagaactg 240
gatgcctttt tgttgagtga gtcttccctc ttcgtcttcg atcacaaatt gacgatgaag 300
acgtgcggta ctacaaccac attgttctgt ctcgaaaagc ttttccagat cgttgagcaa 360
gagttatcgt gggctttccg cacaacacaa gggggcaagt acaaaccatt taaagtgttt 420
tattctagac gatgtttcct tttcccctgt aagcaagccg ctatccatca aaactgggct 480
gacgaagtcg actatttgaa caaatttttc gacaatggta aaagttattc cgtgggaaga 540
aatgacaaga gcaaccactg gaacctgtac gtcaccgaga cggaccgctc cacacctaag 600
ggaaaggagt acatcgagga tgacgacgaa actttcgaag tactgatgac ggagctggac 660
ccagaatgcg ctagtaagtt tgtttgcggg cctgaggcat ccacaaccgc tctcgtggag 720
ccaaacgaag ataagggcca caacctcggc taccaaatga ctaaaaatac aaggcttgac 780
gaaatatatg tcaactcggc ccaagactcc gatttatcat ttcaccacga tgcatttgcg 840
ttcacgccat gtggatactc atccaatatg attctcgctg aaaaatacta ttacaccctg 900
cacgtgactc cggaaaaggg ttggtcttac gcctctttcg aaagtaacat acccgtattt 960
gacatttccc aagggaagca agacaacttg gacgttcttc tacatattct gaacgttttt 1020
caaccaagag agttctcgat gacctttttt accaaaaatt atcagaacca atccttccaa 1080
aaactactaa gcatcaacga gtcactgccc gactacatca agttagacaa aattgtttat 1140
gatctggacg actaccacct tttctatatg aaattgcaga agaaaatatg a 1191
<210> 24
<211> 882
<212> DNA
<213> 酿酒酵母
<400> 24
atggcacaag aaatcactca cccaactatt gtagacggct ggttcagaga aatttctgat 60
accatgtggc caggccaggc catgacttta aaagtggaga aagttttaca ccatgagaag 120
tcaaaatatc aagacgtttt gatcttcaaa tccactacat atggtaatgt tctagtttta 180
gataatgtaa ttcaagccac cgaaagggat gaatttgcct accaagaaat gattgcccat 240
cttgccttga attcccatcc aaatcctaag aaggttcttg ttattggtgg gggtgatggt 300
ggtgttttga gagaggttgt caagcatgat tccgttgagg aagcctggtt atgtgacatt 360
gatgaagctg ttattagact atcaaaggag tacctaccag aaatggctgc ctcttattct 420
cacccaaagg ttaagaccca cattggtgat ggtttccaat ttttaagaga ttaccaaaac 480
acatttgacg taatcattac tgactcttct gacccagaag gtccagctga aacccttttc 540
caaaaggaat atttccaatt gttgaacagt gcgttgacag aaaagggtgt aatcactaca 600
caagcagaaa gtatgtggat tcacttgcca atcattaagg acttaaagaa agcctgttct 660
gaagttttcc cagttgcaga atactctttc gttactattc caacttaccc aactggtacg 720
attggtttta tggtttgctc caaagataaa acttgcaatg tcaagaagcc actacgtgaa 780
atctctgatg agaaggaggc tgaattatac agatactata acaagaaaat tcacgaagct 840
tcctttgttc taccaacctg ggcagccaag gaattaaatt ag 882
<210> 25
<211> 1527
<212> DNA
<213> 酿酒酵母
<400> 25
atgaatacag tttcaccagc caaaaaaaag gttattataa ttggtgccgg tattgctggg 60
cttaaagctg catctacgct acaccaaaac ggtattcaag attgtcttgt tcttgaggcc 120
agagatcggg tcggtggtag gttgcaaact gtcacaggct atcaaggtcg gaaatatgat 180
ataggtgcta gctggcacca tgatacgttg acaaaccctt tatttttgga agaggctcaa 240
ctgagtttga atgatgggag aacgaggttt gtttttgatg acgataattt tatttatatc 300
gacgaagaac gtggaagggt agaccatgac aaggaactgc ttcttgaaat tgtggacaat 360
gaaatgagca aattcgcaga gttagaattc catcaacact taggagtttc agattgctcc 420
ttttttcaat tagtaatgaa atacttacta caaagacgcc aatttctcac aaatgaccaa 480
ataagatatt tgccacaact ctgtcgatat ctggaattgt ggcacggctt agattggaag 540
cttttgagtg ccaaggatac atacttcggt caccaaggaa ggaacgcctt tgctttgaac 600
tatgattctg tggttcaaag aattgctcaa agctttcctc aaaattggtt aaagctaagt 660
tgtgaagtga aatcaattac acgagaacct tcaaaaaatg tgacagtgaa ctgtgaagat 720
ggtactgtgt acaatgctga ttatgttatt attacagtac ctcaaagtgt attgaatttg 780
tctgtacaac ctgaaaaaaa tttacgggga agaatagaat ttcaaccacc cttgaaacca 840
gtgattcaag atgcttttga caagatccat tttggagcgc taggtaaagt aatttttgag 900
tttgaagaat gttgttggtc gaacgaaagt tcaaaaattg taactttggc taactctacc 960
aatgaatttg tcgaaatagt acgtaatgcg gaaaatttag atgaattaga ctctatgcta 1020
gaaagggaag attctcaaaa gcatacgagt gttacttgtt ggagccagcc tttatttttc 1080
gtaaatttgt caaaaagcac aggagtagca agctttatga tgttgatgca ggcaccgctt 1140
acaaatcaca tagaatccat tagagaagat aaagagcgtc tttttagttt tttccaacct 1200
gtgctgaaca agattatgaa gtgtctagat tctgaggatg tcatcgacgg aatgaggccg 1260
atagaaaaca ttgcaaacgc taataaacca gtcttaagaa acatcatcgt tagcaactgg 1320
acacgcgatc cttactcacg cggtgcttat tcggcctgtt ttccaggaga tgatccagtt 1380
gatatggttg ttgcaatgtc taatggtcaa gactcccgca taagatttgc aggcgaacat 1440
actatcatgg acggcgccgg ctgtgcctat ggtgcttggg aaagcggaag acgggaggcg 1500
actcgaatct ctgacttact gaaatag 1527
<210> 26
<211> 1014
<212> DNA
<213> 酿酒酵母
<400> 26
atgaacagga ttaagaatac attttctgtt gctaagagat taaaactaag caaagttatg 60
acgaactcag aattaccgag catattcgaa ggaactgttg atttagggat tattggtggt 120
acaggtttat ataaccttga ctgtctggag cccatcgctt tgcttccacc catggtaaca 180
ccatggggta ccacatcgtc tcctgtcaca atctctcagt tcgtaggaac taacagccac 240
tttcacgttg cgttcatagc cagacacggt attaaccacg aatacccacc cactaaagtc 300
ccatttagag caaacatggc ggccttaaag aacttaaatt gtaaagccgt tctttctttt 360
agtgccgtgg ggtctttaca accccatata aagcctagag attttgtgtt accacagcaa 420
ataatcgaca gaactaaagg cataagacat tcttcatatt tcaacgatga aggcttggta 480
ggtcacgttg gtttcggaca gccgttctct caaaaattcg cagagtatat ctatcaattc 540
aagaacgaga taacaaatcc tgaatccgaa gaaccgtgcc atttgcatta cgacaaggat 600
atgaccgttg tgtgtatgga aggcccacaa ttctccacgc gcgctgaatc caagatgtac 660
agaatgtttg gtggccatgt tattaacatg agtgttattc cagaagccaa attggcgcgt 720
gagtgtgagc tgccttacca gatgatttgt atgtctaccg attacgacgc atggagagat 780
gaggcagaac ctgttaccgt agaaaccgtt attggtaatt tgacgaataa tgggcgcaat 840
gcaaatattt tagcttctaa gatcatcgtc tcaatggcca aggaaatccc agagttcatg 900
catactggcg atgggctgcg cggttccatc aagaaatcta tctctaccaa accagaggct 960
atgtccaagg aaaccttaga aagactaaga tacttatttc caaactattg gtaa 1014
<210> 27
<211> 1131
<212> DNA
<213> 酿酒酵母
<400> 27
atgaccttgg cacccctaga cgcctccaaa gttaagataa ctaccacaca acatgcatct 60
aagccaaaac cgaacagtga gttagtgttt ggcaagagct tcacggacca catgttaact 120
gcggaatgga cagctgaaaa agggtggggt accccagaga ttaaacctta tcaaaatctg 180
tctttagacc cttccgcggt ggttttccat tatgcttttg agctattcga agggatgaag 240
gcttacagaa cggtggacaa caaaattaca atgtttcgtc cagatatgaa tatgaagcgc 300
atgaataagt ctgctcagag aatctgtttg ccaacgttcg acccagaaga gttgattacc 360
ctaattggga aactgatcca gcaagataag tgcttagttc ctgaaggaaa aggttactct 420
ttatatatca ggcctacatt aatcggcact acggccggtt taggggtttc cacgcctgat 480
agagccttgc tatatgtcat ttgctgccct gtgggtcctt attacaaaac tggatttaag 540
gcggtcagac tggaagccac tgattatgcc acaagagctt ggccaggagg ctgtggtgac 600
aagaaactag gtgcaaacta cgccccctgc gtcctgccac aattgcaagc tgcttcaagg 660
ggttaccaac aaaatttatg gctatttggt ccaaataaca acattactga agtcggcacc 720
atgaatgctt ttttcgtgtt taaagatagt aaaacgggca agaaggaact agttactgct 780
ccactagacg gtaccatttt ggaaggtgtt actagggatt ccattttaaa tcttgctaaa 840
gaaagactcg aaccaagtga atggaccatt agtgaacgct acttcactat aggcgaagtt 900
actgagagat ccaagaacgg tgaactactt gaagcctttg gttctggtac tgctgcgatt 960
gtttctccca ttaaggaaat cggctggaaa ggcgaacaaa ttaatattcc gttgttgccc 1020
ggcgaacaaa ccggtccatt ggccaaagaa gttgcacaat ggattaatgg aatccaatat 1080
ggcgagactg agcatggcaa ttggtcaagg gttgttactg atttgaactg a 1131
<210> 28
<211> 564
<212> DNA
<213> 酿酒酵母
<400> 28
atgtctatag caagttatgc ccaagagttg aagttggctt tacatcaata tccaaacttc 60
cctagtgaag gcattctctt cgaagatttc ttacccattt tcaggaaccc aggtcttttc 120
cagaagttga tcgatgcttt caaactgcat ttagaagaag cttttccaga agttaaaatt 180
gattatatcg tcgggttgga atcccgtggg ttcttgttcg gaccaacttt agctttggcc 240
ctaggtgttg gtttcgttcc agtcaggaag gcaggtaagc tacctggcga atgttttaag 300
gctacgtacg aaaaggagta cggttctgat ctttttgaga tacagaaaaa cgctattcca 360
gcaggttcca acgttatcat tgttgatgac attattgcca ctggtggttc tgctgctgca 420
gccggcgaat tagttgaaca actcgaagcc aaccttttgg aatataactt tgttatggag 480
ttggatttct tgaaaggcag gagtaagttg aatgctccag tgttcacttt actgaacgct 540
caaaaggaag cgttgaaaaa atga 564
<210> 29
<211> 1491
<212> DNA
<213> 酿酒酵母
<400> 29
atgtcaatga gtaatattgt tgtttttgga ggggactcgc accccgagtt agttactaag 60
atctgtgaaa atttggacat tcacccatcg aaagtagaat tagggaagtt ttctaatggg 120
gaaacgaaca ttgctcttcg cgaatctgtt cgtgaaaagg atgtatatat catccagagt 180
ggttgtggcc aggtgaacga cacgttcatg cagttgctga ttttaatcag tgcctgcaag 240
tccgcttctg cctcgagggt tacagccgta atgccatatc tctgctactc gagacagcca 300
gatattccat atactgccaa gggtgctccc ataatttcca agcctaaaga aaactatact 360
tttgaatcgc atccaggcac acccgtgtca tcttctttaa tgacgcaaag accaggtgct 420
gagagctcgt tgaagagttt ggatagtgca atacgatcaa ctatcaactt agaaaatcct 480
caacctatca gaacaccaaa cagcagtgct acggcgaata acaatttcga catcaagaag 540
acgctttctt tttcaagaat tcctatgatt cccggtggta agttacaaaa tacaagcaat 600
agcacggacg ctggtgaatt gttcaacgct caaaatgcag gctataagct atgggtagta 660
caagccggta ctttgattgc tcatttgttg agtgctgcag gtgctgacca tgtgatcaca 720
atggatttgc acgatccaca gttccctggg ttttttgaca ttccagtgga taatctctac 780
tgtaaaccca ttgcacaaaa ctacatccag catcgcattc cagattatca ggatgctgtg 840
atcgtttctc cagatgctgg tggtgcaaag agagctacgg ctattgcaga cgccttggaa 900
ttgtccttcg ccctaattca taaagaaaga agatctcagt tattgaaggg ccctccagat 960
gcgacgttaa cctctggtgg ttcgttacca gtatctccaa ggccattagt tactactttg 1020
gtttcctccc aaaatactac ttcttcaggt gccactgggg ttgcggccct tgaaatgaag 1080
aaaacaactt caacatcttc cacctcgtcg caatcttcta attcgtccaa gttcgttcaa 1140
actaccatgc ttgttggcga tgttagaaac aaggtgtgta ttatagtcga cgacttggtg 1200
gatacttcat acactattac aagagctgcg aaattgttga aggatcaagg atctaccaaa 1260
gtttatgcct taataacgca cggtgttttt tccggtgatg cgctagaaag aatcggccaa 1320
agtagtatag ataagttgat catttctaac acggttcctc aagatagaac actacagtac 1380
ctaggtaagg acagagtgga tgttattgat gtctcctgca taatcggtga agcaattaga 1440
agaatccata acggtgaatc catttctatg ttgttcgagc atggatggta g 1491
<210> 30
<211> 1197
<212> DNA
<213> 婴儿利什曼原虫(Leishmania infantum)
<400> 30
gttgtttcta gaaaaacaat gtctgttcac tctatcttgt tctcttctga acacgttact 60
gaaggtcacc cagacaagtt gtgtgaccaa gtttctgacg ctgttttgga cgcttgtttg 120
gctggtgacc cattctctaa ggttgcttgt gaatcttgtg ctaagactgg tatggttatg 180
gttttcggtg aaatcactac taaggctgtt ttggactacc aaaagatcgt tagaaacact 240
atcaaggaca tcggtttcga ctctgctgac aagggtttgg actacgaatc ttgtaacgtt 300
ttggttgcta tcgaacaaca atctccagac atctgtcaag gtttgggtaa cttcgactct 360
gaagacttgg gtgctggtga ccaaggtatg atgttcggtt acgctactga cgaaactgaa 420
actttgatgc cattgactta cgaattggct agaggtttgg ctaagaagta ctctgaattg 480
agaagagacg gttctttgga atgggctaga ccagacgcta agactcaagt tactgttgaa 540
tacgactacg acactagaga aggtaagcaa gttttgactc caaagagagt tgctgttgtt 600
ttgatctctg ctcaacacga cgaacacgtt actaacgaca agatctctgt tgacttgatg 660
gaaaaggtta tcaaggctgt tatcccagct aacatgttgg acgctgaaac taagtactgg 720
ttgaacccat ctggtagatt cgttagaggt ggtccacacg gtgacgctgg tttgactggt 780
agaaagatca tcgttgacac ttacggtggt tggggtgctc acggtggtgg tgctttctct 840
ggtaaggacc catctaaggt tgacagatct gctgcttacg ctgctagatg gatcgctaag 900
tctatcgttg ctggtggttt ggctagaaga tgtttggttc aattggctta cgctatcggt 960
gttgctgaac cattgtctat gcacgttgaa acttacggta ctggtaagta cgacgacgct 1020
aagttgttgg aaatcgttaa gcaaaacttc aagttgagac catacgacat catccaagaa 1080
ttgaacttga gaagaccaat ctactacgaa acttctcgtt tcggtcactt cggtagaaag 1140
gacgaattgg gtactggtgg tttcacttgg gaagttccaa agaagatggt tgaataa 1197
<210> 31
<211> 1044
<212> DNA
<213> 酿酒酵母
<400> 31
atggtttctg tggagttttt acaggagtta ccaaaatgtg agcatcactt gcatttggaa 60
ggtactctag aacctgacct attgttccca ttagctaaaa gaaacgatat aattctacct 120
gaaggttttc ctaaatcggt cgaggaatta aacgaaaagt ataagaagtt tcgtgatctg 180
caggatttct tagattacta ttatattggt actaatgtct tgattagtga acaagatttc 240
tttgatttgg cgtgggccta ttttaaaaaa gttcacaaac aaggcttggt ccatgctgaa 300
gtgttttacg accctcagtc acatacatct aggggcatct ccatagaaac agtcactaaa 360
ggtttccaaa gagcttgtga caaagccttc tctgaatttg gtattacatc caagctaatt 420
atgtgtctgt taagacacat tgaaccagag gaatgtttga aaactatcga agaagctacc 480
ccatttatta aagatggtac tatctctgcc ttaggattag attctgctga gaaaccattt 540
cccccacatt tatttgttga atgttacgga aaggccgcct cattgaataa agatttaaaa 600
ctaactgcac acgcaggtga agaaggcccc gctcaattcg tctcggatgc tttagacttg 660
ttgcaagtaa caagaatcga tcacggtatc aacagtcaat acgacgagga gttattggat 720
aggttgtcgc gcgaccagac catgctaact atttgtcctc tctccaacgt gaagctacaa 780
gtagtccaat ccgtttcaga gttaccacta caaaagtttc ttgacagaga tgttccattt 840
tctttaaatt ctgatgaccc cgcctatttt ggtggttata tcttagatgt ctacactcaa 900
gtttcgaaag atttcccaca ctgggaccat gaaacatggg gtcgtatcgc taagaacgcc 960
attaaaggtt catggtgtga cgataaaaga aagaacggtt tgttaagtag agtggacgaa 1020
gtagtcacta aatattcgca ttag 1044
<210> 32
<211> 1857
<212> DNA
<213> 酿酒酵母
<400> 32
atgccagagt atacgctact ggctgataat ataagggaga atatcgttca tttcgatccg 60
aatggtttgt ttgataactt gcacaccatt gttcatgaag atgacagtca agagaacgag 120
gaggccgagc atttcaatta tgatcaggtg ttggataaat cgttattgtc aagaggttct 180
attgtcggtc tcggtttagg actaatgagt cccgttttag gaatgtgcac tagtatggcc 240
attgggctaa ttaatggtgg tccgttaact ataatgctag gttttttaat cagtggagtg 300
tgtatatggt tttcgtcgct ttctcttggt gagattgttt caaaatttcc gatggaactg 360
catgttggga gtgccatgtt ggccccggag aaattgaaat tagtatgttc gtggtacact 420
ggctggttaa tgctcatagg gaattggact atgagtacca gtattacttt tgcaggcgct 480
caacttacca tttctttgat tctgatgacg aactccaacc taatatccga ggcacacttg 540
attttttaca cagtcattgt attttactta gttgtgactg ttgtaggcct cgtgaatttg 600
aaatttgcaa gatttattga aacaataaac aaagtctgtg tttattggat catatatgcc 660
attatattta ttgatattct tctactagta ttccacaaag gtaaatttcg atctttgaag 720
tacgcgctat ttcactttga taataatcta tcagggtata aaagcgcatt tctttccttc 780
atcattggat tccaacagtc taatttcacg ttacaaggtt tcagtatgtt acctgcttta 840
gctgacgaag tcaaagttcc tgagaaggat attccacgtg gtatgtcgaa tgcggtattg 900
ttatccgcgt tctctggagt catttttctt ataccaataa tgttaatcct gccagataat 960
gatttgcttt ttaccaatca taaggttcta ccaatagtga acatttttac aaaatcgact 1020
gattcggtgg tcttgtcttt ttttttagtg ctcctaattt taggaaactt actgttttcc 1080
ggaattggct cgattactac atcttctcgt gcggtatata gttttagtcg tgaccaggct 1140
ataccatact acgataaatg gacctacgtc gaaccggatt ctcagtcaaa agtccccaag 1200
aattctgttg tattgagtat gataatatca tactttttag gtctgctagc tttgatttca 1260
acggccgcat ttaatgcttt tataggcgct gcagtgctct gtctttgttc tgcgactttc 1320
attccgttag tcttggtgct gtttacgaga agaagagcta tccgaagcgc gccagtaaaa 1380
atcaggtata agtttggttg gttcatcaac attgtttcta ttgtgtggct cttgttatct 1440
atggtttctg tttgcctacc aacgcaagtg cctgtaactt tcaaaacaat gaattatgct 1500
ttaatggtgt acgtattctg cattttagtt atcactggtc tttatttcaa atgggggaag 1560
tataatttta gattaccctt ggcagatgac atcaaggctc caattcccag tgatgcggaa 1620
gaaactgttt ttgaactaga ggatagcaat gttgaacata ctctaaactc gggaaccaca 1680
gtgaaagagt ctgtagaaaa taattctgaa gaaggtttca tcaaggtgca tcctaaaagt 1740
agtacagaaa atccctttga ggaaaatgag gaaaacgtga taaccgatta tggtgatgag 1800
caccatacag cagaacaaga atttgatctt gccgatgatc gtagatatga tatatga 1857
<210> 33
<211> 2088
<212> DNA
<213> 酿酒酵母
<400> 33
atgactgtca ccataaaaga attgactaac cacaactaca ttgaccacga actatcagcc 60
actttagact caacggatgc gttcgagggt cccgagaagt tgctggaaat ctggttcttc 120
cctcacaaga agtccatcac gaccgaaaag acattaagaa atattggcat ggatagatgg 180
atcgagattt tgaaattagt gaaatgcgaa gttctttcca tgaagaagac taaagaactg 240
gatgcctttt tgttgagtga gtcttccctc ttcgtcttcg atcacaaatt gacgatgaag 300
acgtgcggta ctacaaccac attgttctgt ctcgaaaagc ttttccagat cgttgagcaa 360
gagttatcgt gggctttccg cacaacacaa gggggcaagt acaaaccatt taaagtgttt 420
tattctagac gatgtttcct tttcccctgt aagcaagccg ctatccatca aaactgggct 480
gacgaagtcg actatttgaa caaatttttc gacaatggta aaagttattc cgtgggaaga 540
aatgacaaga gcaaccactg gaacctgtac gtcaccgaga cggaccgctc cacacctaag 600
ggaaaggagt acatcgagga tgacgacgaa actttcgaag tactgatgac ggagctggac 660
ccagaatgcg ctagtaagtt tgtttgcggg cctgaggcat ccacaaccgc tctcgtggag 720
ccaaacgaag ataagggcca caacctcggc taccaaatga ctaaaaatac aaggcttgac 780
gaaatatatg tcaactcggc ccaagactcc gatttatcat ttcaccacga tgcatttgcg 840
ttcacgccat gtggatactc atccaatatg attctcgctg aaaaatacta ttacaccctg 900
cacgtgactc cggaaaaggg ttggtcttac gcctctttcg aaagtaacat acccgtattt 960
gacatttccc aagggaagca agacaacttg gacgttcttc tacatattct gaacgttttt 1020
caaccaagag agttctcgat gacctttttt accaaaaatt atcagaacca atccttccaa 1080
aaactactaa gcatcaacga gtcactgccc gactacatca agttagacaa aattgtttat 1140
gatctggacg actaccacct tttctatatg aaattgcaga agaaaatagg atctggttct 1200
ggttctatgg cacaagaaat cactcaccca actattgtag acggctggtt cagagaaatt 1260
tctgatacca tgtggccagg ccaggccatg actttaaaag tggagaaagt tttacaccat 1320
gagaagtcaa aatatcaaga cgttttgatc ttcaaatcca ctacatatgg taatgttcta 1380
gttttagata atgtaattca agccaccgaa agggatgaat ttgcctacca agaaatgatt 1440
gcccatcttg ccttgaattc ccatccaaat cctaagaagg ttcttgttat tggtgggggt 1500
gatggtggtg ttttgagaga ggttgtcaag catgattccg ttgaggaagc ctggttatgt 1560
gacattgatg aagctgttat tagactatca aaggagtacc taccagaaat ggctgcctct 1620
tattctcacc caaaggttaa gacccacatt ggtgatggtt tccaattttt aagagattac 1680
caaaacacat ttgacgtaat cattactgac tcttctgacc cagaaggtcc agctgaaacc 1740
cttttccaaa aggaatattt ccaattgttg aacagtgcgt tgacagaaaa gggtgtaatc 1800
actacacaag cagaaagtat gtggattcac ttgccaatca ttaaggactt aaagaaagcc 1860
tgttctgaag ttttcccagt tgcagaatac tctttcgtta ctattccaac ttacccaact 1920
ggtacgattg gttttatggt ttgctccaaa gataaaactt gcaatgtcaa gaagccacta 1980
cgtgaaatct ctgatgagaa ggaggctgaa ttatacagat actataacaa gaaaattcac 2040
gaagcttcct ttgttctacc aacctgggca gccaaggaat taaattag 2088
<210> 34
<211> 627
<212> PRT
<213> 布氏锥虫(Trypanosoma brucei)
<400> 34
Met Thr Lys Ser Ala Leu Ala Asp Thr Lys Glu Glu Pro His Val Pro
1 5 10 15
Phe Gly Glu Ile Gln Gly Tyr Thr Pro Cys Gly Val Pro Ala Tyr Ser
20 25 30
Asn Gly His Asp Gly Phe Phe Ser Gly Glu Arg Ser Ile Asp Gly Asn
35 40 45
Leu Phe Cys Gly Phe Lys Tyr Gln Cys Val Glu Phe Ala Arg Arg Trp
50 55 60
Leu Tyr Glu Ala Lys Gly Leu Val Leu Pro Asp Val Asn Trp Ala Ala
65 70 75 80
His Ile Phe Asp Leu Thr Glu Val His Asp Ala Ser Thr Ala Thr Pro
85 90 95
Val Pro Cys Val Lys Val Ser Asn Gly Thr Ala Ala Lys Pro Val Ala
100 105 110
Asp Ser Leu Leu Ile Tyr Ala Val Asn Glu Asp Ala Pro Trp Gly His
115 120 125
Val Ala Val Ile Thr Glu Val Gly Asp Lys Trp Val Arg Ile Ala Asp
130 135 140
Gln Asn His Arg Phe His Lys Trp Lys Gly Thr Tyr Ser Ala Glu Leu
145 150 155 160
Leu Leu Lys His Glu Gly Gly Val Trp Thr Val Glu Asp His Ala Ala
165 170 175
Glu Gly Ile Phe Val Pro Leu Gly Trp Val Thr Phe Pro Ser Arg Pro
180 185 190
Asn Arg Asn Pro Lys Glu Pro Leu Val Leu His Glu Ser Leu Tyr Phe
195 200 205
Lys Gln Pro Glu Lys Pro Phe Leu Arg Arg Val Val Phe Thr Pro Glu
210 215 220
Asn Arg Lys Thr Asp Trp Leu Asp Leu Thr Asn Glu Ala Glu Ala Glu
225 230 235 240
Phe Tyr Lys Thr Phe Gly Lys Glu Ala Thr Arg Gly Gly Val Tyr Glu
245 250 255
Ser Cys Tyr Tyr Leu Met Asn Arg Glu Leu Tyr Leu Asn Cys Val Arg
260 265 270
Tyr Gly Thr Gln Leu His Ser Phe Phe Leu Glu Ala Thr Lys Gln Val
275 280 285
Leu Glu Ser Asp Asp Lys Leu Arg Arg Phe Arg Ile Pro Glu Glu Tyr
290 295 300
Trp Pro Arg Ile Arg His Ser Trp Lys Thr Gln Pro His Ala Ile Thr
305 310 315 320
Gly Arg Phe Asp Phe Val Phe Asp Glu Asn Thr Gln Glu Phe Lys Cys
325 330 335
Phe Glu Tyr Asn Ala Asp Ser Ala Ser Thr Leu Leu Glu Cys Ala Val
340 345 350
Ile Gln Glu Lys Trp Ala Asn Ser Val Gly Leu Asp Asp Asn Ala Thr
355 360 365
Arg Ser Ser Gly Lys Phe Met Pro Gln Thr Leu Val Arg Ala Trp Glu
370 375 380
Met Thr Gly Leu Lys Gly Arg Val His Phe Leu Val Asp Asp Asp Gly
385 390 395 400
Glu Glu Arg Tyr Thr Ala Leu Tyr Val Met Glu Lys Ala Arg Glu Ala
405 410 415
Gly Ile Asp Ala Lys Leu Cys Val Met Phe Asp Glu Phe His Phe Asp
420 425 430
Glu Lys Gly Ala Val Val Asp Ser Asp Gly Ile Pro Ala Thr Ala Val
435 440 445
Trp Lys Thr Trp Met Trp Glu Thr Ala Ile Ser Asp His Gln Ala Ala
450 455 460
Arg Glu Gln Arg Gly Ala Glu Trp Lys Pro Thr Pro Lys Asp Lys Val
465 470 475 480
Arg Leu Cys Asp Ile Leu Leu Gly Asn Asn Trp Asp Val Arg Val Phe
485 490 495
Glu Pro Met Trp Lys Leu Ile Pro Ser Asn Lys Ala Ile Leu Pro Ile
500 505 510
Ile Tyr Asn Asn His Pro Asp His Pro Ala Ile Leu Pro Ala Ser Tyr
515 520 525
Glu Leu Thr Asp Glu Leu Arg Arg Thr Gly Tyr Ala Lys Lys Pro Ile
530 535 540
Val Gly Arg Val Gly Arg Asn Val Thr Val Thr Glu Pro Asp Gly Lys
545 550 555 560
Val Leu Ala Glu Ser Asp Gly Asn Phe Ser Asn Arg Asp Met Val Tyr
565 570 575
Gln Gln Leu Phe Arg Ile Pro Lys Arg Gly Asp Tyr Tyr Ala Ile Leu
580 585 590
Gly Gly Trp Met Leu Gly Asp Thr Tyr Ser Gly Thr Gly Val Arg Glu
595 600 605
Asp Lys Lys Leu Thr Thr Gly Leu Glu Ser Pro Phe Gly Pro Val Arg
610 615 620
Ile Gln Met
625
<210> 35
<211> 1884
<212> DNA
<213> 布氏锥虫
<400> 35
atgactaaat cagcattagc agatacaaaa gaagaacctc acgtcccatt cggtgaaata 60
caaggttata ccccatgtgg tgtcccagct tattcaaatg gtcatgatgg tttcttttct 120
ggtgaaagat caatcgatgg taatttgttt tgtggtttta aataccaatg tgttgaattt 180
gctagaagat ggttgtacga agcaaaaggt ttagttttgc cagatgttaa ttgggctgca 240
catatcttcg atttgactga agttcatgat gcttctactg caacaccagt tccatgtgtt 300
aaagtttcaa atggtacagc tgcaaaacca gttgctgatt ctttgttgat ctatgctgtt 360
aacgaagatg caccatgggg tcatgttgct gttattactg aagttggtga caaatgggtt 420
agaatcgcag atcaaaacca tagattccat aagtggaagg gtacttactc agcagaattg 480
ttattgaaac atgaaggtgg tgtttggaca gttgaagatc atgctgcaga aggtattttt 540
gttccattag gttgggttac atttccatca agaccaaaca gaaacccaaa ggaaccattg 600
gttttgcatg aatctttgta cttcaagcaa ccagaaaagc catttttgag aagagttgtt 660
tttacaccag aaaacagaaa gactgattgg ttggatttga caaacgaagc tgaagcagaa 720
ttttacaaga ctttcggtaa agaagctaca agaggtggtg tttacgaatc ttgttactac 780
ttgatgaaca gagaattgta tttgaattgt gttagatacg gtactcaatt gcattctttc 840
tttttggaag ctacaaagca agttttggaa tctgatgata agttgagaag atttcgtatt 900
ccagaagaat attggccaag aattagacat tcatggaaaa ctcaaccaca tgcaattaca 960
ggtagattcg atttcgtttt cgatgaaaac actcaagagt ttaaatgttt tgaatacaat 1020
gctgattctg catcaacatt gttggaatgt gctgttattc aagaaaaatg ggcaaattct 1080
gttggtttag atgataatgc tactagatct tctggtaaat tcatgccaca aactttggtt 1140
agagcatggg aaatgacagg tttaaaaggt agagttcatt tcttggttga tgatgatggt 1200
gaagaaagat acacagcttt gtacgttatg gaaaaagcta gagaagcagg tatcgatgca 1260
aagttgtgtg ttatgttcga tgaatttcat ttcgatgaaa aaggtgctgt tgttgattca 1320
gatggtattc cagctactgc agtttggaaa acatggatgt gggaaactgc tatttctgat 1380
catcaagctg caagagaaca aagaggtgca gaatggaagc caactccaaa ggataaagtt 1440
agattgtgtg atatcttgtt gggtaacaac tgggatgtta gagttttcga accaatgtgg 1500
aagttgatcc catcaaataa ggctatcttg ccaatcatct ataacaacca tccagatcat 1560
ccagctattt tgccagcatc ttacgaattg actgatgaat tgagaagaac aggttacgct 1620
aagaaaccaa ttgttggtag agttggtaga aatgttactg ttacagaacc agatggtaaa 1680
gttttggcag aatctgatgg taacttctca aacagagata tggtttacca acaattgttt 1740
agaataccaa aacgtggtga ctattacgct attttaggtg gttggatgtt gggtgacact 1800
tactctggta caggtgttag agaagataag aagttgacta caggtttaga atcccctttc 1860
ggtccagtta gaatccaaat gtga 1884
<210> 36
<211> 27
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 36
gtatgaagta atacagaaaa ggaaaac 27
<210> 37
<211> 46
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 37
gacctgcagc gtacgaagct tcagcaatct ctggtggaac gtctag 46
<210> 38
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 38
ctgaagcttc gtacgctgca ggtc 24
<210> 39
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 39
ggccactagt ggatctgata tcac 24
<210> 40
<211> 52
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 40
gtgatatcag atccactagt ggcccctgaa ccaaaacttc aaattcgaat ac 52
<210> 41
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 41
gcattcaaac tctaaaataa caaag 25
<210> 42
<211> 48
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 42
gtgatatcag atccactagt ggccatagct tcaaaatgtt tctactcc 48
<210> 43
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 43
tttgtaatta aaacttagat tagattgc 28
<210> 44
<211> 54
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 44
gcaatctaat ctaagtttta attacaaaat gtctagtact caagtaggaa atgc 54
<210> 45
<211> 51
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 45
gtgctagtgt ctcccgtctt ctgttcaatc gagttcagag tctatgtata c 51
<210> 46
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 46
acagaagacg ggagacacta gcac 24
<210> 47
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 47
attttcaaca tcgtattttc cgaagc 26
<210> 48
<211> 54
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 48
gcttcggaaa atacgatgtt gaaaatcctg aaccaaaact tcaaattcga atac 54
<210> 49
<211> 50
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 49
gtgatatcag atccactagt ggcctgccgt aaaccactaa atcggaaccc 50
<210> 50
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 50
tagggcccac aagcttacgc gtcgac 26
<210> 51
<211> 51
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 51
gtcgacgcgt aagcttgtgg gccctactaa tttaattcct tggctgccca g 51
<210> 52
<211> 52
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 52
ctacttttta caacaaatat aacaaaatgg cacaagaaat cactcaccca ac 52
<210> 53
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 53
tttgttatat ttgttgtaaa aagtag 26
<210> 54
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 54
acgcacagat attataacat ctgcac 26
<210> 55
<211> 50
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 55
gtgcagatgt tataatatct gtgcgtatag cttcaaaatg tttctactcc 50
<210> 56
<211> 51
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 56
cttcggaaaa tacgatgttg aaaatgttac tccgcaacgc ttttctgaac g 51
<210> 57
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 57
taaagtaaga gcgctacatt ggtctacc 28
<210> 58
<211> 56
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 58
ggtagaccaa tgtagcgctc ttactttatc atattttctt ctgcaatttc atatag 56
<210> 59
<211> 56
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 59
gtttcgaata aacacacata aacaaacaaa atgactgtca ccataaaaga attgac 56
<210> 60
<211> 30
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 60
tttgtttgtt tatgtgtgtt tattcgaaac 30
<210> 61
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 61
tcgagtttat cattatcaat actgcc 26
<210> 62
<211> 54
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 62
ggcagtattg ataatgataa actcgacctg aaccaaaact tcaaattcga atac 54
<210> 63
<211> 54
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 63
ggcagtattg ataatgataa actcgagttt aaagattacg gatatttaac ttac 54
<210> 64
<211> 34
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 64
ttttagttta tgtatgtgtt ttttgtagtt atag 34
<210> 65
<211> 61
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 65
ctataactac aaaaaacaca tacataaact aaaaatggtt aataattcac agcatcctta 60
c 61
<210> 66
<211> 53
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 66
gactaataat tcttagttaa aagcacttca ttcattaatg accttgtctg ccc 53
<210> 67
<211> 27
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 67
agtgctttta actaagaatt attagtc 27
<210> 68
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 68
aggtatcatc tccatctccc atatgc 26
<210> 69
<211> 54
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 69
gcatatggga gatggagatg atacctcctg aaccaaaact tcaaattcga atac 54
<210> 70
<211> 45
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 70
cgtcctctcg aaaggtggtt taaagattac ggatatttaa cttac 45
<210> 71
<211> 56
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 71
ctacaaaaaa cacatacata aactaaaaat gaacaggatt aagaatacat tttctg 56
<210> 72
<211> 53
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 72
ggtcgacgcg taagcttgtg ggccctatta ccaatagttt ggaaataagt atc 53
<210> 73
<211> 38
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 73
caggtggtca tggccctttg ccgtaaacca ctaaatcg 38
<210> 74
<211> 49
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 74
ctacaaaaaa cacatacata aactaaaaat gaccttggca cccctagac 49
<210> 75
<211> 49
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 75
ggtcgacgcg taagcttgtg ggccctatca gttcaaatca gtaacaacc 49
<210> 76
<211> 60
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 76
cattaaaaaa ctatatcaat taatttgaat taacttacca atagtttgga aataagtatc 60
<210> 77
<211> 49
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 77
catgactcga ggtcgacggt atctcagttc aaatcagtaa caacccttg 49
<210> 78
<211> 55
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 78
gtaattatct actttttaca acaaatataa caaaatgacc ttggcacccc tagac 55
<210> 79
<211> 46
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 79
caggtggtca tggccctttt tgtaattaaa acttagatta gattgc 46
<210> 80
<211> 58
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 80
gcaatctaat ctaagtttta attacaaaat gtccaagagc aaaactttct tatttacc 58
<210> 81
<211> 53
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 81
ctaaatcatt aaagtaactt aaggagttaa atttaaaatt ccaatttctt tgg 53
<210> 82
<211> 32
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 82
atttaactcc ttaagttact ttaatgattt ag 32
<210> 83
<211> 39
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 83
caggtggtca tggcccttgc gaatttctta tgatttatg 39
<210> 84
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 84
gcgaatttct tatgatttat g 21
<210> 85
<211> 48
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 85
cataaatcat aagaaattcg ctcatttttt caacgcttcc ttttgagc 48
<210> 86
<211> 51
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 86
cgaataaaca cacataaaca aacaaaatgt ctatagcaag ttatgcccaa g 51
<210> 87
<211> 47
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 87
caggtggtca tggccctttt tttgattaaa attaaaaaaa ctttttg 47
<210> 88
<211> 29
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 88
tttttgatta aaattaaaaa aactttttg 29
<210> 89
<211> 58
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 89
caaaaagttt ttttaatttt aatcaaaaaa tgtcaatgag taatattgtt gtttttgg 58
<210> 90
<211> 51
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 90
cgtcctctcg aaaggtggtt aattcaaatt aattgatata gttttttaat g 51
<210> 91
<211> 81
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 91
aagggccatg accacctgat gcaccaatta ggtaggtctg gctatgtcta tacctctggc 60
aattcgccct atagtgagtc g 81
<210> 92
<211> 87
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 92
cacctttcga gaggacgatg cccgtgtcta aatgattcga ccagcctaag aatgttcaac 60
gagctccagc ttttgttccc tttagtg 87
<210> 93
<211> 47
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 93
gcatcgtcct ctcgaaaggt gtcgagttta tcattatcaa tactgcc 47
<210> 94
<211> 32
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 94
atttaactcc ttaagttact ttaatgattt ag 32
<210> 95
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 95
gcgaatttct tatgatttat g 21
<210> 96
<211> 37
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 96
gattaatata attatataaa aatattatct tcttttc 37
<210> 97
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 97
ctggtttgtt ttacaaccaa aag 23
<210> 98
<211> 47
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 98
aactctttca taaaatggta tctttaactt tttatttaat cgtaatg 47
<210> 99
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 99
aatgacaagt ttcatcatc 19
<210> 100
<211> 54
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 100
gcaatctaat ctaagtttta attacaaagt tactccgcaa cgcttttctg aacg 54
<210> 101
<211> 56
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 101
gcaatctaat ctaagtttta attacaaaat gaacaggatt aagaatacat tttctg 56
<210> 102
<211> 47
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 102
ctagtgtctc ccgtcttctg tttaccaata gtttggaaat aagtatc 47
<210> 103
<211> 55
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 103
ctataactac aaaaaacaca tacataaact aaaaatgacc ttggcacccc tagac 55
<210> 104
<211> 49
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 104
gactaataat tcttagttaa aagcacttca gttcaaatca gtaacaacc 49
<210> 105
<211> 50
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 105
gtcgacgcgt aagcttgtgg gccctatcat gatgctgtaa tagcagaatc 50
<210> 106
<211> 51
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 106
ctacttttta caacaaatat aacaaaatgg aaggtggtgg tgctagaaat g 51
<210> 107
<211> 55
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 107
ctataactac aaaaaacaca tacataaact aaaaatgcca gagtatacgc tactg 55
<210> 108
<211> 52
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 108
gactaataat tcttagttaa aagcacttca tatatcatat ctacgatcat cg 52
<210> 109
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 109
gtgaaggaac aactcgtgtc tc 22
<210> 110
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 110
gaggatacgt acatatgcaa gc 22
<210> 111
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 111
tgccgtaaac cactaaatcg gaacc 25
<210> 112
<211> 70
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 112
tcagacttct taactcctgt aaaaacaaaa aaaaaaaaag gcatagcaat aagctggagc 60
tcatagcttc 70
<210> 113
<211> 81
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 113
gtgcctattg atgatctggc ggaatgtctg ccgtgccata gccatgcctt cacatatagt 60
ccgcaaatta aagccttcga g 81
<210> 114
<211> 81
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 114
actatatgtg aaggcatggc tatggcacgg cagacattcc gccagatcat caataggcac 60
cttcgtacgc tgcaggtcga c 81
<210> 115
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 115
tgcgcatgtt tcggcgttcg aaacttctcc gcagtgaaag ataaatgatc actcatcaaa 60
ttccgtacat gttttagagc tagaaatagc aagttaaaat aaggctagtc cgttatcaac 120
<210> 116
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 116
gttgataacg gactagcctt attttaactt gctatttcta gctctaaaac atgtacggaa 60
tttgatgagt gatcatttat ctttcactgc ggagaagttt cgaacgccga aacatgcgca 120
<210> 117
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 117
ataaaatatc aaaacgccga tgagacaggc aggataaagt gacagattca gttatacatt 60
tttattagca ttgatattat tattttaaaa agtctattta cttgtatatt tatccgaata 120
<210> 118
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 118
tattcggata aatatacaag taaatagact ttttaaaata ataatatcaa tgctaataaa 60
aatgtataac tgaatctgtc actttatcct gcctgtctca tcggcgtttt gatattttat 120
<210> 119
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 119
ttttgcaaca tccgggcatg 20
<210> 120
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 120
cggctagctg gtatggatcg 20
<210> 121
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 121
atagcttcaa aatgtttcta ctcc 24
<210> 122
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 122
tgcgacagaa gaaagggaag 20
<210> 123
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 123
cgtctatgag gagactgtta g 21
<210> 124
<211> 43
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 124
ggagtagaaa cattttgaag ctatacgtga ccacttcgag agc 43
<210> 125
<211> 52
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 125
gcaatctaat ctaagtttta attacaaaat gactgtcacc ataaaagaat tg 52
<210> 126
<211> 52
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 126
gtgctagtgt ctcccgtctt ctgttcatat tttcttctgc aatttcatat ag 52
<210> 127
<211> 45
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 127
gcttcggaaa atacgatgtt gaaaatcctg cataatcggc ctcac 45
<210> 128
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 128
ctcgccaagg cattaccatc 20
<210> 129
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 129
gagaacgaga ggacccaac 19
<210> 130
<211> 44
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 130
ggttccgatt tagtggttta cggcaacgtg accacttcga gagc 44
<210> 131
<211> 54
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 131
gtcgacgcgt aagcttgtgg gccctatcat attttcttct gcaatttcat atag 54
<210> 132
<211> 50
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 132
ctacttttta caacaaatat aacaaaatga ctgtcaccat aaaagaattg 50
<210> 133
<211> 52
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 133
ggtagaccaa tgtagcgctc ttactttatc attttttcaa cgcttccttt tg 52
<210> 134
<211> 55
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 134
gtttcgaata aacacacata aacaaacaaa atgtctatag caagttatgc ccaag 55
<210> 135
<211> 45
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 135
ggcagtattg ataatgataa actcgacctg cataatcggc ctcac 45
<210> 136
<211> 49
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 136
gactaataat tcttagttaa aagcactcta ccatccatgc tcgaacaac 49
<210> 137
<211> 49
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 137
gactaataat tcttagttaa aagcactcta ccatccatgc tcgaacaac 49
<210> 138
<211> 45
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 138
gcatatggga gatggagatg atacctcctg cataatcggc ctcac 45
<210> 139
<211> 36
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 139
gggtaccggc cgcaaattaa agccttcgag cgtccc 36
<210> 140
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 140
gtgttcattg tacgtcctag ac 22
<210> 141
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 141
gtgcccaaag ctaagagtc 19
<210> 142
<211> 18
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 142
ctgctcttga atggcgac 18
<210> 143
<211> 38
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 143
gtcgccattc aagagcagca tcgtcctctc gaaaggtg 38
<210> 144
<211> 36
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 144
cgaatcttcc catgcctgca ggtggtcatg gccctt 36
<210> 145
<211> 18
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 145
caggcatggg aagattcg 18
<210> 146
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 146
ctggtgagga tttacggtat g 21
<210> 147
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 147
gtgcgttatc gggttcttac 20
<210> 148
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 148
caggttagtt acttgctcta tg 22
<210> 149
<211> 41
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 149
cagtattgat aatgataaac tcgaaatcag acgcacgctt g 41
<210> 150
<211> 40
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 150
ctttaatttg cggccggtac ccttacgtgg attgagccag 40
<210> 151
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 151
gattgtcata ataggagcta tttg 24
<210> 152
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 152
ccatagtatt actattggtg ttcat 25
<210> 153
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 153
gttatcggtt gtgatattgt tc 22
<210> 154
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 154
ttaagctatt gtttcggcaa tt 22
<210> 155
<211> 29
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 155
cgtgcgauct ctataaaaaa tgtgcgaac 29
<210> 156
<211> 27
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 156
atgacagaut ggtgttgtgg ttctgtg 27
<210> 157
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 157
gagatctttg tgttcggtta c 21
<210> 158
<211> 18
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 158
agtctcgtat gtcggctc 18
<210> 159
<211> 18
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 159
tgtgtccgcg tttctaag 18
<210> 160
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 160
gaggtggtta ttgatcacca g 21
<210> 161
<211> 18
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 161
acgaatcgtt aggcacag 18
<210> 162
<211> 17
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 162
gtgcaatacc aaaatcg 17
<210> 163
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 163
gcagttgttt ggattaaaaa gctgtacg 28
<210> 164
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 164
ccttgtgtca tcatttactc caggc 25
<210> 165
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 165
gtagagtctt agctgcagtt ggtatg 26
<210> 166
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 166
cagggcatta ttactgacgg catg 24
<210> 167
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 167
ctacagcacc tttgaaagaa ggtgtc 26
<210> 168
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 168
gttgatggtg tcgtagacgt cag 23
<210> 169
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 169
ggaacgtgga tttaccccag 20
<210> 170
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 170
gttatcggtt gtgatattgt tcctgc 26
<210> 171
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 171
caaagcgatg ggctccagac 20
<210> 172
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 172
cattccgcag ttaacatgtg gtc 23
<210> 173
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 173
gtgttcattg tacgtcctag actcaaac 28
<210> 174
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 174
cgtgcgttat cgggttctta c 21
<210> 175
<211> 59
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 175
ggaacactgg ggcaataggc tgtcgccatt caagagcagc atcgtcctct cgaaaggtg 59
<210> 176
<211> 60
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 176
ctattgtaat tcaaaaaaaa aaagcgaatc ttcccatgcc tgcaggtggt catggccctt 60
<210> 177
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 177
cttgcataaa ttggtcaatg caag 24
<210> 178
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 178
cgatgacctc ccattgatat ttaag 25
<210> 179
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 179
catcgtcaat ttgtgatcga agac 24
<210> 180
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 180
cattcgccag gtagcttac 19
<210> 181
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 181
tgcattttga gcgttgaaca a 21
<210> 182
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 182
gtgccctgtt ctctgtagtt 20
<210> 183
<211> 60
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 183
atcgggccct ccttactgct ctccttccgt gtaacgcgtt tgccgtaaac cactaaatcg 60
<210> 184
<211> 57
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 184
cgttaagaaa aatttcgaga gagtcgccga tagtagattt tcaacatcgt attttcc 57
<210> 185
<211> 57
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 185
cctccttact gctctccttc cgtgtaacgc gttatagctt caaaatgttt ctactcc 57
<210> 186
<211> 30
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 186
ttatcgagct aactattttc gacacacatg 30
<210> 187
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 187
cgtcgcccag taagtgagac ta 22
<210> 188
<211> 59
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 188
gaaagcatag caatctaatc taagttttaa ttacaaaatg tcaatgagta atattgttg 59
<210> 189
<211> 51
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 189
aagttgtgtg ctagtgtctc ccgtcttctg tctaccatcc atgctcgaac a 51
<210> 190
<211> 60
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 190
ctcgcctagt aaataaacga taaacaaatt tgaagtagta gatacacgta tctcgacatg 60
<210> 191
<211> 59
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 191
gaatgcaaca ccgtagcatg aatcttgaga ttgcatctga taatgggtta gtagtttat 59
<210> 192
<211> 70
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 192
tctccgcagt gaaagataaa tgatcaattt acgaaaaata aaggcgtttt agagctagaa 60
atagcaagtt 70
<210> 193
<211> 70
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 193
aacttgctat ttctagctct aaaacgcctt tatttttcgt aaattgatca tttatctttc 60
actgcggaga 70
<210> 194
<211> 60
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 194
aataaaggca aaaacagtgg tcgtgtgaga aatctatttt ttcgaaatta cttacacttt 60
<210> 195
<211> 60
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 195
aaagtgtaag taatttcgaa aaaatagatt tctcacacga ccactgtttt tgcctttatt 60
<210> 196
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 196
ggtcacccac ccatatacgg 20
<210> 197
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 197
tgtcctccgg ataactgcac 20
<210> 198
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 198
tgcgcatgtt tcggcgttcg aaacttctcc gcagtgaaag ataaatgatc aatttacgaa 60
aaataaaggc gttttagagc tagaaatagc aagttaaaat aaggctagtc cgttatcaac 120
<210> 199
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 199
gttgataacg gactagcctt attttaactt gctatttcta gctctaaaac gcctttattt 60
ttcgtaaatt gatcatttat ctttcactgc ggagaagttt cgaacgccga aacatgcgca 120
<210> 200
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 200
tctttttttg ttcccaacaa gaagtgagtt aataaaggca aaaacagtgg tcgtgtgaga 60
aatctatttt ttcgaaatta cttacacttt tgacggctag aaaaggatat acatacatat 120
<210> 201
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 201
atatgtatgt atatcctttt ctagccgtca aaagtgtaag taatttcgaa aaaatagatt 60
tctcacacga ccactgtttt tgcctttatt aactcacttc ttgttgggaa caaaaaaaga 120
<210> 202
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 202
tgcgcatgtt tcggcgttcg aaacttctcc gcagtgaaag ataaatgatc atcttcaaat 60
ccactacata gttttagagc tagaaatagc aagttaaaat aaggctagtc cgttatcaac 120
<210> 203
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 203
gttgataacg gactagcctt attttaactt gctatttcta gctctaaaac tatgtagtgg 60
atttgaagat gatcatttat ctttcactgc ggagaagttt cgaacgccga aacatgcgca 120
<210> 204
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 204
ttgtacgctt cacatagtag ttcagtcaag aagagcaaac actaataagc aataaatcta 60
ggagaatata catatatatg catatgtttg tttagctaaa taattttatt gagctttgct 120
<210> 205
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 205
agcaaagctc aataaaatta tttagctaaa caaacatatg catatatatg tatattctcc 60
tagatttatt gcttattagt gtttgctctt cttgactgaa ctactatgtg aagcgtacaa 120
<210> 206
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 206
acaccaatat tctgcacctg c 21
<210> 207
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 207
tgctggagaa gatcgtacgc 20
<210> 208
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 208
tgcgcatgtt tcggcgttcg aaacttctcc gcagtgaaag ataaatgatc ttagtagttt 60
ttggaaggat gttttagagc tagaaatagc aagttaaaat aaggctagtc cgttatcaac 120
<210> 209
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 209
gttgataacg gactagcctt attttaactt gctatttcta gctctaaaac atccttccaa 60
aaactactaa gatcatttat ctttcactgc ggagaagttt cgaacgccga aacatgcgca 120
<210> 210
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 210
ttctttttta tattttttag gttttcatat agtgtcttac gcaaataggc ggaccataga 60
aaagccgcca tttgtgtctc ctcatactta catagaatag ccctcttcta ttatccttcg 120
<210> 211
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 211
cgaaggataa tagaagaggg ctattctatg taagtatgag gagacacaaa tggcggcttt 60
tctatggtcc gcctatttgc gtaagacact atatgaaaac ctaaaaaata taaaaaagaa 120
<210> 212
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 212
tacagctcgc tccttgcatc 20
<210> 213
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 213
gcttgcttgg agggcttttc 20
<210> 214
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 214
tgcgcatgtt tcggcgttcg aaacttctcc gcagtgaaag ataaatgatc aagaaccctt 60
tatcataatt gttttagagc tagaaatagc aagttaaaat aaggctagtc cgttatcaac 120
<210> 215
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 215
gttgataacg gactagcctt attttaactt gctatttcta gctctaaaac aattatgata 60
aagggttctt gatcatttat ctttcactgc ggagaagttt cgaacgccga aacatgcgca 120
<210> 216
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 216
ttttttgatt gttctacaac tttttcatag taatcaaaac ctttgaattt caaacttact 60
aggatatatt taaccacgac tttcgcaaga gagacggagg gggtgggaaa aggctgaatg 120
<210> 217
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 217
cattcagcct tttcccaccc cctccgtctc tcttgcgaaa gtcgtggtta aatatatcct 60
agtaagtttg aaattcaaag gttttgatta ctatgaaaaa gttgtagaac aatcaaaaaa 120
<210> 218
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 218
tcatccaggt ttcagcacgg 20
<210> 219
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 219
agctcgaaca aggtgtcagg 20
<210> 220
<211> 60
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 220
gattacttac caatgtgcca taaactccgt gcaccaatag cttcaaaatg tttctactcc 60
<210> 221
<211> 60
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 221
cttgggttgt gggcaattgg gtgtactatg aagcattttc aacatcgtat tttccgaagc 60
<210> 222
<211> 47
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 222
gcaatctaat ctaagtttta attacaaaat ggcacaagaa atcactc 47
<210> 223
<211> 45
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 223
gtgctagtgt ctcccgtctt ctgtctaatt taattccttg gctgc 45
<210> 224
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 224
gaccatcact aaagcttctc tctta 25
<210> 225
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 225
ttgagcaatt catcgacaac aagag 25
<210> 226
<211> 43
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 226
agaaccagaa ccagatccta ttttcttctg caatttcata tag 43
<210> 227
<211> 37
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 227
ggatctggtt ctggttctat ggcacaagaa atcactc 37
<210> 228
<211> 36
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 228
agaaccagaa ccagatccat ttaattcctt ggctgc 36
<210> 229
<211> 42
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 229
ggatctggtt ctggttctat gactgtcacc ataaaagaat tg 42
<210> 230
<211> 60
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 230
gattacttac caatgtgcca taaactccgt gcaccatgcc gtaaaccact aaatcggaac 60
<210> 231
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 231
tgcgcatgtt tcggcgttcg aaacttctcc gcagtgaaag ataaatgatc ttcttagatt 60
actattatat gttttagagc tagaaatagc aagttaaaat aaggctagtc cgttatcaac 120
<210> 232
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 232
gttgataacg gactagcctt attttaactt gctatttcta gctctaaaac atataatagt 60
aatctaagaa gatcatttat ctttcactgc ggagaagttt cgaacgccga aacatgcgca 120
<210> 233
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 233
atagttattt tgaaataata actaccatta gaactaacaa aagaaaagaa aaaaaaaata 60
taccatttgc aagacattgt ataatatttt tgttgaaagt ctttttcgat tcataagcgc 120
<210> 234
<211> 120
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 234
gcgcttatga atcgaaaaag actttcaaca aaaatattat acaatgtctt gcaaatggta 60
tatttttttt ttcttttctt ttgttagttc taatggtagt tattatttca aaataactat 120
<210> 235
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 235
ctcatcgcat gccaacgaag 20
<210> 236
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 236
gcagcaaagc caacccttac 20
<210> 237
<211> 49
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 237
gcaatctaat ctaagtttta attacaaaat gtctgttcac tctatcttg 49
<210> 238
<211> 45
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 238
gtgctagtgt ctcccgtctt ctgtttattc aaccatcttc tttgg 45
<210> 239
<211> 45
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 239
caatgtagcg ctcttacttt attatttcaa gtccttgaaa ttacc 45
<210> 240
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 240
ctggagctca gtttatcatt at 22
<210> 241
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 241
actatagggc gaattgggta c 21
<210> 242
<211> 40
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 242
ataatgataa actgagctcc agagtctcgt atgtcggctc 40
<210> 243
<211> 39
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 243
gtacccaatt cgccctatag ttgtgtccgc gtttctaag 39
<210> 244
<211> 52
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 244
ggtagaccaa tgtagcgctc ttactttatt attcaaccat cttctttgga ac 52
<210> 245
<211> 52
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 245
gtttcgaata aacacacata aacaaacaaa atgtctgttc actctatctt gt 52
<210> 246
<211> 57
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 246
ctataactac aaaaaacaca tacataaact aaaaatgccg tttggaatag acaacac 57
<210> 247
<211> 50
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 247
gactaataat tcttagttaa aagcacttta ccagacatct tcttggtatc 50
<210> 248
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 248
tggtcacaca acttgtctg 19
<210> 249
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 249
ggtactggtg gtttcacttg 20
<210> 250
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 250
gtagtgatca ttggcttaac g 21
<210> 251
<211> 46
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 251
gttccgattt agtggtttac ggcagtgaca ataaattcaa accggt 46
<210> 252
<211> 46
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 252
gcttcggaaa atacgatgtt gaaaatcaac tcagaagttt gacagc 46
<210> 253
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 253
tcgttagatt ctgtatccct a 21
<210> 254
<211> 46
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 254
ggagtagaaa cattttgaag ctatgtgaca ataaattcaa accggt 46
<210> 255
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 255
atttgtgatc gaagacgaag ag 22
<210> 256
<211> 18
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 256
tcaagaagcc actacgtg 18
<210> 257
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 257
actagaacat taccatatgt agtg 24
<210> 258
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 258
caacttggac gttcttctac 20
<210> 259
<211> 619
<212> PRT
<213> 大肠杆菌
<400> 259
Met Ser Lys Gly Thr Thr Ser Leu Asp Ala Pro Phe Gly Thr Leu Leu
1 5 10 15
Gly Tyr Ala Pro Gly Gly Val Ala Ile Tyr Ser Ser Asp Tyr Ser Ser
20 25 30
Leu Asp Pro Gln Glu Tyr Glu Asp Asp Ala Val Phe Arg Ser Tyr Ile
35 40 45
Asp Asp Glu Tyr Met Gly His Lys Trp Gln Cys Val Glu Phe Ala Arg
50 55 60
Arg Phe Leu Phe Leu Asn Tyr Gly Val Val Phe Thr Asp Val Gly Met
65 70 75 80
Ala Trp Glu Ile Phe Ser Leu Arg Phe Leu Arg Glu Val Val Asn Asp
85 90 95
Asn Ile Leu Pro Leu Gln Ala Phe Pro Asn Gly Ser Pro Arg Ala Pro
100 105 110
Val Ala Gly Ala Leu Leu Ile Trp Asp Lys Gly Gly Glu Phe Lys Asp
115 120 125
Thr Gly His Val Ala Ile Ile Thr Gln Leu His Gly Asn Lys Val Arg
130 135 140
Ile Ala Glu Gln Asn Val Ile His Thr Pro Leu Pro Gln Gly Gln Gln
145 150 155 160
Trp Thr Arg Glu Leu Glu Met Val Val Glu Asn Gly Gly Tyr Thr Leu
165 170 175
Lys Asp Thr Phe Asp Asp Thr Thr Ile Leu Gly Trp Met Ile Gln Thr
180 185 190
Glu Asp Thr Glu Tyr Ser Leu Pro Gln Pro Glu Ile Ala Gly Glu Leu
195 200 205
Leu Lys Ile Ser Gly Ala Arg Leu Glu Asn Lys Gly Gln Phe Asp Gly
210 215 220
Lys Trp Leu Asp Glu Lys Asp Pro Leu Gln Asn Ala Tyr Val Gln Ala
225 230 235 240
Asn Gly Gln Val Ile Asn Gln Asp Pro Tyr His Tyr Tyr Thr Ile Thr
245 250 255
Glu Ser Ala Glu Gln Glu Leu Ile Lys Ala Thr Asn Glu Leu His Leu
260 265 270
Met Tyr Leu His Ala Thr Asp Lys Val Leu Lys Asp Asp Asn Leu Leu
275 280 285
Ala Leu Phe Asp Ile Pro Lys Ile Leu Trp Pro Arg Leu Arg Leu Ser
290 295 300
Trp Gln Arg Arg Arg His His Met Ile Thr Gly Arg Met Asp Phe Cys
305 310 315 320
Met Asp Glu Arg Gly Leu Lys Val Tyr Glu Tyr Asn Ala Asp Ser Ala
325 330 335
Ser Cys His Thr Glu Ala Gly Leu Ile Leu Glu Arg Trp Ala Glu Gln
340 345 350
Gly Tyr Lys Gly Asn Gly Phe Asn Pro Ala Glu Gly Leu Ile Asn Glu
355 360 365
Leu Ala Gly Ala Trp Lys His Ser Arg Ala Arg Pro Phe Val His Ile
370 375 380
Met Gln Asp Lys Asp Ile Glu Glu Asn Tyr His Ala Gln Phe Met Glu
385 390 395 400
Gln Ala Leu Gln Gln Ala Gly Phe Glu Thr Arg Ile Leu Arg Gly Leu
405 410 415
Asp Glu Leu Gly Trp Asp Ala Ala Gly Gln Leu Ile Asp Gly Glu Gly
420 425 430
Arg Leu Val Asn Cys Val Trp Lys Thr Trp Ala Trp Glu Thr Ala Phe
435 440 445
Asp Gln Ile Arg Glu Val Ser Asp Arg Glu Phe Ala Ala Val Pro Ile
450 455 460
Arg Thr Gly His Pro Gln Asn Glu Val Arg Leu Ile Asp Val Leu Leu
465 470 475 480
Arg Pro Glu Val Leu Val Phe Glu Pro Leu Trp Thr Val Ile Pro Gly
485 490 495
Asn Lys Ala Ile Leu Pro Ile Leu Trp Ser Leu Phe Pro His His Arg
500 505 510
Tyr Leu Leu Asp Thr Asp Phe Thr Val Asn Asp Glu Leu Val Lys Thr
515 520 525
Gly Tyr Ala Val Lys Pro Ile Ala Gly Arg Cys Gly Ser Asn Ile Asp
530 535 540
Leu Val Ser His His Glu Glu Val Leu Asp Gln Thr Ser Gly Lys Phe
545 550 555 560
Ala Glu Gln Lys Asn Ile Tyr Gln Gln Leu Trp Cys Leu Pro Lys Val
565 570 575
Asp Gly Lys Tyr Ile Gln Val Cys Thr Phe Thr Val Gly Gly Asn Tyr
580 585 590
Gly Gly Thr Cys Leu Arg Gly Asp Glu Ser Leu Val Ile Lys Lys Glu
595 600 605
Ser Asp Ile Glu Pro Leu Ile Val Val Lys Glu
610 615
<210> 260
<211> 1860
<212> DNA
<213> 大肠杆菌
<400> 260
atgagtaaag ggaccaccag tttggacgcc ccttttggaa cattgttagg atacgcccct 60
gggggggtag caatttatag cagtgactat tcctcactgg acccacaaga atatgaagat 120
gatgccgtct tccgtagtta tatagatgat gaatacatgg gccataaatg gcaatgcgtt 180
gagtttgcta ggcgtttctt attcttgaac tatggtgtcg tatttactga cgtgggaatg 240
gcttgggaga tattctctct aagatttctg cgtgaggtcg tcaatgataa catcctgccg 300
ttacaagcat ttccaaatgg ttcacccaga gcccctgtag ctggtgctct actaatatgg 360
gataaaggag gtgaatttaa ggatactggg cacgtagcga taattacgca attacacggc 420
aacaaagtaa gaattgcaga acaaaatgtc atccatacac cactacctca aggacaacag 480
tggactagag agctagaaat ggttgtggaa aacggtgggt atacgttaaa agatacgttc 540
gacgatacca ccattcttgg ttggatgatc cagacggaag acactgagta ctctctgcca 600
cagcctgaga ttgccggcga attattgaaa atcagcggag cacgtttgga aaacaaaggg 660
caattcgacg ggaagtggct ggatgagaaa gatcctttac agaacgccta cgtacaagcc 720
aacgggcagg ttatcaacca agacccgtat cattattaca cgattacaga aagtgccgag 780
caggagctaa tcaaggccac aaacgagtta caccttatgt acttgcatgc cacggacaag 840
gtcttgaaag acgacaatct tttagcccta tttgacattc ctaagatcct atggcctagg 900
ctacgtttat cctggcagcg taggcgtcac cacatgatca cgggacgtat ggacttctgc 960
atggatgaac gtgggctgaa ggtttatgaa tacaacgccg actctgccag ttgccataca 1020
gaagctggcc ttatcctgga gagatgggct gagcaggggt acaaaggaaa cggattcaac 1080
ccagccgaag gattaatcaa cgaactggcc ggtgcatgga agcatagcag ggctcgtcca 1140
ttcgtccata tcatgcagga caaggatatc gaggaaaatt accatgcgca gtttatggaa 1200
caagcactac aacaagctgg gtttgagacc cgtatactaa ggggccttga tgagcttggc 1260
tgggacgctg ctggccagtt gattgatgga gaaggacgtc ttgtgaattg cgtttggaaa 1320
acgtgggcat gggagacagc cttcgatcag ataagagaag tatcagatag agaatttgct 1380
gctgtaccga ttagaacagg ccacccgcaa aacgaagtaa gactgataga cgtattgtta 1440
agacctgagg tgctggtttt tgaacctctt tggacagtaa tacctggaaa caaagcaata 1500
cttccgatct tatggtcact attcccgcac cacaggtatt tgttggacac cgattttact 1560
gtcaacgatg aactggtcaa gacggggtac gcggtcaaac ccatagctgg ccgttgtggc 1620
agtaatatag atttggtatc tcatcacgag gaggtcttgg atcaaactag tgggaaattt 1680
gccgaacaga aaaatattta tcagcagctg tggtgcttgc ctaaagttga tgggaaatat 1740
attcaagtct gtacttttac agtgggaggt aattacggag gcacgtgttt gagaggtgac 1800
gaatcacttg taataaagaa agagtctgat attgagccct taattgtagt gaaggaatag 1860
Claims (14)
1.能够生产谷胱甘肽-多胺缀合物的酵母细胞,其中
所述酵母细胞能够生产至少一种多胺;
所述酵母细胞包含多胺:谷胱甘肽连接酶编码基因;
所述酵母细胞包含至少一种多胺合酶编码基因;并且
所述酵母细胞缺少多胺氧化酶编码基因或包含扰乱的多胺氧化酶编码基因。
2.根据权利要求1所述的酵母细胞,其中所述酵母细胞被工程化以用于多胺:谷胱甘肽连接酶的过量表达。
3.根据权利要求1或2所述的酵母细胞,其中所述谷胱甘肽-多胺缀合物选自锥虫胱甘肽、N 1 -谷胱甘酰基亚精胺、N 10 -谷胱甘酰基亚精胺、N 1 ,N 10 -双(谷胱甘酰基)精胺、N 1 ,N 5 ,N 10 -三(谷胱甘酰基)精胺和N 1 ,N 5 ,N 10 ,N 14 -四(谷胱甘酰基)精胺。
4.根据权利要求1-3的任一项所述的酵母细胞,其中所述多胺:谷胱甘肽连接酶编码基因选自锥虫胱甘肽合酶、谷胱甘酰基亚精胺合酶及其组合。
5. 根据权利要求4所述的酵母细胞,其中所述锥虫胱甘肽合酶编码基因选自布氏锥虫指名亚种(Trypanosoma brucei brucei) TbbTryS和核苷酸序列,所述核苷酸序列编码与锥虫胱甘肽合酶TbbTryS具有至少80%的序列同一性的锥虫胱甘肽合酶。
6. 根据权利要求4或5所述的酵母细胞,其中所述谷胱甘酰基亚精胺合酶(GSS)编码基因选自大肠杆菌(Escherichia coli) EcGSS和核苷酸序列,所述核苷酸序列编码与谷胱甘酰基亚精胺合酶EcGSS具有至少80%的序列同一性的谷胱甘酰基亚精胺合酶。
7.根据权利要求1-6的任一项所述的酵母细胞,其中所述至少一种多胺选自精胺、热精胺、对称-高亚精胺、1,3-二氨基丙烷、腐胺、尸胺、胍丁胺、亚精胺、对称-降亚精胺、降精胺及其组合。
8.根据权利要求1-7的任一项所述的酵母细胞,其中所述酵母细胞被工程化以用于所述至少一种多胺合酶的过量表达。
9.根据权利要求1-8的任一项所述的酵母细胞,其中所述多胺合酶编码基因选自精胺合酶编码基因、热精胺合酶编码基因和高亚精胺合酶编码基因。
10. 根据权利要求9所述的酵母细胞,其中所述精胺合酶编码基因选自酿酒酵母(Saccharomcyes cerevisiae) SPE4、拟南芥(Arabidopsis thaliana) AtSPMS和核苷酸序列,所述核苷酸序列编码与精胺合酶SPE4或精胺合酶AtSPMS具有至少80%的序列同一性的精胺合酶。
11.根据权利要求9或10所述的酵母细胞,其中所述热精胺合酶编码基因选自拟南芥AtACL5和核苷酸序列,所述核苷酸序列编码与热精胺合酶AtACL5具有至少80%的序列同一性的热精胺合酶。
12. 根据权利要求9-11的任一项所述的酵母细胞,其中所述高亚精胺合酶编码基因选自春千里光(Senecio vernalis) SvHSS和绿色芽生绿菌(Blastochloris viridis) BvHSS和核苷酸序列,所述核苷酸序列编码与高亚精胺合酶SvHSS或高亚精胺合酶BvHSS具有至少80%的序列同一性的高亚精胺合酶。
13.根据权利要求1-12的任一项所述的酵母细胞,其中所述酵母细胞为酿酒酵母细胞并且所述多胺氧化酶为FMS1。
14. 生产谷胱甘肽-多胺缀合物的方法,所述方法包含:
在培养基中并且在适合于通过根据权利要求1-13的任一项所述的酵母细胞生产所述谷胱甘肽-多胺缀合物的培养条件下培养所述酵母细胞;以及
从所述培养基和/或从所述酵母细胞中收集所述谷胱甘肽-多胺缀合物。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE1951232-6 | 2019-10-28 | ||
SE1951232 | 2019-10-28 | ||
PCT/EP2020/080140 WO2021083870A1 (en) | 2019-10-28 | 2020-10-27 | Polyamine conjugate producing yeasts |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114599778A true CN114599778A (zh) | 2022-06-07 |
Family
ID=73030124
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202080075687.9A Pending CN114599778A (zh) | 2019-10-28 | 2020-10-27 | 生产多胺缀合物的酵母 |
Country Status (10)
Country | Link |
---|---|
US (1) | US20240294927A1 (zh) |
EP (1) | EP4051801A1 (zh) |
JP (1) | JP2023501189A (zh) |
KR (1) | KR20220088729A (zh) |
CN (1) | CN114599778A (zh) |
AU (1) | AU2020376487A1 (zh) |
BR (1) | BR112022006280A2 (zh) |
CA (1) | CA3156097A1 (zh) |
MX (1) | MX2022004991A (zh) |
WO (1) | WO2021083870A1 (zh) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1757607A1 (en) * | 2005-08-24 | 2007-02-28 | Molisa GmbH | N5-substituted benzo¬2,3|azepino¬4,5-b|indol-6-ones for treating tropical diseases |
WO2019013696A1 (en) * | 2017-07-14 | 2019-01-17 | Biopetrolia Ab | MICROBIAL CELLS FOR THE PRODUCTION OF SPERMIDINE |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10544435B2 (en) | 2015-03-12 | 2020-01-28 | Chrysea Limited | L-ornithine production in eukaryotic cells |
-
2020
- 2020-10-27 EP EP20797739.8A patent/EP4051801A1/en active Pending
- 2020-10-27 WO PCT/EP2020/080140 patent/WO2021083870A1/en unknown
- 2020-10-27 JP JP2022525087A patent/JP2023501189A/ja active Pending
- 2020-10-27 BR BR112022006280A patent/BR112022006280A2/pt unknown
- 2020-10-27 MX MX2022004991A patent/MX2022004991A/es unknown
- 2020-10-27 KR KR1020227016457A patent/KR20220088729A/ko active Pending
- 2020-10-27 CA CA3156097A patent/CA3156097A1/en active Pending
- 2020-10-27 US US17/768,618 patent/US20240294927A1/en active Pending
- 2020-10-27 CN CN202080075687.9A patent/CN114599778A/zh active Pending
- 2020-10-27 AU AU2020376487A patent/AU2020376487A1/en not_active Abandoned
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1757607A1 (en) * | 2005-08-24 | 2007-02-28 | Molisa GmbH | N5-substituted benzo¬2,3|azepino¬4,5-b|indol-6-ones for treating tropical diseases |
WO2019013696A1 (en) * | 2017-07-14 | 2019-01-17 | Biopetrolia Ab | MICROBIAL CELLS FOR THE PRODUCTION OF SPERMIDINE |
Non-Patent Citations (2)
Title |
---|
EMMANUEL TETAUD等: "Cloning and Characterization of the Two Enzymes Responsible for Trypanothione Biosynthesis in Crithidia fasciculata", 《THE JOURNAL OF BIOLOGICAL CHEMISTRY》, vol. 273, no. 31, pages 19383 - 19390, XP055767446, DOI: 10.1074/jbc.273.31.19383 * |
SANDRA L. OZA等: "A Single Enzyme Catalyses Formation of Trypanothione from Glutathione and Spermidine in Trypanosoma cruzi", 《THE JOURNAL OF BIOLOGICAL CHEMISTRY》, vol. 277, no. 39, pages 35853 - 35861, XP055767299, DOI: 10.1074/jbc.M204403200 * |
Also Published As
Publication number | Publication date |
---|---|
CA3156097A1 (en) | 2021-05-06 |
WO2021083870A1 (en) | 2021-05-06 |
MX2022004991A (es) | 2022-05-16 |
US20240294927A1 (en) | 2024-09-05 |
JP2023501189A (ja) | 2023-01-18 |
BR112022006280A2 (pt) | 2022-06-28 |
EP4051801A1 (en) | 2022-09-07 |
AU2020376487A1 (en) | 2022-04-14 |
KR20220088729A (ko) | 2022-06-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2024501B1 (en) | Improved production of sphingoid bases using genetically engineered microbial strains | |
KR20180083350A (ko) | 니코틴아미드 리보시드의 미생물학적 제조 | |
JP2008507970A (ja) | アラニン2,3アミノムターゼ | |
KR20210126061A (ko) | 미생물 내 디오스민 및/또는 헤스페리딘의 생합성 방법 | |
KR101718681B1 (ko) | 가용성 단백질 발현량 및 활성이 증대된 헬리코박터 파일로리 유래 α-1,3 푸코실 전달효소의 유전자와 단백질 및 α-1,3 푸코실올리고당 생산에의 응용 | |
WO2013054447A1 (ja) | γ‐Glu‐X‐Glyまたはその塩の製造方法、および変異型グルタチオン合成酵素 | |
CN113748204A (zh) | 在微生物中生物合成香叶木素和/或橙皮素的方法 | |
US9611490B2 (en) | Biological method for producing cis-5-hydroxy-L-pipecolic acid | |
JP2007181452A (ja) | デキストラン生成酵素遺伝子、デキストラン生成酵素およびその製造方法、デキストランの製造方法 | |
EP4424834A2 (en) | Microorganism having increased activity of 3-methyl-2-oxobutanoate hydroxymethyltransferase, and use thereof | |
EP2071022B1 (en) | S-adenosylmethionine synthetase mutants, the dnas encoding the same and uses of the mutants | |
CN114585727A (zh) | 生产多胺类似物的酵母 | |
JP2018518189A (ja) | エフェドリンおよび関連アルカロイド化合物を製造するための方法および組成物 | |
KR20220152728A (ko) | 3-메틸-2-옥소뷰타노에이트 하이드록시 메틸트랜스퍼라아제의 활성이 강화된 미생물, 및 이의 용도 | |
CN115335514A (zh) | 罗汉果甙的生物合成 | |
JP6553963B2 (ja) | 改変スターメレラ属微生物 | |
CN114599778A (zh) | 生产多胺缀合物的酵母 | |
KR20020029767A (ko) | 환상 뎁시펩티드 합성효소 및 그의 유전자 및 환상뎁시펩티드의 대량생산계 | |
WO2018180568A1 (ja) | 変異型2-デオキシ-シロ-イノソース合成酵素 | |
WO2001083787A1 (fr) | Nouveaux promoteurs et methode d'expression genique reposant sur l'utilisation desdits promoteurs | |
KR101713885B1 (ko) | sn-1 위치 특이적 리파아제 및 그 생산방법 | |
KR101071274B1 (ko) | 미생물 유래의 사슬형 트랜스아미나제를 이용한 L-6-hydroxynorleucine의 생산방법 | |
EP1485473B1 (en) | Production of ugppase | |
WO2003078616A1 (en) | Production of ugppase | |
Kwak et al. | Expression, Purification, and Crystallization of Glutamyl-tRNA [supGln] Specific Amidotransferase from Bacillus stearothermophilus. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 40075728 Country of ref document: HK |