CN113502297B - Recombinant pichia pastoris for synthesizing guanosine diphosphate fucose, construction method and application thereof - Google Patents
Recombinant pichia pastoris for synthesizing guanosine diphosphate fucose, construction method and application thereof Download PDFInfo
- Publication number
- CN113502297B CN113502297B CN202110654277.6A CN202110654277A CN113502297B CN 113502297 B CN113502297 B CN 113502297B CN 202110654277 A CN202110654277 A CN 202110654277A CN 113502297 B CN113502297 B CN 113502297B
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- gly
- ser
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- LQEBEXMHBLQMDB-QIXZNPMTSA-N GDP-L-fucose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)OC1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C3=C(C(NC(N)=N3)=O)N=C2)O1 LQEBEXMHBLQMDB-QIXZNPMTSA-N 0.000 title claims abstract description 18
- 241000235058 Komagataella pastoris Species 0.000 title claims abstract description 18
- 238000010276 construction Methods 0.000 title abstract description 9
- 230000002194 synthesizing effect Effects 0.000 title abstract description 6
- 108010083136 fucokinase Proteins 0.000 claims abstract description 21
- 102100040648 L-fucose kinase Human genes 0.000 claims abstract description 17
- PTVXQARCLQPGIR-SXUWKVJYSA-N beta-L-fucose 1-phosphate Chemical compound C[C@@H]1O[C@H](OP(O)(O)=O)[C@@H](O)[C@H](O)[C@@H]1O PTVXQARCLQPGIR-SXUWKVJYSA-N 0.000 claims description 15
- 238000000855 fermentation Methods 0.000 claims description 12
- 230000004151 fermentation Effects 0.000 claims description 12
- 239000013598 vector Substances 0.000 claims description 8
- 238000000034 method Methods 0.000 claims description 7
- 239000007788 liquid Substances 0.000 claims description 5
- 238000002360 preparation method Methods 0.000 claims description 3
- 241000235648 Pichia Species 0.000 claims 7
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 241001506991 Komagataella phaffii GS115 Species 0.000 abstract description 7
- 102000004357 Transferases Human genes 0.000 abstract description 2
- 108090000992 Transferases Proteins 0.000 abstract description 2
- 238000012986 modification Methods 0.000 abstract 1
- 230000004048 modification Effects 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 17
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 11
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 description 10
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 description 10
- 241000282898 Sus scrofa Species 0.000 description 10
- 230000015572 biosynthetic process Effects 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- QGWNDRXFNXRZMB-UUOKFMHZSA-N GDP Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O QGWNDRXFNXRZMB-UUOKFMHZSA-N 0.000 description 6
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 6
- QGWNDRXFNXRZMB-UHFFFAOYSA-N guanidine diphosphate Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(O)=O)C(O)C1O QGWNDRXFNXRZMB-UHFFFAOYSA-N 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 108010050848 glycylleucine Proteins 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 230000037361 pathway Effects 0.000 description 5
- 108090000623 proteins and genes Proteins 0.000 description 5
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- MVMSCBBUIHUTGJ-UHFFFAOYSA-N 10108-97-1 Natural products C1=2NC(N)=NC(=O)C=2N=CN1C(C(C1O)O)OC1COP(O)(=O)OP(O)(=O)OC1OC(CO)C(O)C(O)C1O MVMSCBBUIHUTGJ-UHFFFAOYSA-N 0.000 description 3
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- LQEBEXMHBLQMDB-UHFFFAOYSA-N GDP-L-fucose Natural products OC1C(O)C(O)C(C)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C3=C(C(N=C(N)N3)=O)N=C2)O1 LQEBEXMHBLQMDB-UHFFFAOYSA-N 0.000 description 3
- MVMSCBBUIHUTGJ-GDJBGNAASA-N GDP-alpha-D-mannose Chemical compound C([C@H]1O[C@H]([C@@H]([C@@H]1O)O)N1C=2N=C(NC(=O)C=2N=C1)N)OP(O)(=O)OP(O)(=O)O[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@@H]1O MVMSCBBUIHUTGJ-GDJBGNAASA-N 0.000 description 3
- LQEBEXMHBLQMDB-JGQUBWHWSA-N GDP-beta-L-fucose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C3=C(C(NC(N)=N3)=O)N=C2)O1 LQEBEXMHBLQMDB-JGQUBWHWSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Natural products C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 3
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 3
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 125000002446 fucosyl group Chemical group C1([C@@H](O)[C@H](O)[C@H](O)[C@@H](O1)C)* 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 229920001542 oligosaccharide Polymers 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 3
- 238000012795 verification Methods 0.000 description 3
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 2
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 2
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 2
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 2
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 2
- XKMLYUALXHKNFT-UUOKFMHZSA-N Guanosine-5'-triphosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XKMLYUALXHKNFT-UUOKFMHZSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 2
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 2
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 2
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- 238000000246 agarose gel electrophoresis Methods 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 238000010170 biological method Methods 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 229940029575 guanosine Drugs 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 230000037353 metabolic pathway Effects 0.000 description 2
- 150000002482 oligosaccharides Chemical class 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- 108010044087 AS-I toxin Proteins 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 1
- OQCPATDFWYYDDX-HGNGGELXSA-N Ala-Gln-His Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OQCPATDFWYYDDX-HGNGGELXSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 1
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- LUVODTFFSXVOAG-ACZMJKKPSA-N Asn-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N LUVODTFFSXVOAG-ACZMJKKPSA-N 0.000 description 1
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 1
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 1
- NJPLPRFQLBZAMH-IHRRRGAJSA-N Asn-Tyr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O NJPLPRFQLBZAMH-IHRRRGAJSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- UXIPUCUHQBIQOS-SRVKXCTJSA-N Asp-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UXIPUCUHQBIQOS-SRVKXCTJSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 1
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 1
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 1
- CUXIOFHFFXNUGG-HTFCKZLJSA-N Cys-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CS)N CUXIOFHFFXNUGG-HTFCKZLJSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 1
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 1
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 1
- JTEGHEWKBCTIAL-IXOXFDKPSA-N Cys-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N)O JTEGHEWKBCTIAL-IXOXFDKPSA-N 0.000 description 1
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 101150086992 FCSK gene Proteins 0.000 description 1
- 101150112609 FPGT gene Proteins 0.000 description 1
- 229920000855 Fucoidan Polymers 0.000 description 1
- 108010045674 Fucose-1-phosphate guanylyltransferase Proteins 0.000 description 1
- 102100037047 Fucose-1-phosphate guanylyltransferase Human genes 0.000 description 1
- 102000006471 Fucosyltransferases Human genes 0.000 description 1
- 108010019236 Fucosyltransferases Proteins 0.000 description 1
- 108010092526 GKPV peptide Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- JNEITCMDYWKPIW-GUBZILKMSA-N Gln-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JNEITCMDYWKPIW-GUBZILKMSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 1
- NPMFDZGLKBNFOO-SRVKXCTJSA-N Gln-Pro-His Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NPMFDZGLKBNFOO-SRVKXCTJSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- XMWNHGKDDIFXQJ-NWLDYVSISA-N Gln-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XMWNHGKDDIFXQJ-NWLDYVSISA-N 0.000 description 1
- RNPGPFAVRLERPP-QEJZJMRPSA-N Gln-Trp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O RNPGPFAVRLERPP-QEJZJMRPSA-N 0.000 description 1
- NVHJGTGTUGEWCG-ZVZYQTTQSA-N Gln-Trp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O NVHJGTGTUGEWCG-ZVZYQTTQSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- VCUNGPMMPNJSGS-JYJNAYRXSA-N Gln-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VCUNGPMMPNJSGS-JYJNAYRXSA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- MCGNJCNXIMQCMN-DCAQKATOSA-N Glu-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCC(O)=O MCGNJCNXIMQCMN-DCAQKATOSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 1
- IANBSEOVTQNGBZ-BQBZGAKWSA-N Gly-Cys-Met Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O IANBSEOVTQNGBZ-BQBZGAKWSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- VUUOMYFPWDYETE-WDSKDSINSA-N Gly-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VUUOMYFPWDYETE-WDSKDSINSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- CQIIXEHDSZUSAG-QWRGUYRKSA-N Gly-His-His Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CQIIXEHDSZUSAG-QWRGUYRKSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 1
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- ZHMZWSFQRUGLEC-JYJNAYRXSA-N His-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZHMZWSFQRUGLEC-JYJNAYRXSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- NUEHSWNAFIEBCQ-NAKRPEOUSA-N Ile-Val-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUEHSWNAFIEBCQ-NAKRPEOUSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- SHZGCJCMOBCMKK-PQMKYFCFSA-N L-Fucose Natural products C[C@H]1O[C@H](O)[C@@H](O)[C@@H](O)[C@@H]1O SHZGCJCMOBCMKK-PQMKYFCFSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- MPSBSKHOWJQHBS-IHRRRGAJSA-N Leu-His-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N MPSBSKHOWJQHBS-IHRRRGAJSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 1
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- IIPHCNKHEZYSNE-DCAQKATOSA-N Met-Arg-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O IIPHCNKHEZYSNE-DCAQKATOSA-N 0.000 description 1
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- DRINJBAHUGXNFC-DCAQKATOSA-N Met-Asp-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O DRINJBAHUGXNFC-DCAQKATOSA-N 0.000 description 1
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 1
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- JLDZQPPLTJTJLE-IHPCNDPISA-N Phe-Trp-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JLDZQPPLTJTJLE-IHPCNDPISA-N 0.000 description 1
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 1
- DXWNFNOPBYAFRM-IHRRRGAJSA-N Phe-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N DXWNFNOPBYAFRM-IHRRRGAJSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- IDCKUIWEIZYVSO-WFBYXXMGSA-N Ser-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C)C(O)=O)=CNC2=C1 IDCKUIWEIZYVSO-WFBYXXMGSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- RTXKJFWHEBTABY-IHPCNDPISA-N Ser-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CO)N RTXKJFWHEBTABY-IHPCNDPISA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 1
- XSTGOZBBXFKGHA-YJRXYDGGSA-N Thr-His-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O XSTGOZBBXFKGHA-YJRXYDGGSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- TZNNEYFZZAHLBL-BPUTZDHNSA-N Trp-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O TZNNEYFZZAHLBL-BPUTZDHNSA-N 0.000 description 1
- HYNAKPYFEYJMAS-XIRDDKMYSA-N Trp-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HYNAKPYFEYJMAS-XIRDDKMYSA-N 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- WACMTVIJWRNVSO-CWRNSKLLSA-N Trp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O WACMTVIJWRNVSO-CWRNSKLLSA-N 0.000 description 1
- OBWQLWYNNZPWGX-QEJZJMRPSA-N Trp-Gln-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OBWQLWYNNZPWGX-QEJZJMRPSA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 1
- MBLJBGZWLHTJBH-SZMVWBNQSA-N Trp-Val-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 MBLJBGZWLHTJBH-SZMVWBNQSA-N 0.000 description 1
- DXUVJJRTVACXSO-KKUMJFAQSA-N Tyr-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DXUVJJRTVACXSO-KKUMJFAQSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 1
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- XTOCLOATLKOZAU-JBACZVJFSA-N Tyr-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N XTOCLOATLKOZAU-JBACZVJFSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 1
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 230000033115 angiogenesis Effects 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 108010083912 bleomycin N-acetyltransferase Proteins 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000010511 deprotection reaction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 239000001177 diphosphate Substances 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000012262 fermentative production Methods 0.000 description 1
- -1 fucosyl oligosaccharides Chemical class 0.000 description 1
- 230000033581 fucosylation Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- RQFCJASXJCIDSX-UUOKFMHZSA-N guanosine 5'-monophosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O RQFCJASXJCIDSX-UUOKFMHZSA-N 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 235000020256 human milk Nutrition 0.000 description 1
- 210000004251 human milk Anatomy 0.000 description 1
- 230000000899 immune system response Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 230000028709 inflammatory response Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 230000008467 tissue growth Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- AVBGNFCMKJOFIN-UHFFFAOYSA-N triethylammonium acetate Chemical compound CC(O)=O.CCN(CC)CC AVBGNFCMKJOFIN-UHFFFAOYSA-N 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
- C12N15/815—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts for yeasts other than Saccharomyces
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1205—Phosphotransferases with an alcohol group as acceptor (2.7.1), e.g. protein kinases
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
- C12P19/28—N-glycosides
- C12P19/30—Nucleotides
- C12P19/32—Nucleotides having a condensed ring system containing a six-membered ring having two N-atoms in the same ring, e.g. purine nucleotides, nicotineamide-adenine dinucleotide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/07—Nucleotidyltransferases (2.7.7)
- C12Y207/0703—Fucose-1-phosphate guanylyltransferase (2.7.7.30)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P20/00—Technologies relating to chemical industry
- Y02P20/50—Improvements relating to the production of bulk chemicals
- Y02P20/55—Design of synthesis routes, e.g. reducing the use of auxiliary or protecting groups
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Mycology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
技术领域technical field
本发明属于遗传工程的技术领域,具体涉及一种合成鸟苷二磷酸岩藻糖的重组毕赤酵母 及其构建方法与应用。The invention belongs to the technical field of genetic engineering, and specifically relates to a recombinant Pichia pastoris for synthesizing guanosine diphosphate fucose and its construction method and application.
背景技术Background technique
鸟苷酸二磷酸岩藻糖(GDP-L-Fuc)是岩藻糖(L-fucose)的核苷酸形式,它为岩藻糖基化反 应提供岩藻糖基受体,在生命体的组织生长、血管再生、受精细胞黏附、炎症和肿瘤转移等 方面起重要作用。同时,含有岩藻糖基的寡糖是母乳的主要组成部分,其功能涉及广泛的生 物学过程,例如,预防病原体感染,改善免疫系统反应,减少炎症反应等。藻糖基化的寡糖是在岩藻糖基转移酶的催化下以鸟苷二磷酸岩藻糖(GDP-L-fucose)作为岩藻糖基的供体合成 的。因为岩藻糖基寡糖的广泛作用而产生巨大需求,所以许多制药公司试图通过化学方法和 生物方法高效合成充足的鸟苷二磷酸岩藻糖。在化学合成中,鸟苷二磷酸岩藻糖是以L-岩藻 吡喃糖基四乙酸为起始原料进行合成,但化学合成伴随有多个保护和脱保护步骤,这种复杂性使得化学合成对于工业应用而言是不现实的。生物合成是基于在生物体中发现了两种合成 GDP-L-fucose的代谢途径:补救途径和从头途径,这两种代谢途径均发生在细胞质中。鸟苷 二磷酸岩藻糖生物合成如图1所示。Guanylate diphosphate fucose (GDP-L-Fuc) is the nucleotide form of fucose (L-fucose), which provides fucosyl receptors for fucosylation reactions, and is used in living organisms It plays an important role in tissue growth, angiogenesis, adhesion of fertilized cells, inflammation and tumor metastasis. At the same time, oligosaccharides containing fucosyl groups are the main components of breast milk, and their functions involve a wide range of biological processes, such as preventing pathogen infection, improving immune system responses, and reducing inflammatory responses, etc. Cocosylated oligosaccharides are synthesized under the catalysis of fucosyltransferase using guanosine diphosphate fucose (GDP-L-fucose) as a fucosyl donor. Because of the huge demand due to the wide range of effects of fucosyl oligosaccharides, many pharmaceutical companies are trying to efficiently synthesize sufficient guanosine diphosphate fucose through chemical and biological methods. In chemical synthesis, guanosine diphosphate fucose is synthesized from L-fucopyranosyltetraacetic acid as the starting material, but chemical synthesis is accompanied by multiple protection and deprotection steps. This complexity makes chemical Synthesis is impractical for industrial applications. Biosynthesis is based on the discovery of two metabolic pathways for the synthesis of GDP-L-fucose in organisms: the salvage pathway and the de novo pathway, both of which occur in the cytoplasm. The biosynthesis of fucose guanosine diphosphate is shown in Figure 1.
补救途径是在人类的代谢途径中发现的,外源的岩藻糖转移到胞内,由岩藻糖激酶消耗 ATP被磷酸化(EC2.7.1.52)继而形成岩藻糖-1-磷酸(Fuc-1-P)。Fuc-1-P结合鸟苷三磷酸(GTP)在 岩藻糖-1-磷酸鸟苷酰基转移酶(L-fucose-1-phosphateguanylyltransferase)(EC2.7.7.30)的催化下 生成GDP-L-岩藻糖。从头合成途径它首先在细菌中发现,然后被发现在植物、哺乳动物和无 脊椎动物中均存在,在此途径中,GDP-岩藻糖是由GDP-甘露糖通过鸟苷二磷酸甘露糖脱水 酶(GMD,EC4.2.1.47)和鸟苷二磷酸甘露糖差向异构酶(WCAG,EC1.1.1.271)催化合成的。The salvage pathway is found in the human metabolic pathway, exogenous fucose is transferred into the cell, ATP is consumed by fucose kinase and phosphorylated (EC2.7.1.52) to form fucose-1-phosphate ( Fuc-1-P). Fuc-1-P combines with guanosine triphosphate (GTP) to generate GDP-L- Fucose. The de novo synthesis pathway, which was first discovered in bacteria and then found in plants, mammals, and invertebrates, is a pathway in which GDP-fucose is dehydrated from GDP-mannose via guanosine diphosphate-mannose Enzyme (GMD, EC4.2.1.47) and guanosine diphosphate mannose epimerase (WCAG, EC1.1.1.271) catalyzed synthesis.
毕赤酵母作为外源蛋白表达的优良宿主被广泛应用,其产品被FDA认证为“generally regarded as safe”(GRAS)安全级别。Pichia pastoris is widely used as an excellent host for exogenous protein expression, and its products have been certified by the FDA as "generally regarded as safe" (GRAS) safety level.
因此,如何利用毕赤酵母,通过生物方法高效合成鸟苷二磷酸岩藻糖,仍是本领域亟待 解决的问题。Therefore, how to use Pichia pastoris to efficiently synthesize fucose guanosine diphosphate through biological methods is still an urgent problem in this field.
发明内容Contents of the invention
本发明的目的在于提供一种合成鸟苷二磷酸岩藻糖的重组毕赤酵母的构建方法与应用, 构建的重组毕赤酵母可以高效合成鸟苷二磷酸岩藻糖。The object of the present invention is to provide a construction method and application of recombinant Pichia pastoris for synthesizing guanosine diphosphate fucose. The constructed recombinant Pichia pastoris can efficiently synthesize guanosine diphosphate fucose.
本发明所采取的技术方案是:The technical scheme that the present invention takes is:
本发明的第一个方面,提供一种载体,所述载体含有岩藻糖-1-磷酸鸟苷转移酶的表达序 列和/或岩藻糖激酶的表达序列。The first aspect of the present invention provides a vector containing the expression sequence of fucose-1-phosphate guanosyltransferase and/or the expression sequence of fucose kinase.
在本发明的一些实施方式中,所述岩藻糖-1-磷酸鸟苷转移酶的氨基酸序列如SEQID NO.1所示,所述岩藻糖激酶的氨基酸序列如SEQ ID NO.2所示。In some embodiments of the present invention, the amino acid sequence of the fucose-1-phosphate guanosyltransferase is shown in SEQ ID NO.1, and the amino acid sequence of the fucose kinase is shown in SEQ ID NO.2 .
在本发明的一些实施方式中,所述岩藻糖-1-磷酸鸟苷转移酶来源于野猪(Susscrofa)的 FPGT基因,所述岩藻糖激酶基因来源于野猪(Sus scrofa)的FCSK基因。In some embodiments of the present invention, the fucose-1-phosphate guanosyltransferase is derived from the FPGT gene of wild boar (Susscrofa), and the fucose kinase gene is derived from the FCSK gene of wild boar (Sus scrofa).
在本发明的一些实施方式中,所述岩藻糖-1-磷酸鸟苷转移酶的表达序列如SEQID NO.3 所示,所述岩藻糖激酶的表达序列如SEQ ID NO.4所示。In some embodiments of the present invention, the expression sequence of the fucose-1-phosphate guanosyltransferase is shown in SEQ ID NO.3, and the expression sequence of the fucose kinase is shown in SEQ ID NO.4 .
在本发明的一些实施方式中,所述载体的序列如SEQ ID NO.5所示。In some embodiments of the present invention, the sequence of the vector is shown in SEQ ID NO.5.
本发明的第二个方面,提供一种重组细胞,所述重组细胞包含本发明第一方面所述的表 达载体,其中所述细胞为非植物细胞。A second aspect of the present invention provides a recombinant cell comprising the expression vector described in the first aspect of the present invention, wherein the cell is a non-plant cell.
在本发明的一些实施方式中,所述细胞为毕赤酵母。In some embodiments of the invention, the cell is Pichia pastoris.
在本发明的一些优选实施方式中,所述细胞为毕赤酵母GS115。In some preferred embodiments of the present invention, the cell is Pichia pastoris GS115.
本发明的第三个方面,提供本发明第二方面所述的重组细胞的构建方法,包括以下步骤: 将本发明第一方面所述载体转入细胞中。The third aspect of the present invention provides the method for constructing the recombinant cell described in the second aspect of the present invention, comprising the following steps: transferring the vector described in the first aspect of the present invention into the cell.
在本发明的一些实施方式中,所述构建方法具体为将本发明第一方面所述载体电转化毕 赤酵母GS115的感受态细胞。In some embodiments of the present invention, the construction method is specifically to electrotransform the competent cells of Pichia pastoris GS115 with the vector described in the first aspect of the present invention.
本发明的第四个方面,提供本发明第一方面所述的载体或本发明第二方面所述重组细胞 的应用。The fourth aspect of the present invention provides the use of the vector described in the first aspect of the present invention or the recombinant cell described in the second aspect of the present invention.
在本发明的一些优选实施方式中,所述应用具体为在制备鸟苷二磷酸岩藻糖中的应用。In some preferred embodiments of the present invention, the application is specifically the application in the preparation of fucose guanosine diphosphate.
本发明的第五个方面,提供一种制备鸟苷二磷酸岩藻糖的方法,通过本发明第二方面所 述的重组细胞生成。The fifth aspect of the present invention provides a method for preparing guanosine diphosphate fucose, which is produced by the recombinant cell described in the second aspect of the present invention.
在本发明的一些实施方式,所述发酵条件为将重组细胞种子液以OD600值为0.5~1.5的 接种量接入发酵培养基,在25~35℃,200~300rpm条件下培养100~140h。In some embodiments of the present invention, the fermentation condition is that the recombinant cell seed solution is inserted into the fermentation medium with an inoculation amount of OD600 of 0.5-1.5, and cultured at 25-35°C and 200-300rpm for 100-140h.
在本发明的一些优选实施方式中,所述发酵条件为将重组细胞种子液以OD值为1的接 种量接入发酵培养基,在30℃,250rpm条件下培养120h。In some preferred embodiments of the present invention, the fermentation condition is that the recombinant cell seed liquid is inserted into the fermentation medium with an inoculum amount of OD value of 1, and cultivated for 120 h at 30 ° C and 250 rpm.
本发明还提供一种鸟苷二磷酸岩藻糖,由本发明第五方面所述的方法制备而成。The present invention also provides guanosine diphosphate fucose prepared by the method described in the fifth aspect of the present invention.
本发明的有益效果是:The beneficial effects of the present invention are:
本发明提供了一种载体,所述载体含有岩藻糖-1-磷酸鸟苷转移酶的表达序列和/或岩藻糖 激酶的表达序列。并且提供了一种重组细胞,是在毕赤酵母GS115的基础上表达岩藻糖-1-磷 酸鸟苷转移酶和岩藻糖激酶基因得到的,通过改造获得了可合成鸟苷二磷酸岩藻糖的重组菌 株。本发明重组毕赤酵母的构建方法简单,便于使用,具有很好的应用前景。The present invention provides a vector, which contains the expression sequence of fucose-1-phosphate guanosyltransferase and/or the expression sequence of fucose kinase. It also provides a recombinant cell, which is obtained by expressing fucose-1-phosphate guanylyltransferase and fucose kinase genes on the basis of Pichia pastoris GS115, through transformation to obtain fucoidan that can synthesize guanosine diphosphate Sugar recombinant strains. The construction method of the recombinant Pichia pastoris of the present invention is simple, easy to use, and has good application prospects.
附图说明Description of drawings
图1为鸟苷二磷酸岩藻糖生物合成示意图。Figure 1 is a schematic diagram of the biosynthesis of fucose guanosine diphosphate.
图2为本发明实施例1合成鸟苷二磷酸岩藻糖质粒示意图。Fig. 2 is a schematic diagram of synthesis of guanosine diphosphate fucose plasmid in Example 1 of the present invention.
图3为本发明实施例1中质粒转化表达FPGT和FCSK基因的酵母菌落PCR验证琼脂糖凝胶电泳图。Fig. 3 is an agarose gel electrophoresis diagram of PCR verification of yeast colonies expressing FPGT and FCSK genes transformed with plasmids in Example 1 of the present invention.
图4为本发明实施例2中的高效液相色谱图。Figure 4 is a high performance liquid chromatogram in Example 2 of the present invention.
图5为本发明实施例2中的质谱图。Fig. 5 is the mass spectrogram in embodiment 2 of the present invention.
具体实施方式Detailed ways
以下将结合实施例对本发明的构思及产生的技术效果进行清楚、完整地描述,以充分地 理解本发明的目的、特征和效果。显然,所描述的实施例只是本发明的一部分实施例,而不 是全部实施例,基于本发明的实施例,本领域的技术人员在不付出创造性劳动的前提下所获得的其他实施例,均属于本发明保护的范围。The concept of the present invention and the technical effects produced will be clearly and completely described below in conjunction with the embodiments, so as to fully understand the purpose, characteristics and effects of the present invention. Apparently, the described embodiments are only some of the embodiments of the present invention, rather than all of them. Based on the embodiments of the present invention, other embodiments obtained by those skilled in the art without creative efforts belong to The protection scope of the present invention.
高效液相色谱(HPLC)检测法:Agilent,UV检测器,C18柱(250×4.6mm,5μm),流动相 A:20mM pH6.0三乙胺乙酸缓冲液(TEAA),流动相B:乙腈,流速0.35mL/min,柱温40℃,进样体积为10μL。High-performance liquid chromatography (HPLC) detection method: Agilent, UV detector, C18 column (250×4.6mm, 5μm), mobile phase A: 20mM pH6.0 triethylamine acetic acid buffer (TEAA), mobile phase B: acetonitrile , the flow rate is 0.35mL/min, the column temperature is 40°C, and the injection volume is 10μL.
实施例1重组毕赤酵母的构建The construction of embodiment 1 recombinant Pichia pastoris
根据NCBI上公布的野猪[Sus scrofa(pig)]的岩藻糖-1-磷酸鸟苷转移酶(FPGT)和岩藻糖 激酶(FCSK)的序列,其中岩藻糖-1-磷酸鸟苷转移酶基因在NCBI上的Gene ID:100513664, 其氨基酸序列如SEQ ID NO.1所示;岩藻糖激酶基因在NCBI上的Gene ID:100625448,其 序列如SEQ ID NO.2所示,上述两基因均按照毕赤酵母密码子偏好性进行优化。其优化后的岩藻糖-1-磷酸鸟苷转移酶的碱基序列如SEQ ID NO.3所示,岩藻糖激酶的碱基序列如SEQ ID NO.4所示;通过酶切连接,构建序列如SEQ ID NO.5所示的重组质粒。其质粒示意图如 图2所示。According to the sequence of fucose-1-phosphate guanosine transferase (FPGT) and fucose kinase (FCSK) of wild boar [Sus scrofa (pig)] published on NCBI, in which fucose-1-phosphate guanosine transfer The Gene ID of the enzyme gene on NCBI: 100513664, its amino acid sequence is shown in SEQ ID NO.1; the Gene ID of the fucose kinase gene on NCBI: 100625448, its sequence is shown in SEQ ID NO.2, the above two Genes were optimized according to the codon bias of Pichia pastoris. The base sequence of the optimized fucose-1-phosphate guanosyltransferase is shown in SEQ ID NO.3, and the base sequence of fucose kinase is shown in SEQ ID NO.4; by enzyme cleavage and connection, A recombinant plasmid whose sequence is shown in SEQ ID NO.5 was constructed. The schematic diagram of the plasmid is shown in Figure 2.
其中:in:
SEQ ID NO.5:SEQ ID NO.5:
在SEQ ID NO.5中“_”为基因FPGT区域,“~”为FCSK区域,“…”为启动子区域。In SEQ ID NO.5, " _ " is the gene FPGT region, "~" is the FCSK region, and "..." is the promoter region.
将构建好的重组质粒电转化重组毕赤酵母GS115的感受态细胞,重组质粒的添加量为 800~1500ng,电转化条件:电压2.5kV,电击试剂5.6ms,30℃复苏1.5h,涂布终浓度为100mg/L 博莱霉素抗性的YPDZ平板,30℃培养96h,挑选若干单菌落。The constructed recombinant plasmid was electrotransformed into competent cells of recombinant Pichia pastoris GS115, the amount of recombinant plasmid added was 800-1500ng, the electroporation conditions were: voltage 2.5kV, electroporation reagent 5.6ms, recovery at 30°C for 1.5h, and final coating. Concentration of 100mg/L bleomycin-resistant YPDZ plate, cultivated at 30°C for 96h, and selected several single colonies.
通过博莱霉素抗性平板筛选,菌落PCR验证,确证岩藻糖-1-磷酸鸟苷转移酶(FPGT) 和岩藻糖激酶(FCSK)成功整合到酵母基因组。琼脂糖凝胶电泳鉴定结果如图3所示,其中条 带1、2、3、4、5均为菌落样品,M为Marker。可以看出菌落PCR验证有特殊条带,证明岩 藻糖-1-磷酸鸟苷转移酶和岩藻糖激酶成功整合的毕赤酵母GS115基因组中。确认野猪来源的岩藻糖-1-磷酸鸟苷转移酶和岩藻糖激酶表达成功,得到重组毕赤酵母GS115。Through bleomycin resistance plate screening and colony PCR verification, it was confirmed that fucose-1-phosphate guanosyltransferase (FPGT) and fucose kinase (FCSK) were successfully integrated into the yeast genome. The results of agarose gel electrophoresis identification are shown in Figure 3, in which bands 1, 2, 3, 4, and 5 are colony samples, and M is Marker. It can be seen that there are special bands in the colony PCR verification, which proves that fucose-1-phosphate guanosyltransferase and fucokinase are successfully integrated in the Pichia pastoris GS115 genome. It was confirmed that the fucose-1-phosphate guanosyltransferase and fucokinase derived from wild boar were successfully expressed, and the recombinant Pichia pastoris GS115 was obtained.
实施例2发酵生产鸟苷二磷酸岩藻糖Example 2 Fermentative production of fucose guanosine diphosphate
将实施例1中重组毕赤酵母制成种子液,种子液培养基的配方为:甘油10g/L、胰蛋白胨 10g/L、酵母粉5g/L、NaCl 10g/L;种子液制作方法为:挑取新鲜平板上的单菌落在种子培养 基中,培养24h。The recombinant Pichia pastoris in Example 1 is made into seed liquid, and the formula of seed liquid culture medium is: glycerol 10g/L, tryptone 10g/L, yeast powder 5g/L, NaCl 10g/L; Seed liquid preparation method is: Pick a single colony on a fresh plate and place it in the seed medium for 24 hours.
将种子液以OD值为1的接种量接入到发酵培养基中,发酵培养基的配方为:蛋白胨20g/L、酵母粉10g/L、1.34%YNB;10%10×pH6.0磷酸缓冲液,于30℃、250rpm条件下培养120h,每隔24h流加1%碳源,碳源为甲醇,在发酵48h后每隔24h向培养基中添加1g/L岩 藻糖至发酵结束。Insert the seed solution into the fermentation medium with an inoculation amount of OD value of 1. The formula of the fermentation medium is: peptone 20g/L, yeast powder 10g/L, 1.34% YNB; 10% 10×pH6.0 phosphate buffer culture medium at 30°C and 250rpm for 120h, adding 1% carbon source every 24h, the carbon source being methanol, and adding 1g/L fucose to the medium every 24h after fermentation for 48h until the end of fermentation.
发酵结束后,使用高效液相色谱测定发酵细胞中的鸟苷二磷酸岩藻糖,结果如图4所示, 并通过质谱进一步确证,结果如图5所示,鸟苷二磷酸岩藻糖的含量为289.8mg/L。After the fermentation, use high performance liquid chromatography to measure the guanosine diphosphate fucose in the fermented cells, the results are shown in Figure 4, and further confirmed by mass spectrometry, the results are shown in Figure 5, the guanosine diphosphate fucose The content is 289.8mg/L.
上述具体实施方式对本发明作了详细说明,但是本发明不限于上述实施例,在所属技术 领域普通技术人员所具备的知识范围内,还可以在不脱离本发明宗旨的前提下作出各种变化。 此外,在不冲突的情况下,本发明的实施例及实施例中的特征可以相互组合。The above-mentioned specific embodiments have described the present invention in detail, but the present invention is not limited to the above-mentioned embodiments, within the scope of knowledge possessed by those of ordinary skill in the art, various changes can also be made without departing from the gist of the present invention. In addition, the embodiments of the present invention and the features in the embodiments can be combined with each other if there is no conflict.
SEQUENCE LISTINGSEQUENCE LISTING
<110> 华南理工大学<110> South China University of Technology
<120> 一种合成鸟苷二磷酸岩藻糖的重组毕赤酵母及其构建方法与应用<120> A recombinant Pichia pastoris for synthesizing guanosine diphosphate fucose and its construction method and application
<130><130>
<160> 5<160> 5
<170> PatentIn version 3.5<170> PatentIn version 3.5
<210> 1<210> 1
<211> 599<211> 599
<212> PRT<212> PRT
<213> Sus scrofa<213> Sus scrofa
<400> 1<400> 1
Met Ala Ala Ala Ser Ala Pro Leu Gly Val Ser Leu Gln Glu Ala ThrMet Ala Ala Ala Ser Ala Pro Leu Gly Val Ser Leu Gln Glu Ala Thr
1 5 10 151 5 10 15
Gln Arg Arg Leu Arg Arg Phe Ser Glu Leu Arg Gly Lys Pro Val AlaGln Arg Arg Leu Arg Arg Phe Ser Glu Leu Arg Gly Lys Pro Val Ala
20 25 30 20 25 30
Ala Gly Glu Phe Trp Asp Ile Val Ala Ile Thr Ala Ala Asp Glu LysAla Gly Glu Phe Trp Asp Ile Val Ala Ile Thr Ala Ala Asp Glu Lys
35 40 45 35 40 45
Gln Glu Leu Ala Tyr Lys Gln Gln Leu Ser Glu Lys Leu Lys Lys LysGln Glu Leu Ala Tyr Lys Gln Gln Leu Ser Glu Lys Leu Lys Lys Lys
50 55 60 50 55 60
Glu Leu Pro Leu Gly Val Gln Tyr Leu Val Phe Val Asp Pro Ala GlyGlu Leu Pro Leu Gly Val Gln Tyr Leu Val Phe Val Asp Pro Ala Gly
65 70 75 8065 70 75 80
Ala Lys Ile Gly Asn Gly Gly Ser Thr Leu Cys Ala Leu Arg Cys LeuAla Lys Ile Gly Asn Gly Gly Ser Thr Leu Cys Ala Leu Arg Cys Leu
85 90 95 85 90 95
Glu Lys Leu Tyr Gly Asp Gln Trp Asn Ser Phe Thr Ile Leu Leu IleGlu Lys Leu Tyr Gly Asp Gln Trp Asn Ser Phe Thr Ile Leu Leu Ile
100 105 110 100 105 110
His Ser Gly Gly Tyr Ser Gln Arg Leu Pro Asn Ala Ser Ala Leu GlyHis Ser Gly Gly Tyr Ser Gln Arg Leu Pro Asn Ala Ser Ala Leu Gly
115 120 125 115 120 125
Lys Ile Phe Thr Ala Leu Pro Phe Gly Ser Pro Ile Tyr Gln Met LeuLys Ile Phe Thr Ala Leu Pro Phe Gly Ser Pro Ile Tyr Gln Met Leu
130 135 140 130 135 140
Glu Leu Lys Leu Ala Met Tyr Ile Asp Phe Pro Ser His Met Asn ProGlu Leu Lys Leu Ala Met Tyr Ile Asp Phe Pro Ser His Met Asn Pro
145 150 155 160145 150 155 160
Gly Ile Leu Val Thr Cys Ala Asp Asp Ile Glu Leu Tyr Ser Ile GlyGly Ile Leu Val Thr Cys Ala Asp Asp Ile Glu Leu Tyr Ser Ile Gly
165 170 175 165 170 175
Glu Ser Glu Phe Ile Arg Phe Asp Lys Pro Gly Phe Thr Ala Leu AlaGlu Ser Glu Phe Ile Arg Phe Asp Lys Pro Gly Phe Thr Ala Leu Ala
180 185 190 180 185 190
His Pro Ser Ser Leu Thr Val Gly Thr Thr His Gly Val Phe Val LeuHis Pro Ser Ser Leu Thr Val Gly Thr Thr His His Gly Val Phe Val Leu
195 200 205 195 200 205
Glu Pro Phe Asn Arg Leu Glu Tyr Arg Asp Leu Glu Tyr Arg Cys CysGlu Pro Phe Asn Arg Leu Glu Tyr Arg Asp Leu Glu Tyr Arg Cys Cys
210 215 220 210 215 220
His Arg Phe Leu His Lys Pro Ser Ile Glu Met Met Tyr Lys Phe AspHis Arg Phe Leu His Lys Pro Ser Ile Glu Met Met Tyr Lys Phe Asp
225 230 235 240225 230 235 240
Ala Val Cys Arg Pro Gly Asn Phe Ser Gln Gln Asp Phe Gly Gly GlyAla Val Cys Arg Pro Gly Asn Phe Ser Gln Gln Asp Phe Gly Gly Gly
245 250 255 245 250 255
Asp Thr Pro Ser Leu Lys Leu Asp Pro Glu Tyr Val Tyr Thr Asp SerAsp Thr Pro Ser Leu Lys Leu Asp Pro Glu Tyr Val Tyr Thr Asp Ser
260 265 270 260 265 270
Val Phe Tyr Met Asp His Lys Thr Ala Lys Lys Leu Leu Ala Phe TyrVal Phe Tyr Met Asp His Lys Thr Ala Lys Lys Leu Leu Ala Phe Tyr
275 280 285 275 280 285
Glu Lys Ile Asp Thr Leu Asn Cys Glu Ile Asp Ala Tyr Gly Asp PheGlu Lys Ile Asp Thr Leu Asn Cys Glu Ile Asp Ala Tyr Gly Asp Phe
290 295 300 290 295 300
Leu Gln Ala Leu Gly Pro Gly Ala Thr Val Glu Tyr Thr Arg Asn ThrLeu Gln Ala Leu Gly Pro Gly Ala Thr Val Glu Tyr Thr Arg Asn Thr
305 310 315 320305 310 315 320
Ser Asn Val Thr Lys Glu Glu Ser Glu Leu Val Asp Met Arg Gln ArgSer Asn Val Thr Lys Glu Glu Ser Glu Leu Val Asp Met Arg Gln Arg
325 330 335 325 330 335
Ile Phe His Leu Leu Lys Gly Thr Pro Leu Asn Val Ile Val Leu AsnIle Phe His Leu Leu Lys Gly Thr Pro Leu Asn Val Ile Val Leu Asn
340 345 350 340 345 350
Asn Ser Lys Phe Tyr His Ile Gly Thr Thr Lys Glu Tyr Leu Phe HisAsn Ser Lys Phe Tyr His Ile Gly Thr Thr Lys Glu Tyr Leu Phe His
355 360 365 355 360 365
Phe Thr Ser Asp Ser Ser Leu Lys Ser Glu Leu Gly Leu Gln Ser IlePhe Thr Ser Asp Ser Ser Leu Lys Ser Glu Leu Gly Leu Gln Ser Ile
370 375 380 370 375 380
Ala Phe Ser Ile Phe Pro Ala Ile Pro Glu Tyr Ser Gly Asn Lys SerAla Phe Ser Ile Phe Pro Ala Ile Pro Glu Tyr Ser Gly Asn Lys Ser
385 390 395 400385 390 395 400
Cys Ile Ile Gln Ser Ile Leu Asp Ser Arg Cys Ser Leu Ala Pro GlyCys Ile Ile Gln Ser Ile Leu Asp Ser Arg Cys Ser Leu Ala Pro Gly
405 410 415 405 410 415
Ser Val Val Glu Tyr Ser Arg Leu Gly Pro Asp Val Ser Val Gly GluSer Val Val Glu Tyr Ser Arg Leu Gly Pro Asp Val Ser Val Gly Glu
420 425 430 420 425 430
Asn Cys Ile Ile Ser Gly Ser His Ile Ile Thr Arg Ala Ile Leu ProAsn Cys Ile Ile Ser Gly Ser His Ile Ile Thr Arg Ala Ile Leu Pro
435 440 445 435 440 445
Ala Tyr Ser Phe Val Cys Ser Leu Ser Leu Lys Ile Asn Gly His IleAla Tyr Ser Phe Val Cys Ser Leu Ser Leu Lys Ile Asn Gly His Ile
450 455 460 450 455 460
Lys Tyr Ser Thr Met Ala Cys Gly Val Gln Asp Asn Leu Lys Lys AsnLys Tyr Ser Thr Met Ala Cys Gly Val Gln Asp Asn Leu Lys Lys Asn
465 470 475 480465 470 475 480
Val Lys Thr Leu Ser Asp Val Lys Leu Leu Gln Phe Phe Gly Val SerVal Lys Thr Leu Ser Asp Val Lys Leu Leu Gln Phe Phe Gly Val Ser
485 490 495 485 490 495
Leu Leu Ser Cys Leu Asp Val Trp Asn Leu Glu Val Thr Glu Glu LeuLeu Leu Ser Cys Leu Asp Val Trp Asn Leu Glu Val Thr Glu Glu Leu
500 505 510 500 505 510
Phe Ser Gly Asn Lys Thr Cys Leu Ser Leu Trp Asn Ala Arg Ile PhePhe Ser Gly Asn Lys Thr Cys Leu Ser Leu Trp Asn Ala Arg Ile Phe
515 520 525 515 520 525
Pro Val Cys Ser Ser Leu Ser Asp Ser Val Ile Ala Ser Leu Lys MetPro Val Cys Ser Ser Leu Ser Asp Ser Val Ile Ala Ser Leu Lys Met
530 535 540 530 535 540
Leu Asn Ala Val Gln Ser Lys Ser Val Phe Ser Leu Asn Asn Tyr LysLeu Asn Ala Val Gln Ser Lys Ser Val Phe Ser Leu Asn Asn Tyr Lys
545 550 555 560545 550 555 560
Leu Leu Ser Ile Glu Glu Met Leu Val Tyr Lys Asp Val Glu Asp MetLeu Leu Ser Ile Glu Glu Met Leu Val Tyr Lys Asp Val Glu Asp Met
565 570 575 565 570 575
Ile Thr Tyr Arg Glu Gln Ile Phe Leu Glu Ile Ala Leu Asn Arg LysIle Thr Tyr Arg Glu Gln Ile Phe Leu Glu Ile Ala Leu Asn Arg Lys
580 585 590 580 585 590
Gln Ser Val Leu Glu Thr SerGln Ser Val Leu Glu Thr Ser
595 595
<210> 2<210> 2
<211> 1083<211> 1083
<212> PRT<212> PRT
<213> Sus scrofa<213> Sus scrofa
<400> 2<400> 2
Met Glu Gln Pro Lys Gly Val Asp Trp Thr Val Ile Ile Leu Thr CysMet Glu Gln Pro Lys Gly Val Asp Trp Thr Val Ile Ile Leu Thr Cys
1 5 10 151 5 10 15
Gln Tyr Lys Asp Ser Val Glu Val Phe Gln Lys Glu Leu Glu Ile ArgGln Tyr Lys Asp Ser Val Glu Val Phe Gln Lys Glu Leu Glu Ile Arg
20 25 30 20 25 30
Gln Lys Arg Glu Gln Ile Pro Ala Ser Thr Leu Leu Leu Ala Val GluGln Lys Arg Glu Gln Ile Pro Ala Ser Thr Leu Leu Leu Ala Val Glu
35 40 45 35 40 45
Asp Pro Glu Val His Val Gly Ser Gly Gly Ala Thr Leu Asn Ala LeuAsp Pro Glu Val His Val Gly Ser Gly Gly Ala Thr Leu Asn Ala Leu
50 55 60 50 55 60
Leu Val Ala Ala Glu His Leu Ser Ala Arg Ala Gly Phe Thr Val ValLeu Val Ala Ala Glu His Leu Ser Ala Arg Ala Gly Phe Thr Val Val
65 70 75 8065 70 75 80
Thr Ser Asp Val Leu His Ser Ala Trp Ile Leu Ile Leu His Met GlyThr Ser Asp Val Leu His Ser Ala Trp Ile Leu Ile Leu His Met Gly
85 90 95 85 90 95
Arg Asp Phe Pro Phe Asp Asp Cys Gly Arg Ala Phe Thr Cys Leu ProArg Asp Phe Pro Phe Asp Asp Cys Gly Arg Ala Phe Thr Cys Leu Pro
100 105 110 100 105 110
Val Glu Asn Pro Gln Ala Pro Val Glu Ala Val Val Cys Asn Leu AspVal Glu Asn Pro Gln Ala Pro Val Glu Ala Val Val Cys Asn Leu Asp
115 120 125 115 120 125
Cys Leu Leu Asp Ile Met Ser His Arg Leu Gly Pro Gly Ser Pro ProCys Leu Leu Asp Ile Met Ser His Arg Leu Gly Pro Gly Ser Pro Pro
130 135 140 130 135 140
Gly Val Trp Val Cys Ser Thr Asp Met Leu Leu Ser Val Pro Pro AsnGly Val Trp Val Cys Ser Thr Asp Met Leu Leu Ser Val Pro Pro Asn
145 150 155 160145 150 155 160
Pro Gly Ile Asn Trp Asp Gly Phe Arg Gly Ala Arg Val Ile Ala LeuPro Gly Ile Asn Trp Asp Gly Phe Arg Gly Ala Arg Val Ile Ala Leu
165 170 175 165 170 175
Pro Gly Ser Thr Ala Tyr Ala Arg Asn His Gly Val Tyr Leu Thr AspPro Gly Ser Thr Ala Tyr Ala Arg Asn His Gly Val Tyr Leu Thr Asp
180 185 190 180 185 190
Ser Gln Gly Phe Val Leu Asp Ile Tyr Tyr Gln Gly Thr Glu Ala GluSer Gln Gly Phe Val Leu Asp Ile Tyr Tyr Gln Gly Thr Glu Ala Glu
195 200 205 195 200 205
Ile Gln Arg Cys Ala Arg Pro Asp Gly Gln Val Pro Leu Val Ser GlyIle Gln Arg Cys Ala Arg Pro Asp Gly Gln Val Pro Leu Val Ser Gly
210 215 220 210 215 220
Ile Val Phe Phe Ser Val Glu Thr Ala Glu His Leu Leu Ala Thr HisIle Val Phe Phe Ser Val Glu Thr Ala Glu His Leu Leu Ala Thr His
225 230 235 240225 230 235 240
Val Ser Pro Pro Leu Asp Ala Cys Thr Tyr Met Gly Leu Asp Ser GlyVal Ser Pro Pro Leu Asp Ala Cys Thr Tyr Met Gly Leu Asp Ser Gly
245 250 255 245 250 255
Ala Gln Pro Val Gln Leu Ser Leu Phe Phe Asp Ile Leu Leu Cys MetAla Gln Pro Val Gln Leu Ser Leu Phe Phe Asp Ile Leu Leu Cys Met
260 265 270 260 265 270
Ala Arg Asn Val Arg Arg Glu Asp Phe Leu Val Gly Arg Pro Pro GluAla Arg Asn Val Arg Arg Glu Asp Phe Leu Val Gly Arg Pro Pro Glu
275 280 285 275 280 285
Met Gly Arg Gly Asp Met Glu Thr Glu Gly Tyr Leu Arg Gly Ala ArgMet Gly Arg Gly Asp Met Glu Thr Glu Gly Tyr Leu Arg Gly Ala Arg
290 295 300 290 295 300
Ala Glu Leu Trp Arg Glu Leu Arg Asp Gln Pro Leu Thr Leu Ala TyrAla Glu Leu Trp Arg Glu Leu Arg Asp Gln Pro Leu Thr Leu Ala Tyr
305 310 315 320305 310 315 320
Val Pro Asp Gly Ser Tyr Asn Tyr Met Thr Asn Ser Ala Ser Glu PheVal Pro Asp Gly Ser Tyr Asn Tyr Met Thr Asn Ser Ala Ser Glu Phe
325 330 335 325 330 335
Leu His Ser Leu Thr Phe Pro Gly Ala Ser Gly Ala Gln Val Val HisLeu His Ser Leu Thr Phe Pro Gly Ala Ser Gly Ala Gln Val Val His
340 345 350 340 345 350
Ser Gln Val Glu Glu Gln Gln Leu Leu Gly Ala Gly Ser Ser Val ValSer Gln Val Glu Glu Gln Gln Leu Leu Gly Ala Gly Ser Ser Val Val
355 360 365 355 360 365
Ser Cys Leu Leu Glu Gly Pro Val Gln Leu Gly Pro Gly Ser Val LeuSer Cys Leu Leu Glu Gly Pro Val Gln Leu Gly Pro Gly Ser Val Leu
370 375 380 370 375 380
Gln His Cys His Leu Arg Gly Pro Ile His Ile Gly Thr Gly Cys PheGln His Cys His Leu Arg Gly Pro Ile His Ile Gly Thr Gly Cys Phe
385 390 395 400385 390 395 400
Val Ser Gly Leu Asp Ala Ala Gln Ser Glu Ala Leu His Ser Leu GluVal Ser Gly Leu Asp Ala Ala Gln Ser Glu Ala Leu His Ser Leu Glu
405 410 415 405 410 415
Leu His Asp Leu Val Leu Gln Gly His His Leu Gln Leu His Gly AlaLeu His Asp Leu Val Leu Gln Gly His His Leu Gln Leu His Gly Ala
420 425 430 420 425 430
Pro Ser Arg Ala Phe Thr Leu Val Gly Arg Leu Asp Ser Trp Glu ArgPro Ser Arg Ala Phe Thr Leu Val Gly Arg Leu Asp Ser Trp Glu Arg
435 440 445 435 440 445
Gln Gly Ala Gly Thr Tyr Leu Asn Met Ser Trp Ser Lys Phe Phe GlnGln Gly Ala Gly Thr Tyr Leu Asn Met Ser Trp Ser Lys Phe Phe Gln
450 455 460 450 455 460
Lys Thr Gly Ile Arg Asp Trp Asp Leu Trp Asp Pro Asp Thr Pro ProLys Thr Gly Ile Arg Asp Trp Asp Leu Trp Asp Pro Asp Thr Pro Pro
465 470 475 480465 470 475 480
Thr Glu Arg Cys Leu Leu Ser Ala Arg Leu Phe Pro Val Leu His ProThr Glu Arg Cys Leu Leu Ser Ala Arg Leu Phe Pro Val Leu His Pro
485 490 495 485 490 495
Leu Arg Ala Leu Gly Pro Gln Asp Met Leu Trp Met Leu Asp Pro GlnLeu Arg Ala Leu Gly Pro Gln Asp Met Leu Trp Met Leu Asp Pro Gln
500 505 510 500 505 510
Glu Asp Gly Gly Lys Ala Leu Arg Ala Trp Arg Asp Ser Trp Arg LeuGlu Asp Gly Gly Lys Ala Leu Arg Ala Trp Arg Asp Ser Trp Arg Leu
515 520 525 515 520 525
Ser Trp Glu Gln Leu Gln Pro Cys Leu Asp Arg Ala Ala Thr Leu AlaSer Trp Glu Gln Leu Gln Pro Cys Leu Asp Arg Ala Ala Thr Leu Ala
530 535 540 530 535 540
Ser Arg Arg Asp Leu Phe Phe Arg Gln Ala Leu His Lys Ala Arg HisSer Arg Arg Asp Leu Phe Phe Arg Gln Ala Leu His Lys Ala Arg His
545 550 555 560545 550 555 560
Val Leu Glu Ala Arg Gln Asp Leu Ser Leu Arg Pro Leu Ile Arg AlaVal Leu Glu Ala Arg Gln Asp Leu Ser Leu Arg Pro Leu Ile Arg Ala
565 570 575 565 570 575
Ala Val Arg Glu Gly Cys Pro Gly Pro Leu Met Ala Thr Leu Asp GlnAla Val Arg Glu Gly Cys Pro Gly Pro Leu Met Ala Thr Leu Asp Gln
580 585 590 580 585 590
Val Ala Ala Gly Ala Gly Asp Pro Gly Val Ala Ala Arg Ala Leu AlaVal Ala Ala Gly Ala Gly Asp Pro Gly Val Ala Ala Arg Ala Leu Ala
595 600 605 595 600 605
Cys Val Ala Asp Ile Leu Gly Cys Met Ala Glu Gly Arg Gly Gly LeuCys Val Ala Asp Ile Leu Gly Cys Met Ala Glu Gly Arg Gly Gly Leu
610 615 620 610 615 620
Arg Ser Gly Pro Ala Ala Asn Pro Glu Trp Val Arg Pro Phe Ser TyrArg Ser Gly Pro Ala Ala Asn Pro Glu Trp Val Arg Pro Phe Ser Tyr
625 630 635 640625 630 635 640
Leu Glu Cys Gly Asp Leu Ala Gly Gly Val Glu Ala Leu Ala Gln GluLeu Glu Cys Gly Asp Leu Ala Gly Gly Val Glu Ala Leu Ala Gln Glu
645 650 655 645 650 655
Arg Glu Lys Trp Leu Ser Arg Pro Ala Leu Leu Val Arg Ala Ala ArgArg Glu Lys Trp Leu Ser Arg Pro Ala Leu Leu Val Arg Ala Ala Arg
660 665 670 660 665 670
His Tyr Glu Gly Ala Gly Gln Ile Leu Ile Arg Gln Ala Val Met SerHis Tyr Glu Gly Ala Gly Gln Ile Leu Ile Arg Gln Ala Val Met Ser
675 680 685 675 680 685
Ala Gln His Phe Val Ser Thr Glu Pro Val Glu Leu Pro Ala Pro GlyAla Gln His Phe Val Ser Thr Glu Pro Val Glu Leu Pro Ala Pro Gly
690 695 700 690 695 700
Gln Trp Val Val Ala Glu Cys Pro Ala Arg Val Asp Phe Ser Gly GlyGln Trp Val Val Ala Glu Cys Pro Ala Arg Val Asp Phe Ser Gly Gly
705 710 715 720705 710 715 720
Trp Ser Asp Thr Pro Pro Leu Ala Tyr Glu His Gly Gly Ala Val LeuTrp Ser Asp Thr Pro Pro Leu Ala Tyr Glu His Gly Gly Ala Val Leu
725 730 735 725 730 735
Gly Leu Ala Val Arg Val Asp Gly Arg Arg Pro Ile Gly Ala Arg AlaGly Leu Ala Val Arg Val Asp Gly Arg Arg Pro Ile Gly Ala Arg Ala
740 745 750 740 745 750
Arg Arg Ile Pro Glu Pro Glu Leu Trp Leu Ala Val Gly Pro Gln HisArg Arg Ile Pro Glu Pro Glu Leu Trp Leu Ala Val Gly Pro Gln His
755 760 765 755 760 765
Asp Lys Met Ala Met Lys Ile Val Cys Arg Ser Leu Asp Asp Leu GlnAsp Lys Met Ala Met Lys Ile Val Cys Arg Ser Leu Asp Asp Leu Gln
770 775 780 770 775 780
Asp Tyr Cys Gln Pro His Ala Pro Gly Ala Leu Leu Lys Ala Ala PheAsp Tyr Cys Gln Pro His Ala Pro Gly Ala Leu Leu Lys Ala Ala Phe
785 790 795 800785 790 795 800
Ile Cys Ala Gly Ile Leu Ser Val His Ser Glu Leu Ser Leu Ser GluIle Cys Ala Gly Ile Leu Ser Val His Ser Glu Leu Ser Leu Ser Glu
805 810 815 805 810 815
Gln Leu Leu Cys Thr Phe Gly Gly Gly Phe Glu Leu Gln Thr Trp SerGln Leu Leu Cys Thr Phe Gly Gly Gly Phe Glu Leu Gln Thr Trp Ser
820 825 830 820 825 830
Glu Leu Pro His Gly Ser Gly Leu Gly Thr Ser Ser Ile Leu Ala GlyGlu Leu Pro His Gly Ser Gly Leu Gly Thr Ser Ser Ile Leu Ala Gly
835 840 845 835 840 845
Ala Ala Leu Ala Ala Leu Gln Arg Val Ala Gly Arg Ala Val Gly ThrAla Ala Leu Ala Ala Leu Gln Arg Val Ala Gly Arg Ala Val Gly Thr
850 855 860 850 855 860
Glu Ala Leu Ile His Ala Val Leu His Leu Glu Gln Val Leu Thr ThrGlu Ala Leu Ile His Ala Val Leu His Leu Glu Gln Val Leu Thr Thr
865 870 875 880865 870 875 880
Gly Gly Gly Trp Gln Asp Gln Val Gly Gly Leu Met Pro Gly Ile LysGly Gly Gly Trp Gln Asp Gln Val Gly Gly Leu Met Pro Gly Ile Lys
885 890 895 885 890 895
Val Gly Arg Ser Arg Ala Gln Leu Pro Leu Lys Val Glu Val Glu GluVal Gly Arg Ser Arg Ala Gln Leu Pro Leu Lys Val Glu Val Glu Glu
900 905 910 900 905 910
Ile Thr Val Pro Ala Gly Phe Val Gln Lys Leu Asn Asp His Leu LeuIle Thr Val Pro Ala Gly Phe Val Gln Lys Leu Asn Asp His Leu Leu
915 920 925 915 920 925
Leu Val Tyr Thr Gly Lys Thr Arg Leu Ala Arg Asn Leu Leu Gln AspLeu Val Tyr Thr Gly Lys Thr Arg Leu Ala Arg Asn Leu Leu Gln Asp
930 935 940 930 935 940
Val Leu Arg Ser Trp Tyr Ala Arg Leu Pro Pro Val Val Gln Asn AlaVal Leu Arg Ser Trp Tyr Ala Arg Leu Pro Pro Val Val Gln Asn Ala
945 950 955 960945 950 955 960
His Asn Leu Val Gln Gln Thr Glu Glu Cys Ala Glu Ala Phe Arg GlnHis Asn Leu Val Gln Gln Thr Glu Glu Cys Ala Glu Ala Phe Arg Gln
965 970 975 965 970 975
Gly Ser Leu Pro Arg Leu Gly Gln Cys Leu Thr Ser Tyr Trp Glu GlnGly Ser Leu Pro Arg Leu Gly Gln Cys Leu Thr Ser Tyr Trp Glu Gln
980 985 990 980 985 990
Lys Lys Leu Met Ala Pro Gly Cys Glu Pro Leu Ala Val Arg His MetLys Lys Leu Met Ala Pro Gly Cys Glu Pro Leu Ala Val Arg His Met
995 1000 1005 995 1000 1005
Met Asp Ala Leu Ala Pro His Val His Gly Gln Ser Leu Ala GlyMet Asp Ala Leu Ala Pro His Val His Gly Gln Ser Leu Ala Gly
1010 1015 1020 1010 1015 1020
Ala Gly Gly Gly Gly Phe Leu Tyr Leu Leu Thr Lys Glu Pro ArgAla Gly Gly Gly Gly Phe Leu Tyr Leu Leu Thr Lys Glu Pro Arg
1025 1030 1035 1025 1030 1035
Gln Lys Glu Ala Leu Glu Ala Val Leu Ala Lys Thr Glu Gly LeuGln Lys Glu Ala Leu Glu Ala Val Leu Ala Lys Thr Glu Gly Leu
1040 1045 1050 1040 1045 1050
Gly Asn Tyr Ser Val His Leu Val Glu Val Asp Thr Gln Gly LeuGly Asn Tyr Ser Val His Leu Val Glu Val Asp Thr Gln Gly Leu
1055 1060 1065 1055 1060 1065
Ser Leu Gln Leu Leu Gly Thr Glu Thr Ser Thr Gly Cys Ser PheSer Leu Gln Leu Leu Gly Thr Glu Thr Ser Ser Thr Gly Cys Ser Phe
1070 1075 1080 1070 1075 1080
<210> 3<210> 3
<211> 1800<211> 1800
<212> DNA<212>DNA
<213> 人工序列<213> Artificial sequence
<400> 3<400> 3
atggctgctg cttcagctcc attgggagtt tccttgcaag aagctactca aagaagattg 60atggctgctg cttcagctcc attgggagtt tccttgcaag aagctactca aagaagattg 60
agaagatttt ccgaattgag aggtaagcca gttgctgctg gtgaattttg ggatattgtt 120agaagatttt ccgaattgag aggtaagcca gttgctgctg gtgaattttg ggatattgtt 120
gctattactg ctgctgatga aaagcaagaa ttggcttaca agcaacaatt gtccgaaaag 180gctattactg ctgctgatga aaagcaagaa ttggcttaca agcaacaatt gtccgaaaag 180
ttgaagaaga aggaattgcc attgggagtt caatacttgg tttttgttga tccagctggt 240ttgaagaagaaga aggaattgcc attgggagtt caatacttgg tttttgttga tccagctggt 240
gctaagattg gtaacggagg tagcactttg tgtgctttga gatgtttgga aaagttgtac 300gctaagattg gtaacggagg tagcactttg tgtgctttga gatgtttgga aaagttgtac 300
ggtgatcaat ggaactcctt tactattttg ttgattcata gcgggggata ctcccaaaga 360ggtgatcaat ggaactcctt tactattttg ttgattcata gcgggggata ctcccaaaga 360
ttgcctaacg cttcagcttt gggtaagatt tttactgctt tgccatttgg ttcacctatt 420ttgcctaacg cttcagcttt gggtaagatt tttactgctt tgccatttgg ttcacctatt 420
taccaaatgt tggaattgaa gttggctatg tacattgatt ttccatcaca tatgaaccct 480taccaaatgt tggaattgaa gttggctatg tacattgatt ttccatcaca tatgaaccct 480
ggtattttgg ttacttgtgc tgatgatatt gaattgtact ccattggaga atccgaattt 540ggtattttgg ttacttgtgc tgatgatatt gaattgtact ccattggaga atccgaattt 540
attagatttg ataagccagg atttactgct ttggctcatc catcatcctt gactgttggt 600attagatttg ataagccagg atttactgct ttggctcatc catcatcctt gactgttggt 600
actactcatg gagtttttgt tttggaacca tttaacagat tggaatacag agatttggaa 660actactcatg gagtttttgt tttggaacca tttaacagat tggaatacag agattggaa 660
tacagatgtt gtcatagatt tttgcataag ccatctattg aaatgatgta caagtttgat 720tacagatgtt gtcatagatt tttgcataag ccatctattg aaatgatgta caagtttgat 720
gctgtttgta gacctggtaa cttttcccaa caagattttg gtggaggtga tactccatcc 780gctgtttgta gacctggtaa cttttcccaa caagattttg gtggaggtga tactccatcc 780
ttgaagctag atccagaata cgtttacact gattccgttt tttacatgga tcataagact 840ttgaagctag atccagaata cgtttacact gattccgttt tttacatgga tcataagact 840
gctaagaagt tgttggcttt ttacgaaaag attgatactt tgaactgtga aattgatgct 900gctaagaagt tgttggcttt ttacgaaaag attgatactt tgaactgtga aattgatgct 900
tacggtgatt ttttgcaagc tttgggacca ggagctactg ttgaatacac tagaaacact 960tacggtgatt ttttgcaagc tttgggacca ggagctactg ttgaatacac tagaaacact 960
tctaacgtta ctaaggaaga atccgaattg gttgatatga gacaaagaat ttttcatttg 1020tctaacgtta ctaaggaaga atccgaattg gttgatatga gacaaagaat ttttcatttg 1020
ttgaagggta ctccattgaa cgttattgtt ttgaacaact ctaagtttta ccatattggt 1080ttgaagggta ctccattgaa cgttattgtt ttgaacaact ctaagtttta ccatattggt 1080
actactaagg aatacttgtt tcattttact tctgattctt ccttgaagtc cgaattggga 1140actactaagg aatacttgtt tcattttact tctgattctt ccttgaagtc cgaattggga 1140
ttgcaatcaa ttgctttttc tatttttcca gctattccag aatactcagg taacaagtca 1200ttgcaatcaa ttgctttttc tatttttcca gctattccag aatactcagg taacaagtca 1200
tgtattattc aatctatttt ggattctaga tgttccttgg ctccaggatc agttgttgaa 1260tgtattattc aatctatttt ggattctaga tgttccttgg ctccaggatc agttgttgaa 1260
tactctagat tgggacctga tgtttccgtt ggtgaaaact gtattatttc aggttcacat 1320tactctagat tgggacctga tgtttccgtt ggtgaaaact gtattatttc aggttcacat 1320
attattacta gagctatttt gccagcttac tcctttgttt gttccttgtc cttgaagatt 1380attattacta gagctatttt gccagcttac tcctttgttt gttccttgtc cttgaagatt 1380
aacggacata ttaagtactc aactatggct tgtggagttc aagataactt gaagaagaac 1440aacggacata ttaagtactc aactatggct tgtggagttc aagataactt gaagaagaac 1440
gttaagactt tgtccgatgt taagttgttg caattttttg gagtttcctt gttgtcctgt 1500gttaagactt tgtccgatgt taagttgttg caattttttg gagtttcctt gttgtcctgt 1500
ttggatgttt ggaacttgga agttactgaa gaattgtttt caggtaacaa gacttgtttg 1560ttggatgttt ggaacttgga agttactgaa gaattgtttt caggtaacaa gacttgtttg 1560
tccttgtgga acgctagaat ttttccagtt tgttcttcct tgtccgattc cgttattgct 1620tccttgtgga acgctagaat ttttccagtt tgttcttcct tgtccgattc cgttattgct 1620
tccttgaaga tgttgaacgc tgttcaatct aagtcagttt tttctttgaa caactacaag 1680tccttgaaga tgttgaacgc tgttcaatct aagtcagttt tttctttgaa caactacaag 1680
ttgttgtcaa ttgaagaaat gttggtttac aaggatgttg aagatatgat tacttacaga 1740ttgttgtcaa ttgaagaaat gttggtttac aaggatgttg aagatatgat tacttacaga 1740
gaacaaattt ttttggaaat tgctttgaac agaaagcaat ccgttttgga aacttcttaa 1800gaacaaattt ttttggaaat tgctttgaac agaaagcaat ccgttttgga aacttcttaa 1800
<210> 4<210> 4
<211> 3252<211> 3252
<212> DNA<212>DNA
<213> 人工序列<213> Artificial sequence
<400> 4<400> 4
atggaacaac ctaagggagt tgattggact gttattattt tgacttgtca atacaaggat 60atggaacaac ctaagggagt tgattggact gttattattt tgacttgtca atacaaggat 60
tccgttgaag tttttcaaaa ggaattggaa attagacaaa agagagaaca aattccagct 120tccgttgaag tttttcaaaa ggaattggaa attagacaaa agagagaaca aattccagct 120
tctactttgt tgttggctgt tgaagatcct gaagttcatg ttggtagtgg aggagctact 180tctactttgt tgttggctgt tgaagatcct gaagttcatg ttggtagtgg aggagctact 180
ttgaacgctt tgttggttgc tgctgaacat ttgtccgcta gagctggatt tactgttgtt 240ttgaacgctt tgttggttgc tgctgaacat ttgtccgcta gagctggatt tactgttgtt 240
acttccgatg ttttgcattc cgcttggatt ttgattttgc atatgggtag agattttcca 300acttccgatg ttttgcattc cgcttggatt ttgattttgc atatgggtag agattttcca 300
tttgatgatt gtggtagagc ttttacttgt ttgccagttg aaaacccaca agctccagtt 360tttgatgatt gtggtagagc ttttacttgt ttgccagttg aaaacccaca agctccagtt 360
gaagctgttg tttgtaactt ggattgtttg ttggatatta tgtcacatag attgggacca 420gaagctgttg tttgtaactt ggattgtttg ttggatatta tgtcacatag attgggacca 420
ggatcacctc caggagtttg ggtttgttct actgatatgt tgttgtcagt tccacctaac 480ggatcacctc caggagtttg ggtttgttct actgatatgt tgttgtcagt tccacctaac 480
ccaggtatta actgggatgg atttagaggt gctagagtta ttgctttgcc aggaagcact 540ccaggttatta actgggatgg atttagaggt gctagagtta ttgctttgcc aggaagcact 540
gcttacgcta gaaaccatgg agtttacttg actgattccc aaggatttgt tttggatatt 600gcttacgcta gaaaccatgg agtttacttg actgattccc aaggatttgt tttggatatt 600
tactaccaag gaactgaagc tgaaattcaa agatgtgcta gacctgatgg tcaagttcca 660tactaccaag gaactgaagc tgaaattcaa agatgtgcta gacctgatgg tcaagttcca 660
ttggtttccg gtattgtttt tttttccgtt gaaactgctg aacatttgtt ggctactcat 720ttggtttccg gtattgtttt tttttccgtt gaaactgctg aacatttgtt ggctactcat 720
gtttccccac cattggatgc ttgtacttac atgggattgg attcaggtgc tcaaccagtt 780gtttccccac cattggatgc ttgtacttac atggattgg attcaggtgc tcaaccagtt 780
caattgtcat tgttttttga tattttgttg tgtatggcta gaaacgttag aagagaagat 840caattgtcat tgttttttga tattttgttg tgtatggcta gaaacgttag aagagaagat 840
tttttggttg gtagaccacc agaaatgggt agaggtgata tggaaactga aggttacttg 900tttttggttg gtagacccacc agaaatgggt agaggtgata tggaaactga aggttacttg 900
agaggtgcta gagctgaatt gtggagagaa ttgagagatc aaccattgac tttggcttac 960agaggtgcta gagctgaatt gtggagagaa ttgagagatc aaccattgac tttggcttac 960
gttcctgatg gttcttacaa ctacatgact aactccgctt cagaattttt gcattccttg 1020gttcctgatg gttcttacaa ctacatgact aactccgctt cagaattttt gcattccttg 1020
acttttccag gtgcttccgg tgctcaagtt gttcattccc aagttgaaga acaacaattg 1080acttttccag gtgcttccgg tgctcaagtt gttcattccc aagttgaaga acaacaattg 1080
ttgggtgctg gttcttccgt tgtttcctgt ttgttggaag gaccagttca attgggacca 1140ttgggtgctg gttcttccgt tgtttcctgt ttgttggaag gaccagttca attgggacca 1140
ggttccgttt tgcaacattg tcatttgaga ggacctattc atattggtac tggatgtttt 1200ggttccgttt tgcaacattg tcatttgaga ggacctattc atattggtac tggatgtttt 1200
gtttcgggct tggatgctgc tcaatccgaa gctttgcatt ccttggaatt gcatgatttg 1260gtttcgggct tggatgctgc tcaatccgaa gctttgcatt ccttggaatt gcatgatttg 1260
gttttgcaag gtcatcattt gcaattgcat ggtgctccat ccagagcttt tactttggtt 1320gttttgcaag gtcatcattt gcaattgcat ggtgctccat ccagagcttt tactttggtt 1320
ggtagattgg attcctggga aagacaaggt gctggtactt acttgaatat gtcctggtct 1380ggtagattgg attcctggga aagacaaggt gctggtactt acttgaatat gtcctggtct 1380
aagttttttc aaaagactgg tattagagat tgggatttgt gggaccctga tactcctcca 1440aagttttttc aaaagactgg tattatagagat tgggatttgt gggaccctga tactcctcca 1440
actgaaagat gtttgttgtc cgctagattg tttccagttt tgcatccatt gagagctttg 1500actgaaagat gtttgttgtc cgctagattg tttccagttt tgcatccatt gagagctttg 1500
ggaccacaag atatgttgtg gatgttagat cctcaagaag atggaggtaa ggctttgaga 1560ggaccacaag atatgttgtg gatgttagat cctcaagaag atggaggtaa ggctttgaga 1560
gcttggagag attcatggag attgtcatgg gaacaattgc aaccatgttt ggatagagct 1620gcttggagag attcatggag attgtcatgg gaacaattgc aaccatgttt ggatagagct 1620
gctactttgg cttctagaag agatttgttt tttagacaag ctttgcataa ggctcgccat 1680gctactttgg cttctagaag agatttgttt tttagacaag ctttgcataa ggctcgccat 1680
gttttggaag ctagacaaga tttgtccttg agaccattga ttagagctgc tgttagagaa 1740gttttggaag ctagacaaga tttgtccttg agaccattga ttagagctgc tgttagagaa 1740
ggatgtccag gtccattgat ggctactttg gatcaagttg ctgctggagc tggtgatcca 1800ggatgtccag gtccatgat ggctactttg gatcaagttg ctgctggagc tggtgatcca 1800
ggagttgctg ctagagcttt ggcttgtgtt gctgatattt tgggatgtat ggctgaaggt 1860ggagttgctg ctagagcttt ggcttgtgtt gctgatattt tgggatgtat ggctgaaggt 1860
agaggtggtt tgagatcagg acctgctgct aacccagaat gggttagacc attttcttac 1920agaggtggtt tgagatcagg acctgctgct aacccagaat gggttagacc attttcttac 1920
ttggaatgtg gtgatttggc tggaggagtt gaagctttgg ctcaagaaag agaaaagtgg 1980ttggaatgtg gtgatttggc tggaggagtt gaagctttgg ctcaagaaag agaaaagtgg 1980
ttgtctagac ctgctttgtt ggttagagct gctagacatt acgaaggtgc tggacaaatt 2040ttgtctagac ctgctttgtt ggttagagct gctagacatt acgaaggtgc tggacaaatt 2040
ttgattagac aagctgttat gtccgctcaa cattttgttt ctactgaacc agttgaattg 2100ttgattagac aagctgttat gtccgctcaa cattttgttt ctactgaacc agttgaattg 2100
ccagctccag gtcaatgggt tgttgctgaa tgtccagcta gagttgattt ttcaggagga 2160ccagctccag gtcaatgggt tgttgctgaa tgtccagcta gagttgattt ttcaggagga 2160
tggtccgata ctccaccatt ggcttacgaa catggaggtg ctgttttggg attggctgtt 2220tggtccgata ctccaccatt ggcttacgaa catggaggtg ctgttttggg attggctgtt 2220
agagttgatg gtagaagacc tattggtgct agagctagac gtattccaga accagaattg 2280agagttgatg gtagaagacc tattggtgct agagctagac gtattccaga accagaattg 2280
tggttggctg ttggacctca acatgataag atggctatga agattgtttg tagatccttg 2340tggttggctg ttggacctca acatgataag atggctatga agattgtttg tagatccttg 2340
gatgatttgc aagattactg tcaacctcat gctccaggag ctttgttgaa ggctgctttt 2400gatgatttgc aagattactg tcaacctcat gctccaggag ctttgttgaa ggctgctttt 2400
atttgtgctg gtattttgtc cgttcattca gaattgtcct tgtccgaaca attgttgtgt 2460atttgtgctg gtattttgtc cgttcattca gaattgtcct tgtccgaaca attgttgtgt 2460
acttttggag gaggatttga attgcaaact tggtccgaat tgccacatgg ttcaggattg 2520acttttggag gaggatttga attgcaaact tggtccgaat tgccacatgg ttcaggattg 2520
ggtacttctt ctattttggc tggtgctgct ttggctgctt tgcaaagagt tgctggtaga 2580ggtacttctt ctattttggc tggtgctgct ttggctgctt tgcaaagagt tgctggtaga 2580
gctgttggta ctgaagcttt gattcatgct gttttgcatt tggaacaagt tttgactact 2640gctgttggta ctgaagcttt gattcatgct gttttgcatt tggaacaagt tttgactact 2640
ggaggaggat ggcaagatca agttggagga ttgatgccag gtattaaggt tggaagatcc 2700ggaggaggat ggcaagatca agttggagga ttgatgccag gtattaaggt tggaagatcc 2700
cgtgcgcaat tgccattgaa ggttgaagtt gaagaaatta ctgttcctgc tggttttgtt 2760cgtgcgcaat tgccattgaa ggttgaagtt gaagaaatta ctgttcctgc tggttttgtt 2760
caaaagttga acgatcattt gttgttggtt tacactggta agactagatt ggctagaaac 2820caaaagttga acgatcattt gttgttggtt tacactggta agactagatt ggctagaaac 2820
ttgttgcaag atgttttgag atcctggtac gctagattgc caccagttgt tcaaaacgct 2880ttgttgcaag atgttttgag atcctggtac gctagattgc caccagttgt tcaaaacgct 2880
cataacttgg ttcaacaaac tgaagaatgt gctgaagctt ttagacaagg ttccttgcca 2940cataacttgg ttcaacaaac tgaagaatgt gctgaagctt ttagacaagg ttccttgcca 2940
agattgggtc aatgtttgac ttcttactgg gaacaaaaga agttgatggc tccaggatgt 3000agattgggtc aatgtttgac ttcttactgg gaacaaaaga agttgatggc tccaggatgt 3000
gaaccattgg ctgttagaca tatgatggat gctttggctc ctcatgttca tggtcaatcc 3060gaaccattgg ctgttagaca tatgatggat gctttggctc ctcatgttca tggtcaatcc 3060
ttggctggtg ctggtggagg aggttttttg tacttgttga ctaaggaacc tagacaaaag 3120ttggctggtg ctggtggagg aggttttttg tacttgttga ctaaggaacc tagacaaaag 3120
gaagctttgg aagctgtttt ggctaagact gaaggattgg gtaactactc agttcatttg 3180gaagctttgg aagctgtttt ggctaagact gaaggattgg gtaactactc agttcatttg 3180
gttgaagttg atactcaagg attgtccttg caattgttgg gaactgaaac ttctactgga 3240gttgaagttg atactcaagg attgtccttg caattgttgg gaactgaaac ttctactgga 3240
tgttcctttt aa 3252tgttcctttt aa 3252
<210> 5<210> 5
<211> 13774<211> 13774
<212> DNA<212>DNA
<213> 人工序列<213> Artificial sequence
<400> 5<400> 5
tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg 60tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg 60
gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg 120gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg 120
ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag 180ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag 180
cgtggcgctt tctcaatgct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc 240cgtggcgctt tctcaatgct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc 240
caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa 300caagctgggc tgtgtgcacg aacccccccgt tcagcccgac cgctgcgcct tatccggtaa 300
ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg 360ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg 360
taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc 420taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc 420
taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac 480taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac 480
cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagccggtg 540cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagccggtg 540
gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt 600gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt 600
tgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg 660tgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg 660
tcatgagatc agatcttttt tgtagaaatg tcttggtgtc ctcgtccaat caggtagcca 720tcatgagatc agatcttttt tgtagaaatg tcttggtgtc ctcgtccaat caggtagcca 720
tctctgaaat atctggctcc gttgcaactc cgaacgacct gctggcaacg taaaattctc 780tctctgaaat atctggctcc gttgcaactc cgaacgacct gctggcaacg taaaattctc 780
cggggtaaaa cttaaatgtg gagtaatgga accagaaacg tctcttccct tctctctcct 840cggggtaaaa cttaaatgtg gagtaatgga accagaaacg tctcttccct tctctctcct 840
tccaccgccc gttaccgtcc ctaggaaatt ttactctgct ggagagcttc ttctacggcc 900tccaccgccc gttaccgtcc ctaggaaatt ttactctgct ggagagcttc ttctacggcc 900
cccttgcagc aatgctcttc ccagcattac gttgcgggta aaacggaggt cgtgtacccg 960cccttgcagc aatgctcttc ccagcattac gttgcgggta aaacggaggt cgtgtacccg 960
acctagcagc ccagggatgg aaaagtcccg gccgtcgctg gcaataatag cgggcggacg 1020acctagcagc ccagggatgg aaaagtcccg gccgtcgctg gcaataatag cgggcggacg 1020
catgtcatga gattattgga aaccaccaga atcgaatata aaaggcgaac acctttccca 1080catgtcatga gattattgga aaccaccaga atcgaatata aaaggcgaac acctttccca 1080
attttggttt ctcctgaccc aaagacttta aatttaattt atttgtccct atttcaatca 1140atttggttt ctcctgaccc aaagacttta aatttaattt atttgtccct atttcaatca 1140
attgaacaac tatatggctg ctgcttcagc tccattggga gtttccttgc aagaagctac 1200attgaacaac tatatggctg ctgcttcagc tccattggga gtttccttgc aagaagctac 1200
tcaaagaaga ttgagaagat tttccgaatt gagaggtaag ccagttgctg ctggtgaatt 1260tcaaagaaga ttgagaagat tttccgaatt gagaggtaag ccagttgctg ctggtgaatt 1260
ttgggatatt gttgctatta ctgctgctga tgaaaagcaa gaattggctt acaagcaaca 1320ttgggatatt gttgctatta ctgctgctga tgaaaagcaa gaattggctt acaagcaaca 1320
attgtccgaa aagttgaaga agaaggaatt gccattggga gttcaatact tggtttttgt 1380attgtccgaa aagttgaaga agaaggaatt gccattggga gttcaatact tggtttttgt 1380
tgatccagct ggtgctaaga ttggtaacgg aggtagcact ttgtgtgctt tgagatgttt 1440tgatccagct ggtgctaaga ttggtaacgg aggtagcact ttgtgtgctt tgagatgttt 1440
ggaaaagttg tacggtgatc aatggaactc ctttactatt ttgttgattc atagcggggg 1500ggaaaagttg tacggtgatc aatggaactc ctttactatt ttgttgattc atagcggggg 1500
atactcccaa agattgccta acgcttcagc tttgggtaag atttttactg ctttgccatt 1560atactcccaa agattgccta acgcttcagc tttgggtaag atttttactg ctttgccatt 1560
tggttcacct atttaccaaa tgttggaatt gaagttggct atgtacattg attttccatc 1620tggttcacct atttaccaaa tgttggaatt gaagttggct atgtacattg attttccatc 1620
acatatgaac cctggtattt tggttacttg tgctgatgat attgaattgt actccattgg 1680acatatgaac cctggtattt tggttacttg tgctgatgat attgaattgt actccattgg 1680
agaatccgaa tttattagat ttgataagcc aggatttact gctttggctc atccatcatc 1740agaatccgaa tttattagat ttgataagcc aggatttact gctttggctc atccatcatc 1740
cttgactgtt ggtactactc atggagtttt tgttttggaa ccatttaaca gattggaata 1800cttgactgtt ggtactactc atggagtttt tgttttggaa ccatttaaca gattggaata 1800
cagagatttg gaatacagat gttgtcatag atttttgcat aagccatcta ttgaaatgat 1860cagagatttg gaatacagat gttgtcatag atttttgcat aagccatcta ttgaaatgat 1860
gtacaagttt gatgctgttt gtagacctgg taacttttcc caacaagatt ttggtggagg 1920gtacaagttt gatgctgttt gtagacctgg taacttttcc caacaagatt ttggtggagg 1920
tgatactcca tccttgaagc tagatccaga atacgtttac actgattccg ttttttacat 1980tgatactcca tccttgaagc tagatccaga atacgtttac actgattccg ttttttacat 1980
ggatcataag actgctaaga agttgttggc tttttacgaa aagattgata ctttgaactg 2040ggatcataag actgctaaga agttgttggc tttttacgaa aagattgata ctttgaactg 2040
tgaaattgat gcttacggtg attttttgca agctttggga ccaggagcta ctgttgaata 2100tgaaattgat gcttacggtg attttttgca agctttggga ccaggagcta ctgttgaata 2100
cactagaaac acttctaacg ttactaagga agaatccgaa ttggttgata tgagacaaag 2160cactagaaac acttctaacg ttactaagga agaatccgaa ttggttgata tgagacaaag 2160
aatttttcat ttgttgaagg gtactccatt gaacgttatt gttttgaaca actctaagtt 2220aatttttcat ttgttgaagg gtactccatt gaacgttatt gttttgaaca actctaagtt 2220
ttaccatatt ggtactacta aggaatactt gtttcatttt acttctgatt cttccttgaa 2280ttaccatatt ggtactacta aggaatactt gtttcatttt acttctgatt cttccttgaa 2280
gtccgaattg ggattgcaat caattgcttt ttctattttt ccagctattc cagaatactc 2340gtccgaattg ggattgcaat caattgcttt ttctattttt ccagctattc cagaatactc 2340
aggtaacaag tcatgtatta ttcaatctat tttggattct agatgttcct tggctccagg 2400aggtaacaag tcatgtatta ttcaatctat tttggattct agatgttcct tggctccagg 2400
atcagttgtt gaatactcta gattgggacc tgatgtttcc gttggtgaaa actgtattat 2460atcagttgtt gaatactcta gattgggacc tgatgtttcc gttggtgaaa actgtattat 2460
ttcaggttca catattatta ctagagctat tttgccagct tactcctttg tttgttcctt 2520ttcaggttca catattatta ctagagctat tttgccagct tactcctttg tttgttcctt 2520
gtccttgaag attaacggac atattaagta ctcaactatg gcttgtggag ttcaagataa 2580gtccttgaag attaacggac atattaagta ctcaactatg gcttgtggag ttcaagataa 2580
cttgaagaag aacgttaaga ctttgtccga tgttaagttg ttgcaatttt ttggagtttc 2640cttgaagaag aacgttaaga ctttgtccga tgttaagttg ttgcaatttt ttggagtttc 2640
cttgttgtcc tgtttggatg tttggaactt ggaagttact gaagaattgt tttcaggtaa 2700cttgttgtcc tgtttggatg tttggaactt ggaagttact gaagaattgt tttcaggtaa 2700
caagacttgt ttgtccttgt ggaacgctag aatttttcca gtttgttctt ccttgtccga 2760caagacttgt ttgtccttgt ggaacgctag aatttttcca gtttgttctt ccttgtccga 2760
ttccgttatt gcttccttga agatgttgaa cgctgttcaa tctaagtcag ttttttcttt 2820ttccgttatt gcttcccttga agatgttgaa cgctgttcaa tctaagtcag ttttttcttt 2820
gaacaactac aagttgttgt caattgaaga aatgttggtt tacaaggatg ttgaagatat 2880gaacaactac aagttgttgt caattgaaga aatgttggtt tacaaggatg ttgaagatat 2880
gattacttac agagaacaaa tttttttgga aattgctttg aacagaaagc aatccgtttt 2940gattacttac agagaacaaa tttttttgga aattgctttg aacagaaagc aatccgtttt 2940
ggaaacttct taagaacaaa aactcatctc agaagaggat ctgaatagcg ccgtcgacca 3000ggaaacttct taagaacaaa aactcatctc agaagaggat ctgaatagcg ccgtcgacca 3000
tcatcatcat catcattgag ttttagcctt agacatgact gttcctcagt tcaagttggg 3060tcatcatcat catcattgag ttttagcctt agacatgact gttcctcagt tcaagttggg 3060
cacttacgag aagaccggtc ttgctagatt ctaatcaaga ggatgtcaga atgccatttg 3120cacttacgag aagaccggtc ttgctagatt ctaatcaaga ggatgtcaga atgccatttg 3120
cctgagagat gcaggcttca tttttgatac ttttttattt gtaacctata tagtatagga 3180cctgagagat gcaggcttca tttttgatac ttttttattt gtaacctata tagtatagga 3180
ttttttttgt cattttgttt cttctcgtac gagcttgctc ctgatcagcc tatctcgcag 3240ttttttttgt cattttgttt cttctcgtac gagcttgctc ctgatcagcc tatctcgcag 3240
ctgatgaata tcttgtggta ggggtttggg aaaatcattc gagtttgatg tttttcttgg 3300ctgatgaata tcttgtggta ggggtttggg aaaatcattc gagtttgatg tttttcttgg 3300
tatttcccac tcctcttcag agtacagaag attaagtgag aacgcgtaga tcttttttgt 3360tatttcccac tcctcttcag agtacagaag attaagtgag aacgcgtaga tcttttttgt 3360
agaaatgtct tggtgtcctc gtccaatcag gtagccatct ctgaaatatc tggctccgtt 3420agaaatgtct tggtgtcctc gtccaatcag gtagccatct ctgaaatatc tggctccgtt 3420
gcaactccga acgacctgct ggcaacgtaa aattctccgg ggtaaaactt aaatgtggag 3480gcaactccga acgacctgct ggcaacgtaa aattctccgg ggtaaaactt aaatgtggag 3480
taatggaacc agaaacgtct cttcccttct ctctccttcc accgcccgtt accgtcccta 3540taatggaacc agaaacgtct cttcccttct ctctccttcc accgcccgtt accgtcccta 3540
ggaaatttta ctctgctgga gagcttcttc tacggccccc ttgcagcaat gctcttccca 3600ggaaatttta ctctgctgga gagcttcttc tacggccccc ttgcagcaat gctcttccca 3600
gcattacgtt gcgggtaaaa cggaggtcgt gtacccgacc tagcagccca gggatggaaa 3660gcattacgtt gcgggtaaaa cggaggtcgt gtacccgacc tagcagccca gggatggaaa 3660
agtcccggcc gtcgctggca ataatagcgg gcggacgcat gtcatgagat tattggaaac 3720agtcccggcc gtcgctggca ataatagcgg gcggacgcat gtcatgagat tattggaaac 3720
caccagaatc gaatataaaa ggcgaacacc tttcccaatt ttggtttctc ctgacccaaa 3780caccagaatc gaatataaaa ggcgaacacc tttcccaatt ttggtttctc ctgacccaaa 3780
gactttaaat ttaatttatt tgtccctatt tcaatcaatt gaacaactat ttcgaaacga 3840gactttaaat ttaatttatt tgtccctatt tcaatcaatt gaacaactat ttcgaaacga 3840
ggaattcatg gaacaaccta agggagttga ttggactgtt attattttga cttgtcaata 3900ggaattcatg gaacaaccta agggagttga ttggactgtt attattttga cttgtcaata 3900
caaggattcc gttgaagttt ttcaaaagga attggaaatt agacaaaaga gagaacaaat 3960caaggattcc gttgaagttt ttcaaaagga attggaaatt agacaaaaga gagaacaaat 3960
tccagcttct actttgttgt tggctgttga agatcctgaa gttcatgttg gtagtggagg 4020tccagcttct actttgttgt tggctgttga agatcctgaa gttcatgttg gtagtggagg 4020
agctactttg aacgctttgt tggttgctgc tgaacatttg tccgctagag ctggatttac 4080agctactttg aacgctttgt tggttgctgc tgaacatttg tccgctagag ctggatttac 4080
tgttgttact tccgatgttt tgcattccgc ttggattttg attttgcata tgggtagaga 4140tgttgttact tccgatgttt tgcattccgc ttggattttg attttgcata tgggtagaga 4140
ttttccattt gatgattgtg gtagagcttt tacttgtttg ccagttgaaa acccacaagc 4200ttttccattt gatgattgtg gtagagcttt tacttgtttg ccagttgaaa accccacaagc 4200
tccagttgaa gctgttgttt gtaacttgga ttgtttgttg gatattatgt cacatagatt 4260tccagttgaa gctgttgttt gtaacttgga ttgtttgttg gatattatgt cacatagatt 4260
gggaccagga tcacctccag gagtttgggt ttgttctact gatatgttgt tgtcagttcc 4320gggaccagga tcacctccag gagtttgggt ttgttctact gatatgttgt tgtcagttcc 4320
acctaaccca ggtattaact gggatggatt tagaggtgct agagttattg ctttgccagg 4380acctaaccca ggtattaact gggatggatt tagaggtgct agagttatg ctttgccagg 4380
aagcactgct tacgctagaa accatggagt ttacttgact gattcccaag gatttgtttt 4440aagcactgct tacgctagaa accatggagt ttacttgact gattcccaag gatttgtttt 4440
ggatatttac taccaaggaa ctgaagctga aattcaaaga tgtgctagac ctgatggtca 4500ggatattac taccaaggaa ctgaagctga aattcaaaga tgtgctagac ctgatggtca 4500
agttccattg gtttccggta ttgttttttt ttccgttgaa actgctgaac atttgttggc 4560agttccattg gtttccggta ttgtttttttttccgttgaa actgctgaac atttgttggc 4560
tactcatgtt tccccaccat tggatgcttg tacttacatg ggattggatt caggtgctca 4620tactcatgtt tccccaccat tggatgcttg tacttacatg ggattggatt caggtgctca 4620
accagttcaa ttgtcattgt tttttgatat tttgttgtgt atggctagaa acgttagaag 4680accagttcaa ttgtcattgt tttttgatat tttgttgtgt atggctagaa acgttagaag 4680
agaagatttt ttggttggta gaccaccaga aatgggtaga ggtgatatgg aaactgaagg 4740agaagatttt ttggttggta gaccaccaga aatgggtaga ggtgattgg aaactgaagg 4740
ttacttgaga ggtgctagag ctgaattgtg gagagaattg agagatcaac cattgacttt 4800ttacttgaga ggtgctagag ctgaattgtg gagagaattg agagatcaac cattgacttt 4800
ggcttacgtt cctgatggtt cttacaacta catgactaac tccgcttcag aatttttgca 4860ggcttacgtt cctgatggtt cttacaacta catgactaac tccgcttcag aatttttgca 4860
ttccttgact tttccaggtg cttccggtgc tcaagttgtt cattcccaag ttgaagaaca 4920ttccttgact tttccaggtg cttccggtgc tcaagttgtt cattcccaag ttgaagaaca 4920
acaattgttg ggtgctggtt cttccgttgt ttcctgtttg ttggaaggac cagttcaatt 4980acaattgttg ggtgctggtt cttccgttgt ttcctgtttg ttggaaggac cagttcaatt 4980
gggaccaggt tccgttttgc aacattgtca tttgagagga cctattcata ttggtactgg 5040gggaccaggt tccgttttgc aacattgtca tttgagagga cctattcata ttggtactgg 5040
atgttttgtt tcgggcttgg atgctgctca atccgaagct ttgcattcct tggaattgca 5100atgttttgtt tcgggcttgg atgctgctca atccgaagct ttgcattcct tggaattgca 5100
tgatttggtt ttgcaaggtc atcatttgca attgcatggt gctccatcca gagcttttac 5160tgatttggtt ttgcaaggtc atcatttgca attgcatggt gctccatcca gagcttttac 5160
tttggttggt agattggatt cctgggaaag acaaggtgct ggtacttact tgaatatgtc 5220tttggttggt agattggatt cctgggaaag acaaggtgct ggtacttact tgaatatgtc 5220
ctggtctaag ttttttcaaa agactggtat tagagattgg gatttgtggg accctgatac 5280ctggtctaag ttttttcaaa agactggtat tagagattgg gatttgtggg accctgatac 5280
tcctccaact gaaagatgtt tgttgtccgc tagattgttt ccagttttgc atccattgag 5340tcctccaact gaaagatgtt tgttgtccgc tagattgttt ccagttttgc atccattgag 5340
agctttggga ccacaagata tgttgtggat gttagatcct caagaagatg gaggtaaggc 5400agctttggga ccacaagata tgttgtggat gttagatcct caagaagatg gaggtaaggc 5400
tttgagagct tggagagatt catggagatt gtcatgggaa caattgcaac catgtttgga 5460tttgagagct tggagagatt catggagatt gtcatgggaa caattgcaac catgtttgga 5460
tagagctgct actttggctt ctagaagaga tttgtttttt agacaagctt tgcataaggc 5520tagagctgct actttggctt ctagaagaga tttgtttttt agacaagctt tgcataaggc 5520
tcgccatgtt ttggaagcta gacaagattt gtccttgaga ccattgatta gagctgctgt 5580tcgccatgtt ttggaagcta gacaagattt gtccttgaga ccattgatta gagctgctgt 5580
tagagaagga tgtccaggtc cattgatggc tactttggat caagttgctg ctggagctgg 5640tagagaagga tgtccaggtc cattgatggc tactttggat caagttgctg ctggagctgg 5640
tgatccagga gttgctgcta gagctttggc ttgtgttgct gatattttgg gatgtatggc 5700tgatccagga gttgctgcta gagctttggc ttgtgttgct gatattttgg gatgtatggc 5700
tgaaggtaga ggtggtttga gatcaggacc tgctgctaac ccagaatggg ttagaccatt 5760tgaaggtaga ggtggtttga gatcaggacc tgctgctaac ccagaatggg ttagaccatt 5760
ttcttacttg gaatgtggtg atttggctgg aggagttgaa gctttggctc aagaaagaga 5820ttcttacttg gaatgtggtg atttggctgg aggagttgaa gctttggctc aagaaagaga 5820
aaagtggttg tctagacctg ctttgttggt tagagctgct agacattacg aaggtgctgg 5880aaagtggttg tctagacctg ctttgttggt tagagctgct agacattacg aaggtgctgg 5880
acaaattttg attagacaag ctgttatgtc cgctcaacat tttgtttcta ctgaaccagt 5940acaaattttg attagacaag ctgttatgtc cgctcaacat tttgtttcta ctgaaccagt 5940
tgaattgcca gctccaggtc aatgggttgt tgctgaatgt ccagctagag ttgatttttc 6000tgaattgcca gctccaggtc aatgggttgt tgctgaatgt ccagctagag ttgatttttc 6000
aggaggatgg tccgatactc caccattggc ttacgaacat ggaggtgctg ttttgggatt 6060aggaggatgg tccgatactc caccattggc ttacgaacat ggaggtgctg ttttgggatt 6060
ggctgttaga gttgatggta gaagacctat tggtgctaga gctagacgta ttccagaacc 6120ggctgttaga gttgatggta gaagacctat tggtgctaga gctagacgta ttccagaacc 6120
agaattgtgg ttggctgttg gacctcaaca tgataagatg gctatgaaga ttgtttgtag 6180agaattgtgg ttggctgttg gacctcaaca tgataagatg gctatgaaga ttgtttgtag 6180
atccttggat gatttgcaag attactgtca acctcatgct ccaggagctt tgttgaaggc 6240atccttggat gatttgcaag attackgtca acctcatgct ccaggagctt tgttgaaggc 6240
tgcttttatt tgtgctggta ttttgtccgt tcattcagaa ttgtccttgt ccgaacaatt 6300tgcttttatt tgtgctggta ttttgtccgt tcattcagaa ttgtccttgt ccgaacaatt 6300
gttgtgtact tttggaggag gatttgaatt gcaaacttgg tccgaattgc cacatggttc 6360gttgtgtact tttggaggag gatttgaatt gcaaacttgg tccgaattgc cacatggttc 6360
aggattgggt acttcttcta ttttggctgg tgctgctttg gctgctttgc aaagagttgc 6420aggattgggt acttcttcta ttttggctgg tgctgctttg gctgctttgc aaagagttgc 6420
tggtagagct gttggtactg aagctttgat tcatgctgtt ttgcatttgg aacaagtttt 6480tggtagagct gttggtactg aagctttgat tcatgctgtt ttgcatttgg aacaagtttt 6480
gactactgga ggaggatggc aagatcaagt tggaggattg atgccaggta ttaaggttgg 6540gactactgga ggaggatggc aagatcaagt tggaggattg atgccaggta ttaaggttgg 6540
aagatcccgt gcgcaattgc cattgaaggt tgaagttgaa gaaattactg ttcctgctgg 6600aagatcccgt gcgcaattgc cattgaaggt tgaagttgaa gaaattactg ttcctgctgg 6600
ttttgttcaa aagttgaacg atcatttgtt gttggtttac actggtaaga ctagattggc 6660ttttgttcaa aagttgaacg atcatttgtt gttggtttac actggtaaga ctagattggc 6660
tagaaacttg ttgcaagatg ttttgagatc ctggtacgct agattgccac cagttgttca 6720tagaaacttg ttgcaagatg ttttgagatc ctggtacgct agattgccac cagttgttca 6720
aaacgctcat aacttggttc aacaaactga agaatgtgct gaagctttta gacaaggttc 6780aaacgctcat aacttggttc aacaaactga agaatgtgct gaagctttta gacaaggttc 6780
cttgccaaga ttgggtcaat gtttgacttc ttactgggaa caaaagaagt tgatggctcc 6840cttgccaaga ttgggtcaat gtttgacttc ttactgggaa caaaagaagt tgatggctcc 6840
aggatgtgaa ccattggctg ttagacatat gatggatgct ttggctcctc atgttcatgg 6900aggatgtgaa ccattggctg ttagacatat gatggatgct ttggctcctc atgttcatgg 6900
tcaatccttg gctggtgctg gtggaggagg ttttttgtac ttgttgacta aggaacctag 6960tcaatccttg gctggtgctg gtggaggagg ttttttgtac ttgttgacta aggaacctag 6960
acaaaaggaa gctttggaag ctgttttggc taagactgaa ggattgggta actactcagt 7020acaaaaggaa gctttggaag ctgttttggc taagactgaa ggattgggta actactcagt 7020
tcatttggtt gaagttgata ctcaaggatt gtccttgcaa ttgttgggaa ctgaaacttc 7080tcatttggtt gaagttgata ctcaaggatt gtccttgcaa ttgttgggaa ctgaaacttc 7080
tactggatgt tccttttaag cggccgccag cttgggcccg aacaaaaact catctcagaa 7140tactggatgt tccttttaag cggccgccag cttgggcccg aacaaaaact catctcagaa 7140
gaggatctga atagcgccgt cgaccatcat catcatcatc attgagtttt agccttagac 7200gaggatctga atagcgccgt cgaccatcat catcatcatc attgagtttt agccttagac 7200
atgactgttc ctcagttcaa gttgggcact tacgagaaga ccggtcttgc tagattctaa 7260atgactgttc ctcagttcaa gttgggcact tacgagaaga ccggtcttgc tagattctaa 7260
tcaagaggat gtcagaatgc catttgcctg agagatgcag gcttcatttt tgatactttt 7320tcaagaggat gtcagaatgc catttgcctg agagatgcag gcttcatttt tgatactttt 7320
ttatttgtaa cctatatagt ataggatttt ttttgtcatt ttgtttcttc tcgtacgagc 7380ttatttgtaa cctatatagt ataggatttt ttttgtcatt ttgtttcttc tcgtacgagc 7380
ttgctcctga tcagcctatc tcgcagctga tgaatatctt gtggtagggg tttgggaaaa 7440ttgctcctga tcagcctatc tcgcagctga tgaatatctt gtggtagggg tttgggaaaa 7440
tcattcgagt ttgatgtttt tcttggtatt tcccactcct cttcagagta cagaagatta 7500tcattcgagt ttgatgtttt tcttggtatt tcccactcct cttcagagta cagaagatta 7500
agtgagacct tcgtttgtgc ggatccatga catttccctt gctacctgca tacgcaagtg 7560agtgagacct tcgtttgtgc ggatccatga catttccctt gctacctgca tacgcaagtg 7560
ttgcagagtt tgataattcc ttgagtttgg taggaaaagc cgtgtttccc tatgctgctg 7620ttgcagagtt tgataattcc ttgagtttgg taggaaaagc cgtgtttccc tatgctgctg 7620
accagctgca caacctgatc aagttcactc aatcgactga gcttcaagtt aatgtgcaag 7680accagctgca caacctgatc aagttcactc aatcgactga gcttcaagtt aatgtgcaag 7680
ttgagtcatc cgttacagag gaccaatttg aggagctgat cgacaacttg ctcaagttgt 7740ttgagtcatc cgttacagag gaccaatttg aggagctgat cgacaacttg ctcaagttgt 7740
acaataatgg tatcaatgaa gtgattttgg acctagattt ggcagaaaga gttgtccaaa 7800acaataatgg tatcaatgaa gtgattttgg acctagattt ggcagaaaga gttgtccaaa 7800
ggatgatccc aggcgctagg gttatctata ggaccctggt tgataaagtt gcatccttgc 7860ggatgatccc aggcgctagg gttatctata ggaccctggt tgataaagtt gcatccttgc 7860
ccgctaatgc tagtatcgct gtgccttttt cttctccact gggcgatttg aaaagtttca 7920ccgctaatgc tagtatcgct gtgccttttt cttctccact gggcgatttg aaaagtttca 7920
ctaatggcgg tagtagaact gtttatgctt tttctgagac cgcaaagttg gtagatgtga 7980ctaatggcgg tagtagaact gtttatgctt tttctgagac cgcaaagttg gtagatgtga 7980
cttccactgt tgcttctggt ataatcccca ttattgatgc tcggcaattg actactgaat 8040cttccactgt tgcttctggt ataatcccca ttattgaatgc tcggcaattg actactgaat 8040
acgaactttc tgaagatgtc aaaaagttcc ctgtcagtga aattttgttg gcgtctttga 8100acgaactttc tgaagatgtc aaaaagttcc ctgtcagtga aattttgttg gcgtctttga 8100
ctactgaccg ccccgatggt ctattcacta ctttggtggc tgactcttct aattactcgt 8160ctactgaccg ccccgatggt ctattcacta ctttggtggc tgactcttct aattactcgt 8160
tgggcctggt gtactcgtcc aaaaagtcta ttccggaggc tataaggaca caaactggag 8220tgggcctggt gtactcgtcc aaaaagtcta ttccggaggc tataaggaca caaactggag 8220
tctaccaatc tcgtcgtcac ggtttgtggt ataaaggtgc tacatctgga gcaactcaaa 8280tctaccaatc tcgtcgtcac ggtttgtggt ataaaggtgc tacatctgga gcaactcaaa 8280
agttgctggg tatcgaattg gattgtgatg gagactgctt gaaatttgtg gttgaacaaa 8340agttgctggg tatcgaattg gattgtgatg gagactgctt gaaatttgtg gttgaacaaa 8340
caggtgttgg tttctgtcac ttggaacgca cttcctgttt tggccaatca aagggtctta 8400caggtgttgg tttctgtcac ttggaacgca cttcctgttt tggccaatca aagggtctta 8400
gagccatgga agccaccttg tgggatcgta agagcaatgc tccagaaggt tcttatacca 8460gagccatgga agccaccttg tgggatcgta agagcaatgc tccagaaggt tcttatacca 8460
aacggttatt tgacgacgaa gttttgttga acgctaaaat tagggaggaa gctgatgaac 8520aacggttattgacgacgaa gttttgttga acgctaaaat tagggaggaa gctgatgaac 8520
ttgcagaagc taaatccaag gaagatatag cctgggaatg tgctgactta ttttattttg 8580ttgcagaagc taaatccaag gaagatatag cctgggaatg tgctgactta ttttattttg 8580
cattagttag atgtgccaag tacggtgtga cgttggacga ggtggagaga aacctggata 8640cattagttag atgtgccaag tacggtgtga cgttggacga ggtggagaga aacctggata 8640
tgaagtccct aaaggtcact agaaggaaag gagatgccaa gccaggatac accaaggaac 8700tgaagtccct aaaggtcact agaaggaaag gagatgccaa gccaggatac accaaggaac 8700
aacctaaaga agaatccaaa cctaaagaag tcccttctga aggtcgtatt gaattgtgca 8760aacctaaaga agaatccaaa cctaaagaag tcccttctga aggtcgtatt gaattgtgca 8760
aaattgacgt ttctaaggcc tcctcacaag aaattgaaga tgcccttcgt cgtcctatcc 8820aaattgacgt ttctaaggcc tcctcacaag aaattgaaga tgcccttcgt cgtcctatcc 8820
agaaaacgga acagattatg gaattagtca aaccaattgt cgacaatgtt cgtcaaaatg 8880agaaaacgga acagattatg gaattagtca aaccaattgt cgacaatgtt cgtcaaaatg 8880
gtgacaaagc ccttttagaa ctaactgcca agtttgatgg agtcgctttg aagacacctg 8940gtgacaaagc ccttttagaa ctaactgcca agtttgatgg agtcgctttg aagacacctg 8940
tgttagaagc tcctttccca gaggaactta tgcaattgcc agataacgtt aagagagcca 9000tgttagaagc tcctttccca gaggaactta tgcaattgcc agataacgtt aagagagcca 9000
ttgatctctc tatagataac gtcaggaaat tccatgaagc tcaactaacg gagacgttgc 9060ttgatctctc tatagataac gtcaggaaat tccatgaagc tcaactaacg gagacgttgc 9060
aagttgagac ttgccctggt gtagtctgct ctcgttttgc aagacctatt gagaaagttg 9120aagttgagac ttgccctggt gtagtctgct ctcgttttgc aagacctatt gagaaagttg 9120
gcctctatat tcctggtgga accgcaattc tgccttccac ttccctgatg ctgggtgttc 9180gcctctatat tcctggtgga accgcaattc tgccttccac ttccctgatg ctgggtgttc 9180
ctgccaaagt tgctggttgc aaagaaattg tttttgcatc tccacctaag aaggatggta 9240ctgccaaagt tgctggttgc aaagaaattg tttttgcatc tccacctaag aaggatggta 9240
cccttacccc agaagtcatc tacgttgccc acaaggttgg tgctaagtgt atcgtgctag 9300cccttacccc agaagtcatc tacgttgccc acaaggttgg tgctaagtgt atcgtgctag 9300
caggaggcgc ccaggcagta gctgctatgg cttacggaac agaaactgtt cctaagtgtg 9360caggaggcgc ccaggcagta gctgctatgg cttacggaac agaaactgtt cctaagtgtg 9360
acaaaatatt tggtccagga aaccagttcg ttactgctgc caagatgatg gttcaaaatg 9420acaaaatatt tggtccagga aaccagttcg ttactgctgc caagatgatg gttcaaaatg 9420
acacatcagc cctgtgtagt attgacatgc ctgctgggcc ttctgaagtt ctagttattg 9480acacatcagc cctgtgtagt attgacatgc ctgctgggcc ttctgaagtt ctagttattg 9480
ctgataaata cgctgatcca gatttcgttg cctcagacct tctgtctcaa gctgaacatg 9540ctgataaata cgctgatcca gatttcgttg cctcagacct tctgtctcaa gctgaacatg 9540
gtattgattc ccaggtgatt ctgttggctg tcgatatgac agacaaggag cttgccagaa 9600gtattgattc ccaggtgatt ctgttggctg tcgatatgac agacaaggag cttgccagaa 9600
ttgaagatgc tgttcacaac caagctgtgc agttgccaag ggttgaaatt gtacgcaagt 9660ttgaagatgc tgttcacaac caagctgtgc agttgccaag ggttgaaatt gtacgcaagt 9660
gtattgcaca ctctacaacc ctatcggttg caacctacga gcaggctttg gaaatgtcca 9720gtattgcaca ctctacaacc ctatcggttg caacctacga gcaggctttg gaaatgtcca 9720
atcagtacgc tcctgaacac ttgatcctgc aaatcgagaa tgcttcttct tatgttgatc 9780atcagtacgc tcctgaacac ttgatcctgc aaatcgagaa tgcttcttct tatgttgatc 9780
aagtacaaca cgctggatct gtgtttgttg gtgcctactc tccagagagt tgtggagatt 9840aagtacaaca cgctggatct gtgtttgttg gtgcctactc tccagagagt tgtggagatt 9840
actcctccgg taccaaccac actttgccaa cgtacggata tgcccgtcaa tacagcggag 9900actcctccgg taccaaccac actttgccaa cgtacggata tgcccgtcaa tacagcggag 9900
ttaacactgc aaccttccag aagttcatca cttcacaaga cgtaactcct gagggactga 9960ttaacactgc aaccttccag aagttcatca cttcacaaga cgtaactcct gagggactga 9960
aacatattgg ccaagcagtg atggatctgg ctgctgttga aggtctagat gctcaccgca 10020aacatattgg ccaagcagtg atggatctgg ctgctgttga aggtctagat gctcaccgca 10020
atgctgttaa ggttcgtatg gagaaactgg gacttattta aggatcctac cgttcgtata 10080atgctgttaa ggttcgtatg gagaaactgg gacttattta aggatcctac cgttcgtata 10080
gcatacatta tacgaagtta taacatccaa agacgaaagg ttgaatgaaa cctttttgcc 10140gcatacatta tacgaagtta taacatccaa agacgaaagg ttgaatgaaa cctttttgcc 10140
atccgacatc cacaggtcca ttctcacaca taagtgccaa acgcaacagg aggggataca 10200atccgacatc cacaggtcca ttctcacaca taagtgccaa acgcaacagg aggggtaca 10200
ctagcagcag accgttgcaa acgcaggacc tccactcctc ttctcctcaa cacccacttt 10260ctagcagcag accgttgcaa acgcaggacc tccactcctc ttctcctcaa cacccacttt 10260
tgccatcgaa aaaccagccc agttattggg cttgattgga gctcgctcat tccaattcct 10320tgccatcgaa aaaccagccc agttattggg cttgattgga gctcgctcat tccaattcct 10320
tctattaggc tactaacacc atgactttat tagcctgtct atcctggccc ccctggcgag 10380tctattaggc tactaacacc atgactttat tagcctgtct atcctggccc ccctggcgag 10380
gttcatgttt gtttatttcc gaatgcaaca agctccgcat tacacccgaa catcactcca 10440gttcatgttt gtttatttcc gaatgcaaca agctccgcat tacacccgaa catcactcca 10440
gatgagggct ttctgagtgt ggggtcaaat agtttcatgt tccccaaatg gcccaaaact 10500gatgagggct ttctgagtgtggggtcaaat agtttcatgt tccccaaatg gcccaaaact 10500
gacagtttaa acgctgtctt ggaacctaat atgacaaaag cgtgatctca tccaagatga 10560gacagtttaa acgctgtctt ggaacctaat atgacaaaag cgtgatctca tccaagatga 10560
actaagtttg gttcgttgaa atgctaacgg ccagttggtc aaaaagaaac ttccaaaagt 10620actaagtttg gttcgttgaa atgctaacgg ccagttggtc aaaaagaaac ttccaaaagt 10620
cggcataccg tttgtcttgt ttggtattga ttgacgaatg ctcaaaaata atctcattaa 10680cggcataccg tttgtcttgt ttggtattga ttgacgaatg ctcaaaaata atctcattaa 10680
tgcttagcgc agtctctcta tcgcttctga accccggtgc acctgtgccg aaacgcaaat 10740tgcttagcgc agtctctcta tcgcttctga accccggtgc acctgtgccg aaacgcaaat 10740
ggggaaacac ccgctttttg gatgattatg cattgtctcc acattgtatg cttccaagat 10800ggggaaacac ccgctttttg gatgattatg cattgtctcc aattgtatg cttccaagat 10800
tctggtggga atactgctga tagcctaacg ttcatgatca aaatttaact gttctaaccc 10860tctggtggga atactgctga tagcctaacg ttcatgatca aaatttaact gttctaaccc 10860
ctacttgaca gcaatatata aacagaagga agctgccctg tcttaaacct ttttttttat 10920ctacttgaca gcaatatata aacagaagga agctgccctg tcttaaacct ttttttttat 10920
catcattatt agcttacttt cataattgcg actggttcca attgacaagc ttttgatttt 10980catcattatt agcttacttt cataattgcg actggttcca attgacaagc ttttgatttt 10980
aacgactttt aacgacaact tgagaagatc aaaaaacaac taattattcg aaacggaatt 11040aacgactttt aacgacaact tgagaagatc aaaaaacaac taattattcg aaacggaatt 11040
gtgagcggat aacaaaggat gtccaattta ctgaccgtac accaaaattt gcctgcatta 11100gtgagcggat aacaaaggat gtccaattta ctgaccgtac accaaaattt gcctgcatta 11100
ccggtcgatg caacgagtga tgaggttcgc aagaacctga tggacatgtt cagggatcgc 11160ccggtcgatg caacgagtga tgaggttcgc aagaacctga tggacatgtt cagggatcgc 11160
caggcgtttt ctgagcatac ctggaaaatg cttctgtccg tttgccggtc gtgggcggca 11220caggcgtttt ctgagcatac ctggaaaatg cttctgtccg tttgccggtc gtgggcggca 11220
tggtgcaagt tgaataaccg gaaatggttt cccgcagaac ctgaagatgt tcgcgattat 11280tggtgcaagt tgaataaccg gaaatggttt cccgcagaac ctgaagatgt tcgcgattat 11280
cttctatatc ttcaggcgcg cggtctggca gtaaaaacta tccagcaaca tttgggccag 11340cttctatatc ttcaggcgcg cggtctggca gtaaaaacta tccagcaaca tttgggccag 11340
ctaaacatgc ttcatcgtcg gtccgggctg ccacgaccaa gtgacagcaa tgctgtttca 11400ctaaacatgc ttcatcgtcg gtccgggctg ccacgaccaa gtgacagcaa tgctgtttca 11400
ctggttatgc ggcgcatccg aaaagaaaac gttgatgccg gtgaacgtgc aaaacaggct 11460ctggttatgc ggcgcatccg aaaagaaaac gttgatgccg gtgaacgtgc aaaacaggct 11460
ctagcgttcg aacgcactga tttcgaccag gttcgttcac tcatggaaaa tagcgatcgc 11520ctagcgttcg aacgcactga tttcgaccag gttcgttcac tcatggaaaa tagcgatcgc 11520
tgccaggata tacgtaatct ggcatttctg gggattgctt ataacaccct gttacgtata 11580tgccaggata tacgtaatct ggcatttctg gggattgctt ataacaccct gttacgtata 11580
gccgaaattg ccaggatcag ggttaaagat atctcacgta ctgacggtgg gagaatgtta 11640gccgaaattg ccaggatcag ggttaaagat atctcacgta ctgacggtgg gagaatgtta 11640
atccatattg gcagaacgaa aacgctggtt agcaccgcag gtgtagagaa ggcacttagc 11700atccatattg gcagaacgaa aacgctggtt agcaccgcag gtgtagagaa ggcacttagc 11700
ctgggggtaa ctaaactggt cgagcgatgg atttccgtct ctggtgtagc tgatgatccg 11760ctgggggtaa ctaaactggt cgagcgatgg atttccgtct ctggtgtagc tgatgatccg 11760
aataactacc tgttttgccg ggtcagaaaa aatggtgttg ccgcgccatc tgccaccagc 11820aataactacc tgttttgccg ggtcagaaaa aatggtgttg ccgcgccatc tgccaccagc 11820
cagctatcaa ctcgcgccct ggaagggatt tttgaagcaa ctcatcgatt gatttacggc 11880cagctatcaa ctcgcgccct ggaagggatt tttgaagcaa ctcatcgatt gatttacggc 11880
gctaaggatg actctggtca gagatacctg gcctggtctg gacacagtgc ccgtgtcgga 11940gctaaggatg actctggtca gagatacctg gcctggtctg gacacagtgc ccgtgtcgga 11940
gccgcgcgag atatggcccg cgctggagtt tcaataccgg agatcatgca agctggtggc 12000gccgcgcgag atatggcccg cgctggagtt tcaataccgg agatcatgca agctggtggc 12000
tggaccaatg taaatattgt catgaactat atccgtaccc tggatagtga aacaggggca 12060tggaccaatg taaatattgt catgaactat atccgtaccc tggatagtga aacaggggca 12060
atggtgcgcc tgctggaaga tggcgattag cgaacaaaaa ctcatctcag aagaggatct 12120atggtgcgcc tgctggaaga tggcgattag cgaacaaaaa ctcatctcag aagaggatct 12120
gaatagcgcc gtcgaccatc atcatcatca tcattgagtt ttagccttag acatgactgt 12180gaatagcgcc gtcgaccatc atcatcatca tcattgagtt ttagccttag acatgactgt 12180
tcctcagttc aagttgggca cttacgagaa gaccggtctt gctagattct aatcaagagg 12240tcctcagttc aagttgggca ccttacgagaa gaccggtctt gctagattct aatcaagagg 12240
atgtcagaat gccatttgcc tgagagatgc aggcttcatt tttgatactt ttttatttgt 12300atgtcagaat gccatttgcc tgagagatgc aggcttcatt tttgatactt ttttatttgt 12300
aacctatata gtataggatt ttttttgtca ttttgtttct tctcgtacga gcttgctcct 12360aacctatata gtataggatt ttttttgtca ttttgtttct tctcgtacga gcttgctcct 12360
gatcagccta tctcgcagct gatgaatatc ttgtggtagg ggtttgggaa aatcattcga 12420gatcagccta tctcgcagct gatgaatatc ttgtggtagg ggtttgggaa aatcattcga 12420
gtttgatgtt tttcttggta tttcccactc ctcttcagag tacagaagat taagtgagac 12480gtttgatgtt tttcttggta tttcccactc ctcttcagag tacagaagat taagtgagac 12480
cttcgtttgt gcggaacccc cacacaccat agcttcaaaa tgtttctact ccttttttac 12540cttcgtttgt gcggaaccccc cacacaccat agcttcaaaa tgtttctact ccttttttac 12540
tcttccagat tttctcggac tccgcgcatc gccgtaccac ttcaaaacac ccaagcacag 12600tcttccagat tttctcggac tccgcgcatc gccgtaccac ttcaaaacac ccaagcacag 12600
catactaaat tttccctctt tcttcctcta gggtgtcgtt aattacccgt actaaaggtt 12660catactaaat tttccctctt tcttccctcta gggtgtcgtt aattacccgt actaaaggtt 12660
tggaaaagaa aaaagagacc gcctcgtttc tttttcttcg tcgaaaaagg caataaaaat 12720tggaaaagaaaaaagagacc gcctcgtttc tttttcttcg tcgaaaaagg caataaaaat 12720
ttttatcacg tttctttttc ttgaaatttt tttttttagt ttttttctct ttcagtgacc 12780ttttatcacg tttctttttc ttgaaatttt tttttttagt ttttttctct ttcagtgacc 12780
tccattgata tttaagttaa taaacggtct tcaatttctc aagtttcagt ttcatttttc 12840tccattgata tttaagttaa taaacggtct tcaatttctc aagtttcagt ttcatttttc 12840
ttgttctatt acaacttttt ttacttcttg ttcattagaa agaaagcata gcaatctaat 12900ttgttctatt acaacttttt ttacttcttg ttcattagaa agaaagcata gcaatctaat 12900
ctaaggggcg gtgttgacaa ttaatcatcg gcatagtata tcggcatagt ataatacgac 12960ctaaggggcg gtgttgacaa ttaatcatcg gcatagtata tcggcatagt ataatacgac 12960
aaggtgagga actaaaccat ggccaagttg accagtgccg ttccggtgct caccgcgcgc 13020aaggtgagga actaaaccat ggccaagttg accagtgccg ttccggtgct caccgcgcgc 13020
gacgtcgccg gagcggtcga gttctggacc gaccggctcg ggttctcccg ggacttcgtg 13080gacgtcgccg gagcggtcga gttctggacc gaccggctcg ggttctcccg ggacttcgtg 13080
gaggacgact tcgccggtgt ggtccgggac gacgtgaccc tgttcatcag cgcggtccag 13140gaggacgact tcgccggtgt ggtccgggac gacgtgaccc tgttcatcag cgcggtccag 13140
gaccaggtgg tgccggacaa caccctggcc tgggtgtggg tgcgcggcct ggacgagctg 13200gaccaggtgg tgccggacaa caccctggcc tgggtgtggg tgcgcggcct ggacgagctg 13200
tacgccgagt ggtcggaggt cgtgtccacg aacttccggg acgcctccgg gccggccatg 13260tacgccgagt ggtcggaggt cgtgtccacg aacttccggg acgcctccgg gccggccatg 13260
accgagatcg gcgagcagcc gtgggggcgg gagttcgccc tgcgcgaccc ggccggcaac 13320accgagatcg gcgagcagcc gtgggggcgg gagttcgccc tgcgcgaccc ggccggcaac 13320
tgcgtgcact tcgtggccga ggagcaggac tgacacgtcc gacggcggcc cacgggtccc 13380tgcgtgcact tcgtggccga ggagcaggac tgacacgtcc gacggcggcc cacgggtccc 13380
aggcctcgga gatccgtccc ccttttcctt tgtcgatatc atgtaattag ttatgtcacg 13440aggcctcgga gatccgtccc ccttttcctt tgtcgatatc atgtaattag ttatgtcacg 13440
cttacattca cgccctcccc ccacatccgc tctaaccgaa aaggaaggag ttagacaacc 13500cttacattca cgccctcccc ccacatccgc tctaaccgaa aaggaaggag ttagacaacc 13500
tgaagtctag gtccctattt atttttttat agttatgtta gtattaagaa cgttatttat 13560tgaagtctag gtccctattt atttttttat agttatgtta gttattaagaa cgttatttt 13560
atttcaaatt tttctttttt ttctgtacag ataacttcgt atagcataca ttatacgaac 13620atttcaaatt tttcttttttttctgtacag ataacttcgt atagcataca ttatacgaac 13620
ggtaacgcgt gtacgcatgt aacattatac tgaaaacctt gcttgagaag gttttgggac 13680ggtaacgcgt gtacgcatgt aacattatac tgaaaacctt gcttgagaag gttttgggac 13680
gctcgaaggc tttaatttgc aagctggaga ccaacatgtg agcaaaaggc cagcaaaagg 13740gctcgaaggc tttaatttgc aagctggaga ccaacatgtg agcaaaaggc cagcaaaagg 13740
ccaggaaccg taaaaaggcc gcgttgctgg cgtt 13774ccaggaaccg taaaaaggcc gcgttgctgg cgtt 13774
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110654277.6A CN113502297B (en) | 2021-06-11 | 2021-06-11 | Recombinant pichia pastoris for synthesizing guanosine diphosphate fucose, construction method and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110654277.6A CN113502297B (en) | 2021-06-11 | 2021-06-11 | Recombinant pichia pastoris for synthesizing guanosine diphosphate fucose, construction method and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113502297A CN113502297A (en) | 2021-10-15 |
CN113502297B true CN113502297B (en) | 2023-08-18 |
Family
ID=78009907
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110654277.6A Active CN113502297B (en) | 2021-06-11 | 2021-06-11 | Recombinant pichia pastoris for synthesizing guanosine diphosphate fucose, construction method and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113502297B (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2012191905A (en) * | 2011-03-17 | 2012-10-11 | National Institute Of Advanced Industrial Science & Technology | Synthesis method of sugar nucleotide using yeast |
CN107699535A (en) * | 2017-11-08 | 2018-02-16 | 光明乳业股份有限公司 | A kind of recombined bacillus subtilis for inducing synthesis guanosine diphosphate fucose and its construction method and application |
CN109749976A (en) * | 2019-01-30 | 2019-05-14 | 光明乳业股份有限公司 | A kind of recombined bacillus subtilis efficiently synthesizing guanosine diphosphate fucose and its construction method and application |
CN111471605A (en) * | 2020-03-17 | 2020-07-31 | 山东大学 | Saccharomyces cerevisiae engineering strain for high yield of fucosyllactose and application thereof |
CN111471606A (en) * | 2020-03-17 | 2020-07-31 | 山东大学 | An optimized strain of Saccharomyces cerevisiae with high production of fucosyllactose and its application |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150093782A1 (en) * | 2013-10-01 | 2015-04-02 | The University Of Wyoming | Compositions and methods for reducing fucosylation of glycoproteins in insect cells and methods of use thereof for production of recombinant glycoproteins |
-
2021
- 2021-06-11 CN CN202110654277.6A patent/CN113502297B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2012191905A (en) * | 2011-03-17 | 2012-10-11 | National Institute Of Advanced Industrial Science & Technology | Synthesis method of sugar nucleotide using yeast |
CN107699535A (en) * | 2017-11-08 | 2018-02-16 | 光明乳业股份有限公司 | A kind of recombined bacillus subtilis for inducing synthesis guanosine diphosphate fucose and its construction method and application |
CN109749976A (en) * | 2019-01-30 | 2019-05-14 | 光明乳业股份有限公司 | A kind of recombined bacillus subtilis efficiently synthesizing guanosine diphosphate fucose and its construction method and application |
CN111471605A (en) * | 2020-03-17 | 2020-07-31 | 山东大学 | Saccharomyces cerevisiae engineering strain for high yield of fucosyllactose and application thereof |
CN111471606A (en) * | 2020-03-17 | 2020-07-31 | 山东大学 | An optimized strain of Saccharomyces cerevisiae with high production of fucosyllactose and its application |
Non-Patent Citations (1)
Title |
---|
发酵法生产L-岩藻糖的研究进展;马巍等;《食品与发酵工业》;20210114;第47卷(第16期);第308-312页 * |
Also Published As
Publication number | Publication date |
---|---|
CN113502297A (en) | 2021-10-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1703507B (en) | Sesquiterpene synthases and methods of use thereof | |
CN110484572B (en) | Method for increasing yield of saccharomyces cerevisiae nerolidol | |
CN110819650A (en) | An engineering strain for producing β-elemene and its application | |
CN108138167A (en) | The manufacturing method of linalool | |
Wang et al. | Homologous overexpression of genes in Cordyceps militaris improves the production of polysaccharides | |
AU749114C (en) | Fatty acid elongases | |
CN110511272A (en) | A kind of maize ZmbHLH55 transcription factor and its application | |
CN108138168A (en) | Linalool composition and its manufacturing method | |
CN113502297B (en) | Recombinant pichia pastoris for synthesizing guanosine diphosphate fucose, construction method and application thereof | |
CN109097378B (en) | A kind of isoprene synthase and its encoding gene, expression vector, engineering bacteria and method and application of producing isoprene | |
CN102676563B (en) | Method for preparing human serum albumin-human parathyroid hormone | |
CN110804561A (en) | Saccharomyces cerevisiae with high yield of C6-C10 ethyl ester and construction method and application thereof | |
CN110042068B (en) | Method for improving enterokinase secretion expression quantity by modifying signal peptide | |
CN110582568A (en) | Sesquiterpene synthases for the production of butanol and mixtures thereof | |
CN112852859B (en) | Method for improving synthesis capacity of filamentous fungi organic acid | |
CN113881579A (en) | Method for synthesizing and improving antibacterial activity of long-chain antimicrobial peptides from Trichoderma | |
CN111363711B (en) | Method for producing lysine by adsorption immobilized fermentation of recombinant corynebacterium glutamicum | |
CN103649318B (en) | Method for accelerating plant growth and increasing yield by modifying expression levels of kinases and phosphatases | |
BR112016025905B1 (en) | DRIMENOL SYNTASES II | |
EP3802783A1 (en) | Microorganisms and the production of fine chemicals | |
CN117511835A (en) | Recombinant pseudomonas aeruginosa and construction method and application thereof | |
CN102086455A (en) | Flocculation gene of flocculating yeast and expression product and application thereof | |
CN113412329B (en) | Phloroglucinol resistant cells, particularly yeasts | |
CN111269930A (en) | Method for detecting genetic stability of filamentous fungus transformation system | |
CN111548946B (en) | A kind of recombinant yeast engineering bacteria producing tanshinone diene |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |