CN111440814A - Insect-resistant fusion gene mCry1AbVip3A, expression vector and application thereof - Google Patents
Insect-resistant fusion gene mCry1AbVip3A, expression vector and application thereof Download PDFInfo
- Publication number
- CN111440814A CN111440814A CN202010294835.8A CN202010294835A CN111440814A CN 111440814 A CN111440814 A CN 111440814A CN 202010294835 A CN202010294835 A CN 202010294835A CN 111440814 A CN111440814 A CN 111440814A
- Authority
- CN
- China
- Prior art keywords
- leu
- ser
- asn
- ile
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 64
- 230000004927 fusion Effects 0.000 title claims abstract description 26
- 241000238631 Hexapoda Species 0.000 title claims abstract description 23
- 239000013604 expression vector Substances 0.000 title description 11
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 23
- 241000196324 Embryophyta Species 0.000 claims abstract description 17
- 239000002773 nucleotide Substances 0.000 claims abstract description 15
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 15
- 230000009261 transgenic effect Effects 0.000 claims abstract description 13
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims abstract description 5
- 108091023040 Transcription factor Proteins 0.000 claims abstract description 3
- 102000040945 Transcription factor Human genes 0.000 claims abstract description 3
- 238000009396 hybridization Methods 0.000 claims abstract description 3
- 240000008042 Zea mays Species 0.000 claims description 11
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 11
- 239000013598 vector Substances 0.000 claims description 7
- 150000001413 amino acids Chemical class 0.000 claims description 5
- 229920000742 Cotton Polymers 0.000 claims description 4
- 240000007594 Oryza sativa Species 0.000 claims description 4
- 235000007164 Oryza sativa Nutrition 0.000 claims description 4
- 235000009566 rice Nutrition 0.000 claims description 4
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 3
- 235000005822 corn Nutrition 0.000 claims description 3
- 235000013399 edible fruits Nutrition 0.000 claims description 3
- 235000013311 vegetables Nutrition 0.000 claims description 3
- 241000219146 Gossypium Species 0.000 claims description 2
- 244000061456 Solanum tuberosum Species 0.000 claims description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 claims description 2
- 230000000295 complement effect Effects 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 230000000749 insecticidal effect Effects 0.000 abstract description 15
- 241000346285 Ostrinia furnacalis Species 0.000 abstract description 14
- 238000000338 in vitro Methods 0.000 abstract description 2
- 241000209510 Liliopsida Species 0.000 abstract 1
- 241000256250 Spodoptera littoralis Species 0.000 abstract 1
- 231100000820 toxicity test Toxicity 0.000 abstract 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 11
- 239000012634 fragment Substances 0.000 description 9
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 8
- 210000004027 cell Anatomy 0.000 description 8
- 235000009973 maize Nutrition 0.000 description 8
- 238000012360 testing method Methods 0.000 description 7
- 230000001988 toxicity Effects 0.000 description 7
- 231100000419 toxicity Toxicity 0.000 description 7
- 108020001507 fusion proteins Proteins 0.000 description 6
- 102000037865 fusion proteins Human genes 0.000 description 6
- 108091008146 restriction endonucleases Proteins 0.000 description 6
- 108020004414 DNA Proteins 0.000 description 5
- 241000607479 Yersinia pestis Species 0.000 description 5
- 210000002777 columnar cell Anatomy 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 230000014509 gene expression Effects 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- 239000003053 toxin Substances 0.000 description 5
- 231100000765 toxin Toxicity 0.000 description 5
- 108700012359 toxins Proteins 0.000 description 5
- 108020004705 Codon Proteins 0.000 description 4
- 230000002195 synergetic effect Effects 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 241000589158 Agrobacterium Species 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 210000002175 goblet cell Anatomy 0.000 description 3
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 3
- 230000000968 intestinal effect Effects 0.000 description 3
- 238000000034 method Methods 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 230000009465 prokaryotic expression Effects 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 230000001018 virulence Effects 0.000 description 3
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 2
- 241000672609 Escherichia coli BL21 Species 0.000 description 2
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 2
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 2
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 2
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 101150014068 PPIP5K1 gene Proteins 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 2
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 2
- 241000256247 Spodoptera exigua Species 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 2
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000005779 cell damage Effects 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000002158 endotoxin Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 210000003000 inclusion body Anatomy 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 239000000575 pesticide Substances 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 241000566547 Agrotis ipsilon Species 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 1
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 1
- WHLDJYNHXOMGMU-JYJNAYRXSA-N Arg-Val-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WHLDJYNHXOMGMU-JYJNAYRXSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- JRVABKHPWDRUJF-UBHSHLNASA-N Asn-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N JRVABKHPWDRUJF-UBHSHLNASA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000254173 Coleoptera Species 0.000 description 1
- 101710151559 Crystal protein Proteins 0.000 description 1
- ODDOYXKAHLKKQY-MMWGEVLESA-N Cys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N ODDOYXKAHLKKQY-MMWGEVLESA-N 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- SSWAFVQFQWOJIJ-XIRDDKMYSA-N Gln-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N SSWAFVQFQWOJIJ-XIRDDKMYSA-N 0.000 description 1
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- AOCARQDSFTWWFT-DCAQKATOSA-N Glu-Met-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AOCARQDSFTWWFT-DCAQKATOSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- 241000258937 Hemiptera Species 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- QEYUCKCWTMIERU-SRVKXCTJSA-N His-Lys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QEYUCKCWTMIERU-SRVKXCTJSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- ZHMZWSFQRUGLEC-JYJNAYRXSA-N His-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZHMZWSFQRUGLEC-JYJNAYRXSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- LQTGGXSOMDSWTQ-UNQGMJICSA-N Met-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCSC)N)O LQTGGXSOMDSWTQ-UNQGMJICSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108010034522 NNQQ peptide Proteins 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- NOFBJKKOPKJDCO-KKXDTOCCSA-N Phe-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NOFBJKKOPKJDCO-KKXDTOCCSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 1
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 1
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- 241000500437 Plutella xylostella Species 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- 101150025008 RBM38 gene Proteins 0.000 description 1
- 102100025859 RNA-binding protein 38 Human genes 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- DGPGKMKUNGKHPK-QEJZJMRPSA-N Ser-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGPGKMKUNGKHPK-QEJZJMRPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- DNUJCLUFRGGSDJ-YLVFBTJISA-N Trp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DNUJCLUFRGGSDJ-YLVFBTJISA-N 0.000 description 1
- MKDXQPMIQPTTAW-SIXJUCDHSA-N Trp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N MKDXQPMIQPTTAW-SIXJUCDHSA-N 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- DWJQKEZKLQCHKO-SRVKXCTJSA-N Tyr-Asn-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O DWJQKEZKLQCHKO-SRVKXCTJSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 1
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- FZADUTOCSFDBRV-RNXOBYDBSA-N Tyr-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FZADUTOCSFDBRV-RNXOBYDBSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- ZTKGDWOUYRRAOQ-ULQDDVLXSA-N Val-His-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N ZTKGDWOUYRRAOQ-ULQDDVLXSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- PDASTHRLDFOZMG-JYJNAYRXSA-N Val-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 PDASTHRLDFOZMG-JYJNAYRXSA-N 0.000 description 1
- 238000012271 agricultural production Methods 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000037429 base substitution Effects 0.000 description 1
- 210000002469 basement membrane Anatomy 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- GINJFDRNADDBIN-FXQIFTODSA-N bilanafos Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCP(C)(O)=O GINJFDRNADDBIN-FXQIFTODSA-N 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 208000037887 cell injury Diseases 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 230000001079 digestive effect Effects 0.000 description 1
- 102000038379 digestive enzymes Human genes 0.000 description 1
- 108091007734 digestive enzymes Proteins 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000003912 environmental pollution Methods 0.000 description 1
- 210000005081 epithelial layer Anatomy 0.000 description 1
- 239000011536 extraction buffer Substances 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 235000011389 fruit/vegetable juice Nutrition 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000002917 insecticide Substances 0.000 description 1
- 210000002490 intestinal epithelial cell Anatomy 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 210000003750 lower gastrointestinal tract Anatomy 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 210000000110 microvilli Anatomy 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 239000000447 pesticide residue Substances 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 238000002331 protein detection Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 230000008961 swelling Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 210000002438 upper gastrointestinal tract Anatomy 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/32—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Bacillus (G)
- C07K14/325—Bacillus thuringiensis crystal peptides, i.e. delta-endotoxins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8286—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for insect resistance
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/22—Vectors comprising a coding region that has been codon optimised for expression in a respective host
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/80—Vectors containing sites for inducing double-stranded breaks, e.g. meganuclease restriction sites
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Zoology (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Pest Control & Pesticides (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Insects & Arthropods (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Gastroenterology & Hepatology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
技术领域technical field
本发明涉及基因工程和生物防治领域,具体地说,涉及抗虫融合基因、其表达载体及应用The present invention relates to the field of genetic engineering and biological control, in particular, to an anti-insect fusion gene, its expression vector and application
背景技术Background technique
害虫给全球农业生产带来每年约80亿美元的损失。害虫的防治目前主要依靠使用化学农药,但是农药残留会对人体健康带来危害。因此,人们成功的选择了利用可产生杀虫毒素的转基因作物进行抗虫,目前转基因抗虫玉米和棉花已经大量种植。Pests cost about $8 billion a year in global agricultural production. At present, the control of pests mainly relies on the use of chemical pesticides, but pesticide residues can cause harm to human health. Therefore, people have successfully selected transgenic crops that can produce insecticidal toxins for insect resistance. At present, transgenic insect-resistant corn and cotton have been planted in large quantities.
在菌株Bt AB88的发酵上清液中发现对小地老虎(Agrotis ipsilon)、甜菜夜蛾(Spodoptera exigua)、草地贪夜蛾(Spodopterafrugiperda)等害虫具有较强杀虫活性的物质,鉴于其在对数生长中期至芽胞期均有表达,将其命名为营养期杀虫蛋白(vegetativeinsecticidal proteins,Vip)(Estruch,1996)。来自苏云金芽孢杆菌(BacillusthuringHansis)中的Bt基因同样可以编码杀虫蛋白晶体,它在芽孢形成过程中产生δ-内毒素的杀虫伴胞晶体蛋白,这些蛋白具有很高的杀虫活性。其作用原理为这种抗虫蛋白能被碱性肠液溶解,水解为更小的活性毒素片段——核心片段(Hofte和Whiteley,1989)。该活性片段能抗蛋白酶的进一步水解,被激活的蛋白质结合在昆虫肠道上的刷状小泡上,引起穿孔从而影响渗透平衡,细胞膨胀并溶解,靶标生物停止取食并最后死亡。研究表明许多靶标害虫的肠道上皮细胞都具有Bt蛋白高亲和性的结合位点(Hofte和Whiteley,1989)。相对于Cry类蛋白来说,对于Vip3类蛋白的杀虫机理研究才刚刚起步,最初的机理研究发现敏感昆虫小地老虎喂食Vip3A毒素24h后上皮柱状细胞膨胀成球形,杯状细胞形态稍有变化,此阶段细胞损伤不明显;48h后柱状细胞持续恶化,肠腔内充满崩溃细胞的碎片,杯状细胞明显受损,但它们仍然与基膜相连;72h后上皮层完全脱落,昆虫死亡,但是在前肠和后肠未出现明显的细胞损伤。敏感昆虫吞食Vip3A引发的症状与δ-内毒素相似,只是在时间上延迟了。同时,体内免疫定位试验表明,Vip3A能特异性地结合到顶部的微绒毛上,且大部分集中于柱状细胞,在杯状细胞中没有检测信号。中肠组织中的柱状细胞主要行使分泌消化酶和吸收消化产物的功能,Vip3A被肠液中的蛋白酶激活后,由柱状细胞吸收的过程中,也许其与细胞膜上某些受体蛋白的结合导致细胞膜穿孔,或是引发某种信号通路,最终虫体死亡(Yu C G和Mullins M A等,1997)。Substances with strong insecticidal activity against pests such as Agrotis ipsilon, Spodoptera exigua and Spodoptera frugiperda were found in the fermentation supernatant of strain Bt AB88. It is expressed from the mid-phase to the spore stage, and is named vegetative insecticidal proteins (Vip) (Estruch, 1996). The Bt gene from Bacillus thuringHansis can also encode insecticidal protein crystals, which produce delta-endotoxin parasporal crystal proteins during spore formation, and these proteins have high insecticidal activity. The principle of action is that the anti-insect protein can be dissolved by alkaline intestinal fluid and hydrolyzed into smaller active toxin fragments - core fragments (Hofte and Whiteley, 1989). The active fragment is resistant to further hydrolysis by proteases, and the activated protein binds to brush-like vesicles in the insect gut, causing perforation to affect osmotic balance, cell swelling and lysis, and target organisms stopping feeding and eventually dying. Studies have shown that the intestinal epithelial cells of many target pests have high-affinity binding sites for Bt proteins (Hofte and Whiteley, 1989). Compared with Cry-like proteins, the research on the insecticidal mechanism of Vip3-like proteins has just started. The initial mechanism study found that epithelial columnar cells swelled into spherical shape after feeding Vip3A toxin for 24 hours, and the shape of goblet cells changed slightly. , the cell damage was not obvious at this stage; after 48h, the columnar cells continued to deteriorate, the intestinal lumen was filled with fragments of collapsed cells, and the goblet cells were significantly damaged, but they were still connected to the basement membrane; after 72h, the epithelial layer was completely detached, and the insect died, but No obvious cellular damage was seen in the foregut and hindgut. Symptoms triggered by ingestion of Vip3A by susceptible insects were similar to delta-endotoxin, but delayed in time. Meanwhile, in vivo immunolocalization assay showed that Vip3A could specifically bind to the top microvilli, and most of them were concentrated in columnar cells, and no signal was detected in goblet cells. The columnar cells in the midgut tissue mainly perform the functions of secreting digestive enzymes and absorbing digestive products. After Vip3A is activated by the protease in the intestinal juice, in the process of being absorbed by the columnar cells, it may bind to some receptor proteins on the cell membrane to cause the cell membrane Perforation, or triggering a certain signaling pathway, eventually worm death (Yu CG and Mullins MA et al., 1997).
近年来,关于营养期杀虫蛋白Vip的研究较多,已发现的Vip类蛋白主要分为4类,即Vip1、Vip2、Vip3和Vip4。Vip1/Vip2构成二元毒素,主要作用于鞘翅目及半翅目昆虫,研究相对较少(Ellis R T和Stamp L,2002)。华中农业大学发现1个vip4类基因,有关其杀虫作用方式和靶标特异性等还未见相关报道。In recent years, there have been many studies on the insecticidal protein Vip in the vegetative period. The Vip-like proteins that have been discovered are mainly divided into four categories, namely Vip1, Vip2, Vip3 and Vip4. Vip1/Vip2 constitute a binary toxin, which mainly acts on Coleoptera and Hemiptera, and there are relatively few studies (Ellis RT and Stamp L, 2002). Huazhong Agricultural University discovered a vip4 gene, but there are no related reports on its insecticidal mode of action and target specificity.
Saraswathy等构建了Vip3Aa14与Cry1Ac的嵌合体蛋白,嵌合体蛋白主要以包涵体的形式存在,对胰蛋白酶的稳定性较好,但是只维持了Cry1Ac的杀虫活性,未表现出协同作用。此外,Cry1C的C端能显著增强Vip3Aa7的合成并帮助其在BMB171中形成包涵体,但生物活性较原毒素Vip3Aa7显著下降(Song R等,2008)。以上融合蛋白的毒力下降可能是由于融合蛋白不能正确折叠引起的。但是Dong等构建的融合蛋白将Vip3Aa7与Cry9Ca的N端进行融合对小菜蛾的协同效应因子达到4.79,远高于将2种原毒素混合作用的协同效应。Yu等将Vip3Aa29与Cyt2Aa3进行共表达,共表达蛋白对甜菜夜蛾、水稻二化螟具有协同效应。将Vip3与不同来源的杀虫毒素融合或共表达,能有效增强其对靶标昆虫的毒力,扩大杀虫谱,延缓昆虫抗性的产生。此外,可将Vip3整合进昆虫病毒的染色体上,既能有效传播Vip3A,另一方面还能提高病毒的杀虫毒力。Saraswathy et al. constructed a chimeric protein of Vip3Aa14 and Cry1Ac. The chimeric protein mainly exists in the form of inclusion bodies and has good stability to trypsin, but only maintains the insecticidal activity of Cry1Ac and does not show synergistic effect. In addition, the C-terminus of Cry1C can significantly enhance the synthesis of Vip3Aa7 and help it to form inclusion bodies in BMB171, but its biological activity is significantly lower than that of the protoxin Vip3Aa7 (Song R et al., 2008). The reduced virulence of the above fusion proteins may be caused by the fusion protein not folding properly. However, the fusion protein constructed by Dong et al. fused Vip3Aa7 with the N-terminus of Cry9Ca, and the synergistic effect factor of Plutella xylostella reached 4.79, which was much higher than the synergistic effect of mixing the two protoxins. Yu et al. co-expressed Vip3Aa29 and Cyt2Aa3, and the co-expressed proteins had a synergistic effect on Spodoptera exigua and rice borer. Fusion or co-expression of Vip3 with insecticidal toxins from different sources can effectively enhance its virulence to target insects, expand the insecticidal spectrum, and delay the emergence of insect resistance. In addition, Vip3 can be integrated into the chromosome of insect virus, which can not only effectively spread Vip3A, but also improve the insecticidal virulence of the virus.
本发明的抗虫融合基因较之单独的Cry1Ab或VIP3A基因可以兼抗玉米螟和草地贪夜蛾。Compared with the single Cry1Ab or VIP3A gene, the anti-insect fusion gene of the present invention can resist both corn borer and Spodoptera frugiperda.
发明内容SUMMARY OF THE INVENTION
本发明的目的是提供一种能够在植物中稳定高效表达的抗虫融合基因及其表达载体。The purpose of the present invention is to provide an anti-insect fusion gene and its expression vector which can be stably and efficiently expressed in plants.
本发明的另一目的是提供抗虫融合基因在提高转基因植物抗虫性中的应用。Another object of the present invention is to provide the application of the insect-resistant fusion gene in improving the insect resistance of transgenic plants.
本发明的目的可以采用以下的技术措施来进一步实现:The object of the present invention can be further realized by adopting the following technical measures:
在保持序列Vip3A翻译蛋白的氨基酸组成总体不变的情况下,用单子叶植物偏爱的密码子进行碱基置换,初步获得一个改造的DNA序列;排除DNA序列中存在的造成植物转录不稳定的富含AT序列以及常用限制性酶切位点(SacI),然后通过置换密码子的方法进行改正消除;然后将Cry1Ab-Ma序列去掉终止子放在合成的Vip3A的N端;确定并化学合成改造的融合基因mCry1AbVip3A,其核苷酸序列如SEQ ID No.1所示;根据克隆需要在序列两端加上限制性内切酶识别位点序列并构建到相应的表达载体上。Under the condition that the amino acid composition of the translation protein of the sequence Vip3A remains unchanged, base substitutions are performed with the codons preferred by monocotyledonous plants to initially obtain a modified DNA sequence; Contains AT sequence and commonly used restriction enzyme cleavage site (SacI), and then corrects and eliminates by replacing codons; then removes the terminator from Cry1Ab-Ma sequence and places it on the N-terminus of synthetic Vip3A; confirms and chemically synthesizes the modified The nucleotide sequence of the fusion gene mCry1AbVip3A is shown in SEQ ID No. 1; according to the needs of cloning, restriction endonuclease recognition site sequences are added at both ends of the sequence and constructed into a corresponding expression vector.
将本发明的融合基因与原核表达载体可操作地连接,能够快速初步检测本发明合成的Bt基因表达产物对玉米螟和草地贪夜蛾的毒性。将本发明的融合基因与植物表达载体可操作地连接,并将表达载体导入相应的农杆菌中,进而通过农杆菌介导法进行遗传转化,培育抗虫转基因玉米。也可以对其它农作物或者果树等进行遗传转化,使其具备抗虫活性。The fusion gene of the present invention is operably linked to the prokaryotic expression vector, and the toxicity of the synthetic Bt gene expression product of the present invention to corn borer and Spodoptera frugiperda can be rapidly and preliminarily detected. The fusion gene of the present invention is operably linked to a plant expression vector, and the expression vector is introduced into the corresponding Agrobacterium, and then genetically transformed by the Agrobacterium-mediated method to cultivate insect-resistant transgenic maize. Other crops or fruit trees can also be genetically transformed to have anti-insect activity.
本领域技术人员还可以将本发明基因转化细菌或真菌,通过大规模发酵生产融合抗虫蛋白,并将其制备成杀虫剂,用于农作物害虫的防治。Those skilled in the art can also transform the gene of the present invention into bacteria or fungi, produce the fusion anti-insect protein through large-scale fermentation, and prepare it into an insecticide for the control of crop pests.
为了实现本发明目的,本发明首先提供了一种抗虫融合基因mCry1AbVip3A,所述基因mCry1AbVip3A的核苷酸序列如(a)、(b)、或(c)所示:In order to achieve the object of the present invention, the present invention first provides an anti-insect fusion gene mCry1AbVip3A, the nucleotide sequence of the gene mCry1AbVip3A is shown in (a), (b), or (c):
(a)、SEQ ID NO:1所示的核苷酸;或(a), the nucleotide shown in SEQ ID NO: 1; or
(b)、编码SEQ ID NO:2所示氨基酸的核苷酸;或(b), a nucleotide encoding the amino acid shown in SEQ ID NO: 2; or
(c)、与SEQ ID NO:1的互补序列在严谨杂交条件能够进行杂交的核苷酸,该核苷酸所编码蛋白质具有mCry1AbVip3A转录因子功能。(c), a nucleotide capable of hybridizing with the complementary sequence of SEQ ID NO: 1 under stringent hybridization conditions, and the protein encoded by the nucleotide has the function of a mCry1AbVip3A transcription factor.
进一步地,本发明还提供了所述抗虫融合基因mCry1AbVip3A编码的氨基酸序列如SEQ ID NO:2所示。Further, the present invention also provides that the amino acid sequence encoded by the anti-insect fusion gene mCry1AbVip3A is shown in SEQ ID NO: 2.
进一步地,本发明还提供了含有抗虫融合基因的载体,优选双价载体。Further, the present invention also provides a vector containing an anti-insect fusion gene, preferably a bivalent vector.
进一步地,本发明还提供了含有上述载体的宿主细胞,优选EHA105。Further, the present invention also provides a host cell containing the above-mentioned vector, preferably EHA105.
进一步地,本发明还提供了含有抗虫融合基因的转化植物细胞。Further, the present invention also provides transformed plant cells containing the anti-insect fusion gene.
更进一步地,本发明还提供了抗虫融合基因在提高转基因植物抗虫性中的应用。优选,所述植物为农作物、果树或蔬菜,如玉米、水稻、马铃薯、棉花等。Furthermore, the present invention also provides the application of the insect-resistant fusion gene in improving the insect resistance of transgenic plants. Preferably, the plants are crops, fruit trees or vegetables, such as corn, rice, potato, cotton and the like.
本发明至少具有下列优点及有益效果:The present invention has at least the following advantages and beneficial effects:
本发明的抗虫融合蛋白杀虫效果较单独的Cry1Ab-Ma和Vip3A,能够兼容Cry1Ab-Ma的玉米螟抗性和Vip3A的草地贪夜蛾抗性。Compared with Cry1Ab-Ma and Vip3A alone, the insecticidal fusion protein of the invention can be compatible with the corn borer resistance of Cry1Ab-Ma and the Spodoptera frugiperda resistance of Vip3A.
将抗虫融合基因导入玉米后,可以得到稳定遗传的转化体。此外,该基因也可以转化棉花、水稻、蔬菜等农作物,使其具备相应的抗虫活性,从而降低农药的使用量,以减少环境污染,具有重要的经济价值和广阔的应用前景。After the insect-resistant fusion gene is introduced into maize, a stably inherited transformant can be obtained. In addition, the gene can also transform cotton, rice, vegetables and other crops to have corresponding anti-insect activity, thereby reducing the use of pesticides and reducing environmental pollution, which has important economic value and broad application prospects.
附图说明Description of drawings
图1含有mCry1AbVip3A基因片段的植物表达载体图。Figure 1 Map of plant expression vector containing mCry1AbVip3A gene fragment.
图2转基因玉米mCry1AbVip3A胶体金试纸检测结果,图中,试纸条显色带上为质控线、下为目的蛋白检测线;左1为阴性对照,其他为转基因玉米样品。Figure 2. Detection results of transgenic maize mCry1AbVip3A colloidal gold test paper. In the figure, the color band on the test strip is the quality control line, and the bottom is the target protein detection line; the left 1 is the negative control, and the others are the transgenic maize samples.
具体实施方式Detailed ways
以下实施例用于说明本发明,但不用来限制本发明的范围。若未特别指明,实施例中所用的技术手段为本领域技术人员所熟知的常规手段。The following examples are intended to illustrate the present invention, but not to limit the scope of the present invention. Unless otherwise specified, the technical means used in the examples are conventional means well known to those skilled in the art.
实施例1抗虫融合基因的合成Example 1 Synthesis of anti-insect fusion gene
根据Vip3A基因的原始核苷酸序列(gene ID DQ539888.1)在保持其蛋白的氨基酸组成总体不变的情况下,用单子叶植物偏爱的密码子进行碱基置换,初步获得一个改造的DNA序列;排除DNA序列中存在的造成植物转录不稳定的富含AT序列(如ATTTA、AATGAA等)以及常用限制性酶切位点(SacI),然后通过置换密码子的方法进行改正消除;并在5’端加上Cry1Ab-Ma序列;确定并化学合成改造的抗虫融合基因mCry1AbVip3A,其核苷酸序列如SEQID No.1所示;根据克隆需要在序列两端加上限制性内切酶识别位点序列并构建到相应的表达载体上。According to the original nucleotide sequence of the Vip3A gene (gene ID DQ539888.1), while keeping the overall amino acid composition of the protein unchanged, the bases were replaced with the codons preferred by monocotyledonous plants, and a modified DNA sequence was initially obtained. ; Exclude AT-rich sequences (such as ATTTA, AATGAA, etc.) and commonly used restriction enzyme cleavage sites (SacI) that cause plant transcription instability in the DNA sequence, and then correct and eliminate them by replacing codons; and in 5 Add Cry1Ab-Ma sequence to the ' end; determine and chemically synthesize the modified anti-insect fusion gene mCry1AbVip3A, whose nucleotide sequence is shown in SEQID No.1; add restriction endonuclease recognition sites at both ends of the sequence according to cloning needs dot sequence and construct into the corresponding expression vector.
实施例2抗虫融合基因在原核系统中的表达和产物的毒性检测Example 2 Expression of anti-insect fusion gene in prokaryotic system and toxicity detection of product
为了检测改造的抗虫基因体外表达及对玉米螟的毒性情况,我们构建了原核表达载体pET-mCry1AbVip3A。In order to detect the in vitro expression of the modified anti-insect gene and its toxicity to corn borer, we constructed a prokaryotic expression vector pET-mCry1AbVip3A.
设计引物序列(上游引物F1:如SEQ ID No.3所示,5’-GACGACGACAAGGCCATGGATGGACAACAACCCG-3’;F2:如SEQ ID No.4所示,5’-ACGACGACAAGGCCATGGATGAACAAGAACAAC-3’;下游引物R1:5’-GGTGGTGGTGGTGCTCGAGCTTGATGCTCACGTC-3’,如SEQ ID No.5所示;R2:如SEQ ID No.6所示,5’-GGTGGTGGTGGTGCTCGAGGTACTCCGCCTCGAA-3’)。Design primer sequences (upstream primer F1: as shown in SEQ ID No. 3, 5'-GACGACGACAAGGCCATGGATGGACAACAACCCG-3'; F2: as shown in SEQ ID No. 4, 5'-ACGACGACAAGGCCATGGATGAACAAGAACAAC-3'; downstream primer R1: 5' -GGTGGTGGTGGTGCTCGAGCTTGATGCTCACGTC-3' as shown in SEQ ID No. 5; R2: as shown in SEQ ID No. 6, 5'-GGTGGTGGTGGTGCTCGAGGTACTCCGCCTCGAA-3').
以构建好的pTF101.1-mCry1AbVip3A植物表达质粒(图1)为模板,以F1和R1为引物,扩增抗虫融合基因,凝胶回收试剂盒(天根生化科技(北京)有限公司,DP209)回收纯化以上抗虫基因片段。用限制性内切酶NcoI和XhoI进行酶切pET30a+,凝胶回收试剂盒回收纯化4.2kb片段。回收片段与酶切的pET30a+片段进行连接反应,构建所得原核表达质粒命名为pET-mCry1AbVip3A。用不同限制性内切酶进行酶切鉴定,表明载体构建正确。Using the constructed pTF101.1-mCry1AbVip3A plant expression plasmid (Figure 1) as a template, and using F1 and R1 as primers, amplify the anti-insect fusion gene, gel recovery kit (Tiangen Biochemical Technology (Beijing) Co., Ltd., DP209 ) recovery and purification of the above insect-resistant gene fragments. The pET30a+ was digested with restriction enzymes NcoI and XhoI, and the 4.2kb fragment was recovered and purified with a gel recovery kit. The recovered fragment was ligated with the digested pET30a+ fragment, and the resulting prokaryotic expression plasmid was constructed and named pET-mCry1AbVip3A. Digestion with different restriction enzymes indicated that the vector was constructed correctly.
构建好的原核载体pET-mCry1AbVip3A转化BL21(DE3)感受态细胞。用侵染好的大肠杆菌BL21菌液经IPTG诱导后,提取Bt蛋白,用玉米螟和草地贪夜蛾进行虫试。具体步骤如下:The constructed prokaryotic vector pET-mCry1AbVip3A was used to transform BL21(DE3) competent cells. Bt protein was extracted from infected Escherichia coli BL21 bacteria after induction by IPTG, and the insect test was carried out with corn borer and Spodoptera frugiperda. Specific steps are as follows:
①接种大肠杆菌BL21单菌落于5ml LB(含卡那霉素)液体培养基中,37℃过夜培养。①Inoculate a single colony of Escherichia coli BL21 in 5ml LB (containing kanamycin) liquid medium, and cultivate overnight at 37°C.
②将菌液接种到500ml LB(含卡那霉素)液体培养基中,培养至OD≈0.5。②Inoculate the bacterial liquid into 500ml LB (containing kanamycin) liquid medium, and cultivate to OD≈0.5.
③加入IPTG至终浓度0.5mM继续摇动诱导4小时。③Add IPTG to the final concentration of 0.5mM and continue to shake for 4 hours.
④4000rpm离心10min,收集菌体。④ Centrifuge at 4000 rpm for 10 min to collect bacterial cells.
⑤加入36ml裂解缓冲液(2mM Tris-HCl;0.2mM CaCl2;pH=8.0),重新悬浮菌体,加入溶菌酶至终浓度1mg/ml,冰上放置30min。⑤ Add 36ml of lysis buffer (2mM Tris-HCl; 0.2mM CaCl 2 ; pH=8.0), resuspend the cells, add lysozyme to a final concentration of 1mg/ml, and place on ice for 30min.
⑥超声波破碎菌体(破碎参数:工作10s,间隔5s,40次,冰浴破碎),4000rpm离心10min,收集上清。⑥Ultrasonic break the cells (breaking parameters: work 10s, interval 5s, 40 times, ice bath breaking), centrifuge at 4000rpm for 10min, and collect the supernatant.
⑦将收集的上清加入到饲料中饲喂玉米螟和草地贪夜蛾。⑦ The collected supernatant was added to the feed to feed corn borer and Spodoptera frugiperda.
对玉米螟的毒力测定:称量5g亚洲玉米螟人工饲料,置于培养皿中,把饲料切成1-2mm薄片,将1ml蛋白液体(50ng/ul)均匀涂抹于饲料表面,并浸润2h。2h后每培养皿中接入10头孵化24小时的玉米螟幼虫。试验以相同体积的蒸馏水为对照。每个处理设置5个重复。放置在温度28℃、光周期16h:8h(L:D)、相对湿度80%的人工气候培养箱中培养。隔天调查并记录存活幼虫数量。Toxicity determination of corn borer: Weigh 5g of Asian corn borer artificial feed, put it in a petri dish, cut the feed into 1-2mm slices, apply 1ml of protein liquid (50ng/ul) evenly on the surface of the feed, and infiltrate it for 2h . After 2 hours, 10 corn borer larvae hatched for 24 hours were inserted into each petri dish. The experiments were performed with the same volume of distilled water as a control. 5 replicates were set for each treatment. Placed in an artificial climate incubator with a temperature of 28°C, a photoperiod of 16h:8h (L:D), and a relative humidity of 80%. The number of surviving larvae was surveyed and recorded every other day.
对草地贪夜蛾的毒力测定:称量5g人工饲料,置于养虫盒中,把饲料切成1-2mm薄片,将1ml蛋白液体(50ng/ul)均匀涂抹于饲料表面,并浸润2h。2h后每养虫盒中接入1头孵化24小时的草地贪夜蛾幼虫。试验以相同体积的蒸馏水为对照。每个处理设置30个重复。放置在温度26℃、光周期16h:8h(L:D)、相对湿度60%的人工气候培养箱中培养。每天调查并记录存活幼虫数量。Toxicity determination of Spodoptera frugiperda: Weigh 5g of artificial feed, put it in a worm box, cut the feed into 1-2mm slices, apply 1ml of protein liquid (50ng/ul) evenly on the surface of the feed, and infiltrate it for 2h . After 2 hours, one Spodoptera frugiperda larvae hatched for 24 hours was inserted into each insect box. The experiments were performed with the same volume of distilled water as a control. 30 replicates were set up for each treatment. Placed in an artificial climate incubator with a temperature of 26°C, a photoperiod of 16h:8h (L:D), and a relative humidity of 60%. The number of surviving larvae was surveyed and recorded daily.
数据分析:采用excel 2007对数据进行初步整理,计算不同取食时间的玉米螟和草地贪夜蛾幼虫存活率;采用spss 23对玉米螟调查数据进行方差分析和差异显著性分析。结果显示,第6天,饲喂mCry1AbVip3A蛋白的玉米螟和草地贪夜蛾无幼虫存活(表1,表2),试验结果表明,mCry1AbVip3A具有强杀虫活性。Data analysis: excel 2007 was used to preliminarily organize the data, and the survival rates of corn borer and Spodoptera frugiperda larvae at different feeding times were calculated; spss 23 was used to conduct variance analysis and significant difference analysis on the corn borer survey data. The results showed that on the 6th day, no larvae of corn borer and Spodoptera frugiperda were fed with mCry1AbVip3A protein (Table 1, Table 2). The test results showed that mCry1AbVip3A had strong insecticidal activity.
表1:对玉米螟幼虫的毒性Table 1: Toxicity to corn borer larvae
表2:对草地贪夜蛾幼虫的毒性Table 2: Toxicity to Spodoptera frugiperda larvae
实施例3 mCry1AbVip3A基因在转基因玉米中的表达Example 3 Expression of mCry1AbVip3A gene in transgenic maize
采用农杆菌介导转化方法(菌种EHA105)将pTF101.1-mCry1AbVip3A(图1)插入序列导入受体玉米的愈伤组织,经除草剂双丙氨膦筛选后获得转基因玉米。目的蛋白mCry1AbVip3A的检测(Bt-Cry1Ab/1Ac胶体金免疫学检测):The pTF101.1-mCry1AbVip3A (Fig. 1) insert sequence was introduced into the callus of recipient maize by Agrobacterium-mediated transformation (strain EHA105), and the transgenic maize was obtained after screening with the herbicide bialaphos. Detection of target protein mCry1AbVip3A (Bt-Cry1Ab/1Ac colloidal gold immunological detection):
①取约1cm2左右新鲜幼嫩叶片放在1.5ml的Eppendorf管中,向管中加入500ulSEB4样品提取缓冲液并用研磨。①Take about 1cm 2 of fresh young leaves and put them in a 1.5ml Eppendorf tube, add 500ul SEB4 sample extraction buffer to the tube and grind it with a grinder.
②取出检测条(北京真尼斯生物技术有限公司,货号:STX06200/0050),手持检测条顶端,将标记的末端插入至离心管中。3-5分钟内出现质控线,如果样品是阳性的,检测线将出现。②Take out the test strip (Beijing Gennis Biotechnology Co., Ltd., item number: STX06200/0050), hold the top of the test strip, and insert the marked end into the centrifuge tube. The control line will appear within 3-5 minutes, and if the sample is positive, the test line will appear.
采用胶体金试纸条检测方法测定了6个转基因植株叶片中mCry1AbVip3A的表达。结果表明,mCry1AbVip3A蛋白在转基因玉米叶片组织中表达(图2)。The expression of mCry1AbVip3A in leaves of 6 transgenic plants was determined by colloidal gold strip detection method. The results showed that mCry1AbVip3A protein was expressed in transgenic maize leaf tissue (Figure 2).
虽然,上文中已经用一般性说明及具体实施方案对本发明作了详尽的描述,但在本发明基础上,可以对之作一些修改或改进,这对本领域技术人员而言是显而易见的。因此,在不偏离本发明精神的基础上所做的这些修改或改进,均属于本发明要求保护的范围。Although the present invention has been described in detail above with general description and specific embodiments, some modifications or improvements can be made on the basis of the present invention, which will be obvious to those skilled in the art. Therefore, these modifications or improvements made without departing from the spirit of the present invention fall within the scope of the claimed protection of the present invention.
序列表sequence listing
<110> 中国农业科学院作物科学研究所 吉林省农业科学院<110> Institute of Crop Science, Chinese Academy of Agricultural Sciences Jilin Academy of Agricultural Sciences
<120> 抗虫融合基因mCry1AbVip3A、其表达载体及应用<120> Anti-insect fusion gene mCry1AbVip3A, its expression vector and application
<130> P200143<130> P200143
<141> 2020-04-15<141> 2020-04-15
<160> 6<160> 6
<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0
<210> 1<210> 1
<211> 4215<211> 4215
<212> DNA<212> DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence
<400> 1<400> 1
atggacaaca acccgaacat caacgagtgc atcccctaca actgcctctc gaacccggag 60atggacaaca acccgaacat caacgagtgc atcccctaca actgcctctc gaacccggag 60
gtcgaggtcc tcggaggcga gcggatcgag acggggtata cgccgatcga tatctcgctc 120gtcgaggtcc tcggaggcga gcggatcgag acggggtata cgccgatcga tatctcgctc 120
tcgctcacgc agttcctcct gtccgaattc gtcccgggcg cgggtttcgt ccttgggctc 180tcgctcacgc agttcctcct gtccgaattc gtcccgggcg cgggtttcgt ccttgggctc 180
gtcgacatca tctgggggat cttcgggccg tcccagtggg acgcgttcct cgtccagatc 240gtcgacatca tctgggggat cttcgggccg tcccagtggg acgcgttcct cgtccagatc 240
gagcaactca tcaaccagcg gatcgaggaa ttcgcgcgga atcaagcgat cagccggctc 300gagcaactca tcaaccagcg gatcgaggaa ttcgcgcgga atcaagcgat cagccggctc 300
gaggggctct ccaacttgta ccagatctac gcggaatcct tccgggaatg ggaggcggac 360gaggggctct ccaacttgta ccagatctac gcggaatcct tccgggaatg ggaggcggac 360
ccgacgaatc cggcgttgag ggaagagatg aggatccagt tcaacgacat gaactccgcg 420ccgacgaatc cggcgttgag ggaagagatg aggatccagt tcaacgacat gaactccgcg 420
ctcacgacgg cgatcccgct cttcgcggtc cagaactatc aggtcccgct actctcggtc 480ctcacgacgg cgatcccgct cttcgcggtc cagaactatc aggtcccgct actctcggtc 480
tatgtccagg cggcgaacct ccatttgtcg gtcctccggg acgtgtcggt cttcggtcag 540tatgtccagg cggcgaacct ccatttgtcg gtcctccggg acgtgtcggt cttcggtcag 540
cgctgggggt tcgacgcggc gacgatcaac tcgcggtaca atgacctcac gcgcctcatc 600cgctgggggt tcgacgcggc gacgatcaac tcgcggtaca atgacctcac gcgcctcatc 600
gggaactaca cggatcacgc ggtccggtgg tacaacacgg ggctcgagcg ggtgtggggc 660gggaactaca cggatcacgc ggtccggtgg tacaacacgg ggctcgagcg ggtgtggggc 660
ccggactcca gggactggat ccgctacaac caattccgtc gggagcttac gttgacggtc 720ccggactcca gggactggat ccgctacaac caattccgtc gggagcttac gttgacggtc 720
ctcgatatcg tcagcttgtt ccctaactac gattcgagga cgtatccgat ccggacggtc 780ctcgatatcg tcagcttgtt ccctaactac gattcgagga cgtatccgat ccggacggtc 780
tcgcagctca cgagggagat ttacacgaac ccggtcctcg agaacttcga tgggtccttc 840tcgcagctca cgagggagat ttacacgaac ccggtcctcg agaacttcga tgggtccttc 840
cgggggtccg cgcaggggat cgaggggtcg atccgctccc cgcacctcat ggatatcctc 900cgggggtccg cgcaggggat cgaggggtcg atccgctccc cgcacctcat ggatatcctc 900
aactcgatca cgatctacac ggacgcgcac cggggggagt actattggtc cgggcaccag 960aactcgatca cgatctacac ggacgcgcac cggggggagt actattggtc cgggcaccag 960
atcatggcgt cgccggtggg cttctcgggc ccggagttca cgttcccgct gtacgggacg 1020atcatggcgt cgccggtggg cttctcgggc ccggagttca cgttcccgct gtacgggacg 1020
atggggaacg cggccccgca gcagcggatc gtcgcgcagc tcgggcaggg cgtgtaccgc 1080atggggaacg cggccccgca gcagcggatc gtcgcgcagc tcgggcaggg cgtgtaccgc 1080
acgctcagca gcacgctcta ccgccgcccg ttcaacatcg ggatcaacaa ccaacagctc 1140acgctcagca gcacgctcta ccgccgcccg ttcaacatcg ggatcaacaa ccaacagctc 1140
tcggtcctcg atgggacgga gttcgcgtac gggacgagca gcaacctccc gtcggcggtc 1200tcggtcctcg atgggacgga gttcgcgtac gggacgagca gcaacctccc gtcggcggtc 1200
taccggaagt cagggacggt cgactcgctc gatgagatcc cgccgcagaa caataacgtc 1260taccggaagt cagggacggt cgactcgctc gatgagatcc cgccgcagaa caataacgtc 1260
ccgccgcggc aggggttctc gcaccggctc tcgcacgtct cgatgttccg gtcggggttc 1320ccgccgcggc aggggttctc gcaccggctc tcgcacgtct cgatgttccg gtcggggttc 1320
tcgaactcgt cggtctcgat catccgcgcg ccgatgttct cgtggatcca ccggtcggcg 1380tcgaactcgt cggtctcgat catccgcgcg ccgatgttct cgtggatcca ccggtcggcg 1380
gaattcaaca acatcattcc gtcgtcgcag atcacgcaga tcccgctcac gaagtcgacg 1440gaattcaaca acatcattcc gtcgtcgcag atcacgcaga tcccgctcac gaagtcgacg 1440
aacctcgggt cggggacgtc ggtcgtcaag gggccggggt tcacgggtgg ggacatcctc 1500aacctcgggt cggggacgtc ggtcgtcaag gggccggggt tcacgggtgg ggacatcctc 1500
cggcggacga gcccggggca gatctcgaca ttgcgggtca acatcacggc gccgctctcc 1560cggcggacga gcccggggca gatctcgaca ttgcgggtca acatcacggc gccgctctcc 1560
cagcgctacc gggtgcgaat ccggtacgcg tcgacgacga acctccagtt ccacacgtcg 1620cagcgctacc gggtgcgaat ccggtacgcg tcgacgacga acctccagtt ccacacgtcg 1620
atcgacggtc ggccgatcaa ccagggaaac ttctcggcga cgatgtcctc ggggtcgaac 1680atcgacggtc ggccgatcaa ccagggaaac ttctcggcga cgatgtcctc ggggtcgaac 1680
ctccagtcgg gttcgttccg gacggtaggc ttcacgacgc cgttcaactt ctccaacggc 1740ctccagtcgg gttcgttccg gacggtaggc ttcacgacgc cgttcaactt ctccaacggc 1740
tcgtcggtct tcacgctctc ggcgcacgtc ttcaactccg ggaacgaggt ctacatcgac 1800tcgtcggtct tcacgctctc ggcgcacgtc ttcaactccg ggaacgaggt ctacatcgac 1800
aggatcgagt tcgtcccggc ggaggtcacg ttcgaggcgg agtacatgaa caagaacaac 1860aggatcgagt tcgtcccggc ggaggtcacg ttcgaggcgg agtacatgaa caagaacaac 1860
accaagctga gcacccgcgc cctgccgagc ttcatcgact acttcaacgg catctacggc 1920accaagctga gcacccgcgc cctgccgagc ttcatcgact acttcaacgg catctacggc 1920
ttcgccaccg gcatcaagga catcatgaac atgatcttca agaccgacac cggcggcgac 1980ttcgccaccg gcatcaagga catcatgaac atgatcttca agaccgacac cggcggcgac 1980
ctgaccctgg acgagatcct gaagaaccag cagctgctga acgacatcag cggcaagctg 2040ctgaccctgg acgagatcct gaagaaccag cagctgctga acgacatcag cggcaagctg 2040
gacggcgtga acggcagcct gaacgacctg atcgcccagg gcaacctgaa caccgagctg 2100gacggcgtga acggcagcct gaacgacctg atcgcccagg gcaacctgaa caccgagctg 2100
agcaaggaga tcctcaagat cgccaacgag cagaaccagg tgctgaacga cgtgaacaac 2160agcaaggaga tcctcaagat cgccaacgag cagaaccagg tgctgaacga cgtgaacaac 2160
aagctggacg ccatcaacac catgctgcgc gtgtacctgc cgaagatcac cagcatgctg 2220aagctggacg ccatcaacac catgctgcgc gtgtacctgc cgaagatcac cagcatgctg 2220
agcgacgtga tcaagcagaa ctacgccctg agcctgcaga tcgagtacct gagcaagcag 2280agcgacgtga tcaagcagaa ctacgccctg agcctgcaga tcgagtacct gagcaagcag 2280
ctgcaggaga tcagcgacaa gctggacatc atcaacgtga acgtcctgat caacagcacc 2340ctgcaggaga tcagcgacaa gctggacatc atcaacgtga acgtcctgat caacagcacc 2340
ctgaccgaga tcaccccggc ctaccagcgc atcaagtacg tgaacgagaa gttcgaagag 2400ctgaccgaga tcaccccggc ctaccagcgc atcaagtacg tgaacgagaa gttcgaagag 2400
ctgaccttcg ccaccgagac cagcagcaag gtgaagaagg acggcagccc ggccgacatc 2460ctgaccttcg ccaccgagac cagcagcaag gtgaagaagg acggcagccc ggccgacatc 2460
ctggacgagc tgaccgagct gaccgagctg gcgaagagcg tgaccaagaa cgacgtggac 2520ctggacgagc tgaccgagct gaccgagctg gcgaagagcg tgaccaagaa cgacgtggac 2520
ggcttcgagt tctacctgaa caccttccac gacgtgatgg tgggcaacaa cctgttcggc 2580ggcttcgagt tctacctgaa caccttccac gacgtgatgg tgggcaacaa cctgttcggc 2580
cgcagcgccc tgaagaccgc cagcgagctg atcaccaagg agaacgtgaa gaccagcggc 2640cgcagcgccc tgaagaccgc cagcgagctg atcaccaagg agaacgtgaa gaccagcggc 2640
agcgaggtgg gcaacgtgta caacttcctg atcgtgctga ccgccctgca ggcccaggcc 2700agcgaggtgg gcaacgtgta caacttcctg atcgtgctga ccgccctgca ggcccaggcc 2700
ttcctgaccc tgaccacctg ccgcaagctg ctgggcctgg ccgacatcga ctacaccagc 2760ttcctgaccc tgaccacctg ccgcaagctg ctgggcctgg ccgacatcga ctacaccagc 2760
atcatgaacg agcacctgaa caaggagaag gaggagttcc gcgtgaacat cctgccgacc 2820atcatgaacg agcacctgaa caaggagaag gaggagttcc gcgtgaacat cctgccgacc 2820
ctgagcaaca ccttcagcaa cccgaactac gccaaggtga agggcagcga cgaggacgcc 2880ctgagcaaca ccttcagcaa cccgaactac gccaaggtga agggcagcga cgaggacgcc 2880
aagatgatcg tggaggccaa gccgggccac gcgctgatcg gcttcgagat cagcaacgac 2940aagatgatcg tggaggccaa gccgggccac gcgctgatcg gcttcgagat cagcaacgac 2940
agcatcaccg tgctgaaggt gtacgaggcc aagctgaagc agaactacca ggtggacaag 3000agcatcaccg tgctgaaggt gtacgaggcc aagctgaagc agaactacca ggtggacaag 3000
gacagcctga gcgaggtgat ctacggcgac atggacaagc tgctgtgccc ggaccagagc 3060gacagcctga gcgaggtgat ctacggcgac atggacaagc tgctgtgccc ggaccagagc 3060
gagcagatct actacaccaa caacatcgtg ttcccgaacg agtacgtgat caccaagatc 3120gagcagatct actacaccaa caacatcgtg ttcccgaacg agtacgtgat caccaagatc 3120
gacttcacca agaagatgaa gaccctgcgc tacgaggtga ccgccaactt ctacgacagc 3180gacttcacca agaagatgaa gaccctgcgc tacgaggtga ccgccaactt ctacgacagc 3180
agcaccggcg agatcgacct gaacaagaag aaggtggaga gcagcgaggc cgagtaccgc 3240agcaccggcg agatcgacct gaacaagaag aaggtggaga gcagcgaggc cgagtaccgc 3240
accctgagcg cgaacgacga cggcgtctac atgccactgg gcgtgatcag cgagaccttc 3300accctgagcg cgaacgacga cggcgtctac atgccactgg gcgtgatcag cgagaccttc 3300
ctgaccccga tcaacggctt cggcctgcag gccgacgaga acagccgcct gatcaccctg 3360ctgaccccga tcaacggctt cggcctgcag gccgacgaga acagccgcct gatcaccctg 3360
acctgtaaga gctacctgcg cgagctgctg ctagccaccg acctgagcaa caaggagacc 3420acctgtaaga gctacctgcg cgagctgctg ctagccaccg acctgagcaa caaggagacc 3420
aagctgatcg tgccaccgag cggcttcatc agcaacatcg tggagaacgg cagcatcgag 3480aagctgatcg tgccaccgag cggcttcatc agcaacatcg tggagaacgg cagcatcgag 3480
gaggacaacc tggagccgtg gaaggccaac aacaagaacg cctacgtcga ccacaccggc 3540gaggacaacc tggagccgtg gaaggccaac aacaagaacg cctacgtcga ccacaccggc 3540
ggcgtgaacg gcaccaaggc cctgtacgtg cacaaggacg gcggcatcag ccagttcatc 3600ggcgtgaacg gcaccaaggc cctgtacgtg cacaaggacg gcggcatcag ccagttcatc 3600
ggcgacaagc tgaagccgaa gaccgagtac gtgatccagt acaccgtgaa gggcaagcca 3660ggcgacaagc tgaagccgaa gaccgagtac gtgatccagt acaccgtgaa gggcaagcca 3660
tcgatccacc tgaaggacga gaacaccggc tacatccact acgaggacac caacaacaac 3720tcgatccacc tgaaggacga gaacaccggc tacatccact acgaggacac caacaacaac 3720
ctggaggact accagaccat caacaagcgc ttcaccaccg gcaccgacct gaagggcgtg 3780ctggaggact accagaccat caacaagcgc ttcaccaccg gcaccgacct gaagggcgtg 3780
tacctgatcc tgaagagcca gaacggcgac gaggcctggg gcgacaactt catcatcctg 3840tacctgatcc tgaagagcca gaacggcgac gaggcctggg gcgacaactt catcatcctg 3840
gagatcagcc cgagcgagaa gctgctgagc ccggagctga tcaacaccaa caactggacc 3900gagatcagcc cgagcgagaa gctgctgagc ccggagctga tcaacaccaa caactggacc 3900
agcaccggca gcaccaacat cagcggcaac accctgaccc tgtaccaggg cggccgcggc 3960agcaccggca gcaccaacat cagcggcaac accctgaccc tgtaccaggg cggccgcggc 3960
atcctgaagc agaacctgca gctggacagc ttcagcacct accgcgtgta cttcagcgtg 4020atcctgaagc agaacctgca gctggacagc ttcagcacct accgcgtgta cttcagcgtg 4020
agcggcgacg ccaacgtgcg catccgcaac tcccgcgagg tgctgttcga gaagaggtac 4080agcggcgacg ccaacgtgcg catccgcaac tcccgcgagg tgctgttcga gaagaggtac 4080
atgagcggcg ccaaggacgt gagcgagatg ttcaccacca agttcgagaa ggacaacttc 4140atgagcggcg ccaaggacgt gagcgagatg ttcaccacca agttcgagaa ggacaacttc 4140
tacatcgagc tgagccaggg caacaacctg tacggcggcc cgatcgtgca cttctacgac 4200tacatcgagc tgagccaggg caacaacctg tacggcggcc cgatcgtgca cttctacgac 4200
gtgagcatca agtag 4215gtgagcatca agtag 4215
<210> 2<210> 2
<211> 1404<211> 1404
<212> PRT<212> PRT
<213> Protein人工序列(Artificial sequence)<213> Protein artificial sequence (Artificial sequence)
<400> 2<400> 2
Met Asp Asn Asn Pro Asn Ile Asn Glu Cys Ile Pro Tyr Asn Cys LeuMet Asp Asn Asn Pro Asn Ile Asn Glu Cys Ile Pro Tyr Asn Cys Leu
1 5 10 151 5 10 15
Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg Ile Glu Thr GlySer Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg Ile Glu Thr Gly
20 25 30 20 25 30
Tyr Thr Pro Ile Asp Ile Ser Leu Ser Leu Thr Gln Phe Leu Leu SerTyr Thr Pro Ile Asp Ile Ser Leu Ser Leu Thr Gln Phe Leu Leu Ser
35 40 45 35 40 45
Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp Ile IleGlu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp Ile Ile
50 55 60 50 55 60
Trp Gly Ile Phe Gly Pro Ser Gln Trp Asp Ala Phe Leu Val Gln IleTrp Gly Ile Phe Gly Pro Ser Gln Trp Asp Ala Phe Leu Val Gln Ile
65 70 75 8065 70 75 80
Glu Gln Leu Ile Asn Gln Arg Ile Glu Glu Phe Ala Arg Asn Gln AlaGlu Gln Leu Ile Asn Gln Arg Ile Glu Glu Phe Ala Arg Asn Gln Ala
85 90 95 85 90 95
Ile Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gln Ile Tyr Ala GluIle Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gln Ile Tyr Ala Glu
100 105 110 100 105 110
Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Arg GluSer Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Arg Glu
115 120 125 115 120 125
Glu Met Arg Ile Gln Phe Asn Asp Met Asn Ser Ala Leu Thr Thr AlaGlu Met Arg Ile Gln Phe Asn Asp Met Asn Ser Ala Leu Thr Thr Ala
130 135 140 130 135 140
Ile Pro Leu Phe Ala Val Gln Asn Tyr Gln Val Pro Leu Leu Ser ValIle Pro Leu Phe Ala Val Gln Asn Tyr Gln Val Pro Leu Leu Ser Val
145 150 155 160145 150 155 160
Tyr Val Gln Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp Val SerTyr Val Gln Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp Val Ser
165 170 175 165 170 175
Val Phe Gly Gln Arg Trp Gly Phe Asp Ala Ala Thr Ile Asn Ser ArgVal Phe Gly Gln Arg Trp Gly Phe Asp Ala Ala Thr Ile Asn Ser Arg
180 185 190 180 185 190
Tyr Asn Asp Leu Thr Arg Leu Ile Gly Asn Tyr Thr Asp His Ala ValTyr Asn Asp Leu Thr Arg Leu Ile Gly Asn Tyr Thr Asp His Ala Val
195 200 205 195 200 205
Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser ArgArg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg
210 215 220 210 215 220
Asp Trp Ile Arg Tyr Asn Gln Phe Arg Arg Glu Leu Thr Leu Thr ValAsp Trp Ile Arg Tyr Asn Gln Phe Arg Arg Glu Leu Thr Leu Thr Val
225 230 235 240225 230 235 240
Leu Asp Ile Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr ProLeu Asp Ile Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr Pro
245 250 255 245 250 255
Ile Arg Thr Val Ser Gln Leu Thr Arg Glu Ile Tyr Thr Asn Pro ValIle Arg Thr Val Ser Gln Leu Thr Arg Glu Ile Tyr Thr Asn Pro Val
260 265 270 260 265 270
Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gln Gly Ile GluLeu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gln Gly Ile Glu
275 280 285 275 280 285
Gly Ser Ile Arg Ser Pro His Leu Met Asp Ile Leu Asn Ser Ile ThrGly Ser Ile Arg Ser Pro His Leu Met Asp Ile Leu Asn Ser Ile Thr
290 295 300 290 295 300
Ile Tyr Thr Asp Ala His Arg Gly Glu Tyr Tyr Trp Ser Gly His GlnIle Tyr Thr Asp Ala His Arg Gly Glu Tyr Tyr Trp Ser Gly His Gln
305 310 315 320305 310 315 320
Ile Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe ProIle Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro
325 330 335 325 330 335
Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gln Gln Arg Ile Val AlaLeu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gln Gln Arg Ile Val Ala
340 345 350 340 345 350
Gln Leu Gly Gln Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr ArgGln Leu Gly Gln Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr Arg
355 360 365 355 360 365
Arg Pro Phe Asn Ile Gly Ile Asn Asn Gln Gln Leu Ser Val Leu AspArg Pro Phe Asn Ile Gly Ile Asn Asn Gln Gln Leu Ser Val Leu Asp
370 375 380 370 375 380
Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala ValGly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala Val
385 390 395 400385 390 395 400
Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu Ile Pro Pro GlnTyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu Ile Pro Pro Gln
405 410 415 405 410 415
Asn Asn Asn Val Pro Pro Arg Gln Gly Phe Ser His Arg Leu Ser HisAsn Asn Asn Val Pro Pro Arg Gln Gly Phe Ser His Arg Leu Ser His
420 425 430 420 425 430
Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser Ile IleVal Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser Ile Ile
435 440 445 435 440 445
Arg Ala Pro Met Phe Ser Trp Ile His Arg Ser Ala Glu Phe Asn AsnArg Ala Pro Met Phe Ser Trp Ile His Arg Ser Ala Glu Phe Asn Asn
450 455 460 450 455 460
Ile Ile Pro Ser Ser Gln Ile Thr Gln Ile Pro Leu Thr Lys Ser ThrIle Ile Pro Ser Ser Gln Ile Thr Gln Ile Pro Leu Thr Lys Ser Thr
465 470 475 480465 470 475 480
Asn Leu Gly Ser Gly Thr Ser Val Val Lys Gly Pro Gly Phe Thr GlyAsn Leu Gly Ser Gly Thr Ser Val Val Lys Gly Pro Gly Phe Thr Gly
485 490 495 485 490 495
Gly Asp Ile Leu Arg Arg Thr Ser Pro Gly Gln Ile Ser Thr Leu ArgGly Asp Ile Leu Arg Arg Thr Ser Pro Gly Gln Ile Ser Thr Leu Arg
500 505 510 500 505 510
Val Asn Ile Thr Ala Pro Leu Ser Gln Arg Tyr Arg Val Arg Ile ArgVal Asn Ile Thr Ala Pro Leu Ser Gln Arg Tyr Arg Val Arg Ile Arg
515 520 525 515 520 525
Tyr Ala Ser Thr Thr Asn Leu Gln Phe His Thr Ser Ile Asp Gly ArgTyr Ala Ser Thr Thr Asn Leu Gln Phe His Thr Ser Ile Asp Gly Arg
530 535 540 530 535 540
Pro Ile Asn Gln Gly Asn Phe Ser Ala Thr Met Ser Ser Gly Ser AsnPro Ile Asn Gln Gly Asn Phe Ser Ala Thr Met Ser Ser Gly Ser Asn
545 550 555 560545 550 555 560
Leu Gln Ser Gly Ser Phe Arg Thr Val Gly Phe Thr Thr Pro Phe AsnLeu Gln Ser Gly Ser Phe Arg Thr Val Gly Phe Thr Thr Pro Phe Asn
565 570 575 565 570 575
Phe Ser Asn Gly Ser Ser Val Phe Thr Leu Ser Ala His Val Phe AsnPhe Ser Asn Gly Ser Ser Val Phe Thr Leu Ser Ala His Val Phe Asn
580 585 590 580 585 590
Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu Phe Val Pro Ala GluSer Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu Phe Val Pro Ala Glu
595 600 605 595 600 605
Val Thr Phe Glu Ala Glu Tyr Met Asn Lys Asn Asn Thr Lys Leu SerVal Thr Phe Glu Ala Glu Tyr Met Asn Lys Asn Asn Thr Lys Leu Ser
610 615 620 610 615 620
Thr Arg Ala Leu Pro Ser Phe Ile Asp Tyr Phe Asn Gly Ile Tyr GlyThr Arg Ala Leu Pro Ser Phe Ile Asp Tyr Phe Asn Gly Ile Tyr Gly
625 630 635 640625 630 635 640
Phe Ala Thr Gly Ile Lys Asp Ile Met Asn Met Ile Phe Lys Thr AspPhe Ala Thr Gly Ile Lys Asp Ile Met Asn Met Ile Phe Lys Thr Asp
645 650 655 645 650 655
Thr Gly Gly Asp Leu Thr Leu Asp Glu Ile Leu Lys Asn Gln Gln LeuThr Gly Gly Asp Leu Thr Leu Asp Glu Ile Leu Lys Asn Gln Gln Leu
660 665 670 660 665 670
Leu Asn Asp Ile Ser Gly Lys Leu Asp Gly Val Asn Gly Ser Leu AsnLeu Asn Asp Ile Ser Gly Lys Leu Asp Gly Val Asn Gly Ser Leu Asn
675 680 685 675 680 685
Asp Leu Ile Ala Gln Gly Asn Leu Asn Thr Glu Leu Ser Lys Glu IleAsp Leu Ile Ala Gln Gly Asn Leu Asn Thr Glu Leu Ser Lys Glu Ile
690 695 700 690 695 700
Leu Lys Ile Ala Asn Glu Gln Asn Gln Val Leu Asn Asp Val Asn AsnLeu Lys Ile Ala Asn Glu Gln Asn Gln Val Leu Asn Asp Val Asn Asn
705 710 715 720705 710 715 720
Lys Leu Asp Ala Ile Asn Thr Met Leu Arg Val Tyr Leu Pro Lys IleLys Leu Asp Ala Ile Asn Thr Met Leu Arg Val Tyr Leu Pro Lys Ile
725 730 735 725 730 735
Thr Ser Met Leu Ser Asp Val Ile Lys Gln Asn Tyr Ala Leu Ser LeuThr Ser Met Leu Ser Asp Val Ile Lys Gln Asn Tyr Ala Leu Ser Leu
740 745 750 740 745 750
Gln Ile Glu Tyr Leu Ser Lys Gln Leu Gln Glu Ile Ser Asp Lys LeuGln Ile Glu Tyr Leu Ser Lys Gln Leu Gln Glu Ile Ser Asp Lys Leu
755 760 765 755 760 765
Asp Ile Ile Asn Val Asn Val Leu Ile Asn Ser Thr Leu Thr Glu IleAsp Ile Ile Asn Val Asn Val Leu Ile Asn Ser Thr Leu Thr Glu Ile
770 775 780 770 775 780
Thr Pro Ala Tyr Gln Arg Ile Lys Tyr Val Asn Glu Lys Phe Glu GluThr Pro Ala Tyr Gln Arg Ile Lys Tyr Val Asn Glu Lys Phe Glu Glu
785 790 795 800785 790 795 800
Leu Thr Phe Ala Thr Glu Thr Ser Ser Lys Val Lys Lys Asp Gly SerLeu Thr Phe Ala Thr Glu Thr Ser Ser Lys Val Lys Lys Asp Gly Ser
805 810 815 805 810 815
Pro Ala Asp Ile Leu Asp Glu Leu Thr Glu Leu Thr Glu Leu Ala LysPro Ala Asp Ile Leu Asp Glu Leu Thr Glu Leu Thr Glu Leu Ala Lys
820 825 830 820 825 830
Ser Val Thr Lys Asn Asp Val Asp Gly Phe Glu Phe Tyr Leu Asn ThrSer Val Thr Lys Asn Asp Val Asp Gly Phe Glu Phe Tyr Leu Asn Thr
835 840 845 835 840 845
Phe His Asp Val Met Val Gly Asn Asn Leu Phe Gly Arg Ser Ala LeuPhe His Asp Val Met Val Gly Asn Asn Leu Phe Gly Arg Ser Ala Leu
850 855 860 850 855 860
Lys Thr Ala Ser Glu Leu Ile Thr Lys Glu Asn Val Lys Thr Ser GlyLys Thr Ala Ser Glu Leu Ile Thr Lys Glu Asn Val Lys Thr Ser Gly
865 870 875 880865 870 875 880
Ser Glu Val Gly Asn Val Tyr Asn Phe Leu Ile Val Leu Thr Ala LeuSer Glu Val Gly Asn Val Tyr Asn Phe Leu Ile Val Leu Thr Ala Leu
885 890 895 885 890 895
Gln Ala Gln Ala Phe Leu Thr Leu Thr Thr Cys Arg Lys Leu Leu GlyGln Ala Gln Ala Phe Leu Thr Leu Thr Thr Cys Arg Lys Leu Leu Gly
900 905 910 900 905 910
Leu Ala Asp Ile Asp Tyr Thr Ser Ile Met Asn Glu His Leu Asn LysLeu Ala Asp Ile Asp Tyr Thr Ser Ile Met Asn Glu His Leu Asn Lys
915 920 925 915 920 925
Glu Lys Glu Glu Phe Arg Val Asn Ile Leu Pro Thr Leu Ser Asn ThrGlu Lys Glu Glu Phe Arg Val Asn Ile Leu Pro Thr Leu Ser Asn Thr
930 935 940 930 935 940
Phe Ser Asn Pro Asn Tyr Ala Lys Val Lys Gly Ser Asp Glu Asp AlaPhe Ser Asn Pro Asn Tyr Ala Lys Val Lys Gly Ser Asp Glu Asp Ala
945 950 955 960945 950 955 960
Lys Met Ile Val Glu Ala Lys Pro Gly His Ala Leu Ile Gly Phe GluLys Met Ile Val Glu Ala Lys Pro Gly His Ala Leu Ile Gly Phe Glu
965 970 975 965 970 975
Ile Ser Asn Asp Ser Ile Thr Val Leu Lys Val Tyr Glu Ala Lys LeuIle Ser Asn Asp Ser Ile Thr Val Leu Lys Val Tyr Glu Ala Lys Leu
980 985 990 980 985 990
Lys Gln Asn Tyr Gln Val Asp Lys Asp Ser Leu Ser Glu Val Ile TyrLys Gln Asn Tyr Gln Val Asp Lys Asp Ser Leu Ser Glu Val Ile Tyr
995 1000 1005 995 1000 1005
Gly Asp Met Asp Lys Leu Leu Cys Pro Asp Gln Ser Glu Gln Ile TyrGly Asp Met Asp Lys Leu Leu Cys Pro Asp Gln Ser Glu Gln Ile Tyr
1010 1015 1020 1010 1015 1020
Tyr Thr Asn Asn Ile Val Phe Pro Asn Glu Tyr Val Ile Thr Lys IleTyr Thr Asn Asn Ile Val Phe Pro Asn Glu Tyr Val Ile Thr Lys Ile
1025 1030 1035 10401025 1030 1035 1040
Asp Phe Thr Lys Lys Met Lys Thr Leu Arg Tyr Glu Val Thr Ala AsnAsp Phe Thr Lys Lys Met Lys Thr Leu Arg Tyr Glu Val Thr Ala Asn
1045 1050 1055 1045 1050 1055
Phe Tyr Asp Ser Ser Thr Gly Glu Ile Asp Leu Asn Lys Lys Lys ValPhe Tyr Asp Ser Ser Thr Gly Glu Ile Asp Leu Asn Lys Lys Lys Val
1060 1065 1070 1060 1065 1070
Glu Ser Ser Glu Ala Glu Tyr Arg Thr Leu Ser Ala Asn Asp Asp GlyGlu Ser Ser Glu Ala Glu Tyr Arg Thr Leu Ser Ala Asn Asp Asp Gly
1075 1080 1085 1075 1080 1085
Val Tyr Met Pro Leu Gly Val Ile Ser Glu Thr Phe Leu Thr Pro IleVal Tyr Met Pro Leu Gly Val Ile Ser Glu Thr Phe Leu Thr Pro Ile
1090 1095 1100 1090 1095 1100
Asn Gly Phe Gly Leu Gln Ala Asp Glu Asn Ser Arg Leu Ile Thr LeuAsn Gly Phe Gly Leu Gln Ala Asp Glu Asn Ser Arg Leu Ile Thr Leu
1105 1110 1115 11201105 1110 1115 1120
Thr Cys Lys Ser Tyr Leu Arg Glu Leu Leu Leu Ala Thr Asp Leu SerThr Cys Lys Ser Tyr Leu Arg Glu Leu Leu Leu Ala Thr Asp Leu Ser
1125 1130 1135 1125 1130 1135
Asn Lys Glu Thr Lys Leu Ile Val Pro Pro Ser Gly Phe Ile Ser AsnAsn Lys Glu Thr Lys Leu Ile Val Pro Ser Gly Phe Ile Ser Asn
1140 1145 1150 1140 1145 1150
Ile Val Glu Asn Gly Ser Ile Glu Glu Asp Asn Leu Glu Pro Trp LysIle Val Glu Asn Gly Ser Ile Glu Glu Asp Asn Leu Glu Pro Trp Lys
1155 1160 1165 1155 1160 1165
Ala Asn Asn Lys Asn Ala Tyr Val Asp His Thr Gly Gly Val Asn GlyAla Asn Asn Lys Asn Ala Tyr Val Asp His Thr Gly Gly Val Asn Gly
1170 1175 1180 1170 1175 1180
Thr Lys Ala Leu Tyr Val His Lys Asp Gly Gly Ile Ser Gln Phe IleThr Lys Ala Leu Tyr Val His Lys Asp Gly Gly Ile Ser Gln Phe Ile
1185 1190 1195 12001185 1190 1195 1200
Gly Asp Lys Leu Lys Pro Lys Thr Glu Tyr Val Ile Gln Tyr Thr ValGly Asp Lys Leu Lys Pro Lys Thr Glu Tyr Val Ile Gln Tyr Thr Val
1205 1210 1215 1205 1210 1215
Lys Gly Lys Pro Ser Ile His Leu Lys Asp Glu Asn Thr Gly Tyr IleLys Gly Lys Pro Ser Ile His Leu Lys Asp Glu Asn Thr Gly Tyr Ile
1220 1225 1230 1220 1225 1230
His Tyr Glu Asp Thr Asn Asn Asn Leu Glu Asp Tyr Gln Thr Ile AsnHis Tyr Glu Asp Thr Asn Asn Asn Leu Glu Asp Tyr Gln Thr Ile Asn
1235 1240 1245 1235 1240 1245
Lys Arg Phe Thr Thr Gly Thr Asp Leu Lys Gly Val Tyr Leu Ile LeuLys Arg Phe Thr Thr Gly Thr Asp Leu Lys Gly Val Tyr Leu Ile Leu
1250 1255 1260 1250 1255 1260
Lys Ser Gln Asn Gly Asp Glu Ala Trp Gly Asp Asn Phe Ile Ile LeuLys Ser Gln Asn Gly Asp Glu Ala Trp Gly Asp Asn Phe Ile Ile Leu
1265 1270 1275 12801265 1270 1275 1280
Glu Ile Ser Pro Ser Glu Lys Leu Leu Ser Pro Glu Leu Ile Asn ThrGlu Ile Ser Pro Ser Glu Lys Leu Leu Ser Pro Glu Leu Ile Asn Thr
1285 1290 1295 1285 1290 1295
Asn Asn Trp Thr Ser Thr Gly Ser Thr Asn Ile Ser Gly Asn Thr LeuAsn Asn Trp Thr Ser Thr Gly Ser Thr Asn Ile Ser Gly Asn Thr Leu
1300 1305 1310 1300 1305 1310
Thr Leu Tyr Gln Gly Gly Arg Gly Ile Leu Lys Gln Asn Leu Gln LeuThr Leu Tyr Gln Gly Gly Arg Gly Ile Leu Lys Gln Asn Leu Gln Leu
1315 1320 1325 1315 1320 1325
Asp Ser Phe Ser Thr Tyr Arg Val Tyr Phe Ser Val Ser Gly Asp AlaAsp Ser Phe Ser Thr Tyr Arg Val Tyr Phe Ser Val Ser Gly Asp Ala
1330 1335 1340 1330 1335 1340
Asn Val Arg Ile Arg Asn Ser Arg Glu Val Leu Phe Glu Lys Arg TyrAsn Val Arg Ile Arg Asn Ser Arg Glu Val Leu Phe Glu Lys Arg Tyr
1345 1350 1355 13601345 1350 1355 1360
Met Ser Gly Ala Lys Asp Val Ser Glu Met Phe Thr Thr Lys Phe GluMet Ser Gly Ala Lys Asp Val Ser Glu Met Phe Thr Thr Lys Phe Glu
1365 1370 1375 1365 1370 1375
Lys Asp Asn Phe Tyr Ile Glu Leu Ser Gln Gly Asn Asn Leu Tyr GlyLys Asp Asn Phe Tyr Ile Glu Leu Ser Gln Gly Asn Asn Leu Tyr Gly
1380 1385 1390 1380 1385 1390
Gly Pro Ile Val His Phe Tyr Asp Val Ser Ile LysGly Pro Ile Val His Phe Tyr Asp Val Ser Ile Lys
1395 1400 1395 1400
<210> 3<210> 3
<211> 34<211> 34
<212> DNA<212> DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence
<400> 3<400> 3
gacgacgaca aggccatgga tggacaacaa cccg 34gacgacgaca aggccatgga tggacaacaa cccg 34
<210> 4<210> 4
<211> 33<211> 33
<212> DNA<212> DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence
<400> 4<400> 4
acgacgacaa ggccatggat gaacaagaac aac 33acgacgacaa ggccatggat gaacaagaac aac 33
<210> 5<210> 5
<211> 34<211> 34
<212> DNA<212> DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence
<400> 5<400> 5
ggtggtggtg gtgctcgagc ttgatgctca cgtc 34ggtggtggtg gtgctcgagc ttgatgctca cgtc 34
<210> 6<210> 6
<211> 34<211> 34
<212> DNA<212> DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence
<400> 6<400> 6
ggtggtggtg gtgctcgagg tactccgcct cgaa 34ggtggtggtg gtgctcgagg tactccgcct cgaa 34
Claims (10)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010119623 | 2020-02-26 | ||
CN2020101196236 | 2020-02-26 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111440814A true CN111440814A (en) | 2020-07-24 |
Family
ID=71648217
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010294835.8A Pending CN111440814A (en) | 2020-02-26 | 2020-04-15 | Insect-resistant fusion gene mCry1AbVip3A, expression vector and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111440814A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114107344A (en) * | 2021-11-15 | 2022-03-01 | 山东省农业科学院 | Insect-resistant fusion gene M2CryAb-VIP3A, expression vector and product thereof, and application thereof |
Citations (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006012366A2 (en) * | 2004-07-20 | 2006-02-02 | Phyllom Llc | Methods for making and using recombinant bacillus thuringiensis spores |
CN1818067A (en) * | 2006-02-27 | 2006-08-16 | 浙江大学 | Zoophobous fusion protein and use thereof |
CN102094030A (en) * | 2010-11-30 | 2011-06-15 | 中国农业科学院作物科学研究所 | Pesticidal protein encoding gene Cry1Ab-Ma and expression vector and application thereof |
CN102584961A (en) * | 2012-02-28 | 2012-07-18 | 中国农业科学院作物科学研究所 | Anti-insect protein Cry1A.401 and expression vector and application thereof |
CN102633868A (en) * | 2012-02-28 | 2012-08-15 | 中国农业科学院作物科学研究所 | Insecticidal protein Cryl A. 301, expression vector and application thereof |
CN103421816A (en) * | 2012-12-05 | 2013-12-04 | 北京大北农科技集团股份有限公司 | Insecticidal gene and application thereof |
CN104725516A (en) * | 2015-03-24 | 2015-06-24 | 吉林省农业科学院 | Novel recombinant insect resistant protein as well as preparation method and application thereof |
CN105483238A (en) * | 2015-12-26 | 2016-04-13 | 吉林省农业科学院 | LAMP (Loop-mediated isothermal amplification) detection primer group and kit and detection method for gene vip3A of transgenic plants |
CN105543360A (en) * | 2016-01-06 | 2016-05-04 | 天津出入境检验检疫局动植物与食品检测中心 | Real-time fluorescence detection method for Vip3A gene detection and kit adopted by same |
CN105624177A (en) * | 2016-02-04 | 2016-06-01 | 浙江大学 | Insect-fusion-resistant gene, coding protein, carrier and application thereof |
CN106047907A (en) * | 2016-02-15 | 2016-10-26 | 达尼亚·贾德·库雷西·索·贾德·萨利姆·库雷西 | A polynucleotide of a herbicide-resistant insect-resistant plant, an expression cassette comprising the polynucleotide, and applications thereof |
CN107603990A (en) * | 2006-06-03 | 2018-01-19 | 先正达参股股份有限公司 | Corn event mir 162 |
CN107723303A (en) * | 2017-09-29 | 2018-02-23 | 杭州瑞丰生物科技有限公司 | Insect-resistant fusion gene, encoding proteins, carrier and its application |
CN109536490A (en) * | 2018-11-09 | 2019-03-29 | 中国农业科学院作物科学研究所 | Transgenic pest-resistant herbicide-resistant corn C M8101 external source Insert Fragment flanking sequence and its application |
CN110903361A (en) * | 2019-12-24 | 2020-03-24 | 隆平生物技术(海南)有限公司 | Plant insect-resistant gene mVip3Aa, and vector and application thereof |
CN111088265A (en) * | 2019-12-16 | 2020-05-01 | 广州小草农业科技有限公司 | Insect-resistant fusion gene, encoding protein thereof, expression vector thereof and application thereof |
CN111315218A (en) * | 2019-08-09 | 2020-06-19 | 北京大北农生物技术有限公司 | Use of insecticidal proteins |
-
2020
- 2020-04-15 CN CN202010294835.8A patent/CN111440814A/en active Pending
Patent Citations (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006012366A2 (en) * | 2004-07-20 | 2006-02-02 | Phyllom Llc | Methods for making and using recombinant bacillus thuringiensis spores |
CN1818067A (en) * | 2006-02-27 | 2006-08-16 | 浙江大学 | Zoophobous fusion protein and use thereof |
CN107603990A (en) * | 2006-06-03 | 2018-01-19 | 先正达参股股份有限公司 | Corn event mir 162 |
CN102094030A (en) * | 2010-11-30 | 2011-06-15 | 中国农业科学院作物科学研究所 | Pesticidal protein encoding gene Cry1Ab-Ma and expression vector and application thereof |
CN102584961A (en) * | 2012-02-28 | 2012-07-18 | 中国农业科学院作物科学研究所 | Anti-insect protein Cry1A.401 and expression vector and application thereof |
CN102633868A (en) * | 2012-02-28 | 2012-08-15 | 中国农业科学院作物科学研究所 | Insecticidal protein Cryl A. 301, expression vector and application thereof |
CN103421816A (en) * | 2012-12-05 | 2013-12-04 | 北京大北农科技集团股份有限公司 | Insecticidal gene and application thereof |
CN104725516A (en) * | 2015-03-24 | 2015-06-24 | 吉林省农业科学院 | Novel recombinant insect resistant protein as well as preparation method and application thereof |
CN105483238A (en) * | 2015-12-26 | 2016-04-13 | 吉林省农业科学院 | LAMP (Loop-mediated isothermal amplification) detection primer group and kit and detection method for gene vip3A of transgenic plants |
CN105543360A (en) * | 2016-01-06 | 2016-05-04 | 天津出入境检验检疫局动植物与食品检测中心 | Real-time fluorescence detection method for Vip3A gene detection and kit adopted by same |
CN105624177A (en) * | 2016-02-04 | 2016-06-01 | 浙江大学 | Insect-fusion-resistant gene, coding protein, carrier and application thereof |
CN106047907A (en) * | 2016-02-15 | 2016-10-26 | 达尼亚·贾德·库雷西·索·贾德·萨利姆·库雷西 | A polynucleotide of a herbicide-resistant insect-resistant plant, an expression cassette comprising the polynucleotide, and applications thereof |
CN107723303A (en) * | 2017-09-29 | 2018-02-23 | 杭州瑞丰生物科技有限公司 | Insect-resistant fusion gene, encoding proteins, carrier and its application |
CN109536490A (en) * | 2018-11-09 | 2019-03-29 | 中国农业科学院作物科学研究所 | Transgenic pest-resistant herbicide-resistant corn C M8101 external source Insert Fragment flanking sequence and its application |
CN111315218A (en) * | 2019-08-09 | 2020-06-19 | 北京大北农生物技术有限公司 | Use of insecticidal proteins |
CN111088265A (en) * | 2019-12-16 | 2020-05-01 | 广州小草农业科技有限公司 | Insect-resistant fusion gene, encoding protein thereof, expression vector thereof and application thereof |
CN110903361A (en) * | 2019-12-24 | 2020-03-24 | 隆平生物技术(海南)有限公司 | Plant insect-resistant gene mVip3Aa, and vector and application thereof |
Non-Patent Citations (4)
Title |
---|
WEN-BO CHEN等: "Transgenic cotton coexpressing Vip3A and Cry1Ac has a broad insecticidal spectrum against lepidopteran pests", 《JOURNAL OF INVERTEBRATE PATHOLOGY》 * |
X. CHANG等: "Efficacy Evaluation of Two Transgenic Maize Events Expressing Fused Proteins to Cry1Ab-Susceptible and -Resistant Ostrinia Fumacalis (Lepidoptera: Crambidae)", 《JOURNAL OF ECONOMIC ENTOMOLOGY》 * |
ZEYU WANG等: "Specific binding between Bacillus thuringiensis Cry9Aa and Vip3Aa toxins synergizes their toxicity against Asiatic rice borer (Chilo suppressalis)", 《MICROBIOLOGY》 * |
李望舒等: "融合蛋白mCry1AbVip3A对亚洲玉米螟和草地贪夜蛾的抗虫效率分析", 《玉米科学》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114107344A (en) * | 2021-11-15 | 2022-03-01 | 山东省农业科学院 | Insect-resistant fusion gene M2CryAb-VIP3A, expression vector and product thereof, and application thereof |
CN114107344B (en) * | 2021-11-15 | 2023-07-28 | 山东省农业科学院 | Insect-resistant fusion gene M2CryAb-VIP3A, expression vector, product and application thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11060103B2 (en) | Genes encoding insecticidal proteins | |
Weng et al. | Transgenic sugarcane plants expressing high levels of modified cry1Ac provide effective control against stem borers in field trials | |
AU2004267355B2 (en) | Insecticidal proteins secreted from Bacillus thuringiensis and uses therefor | |
CN100427600C (en) | Anti-insect fusion gene, fusion protein and application thereof | |
Weng et al. | Regeneration of sugarcane elite breeding lines and engineering of stem borer resistance | |
RU2613778C2 (en) | Insecticidal proteins | |
CN102094030B (en) | Pesticidal protein encoding gene Cry1Ab-Ma and expression vector and application thereof | |
CN106832001B (en) | A kind of insecticidal fusion protein, encoding gene and application thereof | |
EP2360179A1 (en) | Novel bacillus thuringiensis insecticidal proteins | |
CN102584961B (en) | Anti-insect protein Cry1A.401 and expression vector and application thereof | |
CN111041036B (en) | Coding insecticidal protein insect-resistant fusion gene mCryAb-VIP3A, expression vector and application thereof | |
CN108892721A (en) | Pteromalus Puparum L venom Kazal-type serpin PpSPI20 albumen and application | |
CN111440814A (en) | Insect-resistant fusion gene mCry1AbVip3A, expression vector and application thereof | |
EP3439461A1 (en) | Insecticidal cry toxins | |
CN102633868B (en) | Insecticidal protein Cryl A. 301, expression vector and application thereof | |
EP1490397B1 (en) | Novel bacillus thuringiensis insecticidal proteins | |
CN100389124C (en) | Non-induced expression genetic engineering strain and its construction method and application | |
CN111995690B (en) | Artificially synthesized insect-resistant protein mCry1Ia2 and preparation method and application thereof | |
CN103215290A (en) | Insect-resistant fusion gene as well as insect-resistant fusion protein and application of insect-resistant fusion gene and insect-resistant fusion protein | |
CN102260348A (en) | Pteromalus puparum venom serine protease inhibitor Pp-PI polypeptide and application thereof | |
JP5054267B2 (en) | New toxin | |
CN108822210B (en) | Chrysalid pteromalid venom Kazal-type serine protease inhibitor PpSPI24 protein and application | |
CN102899304A (en) | Protein having phytopathogen killing activity, and coding gene and application thereof | |
AU2012258422B2 (en) | Novel genes encoding insecticidal proteins | |
CA2924415A1 (en) | Novel genes encoding insecticidal proteins |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200724 |