CN115894642A - 果型控制基因SlGT-2及其同源基因与应用 - Google Patents
果型控制基因SlGT-2及其同源基因与应用 Download PDFInfo
- Publication number
- CN115894642A CN115894642A CN202110955324.0A CN202110955324A CN115894642A CN 115894642 A CN115894642 A CN 115894642A CN 202110955324 A CN202110955324 A CN 202110955324A CN 115894642 A CN115894642 A CN 115894642A
- Authority
- CN
- China
- Prior art keywords
- sequence
- gene
- slgt
- slgtl1
- protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 177
- 235000013399 edible fruits Nutrition 0.000 title claims abstract description 57
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 66
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims abstract description 49
- 108091033409 CRISPR Proteins 0.000 claims abstract description 25
- 238000010354 CRISPR gene editing Methods 0.000 claims abstract description 11
- 230000001105 regulatory effect Effects 0.000 claims abstract description 10
- 239000000463 material Substances 0.000 claims abstract description 8
- 241000227653 Lycopersicon Species 0.000 claims abstract 4
- 241000196324 Embryophyta Species 0.000 claims description 84
- 240000003768 Solanum lycopersicum Species 0.000 claims description 46
- 108020004414 DNA Proteins 0.000 claims description 29
- 238000000034 method Methods 0.000 claims description 22
- 239000002773 nucleotide Substances 0.000 claims description 22
- 125000003729 nucleotide group Chemical group 0.000 claims description 22
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 18
- 108091026890 Coding region Proteins 0.000 claims description 17
- 101150050863 T gene Proteins 0.000 claims description 16
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 15
- 102000053602 DNA Human genes 0.000 claims description 13
- 108091033380 Coding strand Proteins 0.000 claims description 12
- 101150070093 AG gene Proteins 0.000 claims description 10
- 239000003153 chemical reaction reagent Substances 0.000 claims description 8
- 239000003795 chemical substances by application Substances 0.000 claims description 8
- 239000013612 plasmid Substances 0.000 claims description 8
- 108091027544 Subgenomic mRNA Proteins 0.000 claims description 7
- 238000010362 genome editing Methods 0.000 claims description 7
- 239000001814 pectin Substances 0.000 claims description 7
- 229920001277 pectin Polymers 0.000 claims description 7
- 235000010987 pectin Nutrition 0.000 claims description 7
- 125000000539 amino acid group Chemical group 0.000 claims description 6
- 239000002299 complementary DNA Substances 0.000 claims description 5
- 108010002537 Fruit Proteins Proteins 0.000 claims description 4
- 238000012217 deletion Methods 0.000 claims description 4
- 230000037430 deletion Effects 0.000 claims description 4
- 108020001507 fusion proteins Proteins 0.000 claims description 4
- 102000037865 fusion proteins Human genes 0.000 claims description 4
- 238000006467 substitution reaction Methods 0.000 claims description 4
- 108091008042 inhibitory receptors Proteins 0.000 claims description 3
- 241000208292 Solanaceae Species 0.000 claims description 2
- 239000004480 active ingredient Substances 0.000 claims description 2
- 230000002401 inhibitory effect Effects 0.000 claims description 2
- 230000001276 controlling effect Effects 0.000 claims 1
- 238000005516 engineering process Methods 0.000 abstract description 4
- 231100000221 frame shift mutation induction Toxicity 0.000 abstract description 4
- 230000037433 frameshift Effects 0.000 abstract description 4
- 230000035772 mutation Effects 0.000 description 11
- 230000008685 targeting Effects 0.000 description 10
- 239000013598 vector Substances 0.000 description 9
- 108091029865 Exogenous DNA Proteins 0.000 description 7
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 7
- 239000012634 fragment Substances 0.000 description 7
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 7
- 108010087924 alanylproline Proteins 0.000 description 6
- 238000002360 preparation method Methods 0.000 description 6
- 241000589158 Agrobacterium Species 0.000 description 5
- 238000012408 PCR amplification Methods 0.000 description 5
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 5
- 229930006000 Sucrose Natural products 0.000 description 5
- 239000013604 expression vector Substances 0.000 description 5
- 150000003839 salts Chemical class 0.000 description 5
- 238000012216 screening Methods 0.000 description 5
- 239000005720 sucrose Substances 0.000 description 5
- 229920001817 Agar Polymers 0.000 description 4
- 108020004705 Codon Proteins 0.000 description 4
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 4
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 210000000349 chromosome Anatomy 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 108010077112 prolyl-proline Proteins 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 108010084932 tryptophyl-proline Proteins 0.000 description 4
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 3
- 108020005544 Antisense RNA Proteins 0.000 description 3
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 3
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 3
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 3
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 3
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 3
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 3
- 108091027967 Small hairpin RNA Proteins 0.000 description 3
- 108020004459 Small interfering RNA Proteins 0.000 description 3
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 239000007864 aqueous solution Substances 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 239000003184 complementary RNA Substances 0.000 description 3
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 108010057821 leucylproline Proteins 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 108091070501 miRNA Proteins 0.000 description 3
- 239000002679 microRNA Substances 0.000 description 3
- 239000004055 small Interfering RNA Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- UZKQTCBAMSWPJD-UQCOIBPSSA-N trans-Zeatin Natural products OCC(/C)=C\CNC1=NC=NC2=C1N=CN2 UZKQTCBAMSWPJD-UQCOIBPSSA-N 0.000 description 3
- UZKQTCBAMSWPJD-FARCUNLSSA-N trans-zeatin Chemical compound OCC(/C)=C/CNC1=NC=NC2=C1N=CN2 UZKQTCBAMSWPJD-FARCUNLSSA-N 0.000 description 3
- 229940023877 zeatin Drugs 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 2
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 2
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 2
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 2
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 2
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 2
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 2
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 2
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 2
- XGZDDOKIHSYHTO-SZMVWBNQSA-N Lys-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 XGZDDOKIHSYHTO-SZMVWBNQSA-N 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- OVBPIULPVIDEAO-UHFFFAOYSA-N N-Pteroyl-L-glutaminsaeure Natural products C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)NC(CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 2
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 2
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 2
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 2
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 2
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 239000003337 fertilizer Substances 0.000 description 2
- 229960000304 folic acid Drugs 0.000 description 2
- 235000019152 folic acid Nutrition 0.000 description 2
- 239000011724 folic acid Substances 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 239000006870 ms-medium Substances 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 239000012882 rooting medium Substances 0.000 description 2
- 230000035040 seed growth Effects 0.000 description 2
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- PQFMROVJTOPVDF-JBDRJPRFSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PQFMROVJTOPVDF-JBDRJPRFSA-N 0.000 description 1
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 1
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- ZFXQNADNEBRERM-BJDJZHNGSA-N Ala-Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZFXQNADNEBRERM-BJDJZHNGSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- SVHRPCMZTWZROG-DCAQKATOSA-N Arg-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N SVHRPCMZTWZROG-DCAQKATOSA-N 0.000 description 1
- BJNUAWGXPSHQMJ-DCAQKATOSA-N Arg-Gln-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O BJNUAWGXPSHQMJ-DCAQKATOSA-N 0.000 description 1
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- QUBKBPZGMZWOKQ-SZMVWBNQSA-N Arg-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QUBKBPZGMZWOKQ-SZMVWBNQSA-N 0.000 description 1
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- HOIFSHOLNKQCSA-FXQIFTODSA-N Asn-Arg-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O HOIFSHOLNKQCSA-FXQIFTODSA-N 0.000 description 1
- CQMQJWRCRQSBAF-BPUTZDHNSA-N Asn-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N CQMQJWRCRQSBAF-BPUTZDHNSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- JGIAYNNXZKKKOW-KKUMJFAQSA-N Asn-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N JGIAYNNXZKKKOW-KKUMJFAQSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- CNKAZIGBGQIHLL-GUBZILKMSA-N Asp-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N CNKAZIGBGQIHLL-GUBZILKMSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- JUNZLDGUJZIUCO-IHRRRGAJSA-N Cys-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O JUNZLDGUJZIUCO-IHRRRGAJSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 1
- LLVXTGUTDYMJLY-GUBZILKMSA-N Gln-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LLVXTGUTDYMJLY-GUBZILKMSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 1
- XUZQMPGBGFQJMY-SRVKXCTJSA-N Gln-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XUZQMPGBGFQJMY-SRVKXCTJSA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 1
- 108020005004 Guide RNA Proteins 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 1
- SQUHHTBVTRBESD-UHFFFAOYSA-N Hexa-Ac-myo-Inositol Natural products CC(=O)OC1C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C1OC(C)=O SQUHHTBVTRBESD-UHFFFAOYSA-N 0.000 description 1
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 1
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 1
- VHHYJBSXXMPQGZ-AVGNSLFASA-N His-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N VHHYJBSXXMPQGZ-AVGNSLFASA-N 0.000 description 1
- TVRMJKNELJKNRS-GUBZILKMSA-N His-Glu-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N TVRMJKNELJKNRS-GUBZILKMSA-N 0.000 description 1
- CTGZVVQVIBSOBB-AVGNSLFASA-N His-His-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTGZVVQVIBSOBB-AVGNSLFASA-N 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 1
- ORZGPQXISSXQGW-IHRRRGAJSA-N His-His-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O ORZGPQXISSXQGW-IHRRRGAJSA-N 0.000 description 1
- UMBKDWGQESDCTO-KKUMJFAQSA-N His-Lys-Lys Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O UMBKDWGQESDCTO-KKUMJFAQSA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- QONKWXNJRRNTBV-AVGNSLFASA-N Leu-Pro-Met Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N QONKWXNJRRNTBV-AVGNSLFASA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- WGAZVKFCPHXZLO-SZMVWBNQSA-N Leu-Trp-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N WGAZVKFCPHXZLO-SZMVWBNQSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 1
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 1
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- MVQGZYIOMXAFQG-GUBZILKMSA-N Met-Ala-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MVQGZYIOMXAFQG-GUBZILKMSA-N 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- DRINJBAHUGXNFC-DCAQKATOSA-N Met-Asp-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O DRINJBAHUGXNFC-DCAQKATOSA-N 0.000 description 1
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 1
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 1
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- FDINZVJXLPILKV-DCAQKATOSA-N Pro-His-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O FDINZVJXLPILKV-DCAQKATOSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- 238000000692 Student's t-test Methods 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 1
- UHXOYRWHIQZAKV-SZMVWBNQSA-N Trp-Pro-Arg Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UHXOYRWHIQZAKV-SZMVWBNQSA-N 0.000 description 1
- XDQGKIMTRSVSBC-WDSOQIARSA-N Trp-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CNC2=CC=CC=C12 XDQGKIMTRSVSBC-WDSOQIARSA-N 0.000 description 1
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 1
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 1
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 1
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- JTEDVYBZBROSJT-UHFFFAOYSA-N indole-3-butyric acid Chemical compound C1=CC=C2C(CCCC(=O)O)=CNC2=C1 JTEDVYBZBROSJT-UHFFFAOYSA-N 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 229960000367 inositol Drugs 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000012113 quantitative test Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- CDAISMWEOUEBRE-UHFFFAOYSA-N scyllo-inosotol Natural products OC1C(O)C(O)C(O)C(O)C1O CDAISMWEOUEBRE-UHFFFAOYSA-N 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 239000002924 silencing RNA Substances 0.000 description 1
- SUKJFIGYRHOWBL-UHFFFAOYSA-N sodium hypochlorite Chemical compound [Na+].Cl[O-] SUKJFIGYRHOWBL-UHFFFAOYSA-N 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000012353 t test Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Landscapes
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
本发明公开了调控果型的SlGT‑2蛋白及其同源SlGTL1蛋白。本发明采用CRISPR/Cas9技术,定点编辑SlGT‑2基因和SlGTL1基因,通过造成移码突变,敲除了番茄SlGT‑2基因和SlGTL1基因,使圆形多汁番茄材料转变成方形少汁番茄材料。本发明具有重大的应用推广价值。
Description
技术领域
本发明涉及生物技术领域中果型控制基因SlGT-2及其同源基因与应用。
背景技术
番茄是全球消费最大的蔬菜之一,同时也是一种重要的水果。果型是番茄果实最直观的性状,不仅影响消费者的选择,也会影响到果实的口感或营养品质。各种作物的果实大多是圆形、椭圆形、扁圆形、长形,极少有方形果实。方形作为一种极其稀有的形状,对消费者具有非常大的吸引力,生产上也有很大需求,但是培育方形果实极其困难,目前还没有克隆到一个方形果实调控基因。
发明内容
本发明所要解决的技术问题是如何调控植物果实形状,如何调控果胶含水量。
为了解决以上技术问题,本发明的第一个目的是提供蛋白质,所述蛋白质是SlGT-2和/或其同源蛋白SlGTL1;
所述SlGT-2是如下A1)、A2)或A3)的蛋白质:
A1)氨基酸序列是序列表中序列1的蛋白质;
A2)将序列表中序列1中任一种所示的氨基酸序列经过一个以上氨基酸残基的取代和/或缺失和/或添加得到的与A1)所示的蛋白质衍生得到且与植物果实形状相关的蛋白质;
A3)在A1)或A2)的N末端或/和C末端连接蛋白标签得到的融合蛋白质;
所述SlGTL1是如下A4)、A5)或A6)的蛋白质:
A4)氨基酸序列是序列表中序列2的蛋白质;
A5)将序列表中序列2中任一种所示的氨基酸序列经过一个以上氨基酸残基的取代和/或缺失和/或添加得到的与A4)所示的蛋白质衍生得到且与植物果实形状相关的蛋白质;
A6)在A5)或A6)的N末端或/和C末端连接蛋白标签得到的融合蛋白质。
其中,序列表中的序列1由654个氨基酸残基组成,序列表中的序列2由651个氨基酸残基组成。
上述蛋白质可来源于番茄(Lycopersicon esculentum)。
上述蛋白质可人工合成,也可先合成其编码基因,再进行生物表达得到。
上述蛋白质中,所述蛋白质标签(protein-tag)是指利用DNA体外重组技术,与目的蛋白质一起融合表达的一种多肽或者蛋白质,以便于目的蛋白质的表达、检测、示踪和/或纯化。所述蛋白质标签可为Flag标签、His标签、MBP标签、HA标签、myc标签、GST标签和/或SUMO标签等。
上述蛋白质中,A2)所述蛋白质为SlGT-2+T,所述SlGT-2+T的氨基酸序列是序列表中序列9;A5)所述蛋白质为SlGTL1+AG,所述SlGTL1+AG的氨基酸序列是序列表中序列10。
本发明还提供基因,所述基因为编码所述SlGT-2的SlGT-2基因或编码所述SlGTL1的SlGTL1基因、或编码所述SlGT-2+T的SlGT-2+T基因或编码所述SlGTL1+AG的SlGTL1+AG基因;
所述SlGT-2基因为如下B1)或B2)所示的基因:
B1)编码链的编码序列(CDS)是序列表中序列3的cDNA分子或DNA分子;
B2)编码链的核苷酸是序列表中序列5的DNA分子。
其中,序列表中的序列3由1965个核苷酸组成,编码序列表中的序列1所示的蛋白质。
所述SlGTL1基因为如下B3)或B4)所示的基因:
B3)编码链的编码序列(CDS)是序列表中序列4的cDNA分子或DNA分子;
B4)编码链的核苷酸是序列表中序列6的DNA分子。
其中,序列表中的序列4由1956个核苷酸组成,编码序列表中的序列2所示的蛋白质。
所述SlGT-2+T基因是编码链的核苷酸是序列表中序列11的DNA分子;
所述SlGTL1+AG基因是编码链的核苷酸是序列表中序列12的DNA分子。
本发明还提供了一种调控植物果实形状的方法,包括如下步骤:抑制受体圆形果植物中所述SlGT-2基因及所述SlGTL1基因的表达,得到方形果的目的植物。
上述方法中,所述抑制受体圆形果植物中权利要求2所述SlGT-2基因及权利要求2所述SlGTL1基因的表达是通过对植物中所述SlGT-2基因和所述SlGTL1基因的进行基因编辑实现的。所述基因编辑是借助CRISPR/Cas9系统实现的。
上述方法中,所述CRISPR/Cas9系统包括表达含有Cas9和sgRNA的质粒,所述sgRNA可为针对靶序列1的sgRNA1和针对靶序列2的sgRNA2,所述靶序列1可为序列表中序列5的第40-59位,所述靶序列2可为序列表中序列6的第116-136位。
上述方法中,所述sgRNA1的核苷酸序列如序列表中序列7所示,所述sgRNA2的核苷酸序列如序列表中序列8所示。
上述方法中,所述目的植物为满足如下条件的植物:在靶序列1和靶序列2区域均发生突变的植物。
上述方法中,所述抑制受体圆形果植物中所述SlGT-2基因及所述SlGTL1基因的表达为下述X1和X2:
X1将序列表中序列5所示的所述SlGT-2基因突变为SlGT-2+T基因,所述SlGT-2+T基因的编码序列如序列11所示;
X2将序列表中序列6所示的所述SlGTL1基因突变为SlGTL1+AG基因,所述SlGTL1+AG基因的编码序列如序列12所示。
上述方法中,所述目的植物中突变后的SlGT-2蛋白质为的氨基酸序列如序列9所示,其编码序列如序列11所示;突变后的SlGTL1蛋白质的氨基酸序列如序列10所示,其编码序列如序列12所示。
上述方法中,目的植物可以是所述SlGT-2基因和SlGTL1基因均发生纯合突变的植株的植物。
上文所述植物可为F1)-F4)中的任一种:
F1)管状花目植物;
F2)茄科植物;
F3)番茄属植物;
F4)番茄。
为了解决上述技术问题,本发明还提供了调控植物果型的试剂,所述试剂的活性成分为抑制编码所述SlGT-2基因和所述SlGTL1基因的表达、降低所述SlGT-2蛋白和SlGTL1蛋白的丰度、和/或敲除所述SlGT-2基因和所述SlGTL1基因的物质。
上述试剂中,所述物质含有下述F1)、F2)或F3):
F1)靶向所述基因的sgRNA、siRNA、shRNA、miRNA或反义RNA;
F2)产生靶向所述基因的sgRNA的DNA分子、产生靶向所述基因的siRNA的DNA分子、产生靶向所述基因的shRNA的DNA分子、产生靶向所述基因的miRNA的DNA分子或产生靶向所述基因的反义RNA的DNA分子;
F3)产生靶向所述基因的sgRNA的表达载体、产生靶向所述基因的siRNA的表达载体、产生靶向所述基因的shRNA的表达载体、产生靶向所述基因的miRNA的表达载体或产生靶向所述基因的反义RNA的表达载体。
上述的试剂的活性成分还可含有其他生物成分或/和非生物成分,上述药剂的其他活性成分本领域技术人员可根据植物的果型确定。
本发明还提供所述蛋白、所述基因、或所述试剂在调控植物果实形状和/或果胶含水量中的应用。
本发明还提供所述方法在调控植物果胶含水量中的应用。
本发明的发明人利用本实验室从番茄中分离克隆的一个与果型相关的蛋白SlGT-2及其同源蛋白SlGTL1,采用CRISPR/Cas9技术,定点编辑SlGT-2基因和SlGTL1基因,通过造成移码突变,敲除了番茄中SlGT-2基因和SlGTL1基因,可以使圆形多汁番茄材料转变成方形少汁番茄材料。本发明具有重大的应用推广价值。
附图说明
图1为本发明实施例1中1-SlGT-2-SlGTL1基因编辑植株和对照野生型番茄品种AC果实的外形照片。
图2为本发明实施例1中1-SlGT-2-SlGTL1基因编辑植株和对照野生型番茄品种AC果实的纵切照片。
图3为本发明方度计算公式中果实纵向顶端5%位置的宽度以及果实纵向中间位置的宽度的量取位置。其中,从上往下数第一条横线指示果实纵向顶端5%位置,从上往下数第二条横线指示果实纵向中间位置。
图4为本发明实施例1中1-SlGT-2-SlGTL1基因编辑植株和对照野生型番茄品种AC果实方度统计结果图。所示数据为平均值±标准差,重复数为15,以t-test分析各组显著性差异,**代表显著性分析结果为P<0.01。
具体实施方式
下面结合具体实施方式对本发明进行进一步的详细描述,给出的实施例仅为了阐明本发明,而不是为了限制本发明的范围。以下提供的实施例可作为本技术领域普通技术人员进行进一步改进的指南,并不以任何方式构成对本发明的限制。
下述实施例中的定量试验,均设置三次重复实验,结果取平均值。
下述实施例中的实验方法,如无特殊说明,均为常规方法。下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。
下述实施例中的番茄品种AC(Ailsa Craig)为美国番茄遗传资源中心产品(TGRC,http://tgrc.ucdavis.edu/),编号为LA2838A。
下述实施例中的CRISPR/Cas9载体pTX041,记载于非专利文献“Deng et al.,Efficient generation ofpink-fruited tomatoes using CRISPR/Cas9 system.Journalof Genetics and Genomics 45(2018)51-54”,公众可以从中国科学院遗传与发育生物学研究所获得,以重复本申请实验,不可作为其它用途使用。
下述实施例中的种子生长培养基的配制方法为(1L为例):将2.2g MS盐和10g蔗糖溶解于水中定容至1L,用1mol/L KOH调pH至5.8~6.0之间,加入8g琼脂,高压灭菌。
下述实施例中的预培养培养基的配制方法为(1L为例):将4.4g MS盐、1.0mg玉米素(Zeatin)和30g蔗糖溶于水中定容至1L,用1mol/L KOH调pH至5.8~6.0之间,加入8g琼脂,高压灭菌。
下述实施例中的液体MS培养基的配制方法为(1L为例):将4.4g MS盐和30g蔗糖溶解于水中定容至1L,用1mol/L KOH调pH至5.8~6.0之间,高压灭菌。
下述实施例中的筛选分化培养基的配制方法为(1L为例):将4.4g MS盐、2.0mg玉米素、50mg卡那霉素、100mg肌醇、0.5mg叶酸和30g蔗糖溶于水中定容至1L,用1mol/L KOH调pH至5.8~6.0之间,加入8g琼脂,高压灭菌。
下述实施例中的生根培养基的配制方法为(1L为例):将4.4g MS盐、50mg卡那霉素、0.5mg叶酸、0.5mg吲哚丁酸和30g蔗糖溶于水中定容至1L,用1mol/L KOH调pH至5.8~6.0之间,加入8g琼脂,高压灭菌。
实施例1利用CRISPR/Cas9方法编辑番茄SlGT-2基因及其同源基因SlGTL1获得方形少汁果实的番茄
发明人首次在番茄中发现了SlGT-2蛋白和其同源蛋白SlGTL1,该类蛋白控制着方形果实的形成。SlGT-2蛋白的氨基酸序列为序列表序列1,编码SlGT-2蛋白的基因为SlGT-2基因,SlGT-2基因在基因组DNA中起始密码子至终止密码子之间的核苷酸序列为序列表序列5(其中第1-284位和第1037-2717位为外显子),其cDNA中的开放阅读框为序列表的序列3。SlGTL1蛋白的氨基酸序列为序列表序列2,编码SlGTL1蛋白的基因为SlGTL1基因,SlGTL1基因在基因组DNA中起始密码子至终止密码子之间的核苷酸序列为序列表序列6(其中第1-320位和第729-2364位为外显子),其cDNA中的开放阅读框为序列表的序列4。
一、重组载体的构建
1、选定靶序列:针对SlGT-2基因选择序列表中序列5第40-59位作为靶序列1;针对SlGTL1基因,选择序列表中序列6第116-136位作为靶序列2。
合成两条PCR引物1和2,然后从模板质粒pTX041载体上扩出目标片段,目标片段含有两个gRNA及U6启动子序列,将目标片段纯化回收。
两条PCR引物F和R的核苷酸序列分别如下:
F:5’-ATATATGTTTGGGTGATAATCCAGAAAGCGGGTTTTAGAGCTAGAAATAGC-3’(下划线指示的序列与序列表中序列5第40-59位特异结合,波浪线指示的序列为BsaI酶识别位点);
R:5’-ATTATTGAAACACCACTCTCCGGAGCTTCTTCAAACTACACTGTTAGATTC-3’(下划线指示的序列与序列表中序列6第116-136位特异结合,波浪线指示的序列为BsaI酶识别位点)。
2、将以上回收的目标片段用限制性内切酶BsaI进行酶切,纯化回收,得到酶切目标片段。将CRISPR/Cas9空载体pTX041,用限制性内切酶BsaI进行酶切,纯化回收,得到线性化的pTX041载体骨架。
3、将步骤2得到的酶切目标片段与pTX041载体骨架连接,得到将酶切目标片段插入pTX041载体BsaI酶切位点之间,保持pTX041载体的其它序列不变,得到的重组质粒。经测序验证,该重组质粒表达序列表中序列7所示的sgRNA1(针对靶序列1)和序列表中序列8所示的sgRNA2(针对靶序列2),将该重组质粒命名为pTX041-sgRNA1-sgRNA2。
sgRNA1针对靶序列1(序列表中序列5第40-59位),sgRNA1中的靶序列结合区如序列表的序列7第2-21位核苷酸所示。
sgRNA2针对靶序列2(序列表中序列6第116-136位),sgRNA1中的靶序列结合区如序列表的序列8第2-21位核苷酸所示。
二、基因编辑植株的制备
1、取步骤一制备的重组质粒pTX041-sgRNA1-sgRNA2,导入农杆菌GV3101,得到重组农杆菌,命名为GV3101-sgRNA1-sgRNA2。
2、取重组农杆菌GV3101-sgRNA1-sgRNA2,通过农杆菌介导的方法对野生型番茄品种AC进行遗传转化,得到T0代植株,具体过程如下:
(1)取野生型番茄品种AC的种子,挑选饱满、粒大的种子,用75%乙醇水溶液浸泡2min,然后用10%NaClO水溶液浸泡10min,然后用无菌水冲洗7遍,然后将种子播种于种子生长培养基并培养8天。取子叶,在无菌条件下将子叶切成小方块,将子叶块接种于预培养培养基并培养2天,得到的子叶块作为外植体。
(2)取重组农杆菌GV3101-sgRNA1-sgRNA2,用50mL液体MS培养基重悬,得到OD600nm=0.4的菌悬液,然后加入50μL 0.074mol/L乙酰丁香酮水溶液,即为侵染液。
(3)取步骤(1)得到的外植体,在步骤(2)得到的浸染液中浸没10min,然后接种至预培养培养基上培养2天。
(4)完成步骤(3)后,取外植体,接种至筛选分化培养基培养8周(每2周继代一次),此时外植体长出抗性芽。
(5)步骤(4)中抗性芽芽长为3cm时,切取抗性芽,转接至生根培养基进行培养,生根的植株即为T0代植株。
整个过程的培养条件:25℃、16h光照/8h黑暗。
3、从T0代植株中筛选出SlGT-2基因及其同源基因SlGTL1都发生纯合突变的植株。
筛选方法:取植株叶片的基因组DNA,分别用引物对1(由F1和R1组成)及引物对2(由F2和R2组成)两对引物进行PCR扩增,回收PCR扩增产物并进行测序。
F1:5′-GGTGTAATTGCTAACTTGCTTGG-3′;
R1:5′-GCATGATGAAACTGGTGCAGC-3′;
F2:5′-ATGCTTGGTGTTTCTTCAAG-3′;
R2:5′-AACTTCTTCCCATAACGGTCC-3′。
筛选到1株SlGT-2基因及其同源基因SlGTL1都发生纯合突变的植株,命名为植株1。
与野生型番茄品种AC相比,植株1中SlGT-2基因发生了插入一个核苷酸的突变,具体是的序列表中序列5第56位和第57位之间插入了一个核苷酸“T”,且为纯合型突变(即两条染色体发生了相同突变)。上述插入导致基因CDS序列从第57位发生移码突变,提前产生终止密码子,翻译出丧失DNA结合结构域的截短蛋白,以致SlGT-2蛋白质功能的丧失(植株中不含有SlGT-2蛋白)。将突变后的基因命名为SlGT-2+T基因,SlGT-2+T基因编码的蛋白质是SlGT-2+T,SlGT-2+T的氨基酸序列是序列9。同时,与野生型番茄品种AC相比,植株1中SlGTL1基因发生了插入两个核苷酸的突变,具体是的序列表中序列6第131位和第132位之间插入了“AG”两个核苷酸,且为纯合型突变(即两条染色体发生了相同突变)。上述插入导致基因CDS序列从第132位发生移码突变,提前产生终止密码子,翻译出丧失DNA结合结构域的截短蛋白,以致SlGTL1蛋白质功能的丧失(植株中不含有SlGTL1蛋白)。将突变后的基因命名为SlGTL1+T基因,SlGTL1+T基因编码的蛋白质是SlGT-2+T,SlGT-2+T的氨基酸序列是序列10。
也就是,植株1与野生型番茄品种AC相比,对于SlGT-2基因,两条同源染色体中的SlGT-2基因均突变为SlGT-2+T基因,SlGT-2+T基因的编码序列是序列11;对于SlGTL1基因,两条同源染色体中的SlGTL1基因均突变为SlGTL1+AG基因,SlGTL1+AG基因的编码序列是序列12。
4、正常培养植株1,自交获得种子,即为T1代种子,将T1代种子培育为植株群体,即为T1代植株1群体。
5、从T1代植株1群体中筛选不含外源DNA的植株。
筛选方法:将T1代植株1群体的每个单株的叶片单独提取基因组DNA,采用F3和R3组成的引物对进行PCR扩增(F3和R3组成的引物对的靶序列位于CRISPR/Cas9载体pTX041中的Cas9基因,扩增产物预期为约402bp),如果得到扩增产物说明植株中含有外源DNA,如果没有得到扩增产物说明植株中不含外源DNA。
F3:5′-TTGACAAGCTGTTCATCCAG-3′;
R3:5′-CCTTCGTAATCTCGGTGTTC-3′。
T1代植株1群体中约1/4的个体不含外源DNA。
6、取步骤5筛选得到的不含外源DNA的植株,提取叶片的基因组DNA,分别采用F1和R1及F2和R2组成的引物对进行PCR扩增,回收PCR扩增产物并进行测序。
从T1代植株1群体中筛选得到的不含外源DNA的植株,SlGT-2基因及其同源基因SlGTL1均发生了与植株1相同的纯合型插入。结果表明,将步骤一制备的重组质粒pTX041-sgRNA1-sgRNA2导入野生型番茄品种AC产生的突变能够从T0代稳定遗传到T1代。
将从T1代植株1群体中筛选得到的不含外源DNA的植株均命名为1-SlGT-2-SlGTL1基因编辑植株。
三、果实观察
供试植株为:野生型番茄品种AC、1-SlGT-2-SlGTL1基因编辑植株。
1、25℃、16h光照/8h黑暗条件下培养供试植株幼苗,直至植株长至4-5个叶片。
2、完成步骤1后,将植株(每种供试植株10株,且长势一致)移栽到大棚中。株距、行距在60cm以上;供试植株随机分布;正常的水肥管理,保证所有植株水肥条件基本一致。
移栽3个月后,植株开始进入结果期,结果期持续约6个月。整个结果期,收集成熟果实。
1-SlGT-2-SlGTL1基因编辑植株,平均每株植株在整个结果期收获的成熟果实数量为55个,均为方形果实。
野生型番茄品种AC,平均每株植株在整个结果期收获的成熟果实数量为58个,均为圆形果实。
部分果实的示例性果实外形见图1。
部分果实的示例性果实纵切见图2。
方度的计算公式如下:
方度=果实纵向顶端5%位置的宽度/果实纵向中间位置的宽度(见图3)。
每种供试植株的果实方度统计结果见图4。
与野生型番茄品种AC的果实相比,1-SlGT-2-SlGTL1基因编辑植株的果实由圆形转变成方形,且果胶含水量由多变少。
以上结果表明,SlGT-2基因及其同源基因SlGTL1可以调控番茄的果型,通过对SlGT-2基因及其同源基因SlGTL1进行基因编辑可以使圆形多汁番茄材料转变成方形少汁番茄材料。
综上,发明人首次在番茄中发现了SlGT-2蛋白和其同源蛋白SlGTL1,该类蛋白控制着方形果实的形成,利用非转基因的基因编辑技术将这些基因敲除以后就能使圆形果实变成方形,同时果胶状态,特别是含水量发生明显改变。利用基因编辑技术对本发明提供的靶标基因能够快速培育出方形果实,在番茄中只需半年到一年时间。
以上对本发明进行了详述。对于本领域技术人员来说,在不脱离本发明的宗旨和范围,以及无需进行不必要的实验情况下,可在等同参数、浓度和条件下,在较宽范围内实施本发明。虽然本发明给出了特殊的实施例,应该理解为,可以对本发明作进一步的改进。总之,按本发明的原理,本申请欲包括任何变更、用途或对本发明的改进,包括脱离了本申请中已公开范围,而用本领域已知的常规技术进行的改变。按以下附带的权利要求的范围,可以进行一些基本特征的应用。
序列表
<110> 中国科学院遗传与发育生物学研究所
<120> 果型控制基因SlGT-2及其同源基因与应用
<130> GNCSY212244
<160> 12
<170> SIPOSequenceListing 1.0
<210> 1
<211> 654
<212> PRT
<213> 番茄(Lycopersicon esculentum)
<400> 1
Met Leu Gly Val Ser Gly Leu Val Ser Ser Glu Gly Gly Gly Asp Asn
1 5 10 15
Pro Glu Ser Gly Gly Gly Ala Gly Ser Gly Gly Ser Ser Glu Ile Gly
20 25 30
Leu Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Ser Gly Gly Phe Met
35 40 45
Thr Glu Asp Gly Glu Arg Asn Ser Gly Gly Asn Arg Trp Pro Arg Gln
50 55 60
Glu Thr Ile Ala Leu Leu Lys Ile Arg Ser Glu Met Asp Val Ile Phe
65 70 75 80
Arg Asp Ser Ser Leu Lys Gly Pro Leu Trp Glu Glu Val Ser Arg Lys
85 90 95
Met Ala Asp Leu Gly Phe His Arg Ser Ser Lys Lys Cys Lys Glu Lys
100 105 110
Phe Glu Asn Val Tyr Lys Tyr His Lys Arg Thr Lys Asp Gly Arg Ala
115 120 125
Ser Lys Ala Asp Gly Lys Asn Tyr Arg Phe Phe Glu Gln Leu Glu Ala
130 135 140
Leu Glu Asn Ile Thr Ser His His Ser Leu Met Pro Val Pro Ser Ser
145 150 155 160
Asn Thr Arg Pro Pro Pro Pro Pro Leu Glu Ala Thr Pro Ile Asn Met
165 170 175
Ala Met Pro Met Ala Ser Ser Asn Val Gln Val Thr Ala Ser Gln Gly
180 185 190
Thr Ile Pro His His Val Thr Ile Ser Ser Ala Pro Pro Pro Pro Asn
195 200 205
Ser Leu Phe Ala Pro Ser His Gln Asn Ala Pro Ser Ser Ser Pro Val
210 215 220
Pro Leu Pro Pro Pro Pro Ser Gln Gln Pro Ser Pro Gln Pro Ala Val
225 230 235 240
Asn Pro Ile Asn Asn Ile Pro Gln Gln Val Asn Ala Ser Ala Met Ser
245 250 255
Tyr Ser Thr Ser Ser Ser Thr Ser Ser Asp Glu Asp Ile Gln Arg Arg
260 265 270
His Lys Lys Lys Arg Lys Trp Lys Asp Tyr Phe Glu Lys Phe Thr Lys
275 280 285
Asp Val Ile Asn Lys Gln Glu Glu Ser His Arg Arg Phe Leu Glu Lys
290 295 300
Leu Glu Lys Arg Glu His Asp Arg Met Val Arg Glu Glu Ala Trp Lys
305 310 315 320
Val Glu Glu Met Ala Arg Met Asn Arg Glu His Asp Leu Leu Val Gln
325 330 335
Glu Arg Ala Met Ala Ala Ala Lys Asp Ala Ala Val Ile Ser Phe Leu
340 345 350
Gln Lys Ile Thr Glu Gln Gln Asn Ile Gln Ile Pro Asn Ser Ile Asn
355 360 365
Val Gly Pro Pro Ser Ala Gln Val Gln Ile Gln Leu Pro Glu Asn Pro
370 375 380
Leu Ser Ala Pro Val Pro Thr Gln Ile Gln Pro Thr Thr Val Thr Ala
385 390 395 400
Ala Ala Pro Pro Gln Pro Ala Pro Val Pro Val Ser Leu Pro Val Thr
405 410 415
Ile Pro Ala Pro Val Pro Ala Leu Ile Pro Ser Leu Ser Leu Pro Leu
420 425 430
Thr Pro Pro Val Pro Ser Lys Asn Met Glu Leu Val Pro Lys Ser Asp
435 440 445
Asn Gly Gly Asp Ser Tyr Ser Pro Ala Ser Ser Ser Arg Trp Pro Lys
450 455 460
Ala Glu Val Glu Ala Leu Ile Lys Leu Arg Thr Asn Leu Asp Val Lys
465 470 475 480
Tyr Gln Glu Asn Gly Pro Lys Gly Pro Leu Trp Glu Glu Ile Ser Ser
485 490 495
Gly Met Lys Lys Ile Gly Tyr Asn Arg Asn Ala Lys Arg Cys Lys Glu
500 505 510
Lys Trp Glu Asn Ile Asn Lys Tyr Phe Lys Lys Val Lys Glu Ser Asn
515 520 525
Lys Lys Arg Pro Glu Asp Ser Lys Thr Cys Pro Tyr Phe His Gln Leu
530 535 540
Asp Ala Leu Tyr Lys Glu Lys Ala Lys Asn Pro Glu Thr Ala Ser Ser
545 550 555 560
Thr Ser Ser Phe Asn Pro Ser Phe Ala Leu Asn Pro Asp Asn Asn Gln
565 570 575
Met Ala Pro Ile Met Ala Arg Pro Glu Gln Gln Trp Pro Leu Pro Gln
580 585 590
His His Glu Ser Thr Thr Arg Ile Asp His Glu Asn Glu Ser Asp Asn
595 600 605
Met Asp Glu Asp Asp His Asp Asp Glu Glu Asp Glu Asp Asp Glu Asp
610 615 620
Glu Asn Asn Ala Tyr Glu Ile Val Ala Asn Lys Gln Gln Ser Ser Met
625 630 635 640
Ala Ala Ala Asn Thr Thr Thr Ser Thr Ala Thr Thr Thr Val
645 650
<210> 2
<211> 651
<212> PRT
<213> 番茄(Lycopersicon esculentum)
<400> 2
Met Leu Gly Val Ser Ser Ser Leu Ile Ala Ser Ser Asn Thr Ser Ile
1 5 10 15
Thr Ala Gly Ala Ala Gly Asp Gly Ala Ala Ile Ser Ala Ala Pro Ser
20 25 30
Gln Leu Ala Pro Pro Pro Gln Glu Ala Pro Glu Ser Gly Gly Ser Ser
35 40 45
Glu Gly Gly Gly Gly Gly Gly Asp Leu Ser Ile Gly Gly Glu Asp Gly
50 55 60
Glu Arg Asn Ser Gly Gly Asn Arg Trp Pro Arg Gln Glu Thr Leu Ala
65 70 75 80
Leu Leu Lys Ile Arg Ser Glu Met Asp Val Val Phe Lys Asp Ser Ser
85 90 95
Leu Lys Gly Pro Leu Trp Glu Glu Val Ser Arg Lys Leu Ala Glu Leu
100 105 110
Gly Tyr His Arg Ser Ala Lys Lys Cys Lys Glu Lys Phe Glu Asn Val
115 120 125
Tyr Lys Tyr His Arg Arg Thr Lys Asp Gly Arg Ala Ser Lys Ala Asp
130 135 140
Gly Lys Thr Tyr Arg Phe Phe Asp Gln Leu Gln Ala Leu Glu Asn Asn
145 150 155 160
Pro Ser Ser His Ser Asn Ile Pro Pro Pro Pro Leu Ala Ala Thr Pro
165 170 175
Ile Thr Met Ala Met Pro Met Arg Ser Gly Asn Asn Ser Ala Asn Pro
180 185 190
Pro Met Pro Thr Pro Thr Pro Thr Pro Gln Asn His Asn His Phe Phe
195 200 205
Ser Val Ser Gln Lys Ser Val Val Thr Gly Ala Ala Gln Pro Ala Val
210 215 220
Met Thr Ala Pro Ala Leu Pro Leu Ser Gln Val Pro Ile Gly Asn Asn
225 230 235 240
Asn Leu Asn Gln Met His Arg Pro Gln Gly Asn Thr Thr Thr Thr Lys
245 250 255
Thr Ser Phe Leu Ser Asn Ser Thr Ser Ser Ser Ser Ser Thr Ser Ser
260 265 270
Asp Glu Asp Ile Gln Arg Arg Gln Met Lys Lys Arg Lys Trp Lys Glu
275 280 285
Phe Phe Glu Ser Leu Met Lys Asp Val Ile Glu Lys Gln Glu Glu Leu
290 295 300
Gln Lys Lys Phe Leu Glu Thr Leu Glu Lys Arg Glu Arg Asp Arg Leu
305 310 315 320
Met Arg Glu Glu Ala Trp Arg Val Gln Glu Met Ala Arg Leu Asn Arg
325 330 335
Glu His Asp Leu Leu Val Gln Glu Arg Ser Met Ala Ala Ala Lys Asp
340 345 350
Ala Thr Ile Ile Ala Phe Leu Gln Lys Ile Thr Glu Gln Gln Asn Thr
355 360 365
Gln Thr Pro Asn Ser Thr Asn Asn Thr Ser Pro Ser Pro Phe Pro Ile
370 375 380
Ala Gln Ile Gln Leu Lys Leu Ser Glu Lys Pro Phe Ser Thr Pro Pro
385 390 395 400
Gln Pro Gln Pro Gln Pro Ser Ala Thr Ala Val Ser Leu Pro Met Thr
405 410 415
Ile His Thr Pro Thr Pro Ala Pro Pro Gln Thr Leu Thr Leu Pro Val
420 425 430
Val Ser Ser Lys Ser Leu Glu Pro Pro Lys Ser Asp Asn Gly Gly Glu
435 440 445
Asn Phe Ser Pro Ala Ser Ser Ser Arg Trp Pro Lys Glu Glu Ile Glu
450 455 460
Ala Leu Ile Ser Leu Arg Thr Cys Leu Asp Leu Lys Tyr Gln Glu Asn
465 470 475 480
Gly Pro Lys Gly Pro Leu Trp Glu Glu Ile Ser Ser Gly Met Arg Lys
485 490 495
Ile Gly Tyr Asn Arg Asn Ala Lys Arg Cys Lys Glu Lys Trp Glu Asn
500 505 510
Ile Asn Lys Tyr Phe Lys Lys Val Lys Glu Ser Asn Lys Lys Arg Pro
515 520 525
Glu Asp Ser Lys Thr Cys Pro Tyr Phe His Gln Leu Glu Ala Leu Tyr
530 535 540
Lys Glu Lys Ala Lys Leu Glu Pro Val Pro His Asn Thr Thr Phe Gly
545 550 555 560
Leu Thr Pro Gln Asn Asn Pro Pro Pro Pro Pro Pro Pro Ile Met Ala
565 570 575
Gln Pro Glu Gln Gln Trp Pro Ile Pro Gln Asn Gln Leu His Gln Gln
580 585 590
Asn Arg Asp His His His Asp Asn Glu Ser Asp Ser Met Asp His Asp
595 600 605
Leu Glu Glu Asp Glu Asp Glu Asp Glu Glu Asp Glu Gly Asn Gly Tyr
610 615 620
Glu Ile Ile Ile Thr Asn Lys Gln Gln Ser Ser Ser Met Ala Ala Thr
625 630 635 640
Pro Val Thr Thr Thr Thr Ser Ala Ala Ala Val
645 650
<210> 3
<211> 1965
<212> DNA
<213> 番茄(Lycopersicon esculentum)
<400> 3
atgctgggcg tttccggctt agtaagtagt gaaggtggtg gtgataatcc agaaagcggt 60
ggaggagctg gaagcggagg gagtagtgag attggattag gcggtggaag tggcggcggt 120
ggaggtagta gcggtggatt catgacggaa gatggagaaa gaaattcagg tggaaataga 180
tggccaagac aagaaacaat tgctttgctg aaaataaggt ctgaaatgga tgttattttt 240
agagactcaa gtcttaaagg acctttatgg gaagaagttt ccaggaaaat ggcagacctt 300
gggttccaca gaagttccaa gaaatgcaag gagaagttcg aaaatgtata caaatatcac 360
aagagaacca aggatggccg agcatcgaaa gcggatggaa agaattatag gtttttcgag 420
caattggaag ccctggagaa cattacatct catcattctc taatgccagt accgtcgtct 480
aatacgcgtc ctccaccccc tccgttggaa gctactccaa taaatatggc tatgccaatg 540
gcatcatcaa atgtacaagt cacggcttca caaggtacta ttcctcatca tgttactatt 600
tcatcagcac caccgccacc gaatagcctt tttgctcctt ctcatcaaaa tgctccgtca 660
agttcacccg tgccactacc accaccgcca tcacagcaac catcaccgca gccagctgtc 720
aatccgatta ataatattcc tcaacaagtg aacgcttcag caatgtcgta ttcaacttct 780
tcgtctactt cctcggatga ggatatacaa agaaggcata agaagaagag gaaatggaag 840
gattattttg agaagttcac caaggatgtg attaataagc aggaggaatc gcacaggagg 900
ttcttggaga agcttgagaa gcgggaacat gatcggatgg ttcgagaaga agcatggaaa 960
gtagaggaaa tggcaaggat gaatagggag catgatcttt tagttcaaga aagagcaatg 1020
gcggcagcca aggatgcagc tgttatttct tttttacaaa agataactga acagcaaaac 1080
attcaaattc caaatagtat caacgttggc cctccatcag cacaagtaca aatacaattg 1140
cctgaaaacc cactatccgc gcctgtacca acacaaatac aaccgacgac tgttacagca 1200
gcagcaccac ctcaaccagc accggtccca gtatcgttgc cagtaacaat accagctcca 1260
gtaccagcat taataccatc attgtcgcta ccactgacac caccagtgcc atccaagaac 1320
atggagttag taccaaaaag cgataacgga ggtgatagtt acagtccagc aagctcttca 1380
aggtggccaa aagcagaagt tgaagcattg attaaacttc gtacaaattt agatgtcaaa 1440
taccaagaga acggacctaa aggtccactt tgggaagaga tatcatctgg aatgaagaaa 1500
attggataca atcggaatgc aaagagatgc aaagaaaaat gggaaaacat caacaaatac 1560
ttcaagaagg tgaaggagag caacaaaaaa cgacccgaag attccaaaac ttgcccatat 1620
ttccaccagc tcgatgcact gtacaaggag aaagccaaaa accccgaaac agcttcttca 1680
acgtcttcgt tcaatccttc attcgcttta aaccccgata acaaccaaat ggctcccatc 1740
atggctcgtc cagaacagca atggccactt ccacaacacc atgaaagcac cacccgtatc 1800
gaccacgaaa acgagagcga caacatggat gaagatgatc acgatgatga ggaggatgaa 1860
gatgacgagg acgaaaacaa cgcttatgag atagtagcaa acaagcaaca atcctcaatg 1920
gcggccgcaa acaccactac cagcaccgca acaacaacag tttga 1965
<210> 4
<211> 1956
<212> DNA
<213> 番茄(Lycopersicon esculentum)
<400> 4
atgcttggtg tttcttcaag tttaatagct agcagtaata ctagtattac tgctggtgct 60
gcaggtgatg gagctgccat ttcggcagct ccatcacagt tagcaccgcc accacaagaa 120
gctccggaga gtggtgggag tagtgaaggt ggtggcggtg gaggagattt gtcgattggc 180
ggtgaagatg gagaaaggaa ctcaggtgga aatcgatggc caaggcaaga aactttagct 240
ttactgaaaa ttagatcgga aatggatgtt gttttcaaag attcaagtct taaaggaccg 300
ttatgggaag aagtttccag aaaactcgcg gagttgggtt atcatcgaag tgctaagaaa 360
tgtaaagaga aattcgagaa tgtttacaag tatcacagga gaaccaaaga tggtcgtgct 420
tcgaaagcag atggaaaaac ttatcgattc tttgatcagt tacaggcttt ggaaaacaat 480
ccatcttctc attctaacat accgccacct ccattagcag caacacccat aacaatggca 540
atgccaatgc gatcaggaaa caattcagca aatcctccaa tgccgacgcc aacgccaact 600
ccacaaaatc ataatcattt ttttagtgtt tcgcagaaaa gtgttgtgac aggagcagcg 660
cagcctgctg ttatgactgc acctgcgctg ccactgtcac aagtgccgat aggtaataat 720
aacttgaacc agatgcatcg gcctcaaggt aatactacta ctacaaaaac aagtttcctg 780
tcgaattcaa cttcatcatc atcttcaact tcgtcggatg aggatataca aaggaggcag 840
atgaagaagc ggaaatggaa ggaattcttt gagagtttaa tgaaggatgt gattgagaag 900
caagaggaat tgcagaagaa gtttttggaa acgctcgaga agcgcgagag ggataggttg 960
atgagagagg aggcatggag agtgcaagag atggctagat tgaataggga acatgatctt 1020
ttagtccaag agagatcaat ggcagcagct aaagacgcaa caatcatcgc cttcttgcaa 1080
aaaataactg aacagcaaaa cacacaaacc ccgaatagta caaataacac ttctccttct 1140
ccttttccaa ttgctcaaat tcaattaaaa ttgtccgaaa agccattcag tacaccacca 1200
caaccacaac cacaaccatc agctaccgcg gtatcactgc caatgacaat acatacacca 1260
acaccagcac caccacagac actgacatta cctgtagtat catcaaaatc acttgaacct 1320
ccaaaatccg ataatggtgg tgagaatttc tctccagcaa gctcgtcaag atggccgaaa 1380
gaagaaatcg aagcattgat aagtctccga acctgtttag atctaaaata ccaagaaaat 1440
ggaccgaaag gaccactgtg ggaagaaatt tcatctggaa tgagaaagat aggatacaac 1500
aggaatgcaa agagatgcaa ggaaaaatgg gagaacatca acaagtactt caagaaggta 1560
aaagaaagca acaaaaaaag accagaagat tccaaaactt gcccatattt ccaccagctg 1620
gaagcactgt acaaagaaaa agccaagctc gaacctgtac cacacaacac taccttcgga 1680
ttaacacccc aaaacaatcc tcctcctcct cctcctccca tcatggctca acccgagcaa 1740
caatggccaa ttcctcaaaa tcaacttcac cagcaaaatc gtgatcatca tcacgataat 1800
gaaagcgaca gcatggatca cgatttggaa gaggacgagg atgaggacga agaagatgaa 1860
ggtaatggct atgaaataat aatcacaaat aaacaacaat catcatcaat ggcggctacc 1920
ccagtaacaa caacaacttc tgctgctgca gtttaa 1956
<210> 5
<211> 2717
<212> DNA
<213> 番茄(Lycopersicon esculentum)
<400> 5
atgctgggcg tttccggctt agtaagtagt gaaggtggtg gtgataatcc agaaagcggt 60
ggaggagctg gaagcggagg gagtagtgag attggattag gcggtggaag tggcggcggt 120
ggaggtagta gcggtggatt catgacggaa gatggagaaa gaaattcagg tggaaataga 180
tggccaagac aagaaacaat tgctttgctg aaaataaggt ctgaaatgga tgttattttt 240
agagactcaa gtcttaaagg acctttatgg gaagaagttt ccaggtaatt aaattcaatt 300
tcattattcc aatttcttca cctgaccttc tcaatcatta ttaagctgca ccagtttcat 360
catgcataaa taaaaattga tagaaatgga atctttattt aatttttttt ttcaatttct 420
acttttggga aaaaaataat taatagaatg atttttattt tttgggaaat gaaaagatag 480
atctatggat cagaattcca ttgatttatt gcttttttga ttaaaagggt tattgttttt 540
cagttcattt cactacaaac aatacaacaa aaatacaatt gttgaggaaa ttcagattcc 600
ctccttccgg gttttgagcc aaattcagtt ttgctttttt ggcgtttttt ctttctctgc 660
caattccagc aacaaatttt ggaaactaat ttactcatct tttttgtatt agagttccaa 720
ctttatgaac tacctttttt taaatttagc aaataaataa gtttggtaat catcaaatct 780
aataattaag caagtaaaaa aacaagattt atgattgaga aaaatgtggt ttccatagag 840
tgtttcaatt gtctcctact tgtttaatta attgatttct taattacctt aatcttgatt 900
aataatctca tttttatttt atgtggtgaa tagtatttta ctattgaatt caattaccaa 960
ggatttaaat tattgtactt gtttatttac taccattttt tctaatactt atgccaactg 1020
ttgttatcat gagcaggaaa atggcagacc ttgggttcca cagaagttcc aagaaatgca 1080
aggagaagtt cgaaaatgta tacaaatatc acaagagaac caaggatggc cgagcatcga 1140
aagcggatgg aaagaattat aggtttttcg agcaattgga agccctggag aacattacat 1200
ctcatcattc tctaatgcca gtaccgtcgt ctaatacgcg tcctccaccc cctccgttgg 1260
aagctactcc aataaatatg gctatgccaa tggcatcatc aaatgtacaa gtcacggctt 1320
cacaaggtac tattcctcat catgttacta tttcatcagc accaccgcca ccgaatagcc 1380
tttttgctcc ttctcatcaa aatgctccgt caagttcacc cgtgccacta ccaccaccgc 1440
catcacagca accatcaccg cagccagctg tcaatccgat taataatatt cctcaacaag 1500
tgaacgcttc agcaatgtcg tattcaactt cttcgtctac ttcctcggat gaggatatac 1560
aaagaaggca taagaagaag aggaaatgga aggattattt tgagaagttc accaaggatg 1620
tgattaataa gcaggaggaa tcgcacagga ggttcttgga gaagcttgag aagcgggaac 1680
atgatcggat ggttcgagaa gaagcatgga aagtagagga aatggcaagg atgaataggg 1740
agcatgatct tttagttcaa gaaagagcaa tggcggcagc caaggatgca gctgttattt 1800
cttttttaca aaagataact gaacagcaaa acattcaaat tccaaatagt atcaacgttg 1860
gccctccatc agcacaagta caaatacaat tgcctgaaaa cccactatcc gcgcctgtac 1920
caacacaaat acaaccgacg actgttacag cagcagcacc acctcaacca gcaccggtcc 1980
cagtatcgtt gccagtaaca ataccagctc cagtaccagc attaatacca tcattgtcgc 2040
taccactgac accaccagtg ccatccaaga acatggagtt agtaccaaaa agcgataacg 2100
gaggtgatag ttacagtcca gcaagctctt caaggtggcc aaaagcagaa gttgaagcat 2160
tgattaaact tcgtacaaat ttagatgtca aataccaaga gaacggacct aaaggtccac 2220
tttgggaaga gatatcatct ggaatgaaga aaattggata caatcggaat gcaaagagat 2280
gcaaagaaaa atgggaaaac atcaacaaat acttcaagaa ggtgaaggag agcaacaaaa 2340
aacgacccga agattccaaa acttgcccat atttccacca gctcgatgca ctgtacaagg 2400
agaaagccaa aaaccccgaa acagcttctt caacgtcttc gttcaatcct tcattcgctt 2460
taaaccccga taacaaccaa atggctccca tcatggctcg tccagaacag caatggccac 2520
ttccacaaca ccatgaaagc accacccgta tcgaccacga aaacgagagc gacaacatgg 2580
atgaagatga tcacgatgat gaggaggatg aagatgacga ggacgaaaac aacgcttatg 2640
agatagtagc aaacaagcaa caatcctcaa tggcggccgc aaacaccact accagcaccg 2700
caacaacaac agtttga 2717
<210> 6
<211> 2364
<212> DNA
<213> 番茄(Lycopersicon esculentum)
<400> 6
atgcttggtg tttcttcaag tttaatagct agcagtaata ctagtattac tgctggtgct 60
gcaggtgatg gagctgccat ttcggcagct ccatcacagt tagcaccgcc accacaagaa 120
gctccggaga gtggtgggag tagtgaaggt ggtggcggtg gaggagattt gtcgattggc 180
ggtgaagatg gagaaaggaa ctcaggtgga aatcgatggc caaggcaaga aactttagct 240
ttactgaaaa ttagatcgga aatggatgtt gttttcaaag attcaagtct taaaggaccg 300
ttatgggaag aagtttccag gtactgtttt tttttggtca ttttgattaa ctctttcatc 360
atcatcatat gcatagaatt cagaaaatta aaagatctat tatatttgga attaaaattc 420
gtatttgatg gaaagtgtta ttttttttta gttttgttgt tataccattt tctgctgatt 480
ccagcaagaa atttaggatg agtttaattt ctctactcat cttcaacact ttttgtgctt 540
ttccctattt tccagaaaat tcaagctaag atgttgatga ttgatggttt attatgtttt 600
attttatgta tataaaaata gtatgagatt ttgttttttt attactaatg aatgatgaat 660
atgagatgag ataattaaga caaggtgttt ttctttttgt atccattttg aactttgttg 720
tttatcagaa aactcgcgga gttgggttat catcgaagtg ctaagaaatg taaagagaaa 780
ttcgagaatg tttacaagta tcacaggaga accaaagatg gtcgtgcttc gaaagcagat 840
ggaaaaactt atcgattctt tgatcagtta caggctttgg aaaacaatcc atcttctcat 900
tctaacatac cgccacctcc attagcagca acacccataa caatggcaat gccaatgcga 960
tcaggaaaca attcagcaaa tcctccaatg ccgacgccaa cgccaactcc acaaaatcat 1020
aatcattttt ttagtgtttc gcagaaaagt gttgtgacag gagcagcgca gcctgctgtt 1080
atgactgcac ctgcgctgcc actgtcacaa gtgccgatag gtaataataa cttgaaccag 1140
atgcatcggc ctcaaggtaa tactactact acaaaaacaa gtttcctgtc gaattcaact 1200
tcatcatcat cttcaacttc gtcggatgag gatatacaaa ggaggcagat gaagaagcgg 1260
aaatggaagg aattctttga gagtttaatg aaggatgtga ttgagaagca agaggaattg 1320
cagaagaagt ttttggaaac gctcgagaag cgcgagaggg ataggttgat gagagaggag 1380
gcatggagag tgcaagagat ggctagattg aatagggaac atgatctttt agtccaagag 1440
agatcaatgg cagcagctaa agacgcaaca atcatcgcct tcttgcaaaa aataactgaa 1500
cagcaaaaca cacaaacccc gaatagtaca aataacactt ctccttctcc ttttccaatt 1560
gctcaaattc aattaaaatt gtccgaaaag ccattcagta caccaccaca accacaacca 1620
caaccatcag ctaccgcggt atcactgcca atgacaatac atacaccaac accagcacca 1680
ccacagacac tgacattacc tgtagtatca tcaaaatcac ttgaacctcc aaaatccgat 1740
aatggtggtg agaatttctc tccagcaagc tcgtcaagat ggccgaaaga agaaatcgaa 1800
gcattgataa gtctccgaac ctgtttagat ctaaaatacc aagaaaatgg accgaaagga 1860
ccactgtggg aagaaatttc atctggaatg agaaagatag gatacaacag gaatgcaaag 1920
agatgcaagg aaaaatggga gaacatcaac aagtacttca agaaggtaaa agaaagcaac 1980
aaaaaaagac cagaagattc caaaacttgc ccatatttcc accagctgga agcactgtac 2040
aaagaaaaag ccaagctcga acctgtacca cacaacacta ccttcggatt aacaccccaa 2100
aacaatcctc ctcctcctcc tcctcccatc atggctcaac ccgagcaaca atggccaatt 2160
cctcaaaatc aacttcacca gcaaaatcgt gatcatcatc acgataatga aagcgacagc 2220
atggatcacg atttggaaga ggacgaggat gaggacgaag aagatgaagg taatggctat 2280
gaaataataa tcacaaataa acaacaatca tcatcaatgg cggctacccc agtaacaaca 2340
acaacttctg ctgctgcagt ttaa 2364
<210> 7
<211> 97
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 7
gggugauaau ccagaaagcg gguuuuagag cuagaaauag caaguuaaaa uaaggcuagu 60
ccguuaucaa cuugaaaaag uggcaccgag ucggugc 97
<210> 8
<211> 97
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 8
gaagaagcuc cggagagugg uguuuuagag cuagaaauag caaguuaaaa uaaggcuagu 60
ccguuaucaa cuugaaaaag uggcaccgag ucggugc 97
<210> 9
<211> 28
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 9
Met Leu Gly Val Ser Gly Leu Val Ser Ser Glu Gly Gly Gly Asp Asn
1 5 10 15
Pro Glu Ser Arg Trp Arg Ser Trp Lys Arg Arg Glu
20 25
<210> 10
<211> 79
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 10
Met Leu Gly Val Ser Ser Ser Leu Ile Ala Ser Ser Asn Thr Ser Ile
1 5 10 15
Thr Ala Gly Ala Ala Gly Asp Gly Ala Ala Ile Ser Ala Ala Pro Ser
20 25 30
Gln Leu Ala Pro Pro Pro Gln Glu Ala Pro Glu Arg Val Val Gly Val
35 40 45
Val Lys Val Val Ala Val Glu Glu Ile Cys Arg Leu Ala Val Lys Met
50 55 60
Glu Lys Gly Thr Gln Val Glu Ile Asp Gly Gln Gly Lys Lys Leu
65 70 75
<210> 11
<211> 1966
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 11
atgctgggcg tttccggctt agtaagtagt gaaggtggtg gtgataatcc agaaagtcgg 60
tggaggagct ggaagcggag ggagtagtga gattggatta ggcggtggaa gtggcggcgg 120
tggaggtagt agcggtggat tcatgacgga agatggagaa agaaattcag gtggaaatag 180
atggccaaga caagaaacaa ttgctttgct gaaaataagg tctgaaatgg atgttatttt 240
tagagactca agtcttaaag gacctttatg ggaagaagtt tccaggaaaa tggcagacct 300
tgggttccac agaagttcca agaaatgcaa ggagaagttc gaaaatgtat acaaatatca 360
caagagaacc aaggatggcc gagcatcgaa agcggatgga aagaattata ggtttttcga 420
gcaattggaa gccctggaga acattacatc tcatcattct ctaatgccag taccgtcgtc 480
taatacgcgt cctccacccc ctccgttgga agctactcca ataaatatgg ctatgccaat 540
ggcatcatca aatgtacaag tcacggcttc acaaggtact attcctcatc atgttactat 600
ttcatcagca ccaccgccac cgaatagcct ttttgctcct tctcatcaaa atgctccgtc 660
aagttcaccc gtgccactac caccaccgcc atcacagcaa ccatcaccgc agccagctgt 720
caatccgatt aataatattc ctcaacaagt gaacgcttca gcaatgtcgt attcaacttc 780
ttcgtctact tcctcggatg aggatataca aagaaggcat aagaagaaga ggaaatggaa 840
ggattatttt gagaagttca ccaaggatgt gattaataag caggaggaat cgcacaggag 900
gttcttggag aagcttgaga agcgggaaca tgatcggatg gttcgagaag aagcatggaa 960
agtagaggaa atggcaagga tgaataggga gcatgatctt ttagttcaag aaagagcaat 1020
ggcggcagcc aaggatgcag ctgttatttc ttttttacaa aagataactg aacagcaaaa 1080
cattcaaatt ccaaatagta tcaacgttgg ccctccatca gcacaagtac aaatacaatt 1140
gcctgaaaac ccactatccg cgcctgtacc aacacaaata caaccgacga ctgttacagc 1200
agcagcacca cctcaaccag caccggtccc agtatcgttg ccagtaacaa taccagctcc 1260
agtaccagca ttaataccat cattgtcgct accactgaca ccaccagtgc catccaagaa 1320
catggagtta gtaccaaaaa gcgataacgg aggtgatagt tacagtccag caagctcttc 1380
aaggtggcca aaagcagaag ttgaagcatt gattaaactt cgtacaaatt tagatgtcaa 1440
ataccaagag aacggaccta aaggtccact ttgggaagag atatcatctg gaatgaagaa 1500
aattggatac aatcggaatg caaagagatg caaagaaaaa tgggaaaaca tcaacaaata 1560
cttcaagaag gtgaaggaga gcaacaaaaa acgacccgaa gattccaaaa cttgcccata 1620
tttccaccag ctcgatgcac tgtacaagga gaaagccaaa aaccccgaaa cagcttcttc 1680
aacgtcttcg ttcaatcctt cattcgcttt aaaccccgat aacaaccaaa tggctcccat 1740
catggctcgt ccagaacagc aatggccact tccacaacac catgaaagca ccacccgtat 1800
cgaccacgaa aacgagagcg acaacatgga tgaagatgat cacgatgatg aggaggatga 1860
agatgacgag gacgaaaaca acgcttatga gatagtagca aacaagcaac aatcctcaat 1920
ggcggccgca aacaccacta ccagcaccgc aacaacaaca gtttga 1966
<210> 12
<211> 1958
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 12
atgcttggtg tttcttcaag tttaatagct agcagtaata ctagtattac tgctggtgct 60
gcaggtgatg gagctgccat ttcggcagct ccatcacagt tagcaccgcc accacaagaa 120
gctccggaga gagtggtggg agtagtgaag gtggtggcgg tggaggagat ttgtcgattg 180
gcggtgaaga tggagaaagg aactcaggtg gaaatcgatg gccaaggcaa gaaactttag 240
ctttactgaa aattagatcg gaaatggatg ttgttttcaa agattcaagt cttaaaggac 300
cgttatggga agaagtttcc agaaaactcg cggagttggg ttatcatcga agtgctaaga 360
aatgtaaaga gaaattcgag aatgtttaca agtatcacag gagaaccaaa gatggtcgtg 420
cttcgaaagc agatggaaaa acttatcgat tctttgatca gttacaggct ttggaaaaca 480
atccatcttc tcattctaac ataccgccac ctccattagc agcaacaccc ataacaatgg 540
caatgccaat gcgatcagga aacaattcag caaatcctcc aatgccgacg ccaacgccaa 600
ctccacaaaa tcataatcat ttttttagtg tttcgcagaa aagtgttgtg acaggagcag 660
cgcagcctgc tgttatgact gcacctgcgc tgccactgtc acaagtgccg ataggtaata 720
ataacttgaa ccagatgcat cggcctcaag gtaatactac tactacaaaa acaagtttcc 780
tgtcgaattc aacttcatca tcatcttcaa cttcgtcgga tgaggatata caaaggaggc 840
agatgaagaa gcggaaatgg aaggaattct ttgagagttt aatgaaggat gtgattgaga 900
agcaagagga attgcagaag aagtttttgg aaacgctcga gaagcgcgag agggataggt 960
tgatgagaga ggaggcatgg agagtgcaag agatggctag attgaatagg gaacatgatc 1020
ttttagtcca agagagatca atggcagcag ctaaagacgc aacaatcatc gccttcttgc 1080
aaaaaataac tgaacagcaa aacacacaaa ccccgaatag tacaaataac acttctcctt 1140
ctccttttcc aattgctcaa attcaattaa aattgtccga aaagccattc agtacaccac 1200
cacaaccaca accacaacca tcagctaccg cggtatcact gccaatgaca atacatacac 1260
caacaccagc accaccacag acactgacat tacctgtagt atcatcaaaa tcacttgaac 1320
ctccaaaatc cgataatggt ggtgagaatt tctctccagc aagctcgtca agatggccga 1380
aagaagaaat cgaagcattg ataagtctcc gaacctgttt agatctaaaa taccaagaaa 1440
atggaccgaa aggaccactg tgggaagaaa tttcatctgg aatgagaaag ataggataca 1500
acaggaatgc aaagagatgc aaggaaaaat gggagaacat caacaagtac ttcaagaagg 1560
taaaagaaag caacaaaaaa agaccagaag attccaaaac ttgcccatat ttccaccagc 1620
tggaagcact gtacaaagaa aaagccaagc tcgaacctgt accacacaac actaccttcg 1680
gattaacacc ccaaaacaat cctcctcctc ctcctcctcc catcatggct caacccgagc 1740
aacaatggcc aattcctcaa aatcaacttc accagcaaaa tcgtgatcat catcacgata 1800
atgaaagcga cagcatggat cacgatttgg aagaggacga ggatgaggac gaagaagatg 1860
aaggtaatgg ctatgaaata ataatcacaa ataaacaaca atcatcatca atggcggcta 1920
ccccagtaac aacaacaact tctgctgctg cagtttaa 1958
Claims (11)
1.蛋白质,其特征在于,包括如下步骤:所述蛋白质是SlGT-2和/或其同源蛋白SlGTL1;
所述SlGT-2是如下A1)、A2)或A3)的蛋白质:
A1)氨基酸序列是序列表中序列1的蛋白质;
A2)将序列表中序列1中任一种所示的氨基酸序列经过一个以上氨基酸残基的取代和/或缺失和/或添加得到的与A1)所示的蛋白质衍生得到且与植物果实形状相关的蛋白质;
A3)在A1)或A2)的N末端或/和C末端连接蛋白标签得到的融合蛋白质;
所述SlGTL1是如下A4)、A5)或A6)的蛋白质:
A4)氨基酸序列是序列表中序列2的蛋白质;
A5)将序列表中序列2中任一种所示的氨基酸序列经过一个以上氨基酸残基的取代和/或缺失和/或添加得到的与A4)所示的蛋白质衍生得到且与植物果实形状相关的蛋白质;
A6)在A5)或A6)的N末端或/和C末端连接蛋白标签得到的融合蛋白质。
2.根据权利要求1所述的蛋白质,其特征在于:A2)所述蛋白质为SlGT-2+T,所述SlGT-2+T的氨基酸序列是序列表中序列9;
A5)所述蛋白质为SlGTL1+AG,所述SlGTL1+AG的氨基酸序列是序列表中序列10。
3.基因,其特征在于:所述基因为编码权利要求1所述SlGT-2的SlGT-2基因或编码权利要求1所述SlGTL1的SlGTL1基因、或编码权利要求2所述SlGT-2+T的SlGT-2+T基因或编码权利要求2所述SlGTL1+AG的SlGTL1+AG基因;
所述SlGT-2基因为如下B1)或B2)所示的基因:
B1)编码链的编码序列是序列表中序列3的cDNA分子或DNA分子;
B2)编码链的核苷酸是序列表中序列5的DNA分子;
所述SlGTL1基因,为如下B3)或B4)所示的基因:
B3)编码链的编码序列(CDS)是序列表中序列4的cDNA分子或DNA分子;
B4)编码链的核苷酸是序列表中序列6的DNA分子;
所述SlGT-2+T基因是编码链的核苷酸是序列表中序列11的DNA分子;
所述SlGTL1+AG基因是编码链的核苷酸是序列表中序列12的DNA分子。
4.一种调控植物果实形状的方法,其特征在于:包括如下步骤:抑制受体圆形果植物中权利要求3所述SlGT-2基因及权利要求3所述SlGTL1基因的表达,得到方形果的目的植物。
5.根据权利要求4所述的方法,其特征在于:所述抑制受体圆形果植物中权利要求3所述SlGT-2基因及权利要求3所述SlGTL1基因的表达是通过对植物中所述SlGT-2基因和所述SlGTL1基因的进行基因编辑实现的,所述基因编辑是借助CRISPR/Cas9系统实现的,所述CRISPR/Cas9系统包括表达含有Cas9和sgRNA的质粒,所述sgRNA为针对靶序列1的sgRNA1和针对靶序列2的sgRNA2,所述靶序列1为序列表中序列5的第40-59位,所述靶序列2为序列表中序列6的第116-136位。
6.根据权利要求5所述的方法,其特征在于:所述sgRNA1的核苷酸序列如序列表中序列7所示,所述sgRNA2的核苷酸序列如序列表中序列8所示。
7.根据权利要求6所述的方法,其特征在于:所述抑制受体圆形果植物中权利要求3所述SlGT-2基因及权利要求3所述SlGTL1基因的表达为下述X1和X2:
X1将序列表中序列5所示的所述SlGT-2基因突变为SlGT-2+T基因,所述SlGT-2+T基因的编码序列如序列11所示;
X2将序列表中序列6所示的所述SlGTL1基因突变为SlGTL1+AG基因,所述SlGTL1+AG基因的编码序列如序列12所示。
8.根据权利要求4-7任一所述的方法,其特征在于:所述植物为F1)-F4)中的任一种:
F1)管状花目植物;
F2)茄科植物;
F3)番茄属植物;
F4)番茄。
9.调控植物果型的试剂,其特征在于:所述试剂的活性成分为抑制权利要求3所述SlGT-2基因和权利要求3所述SlGTL1基因的表达、降低权利要求1所述SlGT-2蛋白和权利要求1所述SlGTL1蛋白的丰度、和/或敲除权利要求3所述SlGT-2基因和权利要求3所述SlGTL1基因的物质。
10.权利要求1所述的蛋白、权利要求3所述的基因或权利要求9所述的试剂在调控植物果实形状和/或果胶含水量中的应用。
11.权利要求4-8所述的方法在调控植物果胶含水量中的应用。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110955324.0A CN115894642B (zh) | 2021-08-19 | 2021-08-19 | 果型控制基因SlGT-2及其同源基因与应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110955324.0A CN115894642B (zh) | 2021-08-19 | 2021-08-19 | 果型控制基因SlGT-2及其同源基因与应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115894642A true CN115894642A (zh) | 2023-04-04 |
CN115894642B CN115894642B (zh) | 2024-04-02 |
Family
ID=86474917
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110955324.0A Active CN115894642B (zh) | 2021-08-19 | 2021-08-19 | 果型控制基因SlGT-2及其同源基因与应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115894642B (zh) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1919867A (zh) * | 2006-08-09 | 2007-02-28 | 中国科学院遗传与发育生物学研究所 | 大豆Trihelix转录因子及其编码基因与应用 |
US20090138981A1 (en) * | 1998-09-22 | 2009-05-28 | Mendel Biotechnology, Inc. | Biotic and abiotic stress tolerance in plants |
CN103626856A (zh) * | 2012-08-24 | 2014-03-12 | 中国科学院遗传与发育生物学研究所 | 转录因子AtGT4及其编码基因与应用 |
US20190194682A1 (en) * | 2013-10-14 | 2019-06-27 | Koch Biological Solutions, Llc | Yield improvement in plants |
CN111808869A (zh) * | 2020-07-31 | 2020-10-23 | 浙江省农业科学院 | 参与番茄果实大小、番茄红素及β-胡萝卜素调控的基因SlOPT7及其应用 |
CN112457380A (zh) * | 2019-09-09 | 2021-03-09 | 中国科学院遗传与发育生物学研究所 | 调控植物果形和/或果汁含量的蛋白质及其相关生物材料和应用 |
CN113264992A (zh) * | 2020-02-14 | 2021-08-17 | 中国科学院遗传与发育生物学研究所 | 一种梨形番茄材料的制备方法 |
-
2021
- 2021-08-19 CN CN202110955324.0A patent/CN115894642B/zh active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090138981A1 (en) * | 1998-09-22 | 2009-05-28 | Mendel Biotechnology, Inc. | Biotic and abiotic stress tolerance in plants |
CN1919867A (zh) * | 2006-08-09 | 2007-02-28 | 中国科学院遗传与发育生物学研究所 | 大豆Trihelix转录因子及其编码基因与应用 |
CN103626856A (zh) * | 2012-08-24 | 2014-03-12 | 中国科学院遗传与发育生物学研究所 | 转录因子AtGT4及其编码基因与应用 |
US20190194682A1 (en) * | 2013-10-14 | 2019-06-27 | Koch Biological Solutions, Llc | Yield improvement in plants |
CN112457380A (zh) * | 2019-09-09 | 2021-03-09 | 中国科学院遗传与发育生物学研究所 | 调控植物果形和/或果汁含量的蛋白质及其相关生物材料和应用 |
CN113264992A (zh) * | 2020-02-14 | 2021-08-17 | 中国科学院遗传与发育生物学研究所 | 一种梨形番茄材料的制备方法 |
CN111808869A (zh) * | 2020-07-31 | 2020-10-23 | 浙江省农业科学院 | 参与番茄果实大小、番茄红素及β-胡萝卜素调控的基因SlOPT7及其应用 |
Non-Patent Citations (5)
Title |
---|
CHUYING YU 等: ""Genome-wide identification and expression profiling analysis of trihelix gene family in tomato"", 《BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS》, vol. 468, no. 4, pages 653 - 659, XP029357835, DOI: 10.1016/j.bbrc.2015.11.010 * |
NCBI: ""PREDICTED: Solanum lycopersicum trihelix transcription factor GT-2 (LOC101267228), mRNA"", 《GENBANK》, pages 004237741 * |
NCBI: ""PREDICTED: Solanum lycopersicum trihelix transcription factor GT-2-like (LOC101262091), mRNA"", 《GENBANK》, pages 010316178 * |
QIANG ZHU 等: ""Redesigning the tomato fruit shape for mechanized production"", 《NAT PLANTS》, vol. 9, no. 10, pages 1659 - 1674 * |
姬雅静 等: ""番茄果实形状的调控机制研究进展"", 《番茄果实形状的调控机制研究进展》, vol. 50, no. 9, pages 2015 - 2030 * |
Also Published As
Publication number | Publication date |
---|---|
CN115894642B (zh) | 2024-04-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109321582B (zh) | 粗山羊草Yr4DS基因在麦族植物抗条锈病育种的应用 | |
CN108368517B (zh) | 用于快速植物转化的方法和组合物 | |
CN110317250B (zh) | Myb6基因及其编码蛋白在调控植物对黄萎病抗性中的应用 | |
CN110088284A (zh) | 高水平蒂巴因罂粟及其生产的方法 | |
CN113481176B (zh) | GA3ox1蛋白在调控苜蓿株型中的应用 | |
CN108395472A (zh) | 一种控制稻类粒长和粒重的基因及其应用 | |
CN105646684A (zh) | 一种水稻粒型相关蛋白glw2及其编码基因与应用 | |
CN110691509A (zh) | 用于改良植物性状的方法 | |
CN112724213B (zh) | 甘薯花青苷合成和抗逆相关蛋白IbMYB4及其编码基因与应用 | |
CN115927445A (zh) | OsPIL15基因在调控水稻节水抗旱中的应用 | |
CN112812161B (zh) | 蛋白质IbMYC2在调控植物抗旱性中的应用 | |
CN106496313A (zh) | 抗病相关蛋白IbSWEET10及其编码基因与应用 | |
CN107266543B (zh) | 抗逆相关蛋白IbRAP2-12及其编码基因与应用 | |
CN115894642B (zh) | 果型控制基因SlGT-2及其同源基因与应用 | |
CN107936099B (zh) | Lhap1蛋白及其编码基因在调控植物光合作用中的应用 | |
CN116024228B (zh) | 水稻Ospep5基因及其编码小肽在调控植物耐盐性中的应用 | |
CN108395473B (zh) | 植物类胡萝卜素合成相关蛋白及其编码基因与应用 | |
CN105198977A (zh) | 植物白粉病抗性相关蛋白Pm-2F及其编码基因与应用 | |
CN112458105B (zh) | 一种普通野生稻粒型相关编码基因及其应用 | |
CN113264992B (zh) | 一种梨形番茄材料的制备方法 | |
CN111154770B (zh) | 水稻基因OsABCC2在调节农药的吸收转运中的应用 | |
CN114736278A (zh) | 一种马铃薯花色素苷生物合成负调控基因、转录因子及应用 | |
CN112430613A (zh) | 一种宽编辑范围的SpG基因及其应用 | |
CN101575366B (zh) | 水稻株型基因及其应用 | |
CN113968899B (zh) | 一种长果番茄材料的制备方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |