CN114989280A - A rice male fertility control gene STS1, its encoded protein and application - Google Patents
A rice male fertility control gene STS1, its encoded protein and application Download PDFInfo
- Publication number
- CN114989280A CN114989280A CN202210535082.4A CN202210535082A CN114989280A CN 114989280 A CN114989280 A CN 114989280A CN 202210535082 A CN202210535082 A CN 202210535082A CN 114989280 A CN114989280 A CN 114989280A
- Authority
- CN
- China
- Prior art keywords
- sts1
- gene
- rice
- ala
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 101150067286 STS1 gene Proteins 0.000 title claims abstract description 83
- 235000007164 Oryza sativa Nutrition 0.000 title claims abstract description 74
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 69
- 235000009566 rice Nutrition 0.000 title claims abstract description 65
- 230000035558 fertility Effects 0.000 title claims abstract description 39
- 101100028967 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PDR5 gene Proteins 0.000 title claims abstract description 36
- 101150027289 Ubash3b gene Proteins 0.000 title claims abstract description 36
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 15
- 240000007594 Oryza sativa Species 0.000 title description 57
- 241000196324 Embryophyta Species 0.000 claims abstract description 45
- 108020004414 DNA Proteins 0.000 claims abstract description 24
- 238000000034 method Methods 0.000 claims abstract description 24
- 239000002773 nucleotide Substances 0.000 claims abstract description 21
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 21
- 230000014509 gene expression Effects 0.000 claims abstract description 18
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 17
- 102100040338 Ubiquitin-associated and SH3 domain-containing protein B Human genes 0.000 claims abstract description 11
- 230000002068 genetic effect Effects 0.000 claims abstract description 4
- 241000209094 Oryza Species 0.000 claims abstract 17
- 230000035772 mutation Effects 0.000 claims description 17
- 239000013598 vector Substances 0.000 claims description 15
- 239000000463 material Substances 0.000 claims description 13
- 230000000295 complement effect Effects 0.000 claims description 9
- 238000005516 engineering process Methods 0.000 claims description 9
- 239000013604 expression vector Substances 0.000 claims description 9
- 230000009466 transformation Effects 0.000 claims description 8
- 239000012634 fragment Substances 0.000 claims description 7
- 239000012620 biological material Substances 0.000 claims description 5
- 238000004519 manufacturing process Methods 0.000 claims description 5
- 244000005700 microbiome Species 0.000 claims description 5
- 238000012216 screening Methods 0.000 claims description 4
- 125000003275 alpha amino acid group Chemical group 0.000 claims 4
- 238000010353 genetic engineering Methods 0.000 claims 1
- 238000009396 hybridization Methods 0.000 claims 1
- 230000001105 regulatory effect Effects 0.000 abstract description 8
- 238000011426 transformation method Methods 0.000 abstract description 4
- 238000013518 transcription Methods 0.000 abstract description 2
- 230000035897 transcription Effects 0.000 abstract description 2
- 230000009261 transgenic effect Effects 0.000 description 13
- 206010021929 Infertility male Diseases 0.000 description 11
- 208000007466 Male Infertility Diseases 0.000 description 11
- 238000010362 genome editing Methods 0.000 description 10
- 210000004027 cell Anatomy 0.000 description 9
- 238000010367 cloning Methods 0.000 description 8
- 108091008146 restriction endonucleases Proteins 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 5
- 206010064571 Gene mutation Diseases 0.000 description 5
- 150000001413 amino acids Chemical group 0.000 description 5
- 238000009395 breeding Methods 0.000 description 5
- 230000001488 breeding effect Effects 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 240000006394 Sorghum bicolor Species 0.000 description 4
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 230000002028 premature Effects 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- ZCYVEMRRCGMTRW-UHFFFAOYSA-N 7553-56-2 Chemical compound [I] ZCYVEMRRCGMTRW-UHFFFAOYSA-N 0.000 description 3
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- 108700019146 Transgenes Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 229910052740 iodine Inorganic materials 0.000 description 3
- 239000011630 iodine Substances 0.000 description 3
- 230000001850 reproductive effect Effects 0.000 description 3
- 238000010186 staining Methods 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- 241000219198 Brassica Species 0.000 description 2
- 235000011331 Brassica Nutrition 0.000 description 2
- 244000056139 Brassica cretica Species 0.000 description 2
- 235000003351 Brassica cretica Nutrition 0.000 description 2
- 235000003343 Brassica rupestris Nutrition 0.000 description 2
- 108091033409 CRISPR Proteins 0.000 description 2
- 244000025254 Cannabis sativa Species 0.000 description 2
- 244000020518 Carthamus tinctorius Species 0.000 description 2
- 235000003255 Carthamus tinctorius Nutrition 0.000 description 2
- 229920000742 Cotton Polymers 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- 244000068988 Glycine max Species 0.000 description 2
- 235000010469 Glycine max Nutrition 0.000 description 2
- 239000005562 Glyphosate Substances 0.000 description 2
- 244000299507 Gossypium hirsutum Species 0.000 description 2
- 241000448472 Gramma Species 0.000 description 2
- 206010020649 Hyperkeratosis Diseases 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- 240000006240 Linum usitatissimum Species 0.000 description 2
- 235000004431 Linum usitatissimum Nutrition 0.000 description 2
- 108060001084 Luciferase Proteins 0.000 description 2
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 2
- 235000009430 Thespesia populnea Nutrition 0.000 description 2
- 241000209140 Triticum Species 0.000 description 2
- 235000021307 Triticum Nutrition 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- 238000012271 agricultural production Methods 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- QKSKPIVNLNLAAV-UHFFFAOYSA-N bis(2-chloroethyl) sulfide Chemical compound ClCCSCCCl QKSKPIVNLNLAAV-UHFFFAOYSA-N 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000001976 enzyme digestion Methods 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 2
- 229940097068 glyphosate Drugs 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 235000010460 mustard Nutrition 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 238000005204 segregation Methods 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 108091005957 yellow fluorescent proteins Proteins 0.000 description 2
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- OZRFYUJEXYKQDV-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-carboxypropanoyl)amino]-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]butanedioic acid Chemical compound OC(=O)CC(N)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(O)=O OZRFYUJEXYKQDV-UHFFFAOYSA-N 0.000 description 1
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- 235000017060 Arachis glabrata Nutrition 0.000 description 1
- 235000010777 Arachis hypogaea Nutrition 0.000 description 1
- 244000105624 Arachis hypogaea Species 0.000 description 1
- 235000018262 Arachis monticola Nutrition 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- NBKLEMWHDLAUEM-CIUDSAMLSA-N Asp-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N NBKLEMWHDLAUEM-CIUDSAMLSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- 244000075850 Avena orientalis Species 0.000 description 1
- 235000007319 Avena orientalis Nutrition 0.000 description 1
- 235000007558 Avena sp Nutrition 0.000 description 1
- 235000016068 Berberis vulgaris Nutrition 0.000 description 1
- 241000335053 Beta vulgaris Species 0.000 description 1
- 240000002791 Brassica napus Species 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 235000016401 Camelina Nutrition 0.000 description 1
- 244000197813 Camelina sativa Species 0.000 description 1
- 240000006432 Carica papaya Species 0.000 description 1
- 235000009467 Carica papaya Nutrition 0.000 description 1
- 235000007516 Chrysanthemum Nutrition 0.000 description 1
- 240000005250 Chrysanthemum indicum Species 0.000 description 1
- 240000007154 Coffea arabica Species 0.000 description 1
- 244000241257 Cucumis melo Species 0.000 description 1
- 235000009847 Cucumis melo var cantalupensis Nutrition 0.000 description 1
- 240000008067 Cucumis sativus Species 0.000 description 1
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- 241001523681 Dendrobium Species 0.000 description 1
- 235000001950 Elaeis guineensis Nutrition 0.000 description 1
- 244000127993 Elaeis melanococca Species 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 244000166124 Eucalyptus globulus Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000234642 Festuca Species 0.000 description 1
- 241000245654 Gladiolus Species 0.000 description 1
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- OFIHURVSQXAZIR-SZMVWBNQSA-N Glu-Lys-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OFIHURVSQXAZIR-SZMVWBNQSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- PYFHPYDQHCEVIT-KBPBESRZSA-N Gly-Trp-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O PYFHPYDQHCEVIT-KBPBESRZSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- 241000234280 Liliaceae Species 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- PRCHKVGXZVTALR-KKUMJFAQSA-N Lys-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N PRCHKVGXZVTALR-KKUMJFAQSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- 101150117737 MFS1 gene Proteins 0.000 description 1
- 235000011430 Malus pumila Nutrition 0.000 description 1
- 244000070406 Malus silvestris Species 0.000 description 1
- 235000015103 Malus silvestris Nutrition 0.000 description 1
- 240000004658 Medicago sativa Species 0.000 description 1
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- 240000005561 Musa balbisiana Species 0.000 description 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 101150005851 NOS gene Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 240000002582 Oryza sativa Indica Group Species 0.000 description 1
- 240000008467 Oryza sativa Japonica Group Species 0.000 description 1
- 101000708283 Oryza sativa subsp. indica Protein Rf1, mitochondrial Proteins 0.000 description 1
- 241001668545 Pascopyrum Species 0.000 description 1
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 1
- 244000046052 Phaseolus vulgaris Species 0.000 description 1
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 1
- 241000018646 Pinus brutia Species 0.000 description 1
- 235000011613 Pinus brutia Nutrition 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- VGFFUEVZKRNRHT-ULQDDVLXSA-N Pro-Trp-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)O)C(=O)O VGFFUEVZKRNRHT-ULQDDVLXSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 244000184734 Pyrus japonica Species 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 240000000528 Ricinus communis Species 0.000 description 1
- 235000004443 Ricinus communis Nutrition 0.000 description 1
- 244000235659 Rubus idaeus Species 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 244000082988 Secale cereale Species 0.000 description 1
- 235000007238 Secale cereale Nutrition 0.000 description 1
- 241000228160 Secale cereale x Triticum aestivum Species 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- XERQKTRGJIKTRB-CIUDSAMLSA-N Ser-His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CN=CN1 XERQKTRGJIKTRB-CIUDSAMLSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- 235000003434 Sesamum indicum Nutrition 0.000 description 1
- 244000040738 Sesamum orientale Species 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- 235000019714 Triticale Nutrition 0.000 description 1
- 240000000359 Triticum dicoccon Species 0.000 description 1
- 235000001468 Triticum dicoccon Nutrition 0.000 description 1
- 240000000581 Triticum monococcum Species 0.000 description 1
- 240000003834 Triticum spelta Species 0.000 description 1
- 235000004240 Triticum spelta Nutrition 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- ILDJYIDXESUBOE-HSCHXYMDSA-N Trp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ILDJYIDXESUBOE-HSCHXYMDSA-N 0.000 description 1
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 1
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- -1 UidA Proteins 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 206010000210 abortion Diseases 0.000 description 1
- 231100000176 abortion Toxicity 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 229930002877 anthocyanin Natural products 0.000 description 1
- 235000010208 anthocyanin Nutrition 0.000 description 1
- 239000004410 anthocyanin Substances 0.000 description 1
- 150000004636 anthocyanins Chemical class 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 1
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 210000000081 body of the sternum Anatomy 0.000 description 1
- 235000019577 caloric intake Nutrition 0.000 description 1
- 210000003855 cell nucleus Anatomy 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 235000016213 coffee Nutrition 0.000 description 1
- 235000013353 coffee beverage Nutrition 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 235000012343 cottonseed oil Nutrition 0.000 description 1
- 238000009402 cross-breeding Methods 0.000 description 1
- 108010082025 cyan fluorescent protein Proteins 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000004720 fertilization Effects 0.000 description 1
- 235000004426 flaxseed Nutrition 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 238000012226 gene silencing method Methods 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 235000015810 grayleaf red raspberry Nutrition 0.000 description 1
- 101150054900 gus gene Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 231100000707 mutagenic chemical Toxicity 0.000 description 1
- 230000003505 mutagenic effect Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 235000020232 peanut Nutrition 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 230000010152 pollination Effects 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000027272 reproductive process Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 229940124530 sulfonamide Drugs 0.000 description 1
- 150000003456 sulfonamides Chemical class 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 230000009105 vegetative growth Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H1/00—Processes for modifying genotypes ; Plants characterised by associated natural traits
- A01H1/02—Methods or apparatus for hybridisation; Artificial pollination ; Fertility
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H1/00—Processes for modifying genotypes ; Plants characterised by associated natural traits
- A01H1/02—Methods or apparatus for hybridisation; Artificial pollination ; Fertility
- A01H1/022—Genic fertility modification, e.g. apomixis
- A01H1/023—Male sterility
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8287—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for fertility modification, e.g. apomixis
- C12N15/8289—Male sterility
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Botany (AREA)
- Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- Developmental Biology & Embryology (AREA)
- Environmental Sciences (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Cell Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
本发明提供一种水稻雄性育性控制基因STS1、其编码的蛋白及应用,属于植物生物技术领域。该水稻雄性育性控制基因STS1核苷酸序列具有如下群组之一:1)具有如SEQ ID NO.1所示的核苷酸序列;2)与SEQ ID NO.1所示的核苷酸序列具有不低于90%的相似性,且具有相同功能的DNA序列;3)在一定条件下能够与1)所述序列的DNA杂交的DNA序列;4)与1)~3)任一所述序列互补的DNA序列;本发明还保护通过影响STS1的核苷酸序列或者通过调控STS1基因的转录表达从而影响植株育性的方法,以及采用常规遗传转化手段将所述STS1基因,转入如前述所述应用获得的水稻雄性不育株系,进而使得突变体恢复野生型表型。
The invention provides a rice male fertility control gene STS1, a protein encoded by the gene and application thereof, belonging to the field of plant biotechnology. The nucleotide sequence of the rice male fertility control gene STS1 has one of the following groups: 1) with the nucleotide sequence shown in SEQ ID NO.1; 2) with the nucleotide shown in SEQ ID NO.1 DNA sequences with sequences that are not less than 90% similar and have the same function; 3) DNA sequences that can hybridize to the DNA of 1) the sequence described under certain conditions; 4) Any of 1) to 3) DNA sequences The present invention also protects a method for affecting plant fertility by affecting the nucleotide sequence of STS1 or by regulating the transcription and expression of the STS1 gene, and using conventional genetic transformation methods to transform the STS1 gene into such as The male-sterile rice line obtained by the above-mentioned application enables the mutant to restore the wild-type phenotype.
Description
技术领域technical field
本发明涉及植物基因技术领域,具体涉及一种水稻雄性育性控制基因STS1、其编码的蛋白及应用。The invention relates to the technical field of plant genes, in particular to a rice male fertility control gene STS1, a protein encoded by the gene, and applications.
背景技术Background technique
水稻是人类营养和热量摄入最重要的作物之一,提供了超过全人类所消耗卡路里的五分之一。杂种优势利用是培育水稻新品种的主要手段,也是提高水稻产量的重要途径。水稻属于自花授粉植物,雌蕊和雄蕊着生在同一朵很小的颖花里,人工去雄的方法难以满足生产所需的大量杂交种子,因此杂交稻育种依赖于雄性不育种质资源的发掘与利用。也就是说,利用这种天然的雄性不育水稻作为母本,通过人工辅助授粉,就能大量生产杂交种子。然而长久以来,由于雄性不育种质资源的发掘与利用整体有限,目前广为利用的仅限于核质互作雄性不育(三系)和温敏核不育(两系)两类,且前者受恢保关系制约,后者易受环境影响,更重要的是,这两个类型中实际的不育基因资源单一,限制了杂交水稻的优势进一步提升,同时也存在较大风险。随着当前生物技术的发展,近年来的研究表明充分利用一种称为“种子生产技术(Seed Production Technology,SPT)的技术手段,可实现水稻核雄性不育突变体在杂交稻中的直接应用。因此通过分离克隆新类型的核雄性育性控制基因,创制新的不育材料,对于扩展杂交水稻急需的不育系种质范围具有重要意义Rice is one of the most important crops for human nutrition and caloric intake, providing more than one-fifth of the calories consumed by all humans. The utilization of heterosis is the main method to cultivate new rice varieties, and it is also an important way to improve rice yield. Rice is a self-pollinating plant. The pistils and stamens are placed in the same small spikelets. The method of artificial emasculation is difficult to meet the large number of hybrid seeds required for production. Therefore, hybrid rice breeding relies on male sterile germplasm resources. Discover and utilize. That is to say, using this natural male-sterile rice as the female parent and artificially assisted pollination, a large number of hybrid seeds can be produced. However, for a long time, due to the overall limited exploration and utilization of male sterile germplasm resources, the currently widely used ones are limited to the two types of male sterility (three lines) and thermosensitive male sterility (two lines). Restricted by the relationship between restoration and protection, the latter is easily affected by the environment. More importantly, the actual sterile gene resources of these two types are limited, which limits the further improvement of the advantages of hybrid rice, and also has greater risks. With the development of current biotechnology, research in recent years has shown that the direct application of rice nuclear male sterile mutants in hybrid rice can be realized by making full use of a technical means called "Seed Production Technology (SPT)" Therefore, by isolating and cloning new types of nuclear male fertility control genes and creating new sterile materials, it is of great significance to expand the germplasm range of sterile lines urgently needed in hybrid rice
发明内容SUMMARY OF THE INVENTION
针对现有技术存在的问题和缺陷,本发明提供一种水稻雄性育性控制基因STS1、其编码的蛋白及应用。利用该基因及其编码的蛋白调控水稻雄性育性,通过基因编辑等突变该编码基因获得新的水稻不育系,以及通过转基因技术恢复突变体育性,在农业生产上具有重要的应用价值。本发明的技术方案为:In view of the problems and defects in the prior art, the present invention provides a rice male fertility control gene STS1, a protein encoded by the gene, and applications thereof. The use of this gene and its encoded protein to regulate the male fertility of rice, to mutate the encoded gene through gene editing to obtain a new sterile line of rice, and to restore the mutant fertility through transgenic technology has important application value in agricultural production. The technical scheme of the present invention is:
第一方面,本发明提供一种水稻雄性育性控制基因STS1,来源于水稻,其核苷酸序列具有如下群组之一:1)具有如SEQ ID NO.1所示的核苷酸序列;2)与SEQ ID NO.1所示的核苷酸序列具有不低于90%的相似性,且具有相同功能的DNA序列;3)在一定条件下能够与1)所述序列的DNA杂交的DNA序列;4)与1)~3)任一所述序列互补的DNA序列。In a first aspect, the present invention provides a rice male fertility control gene STS1, derived from rice, whose nucleotide sequence has one of the following groups: 1) has a nucleotide sequence as shown in SEQ ID NO.1; 2) A DNA sequence that is not less than 90% similar to the nucleotide sequence shown in SEQ ID NO. 1 and has the same function; DNA sequence; 4) a DNA sequence complementary to any of the sequences 1) to 3).
本领域公知,本发明所述的水稻雄性育性控制基因包括与STS1基因高度同源,并且具有同样的育性控制功能的高度同源的功能等价体序列。所述高度同源的功能等价体序列包括在一定条件下能够与本发明所公开的STS1基因的核苷酸序列杂交的DNA序列。本发明中所使用的“一定条件”属于公知常识,包括采用变性(如90~96℃条件下、一定pH、以及有机溶剂存在下)、退火(于40~65℃复性条件下)、延伸(于68~75℃在聚合酶作用下)等手段进行核酸杂交。It is known in the art that the rice male fertility control gene of the present invention includes a highly homologous functional equivalent sequence that is highly homologous to the STS1 gene and has the same fertility control function. The highly homologous functional equivalent sequences include DNA sequences capable of hybridizing to the nucleotide sequence of the STS1 gene disclosed in the present invention under certain conditions. The "certain conditions" used in the present invention belong to common knowledge, including denaturation (for example, under the condition of 90-96°C, a certain pH, and in the presence of an organic solvent), annealing (at 40-65°C for renaturation), extension (under the action of polymerase at 68-75°C) and other means to carry out nucleic acid hybridization.
功能等价体序列还包括与本发明所公开的STS1基因所示的序列有至少90%、95%、96%、97%、98%、或99%序列相似性,且具有育性调控功能的DNA序列,可以从任何植物中分离获得。其中,序列相似性的百分比可以通过公知的生物信息学算法来获得,包括Myers和Miller算法(Bioinformatics,4(1):11-17,1988)、Needleman-Wunsch全局比对法(J.Mol.Biol.,48(3):443-53,1970)、Smith-Waterman局部比对法(J.Mol.Biol.,147:195-197,1981)、Pearson和Lipman相似性搜索法(PNAS,85(8):2444-2448,1988)、Karlin和Altschul的算法(Altschul等,J.Mol.Biol.,215(3):403-410,1990;PNAS,90:5873-5877,1993)。这对于本领域技术人员来说是熟悉的。The functional equivalent sequence also includes at least 90%, 95%, 96%, 97%, 98%, or 99% sequence similarity with the sequence shown in the STS1 gene disclosed in the present invention, and has fertility regulation function. DNA sequences can be isolated from any plant. Among them, the percentage of sequence similarity can be obtained by well-known bioinformatics algorithms, including Myers and Miller algorithm (Bioinformatics, 4(1): 11-17, 1988), Needleman-Wunsch global alignment method (J.Mol. Biol., 48(3): 443-53, 1970), Smith-Waterman local alignment (J. Mol. Biol., 147: 195-197, 1981), Pearson and Lipman similarity search method (PNAS, 85 (8): 2444-2448, 1988), the algorithm of Karlin and Altschul (Altschul et al., J. Mol. Biol., 215(3): 403-410, 1990; PNAS, 90: 5873-5877, 1993). This is familiar to those skilled in the art.
上文所指的“可以从任何植物中分离获得”,包括但不限于芸苔属、玉米、小麦、高粱、两节荠属、白芥、蓖麻子、芝麻、棉籽、亚麻子、大豆、拟南芥属、菜豆属、花生、苜蓿、燕麦、油菜籽、大麦、燕麦、黑麦(Rye)、粟、蜀黍、小黑麦、单粒小麦、斯佩尔特小麦(Spelt)、双粒小麦、亚麻、格兰马草(Gramma grass)、摩擦禾、假蜀黍、羊茅、多年生麦草、甘蔗、红莓苔子、番木瓜、香蕉、红花、油棕、香瓜、苹果、黄瓜、石斛、剑兰、菊花、百合科、棉花、桉、向日葵、芸苔、甜菜、咖啡、观赏植物和松类等。优选地,植物包括玉米、大豆、红花、芥菜、小麦、大麦、黑麦、稻、棉花和高粱。"Can be isolated from any plant" as referred to above, including but not limited to Brassica, corn, wheat, sorghum, Camelina, mustard, castor bean, sesame, cottonseed, linseed, soybean, Arabidopsis, Bean, Peanut, Alfalfa, Oat, Rapeseed, Barley, Oat, Rye, Millet, Milo, Triticale, Einkorn, Spelt, Emmer , flax, Gramma grass (Gramma grass), rubgrass, false milo, fescue, perennial wheat grass, sugar cane, red raspberry moss, papaya, banana, safflower, oil palm, cantaloupe, apple, cucumber, dendrobium, Gladiolus, Chrysanthemum, Liliaceae, Cotton, Eucalyptus, Sunflower, Brassica, Beet, Coffee, Ornamental Plants and Pines, etc. Preferably, the plants include corn, soybean, safflower, mustard, wheat, barley, rye, rice, cotton and sorghum.
第二方面,本发明还提供与该水稻雄性育性控制基因STS1相关的生物材料,为以下1)~5)中的任一种:In a second aspect, the present invention also provides a biological material related to the rice male fertility control gene STS1, which is any one of the following 1) to 5):
1)编码基因STS1的蛋白;1) the protein encoding the gene STS1;
2)含有基因STS1的表达盒;2) an expression cassette containing the gene STS1;
3)含有基因STS1的重组载体;3) a recombinant vector containing the gene STS1;
4)含有基因STS1的重组微生物;4) a recombinant microorganism containing the gene STS1;
5)含有3)所述重组载体的重组微生物。5) A recombinant microorganism containing the recombinant vector of 3).
进一步地,所述编码基因STS1的蛋白氨基酸序列具有如SEQ ID NO.2所示的序列,或者与SEQ ID NO.2所示的氨基酸序列相似且具有相同功能的氨基酸序列。Further, the protein amino acid sequence of the coding gene STS1 has the sequence shown in SEQ ID NO. 2, or an amino acid sequence similar to the amino acid sequence shown in SEQ ID NO. 2 and having the same function.
进一步地,所述含有基因STS1表达盒的制备过程中,可对多种DNA片段加以操作,以提供处于合适方向,或是处于正确读码框中的DNA序列。为达到此目的,可使用衔接子或接头,将DNA片段连起来,或者进一步包括其它操作,以提供方便的限制性酶切位点等。此外,本发明所提供的表达盒或者重组载体可被插入质粒、粘粒、酵母人工染色体、细菌人工染色体或其他适合转化进宿主细胞中的任何载体中。优选的宿主细胞是细菌细胞,尤其是用于克隆或储存多核苷酸、或用于转化植物细胞的细菌细胞,例如大肠杆菌、根瘤土壤杆菌和毛根土壤杆菌。当宿主细胞是植物细胞时,表达盒或重组载体可被插入被转化的植物细胞的基因组中。插入可以是定位的或随机的插入。优选地,插入通过诸如同源重组来实现。另外,表达盒或重组载体可保持在染色体外。本发明的表达盒或重组载体可存在于植物细胞的核、叶绿体、线粒体和/或质体中。优选地,本发明的表达盒或重组载体被插入植物细胞核的染色体DNA中。Further, during the preparation of the gene-containing STS1 expression cassette, various DNA fragments can be manipulated to provide DNA sequences in proper orientations or in correct reading frames. For this purpose, adaptors or linkers can be used to join the DNA fragments, or other manipulations can be further included to provide convenient restriction enzyme sites and the like. In addition, the expression cassettes or recombinant vectors provided by the present invention can be inserted into plasmids, cosmids, yeast artificial chromosomes, bacterial artificial chromosomes or any other vector suitable for transformation into host cells. Preferred host cells are bacterial cells, especially for cloning or storage of polynucleotides, or for transforming plant cells, such as Escherichia coli, Agrobacterium tumefaciens and Agrobacterium rhizogenes. When the host cell is a plant cell, the expression cassette or recombinant vector can be inserted into the genome of the transformed plant cell. Insertions can be positioned or random. Preferably, insertion is effected, for example, by homologous recombination. Alternatively, the expression cassette or recombinant vector may remain extrachromosomal. The expression cassette or recombinant vector of the present invention may be present in the nucleus, chloroplast, mitochondria and/or plastid of a plant cell. Preferably, the expression cassette or recombinant vector of the present invention is inserted into the chromosomal DNA of the plant cell nucleus.
进一步地,所述重组微生物可以通过常规方法或是基因编辑手段获得,诸如将启动子序列与报告基因可操作性连接,形成可转化的构建体,再将该构建体转入植株中;或者将上述构建体亚克隆用于瞬时表达实验的表达载体,通过瞬时表达实验来检测启动子或其调控区的功能用来测试启动子或调控区域功能的适当表达载体的选择将取决于宿主和将该表达载体引入宿主的方法,这类方法是本领域普通技术人员所熟知的。对于真核生物,在载体中的区域包括控制转录起始和控制加工的区域。这些区域被可操作地连接到报告基因,所述报告基因包括YFP、UidA、GUS基因或荧光素酶。包含位于基因组片段中的推定调控区的表达载体可以被引入完整的组织,例如阶段性花粉,或引入愈伤组织。Further, the recombinant microorganism can be obtained by conventional methods or gene editing means, such as operably linking a promoter sequence with a reporter gene to form a transformable construct, and then transferring the construct into a plant; or The above constructs are subcloned for expression vectors used in transient expression experiments to test the function of the promoter or its regulatory region. The choice of an appropriate expression vector for testing the function of the promoter or regulatory region will depend on the host and the Methods for introducing expression vectors into hosts are well known to those of ordinary skill in the art. For eukaryotes, the regions in the vector include regions that control transcription initiation and regions that control processing. These regions are operably linked to reporter genes including YFP, UidA, GUS gene or luciferase. Expression vectors containing putative regulatory regions located in genomic fragments can be introduced into whole tissues, such as staged pollen, or into callus tissue.
第三方面,本发明所述的应用是:本发明提供了通过影响STS1的核苷酸序列或者通过调控STS1基因的转录表达从而影响植株育性的方法,获得新的水稻雄性不育系,该水稻雄性不育系可以用来生产杂交种子。In the third aspect, the application of the present invention is as follows: the present invention provides a method for affecting plant fertility by affecting the nucleotide sequence of STS1 or regulating the transcription and expression of the STS1 gene to obtain a new rice male sterile line, the Rice male sterile lines can be used to produce hybrid seeds.
所述影响植株育性的方法是指通过调控STS1基因的表达,从而使所述植株的育性发生改变,如导致植株雄性不育。具体地,取决于具体应用需求,可以通过多种方法来影响STS1基因在植物体内的表达,从而达到调控植株雄性育性的效果。更具体地,调控STS1基因的表达可以使用许多本领域普通技术人员可获得的工具进行,例如,通过突变、诱变、反义基因的转入、共抑制或发夹结构的引入以及基因编辑等,都可以用于破坏STS1基因的正常表达,从而获得雄性不育的植株。The method for affecting plant fertility refers to changing the fertility of the plant by regulating the expression of the STS1 gene, such as causing male sterility of the plant. Specifically, depending on the specific application requirements, various methods can be used to influence the expression of the STS1 gene in plants, so as to achieve the effect of regulating the male fertility of plants. More specifically, regulating the expression of the STS1 gene can be performed using many tools available to those of ordinary skill in the art, for example, by mutation, mutagenesis, introduction of antisense genes, introduction of co-suppression or hairpin structures, and gene editing, etc. , can be used to disrupt the normal expression of the STS1 gene to obtain male sterile plants.
另外,本发明还提供了一种STS1基因的不育突变体序列及其雄性不育突变体材料。更具体地,所述雄性不育突变体材料是通过突变水稻内源的STS1基因,或突变与其高度同源的基因的核苷酸序列,使该植物体丧失雄性育性的过程。所述“突变”包括但不限于以下方法,如用物理或化学的方法导致的基因突变,化学方法包括用EMS等诱变剂处理所导致的诱变,所述突变还可以是点突变,也可以是DNA缺失或插入突变,还可以是通过RNAi、定点突变等基因沉默和基因编辑等手段产生。In addition, the present invention also provides a sterile mutant sequence of the STS1 gene and a male sterile mutant material thereof. More specifically, the male sterile mutant material is a process in which the plant body loses male fertility by mutating the endogenous STS1 gene in rice, or mutating the nucleotide sequence of a gene highly homologous thereto. The "mutation" includes but is not limited to the following methods, such as gene mutation caused by physical or chemical methods, chemical methods include mutagenesis caused by treatment with mutagen such as EMS, and the mutation can also be a point mutation, or It can be DNA deletion or insertion mutation, or it can be generated by means of gene silencing such as RNAi, site-directed mutagenesis, and gene editing.
本领域技术人员应该知晓,创制的雄性不育材料受体,可以包括,但不限于目前已知的所有的常规籼粳水稻品种。Those skilled in the art should know that the male sterile material receptors created can include, but are not limited to, all conventional indica and japonica rice varieties currently known.
具体地,本发明采用上述方法构建了四种水稻雄性不育突变体,这四种水稻雄性不育突变体均含有突变后的雄性不育基因,所述突变后的雄性不育基因的核苷酸序列分别如SEQ ID NO.6(sts1-1)、SEQ ID NO.7(sts1-2)、SEQ ID NO.8(sts1-3)、SEQ ID NO.9(sts1-4)所示。与野生型相比,在四个雄性不育突变体中,sts1-1突变体在该基因编码区内含子上一个剪接识别位点的核苷酸发生替换,产生3个错误的转录本;sts1-2突变体在该基因的第7外显子的一个核苷酸发生替换进而导致终止子提前形成;sts1-3突变体在该基因的第2外显子上发生一个碱基的插入,进而导致终止子提前形成;sts1-4突变体在该基因的第9外显子上发生两个碱基的缺失,进而导致终止子提前形成。四个突变体中该基因的突变最终都导致了蛋白翻译提前终止。Specifically, the present invention adopts the above method to construct four rice male sterility mutants, and these four rice male sterility mutants all contain a mutated male sterility gene, and the nucleosides of the mutated male sterility gene The acid sequences are shown in SEQ ID NO.6 (sts1-1), SEQ ID NO.7 (sts1-2), SEQ ID NO.8 (sts1-3), and SEQ ID NO.9 (sts1-4). Compared with the wild type, among the four male sterile mutants, the sts1-1 mutant had a nucleotide substitution at a splicing recognition site in the coding region of the gene, resulting in three erroneous transcripts; The sts1-2 mutant has a nucleotide substitution in the 7th exon of the gene, which leads to the premature formation of the terminator; the sts1-3 mutant has a nucleotide insertion in the 2nd exon of the gene, This leads to the premature formation of the terminator; the sts1-4 mutant has a deletion of two bases in the 9th exon of the gene, which leads to the premature formation of the terminator. Mutations in this gene in all four mutants ultimately resulted in premature termination of protein translation.
第四方面,本发明还涉及一种恢复水稻雄性不育株系的雄性不育性状的方法,包括如下步骤:采用常规遗传转化手段将所述STS1基因,转入如前述所述应用获得的水稻雄性不育株系,进而使得突变体恢复野生型表型。In a fourth aspect, the present invention also relates to a method for restoring male sterility of a rice male sterile line, comprising the steps of: using conventional genetic transformation means to transform the STS1 gene into the rice obtained by the aforementioned application. Male sterile lines, which in turn allow mutants to restore the wild-type phenotype.
第五方面,本发明还涉及一种水稻不育株系在水稻制种中的用途,以如前述所述构建获得的水稻雄性不育突变体株系作为母本,进行杂交育种。具体地,在某些应用的实施方式中,可以应用本发明所提供的STS1基因来实现STS1或其他类似育性相关基因突变所获得的雄性不育系的繁殖和保持。In a fifth aspect, the present invention also relates to the use of a sterile rice line in rice seed production, and cross-breeding is carried out with the male sterile mutant line of rice constructed and obtained as described above as the female parent. Specifically, in certain application embodiments, the STS1 gene provided by the present invention can be used to achieve the reproduction and maintenance of male sterile lines obtained by mutation of STS1 or other similar fertility-related genes.
更具体地,上述雄性不育系的繁殖和保持,是指以纯合隐性核雄性不育突变体为转化受体材料,将紧密连锁的3个目标基因转化至该不育突变体受体植株中。所述3个目标基因分别是育性恢复基因、花粉失活基因和筛选基因。其中,育性恢复基因可使不育的转化受体育性恢复,花粉失活基因可使含有转化的外源基因的花粉失活,即失去授精能力,筛选基因可以用于转基因种子和非转基因种子的分拣,分拣出的非转基因种子用作不育系生产杂交种,转基因种子用作保持系来源源不断地、稳定地生产不育系。本发明的所提供的花粉特异表达启动子可用于外源基因在花粉中的特异性表达,从而避免该外源基因在植物其他组织中持续表达所带来的不利影响,还可以用于植物花粉生长发育相关基因的功能分析和鉴定;可用于雄性不育系和恢复系的创建;并可应用于花粉败育实验中,从而避免由植物转基因漂移或花粉逃逸所带来的生物安全问题,对植物雄性不育系和恢复系的创造具有重要意义。More specifically, the breeding and maintenance of the above-mentioned male sterile line refers to using a homozygous recessive nuclear male sterile mutant as a transformation recipient material, and transforming the three closely linked target genes into the sterile mutant recipient plant. middle. The three target genes are fertility restoration gene, pollen inactivation gene and screening gene respectively. Among them, the fertility restorer gene can restore the sterile transformation by vegetative ability, and the pollen inactivation gene can inactivate the pollen containing the transformed exogenous gene, that is, lose the fertilization ability, and the screening gene can be used for transgenic seeds and non-transgenic seeds. The sorted non-transgenic seeds are used as sterile lines to produce hybrids, and the transgenic seeds are used as maintainer lines to continuously and stably produce sterile lines. The pollen-specific expression promoter provided by the present invention can be used for the specific expression of an exogenous gene in pollen, so as to avoid the adverse effects caused by the continuous expression of the exogenous gene in other plant tissues, and can also be used in plant pollen Functional analysis and identification of growth and development-related genes; it can be used for the creation of male sterile lines and restorer lines; it can be used in pollen abortion experiments to avoid biosafety problems caused by plant transgene drift or pollen escape. The creation of plant male sterile lines and restorer lines is of great significance.
本领域技术人员应该知晓,合适的筛选基因包括但不限于:氯霉素抗性基因,潮霉素抗性基因,链霉素抗性基因,奇霉素抗性基因,磺胺类抗性基因,草甘磷抗性基因,草丁嶙抗性基因。所述选择标记基因还可以是红色荧光基因、青色荧光蛋白基因、黄色荧光蛋白基因、荧光素酶基因、绿色荧光蛋白基因、花青甙p1等基因。Those skilled in the art should know that suitable screening genes include but are not limited to: chloramphenicol resistance gene, hygromycin resistance gene, streptomycin resistance gene, spectinomycin resistance gene, sulfonamide resistance gene, Glyphosate resistance gene, glyphosate resistance gene. The selectable marker gene can also be red fluorescent gene, cyan fluorescent protein gene, yellow fluorescent protein gene, luciferase gene, green fluorescent protein gene, anthocyanin p1 and other genes.
此外,本发明的转基因植物使用植物生物技术领域技术人员已知的转化方法制备。任何方法可被用于将重组表达载体转化进植物细胞中,以产生本发明的转基因植物。转化方法可包括直接和间接的转化方法。合适的直接方法包括聚乙二醇诱导的DNA摄入、脂质体介导的转化、使用基因枪导入、电穿孔、以及显微注射等。在本发明的具体实施方式中,本发明使用了基于土壤农杆菌的转化技术。Furthermore, the transgenic plants of the present invention are prepared using transformation methods known to those skilled in the art of plant biotechnology. Any method can be used to transform the recombinant expression vector into plant cells to produce the transgenic plants of the present invention. Transformation methods can include direct and indirect transformation methods. Suitable direct methods include polyethylene glycol-induced DNA uptake, liposome-mediated transformation, introduction using a gene gun, electroporation, and microinjection, among others. In a specific embodiment of the invention, the invention uses Agrobacterium-based transformation techniques.
与现有技术相比,本发明可以获得包括以下技术效果:Compared with the prior art, the present invention can obtain the following technical effects:
1)本发明通过控制水稻雄性育性STS1基因及其编码蛋白获得水稻雄性生殖发育的变异株,实现人为控制水稻生殖过程。1) The present invention obtains mutants of rice male reproductive development by controlling the rice male fertility STS1 gene and its encoded protein, thereby realizing artificial control of the rice reproductive process.
2)本发明获得的水稻突变体在营养期与来源亲本无明显差异,进入生殖生长阶段后雄性生殖发育异常,花粉败育,得到完全雄性不育的不育系,该不育系的育性稳定、不受环境条件影响、能够被野生型转基因恢复。2) The rice mutant obtained by the present invention has no obvious difference from the source parent in the vegetative period, and after entering the reproductive growth stage, the male reproductive development is abnormal, the pollen is aborted, and a sterile line with complete male sterility is obtained. The fertility of the sterile line Stable, unaffected by environmental conditions, capable of being restored by wild-type transgenes.
3)该基因以及该基因突变产生的不育系为构建第三代杂交育种体系提供了必要的元件,该基因突变产生的雄性不育系,用来生产杂交种子,对于突破并改良现有的“三系”和“两系”杂交技术有重要意义,并在杂交水稻构建和农业生产上具有十分重要的应用价值。3) The gene and the sterile line produced by the mutation of the gene provide the necessary elements for the construction of the third-generation hybrid breeding system. The male sterile line produced by the gene mutation is used to produce hybrid seeds. "Three-line" and "two-line" hybrid technology is of great significance, and has very important application value in the construction of hybrid rice and agricultural production.
当然,实施本发明的任一产品并不一定需要同时达到以上所述的所有技术效果。Of course, any product implementing the present invention does not necessarily need to achieve all the above-mentioned technical effects at the same time.
附图说明Description of drawings
图1为本发明实施例1中STS1基因突变后的表型图;其中A为野生型9311和突变体株型;B为野生型和突变体穗部;C为野生型和突变体小花;D为野生型和突变体的花药形态;E为突变体和野生型花粉碘染情况。Fig. 1 is the phenotype diagram of STS1 gene after mutation in Example 1 of the present invention; wherein A is wild type 9311 and mutant plant type; B is wild type and mutant panicle; C is wild type and mutant floret; D Anther morphology of wild-type and mutant; E is the pollen iodine staining of mutant and wild-type.
图2为本发明实施例2中STS1基因的克隆;其中A、B分别为突变体sts1-1、sts1-2的SNP指数图谱;C为STS1基因的基因结构和变异位点及方式;D为sts1-1中变异产生的3个错误的转录本;E为突变体sts1-1、sts1-2变异位点与表型的共分离验证。Fig. 2 is the clone of STS1 gene in Example 2 of the present invention; wherein A and B are the SNP index maps of mutants sts1-1 and sts1-2 respectively; C is the gene structure, mutation site and method of STS1 gene; D is Three erroneous transcripts produced by mutation in sts1-1; E is the co-segregation verification of mutant sts1-1 and sts1-2 mutation sites and phenotypes.
图3为本发明实施例3中STS1基因经过基因编辑获得的突变体表型图;其中A为STS1基因的基因结构和突变体sts1-3、sts1-4的基因编辑位点及成功编辑之后的碱基变化;B为野生型日本晴和突变体sts1-3、sts1-4株型;C为野生型日本晴和突变体sts1-3、sts1-4的花药形态;D为野生型日本晴和突变体sts1-3、sts1-4的花粉碘染情况。Figure 3 is a phenotype diagram of the mutant obtained by gene editing of the STS1 gene in Example 3 of the present invention; wherein A is the gene structure of the STS1 gene and the gene editing sites of the mutants sts1-3 and sts1-4 and the results after successful editing. Base changes; B is the plant type of wild-type Nipponbare and mutant sts1-3, sts1-4; C is anther morphology of wild-type Nipponbare and mutant sts1-3, sts1-4; D is wild-type Nipponbare and mutant sts1 -3. Pollen iodine staining of sts1-4.
图4为本发明实施例4中sts1-1突变体育性恢复株系的表型图;其中A为恢复育性的STS1基因结构;B为野生型9311、突变体和互补株系的株型;C为野生型9311、突变体和互补株系的小花;D为野生型9311、突变体和互补株系的花药形态;E为为野生型9311、突变体和互补株系的花粉碘染情况。Figure 4 is a phenotypic diagram of the sts1-1 mutant sports restorer line in Example 4 of the present invention; wherein A is the STS1 gene structure that restores fertility; B is the plant type of wild-type 9311, mutant and complementary lines; C is the floret of wild-type 9311, mutant and complementary lines; D is anther morphology of wild-type 9311, mutant and complementary lines; E is pollen iodine staining of wild-type 9311, mutant and complementary lines.
具体实施方式Detailed ways
在本发明的描述中,需要说明的是,实施例中未注明具体条件者,按照常规条件或制造商建议的条件进行。所用试剂或仪器未注明生产厂商者,均为可以通过市售购买获得的常规产品。In the description of the present invention, it should be noted that, if the specific conditions are not indicated in the examples, the conventional conditions or the conditions suggested by the manufacturer are carried out. The reagents or instruments used without the manufacturer's indication are conventional products that can be purchased from the market.
发明人采用本领域技术人员所熟知的生物信息学技术,从水稻基因组中确定一个含完整水稻雄性育性控制基因STS1,核苷酸序列如SEQ ID NO.1所示,其编码的蛋白氨基酸序列如SEQ ID NO.2所示;STS1基因的全长ORF长度为1986bp,编码661个氨基酸。The inventor uses bioinformatics technology well known to those skilled in the art to determine a complete rice male fertility control gene STS1 from the rice genome, the nucleotide sequence is shown in SEQ ID NO. As shown in SEQ ID NO. 2; the full-length ORF of the STS1 gene is 1986 bp in length, encoding 661 amino acids.
下面结合附图和具体的实施例对本发明做进一步详细说明,所述是对本发明的解释而不是限定。The present invention will be described in further detail below with reference to the accompanying drawings and specific embodiments, which are intended to explain rather than limit the present invention.
实施例1Example 1
STS1基因突变的表型Phenotype of STS1 Gene Mutation
利用本领域常规基因突变技术,对常规水稻品种9311进行诱变,在获得的突变体库中发现两个雄性不育突变体sts1-1和sts1-2,核苷酸序列分别如SEQ ID NO.6、SEQ IDNO.7所示,将两个突变体和野生型(WT)多代杂交之后稳定遗传,表型分析发现突变体与野生型表现相似的营养生长,但不能产生成熟花粉粒,表现为无粉型雄性不育(图1)。进一步通过基因组重测序技术从两个突变体中克隆到同一基因STS1,具体步骤见实施例2。Using the conventional gene mutation technology in the field, the conventional rice variety 9311 was mutagenized, and two male sterile mutants sts1-1 and sts1-2 were found in the obtained mutant library, and the nucleotide sequences are shown in SEQ ID NO. 6. As shown in SEQ ID NO.7, the two mutants and the wild type (WT) were stably inherited after multiple generations of crosses. The phenotypic analysis found that the mutant and the wild type showed similar vegetative growth, but could not produce mature pollen grains. For the powder-free male sterility (Figure 1). The same gene STS1 was further cloned from the two mutants by genome resequencing technology, see Example 2 for specific steps.
实施例2Example 2
STS1基因的克隆Cloning of STS1 gene
1、STS1基因突变的重测序克隆1. Resequencing cloning of STS1 gene mutation
利用两个突变体sts1-1和sts1-2做母本与野生型体杂交,分别构建分离群体,将分离群体中的雄性不育单株和正常育性的单株分别混池测序,同时将测序数据与野生型基因组进行比较,获得SNP指数为1的图谱(如图2中A、B图所示)。进一步分析上述图谱中的变异位点,并以PCR测序的方法对群体中分离的单株和变异位点进行共分离验证,分别得到唯一可以与表型共分离的SNP变异位点,验证该位点的引物分别是sts1-1F(SEQ ID NO.10)、sts1-1R(SEQ ID NO.11)和sts1-2F(SEQ ID NO.12)、sts1-2R(SEQ ID NO.13)该位点所在的基因为STS1。如图2中C、D、E图所示,sts1-1和sts1-2的变异方式、sts1-1中突变产生的三个错误转录本和sts1-1、sts1-2的共分离分析。从而确认sts1-1和sts1-2是发生在同一基因STS1上的不同变异。其中,STS1基因的编码区内含子上一个剪接识别位点的核苷酸发生替换,产生如SEQ ID NO.6所示的DNA序列,导致产生3个错位的转录本,从而获得sts1-1突变体;STS1基因的编码区第7外显子一个核苷酸发生替换,产生如SEQ ID NO.7所示的DNA序列,导致终止子提前形成,蛋白翻译提前终止,从而获得雄性不育株系,即sts1-2突变体。Two mutants, sts1-1 and sts1-2, were used as the female parent to cross with the wild-type body, and segregating populations were constructed respectively. The sequencing data was compared with the wild-type genome to obtain a map with a SNP index of 1 (as shown in Figures A and B in Figure 2). The mutation sites in the above map were further analyzed, and the individual plants and mutation sites isolated in the population were co-segregated and verified by PCR sequencing. The primers of the dots are sts1-1F (SEQ ID NO.10), sts1-1R (SEQ ID NO.11) and sts1-2F (SEQ ID NO.12), sts1-2R (SEQ ID NO.13), respectively. The gene where the dots are located is STS1. As shown in C, D, and E in Figure 2, the mutation pattern of sts1-1 and sts1-2, the three erroneous transcripts produced by mutation in sts1-1, and the co-segregation analysis of sts1-1 and sts1-2. Thus, it was confirmed that sts1-1 and sts1-2 were different variants of the same gene STS1. Among them, the nucleotide of a splicing recognition site on the intron of the coding region of the STS1 gene was replaced, resulting in the DNA sequence shown in SEQ ID NO. 6, resulting in the production of three misplaced transcripts, thereby obtaining sts1-1 Mutant; a nucleotide in the 7th exon of the coding region of the STS1 gene is replaced, resulting in a DNA sequence as shown in SEQ ID NO. line, the sts1-2 mutant.
野生型STS1基因的序列如SEQ ID NO.1所示,编码蛋白序列如SEQ ID NO.2所示。STS1基因的全长ORF长度为1986bp,编码662个氨基酸。The sequence of the wild-type STS1 gene is shown in SEQ ID NO.1, and the encoded protein sequence is shown in SEQ ID NO.2. The full-length ORF of the STS1 gene is 1986 bp in length and encodes 662 amino acids.
2、STS1基因全长cDNA克隆2. Full-length cDNA cloning of STS1 gene
以正常野生型水稻日本晴品种的RNA为模板,合成第一链cDNA,用该STS1基因ORF序列的5’和3’端的寡核苷酸作为PCR引物(SEQ ID NO.14和SEQ ID NO.15),用高保真Taq酶Primerstar进行扩增,获得约1.9K的MFS1基因的全长cDNA扩增产物。将该扩增产物通过EcoRⅠ限制性酶切重组进载体pEntry的相应限制性内切酶多克隆位点,并对重组子进行测序验证。该重组的过渡质粒载体称为pEntry-STS1。Using the RNA of the normal wild-type rice Nipponbare variety as a template, the first-strand cDNA was synthesized, and the oligonucleotides at the 5' and 3' ends of the ORF sequence of the STS1 gene were used as PCR primers (SEQ ID NO.14 and SEQ ID NO.15). ), and amplified with high-fidelity Taq enzyme Primerstar to obtain a full-length cDNA amplification product of about 1.9K MFS1 gene. The amplified product was recombined into the corresponding restriction endonuclease multiple cloning site of the vector pEntry by EcoRI restriction enzyme digestion, and the recombinant was verified by sequencing. The recombinant transition plasmid vector was named pEntry-STS1.
5’端的寡核苷酸序列为(SEQ ID NO.14):The oligonucleotide sequence at the 5' end is (SEQ ID NO. 14):
5’-aaaaaagcaggctccgaattcatggccacgacgctgacg-3’5’-aaaaaagcaggctccgaattcatggccacgacgctgacg-3’
3’端的寡核苷酸序列为(SEQ ID NO.15):The oligonucleotide sequence at the 3' end is (SEQ ID NO. 15):
5’-caagaaagctgggtcgaattcttactgtgcaactgttccatctgtt-3’5’-caagaaagctgggtcgaattcttactgtgcaactgttccatctgtt-3’
实施例3Example 3
利用基因编辑创制新的STS1突变体株系Using gene editing to create new STS1 mutant lines
采用商用CRSPR/Cas9基因编辑载体BGK030(可商购)作为水稻转基因的载体。该载体编码一个细菌复制起点(ori)、卡那霉素抗性基因(Kanr)、潮霉素抗性基因(Hygr)、基因编辑作用基因(Cas9)、35S启动子、NOS基因终止信号序列以及用于连接目的片段的多克隆位点(MCS)。在限制性内切酶位点分别插入STS1基因两个引导序列SG RNAs,序列如SEQ IDNO.16和SEQ ID NO.17所示,建成转基因质粒BGK030-STS1-3和BGK030-STS1-4。The commercial CRSPR/Cas9 gene editing vector BGK030 (commercially available) was used as the vector for rice transgene. The vector encodes a bacterial origin of replication (ori), kanamycin resistance gene (Kan r ), hygromycin resistance gene ( Hygr ), gene editing gene (Cas9), 35S promoter, NOS gene termination signal Sequence and Multiple Cloning Site (MCS) for ligation of fragments of interest. Insert two guide sequences SG RNAs of STS1 gene into the restriction endonuclease sites respectively, the sequences are shown in SEQ ID NO.
4、STS1基因转化水稻试验4. STS1 gene transformation experiment in rice
将上述BGK030-STS1-3和BGK030-STS1-4通过冻融法导入农杆菌EHA105,并侵染野生型粳稻品种日本晴的愈伤组织,培养,分化得到转基因植株。利用潮霉素筛选转化植株;阳性植株提取叶片总DNA,经PCR进一步鉴定转化植株。用转基因T1代调查育性表型,验证STS1基因功能。如图3所示,在STS1靶位点插入一个碱基和缺失两个碱基的植株表现为雄性不育,获得的突变体分别为sts1-3和sts1-4,核苷酸序列分别如SEQ ID NO.8、SEQ ID NO.9所示。即,sts1-3突变体是在STS1基因的编码区第2外显子插入一个核苷酸;sts1-4突变体是在STS1基因的编码区第9外显子发生两个核苷酸的缺失。The above BGK030-STS1-3 and BGK030-STS1-4 were introduced into Agrobacterium EHA105 by freeze-thaw method, and infected the callus of wild-type japonica variety Nipponbare, cultured and differentiated to obtain transgenic plants. The transformed plants were screened by hygromycin; the total DNA of leaves was extracted from the positive plants, and the transformed plants were further identified by PCR. Fertility phenotypes were investigated with transgenic T1 generation to verify STS1 gene function. As shown in Figure 3, plants with one base inserted and two bases deleted at the STS1 target site exhibited male sterility, and the obtained mutants were sts1-3 and sts1-4, respectively, and the nucleotide sequences were shown in SEQ ID NO.8, SEQ ID NO.9. That is, the sts1-3 mutant is a nucleotide insertion in the second exon of the coding region of the STS1 gene; the sts1-4 mutant is a deletion of two nucleotides in the 9th exon of the coding region of the STS1 gene. .
实施例4Example 4
STS1基因突变体的育性恢复Fertility Restoration of STS1 Gene Mutants
STS1基因互补表达载体构建,在本实施例中,首先用PstⅠ和SacⅠ限制性酶切pCMBIYA1300,将使用SEQ ID NO.3和SEQ ID NO.4扩增的包含启动子的野生型STS1互补片段约6.3K,序列如SEQ ID NO.5所示。连入pCMBIYA1300的PstⅠ和SacⅠ位点中,对重组子进行酶切和测序鉴定。然后如实施例3所述的转基因方法获得水稻STS1互补表达的突变体转基因植株,用转基因T1代植株观察,如图4所示,在突变体sts1-1中,成功转入互补表达载体之后获得的转基因阳性植株恢复了育性。The STS1 gene complementary expression vector was constructed. In this example, pCMBIYA1300 was firstly digested with PstI and SacI restriction enzymes, and the wild-type STS1 complementary fragment containing the promoter amplified by SEQ ID NO.3 and SEQ ID NO.4 was approximately 6.3K, the sequence is shown in SEQ ID NO.5. Linked into the PstI and SacI sites of pCMBIYA1300, the recombinant was identified by restriction enzyme digestion and sequencing. Then, the mutant transgenic plants with complementary expression of rice STS1 were obtained by the transgenic method described in Example 3, and the transgenic T1 generation plants were used to observe them. of transgenic positive plants restored fertility.
综上所述,本发明通过克隆技术和基因编辑方法使STS1的个别核苷酸序列改变、增加或缺失,导致编码的蛋白功能丧失,从而获得雄性不育株系。产生的雄性不育株系育性彻底,不受环境条件影响,且转入野生型基因能恢复育性。本发明提供的雄性育性控制基因以及基于该基因产生的不育系将为新型杂交育种体系提供了必要的不育系种质资源,对杂交水稻育种具有重要意义。To sum up, the present invention changes, increases or deletes individual nucleotide sequences of STS1 through cloning technology and gene editing method, resulting in loss of function of the encoded protein, thereby obtaining a male sterile line. The resulting male sterile line has complete fertility and is not affected by environmental conditions, and the transfer of the wild-type gene can restore the fertility. The male fertility control gene provided by the present invention and the sterile line generated based on the gene will provide the necessary sterile line germplasm resources for a new hybrid breeding system, and are of great significance to hybrid rice breeding.
以上所述实施例仅表达了本发明的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对本发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和改进,这些都属于本发明的保护范围。因此,本发明专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only represent several embodiments of the present invention, and the descriptions thereof are specific and detailed, but should not be construed as a limitation on the scope of the patent of the present invention. It should be pointed out that for those of ordinary skill in the art, without departing from the concept of the present invention, several modifications and improvements can also be made, which all belong to the protection scope of the present invention. Therefore, the protection scope of the patent of the present invention should be subject to the appended claims.
序列表 sequence listing
<110> 四川农业大学<110> Sichuan Agricultural University
<120> 一种水稻雄性育性控制基因STS1、其编码蛋白及应用<120> A rice male fertility control gene STS1, its encoded protein and application
<160> 17<160> 17
<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0
<210> 1<210> 1
<211> 1986<211> 1986
<212> DNA<212> DNA
<213> STS1基因(Oryza sativa)<213> STS1 gene (Oryza sativa)
<400> 1<400> 1
atggccacga cgctgacgcc gacgcagcgc tacgcggcgg gggcgctgct ggcgctcgcg 60atggccacga cgctgacgcc gacgcagcgc tacgcggcgg gggcgctgct ggcgctcgcg 60
ctccgccagg cccagatcca ccagtccgtc ctcctcggcg cccaccacca ccacgacgac 120ctccgccagg cccagatcca ccagtccgtc ctcctcggcg cccaccacca ccacgacgac 120
gacgacgagg aacaggggcg caccagcacc agcagcggcg gcggcggcgg cagctcctcc 180gacgacgagg aacaggggcg caccagcacc agcagcggcg gcggcggcgg cagctcctcc 180
tcctcctcca actccggcgc cggcgccgac gccgacctct ggacacacga ctcccacggc 240tcctcctcca actccggcgc cggcgccgac gccgacctct ggacacacga ctcccacggc 240
ctcctccgcc ccgtcttcag gttcctggag atcgatccca aggcgtggtc ggggctggag 300ctcctccgcc ccgtcttcag gttcctggag atcgatccca aggcgtggtc ggggctggag 300
gagacggcgg cgtcgtccga ggccaagcat cacatcggcg cgtttctaag gataatattc 360gagacggcgg cgtcgtccga ggccaagcat cacatcggcg cgtttctaag gataatattc 360
gaagaagatg gcgaaagctc ctcagacaga tccgttcagg agcttgcctt ggcaaaagga 420gaagaagatg gcgaaagctc ctcagacaga tccgttcagg agcttgcctt ggcaaaagga 420
gttgatgtga tggtaatgag tttgggcaat gacagtgaag tgggtaacac aattaaaggt 480gttgatgtga tggtaatgag tttgggcaat gacagtgaag tgggtaacac aattaaaggt 480
ggggatcaag atgctttgcc tagcagctct ggaacagata aatccccagg ggaaagctct 540ggggatcaag atgctttgcc tagcagctct ggaacagata aatccccagg ggaaagctct 540
catgatgacc agttaggaat caataaattg accttagatg atattcctgc taacaatcac 600catgatgacc agttaggaat caataaattg accttagatg atattcctgc taacaatcac 600
cggaagatgg cattgctgtt tgctcttctc tcggcttgtg ttgctgataa acctgtatca 660cggaagatgg cattgctgtt tgctcttctc tcggcttgtg ttgctgataa acctgtatca 660
caagaggaag aggacagaaa atcaacccgc ttcagaaagg gttatgatgc tcggcatcgt 720caagaggaag aggacagaaa atcaacccgc ttcagaaagg gttatgatgc tcggcatcgt 720
gttgctcttc ggctgctatc tacatggctt gatgtgaagt ggatcaaaat ggaagcaatt 780gttgctcttc ggctgctatc tacatggctt gatgtgaagt ggatcaaaat ggaagcaatt 780
gaggttatgg ttgcatgctc tgccatggca gcagcgaaag aacaagaaca atcacaagaa 840gaggttatgg ttgcatgctc tgccatggca gcagcgaaag aacaagaaca atcacaagaa 840
agcgcatcac ccaaaagtaa atgggaaaaa tggaagcgtg gaggaataat tggtgcagct 900agcgcatcac ccaaaagtaa atgggaaaaa tggaagcgtg gaggaataat tggtgcagct 900
gctttgactg gaggagcact gttggctatt actgggggtt tagctgctcc agcaattgct 960gctttgactg gaggagcact gttggctatt actgggggtt tagctgctcc agcaattgct 960
gcagggtttg gtgctcttgc tccaaccctg ggcacacttg tccctgttat aggagctagc 1020gcagggtttg gtgctcttgc tccaaccctg ggcacacttg tccctgttat aggagctagc 1020
ggatttgctg ctatggctac cgctgcagga tctgttgctg gctctgtggc agttgctgca 1080ggatttgctg ctatggctac cgctgcagga tctgttgctg gctctgtggc agttgctgca 1080
tcatttggag ctgctggagc tggcttgacg gggagcaaaa tggccagaag aattgggagc 1140tcatttggag ctgctggagc tggcttgacg gggagcaaaa tggccagaag aattgggagc 1140
gtgaaagaat ttgagttcaa acctattggt gaaaaccata atcagggtcg gcttgcagtt 1200gtgaaagaat ttgagttcaa acctattggt gaaaaccata atcagggtcg gcttgcagtt 1200
ggcatcttaa tttccggatt tgcttttgat gaggatgact tttgtagacc ttgggaagga 1260ggcatcttaa tttccggatt tgcttttgat gaggatgact tttgtagacc ttgggaagga 1260
tggcaggata acctagagag gtacatcctt caatgggagt ctaagcatat aattgcagtg 1320tggcaggata acctagagag gtacatcctt caatgggagt ctaagcatat aattgcagtg 1320
agtacagcaa tacaggattg gctgacatca agacttgcta tggaattgat gaagcaaggt 1380agtacagcaa tacaggattg gctgacatca agacttgcta tggaattgat gaagcaaggt 1380
gcaatgagaa ccgtattaag tggtcttctt gcagcatttg catggccagc tacactgctt 1440gcaatgagaa ccgtattaag tggtcttctt gcagcatttg catggccagc tacactgctt 1440
gcggctacag actttattga tagtaaatgg tctgttgcta ttgacagatc agataaggca 1500gcggctacag actttattga tagtaaatgg tctgttgcta ttgacagatc agataaggca 1500
ggaaaaatgc ttgctgaagt gcttctcaag ggattgcaag gaaataggcc tgtgactctc 1560ggaaaaatgc ttgctgaagt gcttctcaag ggattgcaag gaaataggcc tgtgactctc 1560
attgggtttt cactaggtgc acgtgttata tttaagtgtt tacaggaact tgctttatcg 1620attgggtttt cactaggtgc acgtgttata tttaagtgtt tacaggaact tgctttatcg 1620
agtgataatg agggacttgt tgagagggtg gttcttcttg gtgctccagt ttctgtgaaa 1680agtgataatg agggacttgt tgagagggtg gttcttcttg gtgctccagt ttctgtgaaa 1680
ggcgagcggt gggaggctgc aaggaagatg gttgctggaa gatttgtgaa tgtgtactcc 1740ggcgagcggt gggaggctgc aaggaagatg gttgctggaa gatttgtgaa tgtgtactcc 1740
acagatgact ggattcttgg tgttactttt cgtgcgagtc tgctaacaca agggttggct 1800acagatgact ggattcttgg tgttactttt cgtgcgagtc tgctaacaca agggttggct 1800
ggcatccagg ccattgatgt ccctggtgtt gaaaatgtcg atgttacgga gcttgtcgat 1860ggcatccagg ccattgatgt ccctggtgtt gaaaatgtcg atgttacgga gcttgtcgat 1860
ggccattcgt cttacctgtc ggcagcacaa cagatactgg agcatcttga actaaatacc 1920ggccattcgt cttacctgtc ggcagcacaa cagatactgg agcatcttga actaaatacc 1920
tactatccgg tttttgttcc cctgagcgca gccaacgaag aaacagatgg aacagttgca 1980tactatccgg ttttttgttcc cctgagcgca gccaacgaag aaacagatgg aacagttgca 1980
cagtaa 1986cagtaa 1986
<210> 2<210> 2
<211> 661<211> 661
<212> PRT<212> PRT
<213> STS1基因的编码蛋白(Oryza sativa)<213> STS1 gene encoding protein (Oryza sativa)
<400> 2<400> 2
Met Ala Thr Thr Leu Thr Pro Thr Gln Arg Tyr Ala Ala Gly Ala LeuMet Ala Thr Thr Leu Thr Pro Thr Gln Arg Tyr Ala Ala Gly Ala Leu
1 5 10 151 5 10 15
Leu Ala Leu Ala Leu Arg Gln Ala Gln Ile His Gln Ser Val Leu LeuLeu Ala Leu Ala Leu Arg Gln Ala Gln Ile His Gln Ser Val Leu Leu
20 25 30 20 25 30
Gly Ala His His His His Asp Asp Asp Asp Glu Glu Gln Gly Arg ThrGly Ala His His His His Asp Asp Asp Asp Asp Glu Glu Gln Gly Arg Thr
35 40 45 35 40 45
Ser Thr Ser Ser Gly Gly Gly Gly Gly Ser Ser Ser Ser Ser Ser AsnSer Thr Ser Ser Gly Gly Gly Gly Gly Gly Ser Ser Ser Ser Ser Ser Ser Asn
50 55 60 50 55 60
Ser Gly Ala Gly Ala Asp Ala Asp Leu Trp Thr His Asp Ser His GlySer Gly Ala Gly Ala Asp Ala Asp Leu Trp Thr His Asp Ser His Gly
65 70 75 8065 70 75 80
Leu Leu Arg Pro Val Phe Arg Phe Leu Glu Ile Asp Pro Lys Ala TrpLeu Leu Arg Pro Val Phe Arg Phe Leu Glu Ile Asp Pro Lys Ala Trp
85 90 95 85 90 95
Ser Gly Leu Glu Glu Thr Ala Ala Ser Ser Glu Ala Lys His His IleSer Gly Leu Glu Glu Thr Ala Ala Ser Ser Glu Ala Lys His His Ile
100 105 110 100 105 110
Gly Ala Phe Leu Arg Ile Ile Phe Glu Glu Asp Gly Glu Ser Ser SerGly Ala Phe Leu Arg Ile Ile Phe Glu Glu Asp Gly Glu Ser Ser Ser
115 120 125 115 120 125
Asp Arg Ser Val Gln Glu Leu Ala Leu Ala Lys Gly Val Asp Val MetAsp Arg Ser Val Gln Glu Leu Ala Leu Ala Lys Gly Val Asp Val Met
130 135 140 130 135 140
Val Met Ser Leu Gly Asn Asp Ser Glu Val Gly Asn Thr Ile Lys GlyVal Met Ser Leu Gly Asn Asp Ser Glu Val Gly Asn Thr Ile Lys Gly
145 150 155 160145 150 155 160
Gly Asp Gln Asp Ala Leu Pro Ser Ser Ser Gly Thr Asp Lys Ser ProGly Asp Gln Asp Ala Leu Pro Ser Ser Ser Gly Thr Asp Lys Ser Pro
165 170 175 165 170 175
Gly Glu Ser Ser His Asp Asp Gln Leu Gly Ile Asn Lys Leu Thr LeuGly Glu Ser Ser His Asp Asp Gln Leu Gly Ile Asn Lys Leu Thr Leu
180 185 190 180 185 190
Asp Asp Ile Pro Ala Asn Asn His Arg Lys Met Ala Leu Leu Phe AlaAsp Asp Ile Pro Ala Asn Asn His Arg Lys Met Ala Leu Leu Phe Ala
195 200 205 195 200 205
Leu Leu Ser Ala Cys Val Ala Asp Lys Pro Val Ser Gln Glu Glu GluLeu Leu Ser Ala Cys Val Ala Asp Lys Pro Val Ser Gln Glu Glu Glu
210 215 220 210 215 220
Asp Arg Lys Ser Thr Arg Phe Arg Lys Gly Tyr Asp Ala Arg His ArgAsp Arg Lys Ser Thr Arg Phe Arg Lys Gly Tyr Asp Ala Arg His Arg
225 230 235 240225 230 235 240
Val Ala Leu Arg Leu Leu Ser Thr Trp Leu Asp Val Lys Trp Ile LysVal Ala Leu Arg Leu Leu Ser Thr Trp Leu Asp Val Lys Trp Ile Lys
245 250 255 245 250 255
Met Glu Ala Ile Glu Val Met Val Ala Cys Ser Ala Met Ala Ala AlaMet Glu Ala Ile Glu Val Met Val Ala Cys Ser Ala Met Ala Ala Ala
260 265 270 260 265 270
Lys Glu Gln Glu Gln Ser Gln Glu Ser Ala Ser Pro Lys Ser Lys TrpLys Glu Gln Glu Gln Ser Gln Glu Ser Ala Ser Pro Lys Ser Lys Trp
275 280 285 275 280 285
Glu Lys Trp Lys Arg Gly Gly Ile Ile Gly Ala Ala Ala Leu Thr GlyGlu Lys Trp Lys Arg Gly Gly Ile Ile Gly Ala Ala Ala Leu Thr Gly
290 295 300 290 295 300
Gly Ala Leu Leu Ala Ile Thr Gly Gly Leu Ala Ala Pro Ala Ile AlaGly Ala Leu Leu Ala Ile Thr Gly Gly Leu Ala Ala Pro Ala Ile Ala
305 310 315 320305 310 315 320
Ala Gly Phe Gly Ala Leu Ala Pro Thr Leu Gly Thr Leu Val Pro ValAla Gly Phe Gly Ala Leu Ala Pro Thr Leu Gly Thr Leu Val Pro Val
325 330 335 325 330 335
Ile Gly Ala Ser Gly Phe Ala Ala Met Ala Thr Ala Ala Gly Ser ValIle Gly Ala Ser Gly Phe Ala Ala Met Ala Thr Ala Ala Gly Ser Val
340 345 350 340 345 350
Ala Gly Ser Val Ala Val Ala Ala Ser Phe Gly Ala Ala Gly Ala GlyAla Gly Ser Val Ala Val Ala Ala Ser Phe Gly Ala Ala Gly Ala Gly
355 360 365 355 360 365
Leu Thr Gly Ser Lys Met Ala Arg Arg Ile Gly Ser Val Lys Glu PheLeu Thr Gly Ser Lys Met Ala Arg Arg Ile Gly Ser Val Lys Glu Phe
370 375 380 370 375 380
Glu Phe Lys Pro Ile Gly Glu Asn His Asn Gln Gly Arg Leu Ala ValGlu Phe Lys Pro Ile Gly Glu Asn His Asn Gln Gly Arg Leu Ala Val
385 390 395 400385 390 395 400
Gly Ile Leu Ile Ser Gly Phe Ala Phe Asp Glu Asp Asp Phe Cys ArgGly Ile Leu Ile Ser Gly Phe Ala Phe Asp Glu Asp Asp Phe Cys Arg
405 410 415 405 410 415
Pro Trp Glu Gly Trp Gln Asp Asn Leu Glu Arg Tyr Ile Leu Gln TrpPro Trp Glu Gly Trp Gln Asp Asn Leu Glu Arg Tyr Ile Leu Gln Trp
420 425 430 420 425 430
Glu Ser Lys His Ile Ile Ala Val Ser Thr Ala Ile Gln Asp Trp LeuGlu Ser Lys His Ile Ile Ala Val Ser Thr Ala Ile Gln Asp Trp Leu
435 440 445 435 440 445
Thr Ser Arg Leu Ala Met Glu Leu Met Lys Gln Gly Ala Met Arg ThrThr Ser Arg Leu Ala Met Glu Leu Met Lys Gln Gly Ala Met Arg Thr
450 455 460 450 455 460
Val Leu Ser Gly Leu Leu Ala Ala Phe Ala Trp Pro Ala Thr Leu LeuVal Leu Ser Gly Leu Leu Ala Ala Phe Ala Trp Pro Ala Thr Leu Leu
465 470 475 480465 470 475 480
Ala Ala Thr Asp Phe Ile Asp Ser Lys Trp Ser Val Ala Ile Asp ArgAla Ala Thr Asp Phe Ile Asp Ser Lys Trp Ser Val Ala Ile Asp Arg
485 490 495 485 490 495
Ser Asp Lys Ala Gly Lys Met Leu Ala Glu Val Leu Leu Lys Gly LeuSer Asp Lys Ala Gly Lys Met Leu Ala Glu Val Leu Leu Lys Gly Leu
500 505 510 500 505 510
Gln Gly Asn Arg Pro Val Thr Leu Ile Gly Phe Ser Leu Gly Ala ArgGln Gly Asn Arg Pro Val Thr Leu Ile Gly Phe Ser Leu Gly Ala Arg
515 520 525 515 520 525
Val Ile Phe Lys Cys Leu Gln Glu Leu Ala Leu Ser Ser Asp Asn GluVal Ile Phe Lys Cys Leu Gln Glu Leu Ala Leu Ser Ser Asp Asn Glu
530 535 540 530 535 540
Gly Leu Val Glu Arg Val Val Leu Leu Gly Ala Pro Val Ser Val LysGly Leu Val Glu Arg Val Val Leu Leu Gly Ala Pro Val Ser Val Lys
545 550 555 560545 550 555 560
Gly Glu Arg Trp Glu Ala Ala Arg Lys Met Val Ala Gly Arg Phe ValGly Glu Arg Trp Glu Ala Ala Arg Lys Met Val Ala Gly Arg Phe Val
565 570 575 565 570 575
Asn Val Tyr Ser Thr Asp Asp Trp Ile Leu Gly Val Thr Phe Arg AlaAsn Val Tyr Ser Thr Asp Asp Trp Ile Leu Gly Val Thr Phe Arg Ala
580 585 590 580 585 590
Ser Leu Leu Thr Gln Gly Leu Ala Gly Ile Gln Ala Ile Asp Val ProSer Leu Leu Thr Gln Gly Leu Ala Gly Ile Gln Ala Ile Asp Val Pro
595 600 605 595 600 605
Gly Val Glu Asn Val Asp Val Thr Glu Leu Val Asp Gly His Ser SerGly Val Glu Asn Val Asp Val Thr Glu Leu Val Asp Gly His Ser Ser
610 615 620 610 615 620
Tyr Leu Ser Ala Ala Gln Gln Ile Leu Glu His Leu Glu Leu Asn ThrTyr Leu Ser Ala Ala Gln Gln Ile Leu Glu His Leu Glu Leu Asn Thr
625 630 635 640625 630 635 640
Tyr Tyr Pro Val Phe Val Pro Leu Ser Ala Ala Asn Glu Glu Thr AspTyr Tyr Pro Val Phe Val Pro Leu Ser Ala Ala Asn Glu Glu Thr Asp
645 650 655 645 650 655
Gly Thr Val Ala GlnGly Thr Val Ala Gln
660 660
<210> 3<210> 3
<211> 19<211> 19
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 3<400> 3
gctcatcttc gggctcacc 19gctcatcttc gggctcacc 19
<210> 4<210> 4
<211> 20<211> 20
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 4<400> 4
gacacggcca gaaacagcaa 20
<210> 5<210> 5
<211> 6353<211> 6353
<212> DNA<212> DNA
<213> STS1基因(Oryza sativa)<213> STS1 gene (Oryza sativa)
<400> 5<400> 5
gctcatcttc gggctcacca aggacgtcat ctttcgcgcc gccttcggca cgcgcgacgg 60gctcatcttc gggctcacca aggacgtcat ctttcgcgcc gccttcggca cgcgcgacgg 60
cggcggccac ggcgagctgg aggttctcct gcaggagttc tccaagctgt tcggcgcgtt 120cggcggccac ggcgagctgg aggttctcct gcaggagttc tccaagctgt tcggcgcgtt 120
caacgtcggg gacttcatcc cctggctcgc gtggctcgac ccgcacggca tcaaccgacg 180caacgtcggg gacttcatcc cctggctcgc gtggctcgac ccgcacggca tcaaccgacg 180
gctccgcgcg gcgcgcgccg ccctcgacag cgtcatcgac aggatcatcg acgagcacgt 240gctccgcgcg gcgcgcgccg ccctcgacag cgtcatcgac aggatcatcg acgagcacgt 240
cagcaacccc gccggcgacg aggacgccga catggtggac gacatgctcg cgttcctcga 300cagcaacccc gccggcgacg aggacgccga catggtggac gacatgctcg cgttcctcga 300
cgaggccgga cgagaccaaa ccggcggcgg cggcgagctg cagggcacgc tccggctgac 360cgaggccgga cgagaccaaa ccggcggcgg cggcgagctg cagggcacgc tccggctgac 360
gcgcgacaac atcaaggcca tcataatggt acgtgccatt ctgacaccac tgccatggca 420gcgcgacaac atcaaggcca tcataatggt acgtgccatt ctgacaccac tgccatggca 420
tgcacgtcgc gttgcttgat ggagttggcg acgacaagtg caggacttcg tcttcggcgg 480tgcacgtcgc gttgcttgat ggagttggcg acgacaagtg caggacttcg tcttcggcgg 480
gacggagacg gtggcgtcgg cgatcgagtg ggcgatggcg gagttgctgc acagccccgg 540gacggagacg gtggcgtcgg cgatcgagtg ggcgatggcg gagttgctgc acagccccgg 540
cgatctccgg cggctgcagg cggagctcgc cgacgtggtg gggctcgggc gcggcgtgga 600cgatctccgg cggctgcagg cggagctcgc cgacgtggtg gggctcgggc gcggcgtgga 600
ggagggcgac ctggagaagc tccccttcct caggtgcgtc gccatggaga cgctgcgtct 660ggagggcgac ctggagaagc tccccttcct caggtgcgtc gccatggaga cgctgcgtct 660
ccacccgccg atcccgctgc tcctccacga ggcggcggcg gactgcgtcg tcggcgggta 720ccacccgccg atcccgctgc tcctccacga ggcggcggcg gactgcgtcg tcggcgggta 720
ctccgtgccg aggggcgccc gcgtggtggt caacgtgtgg agcgtcggcc gcgacgccgg 780ctccgtgccg aggggcgccc gcgtggtggt caacgtgtgg agcgtcggcc gcgacgccgg 780
cgcgtggaag ggcgacgccg gcgcgttccg gccggcgagg ttcatggcgg gcggcgaggc 840cgcgtggaag ggcgacgccg gcgcgttccg gccggcgagg ttcatggcgg gcggcgaggc 840
ggcggggatg gacctcaggg gcggatgctt cgagctcctg ccgttcgggt cggggcggag 900ggcggggatg gacctcaggg gcggatgctt cgagctcctg ccgttcgggt cggggcggag 900
ggcgtgcccc gccatcgtgc tcgggatgta cgagctggag ctcgtcgtcg cgcgcctcgt 960ggcgtgcccc gccatcgtgc tcgggatgta cgagctggag ctcgtcgtcg cgcgcctcgt 960
ccacgcgttc gggtgggcgc cgccgggcgg cgtggcgccg gaggagctcg acatggcgga 1020ccacgcgttc gggtgggcgc cgccgggcgg cgtggcgccg gaggagctcg acatggcgga 1020
cggcttcggc ctcaccgcgc cgcgcgccgc gaggctccgc gccgtgccga cgccccggct 1080cggcttcggc ctcaccgcgc cgcgcgccgc gaggctccgc gccgtgccga cgccccggct 1080
cacctgcccg atgtgacgcg cgttctcgat ttggcgcgag agatgcgcaa cgcagctcga 1140cacctgcccg atgtgacgcg cgttctcgat ttggcgcgag agatgcgcaa cgcagctcga 1140
agtgtgactg tgacagtgtg agcgtgtaat ctggtgtgga catcgtatgc atcatgtgtg 1200agtgtgactg tgacagtgtg agcgtgtaat ctggtgtgga catcgtatgc atcatgtgtg 1200
cttatggata attaaatata taaataatat ttggatgtgg atttcggctg aaattttctg 1260cttatggata attaaatata taaataatat ttggatgtgg atttcggctg aaattttctg 1260
tcatttcgaa aaaaattcgt cctagaattc aaatagaata ttttttttct cattattttt 1320tcatttcgaa aaaaattcgt cctagaattc aaatagaata ttttttttct cattattttt 1320
tatagttgca cattggctac attatctgta gaagcttctt gattcacgag atctagcggg 1380tatagttgca cattggctac attatctgta gaagcttctt gattcacgag atctagcggg 1380
cgtgacaata tggtcaaaga attctttacc ctagacatgg gcgtcgatat ggtcgtaaag 1440cgtgacaata tggtcaaaga attctttacc ctagacatgg gcgtcgatat ggtcgtaaag 1440
ctgctcttca gcttagtagc cacatccata atttaatctt aggattattt ttacagttga 1500ctgctcttca gcttagtagc cacatccata atttaatctt aggattattt ttacagttga 1500
ttccgatatt ttttcatcgt aaacatgggg ttttacatca ttaaaaaccc acatataaaa 1560ttccgatatt ttttcatcgt aaacatgggg ttttacatca ttaaaaaccc acatataaaa 1560
tcctttccct taaaagaatg atggtctttg aagcccttat ttcttttttt aatcttcgag 1620tcctttccct taaaagaatg atggtctttg aagcccttat ttcttttttt aatcttcgag 1620
tatgtatgaa tagtttgatt acgtgcattt taattatgta gagatttgga atgctattcg 1680tatgtatgaa tagtttgatt acgtgcattt taattatgta gagatttgga atgctattcg 1680
acaattatat tatcttaatg ttacattaag tgagatgctt ttattattta aaaatcgatt 1740acaattatat tatcttaatg ttacattaag tgagatgctt ttattattta aaaatcgatt 1740
atattttaaa acaaaatctg aagcactccg tggaccgtgg ttacacgtgc caggtcgacg 1800atattttaaa acaaaatctg aagcactccg tggaccgtgg ttacacgtgc caggtcgacg 1800
tgttcttctg gaagaaatct atgtctactt ttgaaatgga atatactacc tccgttaaaa 1860tgttcttctg gaagaaatct atgtctactt ttgaaatgga atatactacc tccgttaaaa 1860
ataagtacaa acgtaaataa ataagtacaa acgtaaatat ccgtatgaac gttcgtctta 1920ataagtacaa acgtaaataa ataagtacaa acgtaaatat ccgtatgaac gttcgtctta 1920
ttccaaaaat ttataaaaat aaattataaa aaaacagtca catataaaat attatttatg 1980ttccaaaaat ttataaaaat aaattataaa aaaacagtca catataaaat attatttatg 1980
ttttataatc taatagtaac aaaaaaatat taatcataaa agataaagaa aaatttaaat 2040ttttataatc taatagtaac aaaaaaatat taatcataaa agataaagaa aaatttaaat 2040
aagataaacg ctgaatataa ataatatttt ataaagctgt acttatttta agacggaaag 2100aagataaacg ctgaatataa ataatatttt ataaagctgt acttatttta agacggaaag 2100
agtaaaaaat tttggaaaaa tttctagtgc agcatgctgc tcatttggga ggtgtgtgct 2160agtaaaaaat tttggaaaaa tttctagtgc agcatgctgc tcatttggga ggtgtgtgct 2160
gctgctctcc agtctccccc aaccaaaatt tgaccgagtt tggcacgaaa gaaaccgcct 2220gctgctctcc agtctccccc aaccaaaatt tgaccgagtt tggcacgaaa gaaaccgcct 2220
cgacgccgtc caacggaagc cagccgaggc gaccaaccca agggtaggcc taggcgtaaa 2280cgacgccgtc caacggaagc cagccgaggc gaccaaccca agggtaggcc taggcgtaaa 2280
ccctgcacct gaatcgatcg ccggcgatgg ccacgacgct gacgccgacg cagcgctacg 2340ccctgcacct gaatcgatcg ccggcgatgg ccacgacgct gacgccgacg cagcgctacg 2340
cggcgggggc gctgctggcg ctcgcgctcc gccaggccca gatccaccag tccgtcctcc 2400cggcgggggc gctgctggcg ctcgcgctcc gccaggccca gatccaccag tccgtcctcc 2400
tcggcgccca ccaccaccac gacgacgacg acgaggaaca ggggcgcacc agcaccagca 2460tcggcgccca ccaccaccac gacgacgacg acgaggaaca ggggcgcacc agcaccagca 2460
gcggcggcgg cggcggcagc tcctcctcct cctccaactc cggcgccggc gccgacgccg 2520gcggcggcgg cggcggcagc tcctcctcct cctccaactc cggcgccggc gccgacgccg 2520
acctctggac acacgactcc cacggcctcc tccgccccgt cttcaggtgt tttctcgctc 2580acctctggac acacgactcc cacggcctcc tccgccccgt cttcaggtgt tttctcgctc 2580
ctctccgctt gtaaatttgg ggggtttgag atgaatgctt ccgctttact tggcgatcgg 2640ctctccgctt gtaaatttgg ggggtttgag atgaatgctt ccgctttact tggcgatcgg 2640
ggggctctgc taggttcctg gagatcgatc ccaaggcgtg gtcggggctg gaggagacgg 2700ggggctctgc taggttcctg gagatcgatc ccaaggcgtg gtcggggctg gaggagacgg 2700
cggcgtcgtc cgaggccaag catcacatcg gcgcggtacg gcactcactg cattttctgc 2760cggcgtcgtc cgaggccaag catcacatcg gcgcggtacg gcactcactg cattttctgc 2760
cctcgtattg taattggccg tgcgttttgg tcagtgttgc agtgatggtt tggggcttag 2820cctcgtattg taattggccg tgcgttttgg tcagtgttgc agtgatggtt tggggcttag 2820
gtgtttgatt atatgcctac aagactgttc aacttcaatt tccgcgtgtg cttgaccagt 2880gtgtttgatt atatgcctac aagactgttc aacttcaatt tccgcgtgtg cttgaccagt 2880
gaaatttgtg ttttcgtagc atctcatatc actctcattt tgtagtttct aaggataata 2940gaaatttgtg ttttcgtagc atctcatatc actctcattt tgtagtttct aaggataata 2940
ttcgaagaag atggcgaaag ctcctcagac agatccgttc aggagcttgc cttggcaaaa 3000ttcgaagaag atggcgaaag ctcctcagac agatccgttc aggagcttgc cttggcaaaa 3000
ggagttgatg tgatggtaat gagtttgggc aatgacagtg aagtgggtaa cacaattaaa 3060ggagttgatg tgatggtaat gagtttgggc aatgacagtg aagtgggtaa cacaattaaa 3060
ggtggggatc aagatgcttt gcctagcagc tctggaacag ataaatcccc aggggaaagc 3120ggtggggatc aagatgcttt gcctagcagc tctggaacag ataaatcccc aggggaaagc 3120
tctcatgatg accagttagg aatcaataaa ttgaccttag atgatattcc tgctaacaat 3180tctcatgatg accagttagg aatcaataaa ttgaccttag atgatattcc tgctaacaat 3180
caccggaaga tggcattgct gtttgctctt ctctcggctt gtgttgctga taaacctgta 3240caccggaaga tggcattgct gtttgctctt ctctcggctt gtgttgctga taaacctgta 3240
tcacaagagg aagaggacag aaaatcaacc cgcttcagaa agggttatga tgctcggcat 3300tcacaagagg aagaggacag aaaatcaacc cgcttcagaa agggttatga tgctcggcat 3300
cgtgttgctc ttcggctgct atctacatgg cttgatgtga agtggatcaa aatggtttgt 3360cgtgttgctc ttcggctgct atctacatgg cttgatgtga agtggatcaa aatggtttgt 3360
acagactaaa cttttttgga ttcagcagag taatcactat gggtaccaaa caatatccta 3420acagactaaa ctttttttgga ttcagcagag taatcactat gggtaccaaa caatatccta 3420
aggcaattaa agaaaaaaga acccccaatg tttttcatgt ttcaccaaat ttgtcaccaa 3480aggcaattaa agaaaaaaga acccccaatg tttttcatgt ttcaccaaat ttgtcaccaa 3480
ccatgctttt tcgctgcagg aagcaattga ggttatggtt gcatgctctg ccatggcagc 3540ccatgctttt tcgctgcagg aagcaattga ggttatggtt gcatgctctg ccatggcagc 3540
agcgaaagaa caagaacaat cacaagaaag cgcatcaccc aaaagtaaat gggaaaaatg 3600agcgaaagaa caagaacaat cacaagaaag cgcatcaccc aaaagtaaat gggaaaaatg 3600
gaagcgtgga ggaataattg gtgcagctgc tttgactgga ggagcactgt tggctattac 3660gaagcgtgga ggaataattg gtgcagctgc tttgactgga ggagcactgt tggctattac 3660
tgggggtatg aaaaaacttc agaaattctg ctggacgctg aacatctgaa ctagtttctg 3720tgggggtatg aaaaaacttc agaaattctg ctggacgctg aacatctgaa ctagtttctg 3720
tcgctcatat tactctaaac ggttgatatt ctcttccaac ttgacaggtt tagctgctcc 3780tcgctcatat tactctaaac ggttgatatt ctcttccaac ttgacaggtt tagctgctcc 3780
agcaattgct gcagggtttg gtgctcttgc tccaaccctg ggcacacttg tccctgttat 3840agcaattgct gcagggtttg gtgctcttgc tccaaccctg ggcacacttg tccctgttat 3840
aggagctagc ggatttgctg ctatggctac cgctgcagga tctgttgctg gctctgtggc 3900aggagctagc ggatttgctg ctatggctac cgctgcagga tctgttgctg gctctgtggc 3900
agttgctgca tcatttggag gtgtgtctac atgttcttca aaacccttct ggtgatgttg 3960agttgctgca tcatttggag gtgtgtctac atgttcttca aaacccttct ggtgatgttg 3960
ttttgagctt tcgtttcgtc atgctacatt actatcagct gcagtctccc tttccttttg 4020ttttgagctt tcgtttcgtc atgctacatt actatcagct gcagtctccc tttccttttg 4020
tcatgagggt tgcaatattg tttagtgaac ttacccaaga atatgcacac aacactttta 4080tcatgagggt tgcaatattg tttagtgaac ttacccaaga atatgcacac aacactttta 4080
ggggtgcatt ttacaattag ataatagttc gatggattac cagaagaagg ataagactta 4140ggggtgcatt ttacaattag ataatagttc gatggattac cagaagaagg ataagactta 4140
tacaatgtat gacccaaaat tgtgagaaac atatggatga aagcgtaatg agaatagttg 4200tacaatgtat gacccaaaat tgtgagaaac atatggatga aagcgtaatg agaatagttg 4200
tggtgtggaa attttctatt tattcatgct tatcctttga aatgatattt ttcaattagc 4260tggtgtggaa attttctatt tattcatgct tatcctttga aatgatattt ttcaattagc 4260
tgctggagct ggcttgacgg ggagcaaaat ggccagaaga attgggagcg tgaaagaatt 4320tgctggagct ggcttgacgg ggagcaaaat ggccagaaga attgggagcg tgaaagaatt 4320
tgagttcaaa cctattggtg aaaaccataa tcagggtgtg agttcctttt ctctttctga 4380tgagttcaaa cctattggtg aaaaccataa tcagggtgtg agttcctttt ctctttctga 4380
tgtcacaaca aatttcattc tactagattt ttgttcaccc ttttagtagt caaactgatc 4440tgtcacaaca aatttcattc tactagattt ttgttcaccc ttttagtagt caaactgatc 4440
tggtctggtt aaatgttctc aacactttca gcggcttgca gttggcatct taatttccgg 4500tggtctggtt aaatgttctc aacactttca gcggcttgca gttggcatct taatttccgg 4500
atttgctttt gatgaggatg acttttgtag accttgggaa ggatggcagg ataacctaga 4560atttgctttt gatgaggatg acttttgtag accttgggaa ggatggcagg ataacctaga 4560
gaggtataga gtggtttttt tgacatgttc tttttgctga tcactattct agtactttgt 4620gaggtataga gtggtttttt tgacatgttc tttttgctga tcactattct agtactttgt 4620
catgtgttac cagctaagag catccttaaa aatataattt caagagttga ataatgtctg 4680catgtgttac cagctaagag catccttaaa aatataattt caagagttga ataatgtctg 4680
gcaggtacat ccttcaatgg gagtctaagc atataattgc agtgagtaca gcaatacagg 4740gcaggtacat ccttcaatgg gagtctaagc atataattgc agtgagtaca gcaatacagg 4740
attggctgac atcaagtaat atttgtagtt ttgcccattt acactatcag caaaaacttc 4800attggctgac atcaagtaat atttgtagtt ttgcccattt acactatcag caaaaacttc 4800
taatttacac tgttatttat gattcaggac ttgctatgga attgatgaag caaggtgcaa 4860taatttacac tgttatttat gattcaggac ttgctatgga attgatgaag caaggtgcaa 4860
tgagaaccgt attaagtggt cttcttgcag catttgcatg gccagctaca ctgcttgcgg 4920tgagaaccgt attaagtggt cttcttgcag catttgcatg gccagctaca ctgcttgcgg 4920
ctacagactt tattgatagt aaatggtctg ttgctattga caggtaattg caaactctcc 4980ctacagactt tattgatagt aaatggtctg ttgctattga caggtaattg caaactctcc 4980
catttctctt atgttgcttg cagagttgca gtgcacactg actatctaca gtattggtat 5040catttctctt atgttgcttg cagagttgca gtgcacactg actatctaca gtattggtat 5040
tcttctgttg atcgacacat aagtttgtgt attcaaattt atgaagatta tgacgtaaca 5100tcttctgttg atcgacacat aagtttgtgt attcaaattt atgaagatta tgacgtaaca 5100
tttagacaaa cttttaatgt atatatccca cagtacatat attgcagtta aagacatcct 5160tttagacaaa cttttaatgt atatatccca cagtacatat attgcagtta aagacatcct 5160
agcaatttgt taacttagcc aatcaaaatc aactgtatct tctcagatca gataaggcag 5220agcaatttgt taacttagcc aatcaaaatc aactgtatct tctcagatca gataaggcag 5220
gaaaaatgct tgctgaagtg cttctcaagg gattgcaagg aaataggtaa ctccatatgc 5280gaaaaatgct tgctgaagtg cttctcaagg gattgcaagg aaataggtaa ctccatatgc 5280
atttatttca ttagcgatgt tggccgctca gaaatatgcc cccctaatgc caattttttc 5340atttatttca ttagcgatgt tggccgctca gaaatatgcc cccctaatgc caattttttc 5340
ccaggcctgt gactctcatt gggttttcac taggtgcacg tgttatattt aagtgtttac 5400ccaggcctgt gactctcatt gggttttcac taggtgcacg tgttatattt aagtgtttac 5400
aggaacttgc tttatcgagt gataatggta cctgttctgt aaccatctcc ttcatccata 5460aggaacttgc tttatcgagt gataatggta cctgttctgt aaccatctcc ttcatccata 5460
agattttcct ttcttggtgg caaattatca gttttaacca tctccatctc ttgtttgcaa 5520agattttcct ttcttggtgg caaattatca gttttaacca tctccatctc ttgtttgcaa 5520
ccagagggac ttgttgagag ggtggttctt cttggtgctc cagtttctgt gaaaggcgag 5580ccagagggac ttgttgagag ggtggttctt cttggtgctc cagtttctgt gaaaggcgag 5580
cggtgggagg ctgcaaggaa ggtaatggcg atgaaatcat ttttttctct attatctctg 5640cggtgggagg ctgcaaggaa ggtaatggcg atgaaatcat ttttttctct attatctctg 5640
taggaatggt gcaacgtaac aacaagaaac agtgaagttg ctgttcattt tatcccagct 5700taggaatggt gcaacgtaac aacaagaaac agtgaagttg ctgttcattt tatcccagct 5700
tcttccctcg gcgcttaatt cactcacttg tcctcatggc agatggttgc tggaagattt 5760tcttccctcg gcgcttaatt cactcacttg tcctcatggc agatggttgc tggaagattt 5760
gtgaatgtgt actccacaga tgactggatt cttggtgtta cttttcgtgc gaggtaaaca 5820gtgaatgtgt actccacaga tgactggatt cttggtgtta cttttcgtgc gaggtaaaca 5820
gatttatgat ttcagagctc ctagaaagct catatcacca ttcccccatg tgagctatga 5880gatttatgat ttcagagctc ctagaaagct catatcacca ttcccccatg tgagctatga 5880
cttaagtaat cttctcgtgc agtctgctaa cacaagggtt ggctggcatc caggccattg 5940cttaagtaat cttctcgtgc agtctgctaa cacaagggtt ggctggcatc caggccattg 5940
atgtccctgg tgttgaaaat gtaagcatct ggggtgccat tcttgtgttc ttacgtgatt 6000atgtccctgg tgttgaaaat gtaagcatct ggggtgccat tcttgtgttc ttacgtgatt 6000
ttcatctatc ttttacacgt ctttgtgtac aatgcaattt acaggtcgat gttacggagc 6060ttcatctatc ttttacacgt ctttgtgtac aatgcaattt acaggtcgat gttacggagc 6060
ttgtcgatgg ccattcgtct tacctgtcgg cagcacaaca gatactggag catcttgaac 6120ttgtcgatgg ccattcgtct tacctgtcgg cagcacaaca gatactggag catcttgaac 6120
taaataccta ctatccggtt tttgttcccc tgagcgcagc caacgaagaa acagatggaa 6180taaataccta ctatccggtt tttgttcccc tgagcgcagc caacgaagaa acagatggaa 6180
cagttgcaca gtaacgtgtg ggcaggattc gatgaaaatc gtatgtagac aaccaaatgt 6240cagttgcaca gtaacgtgtg ggcaggattc gatgaaaatc gtatgtagac aaccaaatgt 6240
aacagactat atatacaacc aaatgtaaca gactaactcg ttcaaactgt aaaccagtct 6300aacagactat atatacaacc aaatgtaaca gactaactcg ttcaaactgt aaaccagtct 6300
cagcctgagg cgaatgtgtc tggttggtga ggcttgctgt ttctggccgt gtc 6353cagcctgagg cgaatgtgtc tggttggtga ggcttgctgt ttctggccgt gtc 6353
<210> 6<210> 6
<211> 4183<211> 4183
<212> DNA<212> DNA
<213> 水稻雄性不育突变体sts1-1(Oryza sativa)<213> Rice male sterile mutant sts1-1 (Oryza sativa)
<400> 6<400> 6
agtctccccc aaccaaaatt tgaccgagtt tggcacgaaa gaaaccgcct cgacgccgtc 60agtctccccc aaccaaaatt tgaccgagtt tggcacgaaa gaaaccgcct cgacgccgtc 60
caacggaagc cagccgaggc gaccaaccca agggtaggcc taggcgtaaa ccctgcacct 120caacggaagc cagccgaggc gaccaaccca agggtaggcc taggcgtaaa ccctgcacct 120
gaatcgatcg ccggcgatgg ccacgacgct gacgccgacg cagcgctacg cggcgggggc 180gaatcgatcg ccggcgatgg ccacgacgct gacgccgacg cagcgctacg cggcgggggc 180
gctgctggcg ctcgcgctcc gccaggccca gatccaccag tccgtcctcc tcggcgccca 240gctgctggcg ctcgcgctcc gccaggccca gatccaccag tccgtcctcc tcggcgccca 240
ccaccaccac gacgacgacg acgaggaaca ggggcgcacc agcaccagca gcggcggcgg 300ccaccaccac gacgacgacg acgaggaaca ggggcgcacc agcaccagca gcggcggcgg 300
cggcggcagc tcctcctcct cctccaactc cggcgccggc gccgacgccg acctctggac 360cggcggcagc tcctcctcct cctccaactc cggcgccggc gccgacgccg acctctggac 360
acacgactcc cacggcctcc tccgccccgt cttcaggtgt tttctcgctc ctctccgctt 420acacgactcc cacggcctcc tccgccccgt cttcaggtgt tttctcgctc ctctccgctt 420
gtaaatttgg ggggtttgag atgaatgctt ccgctttact tggcgatcgg ggggctctgc 480gtaaatttgg ggggtttgag atgaatgctt ccgctttact tggcgatcgg ggggctctgc 480
taggttcctg gagatcgatc ccaaggcgtg gtcggggctg gaggagacgg cggcgtcgtc 540taggttcctg gagatcgatc ccaaggcgtg gtcggggctg gaggagacgg cggcgtcgtc 540
cgaggccaag catcacatcg gcgcggtacg gcactcactg cattttctgc cctcgtattg 600cgaggccaag catcacatcg gcgcggtacg gcactcactg cattttctgc cctcgtattg 600
taattggccg tgcgttttgg tcagtgttgc agtgatggtt tggggcttag gtgtttgatt 660taattggccg tgcgttttgg tcagtgttgc agtgatggtt tggggcttag gtgtttgatt 660
atatgcctac aagactgttc aacttcaatt tccgcgtgtg cttgaccagt gaaatttgtg 720atatgcctac aagactgttc aacttcaatt tccgcgtgtg cttgaccagt gaaatttgtg 720
ttttcgtagc atctcatatc actctcattt tgtagtttct aaggataata ttcgaagaag 780ttttcgtagc atctcatatc actctcattt tgtagtttct aaggataata ttcgaagaag 780
atggcgaaag ctcctcagac agatccgttc aggagcttgc cttggcaaaa ggagttgatg 840atggcgaaag ctcctcagac agatccgttc aggagcttgc cttggcaaaa ggagttgatg 840
tgatggtaat gagtttgggc aatgacagtg aagtgggtaa cacaattaaa ggtggggatc 900tgatggtaat gagtttgggc aatgacagtg aagtgggtaa cacaattaaa ggtggggatc 900
aagatgcttt gcctagcagc tctggaacag ataaatcccc aggggaaagc tctcatgatg 960aagatgcttt gcctagcagc tctggaacag ataaatcccc aggggaaagc tctcatgatg 960
accagttagg aatcaataaa ttgaccttag atgatattcc tgctaacaat caccggaaga 1020accagttagg aatcaataaa ttgaccttag atgatattcc tgctaacaat caccggaaga 1020
tggcattgct gtttgctctt ctctcggctt gtgttgctga taaacctgta tcacaagagg 1080tggcattgct gtttgctctt ctctcggctt gtgttgctga taaacctgta tcacaagagg 1080
aagaggacag aaaatcaacc cgcttcagaa agggttatga tgctcggcat cgtgttgctc 1140aagaggacag aaaatcaacc cgcttcagaa agggttatga tgctcggcat cgtgttgctc 1140
ttcggctgct atctacatgg cttgatgtga agtggatcaa aatggtttgt acagactaaa 1200ttcggctgct atctacatgg cttgatgtga agtggatcaa aatggtttgt acagactaaa 1200
cttttttgga ttcagcagag taatcactat gggtaccaaa caatatccta aggcaattaa 1260ctttttttgga ttcagcagag taatcactat gggtaccaaa caatatccta aggcaattaa 1260
agaaaaaaga acccccaatg tttttcatgt ttcaccaaat ttgtcaccaa ccatgctttt 1320agaaaaaaga acccccaatg tttttcatgt ttcaccaaat ttgtcaccaa ccatgctttt 1320
tcgctgcagg aagcaattga ggttatggtt gcatgctctg ccatggcagc agcgaaagaa 1380tcgctgcagg aagcaattga ggttatggtt gcatgctctg ccatggcagc agcgaaagaa 1380
caagaacaat cacaagaaag cgcatcaccc aaaagtaaat gggaaaaatg gaagcgtgga 1440caagaacaat cacaagaaag cgcatcaccc aaaagtaaat gggaaaaatg gaagcgtgga 1440
ggaataattg gtgcagctgc tttgactgga ggagcactgt tggctattac tgggggtatg 1500ggaataattg gtgcagctgc tttgactgga ggagcactgt tggctattac tgggggtatg 1500
aaaaaacttc agaaattctg ctggacgctg aacatctgaa ctagtttctg tcgctcatat 1560aaaaaacttc agaaattctg ctggacgctg aacatctgaa ctagtttctg tcgctcatat 1560
tactctaaac ggttgatatt ctcttccaac ttgacaggtt tagctgctcc agcaattgct 1620tactctaaac ggttgatatt ctcttccaac ttgacaggtt tagctgctcc agcaattgct 1620
gcagggtttg gtgctcttgc tccaaccctg ggcacacttg tccctgttat aggagctagc 1680gcagggtttg gtgctcttgc tccaaccctg ggcacacttg tccctgttat aggagctagc 1680
ggatttgctg ctatggctac cgctgcagga tctgttgctg gctctgtggc agttgctgca 1740ggatttgctg ctatggctac cgctgcagga tctgttgctg gctctgtggc agttgctgca 1740
tcatttggag gtgtgtctac atgttcttca aaacccttct ggtgatgttg ttttgagctt 1800tcatttggag gtgtgtctac atgttcttca aaacccttct ggtgatgttg ttttgagctt 1800
tcgtttcgtc atgctacatt actatcagct gcagtctccc tttccttttg tcatgagggt 1860tcgtttcgtc atgctacatt actatcagct gcagtctccc tttccttttg tcatgagggt 1860
tgcaatattg tttagtgaac ttacccaaga atatgcacac aacactttta ggggtgcatt 1920tgcaatattg tttagtgaac ttacccaaga atatgcacac aacactttta ggggtgcatt 1920
ttacaattag ataatagttc gatggattac cagaagaagg ataagactta tacaatgtat 1980ttacaattag ataatagttc gatggattac cagaagaagg ataagactta tacaatgtat 1980
gacccaaaat tgtgagaaac atatggatga aagcgtaatg agaatagttg tggtgtggaa 2040gacccaaaat tgtgagaaac atatggatga aagcgtaatg agaatagttg tggtgtggaa 2040
attttctatt tattcatgct tatcctttga aatgatattt ttcaattagc tgctggagct 2100attttctatt tattcatgct tatcctttga aatgatattt ttcaattagc tgctggagct 2100
ggcttgacgg ggagcaaaat ggccagaaga attgggagcg tgaaagaatt tgagttcaaa 2160ggcttgacgg ggagcaaaat ggccagaaga attgggagcg tgaaagaatt tgagttcaaa 2160
cctattggtg aaaaccataa tcagggtgtg agttcctttt ctctttctga tgtcacaaca 2220cctattggtg aaaaccataa tcagggtgtg agttcctttt ctctttctga tgtcacaaca 2220
aatttcattc tactagattt ttgttcaccc ttttagtagt caaactgatc tggtctggtt 2280aatttcattc tactagattt ttgttcaccc ttttagtagt caaactgatc tggtctggtt 2280
aaatgttctc aacactttca gcggcttgca gttggcatct taatttccgg atttgctttt 2340aaatgttctc aacactttca gcggcttgca gttggcatct taatttccgg atttgctttt 2340
gatgaggatg acttttgtag accttgggaa ggatggcagg ataacctaga gaggtataga 2400gatgaggatg acttttgtag accttgggaa ggatggcagg ataacctaga gaggtataga 2400
gtggtttttt tgacatgttc tttttgctga tcactattct agtactttgt catgtgttac 2460gtggtttttt tgacatgttc tttttgctga tcactattct agtactttgt catgtgttac 2460
cagctaagag catccttaaa aatataattt caagagttga ataatgtctg gcaggtacat 2520cagctaagag catccttaaa aatataattt caagagttga ataatgtctg gcaggtacat 2520
ccttcaatgg gagtctaagc atataattgc agtgagtaca gcaatacagg attggctgac 2580ccttcaatgg gagtctaagc atataattgc agtgagtaca gcaatacagg attggctgac 2580
atcaaataat atttgtagtt ttgcccattt acactatcag caaaaacttc taatttacac 2640atcaaataat atttgtagtt ttgcccattt acactatcag caaaaacttc taatttacac 2640
tgttatttat gattcaggac ttgctatgga attgatgaag caaggtgcaa tgagaaccgt 2700tgttatttat gattcaggac ttgctatgga attgatgaag caaggtgcaa tgagaaccgt 2700
attaagtggt cttcttgcag catttgcatg gccagctaca ctgcttgcgg ctacagactt 2760attaagtggt cttcttgcag catttgcatg gccagctaca ctgcttgcgg ctacagactt 2760
tattgatagt aaatggtctg ttgctattga caggtaattg caaactctcc catttctctt 2820tattgatagt aaatggtctg ttgctattga caggtaattg caaactctcc catttctctt 2820
atgttgcttg cagagttgca gtgcacactg actatctaca gtattggtat tcttctgttg 2880atgttgcttg cagagttgca gtgcacactg actatctaca gtattggtat tcttctgttg 2880
atcgacacat aagtttgtgt attcaaattt atgaagatta tgacgtaaca tttagacaaa 2940atcgacacat aagtttgtgt attcaaattt atgaagatta tgacgtaaca tttagacaaa 2940
cttttaatgt atatatccca cagtacatat attgcagtta aagacatcct agcaatttgt 3000cttttaatgt atatatccca cagtacatat attgcagtta aagacatcct agcaatttgt 3000
taacttagcc aatcaaaatc aactgtatct tctcagatca gataaggcag gaaaaatgct 3060taacttagcc aatcaaaatc aactgtatct tctcagatca gataaggcag gaaaaatgct 3060
tgctgaagtg cttctcaagg gattgcaagg aaataggtaa ctccatatgc atttatttca 3120tgctgaagtg cttctcaagg gattgcaagg aaataggtaa ctccatatgc atttatttca 3120
ttagcgatgt tggccgctca gaaatatgcc cccctaatgc caattttttc ccaggcctgt 3180ttagcgatgt tggccgctca gaaatatgcc cccctaatgc caattttttc ccaggcctgt 3180
gactctcatt gggttttcac taggtgcacg tgttatattt aagtgtttac aggaacttgc 3240gactctcatt gggttttcac taggtgcacg tgttatattt aagtgtttac aggaacttgc 3240
tttatcgagt gataatggta cctgttctgt aaccatctcc ttcatccata agattttcct 3300tttatcgagt gataatggta cctgttctgt aaccatctcc ttcatccata agattttcct 3300
ttcttggtgg caaattatca gttttaacca tctccatctc ttgtttgcaa ccagagggac 3360ttcttggtgg caaattatca gttttaacca tctccatctc ttgtttgcaa ccagagggac 3360
ttgttgagag ggtggttctt cttggtgctc cagtttctgt gaaaggcgag cggtgggagg 3420ttgttgagag ggtggttctt cttggtgctc cagtttctgt gaaaggcgag cggtgggagg 3420
ctgcaaggaa ggtaatggcg atgaaatcat ttttttctct attatctctg taggaatggt 3480ctgcaaggaa ggtaatggcg atgaaatcat ttttttctct attatctctg taggaatggt 3480
gcaacgtaac aacaagaaac agtgaagttg ctgttcattt tatcccagct tcttccctcg 3540gcaacgtaac aacaagaaac agtgaagttg ctgttcattt tatcccagct tcttccctcg 3540
gcgcttaatt cactcacttg tcctcatggc agatggttgc tggaagattt gtgaatgtgt 3600gcgcttaatt cactcacttg tcctcatggc agatggttgc tggaagattt gtgaatgtgt 3600
actccacaga tgactggatt cttggtgtta cttttcgtgc gaggtaaaca gatttatgat 3660actccacaga tgactggatt cttggtgtta cttttcgtgc gaggtaaaca gatttatgat 3660
ttcagagctc ctagaaagct catatcacca ttcccccatg tgagctatga cttaagtaat 3720ttcagagctc ctagaaagct catatcacca ttcccccatg tgagctatga cttaagtaat 3720
cttctcgtgc agtctgctaa cacaagggtt ggctggcatc caggccattg atgtccctgg 3780cttctcgtgc agtctgctaa cacaagggtt ggctggcatc caggccattg atgtccctgg 3780
tgttgaaaat gtaagcatct ggggtgccat tcttgtgttc ttacgtgatt ttcatctatc 3840tgttgaaaat gtaagcatct ggggtgccat tcttgtgttc ttacgtgatt ttcatctatc 3840
ttttacacgt ctttgtgtac aatgcaattt acaggtcgat gttacggagc ttgtcgatgg 3900ttttacacgt ctttgtgtac aatgcaattt acaggtcgat gttacggagc ttgtcgatgg 3900
ccattcgtct tacctgtcgg cagcacaaca gatactggag catcttgaac taaataccta 3960ccattcgtct tacctgtcgg cagcacaaca gatactggag catcttgaac taaataccta 3960
ctatccggtt tttgttcccc tgagcgcagc caacgaagaa acagatggaa cagttgcaca 4020ctatccggtt tttgttcccc tgagcgcagc caacgaagaa acagatggaa cagttgcaca 4020
gtaacgtgtg ggcaggattc gatgaaaatc gtatgtagac aaccaaatgt aacagactat 4080gtaacgtgtg ggcaggattc gatgaaaatc gtatgtagac aaccaaatgt aacagactat 4080
atatacaacc aaatgtaaca gactaactcg ttcaaactgt aaaccagtct cagcctgagg 4140atatacaacc aaatgtaaca gactaactcg ttcaaactgt aaaccagtct cagcctgagg 4140
cgaatgtgtc tggttggtga ggcttgctgt ttctggccgt gtc 4183cgaatgtgtc tggttggtga ggcttgctgt ttctggccgt gtc 4183
<210> 7<210> 7
<211> 4183<211> 4183
<212> DNA<212> DNA
<213> 水稻雄性不育突变体sts1-2(Oryza sativa)<213> Rice male sterile mutant sts1-2 (Oryza sativa)
<400> 7<400> 7
agtctccccc aaccaaaatt tgaccgagtt tggcacgaaa gaaaccgcct cgacgccgtc 60agtctccccc aaccaaaatt tgaccgagtt tggcacgaaa gaaaccgcct cgacgccgtc 60
caacggaagc cagccgaggc gaccaaccca agggtaggcc taggcgtaaa ccctgcacct 120caacggaagc cagccgaggc gaccaaccca agggtaggcc taggcgtaaa ccctgcacct 120
gaatcgatcg ccggcgatgg ccacgacgct gacgccgacg cagcgctacg cggcgggggc 180gaatcgatcg ccggcgatgg ccacgacgct gacgccgacg cagcgctacg cggcgggggc 180
gctgctggcg ctcgcgctcc gccaggccca gatccaccag tccgtcctcc tcggcgccca 240gctgctggcg ctcgcgctcc gccaggccca gatccaccag tccgtcctcc tcggcgccca 240
ccaccaccac gacgacgacg acgaggaaca ggggcgcacc agcaccagca gcggcggcgg 300ccaccaccac gacgacgacg acgaggaaca ggggcgcacc agcaccagca gcggcggcgg 300
cggcggcagc tcctcctcct cctccaactc cggcgccggc gccgacgccg acctctggac 360cggcggcagc tcctcctcct cctccaactc cggcgccggc gccgacgccg acctctggac 360
acacgactcc cacggcctcc tccgccccgt cttcaggtgt tttctcgctc ctctccgctt 420acacgactcc cacggcctcc tccgccccgt cttcaggtgt tttctcgctc ctctccgctt 420
gtaaatttgg ggggtttgag atgaatgctt ccgctttact tggcgatcgg ggggctctgc 480gtaaatttgg ggggtttgag atgaatgctt ccgctttact tggcgatcgg ggggctctgc 480
taggttcctg gagatcgatc ccaaggcgtg gtcggggctg gaggagacgg cggcgtcgtc 540taggttcctg gagatcgatc ccaaggcgtg gtcggggctg gaggagacgg cggcgtcgtc 540
cgaggccaag catcacatcg gcgcggtacg gcactcactg cattttctgc cctcgtattg 600cgaggccaag catcacatcg gcgcggtacg gcactcactg cattttctgc cctcgtattg 600
taattggccg tgcgttttgg tcagtgttgc agtgatggtt tggggcttag gtgtttgatt 660taattggccg tgcgttttgg tcagtgttgc agtgatggtt tggggcttag gtgtttgatt 660
atatgcctac aagactgttc aacttcaatt tccgcgtgtg cttgaccagt gaaatttgtg 720atatgcctac aagactgttc aacttcaatt tccgcgtgtg cttgaccagt gaaatttgtg 720
ttttcgtagc atctcatatc actctcattt tgtagtttct aaggataata ttcgaagaag 780ttttcgtagc atctcatatc actctcattt tgtagtttct aaggataata ttcgaagaag 780
atggcgaaag ctcctcagac agatccgttc aggagcttgc cttggcaaaa ggagttgatg 840atggcgaaag ctcctcagac agatccgttc aggagcttgc cttggcaaaa ggagttgatg 840
tgatggtaat gagtttgggc aatgacagtg aagtgggtaa cacaattaaa ggtggggatc 900tgatggtaat gagtttgggc aatgacagtg aagtgggtaa cacaattaaa ggtggggatc 900
aagatgcttt gcctagcagc tctggaacag ataaatcccc aggggaaagc tctcatgatg 960aagatgcttt gcctagcagc tctggaacag ataaatcccc aggggaaagc tctcatgatg 960
accagttagg aatcaataaa ttgaccttag atgatattcc tgctaacaat caccggaaga 1020accagttagg aatcaataaa ttgaccttag atgatattcc tgctaacaat caccggaaga 1020
tggcattgct gtttgctctt ctctcggctt gtgttgctga taaacctgta tcacaagagg 1080tggcattgct gtttgctctt ctctcggctt gtgttgctga taaacctgta tcacaagagg 1080
aagaggacag aaaatcaacc cgcttcagaa agggttatga tgctcggcat cgtgttgctc 1140aagaggacag aaaatcaacc cgcttcagaa agggttatga tgctcggcat cgtgttgctc 1140
ttcggctgct atctacatgg cttgatgtga agtggatcaa aatggtttgt acagactaaa 1200ttcggctgct atctacatgg cttgatgtga agtggatcaa aatggtttgt acagactaaa 1200
cttttttgga ttcagcagag taatcactat gggtaccaaa caatatccta aggcaattaa 1260ctttttttgga ttcagcagag taatcactat gggtaccaaa caatatccta aggcaattaa 1260
agaaaaaaga acccccaatg tttttcatgt ttcaccaaat ttgtcaccaa ccatgctttt 1320agaaaaaaga acccccaatg tttttcatgt ttcaccaaat ttgtcaccaa ccatgctttt 1320
tcgctgcagg aagcaattga ggttatggtt gcatgctctg ccatggcagc agcgaaagaa 1380tcgctgcagg aagcaattga ggttatggtt gcatgctctg ccatggcagc agcgaaagaa 1380
caagaacaat cacaagaaag cgcatcaccc aaaagtaaat gggaaaaatg gaagcgtgga 1440caagaacaat cacaagaaag cgcatcaccc aaaagtaaat gggaaaaatg gaagcgtgga 1440
ggaataattg gtgcagctgc tttgactgga ggagcactgt tggctattac tgggggtatg 1500ggaataattg gtgcagctgc tttgactgga ggagcactgt tggctattac tgggggtatg 1500
aaaaaacttc agaaattctg ctggacgctg aacatctgaa ctagtttctg tcgctcatat 1560aaaaaacttc agaaattctg ctggacgctg aacatctgaa ctagtttctg tcgctcatat 1560
tactctaaac ggttgatatt ctcttccaac ttgacaggtt tagctgctcc agcaattgct 1620tactctaaac ggttgatatt ctcttccaac ttgacaggtt tagctgctcc agcaattgct 1620
gcagggtttg gtgctcttgc tccaaccctg ggcacacttg tccctgttat aggagctagc 1680gcagggtttg gtgctcttgc tccaaccctg ggcacacttg tccctgttat aggagctagc 1680
ggatttgctg ctatggctac cgctgcagga tctgttgctg gctctgtggc agttgctgca 1740ggatttgctg ctatggctac cgctgcagga tctgttgctg gctctgtggc agttgctgca 1740
tcatttggag gtgtgtctac atgttcttca aaacccttct ggtgatgttg ttttgagctt 1800tcatttggag gtgtgtctac atgttcttca aaacccttct ggtgatgttg ttttgagctt 1800
tcgtttcgtc atgctacatt actatcagct gcagtctccc tttccttttg tcatgagggt 1860tcgtttcgtc atgctacatt actatcagct gcagtctccc tttccttttg tcatgagggt 1860
tgcaatattg tttagtgaac ttacccaaga atatgcacac aacactttta ggggtgcatt 1920tgcaatattg tttagtgaac ttacccaaga atatgcacac aacactttta ggggtgcatt 1920
ttacaattag ataatagttc gatggattac cagaagaagg ataagactta tacaatgtat 1980ttacaattag ataatagttc gatggattac cagaagaagg ataagactta tacaatgtat 1980
gacccaaaat tgtgagaaac atatggatga aagcgtaatg agaatagttg tggtgtggaa 2040gacccaaaat tgtgagaaac atatggatga aagcgtaatg agaatagttg tggtgtggaa 2040
attttctatt tattcatgct tatcctttga aatgatattt ttcaattagc tgctggagct 2100attttctatt tattcatgct tatcctttga aatgatattt ttcaattagc tgctggagct 2100
ggcttgacgg ggagcaaaat ggccagaaga attgggagcg tgaaagaatt tgagttcaaa 2160ggcttgacgg ggagcaaaat ggccagaaga attgggagcg tgaaagaatt tgagttcaaa 2160
cctattggtg aaaaccataa tcagggtgtg agttcctttt ctctttctga tgtcacaaca 2220cctattggtg aaaaccataa tcagggtgtg agttcctttt ctctttctga tgtcacaaca 2220
aatttcattc tactagattt ttgttcaccc ttttagtagt caaactgatc tggtctggtt 2280aatttcattc tactagattt ttgttcaccc ttttagtagt caaactgatc tggtctggtt 2280
aaatgttctc aacactttca gcggcttgca gttggcatct taatttccgg atttgctttt 2340aaatgttctc aacactttca gcggcttgca gttggcatct taatttccgg atttgctttt 2340
gatgaggatg acttttgtag accttgggaa ggatgacagg ataacctaga gaggtataga 2400gatgaggatg acttttgtag accttgggaa ggatgacagg ataacctaga gaggtataga 2400
gtggtttttt tgacatgttc tttttgctga tcactattct agtactttgt catgtgttac 2460gtggtttttt tgacatgttc tttttgctga tcactattct agtactttgt catgtgttac 2460
cagctaagag catccttaaa aatataattt caagagttga ataatgtctg gcaggtacat 2520cagctaagag catccttaaa aatataattt caagagttga ataatgtctg gcaggtacat 2520
ccttcaatgg gagtctaagc atataattgc agtgagtaca gcaatacagg attggctgac 2580ccttcaatgg gagtctaagc atataattgc agtgagtaca gcaatacagg attggctgac 2580
atcaagtaat atttgtagtt ttgcccattt acactatcag caaaaacttc taatttacac 2640atcaagtaat atttgtagtt ttgcccattt acactatcag caaaaacttc taatttacac 2640
tgttatttat gattcaggac ttgctatgga attgatgaag caaggtgcaa tgagaaccgt 2700tgttatttat gattcaggac ttgctatgga attgatgaag caaggtgcaa tgagaaccgt 2700
attaagtggt cttcttgcag catttgcatg gccagctaca ctgcttgcgg ctacagactt 2760attaagtggt cttcttgcag catttgcatg gccagctaca ctgcttgcgg ctacagactt 2760
tattgatagt aaatggtctg ttgctattga caggtaattg caaactctcc catttctctt 2820tattgatagt aaatggtctg ttgctattga caggtaattg caaactctcc catttctctt 2820
atgttgcttg cagagttgca gtgcacactg actatctaca gtattggtat tcttctgttg 2880atgttgcttg cagagttgca gtgcacactg actatctaca gtattggtat tcttctgttg 2880
atcgacacat aagtttgtgt attcaaattt atgaagatta tgacgtaaca tttagacaaa 2940atcgacacat aagtttgtgt attcaaattt atgaagatta tgacgtaaca tttagacaaa 2940
cttttaatgt atatatccca cagtacatat attgcagtta aagacatcct agcaatttgt 3000cttttaatgt atatatccca cagtacatat attgcagtta aagacatcct agcaatttgt 3000
taacttagcc aatcaaaatc aactgtatct tctcagatca gataaggcag gaaaaatgct 3060taacttagcc aatcaaaatc aactgtatct tctcagatca gataaggcag gaaaaatgct 3060
tgctgaagtg cttctcaagg gattgcaagg aaataggtaa ctccatatgc atttatttca 3120tgctgaagtg cttctcaagg gattgcaagg aaataggtaa ctccatatgc atttatttca 3120
ttagcgatgt tggccgctca gaaatatgcc cccctaatgc caattttttc ccaggcctgt 3180ttagcgatgt tggccgctca gaaatatgcc cccctaatgc caattttttc ccaggcctgt 3180
gactctcatt gggttttcac taggtgcacg tgttatattt aagtgtttac aggaacttgc 3240gactctcatt gggttttcac taggtgcacg tgttatattt aagtgtttac aggaacttgc 3240
tttatcgagt gataatggta cctgttctgt aaccatctcc ttcatccata agattttcct 3300tttatcgagt gataatggta cctgttctgt aaccatctcc ttcatccata agattttcct 3300
ttcttggtgg caaattatca gttttaacca tctccatctc ttgtttgcaa ccagagggac 3360ttcttggtgg caaattatca gttttaacca tctccatctc ttgtttgcaa ccagagggac 3360
ttgttgagag ggtggttctt cttggtgctc cagtttctgt gaaaggcgag cggtgggagg 3420ttgttgagag ggtggttctt cttggtgctc cagtttctgt gaaaggcgag cggtgggagg 3420
ctgcaaggaa ggtaatggcg atgaaatcat ttttttctct attatctctg taggaatggt 3480ctgcaaggaa ggtaatggcg atgaaatcat ttttttctct attatctctg taggaatggt 3480
gcaacgtaac aacaagaaac agtgaagttg ctgttcattt tatcccagct tcttccctcg 3540gcaacgtaac aacaagaaac agtgaagttg ctgttcattt tatcccagct tcttccctcg 3540
gcgcttaatt cactcacttg tcctcatggc agatggttgc tggaagattt gtgaatgtgt 3600gcgcttaatt cactcacttg tcctcatggc agatggttgc tggaagattt gtgaatgtgt 3600
actccacaga tgactggatt cttggtgtta cttttcgtgc gaggtaaaca gatttatgat 3660actccacaga tgactggatt cttggtgtta cttttcgtgc gaggtaaaca gatttatgat 3660
ttcagagctc ctagaaagct catatcacca ttcccccatg tgagctatga cttaagtaat 3720ttcagagctc ctagaaagct catatcacca ttcccccatg tgagctatga cttaagtaat 3720
cttctcgtgc agtctgctaa cacaagggtt ggctggcatc caggccattg atgtccctgg 3780cttctcgtgc agtctgctaa cacaagggtt ggctggcatc caggccattg atgtccctgg 3780
tgttgaaaat gtaagcatct ggggtgccat tcttgtgttc ttacgtgatt ttcatctatc 3840tgttgaaaat gtaagcatct ggggtgccat tcttgtgttc ttacgtgatt ttcatctatc 3840
ttttacacgt ctttgtgtac aatgcaattt acaggtcgat gttacggagc ttgtcgatgg 3900ttttacacgt ctttgtgtac aatgcaattt acaggtcgat gttacggagc ttgtcgatgg 3900
ccattcgtct tacctgtcgg cagcacaaca gatactggag catcttgaac taaataccta 3960ccattcgtct tacctgtcgg cagcacaaca gatactggag catcttgaac taaataccta 3960
ctatccggtt tttgttcccc tgagcgcagc caacgaagaa acagatggaa cagttgcaca 4020ctatccggtt tttgttcccc tgagcgcagc caacgaagaa acagatggaa cagttgcaca 4020
gtaacgtgtg ggcaggattc gatgaaaatc gtatgtagac aaccaaatgt aacagactat 4080gtaacgtgtg ggcaggattc gatgaaaatc gtatgtagac aaccaaatgt aacagactat 4080
atatacaacc aaatgtaaca gactaactcg ttcaaactgt aaaccagtct cagcctgagg 4140atatacaacc aaatgtaaca gactaactcg ttcaaactgt aaaccagtct cagcctgagg 4140
cgaatgtgtc tggttggtga ggcttgctgt ttctggccgt gtc 4183cgaatgtgtc tggttggtga ggcttgctgt ttctggccgt gtc 4183
<210> 8<210> 8
<211> 4184<211> 4184
<212> DNA<212> DNA
<213> 水稻雄性不育突变体sts1-3(Oryza sativa)<213> Rice male sterile mutant sts1-3 (Oryza sativa)
<400> 8<400> 8
agtctccccc aaccaaaatt tgaccgagtt tggcacgaaa gaaaccgcct cgacgccgtc 60agtctccccc aaccaaaatt tgaccgagtt tggcacgaaa gaaaccgcct cgacgccgtc 60
caacggaagc cagccgaggc gaccaaccca agggtaggcc taggcgtaaa ccctgcacct 120caacggaagc cagccgaggc gaccaaccca agggtaggcc taggcgtaaa ccctgcacct 120
gaatcgatcg ccggcgatgg ccacgacgct gacgccgacg cagcgctacg cggcgggggc 180gaatcgatcg ccggcgatgg ccacgacgct gacgccgacg cagcgctacg cggcgggggc 180
gctgctggcg ctcgcgctcc gccaggccca gatccaccag tccgtcctcc tcggcgccca 240gctgctggcg ctcgcgctcc gccaggccca gatccaccag tccgtcctcc tcggcgccca 240
ccaccaccac gacgacgacg acgaggaaca ggggcgcacc agcaccagca gcggcggcgg 300ccaccaccac gacgacgacg acgaggaaca ggggcgcacc agcaccagca gcggcggcgg 300
cggcggcagc tcctcctcct cctccaactc cggcgccggc gccgacgccg acctctggac 360cggcggcagc tcctcctcct cctccaactc cggcgccggc gccgacgccg acctctggac 360
acacgactcc cacggcctcc tccgccccgt cttcaggtgt tttctcgctc ctctccgctt 420acacgactcc cacggcctcc tccgccccgt cttcaggtgt tttctcgctc ctctccgctt 420
gtaaatttgg ggggtttgag atgaatgctt ccgctttact tggcgatcgg ggggctctgc 480gtaaatttgg ggggtttgag atgaatgctt ccgctttact tggcgatcgg ggggctctgc 480
taggttcctg gagatcgatc ccaaggcgtg gtcggggctg gaggagacgg cggcgtcgtc 540taggttcctg gagatcgatc ccaaggcgtg gtcggggctg gaggagacgg cggcgtcgtc 540
cgaggccaag catcacatcg ggcgcggtac ggcactcact gcattttctg ccctcgtatt 600cgaggccaag catcacatcg ggcgcggtac ggcactcact gcattttctg ccctcgtatt 600
gtaattggcc gtgcgttttg gtcagtgttg cagtgatggt ttggggctta ggtgtttgat 660gtaattggcc gtgcgttttg gtcagtgttg cagtgatggt ttggggctta ggtgtttgat 660
tatatgccta caagactgtt caacttcaat ttccgcgtgt gcttgaccag tgaaatttgt 720tatatgccta caagactgtt caacttcaat ttccgcgtgt gcttgaccag tgaaatttgt 720
gttttcgtag catctcatat cactctcatt ttgtagtttc taaggataat attcgaagaa 780gttttcgtag catctcatat cactctcatt ttgtagtttc taaggataat attcgaagaa 780
gatggcgaaa gctcctcaga cagatccgtt caggagcttg ccttggcaaa aggagttgat 840gatggcgaaa gctcctcaga cagatccgtt caggagcttg ccttggcaaa aggagttgat 840
gtgatggtaa tgagtttggg caatgacagt gaagtgggta acacaattaa aggtggggat 900gtgatggtaa tgagtttggg caatgacagt gaagtgggta acacaattaa aggtggggat 900
caagatgctt tgcctagcag ctctggaaca gataaatccc caggggaaag ctctcatgat 960caagatgctt tgcctagcag ctctggaaca gataaatccc caggggaaag ctctcatgat 960
gaccagttag gaatcaataa attgacctta gatgatattc ctgctaacaa tcaccggaag 1020gaccagttag gaatcaataa attgacctta gatgatattc ctgctaacaa tcaccggaag 1020
atggcattgc tgtttgctct tctctcggct tgtgttgctg ataaacctgt atcacaagag 1080atggcattgc tgtttgctct tctctcggct tgtgttgctg ataaacctgt atcacaagag 1080
gaagaggaca gaaaatcaac ccgcttcaga aagggttatg atgctcggca tcgtgttgct 1140gaagaggaca gaaaatcaac ccgcttcaga aagggttatg atgctcggca tcgtgttgct 1140
cttcggctgc tatctacatg gcttgatgtg aagtggatca aaatggtttg tacagactaa 1200cttcggctgc tatctacatg gcttgatgtg aagtggatca aaatggtttg tacagactaa 1200
acttttttgg attcagcaga gtaatcacta tgggtaccaa acaatatcct aaggcaatta 1260acttttttgg attcagcaga gtaatcacta tgggtaccaa acaatatcct aaggcaatta 1260
aagaaaaaag aacccccaat gtttttcatg tttcaccaaa tttgtcacca accatgcttt 1320aagaaaaaag aacccccaat gtttttcatg tttcaccaaa tttgtcacca accatgcttt 1320
ttcgctgcag gaagcaattg aggttatggt tgcatgctct gccatggcag cagcgaaaga 1380ttcgctgcag gaagcaattg aggttatggt tgcatgctct gccatggcag cagcgaaaga 1380
acaagaacaa tcacaagaaa gcgcatcacc caaaagtaaa tgggaaaaat ggaagcgtgg 1440acaagaacaa tcacaagaaa gcgcatcacc caaaagtaaa tgggaaaaat ggaagcgtgg 1440
aggaataatt ggtgcagctg ctttgactgg aggagcactg ttggctatta ctgggggtat 1500aggaataatt ggtgcagctg ctttgactgg aggagcactg ttggctatta ctgggggtat 1500
gaaaaaactt cagaaattct gctggacgct gaacatctga actagtttct gtcgctcata 1560gaaaaaactt cagaaattct gctggacgct gaacatctga actagtttct gtcgctcata 1560
ttactctaaa cggttgatat tctcttccaa cttgacaggt ttagctgctc cagcaattgc 1620ttactctaaa cggttgatat tctcttccaa cttgacaggt ttagctgctc cagcaattgc 1620
tgcagggttt ggtgctcttg ctccaaccct gggcacactt gtccctgtta taggagctag 1680tgcagggttt ggtgctcttg ctccaaccct gggcacactt gtccctgtta taggagctag 1680
cggatttgct gctatggcta ccgctgcagg atctgttgct ggctctgtgg cagttgctgc 1740cggatttgct gctatggcta ccgctgcagg atctgttgct ggctctgtgg cagttgctgc 1740
atcatttgga ggtgtgtcta catgttcttc aaaacccttc tggtgatgtt gttttgagct 1800atcatttgga ggtgtgtcta catgttcttc aaaacccttc tggtgatgtt gttttgagct 1800
ttcgtttcgt catgctacat tactatcagc tgcagtctcc ctttcctttt gtcatgaggg 1860ttcgtttcgt catgctacat tactatcagc tgcagtctcc ctttcctttt gtcatgaggg 1860
ttgcaatatt gtttagtgaa cttacccaag aatatgcaca caacactttt aggggtgcat 1920ttgcaatatt gtttagtgaa cttacccaag aatatgcaca caacactttt aggggtgcat 1920
tttacaatta gataatagtt cgatggatta ccagaagaag gataagactt atacaatgta 1980tttacaatta gataatagtt cgatggatta ccagaagaag gataagactt atacaatgta 1980
tgacccaaaa ttgtgagaaa catatggatg aaagcgtaat gagaatagtt gtggtgtgga 2040tgacccaaaa ttgtgagaaa catatggatg aaagcgtaat gagaatagtt gtggtgtgga 2040
aattttctat ttattcatgc ttatcctttg aaatgatatt tttcaattag ctgctggagc 2100aattttctat ttattcatgc ttatcctttg aaatgatatt tttcaattag ctgctggagc 2100
tggcttgacg gggagcaaaa tggccagaag aattgggagc gtgaaagaat ttgagttcaa 2160tggcttgacg gggagcaaaa tggccagaag aattgggagc gtgaaagaat ttgagttcaa 2160
acctattggt gaaaaccata atcagggtgt gagttccttt tctctttctg atgtcacaac 2220acctattggt gaaaaccata atcagggtgt gagttccttt tctctttctg atgtcacaac 2220
aaatttcatt ctactagatt tttgttcacc cttttagtag tcaaactgat ctggtctggt 2280aaatttcatt ctactagatt tttgttcacc cttttagtag tcaaactgat ctggtctggt 2280
taaatgttct caacactttc agcggcttgc agttggcatc ttaatttccg gatttgcttt 2340taaatgttct caacactttc agcggcttgc agttggcatc ttaatttccg gatttgcttt 2340
tgatgaggat gacttttgta gaccttggga aggatggcag gataacctag agaggtatag 2400tgatgaggat gacttttgta gaccttggga aggatggcag gataacctag agaggtatag 2400
agtggttttt ttgacatgtt ctttttgctg atcactattc tagtactttg tcatgtgtta 2460agtggttttt ttgacatgtt ctttttgctg atcactattc tagtactttg tcatgtgtta 2460
ccagctaaga gcatccttaa aaatataatt tcaagagttg aataatgtct ggcaggtaca 2520ccagctaaga gcatccttaa aaatataatt tcaagagttg aataatgtct ggcaggtaca 2520
tccttcaatg ggagtctaag catataattg cagtgagtac agcaatacag gattggctga 2580tccttcaatg ggagtctaag catataattg cagtgagtac agcaatacag gattggctga 2580
catcaagtaa tatttgtagt tttgcccatt tacactatca gcaaaaactt ctaatttaca 2640catcaagtaa tatttgtagt tttgcccatt tacactatca gcaaaaactt ctaatttaca 2640
ctgttattta tgattcagga cttgctatgg aattgatgaa gcaaggtgca atgagaaccg 2700ctgttattta tgattcagga cttgctatgg aattgatgaa gcaaggtgca atgagaaccg 2700
tattaagtgg tcttcttgca gcatttgcat ggccagctac actgcttgcg gctacagact 2760tattaagtgg tcttcttgca gcatttgcat ggccagctac actgcttgcg gctacagact 2760
ttattgatag taaatggtct gttgctattg acaggtaatt gcaaactctc ccatttctct 2820ttattgatag taaatggtct gttgctattg acaggtaatt gcaaactctc ccatttctct 2820
tatgttgctt gcagagttgc agtgcacact gactatctac agtattggta ttcttctgtt 2880tatgttgctt gcagagttgc agtgcacact gactatctac agtattggta ttcttctgtt 2880
gatcgacaca taagtttgtg tattcaaatt tatgaagatt atgacgtaac atttagacaa 2940gatcgacaca taagtttgtg tattcaaatt tatgaagatt atgacgtaac atttagacaa 2940
acttttaatg tatatatccc acagtacata tattgcagtt aaagacatcc tagcaatttg 3000acttttaatg tatatatccc acagtacata tattgcagtt aaagacatcc tagcaatttg 3000
ttaacttagc caatcaaaat caactgtatc ttctcagatc agataaggca ggaaaaatgc 3060ttaacttagc caatcaaaat caactgtatc ttctcagatc agataaggca ggaaaaatgc 3060
ttgctgaagt gcttctcaag ggattgcaag gaaataggta actccatatg catttatttc 3120ttgctgaagt gcttctcaag ggattgcaag gaaataggta actccatatg catttatttc 3120
attagcgatg ttggccgctc agaaatatgc ccccctaatg ccaatttttt cccaggcctg 3180attagcgatg ttggccgctc agaaatatgc ccccctaatg ccaatttttt cccaggcctg 3180
tgactctcat tgggttttca ctaggtgcac gtgttatatt taagtgttta caggaacttg 3240tgactctcat tgggttttca ctaggtgcac gtgttatatt taagtgttta caggaacttg 3240
ctttatcgag tgataatggt acctgttctg taaccatctc cttcatccat aagattttcc 3300ctttatcgag tgataatggt acctgttctg taaccatctc cttcatccat aagattttcc 3300
tttcttggtg gcaaattatc agttttaacc atctccatct cttgtttgca accagaggga 3360tttcttggtg gcaaattatc agttttaacc atctccatct cttgtttgca accagaggga 3360
cttgttgaga gggtggttct tcttggtgct ccagtttctg tgaaaggcga gcggtgggag 3420cttgttgaga gggtggttct tcttggtgct ccagtttctg tgaaaggcga gcggtgggag 3420
gctgcaagga aggtaatggc gatgaaatca tttttttctc tattatctct gtaggaatgg 3480gctgcaagga aggtaatggc gatgaaatca tttttttctc tattatctct gtaggaatgg 3480
tgcaacgtaa caacaagaaa cagtgaagtt gctgttcatt ttatcccagc ttcttccctc 3540tgcaacgtaa caacaagaaa cagtgaagtt gctgttcatt ttatcccagc ttcttccctc 3540
ggcgcttaat tcactcactt gtcctcatgg cagatggttg ctggaagatt tgtgaatgtg 3600ggcgcttaat tcactcactt gtcctcatgg cagatggttg ctggaagatt tgtgaatgtg 3600
tactccacag atgactggat tcttggtgtt acttttcgtg cgaggtaaac agatttatga 3660tactccacag atgactggat tcttggtgtt acttttcgtg cgaggtaaac agatttatga 3660
tttcagagct cctagaaagc tcatatcacc attcccccat gtgagctatg acttaagtaa 3720tttcagagct cctagaaagc tcatatcacc attcccccat gtgagctatg acttaagtaa 3720
tcttctcgtg cagtctgcta acacaagggt tggctggcat ccaggccatt gatgtccctg 3780tcttctcgtg cagtctgcta acacaagggt tggctggcat ccaggccatt gatgtccctg 3780
gtgttgaaaa tgtaagcatc tggggtgcca ttcttgtgtt cttacgtgat tttcatctat 3840gtgttgaaaa tgtaagcatc tggggtgcca ttcttgtgtt cttacgtgat tttcatctat 3840
cttttacacg tctttgtgta caatgcaatt tacaggtcga tgttacggag cttgtcgatg 3900cttttacacg tctttgtgta caatgcaatt tacaggtcga tgttacggag cttgtcgatg 3900
gccattcgtc ttacctgtcg gcagcacaac agatactgga gcatcttgaa ctaaatacct 3960gccattcgtc ttacctgtcg gcagcacaac agatactgga gcatcttgaa ctaaatacct 3960
actatccggt ttttgttccc ctgagcgcag ccaacgaaga aacagatgga acagttgcac 4020actatccggt ttttgttccc ctgagcgcag ccaacgaaga aacagatgga acagttgcac 4020
agtaacgtgt gggcaggatt cgatgaaaat cgtatgtaga caaccaaatg taacagacta 4080agtaacgtgt gggcaggatt cgatgaaaat cgtatgtaga caaccaaatg taacagacta 4080
tatatacaac caaatgtaac agactaactc gttcaaactg taaaccagtc tcagcctgag 4140tatatacaac caaatgtaac agactaactc gttcaaactg taaaccagtc tcagcctgag 4140
gcgaatgtgt ctggttggtg aggcttgctg tttctggccg tgtc 4184gcgaatgtgt ctggttggtg aggcttgctg tttctggccg tgtc 4184
<210> 9<210> 9
<211> 4122<211> 4122
<212> DNA<212> DNA
<213> 水稻雄性不育突变体sts1-4(Oryza sativa)<213> Rice male sterile mutant sts1-4 (Oryza sativa)
<400> 9<400> 9
agtctccccc aaccaaaatt tgaccgagtt tggcacgaaa gaaaccgcct cgacgccgtc 60agtctccccc aaccaaaatt tgaccgagtt tggcacgaaa gaaaccgcct cgacgccgtc 60
caacggaagc cagccgaggc gaccaaccca agggtaggcc taggcgtaaa ccctgcacct 120caacggaagc cagccgaggc gaccaaccca agggtaggcc taggcgtaaa ccctgcacct 120
gaatcgatcg ccggcgatgg ccacgacgct gacgccgacg cagcgctacg cggcgggggc 180gaatcgatcg ccggcgatgg ccacgacgct gacgccgacg cagcgctacg cggcgggggc 180
gctgctggcg ctcgcgctcc gccaggccca gatccaccag tccgtcctcc tcggcgccca 240gctgctggcg ctcgcgctcc gccaggccca gatccaccag tccgtcctcc tcggcgccca 240
ccaccaccac gacgacgacg acgaggaaca ggggcgcacc agcaccagca gcggcggcgg 300ccaccaccac gacgacgacg acgaggaaca ggggcgcacc agcaccagca gcggcggcgg 300
cggcggcagc tcctcctcct cctccaactc cggcgccggc gccgacgccg acctctggac 360cggcggcagc tcctcctcct cctccaactc cggcgccggc gccgacgccg acctctggac 360
acacgactcc cacggcctcc tccgccccgt cttcaggtgt tttctcgctc ctctccgctt 420acacgactcc cacggcctcc tccgccccgt cttcaggtgt tttctcgctc ctctccgctt 420
gtaaatttgg ggggtttgag atgaatgctt ccgctttact tggcgatcgg ggggctctgc 480gtaaatttgg ggggtttgag atgaatgctt ccgctttact tggcgatcgg ggggctctgc 480
taggttcctg gagatcgatc ccaaggcgtg gtcggggctg gaggagacgg cggcgtcgtc 540taggttcctg gagatcgatc ccaaggcgtg gtcggggctg gaggagacgg cggcgtcgtc 540
cgaggccaag catcacatcg gcgcggtacg gcactcactg cattttctgc cctcgtattg 600cgaggccaag catcacatcg gcgcggtacg gcactcactg cattttctgc cctcgtattg 600
taattggccg tgcgttttgg tcagtgttgc agtgatggtt tggggcttag gtgtttgatt 660taattggccg tgcgttttgg tcagtgttgc agtgatggtt tggggcttag gtgtttgatt 660
atatgcctac aagactgttc aacttcaatt tccgcgtgtg cttgaccagt gaaatttgtg 720atatgcctac aagactgttc aacttcaatt tccgcgtgtg cttgaccagt gaaatttgtg 720
ttttcgtagc atctcatatc actctcattt tgtagtttct aaggataata ttcgaagaag 780ttttcgtagc atctcatatc actctcattt tgtagtttct aaggataata ttcgaagaag 780
atggcgaaag ctcctcagac agatccgttc aggagcttgc cttggcaaaa ggagttgatg 840atggcgaaag ctcctcagac agatccgttc aggagcttgc cttggcaaaa ggagttgatg 840
tgatggtaat gagtttgggc aatgacagtg aagtgggtaa cacaattaaa ggtggggatc 900tgatggtaat gagtttgggc aatgacagtg aagtgggtaa cacaattaaa ggtggggatc 900
aagatgcttt gcctagcagc tctggaacag ataaatcccc aggggaaagc tctcatgatg 960aagatgcttt gcctagcagc tctggaacag ataaatcccc aggggaaagc tctcatgatg 960
accagttagg aatcaataaa ttgaccttag atgatattcc tgctaacaat caccggaaga 1020accagttagg aatcaataaa ttgaccttag atgatattcc tgctaacaat caccggaaga 1020
tggcattgct gtttgctctt ctctcggctt gtgttgctga taaacctgta tcacaagagg 1080tggcattgct gtttgctctt ctctcggctt gtgttgctga taaacctgta tcacaagagg 1080
aagaggacag aaaatcaacc cgcttcagaa agggttatga tgctcggcat cgtgttgctc 1140aagaggacag aaaatcaacc cgcttcagaa agggttatga tgctcggcat cgtgttgctc 1140
ttcggctgct atctacatgg cttgatgtga agtggatcaa aatggtttgt acagactaaa 1200ttcggctgct atctacatgg cttgatgtga agtggatcaa aatggtttgt acagactaaa 1200
cttttttgga ttcagcagag taatcactat gggtaccaaa caatatccta aggcaattaa 1260ctttttttgga ttcagcagag taatcactat gggtaccaaa caatatccta aggcaattaa 1260
agaaaaaaga acccccaatg tttttcatgt ttcaccaaat ttgtcaccaa ccatgctttt 1320agaaaaaaga acccccaatg tttttcatgt ttcaccaaat ttgtcaccaa ccatgctttt 1320
tcgctgcagg aagcaattga ggttatggtt gcatgctctg ccatggcagc agcgaaagaa 1380tcgctgcagg aagcaattga ggttatggtt gcatgctctg ccatggcagc agcgaaagaa 1380
caagaacaat cacaagaaag cgcatcaccc aaaagtaaat gggaaaaatg gaagcgtgga 1440caagaacaat cacaagaaag cgcatcaccc aaaagtaaat gggaaaaatg gaagcgtgga 1440
ggaataattg gtgcagctgc tttgactgga ggagcactgt tggctattac tgggggtatg 1500ggaataattg gtgcagctgc tttgactgga ggagcactgt tggctattac tgggggtatg 1500
aaaaaacttc agaaattctg ctggacgctg aacatctgaa ctagtttctg tcgctcatat 1560aaaaaacttc agaaattctg ctggacgctg aacatctgaa ctagtttctg tcgctcatat 1560
tactctaaac ggttgatatt ctcttccaac ttgacaggtt tagctgctcc agcaattgct 1620tactctaaac ggttgatatt ctcttccaac ttgacaggtt tagctgctcc agcaattgct 1620
gcagggtttg gtgctcttgc tccaaccctg ggcacacttg tccctgttat aggagctagc 1680gcagggtttg gtgctcttgc tccaaccctg ggcacacttg tccctgttat aggagctagc 1680
ggatttgctg ctatggctac cgctgcagga tctgttgctg gctctgtggc agttgctgca 1740ggatttgctg ctatggctac cgctgcagga tctgttgctg gctctgtggc agttgctgca 1740
tcatttggag gtgtgtctac atgttcttca aaacccttct ggtgatgttg ttttgagctt 1800tcatttggag gtgtgtctac atgttcttca aaacccttct ggtgatgttg ttttgagctt 1800
tcgtttcgtc atgctacatt actatcagct gcagtctccc tttccttttg tcatgagggt 1860tcgtttcgtc atgctacatt actatcagct gcagtctccc tttccttttg tcatgagggt 1860
tgcaatattg tttagtgaac ttacccaaga atatgcacac aacactttta ggggtgcatt 1920tgcaatattg tttagtgaac ttacccaaga atatgcacac aacactttta ggggtgcatt 1920
ttacaattag ataatagttc gatggattac cagaagaagg ataagactta tacaatgtat 1980ttacaattag ataatagttc gatggattac cagaagaagg ataagactta tacaatgtat 1980
gacccaaaat tgtgagaaac atatggatga aagcgtaatg agaatagttg tggtgtggaa 2040gacccaaaat tgtgagaaac atatggatga aagcgtaatg agaatagttg tggtgtggaa 2040
attttctatt tattcatgct tatcctttga aatgatattt ttcaattagc tgctggagct 2100attttctatt tattcatgct tatcctttga aatgatattt ttcaattagc tgctggagct 2100
ggcttgacgg ggagcaaaat ggccagaaga attgggagcg tgaaagaatt tgagttcaaa 2160ggcttgacgg ggagcaaaat ggccagaaga attgggagcg tgaaagaatt tgagttcaaa 2160
cctattggtg aaaaccataa tcagggtgtg agttcctttt ctctttctga tgtcacaaca 2220cctattggtg aaaaccataa tcagggtgtg agttcctttt ctctttctga tgtcacaaca 2220
aatttcattc tactagattt ttgttcaccc ttttagtagt caaactgatc tggtctggtt 2280aatttcattc tactagattt ttgttcaccc ttttagtagt caaactgatc tggtctggtt 2280
aaatgttctc aacactttca gcggcttgca gttggcatct taatttccgg atttgctttt 2340aaatgttctc aacactttca gcggcttgca gttggcatct taatttccgg atttgctttt 2340
gatgaggatg acttttgtag accttgggaa ggatggcagg ataacctaga gaggtataga 2400gatgaggatg acttttgtag accttgggaa ggatggcagg ataacctaga gaggtataga 2400
gtggtttttt tgacatgttc tttttgctga tcactattct agtactttgt catgtgttac 2460gtggtttttt tgacatgttc tttttgctga tcactattct agtactttgt catgtgttac 2460
cagctaagag catccttaaa aatataattt caagagttga ataatgtctg gcaggtacat 2520cagctaagag catccttaaa aatataattt caagagttga ataatgtctg gcaggtacat 2520
ccttcaatgg gagtctaagc atataattgc agtgagtaca gcaatacagg attggctgac 2580ccttcaatgg gagtctaagc atataattgc agtgagtaca gcaatacagg attggctgac 2580
atcaagtaat atttgtagtt ttgcccattt acactatcag caaaaacttc taatttacac 2640atcaagtaat atttgtagtt ttgcccattt acactatcag caaaaacttc taatttacac 2640
tgttatttat gattcaggac ttgctatgga attgatgaaa aggtgcaatg agaaccgtat 2700tgttatttat gattcaggac ttgctatgga attgatgaaa aggtgcaatg agaaccgtat 2700
taagtggtct tcttgcagca tttgcatggc cagctacact gcttgcggct acagacttta 2760taagtggtct tcttgcagca tttgcatggc cagctacact gcttgcggct acagacttta 2760
ttgatagtaa atggtctgtt gctattgaca ggtaattgca aactctccca tttctcttat 2820ttgatagtaa atggtctgtt gctattgaca ggtaattgca aactctccca tttctcttat 2820
gttgcttgca gagttgcagt gcacactgac tatctacagt attggtattc ttctgttgat 2880gttgcttgca gagttgcagt gcacactgac tatctacagt attggtattc ttctgttgat 2880
cgacacataa gtttgtgtat tcaaatttat gaagattatg acgtaacatt tagacaaact 2940cgacacataa gtttgtgtat tcaaatttat gaagattatg acgtaacatt tagacaaact 2940
tttaatgtat atatcccaca gtacatatat tgcagttaaa gacatcctag caatttgtta 3000tttaatgtat atatcccaca gtacatatat tgcagttaaa gacatcctag caatttgtta 3000
acttagccaa tcaaaatcaa ctgtatcttc tcagatcaga taaggcagga aaaatgcttg 3060acttagccaa tcaaaatcaa ctgtatcttc tcagatcaga taaggcagga aaaatgcttg 3060
ctgaagtgct tctcaaggga ttgcaaggaa ataggtaact ccatatgcat ttatttcatt 3120ctgaagtgct tctcaaggga ttgcaaggaa ataggtaact ccatatgcat ttatttcatt 3120
agcgatgttg gccgctcaga aatatgcccc cctaatgcca attttttccc aggcctgtga 3180agcgatgttg gccgctcaga aatatgcccc cctaatgcca atttttttccc aggcctgtga 3180
ctctcattgg gttttcacta ggtgcacgtg ttatatttaa gtgtttacag gaacttgctt 3240ctctcattgg gttttcacta ggtgcacgtg ttatatttaa gtgtttacag gaacttgctt 3240
tatcgagtga taatggtacc tgttctgtaa ccatctcctt catccataag attttccttt 3300tatcgagtga taatggtacc tgttctgtaa ccatctcctt catccataag attttccttt 3300
cttggtggca aattatcagt tttaaccatc tccatctctt gtttgcaacc agagggactt 3360cttggtggca aattatcagt tttaaccatc tccatctctt gtttgcaacc agagggactt 3360
gttgagaggg tggttcttct tggtgctcca gtttctgtga aaggcgagcg gtgggaggct 3420gttgagaggg tggttcttct tggtgctcca gtttctgtga aaggcgagcg gtgggaggct 3420
gcaaggaagg taatggcgat gaaatcattt ttttctctat tatctctgta ggaatggtgc 3480gcaaggaagg taatggcgat gaaatcattt ttttctctat tatctctgta ggaatggtgc 3480
aacgtaacaa caagaaacag tgaagttgct gttcatttta tcccagcttc ttccctcggc 3540aacgtaacaa caagaaacag tgaagttgct gttcatttta tcccagcttc ttccctcggc 3540
gcttaattca ctcacttgtc ctcatggcag atggttgctg gaagatttgt gaatgtgtac 3600gcttaattca ctcacttgtc ctcatggcag atggttgctg gaagatttgt gaatgtgtac 3600
tccacagatg actggattct tggtgttact tttcgtgcga ggtaaacaga tttatgattt 3660tccacagatg actggattct tggtgttact tttcgtgcga ggtaaacaga tttatgattt 3660
cagagctcct agaaagctca tatcaccatt cccccatgtg agctatgact taagtaatct 3720cagagctcct agaaagctca tatcaccatt cccccatgtg agctatgact taagtaatct 3720
tctcgtgcag tctgctaaca caagggttgg ctggcatcca ggccattgat gtccctggtg 3780tctcgtgcag tctgctaaca caagggttgg ctggcatcca ggccattgat gtccctggtg 3780
ttgaaaatgt aagcatctgg ggtgccattc ttgtgttctt acgtgatttt catctatctt 3840ttgaaaatgt aagcatctgg ggtgccattc ttgtgttctt acgtgatttt catctatctt 3840
ttacacgtct ttgtgtacaa tgcaatttac aggtcgatgt tacggagctt gtcgatggcc 3900ttacacgtct ttgtgtacaa tgcaatttac aggtcgatgt tacggagctt gtcgatggcc 3900
attcgtctta cctgtcggca gcacaacaga tactggagca tcttgaacta aatacctact 3960attcgtctta cctgtcggca gcacaacaga tactggagca tcttgaacta aatacctact 3960
atccggtttt tgttcccctg agcgcagcca acgaagaaac agatggaaca gttgcacagt 4020atccggtttt tgttcccctg agcgcagcca acgaagaaac agatggaaca gttgcacagt 4020
aacgtgtggg caggattcga tgaaaatcgt atgtagacaa ccaaatgtaa cagactatat 4080aacgtgtggg caggattcga tgaaaatcgt atgtagacaa ccaaatgtaa cagactatat 4080
atacaaccaa atgtaacaga ctaactcgtt caaactgtaa ac 4122atacaaccaa atgtaacaga ctaactcgtt caaactgtaa ac 4122
<210> 10<210> 10
<211> 16<211> 16
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 10<400> 10
ctgtagccgc aagcag 16ctgtagccgc aagcag 16
<210> 11<210> 11
<211> 16<211> 16
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 11<400> 11
ttgggagcgt gaaaga 16ttgggagcgt gaaaga 16
<210> 12<210> 12
<211> 18<211> 18
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 12<400> 12
caaatgctgc aagaagac 18caaatgctgc aagaagac 18
<210> 13<210> 13
<211> 20<211> 20
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 13<400> 13
tgaaagcgta atgagaatag 20
<210> 14<210> 14
<211> 39<211> 39
<212> DNA<212> DNA
<213> STS1基因(Oryza sativa)<213> STS1 gene (Oryza sativa)
<400> 14<400> 14
aaaaaagcag gctccgaatt catggccacg acgctgacg 39aaaaaagcag gctccgaatt catggccacg acgctgacg 39
<210> 15<210> 15
<211> 46<211> 46
<212> DNA<212> DNA
<213> STS1基因(Oryza sativa)<213> STS1 gene (Oryza sativa)
<400> 15<400> 15
caagaaagct gggtcgaatt cttactgtgc aactgttcca tctgtt 46caagaaagct gggtcgaatt cttactgtgc aactgttcca tctgtt 46
<210> 16<210> 16
<211> 23<211> 23
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 16<400> 16
ggccaagcat cacatcggcg cgg 23ggccaagcat cacatcggcg cgg 23
<210> 17<210> 17
<211> 23<211> 23
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<400> 17<400> 17
gctatggaat tgatgaagca agg 23gctatggaat tgatgaagca agg 23
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210535082.4A CN114989280B (en) | 2022-05-17 | 2022-05-17 | A rice male fertility control gene STS1, its encoding protein and application |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210535082.4A CN114989280B (en) | 2022-05-17 | 2022-05-17 | A rice male fertility control gene STS1, its encoding protein and application |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114989280A true CN114989280A (en) | 2022-09-02 |
CN114989280B CN114989280B (en) | 2024-12-20 |
Family
ID=83026802
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210535082.4A Active CN114989280B (en) | 2022-05-17 | 2022-05-17 | A rice male fertility control gene STS1, its encoding protein and application |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114989280B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040123343A1 (en) * | 2000-04-19 | 2004-06-24 | La Rosa Thomas J. | Rice nucleic acid molecules and other molecules associated with plants and uses thereof for plant improvement |
US20060123505A1 (en) * | 2002-05-30 | 2006-06-08 | National Institute Of Agrobiological Sciences | Full-length plant cDNA and uses thereof |
CN106834305A (en) * | 2017-03-16 | 2017-06-13 | 四川农业大学 | A kind of rice male sterility changing gene OsSTRL2 and its application |
CN114262699A (en) * | 2021-12-28 | 2022-04-01 | 中国水稻研究所 | Rice male sterility gene OsLDDT1 and its molecular markers and applications |
-
2022
- 2022-05-17 CN CN202210535082.4A patent/CN114989280B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040123343A1 (en) * | 2000-04-19 | 2004-06-24 | La Rosa Thomas J. | Rice nucleic acid molecules and other molecules associated with plants and uses thereof for plant improvement |
US20060123505A1 (en) * | 2002-05-30 | 2006-06-08 | National Institute Of Agrobiological Sciences | Full-length plant cDNA and uses thereof |
CN106834305A (en) * | 2017-03-16 | 2017-06-13 | 四川农业大学 | A kind of rice male sterility changing gene OsSTRL2 and its application |
CN114262699A (en) * | 2021-12-28 | 2022-04-01 | 中国水稻研究所 | Rice male sterility gene OsLDDT1 and its molecular markers and applications |
Non-Patent Citations (4)
Title |
---|
KIKUCHI,S.等: "\""Oryza sativa Japonica Group cDNA clone:J023105E01, full insert sequence"\"" * |
NCBI: ""PREDICTED: Oryza sativa Japonica Group transmembrane and coiled-coil domain-containing protein 4 (LOC4331373), mRNA"" * |
ZHIHAO SUN 等: ""OsLDDT1, encoding a transmembrane structural DUF726 family protein, is essential for tapetum degradation and pollen formation in rice"" * |
邹挺: ""几个植物雄性育性控制基因的克隆与功能研究"" * |
Also Published As
Publication number | Publication date |
---|---|
CN114989280B (en) | 2024-12-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2021225142B2 (en) | Generation of haploid plants | |
CN105602952B (en) | A fertility gene and its application | |
CN107267527B (en) | The method of maintaining male fertility and its application | |
US20040205839A1 (en) | Methods for obtaining plant varieties | |
CN106998665A (en) | The generation of haplophyte | |
CN105518141B (en) | The application of male nuclear sterile gene and its mutant in crossbreeding | |
CN111118030B (en) | DNA sequence regulating corn leaf angle and its mutants, molecular markers, detection primers and applications | |
CN106834305B (en) | Rice male fertility regulation gene OsSTRL2 and application thereof | |
CN112522259A (en) | Method for cultivating plant type improved rice material with Oslg1 mutant phenotype through haploid mediation | |
CN115466747B (en) | Glycosyltransferase ZmKOB1 gene and application thereof in regulation and control of maize female ear set character or development | |
CN108441499A (en) | Male fertile related gene HT2925 and its application | |
CN101525630B (en) | Rapeseed plant recessive cytoblast sterile restoring gene BnCYP704B1 and application | |
CN113832179A (en) | Application of ZmELF3.1 protein and functional deletion mutant thereof in regulating and controlling branch number of tassel of crop | |
CN110373418B (en) | Gene for regulating and controlling plant seed size and application thereof | |
CN114989280B (en) | A rice male fertility control gene STS1, its encoding protein and application | |
CN108660139B (en) | Plant Fertility Regulating Gene NP2 and Its Encoded Protein and Application | |
CN113817033B (en) | Application of ZmELF3.1 protein and its functional deletion mutant in regulating and controlling crop aerial root number or layer number | |
CN113151295B (en) | Rice Thermosensitive Male Sterile Gene OsFMS1 and Its Application | |
CN114540366B (en) | Rice fertility regulating gene GMS3, mutant and application thereof | |
CN113754746A (en) | Rice male fertility regulation gene, application thereof and method for regulating rice fertility by using CRISPR-Cas9 | |
CN105886516B (en) | One sterility changing gene OsRPLP0 and its application | |
CN110846325B (en) | Application of a rice multiflora gene MOF1 and its encoded protein | |
CN116640767A (en) | A rice leaf color regulation gene OsWLS7 and its encoded protein and application | |
CN115044660A (en) | Targeting knockout method of rice heading stage regulatory gene Hd6, mutant and application thereof | |
EA048205B1 (en) | HAPLOD INDUCTOR |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |